bart-large-chinese

Maintained By
fnlp

BART-Large-Chinese

PropertyValue
Parameter Count407M
Model TypeText-to-Text Generation
ArchitectureBART Large
PaperCPT: A Pre-Trained Unbalanced Transformer

What is bart-large-chinese?

BART-Large-Chinese is a sophisticated text-to-text generation model specifically designed for Chinese language tasks. Developed by FNLP, this model represents a significant advancement in Chinese natural language processing, featuring an expanded vocabulary of 51,271 tokens and extended position embeddings up to 1024 tokens.

Implementation Details

The model incorporates several technical improvements from its previous versions, including a comprehensive vocabulary update that encompasses traditional Chinese characters and removes redundant tokens. The architecture leverages the BART framework while being optimized for Chinese language characteristics.

  • Extended max position embeddings from 512 to 1024
  • Enhanced vocabulary with 6,800+ additional Chinese characters
  • Optimized token embedding structure
  • F32 tensor type for precise computations

Core Capabilities

  • Text generation and completion
  • Summarization (demonstrated by LCSTS benchmark)
  • Classification tasks (shown in AFQMC and IFLYTEK benchmarks)
  • Sequence-to-sequence transformations

Frequently Asked Questions

Q: What makes this model unique?

The model stands out for its comprehensive Chinese language support, including both simplified and traditional characters, along with its robust performance across various NLP tasks, achieving strong results in benchmarks like AFQMC (75.81%) and LCSTS (40.90%).

Q: What are the recommended use cases?

This model is particularly well-suited for Chinese text generation tasks, summarization, and general sequence-to-sequence applications. It's important to note that users should utilize BertTokenizer instead of the original BartTokenizer for optimal performance.

The first platform built for prompt engineering