Draft:DuXiu

DuXiu
Native name
读秀
Type of site
Digital library, Bibliographic database
Available inChinese
Country of originChina
OwnerSuperStar Digital Library (Chaoxing / 超星)
URLwww.duxiu.com
Launched2007
Content license
Subscription (institutional)

DuXiu (pinyin: Dúxiù; lit. 'reading excellence'), also known as the DuXiu Knowledge Search Database or DuXiu Academic Search, is a Chinese digital library and full-text bibliographic search platform developed and operated by SuperStar Digital Library (Chinese: 超星数字图书馆, also known as Chaoxing). Launched in 2007, it is the largest digital library of books published in mainland China and is widely used by universities and research institutions across China and internationally.[1]

Overview

DuXiu provides full-text search capabilities across a catalog of over three million Chinese-language book volumes, spanning all major academic disciplines including humanities, social sciences, natural sciences, engineering, and medicine.[2] The platform is distinguished by allowing users to search within the text of books rather than only searching titles or metadata. Access to full content is provided through institutional subscription; users may preview a limited number of pages online, and may request up to 50 pages or 20 percent of a book (whichever is smaller) delivered by email per week through a document delivery system embedded in the platform.[3]

DuXiu's collection covers virtually all books published in the People's Republic of China from 1949 onward, as well as a significant portion of pre-1949 publications, making it an essential resource for researchers in Chinese studies, history, and literature.[4]

Piracy and shadow library copies

The books in DuXiu were developed through large-scale digitization of Chinese publications by the SuperStar Digital Library Group, primarily to supply institutional subscribers. According to Anna's Archive, DuXiu content had long circulated informally on the Chinese internet, typically sold by resellers for less than one yuan per title, and was also distributed via Chinese file-hosting services.[5]

In November 2023, Anna's Archive announced it had acquired a full copy of the DuXiu collection, which it described as "the largest Chinese non-fiction book collection in the world," comprising approximately 200 TB of scanned academic books.[5] The collection was subsequently released publicly without embargo, completing what Anna's Archive described as a two-year effort to archive Chinese-language content from multiple sources.[6]

See also

References

  1. ^ Li, Aiguo (2009). "Digitizing Chinese Books: A Case Study of the SuperStar DuXiu Scholar Search Engine". D-Lib Magazine. Retrieved 2026-06-03.
  2. ^ "DuXiu Knowledge Search Database". Duke University Libraries. Retrieved 2026-06-03.
  3. ^ "Duxiu / 读秀". Cornell University Library. Retrieved 2026-06-03.
  4. ^ "Duxiu — what the?". Johns Hopkins University Sheridan Libraries Blog. September 2012. Retrieved 2026-06-03.
  5. ^ a b Anna Archivist (November 4, 2023). "Exclusive access for LLM companies to largest Chinese non-fiction book collection in the world". Anna's Blog. Retrieved 2026-06-03.
  6. ^ Anna Archivist. "We finished the Chinese release". Anna's Blog. Retrieved 2026-06-03.

Content Disclaimer

Informasi ini disarikan dari Wikipedia dan disajikan kembali untuk tujuan edukasi. Konten tersedia di bawah lisensi CC BY-SA 3.0. Kami tidak bertanggung jawab atas ketidakakuratan data yang bersumber dari kontribusi publik tersebut.

  1. The information displayed on this website is sourced in part or in whole from Wikipedia and has been adapted for the purpose of restating it. We strive to provide accurate and relevant information, however:
  2. There is no guarantee of absolute accuracy. Wikipedia is an open, collaborative project that can be edited by anyone, so information is subject to change.
  3. It is not intended to constitute professional advice. The content displayed is for informational and educational purposes only. For important decisions (e.g., medical, legal, or financial), please consult a professional.
  4. Content copyright. Wikipedia is licensed under the Creative Commons Attribution-ShareAlike License (CC BY-SA). This means that content may be reused with appropriate attribution and shared under a similar license.
  5. Responsible use. Any risk arising from the use of information from this website is entirely the responsibility of the user.