Support Chinese Characters (non-ASCII) by UTF-8 by ElevenyChen · Pull Request #40 · radiolarian/AO3Scraper

ElevenyChen · 2023-12-16T13:55:57Z

1. Chinese Character Support: The script now correctly handles Chinese characters in CSV and text file outputs.
2. Progress Tracking: It prints the scraping progress, showing the current work ID and count, enhancing user experience during long scraping sessions.
3. Richer Text File Output: Output text files are now reformatted to include detailed metadata such as title, author, rating, relationship, language, status, and chapters, offering more context for each work.

1. Support Chinese characters in CVS and output txt.files now! 2. Print out working progress when scrapping

3. Reformatted the txt.files. Include more information about the work.

caizhuoyue77 · 2024-06-05T02:33:36Z

When I try to scrape Chinese articles, I couldn't get the Chinese Character's, only the Pinyin?

ElevenyChen · 2024-06-05T02:57:10Z

When I try to scrape Chinese articles, I couldn't get the Chinese Character's, only the Pinyin?

Hi hi, I've successfully scraped Chinese characters in my version of code. See an example. Since the owner of the main branch haven't respond my request, you can directly use the version from my git.

ElevenyChen added 2 commits December 16, 2023 05:47

Add files via upload

bf026ee

1. Support Chinese characters in CVS and output txt.files now! 2. Print out working progress when scrapping

Add files via upload

806115a

3. Reformatted the txt.files. Include more information about the work.

ElevenyChen closed this Jun 5, 2024

ElevenyChen reopened this Jun 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Chinese Characters (non-ASCII) by UTF-8#40

Support Chinese Characters (non-ASCII) by UTF-8#40
ElevenyChen wants to merge 2 commits into
radiolarian:masterfrom
ElevenyChen:master

ElevenyChen commented Dec 16, 2023 •

edited

Loading

Uh oh!

caizhuoyue77 commented Jun 5, 2024

Uh oh!

ElevenyChen commented Jun 5, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ElevenyChen commented Dec 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

caizhuoyue77 commented Jun 5, 2024

Uh oh!

ElevenyChen commented Jun 5, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ElevenyChen commented Dec 16, 2023 •

edited

Loading