@@ -13,7 +13,6 @@ PyTokenCounter is a Python library designed to simplify text tokenization and to
1313 - [ String Tokenization and Counting] ( #string-tokenization-and-counting )
1414 - [ File and Directory Tokenization and Counting] ( #file-and-directory-tokenization-and-counting )
1515 - [ Token Mapping] ( #token-mapping )
16- - [ Ignored Files] ( #ignored-files )
1716- [ Maintainers] ( #maintainers )
1817- [ Acknowledgements] ( #acknowledgements )
1918- [ Contributing] ( #contributing )
@@ -1035,27 +1034,6 @@ print(mapped)
10351034
10361035---
10371036
1038- ## Ignored Files
1039-
1040- When the functions are set to exclude binary files (default behavior), the following file extensions are ignored:
1041-
1042- | Category | Extensions |
1043- | ---------------------------------| -----------------------------------------------------------------------------------------------------------------------------------------------------------------|
1044- | ** Image formats** | ` .png ` , ` .jpg ` , ` .jpeg ` , ` .gif ` , ` .bmp ` , ` .webp ` , ` .avif ` , ` .tiff ` , ` .tif ` , ` .ico ` , ` .svgz ` |
1045- | ** Video formats** | ` .mp4 ` , ` .mkv ` , ` .mov ` , ` .avi ` , ` .wmv ` , ` .flv ` , ` .webm ` , ` .m4v ` , ` .mpeg ` , ` .mpg ` , ` .3gp ` , ` .3g2 ` |
1046- | ** Audio formats** | ` .mp3 ` , ` .wav ` , ` .flac ` , ` .ogg ` , ` .aac ` , ` .m4a ` , ` .wma ` , ` .aiff ` , ` .ape ` , ` .opus ` |
1047- | ** Compressed archives** | ` .zip ` , ` .rar ` , ` .7z ` , ` .tar ` , ` .gz ` , ` .bz2 ` , ` .xz ` , ` .lz ` , ` .zst ` , ` .cab ` , ` .deb ` , ` .rpm ` , ` .pkg ` |
1048- | ** Disk images** | ` .iso ` , ` .dmg ` , ` .img ` , ` .vhd ` , ` .vmdk ` |
1049- | ** Executables & Libraries** | ` .exe ` , ` .msi ` , ` .bat ` , ` .dll ` , ` .so ` , ` .bin ` , ` .o ` , ` .a ` , ` .dylib ` |
1050- | ** Fonts** | ` .ttf ` , ` .otf ` , ` .woff ` , ` .woff2 ` , ` .eot ` |
1051- | ** Documents** | ` .pdf ` , ` .ps ` , ` .eps ` |
1052- | ** Design & Graphics** | ` .psd ` , ` .ai ` , ` .indd ` , ` .sketch ` |
1053- | ** 3D & CAD files** | ` .blend ` , ` .stl ` , ` .step ` , ` .iges ` , ` .fbx ` , ` .glb ` , ` .gltf ` , ` .3ds ` , ` .obj ` , ` .cad ` |
1054- | ** Virtual Machines & Firmware** | ` .qcow2 ` , ` .vdi ` , ` .vhdx ` , ` .rom ` , ` .bin ` , ` .img ` |
1055- | ** Miscellaneous binaries** | ` .dat ` , ` .pak ` , ` .sav ` , ` .nes ` , ` .gba ` , ` .nds ` , ` .iso ` , ` .jar ` , ` .class ` , ` .wasm ` |
1056-
1057- ---
1058-
10591037## Maintainers
10601038
10611039- [ Kaden Gruizenga] ( https://github.com/kgruiz )
0 commit comments