232feabb24Iterating over the year directories is (mostly? entirely?) redundant with the numbered directories.
Leonard Richardson
2013-12-25 22:06:27 -05:00
cb3e7a950bAdded Rosetta stone gibberish.
Leonard Richardson
2013-12-18 09:13:28 -05:00
1eb08cb1ffYield the first token.
Leonard Richardson
2013-12-16 12:07:40 -05:00
5858e73870Added a SentenceAssembler to queneau.
Leonard Richardson
2013-12-15 10:58:16 -05:00
d2258334cfAdded a couple more links.
Leonard Richardson
2013-12-15 10:34:11 -05:00
34ec933758Improved README
Leonard Richardson
2013-12-15 10:30:25 -05:00
1ed840ab70Added a class for diagnosing Unicode strings and a few more alphabets.
Leonard Richardson
2013-12-15 10:16:06 -05:00
daa063a32eMade example filename more generic.
Leonard Richardson
2013-12-04 09:19:17 -05:00
8b34e9f8d1Added disclaimer.
Leonard Richardson
2013-12-03 17:08:46 -05:00
277f8851f5Added a very simple scheduler because I'm sick of dealing with huge standard deviations.
Leonard Richardson
2013-12-03 17:01:25 -05:00
36c297f8a9Added a port of the word filter.
Leonard Richardson
2013-12-01 21:22:44 -05:00
1d74cebcadAdded as much of a modifier alphabet as I could find.
Leonard Richardson
2013-12-01 06:57:23 -05:00
75506ae8e4Added another indicator of the start of the text.
Leonard Richardson
2013-11-30 18:41:56 -05:00
0abe661395Correctly identify the etext ID from a numeric filename.
Leonard Richardson
2013-11-30 18:14:26 -05:00
d0cdf7f945Automatically provide the RDF graph for each PG text (if possible), and search that graph for language information more reliable than the stuff inside the header.
Leonard Richardson
2013-11-30 17:20:57 -05:00
7df369a250Added a lot of other ways for the etext part of a book to end.
Leonard Richardson
2013-11-30 09:34:55 -05:00
abbec27c53Added a Markov generator that tried to keep brackets and quotes balanced.
Leonard Richardson
2013-11-29 16:35:18 -05:00
49ce43e570Made the API for the Markov chain module consistent with the API for the Queneau assembly module.
Leonard Richardson
2013-11-29 09:00:19 -05:00
a8dc086fa6Tweaked the ebooks algorithm and added a Markov chain algorithm.
Leonard Richardson
2013-11-28 19:54:00 -05:00
b5276928abImproved performance a bit and increased the preference for lines that begin with capital letters.
Leonard Richardson
2013-11-27 13:22:29 -05:00
71565c638dAdded a mapping of old-style Project Gutenberg filenames to new-style ebook IDs.
Leonard Richardson
2013-11-27 10:08:05 -05:00
bf6462653cMade the ebook generator go through the pre-2007 ebooks.
Leonard Richardson
2013-11-26 18:48:22 -05:00
1bb5220fccTry to get all the way through the corpus.
Leonard Richardson
2013-11-26 18:08:49 -05:00
7fbc3d47b6Derive encoding from filename if possible.
Leonard Richardson
2013-11-26 15:23:20 -05:00
7b4733e236Derive encoding from filename if possible.
Leonard Richardson
2013-11-26 15:21:56 -05:00
9dbd816e8dWe can now parse every plain-text document in the Project Gutenberg DVD.
Leonard Richardson
2013-11-26 15:18:45 -05:00
b66a5240e8Added the ability to extract the 'best' version of each text on a Project Gutenberg CD or DVD.
Leonard Richardson
2013-11-26 12:36:07 -05:00
01bb4d70f7Fixed text in use.
Leonard Richardson
2013-11-26 08:30:04 -05:00
9479250a6cAdded english.py.
Leonard Richardson
2013-11-26 08:27:55 -05:00
e02ea08a3bRemove obviously unbalanced quote marks.
Leonard Richardson
2013-11-26 08:24:34 -05:00
84431d7f4eAdded a number of horse_ebooks-like tweaks to improve the quality of the selected quotes.
Leonard Richardson
2013-11-26 08:22:33 -05:00
97df55de06Added a basic Project Gutenberg tool and an exciting new text sampler that supplies @horse_ebooks-style hilarity.
Leonard Richardson
2013-11-25 23:01:28 -05:00
ab1fb1b909Tweaked probabilities and improved the looks of gradients.
Leonard Richardson
2013-11-25 08:54:30 -05:00
be43fe753aAdded gibberish gradients.
Leonard Richardson
2013-11-24 22:41:45 -05:00
d0ed2cae39Added more block and box drawing charsets.
Leonard Richardson
2013-11-23 12:44:19 -05:00
160aa33939Added some alphanumeric mosaic sets.
Leonard Richardson
2013-11-14 15:13:43 -05:00
dbd5822a7bBring back the 'choice among Latin alphabets', using a wide variety of cool alphabets assembled by @tef for the unicodefuckery project.
Leonard Richardson
2013-11-14 14:58:53 -05:00
e8493734c7Added composite gibberish, which is like two, two, two gibberishes in one\!
Leonard Richardson
2013-11-14 14:37:59 -05:00
8677df6f8eBumped up limited vocabularies.
Leonard Richardson
2013-10-18 17:18:15 -04:00
f36f26cbc7Fixed 'choose one alphabet.'
Leonard Richardson
2013-10-18 17:09:57 -04:00
615ce1fd19Un-inverted inverted logic.
Leonard Richardson
2013-10-18 16:56:57 -04:00
c6713ccfddMade short strings a little longer.
Leonard Richardson
2013-10-18 16:56:34 -04:00
a8989b6d9aAdded a crossout alphabet.
Leonard Richardson
2013-10-18 14:50:37 -04:00
bb162595e0Tweaked lengths and added a symbology alphabet.
Leonard Richardson
2013-10-18 14:43:16 -04:00
2faee6e02cAdded fill mosaic as a glitch charset.
Leonard Richardson
2013-10-18 14:18:39 -04:00
e856df22dfExclude the large CJK charsets from the big jumble of linguistic charsets to avoid them dominating.
Leonard Richardson
2013-10-18 14:13:29 -04:00
35d758b1c0Added shading mosaic as a glitch character.
Leonard Richardson
2013-10-18 10:44:49 -04:00
6c85851522Added more mouths to the emoticon generator.
Leonard Richardson
2013-10-18 10:14:44 -04:00
34eda12841Added video game cheat codes.
Leonard Richardson
2013-10-18 09:52:46 -04:00
c791dc3432Added a generator that takes from ALL the linguistic alphabets.
Leonard Richardson
2013-10-18 09:43:43 -04:00
554797fbd8Added some more charsets and a Gibberish generator that combines linguistic alphabets.
Leonard Richardson
2013-10-18 09:12:33 -04:00
906e44dbdaTweaked probabilities.
Leonard Richardson
2013-10-08 09:11:09 -04:00
87aa9e7275Class name fix
Richard Magahiz
2013-10-08 07:41:02 -04:00
1dec9548d1Added two-dimensional mosaics and game boards.
Leonard Richardson
2013-10-07 09:29:18 -04:00
7c26796e1cAdded dots as a glitch alphabet.
Leonard Richardson
2013-10-07 08:57:25 -04:00
66069ff150Added dots as a glitch alphabet.
Leonard Richardson
2013-10-07 08:53:36 -04:00
96dce8e9eeAdded custom alphabets to coexist alongside the standard code pages.
Leonard Richardson
2013-10-07 08:48:35 -04:00
d27eed43ceAdded custom alphabets to coexist alongside the standard code pages.
Leonard Richardson
2013-10-07 08:41:07 -04:00
63befc30b4Switched to a 'Monster Manual' style randomness generator. Added many more shape-based and mosaic charsets.
Leonard Richardson
2013-10-06 13:34:59 -04:00
6ebf24256aAdded some custom character sets for shapes and mosaics.
Leonard Richardson
2013-10-06 10:19:27 -04:00
ac8b3f12c5Fixed article bug.
Leonard Richardson
2013-09-30 22:11:57 -04:00
6349329a69Added dinosaur names (courtesy Rob Dubbin/Wikipedia) and an example script for generating dinosaur names.
Leonard Richardson
2013-09-30 22:08:16 -04:00
7d01b09953Added dinosaur names (courtesy Rob Dubbin/Wikipedia) and an example script for generating dinosaur names.
Leonard Richardson
2013-09-30 22:08:08 -04:00
60fe763727Added limited-vocabulary and emoticon-based gibberish.
Leonard Richardson
2013-09-30 09:47:37 -04:00
cd60b4006cNormalize generated words so that we count characters the same way Twitter does.
Leonard Richardson
2013-09-30 09:01:00 -04:00
fbfae11aafUpdated README.
Leonard Richardson
2013-09-29 13:48:06 -04:00
eaf5f4aca6Fixed Twitter wrapper bug.
Leonard Richardson
2013-09-29 13:46:47 -04:00
ffceb082c7Added example scripts for printing gibberish and corrupting text.
Leonard Richardson
2013-09-27 17:53:51 -04:00
88d17bbc50Added a Corruptor class for corrupting text with diacritical marks.
Leonard Richardson
2013-09-27 17:30:18 -04:00
378c270f3eAdded various word length algorithms.
Leonard Richardson
2013-09-27 17:13:25 -04:00
6459ae971bCreate an overarching D&D-type table for rolling up tweet formats.
Leonard Richardson
2013-09-27 16:41:54 -04:00
e0661aedbeAdded a way to pick a random language script and a way to make an alphabet just a little weirder.
Leonard Richardson
2013-09-27 16:30:46 -04:00
196902a434Added a way to pick a random language script and a way to make an alphabet just a little weirder.
Leonard Richardson
2013-09-27 16:09:22 -04:00
7d32f9ff3bCorrected my misunderstanding of the Han unification plane.
Leonard Richardson
2013-09-27 15:49:56 -04:00
f69720bea4Refactored the Weird Twitter code and added variants for Japanese text, CJK ideographs, and math.
Leonard Richardson
2013-09-27 15:48:19 -04:00
4fe10cfcb5Got the alphabets organized, wrote a generator for 'Weird Twitter'-type alphabets.
Leonard Richardson
2013-09-27 14:45:28 -04:00
241f93fa55Added a dataset containing all the named characters in Unicode, organized by code sheet; and a script that generates Unicode art.
Leonard Richardson
2013-09-26 20:28:25 -04:00
7c0d49d2b9Mention other sample scripts.
Leonard Richardson
2013-07-20 21:53:32 -04:00
b34d6b63acRemoved extraneous :s
Leonard Richardson
2013-07-20 21:46:20 -04:00
e0ce5af50cRemoved huge amount of board game data to make the repo easier to check out.
Leonard Richardson
2013-07-20 21:16:03 -04:00
e0dbffd461Fixed the dialogue assembler, and added another example--transcripts from the Apollo 11 mission.
Leonard Richardson
2013-07-20 21:07:44 -04:00
03e1609555Added a board game example, and a composite assembler which chooses between other assemblers in proportion to how many items each has been given.
Leonard Richardson
2013-07-12 14:53:23 -04:00
f5aac793f6Fixed bug that limited sonnets to 10 lines.
Leonard Richardson
2013-07-12 09:00:15 -04:00
76446d52ffAdded Shakesperean sonnets.
Leonard Richardson
2013-07-11 23:12:47 -04:00
88e9cf00efAdded __init__.py
Leonard Richardson
2013-07-09 08:23:05 -04:00
c6e8dfc4b2Updated readme.
Leonard Richardson
2013-07-09 08:20:03 -04:00
b71c8dddfdGet some examples up.
Leonard Richardson
2013-07-09 08:08:26 -04:00