Commit Graph

  • cfb1d23cb5 Merge branch 'master' of https://github.com/leonardr/olipy Leonard Richardson 2013-12-29 11:23:24 -05:00
  • 727ad1d1e2 Added list of bad words. Leonard Richardson 2013-12-29 11:23:09 -05:00
  • acb0b6ad3b Added some word lists from COHA. Leonard Richardson 2013-12-29 11:19:13 -05:00
  • 4d6125816c Merge branch 'master' of https://github.com/leonardr/olipy Leonard Richardson 2013-12-25 23:46:21 -05:00
  • 232feabb24 Iterating over the year directories is (mostly? entirely?) redundant with the numbered directories. Leonard Richardson 2013-12-25 22:06:27 -05:00
  • cb3e7a950b Added Rosetta stone gibberish. Leonard Richardson 2013-12-18 09:13:28 -05:00
  • 1eb08cb1ff Yield the first token. Leonard Richardson 2013-12-16 12:07:40 -05:00
  • 5858e73870 Added a SentenceAssembler to queneau. Leonard Richardson 2013-12-15 10:58:16 -05:00
  • d2258334cf Added a couple more links. Leonard Richardson 2013-12-15 10:34:11 -05:00
  • 34ec933758 Improved README Leonard Richardson 2013-12-15 10:30:25 -05:00
  • 1ed840ab70 Added a class for diagnosing Unicode strings and a few more alphabets. Leonard Richardson 2013-12-15 10:16:06 -05:00
  • 040346dbf8 Merge branch 'master' of https://github.com/leonardr/olipy Leonard Richardson 2013-12-04 09:19:46 -05:00
  • daa063a32e Made example filename more generic. Leonard Richardson 2013-12-04 09:19:17 -05:00
  • 8b34e9f8d1 Added disclaimer. Leonard Richardson 2013-12-03 17:08:46 -05:00
  • 277f8851f5 Added a very simple scheduler because I'm sick of dealing with huge standard deviations. Leonard Richardson 2013-12-03 17:01:25 -05:00
  • 36c297f8a9 Added a port of the word filter. Leonard Richardson 2013-12-01 21:22:44 -05:00
  • 1d74cebcad Added as much of a modifier alphabet as I could find. Leonard Richardson 2013-12-01 06:57:23 -05:00
  • 75506ae8e4 Added another indicator of the start of the text. Leonard Richardson 2013-11-30 18:41:56 -05:00
  • 0abe661395 Correctly identify the etext ID from a numeric filename. Leonard Richardson 2013-11-30 18:14:26 -05:00
  • d0cdf7f945 Automatically provide the RDF graph for each PG text (if possible), and search that graph for language information more reliable than the stuff inside the header. Leonard Richardson 2013-11-30 17:20:57 -05:00
  • 7df369a250 Added a lot of other ways for the etext part of a book to end. Leonard Richardson 2013-11-30 09:34:55 -05:00
  • abbec27c53 Added a Markov generator that tried to keep brackets and quotes balanced. Leonard Richardson 2013-11-29 16:35:18 -05:00
  • 49ce43e570 Made the API for the Markov chain module consistent with the API for the Queneau assembly module. Leonard Richardson 2013-11-29 09:00:19 -05:00
  • a8dc086fa6 Tweaked the ebooks algorithm and added a Markov chain algorithm. Leonard Richardson 2013-11-28 19:54:00 -05:00
  • b5276928ab Improved performance a bit and increased the preference for lines that begin with capital letters. Leonard Richardson 2013-11-27 13:22:29 -05:00
  • 71565c638d Added a mapping of old-style Project Gutenberg filenames to new-style ebook IDs. Leonard Richardson 2013-11-27 10:08:05 -05:00
  • bf6462653c Made the ebook generator go through the pre-2007 ebooks. Leonard Richardson 2013-11-26 18:48:22 -05:00
  • 1bb5220fcc Try to get all the way through the corpus. Leonard Richardson 2013-11-26 18:08:49 -05:00
  • 7fbc3d47b6 Derive encoding from filename if possible. Leonard Richardson 2013-11-26 15:23:20 -05:00
  • 7b4733e236 Derive encoding from filename if possible. Leonard Richardson 2013-11-26 15:21:56 -05:00
  • 9dbd816e8d We can now parse every plain-text document in the Project Gutenberg DVD. Leonard Richardson 2013-11-26 15:18:45 -05:00
  • b66a5240e8 Added the ability to extract the 'best' version of each text on a Project Gutenberg CD or DVD. Leonard Richardson 2013-11-26 12:36:07 -05:00
  • 01bb4d70f7 Fixed text in use. Leonard Richardson 2013-11-26 08:30:04 -05:00
  • 9479250a6c Added english.py. Leonard Richardson 2013-11-26 08:27:55 -05:00
  • e02ea08a3b Remove obviously unbalanced quote marks. Leonard Richardson 2013-11-26 08:24:34 -05:00
  • 84431d7f4e Added a number of horse_ebooks-like tweaks to improve the quality of the selected quotes. Leonard Richardson 2013-11-26 08:22:33 -05:00
  • 97df55de06 Added a basic Project Gutenberg tool and an exciting new text sampler that supplies @horse_ebooks-style hilarity. Leonard Richardson 2013-11-25 23:01:28 -05:00
  • ab1fb1b909 Tweaked probabilities and improved the looks of gradients. Leonard Richardson 2013-11-25 08:54:30 -05:00
  • be43fe753a Added gibberish gradients. Leonard Richardson 2013-11-24 22:41:45 -05:00
  • d0ed2cae39 Added more block and box drawing charsets. Leonard Richardson 2013-11-23 12:44:19 -05:00
  • 160aa33939 Added some alphanumeric mosaic sets. Leonard Richardson 2013-11-14 15:13:43 -05:00
  • dbd5822a7b Bring back the 'choice among Latin alphabets', using a wide variety of cool alphabets assembled by @tef for the unicodefuckery project. Leonard Richardson 2013-11-14 14:58:53 -05:00
  • e8493734c7 Added composite gibberish, which is like two, two, two gibberishes in one\! Leonard Richardson 2013-11-14 14:37:59 -05:00
  • 8677df6f8e Bumped up limited vocabularies. Leonard Richardson 2013-10-18 17:18:15 -04:00
  • f36f26cbc7 Fixed 'choose one alphabet.' Leonard Richardson 2013-10-18 17:09:57 -04:00
  • 615ce1fd19 Un-inverted inverted logic. Leonard Richardson 2013-10-18 16:56:57 -04:00
  • c6713ccfdd Made short strings a little longer. Leonard Richardson 2013-10-18 16:56:34 -04:00
  • a8989b6d9a Added a crossout alphabet. Leonard Richardson 2013-10-18 14:50:37 -04:00
  • bb162595e0 Tweaked lengths and added a symbology alphabet. Leonard Richardson 2013-10-18 14:43:16 -04:00
  • 2faee6e02c Added fill mosaic as a glitch charset. Leonard Richardson 2013-10-18 14:18:39 -04:00
  • e856df22df Exclude the large CJK charsets from the big jumble of linguistic charsets to avoid them dominating. Leonard Richardson 2013-10-18 14:13:29 -04:00
  • 35d758b1c0 Added shading mosaic as a glitch character. Leonard Richardson 2013-10-18 10:44:49 -04:00
  • 6c85851522 Added more mouths to the emoticon generator. Leonard Richardson 2013-10-18 10:14:44 -04:00
  • 34eda12841 Added video game cheat codes. Leonard Richardson 2013-10-18 09:52:46 -04:00
  • c791dc3432 Added a generator that takes from ALL the linguistic alphabets. Leonard Richardson 2013-10-18 09:43:43 -04:00
  • 554797fbd8 Added some more charsets and a Gibberish generator that combines linguistic alphabets. Leonard Richardson 2013-10-18 09:12:33 -04:00
  • 906e44dbda Tweaked probabilities. Leonard Richardson 2013-10-08 09:11:09 -04:00
  • 87aa9e7275 Class name fix Richard Magahiz 2013-10-08 07:41:02 -04:00
  • 1dec9548d1 Added two-dimensional mosaics and game boards. Leonard Richardson 2013-10-07 09:29:18 -04:00
  • 7c26796e1c Added dots as a glitch alphabet. Leonard Richardson 2013-10-07 08:57:25 -04:00
  • 66069ff150 Added dots as a glitch alphabet. Leonard Richardson 2013-10-07 08:53:36 -04:00
  • 96dce8e9ee Added custom alphabets to coexist alongside the standard code pages. Leonard Richardson 2013-10-07 08:48:35 -04:00
  • d27eed43ce Added custom alphabets to coexist alongside the standard code pages. Leonard Richardson 2013-10-07 08:41:07 -04:00
  • 63befc30b4 Switched to a 'Monster Manual' style randomness generator. Added many more shape-based and mosaic charsets. Leonard Richardson 2013-10-06 13:34:59 -04:00
  • 6ebf24256a Added some custom character sets for shapes and mosaics. Leonard Richardson 2013-10-06 10:19:27 -04:00
  • ac8b3f12c5 Fixed article bug. Leonard Richardson 2013-09-30 22:11:57 -04:00
  • 6349329a69 Added dinosaur names (courtesy Rob Dubbin/Wikipedia) and an example script for generating dinosaur names. Leonard Richardson 2013-09-30 22:08:16 -04:00
  • 7d01b09953 Added dinosaur names (courtesy Rob Dubbin/Wikipedia) and an example script for generating dinosaur names. Leonard Richardson 2013-09-30 22:08:08 -04:00
  • 60fe763727 Added limited-vocabulary and emoticon-based gibberish. Leonard Richardson 2013-09-30 09:47:37 -04:00
  • cd60b4006c Normalize generated words so that we count characters the same way Twitter does. Leonard Richardson 2013-09-30 09:01:00 -04:00
  • fbfae11aaf Updated README. Leonard Richardson 2013-09-29 13:48:06 -04:00
  • eaf5f4aca6 Fixed Twitter wrapper bug. Leonard Richardson 2013-09-29 13:46:47 -04:00
  • ffceb082c7 Added example scripts for printing gibberish and corrupting text. Leonard Richardson 2013-09-27 17:53:51 -04:00
  • 88d17bbc50 Added a Corruptor class for corrupting text with diacritical marks. Leonard Richardson 2013-09-27 17:30:18 -04:00
  • 378c270f3e Added various word length algorithms. Leonard Richardson 2013-09-27 17:13:25 -04:00
  • 6459ae971b Create an overarching D&D-type table for rolling up tweet formats. Leonard Richardson 2013-09-27 16:41:54 -04:00
  • e0661aedbe Added a way to pick a random language script and a way to make an alphabet just a little weirder. Leonard Richardson 2013-09-27 16:30:46 -04:00
  • 196902a434 Added a way to pick a random language script and a way to make an alphabet just a little weirder. Leonard Richardson 2013-09-27 16:09:22 -04:00
  • 7d32f9ff3b Corrected my misunderstanding of the Han unification plane. Leonard Richardson 2013-09-27 15:49:56 -04:00
  • f69720bea4 Refactored the Weird Twitter code and added variants for Japanese text, CJK ideographs, and math. Leonard Richardson 2013-09-27 15:48:19 -04:00
  • 4fe10cfcb5 Got the alphabets organized, wrote a generator for 'Weird Twitter'-type alphabets. Leonard Richardson 2013-09-27 14:45:28 -04:00
  • 241f93fa55 Added a dataset containing all the named characters in Unicode, organized by code sheet; and a script that generates Unicode art. Leonard Richardson 2013-09-26 20:28:25 -04:00
  • 7c0d49d2b9 Mention other sample scripts. Leonard Richardson 2013-07-20 21:53:32 -04:00
  • b34d6b63ac Removed extraneous :s Leonard Richardson 2013-07-20 21:46:20 -04:00
  • e0ce5af50c Removed huge amount of board game data to make the repo easier to check out. Leonard Richardson 2013-07-20 21:16:03 -04:00
  • e0dbffd461 Fixed the dialogue assembler, and added another example--transcripts from the Apollo 11 mission. Leonard Richardson 2013-07-20 21:07:44 -04:00
  • 03e1609555 Added a board game example, and a composite assembler which chooses between other assemblers in proportion to how many items each has been given. Leonard Richardson 2013-07-12 14:53:23 -04:00
  • f5aac793f6 Fixed bug that limited sonnets to 10 lines. Leonard Richardson 2013-07-12 09:00:15 -04:00
  • 76446d52ff Added Shakesperean sonnets. Leonard Richardson 2013-07-11 23:12:47 -04:00
  • 88e9cf00ef Added __init__.py Leonard Richardson 2013-07-09 08:23:05 -04:00
  • c6e8dfc4b2 Updated readme. Leonard Richardson 2013-07-09 08:20:03 -04:00
  • b71c8dddfd Get some examples up. Leonard Richardson 2013-07-09 08:08:26 -04:00
  • a47ffbb9a2 Initial commit leonardr 2013-05-22 09:42:41 -07:00