root/BADataMunger


Mode:

Legend:

Added
Modified
Copied or renamed
Rev Chgset Date Author Log Message
(edit) @1215 [1215] 10/25/07 17:04:33 thomase two stylesheets whereby one of our KML point files can be rendered into a …
(edit) @1180 [1180] 10/16/07 18:03:21 thomase add all required suppression directives to keep non-native places (i.e, …
(edit) @1179 [1179] 10/16/07 18:02:08 thomase support earthworks, quarries, walls and mines
(edit) @1178 [1178] 10/16/07 18:01:20 thomase generate suppression directives for gismixer from directory files
(edit) @1177 [1177] 10/16/07 18:00:45 thomase provide a more flexible range of suppression options when evaluating …
(edit) @1159 [1159] 10/12/07 17:14:58 thomase doh
(edit) @1158 [1158] 10/12/07 17:11:00 thomase PleiadesEntity/extensions
(edit) @1157 [1157] 10/12/07 17:07:42 thomase created
(edit) @1156 [1156] 10/10/07 13:47:52 thomase config parameters for map 22
(edit) @1082 [1082] 08/27/07 14:53:00 thomase modify some logging and handle capitalization variation in feature types …
(edit) @1069 [1069] 08/23/07 17:53:39 thomase append references for feature names properly
(edit) @1059 [1059] 08/21/07 11:25:46 thomase markup subcomponents of bibliographic references/citations and attempt to …
(edit) @1058 [1058] 08/21/07 11:24:04 thomase process just the geography and dir stuff to produce frankenformat for …
(edit) @1057 [1057] 08/21/07 11:21:26 thomase add support for cleanup of xlink attributes
(edit) @1021 [1021] 08/15/07 17:59:18 thomase massive regular expression voodoo to insert tei bibliographic tagging for …
(edit) @951 [951] 08/10/07 05:12:11 thomase do just the bibliographic munging, separate from the place munging
(edit) @950 [950] 08/10/07 05:11:32 thomase placesaver.py has been using this for a while!
(edit) @949 [949] 08/10/07 05:09:12 thomase copy recordInfo nodes from 'library' mods to 'student' mods
(edit) @864 [864] 07/10/07 15:19:25 thomase minor: change warning message to info message
(edit) @863 [863] 07/10/07 14:23:11 thomase Add option via config file to suppress individual features.
(edit) @862 [862] 07/10/07 13:56:17 thomase handles the new "use case" that surfaced with Map 38: unlabeled point …
(edit) @858 [858] 06/22/07 15:01:35 thomase Handle multiple locations, types and approximation indicators per place.
(edit) @857 [857] 06/19/07 13:50:38 thomase Fix an xpath construction error in the mods mixing cascade.
(edit) @856 [856] 06/19/07 13:50:09 thomase Move control over mixing of GIS data from the pipeline level down to a new …
(edit) @847 [847] 06/18/07 16:23:12 thomase More aggressive error checking on the gis/dir mixing steps.
(edit) @846 [846] 06/18/07 16:11:53 thomase entering disambiguator numbers correctly is a good idea
(edit) @845 [845] 06/18/07 15:11:41 thomase Fixed logic bug that prevented unlocated places from getting written to …
(edit) @844 [844] 06/17/07 06:50:16 thomase Incorporate the mods mixing process (enhancing the records with data from …
(edit) @843 [843] 06/15/07 16:26:34 thomase Magically expand shorthand references to RE, NPauly, KlPauly? and PECS
(edit) @842 [842] 06/15/07 15:41:33 thomase All certainty measures, plus name-wise references.
(edit) @841 [841] 06/14/07 16:51:55 thomase properly handle unlocateds and falsae
(edit) @840 [840] 06/14/07 16:41:57 thomase Sane dirpath specification and data mixing.
(edit) @837 [837] 06/14/07 13:56:03 thomase properly write classicationSection for feature names (with an internal …
(edit) @836 [836] 06/14/07 12:09:50 thomase Save place data to xml
(edit) @827 [827] 06/13/07 15:38:37 thomase gotta have config files!
(edit) @826 [826] 06/12/07 16:54:45 thomase rudimentary and buggy mixing map data with directory data
(edit) @825 [825] 06/12/07 14:10:41 thomase Saving full place information using the Pleiades frankenformat. Partial …
(edit) @824 [824] 06/12/07 14:08:16 thomase Better xpath construction using namespaces. Copy all the nodes we need …
(edit) @823 [823] 06/12/07 14:07:16 thomase Changed namespace cleanup calls to use the generic one for BADataMunger, …
(edit) @816 [816] 05/29/07 12:38:15 thomase one transform to rule them all
(edit) @815 [815] 05/29/07 12:36:22 thomase support for the TEI namespace
(edit) @814 [814] 05/25/07 17:41:03 thomase all sorts of nifty stuff to deal with names; needs more testing
(edit) @813 [813] 05/25/07 17:40:39 thomase pick up all the details from the library copy
(edit) @812 [812] 05/23/07 13:13:23 thomase added horizontal ellipsis to the list of things that gets normalized, and …
(edit) @799 [799] 04/30/07 13:04:18 thomase Parsing tables by type and handling name variants.
(edit) @798 [798] 04/30/07 13:02:45 thomase boundary condition = strip
(edit) @796 [796] 04/25/07 17:55:57 thomase First steps in parsing the directory tables.
(edit) @795 [795] 04/25/07 17:29:30 thomase Identify the directory listing tables and organize them for further …
(edit) @794 [794] 04/25/07 16:46:15 thomase Add saving of biblio in mods format as part of the "cycle".
(edit) @793 [793] 04/25/07 16:45:43 thomase Better trapping and reporting of failure conditions when trying to match.
(edit) @792 [792] 04/25/07 16:44:58 thomase Handle the case of a directory listing that contains no abbreviation …
(edit) @791 [791] 04/25/07 13:36:49 thomase Added more checks to make title matching more robust, yet more flexible.
(edit) @790 [790] 04/25/07 13:36:26 thomase Fixed bad code that was borking html character entities for characters …
(edit) @789 [789] 04/25/07 13:35:25 thomase Fixed bad code that was borking html character entities for characters …
(edit) @788 [788] 04/24/07 16:47:30 thomase Take a modsCollection file produced through biblioextraction and enrich it …
(edit) @787 [787] 04/19/07 12:44:54 thomase Clean up namespace mess created by lxml etree.
(edit) @786 [786] 04/18/07 07:36:53 thomase Save extracted bibliographic works to a MODS file as a single …
(edit) @785 [785] 04/18/07 07:36:09 thomase Get the works in the "Abbreviations" table in the directory listing div.
(edit) @784 [784] 04/16/07 17:39:26 thomase Strip unwanted Word formatting. Normalize non-breaking hyphens and spaces …
(edit) @777 [777] 04/12/07 15:00:25 thomase Strip all stylistic components and suppress some pointless elements to get …
(edit) @772 [772] 04/07/07 07:54:20 thomase String together a series of transformations and operations to munge a …
(edit) @771 [771] 04/06/07 15:31:48 thomase Moved remotely
(edit) @770 [770] 04/06/07 15:31:32 thomase Moved remotely
(edit) @769 [769] 04/06/07 15:31:18 thomase Moved remotely
(add) @768 [768] 04/06/07 15:31:04 thomase Created folder remotely
Note: See TracRevisionLog for help on using the revision log.