User:Jheald/todo
Jump to navigation
Jump to search
Helps
[edit]Some things to do
Short term
[edit]Railways
[edit]Maps
[edit]- Follow up with Maproom, https://en.wikipedia.org/wiki/User:Maproom/archive_7
- London boroughs
- Maps falling between two areas
- Map templates
- User:Jheald/map cataloguing
DACS
[edit]- DACS ID Search
get the update files to Magnus for MnMDone50 or so old IDs still to track down.Reduced to 3 remaining
subclasses vs instance of
[edit]- follow up on qy results from Project Chat (tractors etc)
- terminal subclasses
tinyurl.com/y9wq25wd
- wine
- discussion of "index mineral"
- User:Jheald/subclasses
- metaclass diagram
- fix remaining food instances
cases of multiple listed building items linking to Commonscats
[edit]- follow up on results from qy last year
London Underground
[edit]- Circle line network query:
tinyurl.com/y8hmeb2d
-- rather a lot of connections missing- Updated query, showing only current connections:
tinyurl.com/yaqrwtl6
- Lokal's grapher: Whole tube; DLR; Piccadilly line
- Updated query, showing only current connections:
- Uploaded from file -- but there are still issues
- CSV file of London connections -- http://markdunne.github.io/2016/04/10/The-London-Tube-as-a-Graph/
- Bonnie & Clyde issue
- Discussion on Wikidata_talk:WikiProject_Railways#Bonnie_and_Clyde_on_the_London_Underground WD and en:Wikipedia_talk:WikiProject_London_Transport#Underground_stations_on_Wikidata_with_related_surface_stations en
- Compare also:
- Stations with split articles on different wikis
tinyurl.com/y8da3o9k
- "London group" main-line stations - state of play:
tinyurl.com/y8cm8hso
- Blackfriars and Liverpool Street currently anomalous
- Query for LU stations with no English article
tinyurl.com/ycdzmpl8
- Query for stations marked as part of LU, but without adjacent station info:
tinyurl.com/ycdhxnvn
- Some are disused (mark with City Road tube station (Q1093983), end time perhaps)
- Some are shared with heavy rail
- LU stations with adjacency information not derived from recent upload
tinyurl.com/y8yj4r8k
- Stations with split articles on different wikis
- distinguish fast services & stopping services on eg Jubilee line
-- but is subclass of (P279) the right relationship ?
- can use connecting service (P1192) for express connections
- distinguish fast services & stopping services on eg Jubilee line
- Distances from terminal Wikidata:Request_a_query#Virtual_graph_? (roughly), to add direction info to links.
- write-up for John Cummings.
RAs
[edit]- Dermod O'Brien (Q5262733) -- Hon RA or not? See talk page on :en. RA can be contacted via https://www.royalacademy.org.uk/contact-us
- RA new IDs -- scrape to see how many existing IDs can be mapped to new ones, based on "named as" (which will need to be scraped). Also gather up other details. Start with RAs. Then matches from here. Then MnM catalogue.
Thesauruses
[edit]- User:Jheald/thesauruses -- attempt to survey what's out there
- User:Jheald/bmt
- Take out existing dubious values.
tinyurl.com/y83lzfpt
Add "historic state" values.
- Take out existing dubious values.
- User:Jheald/aat
- Use Open Refine on current state of costume part of AAT thesaurus.
- Extract Europeana costume thesaurus
- Wikidata:Property proposal/broader concept
- Experiment with using "Part of" as a qualifier. (advantage -- link search may still work)
- Should allow path queries
- Does it allow graph output, without messing up?
- AAT test query:
tinyurl.com/yctm5t2k
- User:Jheald/bmt
LoC thesauruses
[edit]- http://id.loc.gov/ and http://id.loc.gov/download/ -- Examples:
- Genre/Forms Thesaurus
- regions/nations, ethnic groups & relationships thesaurus
- download; put into tree
- set separator (P4155) to allow it in Library of Congress authority ID (P244) w/o breaking uniqueness
Library of Congress classifications headings
[edit]- example: Library of Congress Classification:Class P, subclass PD -- Germanic languages (Q6542661)
- scrape wikipages and add Library of Congress Classification (P1149) to items for the subjects identified
- qualify with object of statement has role (P3831) -> minimum (Q10585806) / maximum (Q10578722) if there's a range
- also include statement is subject of (P805) as qualifier, pointing to the item for the wiki article, for the top-level codes
- add main subject (P921) in reverse direction
- would also be good case to populate new "broader concept" qualifier, when available
Equivalent class
[edit]- equivalent class (P1709) etc -- time to bin these? -- instances: tinyurl.com/y7u98l93
- Do we have properties for all of these? eg http://heritagedata.org/live/schemes/eh_tbm/concepts/97716.html
- cf argument in Proposed Properties over schema ID
geoshapes
[edit]- DE election map:
tinyurl.com/y79k2q5a
- count by country:
tinyurl.com/yb6vxy33
- UK examples:
tinyurl.com/y92xpn7l
MSBI
[edit]- Worth looking up in MnM:
tinyurl.com/y8v6qsg4
- Now at Property_talk:P2914/missing
- Would be worth mining the dates of birth / dates of death some time.
Commons links
[edit]- Some sitelinks to galleries replaced by sitelinks to categories. All but about 4 should have P935s. -- discussion
- A few P935s added for dab pages. About 350 dab pages do have sitelinks and P935s; the majority do not. -- See here
- P373s -> sitelinks :
tinyurl.com/y9r72244
(query 1);tinyurl.com/y998lmnr
(based on Multichill)
BHL
[edit]Properties to populate
[edit]Art UK painters list
[edit]en:Wikipedia:GLAM/Your_paintings/header
- Some Art UK identifiers have changed
Run validation script for existing identifiersDoneTry to find new identifiers, for old ones which have become invalidDone
- Fix templates on en-wiki : en:Template:Art UK bio
Remove old local dataDoneAdd category for links specified locally; and checkDone- Check other remaining links to old BBC site [1] Underway: list
- Need to check for Art UK links not templated Underway: list
- Roll out template to remaining artists -- help requested at PetScan
Add number of paintings, Art UK name/title to template (?); also other presentation modes
- Add info to verified identifiers
add quantity (P1114) and retrieved (P813) as qualifiers on all the identifier claimsDonecheck for no quantity: tinyurl.com/zcm84f8; no reference tinyurl.com/z882kdz
add subject named as (P1810) to indicate name in databaseDonereplace quantity (P1114) with new property number of works (P3740)Done tinyurl.com/jctdna5
- add family name (P734) -- though care needed with de, von, van de, etc.
- add birth date / death date, work period dates, with reference
- compare with dates from other sources.
Would be good if Art UK would syndicate their updates, to avoid the need to scrapesuggested; not possible
- Reflect to tracking pages
Check for old Q-numbers in the list, that have since been merged/redirectedtinyurl.com/z6873fu- Update identifiers that have changed, for Q-numbers in the list -- some may have been missed
- re-check for identifiers that exist on Wikidata ID, but not in the list -- this could be re-run
Remove duplicates, but regenerate ones which *do* have two identifersre-do countsexamine Error 404s- Update changed identifiers that have no Wikidata ID
- cf Google search: perl lwp detecting redirected pages
- re-check for Commons cats
- Mix'n'match
- go through the new items that were added to the pages (to do from 1900)
- items with no RKDs (esp 1850s?)
- Update portal
- live bubble charts to show progress ? <-- probably not possible
- historical trend charts ?
- Scripts
work around WDQ problemDone- automate ?
- Paintings -- try to match accession numbers in collection. <-- Multichill is on this
- Current Art UK painting links: tinyurl.com/zvac8ua Collections with accession numbers: tinyurl.com/hfx8ml4
PASE
[edit]Check link syntax (email someone)contacted ?alternative foundAsk for URL form for numerical domesday person identifiercontacted
Scrape templates and add statements<-- Andy Mabbett has already added ~ 900check non-uniqueness<-- already checked by somebody else- check if any more still to do -- eg examples where more than one PASE template was on the same page
- (but what then to do?)
- PASE Domesday person -- see Wikidata:Property proposal/PASE Domesday person ID
- PASE Domesday place -- see Wikidata:Property proposal/PASE Domesday place
Position within image
[edit]- Ask Shonagon for tool that will find eg P180 = cat, or P180 = random, then challenge the user to find it
- IIIF rendering has broken: phab:T89552 ; Example 1, Example 2
- Try to persuade somebody to implement the formatting for the link (post a bug ?)
- Image without border property: cf keep border property on Commons
Category analysis
[edit]- Motivation: Auto-populating of categories phab:T120439
- Motivation:Template to show what (maybe should) be in categories: cf c:Template:category contains, Commons VP presentation
- Specifying criteria: Wikidata:Property proposal/category contains
- see also User:Jheald/todo/categories -- existing "category combines"; characteristic category labels
- how to deal with criteria such as eg "19th century artists" ?
Commons
[edit]- Structured data
- Categories: advantages of creating CommonsData items for categories? what props might such items contain?
- Tricky cases, eg maps / engravings, where there may not be detailed Q-numbers on main wikidata
- Where are the gaps in the initial proposal?
- Comment at c:Commons_talk:Structured_data#Pages_update
- values of applies to part (P518) on depicts (P180)
tinyurl.com/ycu9jr9y
- values of applies to part (P518) on depicts (P180)
- Sitelinks & P373 survey
Update, work around WDQ- Use WD triples service for large downloads ?
- Low-flying fruit to add more P373s (see queries on query page)
- Problem: many of these time out (even ones that used to run)
- Sitelinked articles that are the target of a category's main topic (P301):
tinyurl.com/y7geofcn
(11,265)
- Sitelinked articles that are the target of a category's main topic (P301):
- Problem: many of these time out (even ones that used to run)
- Galleries blocking category sitelinks: only 476 w/o P910 have an associated commonscat
tinyurl.com/ycrpjdfo
- Creator template information
- Dates ?
UK
[edit]- see User:Jheald/todo/UK
- cf Wikidata:WikiProject_UK_and_Ireland, Wikidata:WikiProject_Historical_Place
- User:Jheald/scratch/historical places - scratch page
strange{...}
behaviour: phab:T158648
- fill out data for existing items on historical adms -- eg items, P31s
- identifiers/external links:
- vision of britain (
email Humphreycontacted) -- see Vision of Britain place ID (P3616), Vision of Britain unit ID (P3615)- VoB place progress: User:Jheald/todo/VoB/places
- track VoB unit types vs Wikidata items -- see User:Jheald/todo/VoB/levels
- Open Domesday places -- OpenDomesday settlement ID (P3118) first upload done
- Placenames (KEPN) -- KEPN ID (P3639) first upload done
- Placenames (EPNS gazetteer) -- Survey of English Place-Names ID (P3627)
- VCH places -- British History Online VCH ID (P3628) -- also see discussion re bibliographic items first upload done
- vision of britain (
- official identifiers for places (en:ONS coding system) -- GSS code (2011) (P836) -- data
- geonames first pass at data completion for civil parishes done
- add 'quantity' for county classes etc.
- Heritage collisions
- match Commons place subcats, by county
BL maps
[edit]** restart geoparse_update on November 9th **done- next steps
- cf also Wikidata:WikiProject_Maps / m:Wikimaps User Group
Wikidata matching- Proposed: Wikidata:Property_proposal/Commons_map_category
- Start with a trial upload of county maps ? U.S. civil war ?
Prepare UK categories-- done
- Machine learning meet-up https://docs.google.com/document/d/1eHWcaFEPW8EGkoZrUwjugvMMaSMeBXU8AQQdW35wwSk/edit#
OS map series
[edit]- (Category: c:Category:Ordnance_Survey_1st_series_1:10560)
- Keep building galleries
- Add c:Template:Geographic_location for sheet-to-sheet navigation (use thumbnails)
- Categorise by village
- Add alt version gallery for jpgs / tiffs
Property / class browser
[edit]- tinyurl.com/h59hxvw -- start from here ?
Thoughts
[edit]- Recent changes on a particular group of pages, or to a particular identifier
Phabricator
[edit]- Access to categories from SPARQL: See phab:T157676
- Access to image sizes from SPARQL: see phab:T157798
- Problems searching reference URL (P854): see phab:T157811