Help

Difference between revisions of "Download datasets"

(Created page with "=== Requirements === Java Runtime Environment. Ubuntu: <code>sudo apt-get install default-jre</code> === Install === * Open [https://github.com/MarcoFalke/wiki-java-tools/re...")
 
Line 36: Line 36:
 
=== Note ===
 
=== Note ===
 
There are also ways to use a category name as input, then to do API queries in order to get the list of files, download them. For a start point on API queries, see [https://codepen.io/hugolpz/pen/ByoKOK this pen].
 
There are also ways to use a category name as input, then to do API queries in order to get the list of files, download them. For a start point on API queries, see [https://codepen.io/hugolpz/pen/ByoKOK this pen].
 +
 +
[[Category:Lingua Libre:Help]]

Revision as of 17:47, 19 January 2021

Requirements

Java Runtime Environment.

Ubuntu: sudo apt-get install default-jre

Install

  • Open GitHub Wiki-java-tools project page.
  • Find the last Imker release.
  • Download Imker_vxx.xx.xx.zip archive
  • Extract the .zip file
  • Run as follow :
    • On Windows : start the .exe file.
    • On Ubuntu, open shell then : $java -jar imker-cli.jar -o ./myFolder/ -c 'CategoryName'

Manual

Imker -- Wikimedia Commons batch downloading tool.

Usage: java -jar imker-cli.jar [options]
  Options:
    --category, -c
       Use the specified Wiki category as download source.
    --domain, -d
       Wiki domain to fetch from
       Default: commons.wikimedia.org
    --file, -f
       Use the specified local file as download source.
  * --outfolder, -o
       The output folder.
    --page, -p
       Use the specified Wiki page as download source.

The download source must be ONE of the following:
 ↳ A Wiki category (Example: --category="Denver, Colorado")
 ↳ A Wiki page (Example: --page="Sandboarding")
 ↳ A local file (Example: --file="Documents/files.txt"; One filename per line!)

Note

There are also ways to use a category name as input, then to do API queries in order to get the list of files, download them. For a start point on API queries, see this pen.