Split-Wikipedia

1.6

Splits a Wikipedia XML database dump into text-only articles. Articles are placed
in an "Articles" directory, then again split into subdirectories with 5,000
articles each.

Installation Options

Copy and Paste the following command to install this package using PowerShellGet More Info

Install-Script -Name Split-Wikipedia

Copy and Paste the following command to install this package using Microsoft.PowerShell.PSResourceGet More Info

You can deploy this package directly to Azure Automation. Note that deploying packages with dependencies will deploy all the dependencies to Azure Automation. Learn More

Manually download the .nupkg file to your system's default download location. Note that the file won't be unpacked, and won't include any dependencies. Learn More

Owners

Package Details

Author(s)

  • Lee Holmes

Functions

GetSafeFilename

Dependencies

This script has no dependencies.

FileList

Version History

Version Downloads Last updated
1.6 (current version) 698 12/12/2016
1.5 26 12/10/2016
1.3 23 12/9/2016
1.2 22 12/9/2016
1.1 25 12/9/2016
1.0 25 12/9/2016
Show more