A kasida ta gaba zamuyi dubi ne akan Wikipedia2text. Wannan shi ne karamin rubutun Shell wanda zai bamu damar duba labaran Wikipedia daga na'ura mai kwakwalwa. Tare da shi kuma za mu iya buɗe labarin da aka zaɓa a cikin kowane mai bincike na rubutu. Aikace-aikace ne kwatankwacin Wikit, wanda shine aikace-aikace don bincika Wikipedia daga tashar don labarai da taƙaitawa, game da abin da muka riga muka buga labarin a cikin wannan shafin.
Yawancinmu ba da daɗewa ba ko kuma daga baya za mu ja Wikipedia. Ko dai neman bayanai game da kowane kamfani ko wasu bayanai game da kusan duk wani abin da ya zo a hankali ko yake bukatar tuntuba. Lokacin da muke bincike a cikin google, a tsoho mahaɗin zuwa Wikipedia yawanci yana cikin Top 5. Wikipedia tana da abubuwa kusan miliyan 40, kusan a cikin fiye da 299 harsuna daban daban. Fadin haka Wikipedia na Turanci shine mafi girma.
Wannan rubutun harsashi yi amfani da mai bincike na rubutu don tuntuba da wakiltar labaran Wikipedia. Za a buga fitarwa azaman daidaitaccen fitarwa. Shin a halin yanzu tana goyon bayan yaruka 30 na Wikipedia.
Kamar yadda na ce, wannan rubutun yana amfani da burauzan rubutu don tambaya da bayar da labarai na Wikipedia. Ta haka ne muna buƙatar shigar da kowane ɗayan masu zuwa yanayin rubutu masu bincike. Sannan zamu iya ci gaba da sanya Wikipedia2text. Muddin bamu girka burauzar don tashar ba, ba zata bamu damar yin bincike ba.
Yadda ake girka Wikipedia2text akan Ubuntu
A kan rarrabawar tushen Debian wannan kunshin an haɗa shi a cikin tsoho ajiya. Koyaya, za mu kuma iya shigar da shi a cikin sauran rarrabawa.
A cikin Debian, Ubuntu da abubuwan banbanci, za mu iya girka wannan rubutun ta amfani da mai sarrafa kunshin. Mun buɗe tashar (Ctrl + Alt + T) kuma kawai zamu rubuta umarnin mai zuwa:
sudo apt install wikipedia2text
Kodayake idan ba mu son shigar da komai to za mu kuma sami damar ma'ajiyar clone da matsar da Wikipedia2text file kawai zuwa / bin directory. Zamu iya barinshi da suna iri ɗaya ko kuma idan kuna son sanya shi gajere da sauƙin tunawa, sake masa suna kamar haka wikicli (ko wani sunan da kake so), kamar yadda zan yi a misali mai zuwa. Don yin wannan, muna buɗe tashar (Ctrl + Alt + T) kuma rubuta kowane ɗayan umarnin masu zuwa:
git clone https://github.com/chrisbra/wikipedia2text sudo mv wikipedia2text/wikipedia2text /bin/wiki-cli rm -Rf wikipedia2text/
Yadda ake amfani da Wikipedia2text
Duba taimakon Wikipedia2text
Lokacin da muka ƙaddamar da umarni ba tare da wata hujja ba, ta tsohuwa za a nuna mana shafin taimako na kayan aiki. A ciki zamu iya tuntuɓar dukkan zaɓuɓɓukan da zamu sami wadatar amfani dasu yayin ƙaddamar da rubutun.
Daga cikin su ina son haskaka wanda zai ba mu dama duba harsuna masu tallafi. Kamar yadda kake gani a cikin hoton da ya gabata, zai nuna mana kalmomin da zamu iya amfani dasu yayin tuntuɓar talifofin cikin yarukan daban-daban.
Don ƙaddamar da wannan rubutun kuma tuntuɓi taimakon, kawai za mu buɗe tashar (Ctrl + Alt + T) kuma buga:
wiki-cli
Karanta labarin
Don karanta kowane labarin Wikipedia, gudanar da umarni mai zuwa. Dole ne ku yi latsa sandar sararin samaniya don cigaba zuwa shafi na gaba daga labarin:
wiki-cli -p ubuntu
Karanta labari a cikin wani yare
Za mu iya karanta labarin a cikin wasu yarukan da Wikipedia ke goyan baya, wanda na riga na ambata layi a sama. A gaskiya rubutun yana tallafawa harsuna 30. Don yin wannan, gudanar da umarni mai zuwa. A cikin wannan misalin zamu gwada harshen Faransanci:
wiki-cli -pl fr arch linux
Bude wata kasida a cikin rubutun rubutun
Don buɗe labarin Wikipedia a rubutun da za mu girka a baya, gudanar da umarnin mai zuwa:
wiki-cli -po opensuse
Sami URL ɗin labarin
Idan kana so san adireshin tambaya, zaku iya samun wannan bayanin ta aiwatar da umarni mai zuwa:
wiki-cli -u debian
Cire Wikipedia2text
A yayin da muke amfani da kayan aiki don shigar da wannan rubutun akan kwamfutarmu, zamu iya cire shi cikin sauƙi kamar yadda muke yi koyaushe. Mun buɗe tashar (Ctrl + Alt + T) kuma rubuta umarnin mai zuwa a ciki:
sudo apt remove wikipedia2text
Idan muna so mu kalla a lambar tushe na wannan mai amfani, zamu iya ganin sa a cikin aikin shafin GitHub.