www.wikidata.uk-ua.nina.az
Cya stattya ye sirim perekladom z inshoyi movi Mozhlivo vona stvorena za dopomogoyu mashinnogo perekladu abo perekladachem yakij nedostatno volodiye oboma movami Bud laska dopomozhit polipshiti pereklad listopad 2016 Cyu stattyu treba vikifikuvati dlya vidpovidnosti standartam yakosti Vikipediyi Bud laska dopomozhit dodavannyam dorechnih vnutrishnih posilan abo vdoskonalennyam rozmitki statti traven 2016 Cya stattya mistit tekst sho ne vidpovidaye enciklopedichnomu stilyu Bud laska dopomozhit udoskonaliti cyu stattyu pogodivshi stil vikladu zi stilistichnimi pravilami Vikipediyi Mozhlivo storinka obgovorennya mistit zauvazhennya shodo potribnih zmin traven 2016 Vidobuvannya znan angl knowledge extraction ros izvlechenie znanij ye stvorennya znan zi strukturovanih relyacijnih baz danih XML i nestrukturovanih teksti dokumenti zobrazhennya dzherel Otrimane znannya povinno buti zberezhene u formati pridatnomu dlya avtomatichnogo chitannya ta interpretaciyi Takozh znannya povinni buti predstavleni takim chinom shob polegshiti logichnij visnovok Popri te sho ce metodichno shozhe na vidobuvannya informaciyi angl Data Mining NLP i ETL zberigannya danih osnovnimi kriteriyami ye te sho rezultat vidobuvannya vihodit za ramki stvorennya strukturovanoyi informaciyi abo peretvorennya yiyi v relyacijnu shemu Ce vimagaye abo povtornogo vikoristannya nayavnih formalnih znan povtorne vikoristannya identifikatoriv abo ontologij abo generaciyu shemi sho ce na osnovi vihidnih danih Grupa RDB2RDF W3C 1 v danij chas koli standartizuye movu dlya vidobutku RDF angl Resource Description Framework z relyacijnih baz danih She odnim populyarnim prikladom vidobutku znan ye peretvorennya Vikipediyi v strukturovani dani a takozh vidobrazhennya do nayavnih znan div DBpedia i Freebase Zmist 1 Oglyad 2 Prikladi 2 1 Zv yazuvannya ob yektiv 2 2 Relyacijni bazi danih v RDF 3 Vityag z strukturovanih dzherel v RDF 3 1 1 1 Vidobrazhennya z tablic BD Vidi na RDF Entities Vlastivosti Znachennya 3 2 Skladni vidobrazhennya relyacijnih baz danih v RDF 3 3 XML 3 4 Oglyad metodiv Instrumenti 4 Vityag z prirodnih dzherel movi 4 1 Tradicijne viluchennya informaciyi IE 4 2 Ontologiya na osnovi viluchennya informaciyi OBIE 4 3 Ontologiya navchannya ON 4 4 Semantichna anotaciya SA 4 5 instrumenti 5 Viyavlennya znan 5 1 Vhidni dani 5 2 vihidni formati 6 Div takozh 7 PrimitkiOglyad RedaguvatiPislya standartizaciyi mov predstavlennya znan takih yak RDF i OWL bagato doslidzhen bulo provedeno v oblasti osoblivo shodo peretvorennya relyacijnih baz danih v RDF zadachi identifikaciyi viyavlennya znan i navchannya ontologij Zagalnij proces vikoristovuye tradicijni metodi dobuvannya danih vijmannya peretvorennya i zavantazhennya ETL yaki peretvoryuyut dani z dzherel u strukturovani formati Nastupni kriteriyi mozhut buti vikoristani dlya klasifikaciyi pidhodiv v cij temi deyaki z nih vikoristovuyutsya lishe dlya vidobutku z relyacijnih baz danih 2 Dzherelo Dzherela danih yaki vikoristovuyutsya Tekst relyacijni bazi danih XML CSVEkspoziciya V yakomu viglyadi dobuvayutsya dani fajl ontologiyi semantichna baza danih Yak mozhna zrobiti zapit Sinhronizaciya Chi vikonuyetsya proces vidobutku znan odin raz dlya otrimannya dampa abo rezultat sinhronizuyetsya z dzherelom Statichna abo dinamichna sinhronizaciya Chi zapisuyutsya zmini rezultativ nazad dvonapravlena sinhronizaciya Povtorne vikoristannya slovnikiv Instrument zdatnij povtorno vikoristovuvati nayavni slovniki pri vidobutku Napriklad stovpchik tablici FirstName mozhut buti zistavlenni z foaf firstName Deyaki avtomatichni pidhodi ne zdatni zistavlyati slovniki Avtomatizaciya Stupin v yakij vidobutok vimagaye vtruchannya avtomatizovanij Dopomoga operatora GUI napivavtomatichnij avtomatichnij Potribna ontologiya predmetnoyi oblasti Potribno pobuduvati vidobrazhennya u vzhe zadanu ontologiyu Tak chinom sho abo stvoryuyetsya vidobrazhennya abo otrimuyetsya shema z dzherela navchannya ontologij en Prikladi RedaguvatiZv yazuvannya ob yektiv Redaguvati DBpedia Spotlight OpenCalais en Dandelion dataTXT nedostupne posilannya Zemanta API Extractiv Arhivovano 29 bereznya 2017 u Wayback Machine ta PoolParty Extractor Arhivovano 26 chervnya 2012 u Wayback Machine analizuyut vilnij tekst cherez rozpiznavannya imenovanih sutnostej a potim usuvaye neodnoznachnist kandidativ cherez rozpiznavannya imen en ta pov yazuye znajdeni ob yekti zi shovishem znan DBpedia 3 div demo Dandelion dataTXT DBpedia Spotlight abo PoolParty Extractor Prezident Obama Arhivovano 12 zhovtnya 2008 u Wayback Machine u seredu zaklikav Kongres prodovzhiti podatkovi pilgi dlya studentiv vklyuchenih do ekonomichnih stimuliv u minulomu roci stverdzhuyuchi sho politika zabezpechuye bilsh shedru dopomogu Yak prezident Obama pov yazanij z resursom DBpedia Linked data en dodatkova informaciya mozhe buti otrimana avtomatichno i Semantic Reasoner en mozhe napriklad zrobiti visnovok sho zgadana osoba maye tip osobi z vikoristannyam FOAF programne zabezpechennya i prezidentiv tipu Spoluchenih Shtativ za dopomogoyu YAGO Prikladi Metodi yaki rozpiznayut tilki ob yekti abo posilannya na statti Vikipediyi ta inshih cilej yaki ne zabezpechuyut podalshe viluchennya strukturovanih danih i formalnih znan Relyacijni bazi danih v RDF Redaguvati Triplify D2R servera Ultrawrap i Virtuoso RDF Pereglyadi instrumentiv yaki transformuyut relyacijni baz danih RDF V hodi cogo procesu voni dozvolyayut povtorno vikoristovuvati isnuyuchi slovniki i ontologiyi v procesi peretvorennya Pri peretvorenni tipovih relyacijnih tablic z im yam koristuvachiv odin stovpec napriklad name abo sukupnist stovpciv napriklad first name i last name povinen nadati URI stvorenogo ob yekta Zazvichaj vikoristovuyetsya pervinnij klyuch Kozhen drugij stovpec mozhe buti zaluchen yak vidnoshennya z ciyeyu organizaciyeyu Potim vikoristovuyutsya vlastivosti z formalno viznachenoyu semantikoyu i povtorno interpretuvati informaciyu Napriklad stovpec v tablici koristuvacha z im yam marriedTo mozhe buti viznachena yak simetrichne vidnoshennya i stovpchik homepage mozhe buti peretvorenij u vlasnist vid FOAF Slovnik nazivayetsya FOAF golovna storinka takim chinom kvalifikuye jogo yak funkcionalna vlastivist zvorotnogo Potim kozhen zapis tablici koristuvacha mozhe buti ekzemplyarom klasu FOAF Lyudina Ontologiya naselennya Krim znannya predmetnoyi oblasti u formi ontologiyi mozhut buti stvoreni z status id abo stvorenih vruchnu pravil yaksho status id 2 zapis vidnositsya do klasu Vchiteli abo semi avtomatichni metodi ontologiya navchannya Os priklad peretvorennya Im ya odruzhenij domashnya storinka statusPeter Mary http example org Peters page nedostupne posilannya z bereznya 2019 1Claus Eva http example org Claus page nedostupne posilannya z bereznya 2019 2 Peter marriedTo Mary marriedTo a owl SymmetricProperty Peter foaf homepage lt http example org Peters page gt Peter a foaf Person Peter a Student Claus a Teacher Vityag z strukturovanih dzherel v RDF Redaguvati1 1 Vidobrazhennya z tablic BD Vidi na RDF Entities Vlastivosti Znachennya Redaguvati Pri stvorenni vistavi RDB v problemnij oblasti vidpravnoyu tochkoyu chasto ye sutnist zv yazok diagrama ERD Yak pravilo kozhnim ob yektom predstavlenomu u viglyadi tablici bazi danih kozhnij atribut sutnosti staye stovpec v cij tablici i vidnosini mizh ob yektami poznachayutsya zovnishnimi klyuchami Kozhna tablicya yak pravilo viznachaye konkretnij klas suti kozhen stovpec odin z jogo atributiv Kozhen ryadok v tablici opisuye ekzemplyar sutnosti odnoznachno identifikuyetsya pervinnim klyuchem Ryadki tablici v sukupnosti opisuyut nabir sutnostej V ekvivalentnij RDF predstavlennya odnogo i togo zh naboru sutnostej Kozhen stovpec u tablici ye atributom tobto predikat Kozhne znachennya stovpcya ye znachennya atributa tobto ob yekt Kozhna klavisha ryadok yavlyaye soboyu identifikator ob yekta tobto sub yekt Kozhen ryadok ye ekzemplyarom sutnosti Kozhen ryadok ekzemplyar ob yekta predstavlena v RDF kolekciyeyu trijok iz zagalnim sub yektom identifikator ob yekta Takim chinom shob zrobiti ekvivalentne uyavlennya na osnovi RDF semantiki osnovne vidobrazhennya algoritmu bude viglyadati nastupnim chinom stvoriti RDFS klas dlya kozhnoyi tablici konvertuvati vsi pervinni klyuchi ta zovnishni klyuchi v IRIs priznachiti predikat IRI dlya kozhnogo stovpchika priznachiti RDF tip predikata dlya kozhnogo ryadka pov yazuyuchi jogo z ISS klasu IRI vidpovidaye tablici dlya kozhnogo stovpchika yakij ne ye ni chastinoyu pervinnogo abo zovnishnogo klyucha pobuduvati potrijnij yakij mistit pervinnij klyuch IRI yak sub yekta stovpec IRI yak predikata i znachennya stovpcya yak ob yekt Najpersha zgadka cogo osnovnogo abo pryamogo vidobrazhennya mozhna znajti v porivnyanni Tim Berners Li modeli ER do modeli RDF 4 Skladni vidobrazhennya relyacijnih baz danih v RDF Redaguvati 1 1 zgaduvane vishe nadaye zastarili dani u viglyadi RDF pryamim shlyahom dodatkovi utochnennya mozhut buti vikoristani dlya pidvishennya korisnosti RDF vivedennya vidpovidnogo do Use Cases Yak pravilo vtrachayetsya informaciya v procesi peretvorennya v sutnist zv yazok diagrami ERD dlya relyacijnih tablic podrobici mozhna znajti v ob yektno relyacijnomu impedansi i povinna buti zvorotnya inzheneriya Z konceptualnoyi tochki zoru pidhodi do vidobutku mozhut nadhoditi z dvoh napryamkiv Pershij napryamok namagayetsya vityagti abo diznatisya shemu OWL z danoyi shemi bazi danih Ranni pidhodi vikoristovuvali fiksovanu kilkist stvorenih vruchnu pravil vidobrazhennya dlya utochnennya vidobrazhennya 1 1 5 6 7 Bilsh skladni metodi z vikoristannyam evristiki abo algoritmiv navchannya shob viklikati shematichnu informaciyu metodi perekrivatisya z navchannyam ontologij U toj chas yak deyaki pidhodi namagayutsya vityagti informaciyu zi strukturi vlastivoyi shemoyu SQL 8 analizuyuchi napriklad zovnishni klyuchi inshi analizuyut zmist i znachennya v tablicyah dlya stvorennya konceptualnih iyerarhij 9 napriklad stovpci z dekilkoma znachennyami ye kandidatami dlya stanovlennya kategoriyi Drugij napryamok namagayetsya vidobraziti shemu i jogo vmist vzhe isnuyuchoyi ontologiyi predmetnoyi oblasti divis takozh virivnyuvannya ontologiyi Chasto odnak vidpovidna ontologiya ne isnuye i povinen buti stvorenij pershim XML Redaguvati Tak yak XML strukturovana u viglyadi dereva bud yaki dani mozhut buti legko predstavleni v RDF yakij strukturovanij u viglyadi grafika XML2RDF ye odnim iz prikladiv takogo pidhodu yakij vikoristovuye RDF porozhni vuzli i peretvoryuye XML elementi i atributi vlastivostej RDF Tema odnak ye bilsh skladnim yak i v razi relyacijnih baz danih U relyacijnoyi tablici pervinnij klyuch ye idealnim kandidatom shob stati predmetom zdobutih trijok XML element odnak mozhut buti peretvoreni v zalezhnosti vid kontekstu yak sub yekt predikat abo ob yekt potrijnij XSLT mozhe buti vikoristanij standartnij movu peretvorennya vruchnu peretvoriti XML v RDF Oglyad metodiv Instrumenti Redaguvati Name Data Source Data Exposition Data Synchronisation Mapping Language Vocabulary Reuse Mapping Automat Req Domain Ontology Uses GUIA Direct Mapping of Relational Data to RDF Arhivovano 9 travnya 2016 u Wayback Machine Relational Data SPARQL ETL dynamic N A false automatic false falseCSV2RDF4LOD Arhivovano 22 serpnya 2016 u Wayback Machine CSV ETL static RDF true manual false falseConvert2RDF Arhivovano 22 veresnya 2016 u Wayback Machine Delimited text file ETL static RDF DAML true manual false trueD2R Server Arhivovano 26 lyutogo 2012 u Wayback Machine RDB SPARQL bi directional D2R Map true manual false falseDartGrid RDB own query language dynamic Visual Tool true manual false trueDataMaster Arhivovano 21 travnya 2016 u Wayback Machine RDB ETL static proprietary true manual true trueGoogle Refine s RDF Extension CSV XML ETL static none semi automatic false trueKrextor XML ETL static xslt true manual true falseMAPONTO Arhivovano 2 chervnya 2016 u Wayback Machine RDB ETL static proprietary true manual true falseMETAmorphoses Arhivovano 17 kvitnya 2016 u Wayback Machine RDB ETL static proprietary xml based mapping language true manual false trueMappingMaster CSV ETL static MappingMaster true GUI false trueODEMapster RDB ETL static proprietary true manual true trueOntoWiki CSV Importer Plug in DataCube amp Tabular CSV ETL static The RDF Data Cube Vocaublary true semi automatic false truePoolparty Extraktor PPX Arhivovano 26 chervnya 2012 u Wayback Machine XML Text LinkedData dynamic RDF SKOS true semi automatic true falseRDBToOnto RDB ETL static none false automatic the user furthermore has the chance to fine tune results false trueRDF 123 Arhivovano 20 lipnya 2011 u Wayback Machine CSV ETL static false false manual false trueRDOTE RDB ETL static SQL true manual true trueRelational OWL RDB ETL static none false automatic false falseT2LD Arhivovano 20 lipnya 2011 u Wayback Machine CSV ETL static false false automatic false falseThe RDF Data Cube Vocabulary Multidimensional statistical data in spreadsheets Data Cube Vocabulary true manual falseTopBraid Composer CSV ETL static SKOS false semi automatic false trueTriplify Arhivovano 6 sichnya 2009 u Wayback Machine RDB LinkedData dynamic SQL true manual false falseUltrawrap RDB SPARQL ETL dynamic R2RML true semi automatic false trueVirtuoso RDF Views Arhivovano 5 veresnya 2014 u Wayback Machine RDB SPARQL dynamic Meta Schema Language true semi automatic false trueVirtuoso Sponger Arhivovano 5 veresnya 2014 u Wayback Machine structured and semi structured data sources SPARQL dynamic Virtuoso PL amp XSLT true semi automatic false falseVisAVis RDB RDQL dynamic SQL true manual true trueXLWrap Spreadsheet to RDF Arhivovano 8 travnya 2016 u Wayback Machine CSV ETL static TriG Syntax true manual false falseXML to RDF Arhivovano 11 travnya 2016 u Wayback Machine XML ETL static false false automatic false falseVityag z prirodnih dzherel movi RedaguvatiNajbilsha chastina informaciyi sho mistitsya v biznes dokumentah blizko 80 10 koduyetsya prirodnoyu movoyu i otzhe nestrukturovana Oskilki nestrukturovani dani ye dosit skladnim zavdannyam dlya viluchennya znan bilsh skladni metodi neobhidni yaki yak pravilo postavlyayut girshi rezultati v porivnyanni z nestrukturovanimi danimi Potencial dlya masovogo pridbannya zdobutih znan prote povinni kompensuvati pidvishenu skladnist i znizhennya yakosti vidobutku Nadali prirodni dzherela movi rozumiyutsya yak dzherela informaciyi de dani navedeni nestrukturovanim chinom yak zvichajnij tekst Yaksho danij tekst dodatkovo vbudovanij v rozmitki dokumenta e G HTML dokument zgadani sistemi zazvichaj vidalyayut elementi rozmitki avtomatichno Tradicijne viluchennya informaciyi IE Redaguvati Tradicijne viluchennya informaciyi 11 ye tehnologiyeyu obrobki prirodnoyi movi yake vityaguye informaciyu z tekstiv prirodnoyu movoyu yak pravilo i strukturi danih vidpovidnim chinom Vidi informaciyi sho pidlyagaye identifikovanogo povinni buti vkazani yak model pered pochatkom procesu tomu ves proces tradicijnogo viluchennya informaciyi zalezhnij IE rozdilenij na nastupni p yat pidzadach viznannya Nazvanij ob yekt VNO Rezolyuciya koreferentnosti RK Shablon budivelnogo elementu ShB Shablon stavlennya konstrukciyi ShS Shablon virobnictva scenarij ShV Zavdannya nazvanogo rozpiznavannya osobi ye viznati i klasifikuvati vsi nazvani ob yekti sho mistyatsya v teksti prisvoyennya imeni ob yekta do viznachenoyi kategoriyi Ce pracyuye shlyahom zastosuvannya gramatiki na osnovi metodiv abo statistichnih modelej Dozvil konferentnogsti viznachaye ekvivalentni ob yekti yaki buli viznani NEK v teksti Isnuyut dva vidi vidpovidnih vidnosin ekvivalentnosti Pershij z nih vidnositsya do vidnosin mizh dvoma riznimi predstavlenimi sub yektami napriklad IBM Europe i IBM a drugij do vidnosin mizh sub yektom i yih anaforicheskih posilan napriklad vin i IBM Obidva vidi mozhut buti viznani vidpovidno do rezolyuciyi koreferentnosti Pid chas budivnictva elementa shablonu sistema identifikuye IE opisovi vlastivosti sutnostej viznanih NEK i CO Ci vlastivosti vidpovidayut zvichajnim yakostyam yak chervonij abo velikij Shablonna konstrukciya vidnoshennya viznachaye vidnosini yaki isnuyut mizh elementami shablonu Ci vidnosini mozhut buti dekilkoh vidiv takih yak roboti z pitannya abo znahodzhennya z obmezhennyam sho obidva domeni i diapazon vidpovidayut sub yektam U shabloni scenariyu zdijsnyuyutsya podiyi yaki opisani v teksti voni budut viznacheni i strukturovani shodo osib viznanih Nyu Jorku i SO i vidnosin yaki buli viznacheni TR Ontologiya na osnovi viluchennya informaciyi OBIE Redaguvati Ontologiya na osnovi viluchennya informaciyi ye polem viluchennya informaciyi za dopomogoyu yakoyi shonajmenshe odna ontologiya vikoristovuyetsya dlya upravlinnya procesom dobuvannya informaciyi z tekstiv prirodnoyu movoyu Sistema OBIE vikoristovuye metodi tradicijnoyi viluchennya informaciyi dlya identifikaciyi ponyat ekzemplyari i vidnosini vikoristovuvanih ontologij v teksti yaki budut strukturovani z ontologiyeyu pislya procesu Takim chinom vhidna ontologiya ye modellyu informaciyi yaku neobhidno vityagti Ontologiya navchannya ON Redaguvati Vivchennya Ontologiyi ye avtomatichnim abo napivavtomatichnim stvorennya ontologij vklyuchayuchi vityag terminiv vidpovidnoyi oblasti vid prirodnogo tekstu movi Oskilki budivlya ontologij vruchnu ye nadzvichajno trudomistkim i zajmaye bagato chasu ye velika motivaciya dlya avtomatizaciyi procesu Semantichna anotaciya SA Redaguvati Pid chas semantichnoyi anotaciyi 12 tekst prirodnoyu movoyu dopovnyuyetsya metadanimi chasto predstavleni v RDFa yaki povinni skladati semantiku terminiv sho mistyatsya mashini zrozumilim U comu procesi yakij yak pravilo napivavtomatichna znannya vidobuvayetsya v tomu sensi sho zv yazok mizh leksichnih terminiv i ponyat napriklad z ontologiyeyu vstanovlyuyetsya Takim chinom znannya zdobuvayetsya sho znachennya termina v obroblenomu konteksti buv priznachenij i otzhe sens tekstu gruntuyetsya na mashinozchituvanih danih z mozhlivistyu zrobiti visnovki Semantichne anotuvannya yak pravilo rozdileni na nastupni dvi pidzadachi ekstrakciya Terminologiya Ob yekt zv yazuvannya Na rivni viluchennya terminologiyi leksichni termini z tekstu vityaguyutsya Dlya ciyeyi meti tokenizator viznachaye spochatku kordoni sliv i virishuye skorochiti Zgodom termini z tekstu yaki vidpovidayut koncepciyi vityaguyutsya za dopomogoyu leksikonu predmetno oriyentovanogo shob zv yazati ci po suti posilannya Po suti pov yazuyuchi 13 zv yazok mizh vidobutih leksichnih terminiv z vihidnogo tekstu i ponyat z ontologiyi abo bazi znan takih yak vstanovleno DBpedia Dlya cogo kandidati koncepciyi viyavlyayutsya vidpovidno v dekilkoh znachennyah termina za dopomogoyu leksikonu I nareshti kontekst terminiv analizuyetsya z metoyu viznachennya najbilsh pidhodyashoyi odnoznachnisti i priznachiti termin dlya pravilnoyi koncepciyi instrumenti Redaguvati Nastupni kriteriyi mozhut buti vikoristani dlya klasifikaciyi instrumentiv yaki vityaguyut znannya z tekstiv prirodnoyu movoyu Dzherelo Yaki formati vvedennya mozhut buti obrobleni za dopomogoyu instrumentu napriklad prostij tekst HTML abo PDF Dostup do Paradigm Chi mozhe instrument zapituvati dzherela danih abo potrebuye cilogo dampa dlya procesu ekstrakciyi Sinhronizaciya danih Ye rezultatom procesu ekstrakciyi sinhronizovanij z dzherelom Vikoristannya Output Ontology Chi zv yazani instrument rezultat z ontologiyeyu Mapping Avtomatizaciya Yak ce avtomatizovanij proces ekstrakciyi ruchnij napivavtomatichnij abo avtomatichnij vimagaye Ontologiya Chi potribno instrument ontologiyi dlya viluchennya Vikoristannya grafichnogo interfejsu koristuvacha Chi nadaye instrument grafichnij interfejs koristuvacha Pidhid Yakij pidhid IS OBIE PR abo SA vikoristovuyetsya instrumentom Vityagnuti Sutnosti Yaki tipi sutnostej napriklad nazvani osobi ponyattya abo vidnoshennya mozhut buti vilucheni za dopomogoyu instrumentu Zastosovuvani metodi Yaki metodi zastosovuyutsya napriklad NLP statistichni metodi klasterizaciya abo mashinnogo navchannya Vihid modeli Yaka model vikoristovuyetsya dlya predstavlennya rezultatu instrumentu e G RDF abo OWL Pidtrimuvani domeni Yaki domeni pidtrimuyutsya napriklad ekonomika abo biologiya Pidtrimuvani Movi Yaki movi mozhut buti obrobleni napriklad anglijsku chi nimecku U navedenij nizhche tablici harakterizuyetsya deyaki instrumenti dlya zdobuttya znan z prirodnih dzherel movi Nazva Dzherelo dostup do Paradigm Data Synchronization Uses Output Ontology Mapping Automation Requires Ontology Uses GUI Approach Extracted Entities Applied Techniques Output Model Supported Domains Supported LanguagesAeroText 14 plain text HTML XML SGML dump no yes automatic yes yes IE named entities relationships events linguistic rules proprietary domain independent English Spanish Arabic Chinese indonesianAlchemyAPI Arhivovano 1 serpnya 2013 u Wayback Machine 15 plain text HTML automatic yes SA multilingualANNIE Arhivovano 15 bereznya 2016 u Wayback Machine Arhivovano 15 bereznya 2016 u Wayback Machine 16 plain text dump yes yes IE finite state algorithms multilingualASIUM Arhivovano 11 chervnya 2017 u Wayback Machine Arhivovano 11 chervnya 2017 u Wayback Machine 17 plain text dump semi automatic yes OL concepts concept hierarchy NLP clusteringAttensity Exhaustive Extraction 18 automatic IE named entities relationships events NLPDandelion API Arhivovano 28 travnya 2016 u Wayback Machine plain text HTML URL REST no no automatic no yes SA named entities concepts statistical methods JSON domain independent multilingualDBpedia Spotlight 19 plain text HTML dump SPARQL yes yes automatic no yes SA annotation to each word annotation to non stopwords NLP statistical methods machine learning RDFa domain independent EnglishEntityClassifier eu Arhivovano 3 bereznya 2016 u Wayback Machine plain text HTML dump yes yes automatic no yes IE OL SA annotation to each word annotation to non stopwords rule based grammar XML domain independent English German DutchFRED Arhivovano 8 travnya 2016 u Wayback Machine Arhivovano 8 travnya 2016 u Wayback Machine 20 plain text PDF and Word via Sheldon Arhivovano 20 travnya 2016 u Wayback Machine dump REST yes automatic no yes OL IE SA concepts concept hierarchy frames events relationships named entities negation modality tense entity linking schema alignment sentiment via Sentilo Arhivovano 18 chervnya 2016 u Wayback Machine NLP SPARQL heuristical rules ontology design patterns RDF OWL Turtle NT JSON LD DAG diagrams domain independent English multilingual inputK Extractor 21 22 plain text HTML XML PDF MS Office e mail dump SPARQL yes yes automatic no yes IE OL SA concepts named entities instances concept hierarchy generic relationships user defined relationships events modality tense entity linking event linking sentiment NLP machine learning heuristic rules RDF OWL proprietary XML domain independent English SpanishiDocument Arhivovano 21 chervnya 2021 u Wayback Machine 23 HTML PDF DOC SPARQL yes yes OBIE instances property values NLP personal businessNetOwl Extractor Arhivovano 9 kvitnya 2016 u Wayback Machine Arhivovano 9 kvitnya 2016 u Wayback Machine 24 plain text HTML XML SGML PDF MS Office dump No Yes Automatic yes Yes IE named entities relationships events NLP XML JSON RDF OWL others multiple domains English Arabic Chinese Simplified and Traditional French Korean Persian Farsi and Dari Russian SpanishOntoGen Arhivovano 30 bereznya 2010 u Wayback Machine Arhivovano 30 bereznya 2010 u Wayback Machine 25 semi automatic yes OL concepts concept hierarchy non taxonomic relations instances NLP machine learning clusteringOntoLearn Arhivovano 9 serpnya 2017 u Wayback Machine Arhivovano 9 serpnya 2017 u Wayback Machine 26 plain text HTML dump no yes automatic yes no OL concepts concept hierarchy instances NLP statistical methods proprietary domain independent EnglishOntoLearn Reloaded Arhivovano 4 bereznya 2016 u Wayback Machine plain text HTML dump no yes automatic yes no OL concepts concept hierarchy instances NLP statistical methods proprietary domain independent EnglishOntoSyphon Arhivovano 10 bereznya 2016 u Wayback Machine Arhivovano 10 bereznya 2016 u Wayback Machine 27 HTML PDF DOC dump search engine queries no yes automatic yes no OBIE concepts relations instances NLP statistical methods RDF domain independent EnglishontoX Arhivovano 27 travnya 2016 u Wayback Machine 28 plain text dump no yes semi automatic yes no OBIE instances datatype property values heuristic based methods proprietary domain independent language independentOpenCalais Arhivovano 24 zhovtnya 2008 u Wayback Machine plain text HTML XML dump no yes automatic yes no SA annotation to entities annotation to events annotation to facts NLP machine learning RDF domain independent English French SpanishPoolParty Extractor Arhivovano 17 travnya 2016 u Wayback Machine Arhivovano 17 travnya 2016 u Wayback Machine 29 plain text HTML DOC ODT dump no yes automatic yes yes OBIE named entities concepts relations concepts that categorize the text enrichments NLP machine learning statistical methods RDF OWL domain independent English German Spanish FrenchRosoka Arhivovano 10 travnya 2016 u Wayback Machine Arhivovano 10 travnya 2016 u Wayback Machine 30 plain text HTML XML SGML PDF MS Office dump Yes Yes Automatic no Yes IE named entities relationships attributes concepts NLP XML JSON RDF others multiple domains Multilingual 230 SCOOBIE Arhivovano 11 chervnya 2018 u Wayback Machine plain text HTML dump no yes automatic no no OBIE instances property values RDFS types NLP machine learning RDF RDFa domain independent English GermanSemTag Arhivovano 11 chervnya 2017 u Wayback Machine Arhivovano 11 chervnya 2017 u Wayback Machine 31 32 HTML dump no yes automatic yes no SA machine learning database record domain independent language independentsmart FIX Arhivovano 17 travnya 2016 u Wayback Machine plain text HTML PDF DOC e Mail dump yes no automatic no yes OBIE named entities NLP machine learning proprietary domain independent English German French Dutch polishText2Onto Arhivovano 2 travnya 2016 u Wayback Machine Arhivovano 2 travnya 2016 u Wayback Machine 33 plain text HTML PDF dump yes no semi automatic yes yes OL concepts concept hierarchy non taxonomic relations instances axioms NLP statistical methods machine learning rule based methods OWL deomain independent English German SpanishText To Onto Arhivovano 15 travnya 2013 u Wayback Machine Arhivovano 15 travnya 2013 u Wayback Machine 34 plain text HTML PDF PostScript dump semi automatic yes yes OL concepts concept hierarchy non taxonomic relations lexical entities referring to concepts lexical entities referring to relations NLP machine learning clustering statistical methods GermanThatNeedle Arhivovano 13 travnya 2016 u Wayback Machine Plain Text dump automatic no concepts relations hierarchy NLP proprietary JSON multiple domains EnglishThe Wiki Machine 35 plain text HTML PDF DOC dump no yes automatic yes yes SA annotation to proper nouns annotation to common nouns machine learning RDFa domain independent English German Spanish French Portuguese Italian RussianThingFinder 36 IE named entities relationships events multilingualViyavlennya znan RedaguvatiViyavlennya znan opisuye proces avtomatichnogo poshuku velikih obsyagiv danih dlya modelej yaki mozhna vvazhati znannya pro dani 37 Vin chasto opisuyetsya yak viluchennya znan z vhidnih danih Viyavlennya znan rozvinulasya z oblasti intelektualnogo analizu danih a takozh tisno pov yazana z neyu yak z tochki zoru metodologiyi ta terminologiyi 38 Najbilsh vidoma gilka intelektualnogo analizu danih ye viyavlennya znan takozh vidomij yak viyavlennya znan v bazah danih KDD Tak samo yak i bagato inshih form viyavlennya znan stvoryuye abstrakciyi vhidnih danih Znannya otrimani v procesi mozhut stati dodatkovi dani yaki mozhut buti vikoristani dlya podalshogo vikoristannya i vidkrittya Chasto rezultati vid viyavlennya znan ne diyevi vidkrittya znannya diyevi takozh vidomij yak domen privodom intelektualnogo analizu danih maye na meti viyaviti ta dostaviti diyevi znannya ta ideyi Inshim perspektivnim zastosuvannya viyavlennya znan v oblasti modernizaciyi programnogo zabezpechennya viyavlennya slabkosti i dotrimannya yakih peredbachaye rozuminnya isnuyuchih programnih artefaktiv Cej proces pov yazanij z koncepciyeyu zvorotnoyi inzheneriyi Yak pravilo znannya otrimani z isnuyuchogo programnogo zabezpechennya predstavleni u viglyadi modelej v yakij konkretni zapiti mozhut buti zrobleni pri potrebi Vidnosini suti ye najchastishim formatom predstavlennya znan otrimanih z isnuyuchogo programnogo zabezpechennya Ob yekt Management Group OMG rozrobila specifikaciyi znannya Discovery Metamodel KDM yakij viznachaye ontologiyu dlya zasobiv programnogo zabezpechennya ta yih vidnosin z metoyu vikonannya viyavlennya znan vsi nayavni kodi Viyavlennya znan z isnuyuchih programnih sistem takozh vidomij yak programne zabezpechennya vidobutku korisnih kopalin tisno pov yazana z vidobutkom korisnih kopalin danih oskilki isnuyuchi programni artefakti mistyat velichezne znachennya dlya upravlinnya rizikami ta vartosti biznesu klyuch dlya ocinki ta rozvitku programnih sistem Zamist togo shob vidobutok okremih naboriv danih girnichodobuvnoyi promislovosti programnogo zabezpechennya fokusuyetsya na metadanih takih yak potoki procesu napriklad potoki danih potoki upravlinnya amp nazvati karti arhitektura shemi baz danih i biznes pravila umovi procesu Vhidni dani Redaguvati bazi danih relyacijni dani baza danih skladskij dokument Informacijne shovishe programne zabezpechennya vihidni dani fajli konfiguraciyi pobudova scenariyiv Tekst koncepciya girnichodobuvnoyi promislovosti diagrami molekula sens poslidovnosti vidobutok potoku danih Navchannya vid zminyuyutsya v chasi potokiv danih v ramkah koncepciyi drejfu Vebvihidni formati Redaguvati Model danih Metadani metamodeli ontologiya uyavlennya znan tegi znan biznes pravila Znannya Discovery Metamodel KDM Modelyuvannya biznes procesiv notaciya BPMN promizhne predstavlennya Resource Description Framework RDF metriki programnogo zabezpechennyaDiv takozh Redaguvatiklasterizaciya arheologiya danih Vidobutok danih intelektualnogo analizu danih domenu privodom Intelektualnogo analizu danih v silskomu gospodarstvi Vityag peretvorennya zavantazhennya informaciya Vidobutok Podannya znan i visnovokPrimitki Redaguvati RDB2RDF Working Group Website http www w3 org 2001 sw rdb2rdf Arhivovano 11 travnya 2016 u Wayback Machine charter http www w3 org 2009 08 rdb2rdf charter Arhivovano 20 bereznya 2016 u Wayback Machine R2RML RDB to RDF Mapping Language http www w3 org TR r2rml Arhivovano 10 zhovtnya 2021 u Wayback Machine Pomilka cituvannya Nekorektnij teg lt ref gt nazva RDB2RDF viznachena kilka raziv z riznim vmistom Pomilka cituvannya Nekorektnij teg lt ref gt nazva RDB2RDF viznachena kilka raziv z riznim vmistom LOD2 EU Deliverable 3 1 1 Knowledge Extraction from Structured Sources http static lod2 eu Deliverables deliverable 3 1 1 pdf Arhivovano 27 serpnya 2011 u Wayback Machine Life in the Linked Data Cloud www opencalais com Arhiv originalu za 24 listopada 2009 Procitovano 10 listopada 2009 Wikipedia has a Linked Data twin called DBpedia DBpedia has the same structured information as Wikipedia but translated into a machine readable format Tim Berners Lee 1998 Relational Databases on the Semantic Web Arhivovano 16 bereznya 2016 u Wayback Machine Retrieved February 20 2011 Hu et al 2007 Discovering Simple Mappings Between Relational Database Schemas and Ontologies In Proc of 6th International Semantic Web Conference ISWC 2007 2nd Asian Semantic Web Conference ASWC 2007 LNCS 4825 pages 225 238 Busan Korea 11 15 November 2007 http citeseerx ist psu edu viewdoc download doi 10 1 1 97 6934 amp rep rep1 amp type pdf Arhivovano 15 zhovtnya 2012 u Wayback Machine R Ghawi and N Cullot 2007 Database to Ontology Mapping Generation for Semantic Interoperability In Third International Workshop on Database Interoperability InterDB 2007 http le2i cnrs fr IMG publications InterDB07 Ghawi pdf Arhivovano 4 bereznya 2016 u Wayback Machine Li et al 2005 A Semi automatic Ontology Acquisition Method for the Semantic Web WAIM volume 3739 of Lecture Notes in Computer Science page 209 220 Springer http dx doi org 10 1007 11563952 19 Arhivovano 26 lipnya 2008 u Wayback Machine Tirmizi et al 2008 Translating SQL Applications to the Semantic Web Lecture Notes in Computer Science Volume 5181 2008 Database and Expert Systems Applications http citeseer ist psu edu viewdoc download jsessionid 15E8AB2A37BD06DAE59255A1AC3095F0 doi 10 1 1 140 3169 amp rep rep1 amp type pdf Arhivovano 4 bereznya 2016 u Wayback Machine Farid Cerbah 2008 Learning Highly Structured Semantic Repositories from Relational Databases The Semantic Web Research and Applications volume 5021 of Lecture Notes in Computer Science Springer Berlin Heidelberg http www tao project eu resources publications cerbah learning highly structured semantic repositories from relational databases pdf Arhivovano 20 lipnya 2011 u Wayback Machine Wimalasuriya Daya C Dou Dejing 2010 Ontology based information extraction An introduction and a survey of current approaches Journal of Information Science 36 3 p 306 323 http ix cs uoregon edu dou research papers jis09 pdf Arhivovano 11 kvitnya 2016 u Wayback Machine retrieved 18 06 2012 Cunningham Hamish 2005 Information Extraction Automatic Encyclopedia of Language and Linguistics 2 p 665 677 http gate ac uk sale ell2 ie main pdf Arhivovano 5 bereznya 2016 u Wayback Machine retrieved 18 06 2012 Erdmann M Maedche Alexander Schnurr H P Staab Steffen 2000 From Manual to Semi automatic Semantic Annotation About Ontology based Text Annotation Tools Proceedings of the COLING http www ida liu se ext epa cis 2001 002 paper pdf Arhivovano 3 bereznya 2016 u Wayback Machine retrieved 18 06 2012 Rao Delip McNamee Paul Dredze Mark 2011 Entity Linking Finding Extracted Entities in a Knowledge Base Multi source Multi lingual Information Extraction and Summarization http www cs jhu edu delip entity linking pdf nedostupne posilannya z bereznya 2019 retrieved 18 06 2012 Rocket Software Inc 2012 technology for extracting intelligence from text http www rocketsoftware com products aerotext Arhivovano 21 chervnya 2013 u Wayback Machine retrieved 18 06 2012 Orchestr8 2012 AlchemyAPI Overview http www alchemyapi com api Arhivovano 1 serpnya 2013 u Wayback Machine retrieved 18 06 2012 The University of Sheffield 2011 ANNIE a Nearly New Information Extraction System http gate ac uk sale tao splitch6 html chap annie Arhivovano 15 bereznya 2016 u Wayback Machine retrieved 18 06 2012 ILP Network of Excellence ASIUM LRI http www ai ijs si ilpnet2 systems asium html Arhivovano 11 chervnya 2017 u Wayback Machine retrieved 18 06 2012 Attensity 2012 Exhaustive Extraction http www attensity com products technology semantic server exhaustive extraction Arhivovano 11 lipnya 2012 u Wayback Machine retrieved 18 06 2012 Mendes Pablo N Jakob Max Garcia Silva Andres Bizer Christian 2011 DBpedia Spotlight Shedding Light on the Web of Documents Proceedings of the 7th International Conference on Semantic Systems p 1 8 http www wiwiss fu berlin de en institute pwo bizer research publications Mendes Jakob GarciaSilva Bizer DBpediaSpotlight ISEM2011 pdf Arhivovano 5 kvitnya 2012 u Wayback Machine retrieved 18 06 2012 Presutti Valentina Draicchio Francesco Gangemi Aldo 2012 Knowledge Extraction based on Discourse Representation Theory and Linguistic Frames Proceedings of the Conference on Knowledge Engineering and Knowledge Management EKAW2012 LNCS Springer http www researchgate net profile Aldo Gangemi publication 262175193 Knowledge extraction based on discourse representation theory and linguistic frames links 5488b1bb0cf268d28f08fde6 pdf retrieved 18 01 2015 Balakrishna Mithun Moldovan Dan 2013 Automatic Building of Semantically Rich Domain Models from Unstructured Data Proceedings of the Twenty Sixth International Florida Artificial Intelligence Research Society Conference FLAIRS p 22 27 http www aaai org ocs index php FLAIRS FLAIRS13 paper view 5909 6036 Arhivovano 4 bereznya 2016 u Wayback Machine retrieved 11 08 2014 2 Moldovan Dan Blanco Eduardo 2012 Polaris Lymba s Semantic Parser Proceedings of the Eight International Conference on Language Resources and Evaluation LREC p 66 72 http www lrec conf org proceedings lrec2012 pdf 176 Paper pdf Arhivovano 12 serpnya 2014 u Wayback Machine retrieved 11 08 2014 Adrian Benjamin Maus Heiko Dengel Andreas 2009 iDocument Using Ontologies for Extracting Information from Text http www dfki uni kl de maus dok AdrianMausDengel09 pdf Arhivovano 4 bereznya 2016 u Wayback Machine retrieved 18 06 2012 SRA International Inc 2012 NetOwl Extractor http www sra com netowl entity extraction Arhivovano 24 veresnya 2012 u Wayback Machine retrieved 18 06 2012 Fortuna Blaz Grobelnik Marko Mladenic Dunja 2007 OntoGen Semi automatic Ontology Editor Proceedings of the 2007 conference on Human interface Part 2 p 309 318 http analytics ijs si blazf papers OntoGen2 HCII2007 pdf Arhivovano 18 veresnya 2013 u Wayback Machine retrieved 18 06 2012 Missikoff Michele Navigli Roberto Velardi Paola 2002 Integrated Approach to Web Ontology Learning and Engineering Computer 35 11 p 60 63 http wwwusers di uniroma1 it velardi IEEE C pdf Arhivovano 19 travnya 2017 u Wayback Machine retrieved 18 06 2012 McDowell Luke K Cafarella Michael 2006 Ontology driven Information Extraction with OntoSyphon Proceedings of the 5th international conference on The Semantic Web p 428 444 http turing cs washington edu papers iswc2006McDowell final pdf Arhivovano 10 bereznya 2016 u Wayback Machine retrieved 18 06 2012 Yildiz Burcu Miksch Silvia 2007 ontoX A Method for Ontology Driven Information Extraction Proceedings of the 2007 international conference on Computational science and its applications 3 p 660 673 http publik tuwien ac at files pub inf 4769 pdf Arhivovano 5 lipnya 2017 u Wayback Machine retrieved 18 06 2012 semanticweb org 2011 PoolParty Extractor http semanticweb org wiki PoolParty Extractor Arhivovano 4 bereznya 2016 u Wayback Machine retrieved 18 06 2012 IMT Holdings Corp 2013 Rosoka http www rosoka com content capabilities Arhivovano 10 travnya 2016 u Wayback Machine retrieved 08 08 2013 Dill Stephen Eiron Nadav Gibson David Gruhl Daniel Guha R Jhingran Anant Kanungo Tapas Rajagopalan Sridhar Tomkins Andrew Tomlin John A Zien Jason Y 2003 SemTag and Seeker Bootstraping the Semantic Web via Automated Semantic Annotation Proceedings of the 12th international conference on World Wide Web p 178 186 http www2003 org cdrom papers refereed p831 p831 dill html Arhivovano 11 chervnya 2017 u Wayback Machine retrieved 18 06 2012 Uren Victoria Cimiano Philipp Iria Jose Handschuh Siegfried Vargas Vera Maria Motta Enrico Ciravegna Fabio 2006 Semantic annotation for knowledge management Requirements and a survey of the state of the art Web Semantics Science Services and Agents on the World Wide Web 4 1 p 14 28 http staffwww dcs shef ac uk people J Iria iria jws06 pdf nedostupne posilannya z travnya 2019 retrieved 18 06 2012 Cimiano Philipp Volker Johanna 2005 Text2Onto A Framework for Ontology Learning and Data Driven Change Discovery Proceedings of the 10th International Conference of Applications of Natural Language to Information Systems 3513 p 227 238 http www cimiano de Publications 2005 nldb05 nldb05 pdf Arhivovano 14 travnya 2013 u Wayback Machine retrieved 18 06 2012 Maedche Alexander Volz Raphael 2001 The Ontology Extraction amp Maintenance Framework Text To Onto Proceedings of the IEEE International Conference on Data Mining http users csc calpoly edu fkurfess Events DM KM 01 Volz pdf Arhivovano 4 bereznya 2016 u Wayback Machine retrieved 18 06 2012 Machine Linking We connect to the Linked Open Data cloud http thewikimachine fbk eu html index html Arhivovano 19 lipnya 2012 u Wayback Machine retrieved 18 06 2012 Inxight Federal Systems 2008 Inxight ThingFinder and ThingFinder Professional http inxightfedsys com products sdks tf Arhivovano 29 chervnya 2012 u Wayback Machine retrieved 18 06 2012 Frawley William F et al 1992 Knowledge Discovery in Databases An Overview AI Magazine Vol 13 No 3 57 70 online full version http www aaai org ojs index php aimagazine article viewArticle 1011 Arhivovano 4 bereznya 2016 u Wayback Machine Fayyad U et al 1996 From Data Mining to Knowledge Discovery in Databases AI Magazine Vol 17 No 3 37 54 online full version http www aaai org ojs index php aimagazine article viewArticle 1230 Arhivovano 4 travnya 2016 u Wayback Machine Pomilka cituvannya Teg lt ref gt z nazvoyu lod2 eu viznachenij u lt references gt v grupi nichogo ne mistit Pomilka cituvannya Teg lt ref gt z nazvoyu Fayyad1996 viznachenij u lt references gt v grupi nichogo ne mistit Pomilka cituvannya Teg lt ref gt z nazvoyu Adrian viznachenij u lt references gt v grupi nichogo ne mistit Pomilka cituvannya Teg lt ref gt z nazvoyu Orchestr8 viznachenij u lt references gt v grupi nichogo ne mistit Pomilka cituvannya Teg lt ref gt z nazvoyu Rocket Software Inc viznachenij u lt references gt v grupi nichogo ne mistit Pomilka cituvannya Teg lt ref gt z nazvoyu Yildiz viznachenij u lt references gt v grupi nichogo ne mistit Otrimano z https uk wikipedia org w index php title Vidobuvannya znan amp oldid 36831486