Dimitris Sacharidis
Dimitris Sacharidis
Home
Research
Publications
Posts
Courses
Contact
Text
Mitigating Data Sparsity in Integrated Data through Text Conceptualization
We propose THOR a novel method to extract information from text, that unlike related approaches, neither relies on complex rules nor models trained with large annotated corpus. Instead, THOR is lightweight and exploits integrated data and its schema without the need for human annotations.
Md Ataur Rahman
,
Sergi Nadal
,
Oscar Romero
,
Dimitris Sacharidis
PDF
Cite
Code
Rank A*
TokenJoin: Efficient Filtering for Set Similarity Join with Maximum Weighted Bipartite Matching
We propose TokenJoin, a method for linking complex records, i.e., identifying similar pairs among a collection of complex records. A complex record is a set of simpler text entities, such as a set of addresses.
Alexandros Zeakis
,
Dimitrios Skoutas
,
Dimitris Sacharidis
,
Odysseas Papapetrou
,
Manolis Koubarakis
PDF
Cite
Code
DOI
Rank A*
Cite
×