Surveys and Analysis
- Entity Linking with a Knowledge Base: Issues, Techniques, and Solutions (TKDE 2014) [Paper] 🌟
- Neural Entity Linking: A Survey of Models based on Deep Learning (2020) [Paper]
- a survey of state-of-the-art neural entity linking models
- a survey of entity embedding techniques;
- a discussion of recent domain-independent (zero-shot) and cross-lingual EL approaches;
- a survey of EL applications to modeling word representations.
- Error analysis of the well known DeepED model [Link]
- Towards holistic Entity Linking: Survey and directions (Information Systems 2021) [Link]
- Neural Collective Entity Linking (COLING 2018) [Paper]
- KG-Based Approach to Named Entity Disambiguation for Healthcare Applications — GraphAware (KGC 2022) [Video]
- Recent Trends in Word Sense Disambiguation : A Survey (IJCAI 2021, Survey Track) [Paper]
Notes and Discussions
- Candiate Entity Ranking [Notes]
- NLP-progress for Entity Linking [Notes] [GitHub]
- Recent Trend for Entity Linking [Notes]
- Summary of Entity Linking [Notes 1] [Notes 2, check the appendices here]
- Entity Coherence in Entity Linking [Blog]
- Zero-Shot Entity Linking by Reading Entity Descriptions (ACL 2019) [Paper][Code and Datasets]
- Keyphrase Overlap Relatedness for Entity Disambiguation (CIKM 2012), LSH 🌟
- Old is Gold: Linguistic Driven Approach for Entity and Relation Linking of Short Text (NAACL 2019), with Relation Linking
- Improving Entity Linking by Modeling Latent Relations between Mentions (ACL 2018)
- Entity Linking for Tweets (ACL 2013)
- Pangloss: Fast Entity Linking in Noisy Text Environments (KDD 2018) [Presentation] 🌟
- THINKER - Entity Linking System for Turkish Language (TKDE 2018) 🌟
- SHINE+: A General Framework for Domain-Specific Entity Linking with Heterogeneous Information Networks (TKDE 2018) 🌟
- SVM ensembles for named entity disambiguation (Computing, 102(4), 2020) [Paper]
- Attention-Based Joint Entity Linking with Entity Embedding (Information 10.2, 2019)
- Entity Disambiguation Leveraging Multi-Perspective Attention (IEEE Access 2019)
- CLEEK: A Chinese Long-text Corpus for Entity Linking (LREC 2020)
- A Novel Approach for Analyzing Entity Linking Between Words and Entities for a Knowledge Base Using an Attention-Based Bilinear Joint Learning and Weighted Summation Model (IEEE Access 2019)
- Fast and accurate entity linking via graph embedding (GRADES-NDA'19) [Paper] [Slides]
- A contrastive deep learning approach for entity linkage (ICDM 2020)
- REL: An Entity Linker Standing on the Shoulders of Giants (SIGIR 2020) [GitHub, API and Code]
- A probabilistic model for linking named entities in web text with heterogeneous information networks(SIGMOD 2014)[Paper] 🌟
- Read, Retrospect, Select: An MRC Framework to Short Text Entity Linking (AAAI 2021) [Paper]
- Pivot-based Candidate Retrieval for Cross-lingual Entity Linking (WWW 2021) [Video]
- Medical Entity Disambiguation using Graph Neural Networks (SIGMOD 2021) 🌟
- Linking Entities across Relations and Graphs (ACM Transactions on Database Systems 2024) [Paper] - From Prof. Wenfei Fan
- Unsupervised Named Entity Disambiguation for Low Resource Domains (EMNLP 2024) [Paper]
Global Coherence
- Relational Inference for Wikification (ACL 2013)
- Robust Disambiguation of Named Entities in Text (EMNLP 2011)
- Liege: Link Entities in Web Lists with Knowledge Base (KDD 2012)🌟
- Collective entity linking in web text: A graph-based method (SIGIR 2011)
- Collective Annotation of Wikipedia Entities in Web Text (KDD 2009)🌟
- Local and Global Algorithms for Disambiguation to Wikipedia (ACL 2011)
- Learning entity representation for entity disambiguation (ACL 2013)
- Robust named entity disambiguation with random walks (Semantic Web 2018)
- Graph ranking for collective named entity disambiguation (ACL 2014)
- Personalized page rank for named entity disambiguation (ACL 2015)
- Collective entity resolution with multi-focal attention (2016)
- To link or not to link? a study on end-to-end tweet entity linking (NAACL 2013)
- An entity-topic model for entity linking (EMNLP 2012)
- Deep joint entity disambiguation with local neural attention (EMNLP 2017)
- Improving entity linking by modeling latent relations between mentions (ACL 2018)
- Neural Collective Entity Linking Based on Recurrent Random Walk Network Learning (IJCAI 2019), introduces external knowledge to model the semantic interdependence between different EL decisions
- ELDEN: Improved Entity Linking using Densified Knowledge Graphs (NAACL-HLT 2018) [Paper][Code], supervised EL system
- KBPearl: A Knowledge Base Population System Supported by Joint Entity and Relation Linking (VLDB 2020) [Paper], with relation linking 🌟
- Joint Embedding in Named Entity Linking on Sentence Level [Paper]
- Improving Entity Linking by Modeling Latent Relations between Mentions (ACL 2018) [Paper]
- Short text understanding through lexical-semantic analysis (ICDE 2015) (Best Paper Award) 🌟 [Paper]
- A collective entity linking algorithm with parallel computing on large-scale knowledge base (The Journal of Supercomputing, 2020) [Paper]
- A Novel Path-based Entity Relatedness Measure for Efficient Collective Entity Linking (ISWC 2020)
- Collective disambiguation in entity linking based on topic coherence in semantic graphs (2020) [Paper]
- Collective List-Only Entity Linking: A Graph-Based Approach (IEEE Access 2018) [Paper]
Relax the Global Coherence Assumption
- Joint entity linking with deep reinforcement learning (WWW 2019) [Paper]
Reinforcement learning, apply LSTM to be able to maintain long term memory for previous decisions.
- Learning Dynamic Context Augmentation for Global Entity Linking (DCA, ACL 2019) [Paper]
Reinforcement learning, previous decisions are collected as dynamic context to improve the following predictions.
- Global Entity Disambiguation with Pretrained Contextualized Embeddings of Words and Entities (2020) [Paper]
- BERT+MLM.
- Note: Paper 1,2,3 address ED as a sequential decision task that disambiguates mentions one by one, and uses words and already disambiguated entities to disambiguate new mentions.
- Joint Learning of Local and Global Features for Entity Linking via Neural Networks (COLING 2016), [Paper], CNN+RNN
- Global-RNN utilizes convolutional neural networks to induce the representations for local contexts and takes advantage of recurrent neural networks to adaptively compress variable length sequences of predictions for global constraints.
- Dynamic Graph Convolutional Networks for Entity Linking (WWW 2020) [Paper]
- Resorts to GNN to automatically decide the relevant linked nodes and then generate the global feature vector for every node.
- Then a score function is proposed to directly utilize the feature to compute the ranking score and do not need other additional inference steps
- Pair-Linking for Collective Entity Disambiguation: Two Could Be Better Than All (TKDE 2018) [Paper]🌟
- With a demo: CoNEREL: Collective Information Extraction in News Articles (SIGIR 2018 demo) [Paper]
- CoNEREL: Collective Information Extraction in News Articles (SIGIR 2018, demo of Paper 6) [Paper]
- KBPearl: A Knowledge Base Population System Supported by Joint Entity and Relation Linking (VLDB 2020) 🌟
- Joint Entity Linking for Web Tables with Hybrid Semantic Matching (ICCS 2020)
- Converts table entity linking into a sequence decision problem and uses hybrid semantic features to disambiguate the mentions in web tables
- Using Knowledge Base Semantics in Context-Aware Entity Linking (DocEng 2019) [Paper]
- Supervised CEL. Retained the sum, max@1,max@2 and max@3 as global contextual features, which can be seen as a kind of flexibility in selecting and aggregating the relatedness score.
- High Quality Candidate Generation and Sequential Graph Attention Network for Entity Linking (WWW 2020) [Paper] [Code] [Video], BERT+SeqGAT
- Graph-based models treat all candidate entities equally which may introduce much noise information
- Sequence models can only observe previous referred entities, ignoring the relevance between the current mention and its subsequent entities
- Contribution: (1) propose a multi-strategy based candidate generation method to generate high recall candidate sets; (2) design a Sequential Graph Attention Network (SeqGAT) which combines the advantages of graph and sequence methods
- The same first author as "Joint entity linking with deep reinforcement learning (WWW 2019)"
- LoG: a locally-global model for entity disambiguation (WWW 2021) [Paper]
- Derives meaningful local neighbors for each mention in a more linguistic way by utilizing dependency parse tree.
- Locates key sentences in the document and then detect keywords at sentence level, and integrates the topical coherence with both mention neighbors and keywords for ED.
- Employs neural ED approach which combines basic deep neural network model with Graph Attention Network (GAT) to utilize local features and global features.
- Neural collective entity linking (COLING 2018) [Paper]
- Topical coherence only need to hold among neighboring mentions
- TENET: Joint Entity and Relation Linking with Coherence Relaxation (SIGMOD 2021) 🌟
- Entity Disambiguation with Entity Definitions (EACL 2023) [Paper]
Some related works
- Evaluating the Impact of Knowledge Graph Context on Entity Disambiguation Models (CIKM 2020, short) [Paper]
- Employ SPASQL to fetch triples of target entities and incorporate the triples as kg context in pre-trained ED models. DCA is used as baselines on the AIDA-CONLL dataset.
- Attention-based Deep Reinforcement Learning Model for Pair-Wise Interaction Recommendation (ICISCE 2019)
- Claim that the one-pass sequential decision should consider both positive feedback and negative feedback.
- Improving Entity Linking through Semantic Reinforced Entity Embeddings (ACL 2020) [Paper] [Data and Code] [Details]
- Fine-grained semantic types of entities can let the linking models learn contextual commonality about semantic relatedness.
- fine-grained semantic words appear frequently as apposition (e.g., Defense contractor Raytheon), coreference (e.g., the company) or anonymous mentions (e.g., American defense firms). These fine-grained types of entities can help capture local contexts and relations of entities.
- Improving Entity Linking by Modeling Latent Entity Type Information (AAAI 2020) [Paper]
- Conduct error analysis of the well known DeepED1 model (Ganea and Hofmann 2017) on the development set of AIDA-CoNLL, and found that more than half of their error cases fall into the category of type errors where the predicted entity’s type is different from the golden entity’s type.
- Inject latent entity type information into the entity embeddings by modeling the immediate context surrounding the mention.
- Apply pre-trained BERT to represent the entity context.
- A joint model for entity analysis: Coreference, typing, and linking (TACL 2014)
- Joint entity recognition and disambiguation (EMNLP 2015)
- J-nerd: joint named entity recognition and disambiguation with rich linguistic features (TACL 2016)
- Paper 3,4,5 integrate type information into the entity linking task by jointly NER+EL, which captures the mutual dependency between them using structured CRF. These methods mainly differ in the design of hand-engineered features.
- Joint learning of named entity recognition and entity linking. (ACL: Student Research Workshop, 2019)
- Multi-task learning using learned features by extending Stack-LSTM.
- Paper 3,4,5,6 rely on extensive annotation of the type of mentions
- DeepType: Multilingual Entity Linking by Neural Type System Evolution (AAAI 2018) [Paper]
Note: how to combine the outputs of multiple EL tools for providing a unified set of entity annotations?
- Better Together - An Ensemble Learner for Combining the Results of Ready-made Entity Linking Systems (SAC 2020) [Paper]
- A Novel Ensemble Method for Named Entity Recognition and Disambiguation Based on Neural Network (ISWC 2018)
- MicroNeel: Combining NLP Tools to Perform Named Entity Detection and Linking on Microposts (Final Workshop 7 December 2016, Naples. 2016)
- Combining open source annotators for entity linking through weighted voting (SEM 2015)
- Joint posterior revision of NLP annotations via ontological knowledge (IJCAI 2018) [Paper] [Slides] [Details] 🌟
- (Extended Version of Paper 5) Knowledge-driven joint posterior revision of named entity classification and linking (Journal of Web Semantics, 2020) [Paper]
- (Extended reading for Paper 5, same research group) An Ontology-Driven Probabilistic Soft Logic Approach to Improve NLP Entity Annotations (ISWC 2018) [Slides]
- Joint Learning of Named Entity Recognition and Entity Linking (ACL 2019) [Paper] [Notes 1] [Notes 2] [Notes 3] [Notes 4]
- Multi-task learning of NER and EL based on Stack-LSTM approach.
- Supervised EL system with learned features.
- Future extension: training entity contextual embeddings and extend it to be cross-lingual.
- Re-ranking for joint named-entity recognition and linking (CIKM 2013) [Paper]
- The reranking model then chooses among the set of all possible mention and entity link labelings for the whole phrase to determine the best choice. It can use features for known relationships between the television channel ABC and the television program The_View to encourage these as outputs. For efficiency, we use the pipeline models to prune the set of all possible candidate mentions and entity links to a manageable size while maintaining high recall. The reranking model can then use more sophisticated features for collective classification over this pruned set.
- Rely on existing NER tools. Only NER is beneficial to EL, not vice versa.
- Hand-engineered features.
- Uses a large number of heuristically obtained Noun phrase (NP) chunks and word n-grams as additional input to the EL stage.
- To link or not to link? a study on end-to-end tweet entity linking (NAACL 2013)
- Only suitable for short-text such as tweets.
- Joint entity recognition and disambiguation (EMNLP 2015) [Paper]
- NER is beneficial to EL. EL is also beneficial to NER.
- Supervised EL system.
- Hand-engineered features.
- J-nerd: joint named entity recognition and disambiguation with rich linguistic features (TACL 2016) [Paper] [Code]
- Supervised, non-linear probabilistic graphical model that captures mention spans, mention types, and the mapping of mentions to entities in a knowledge base.
- Hand-engineered features.
- Relies on fully labeled training data where each tagged entity needs to have an NER and EL label.
- A Joint Model for Entity Analysis: Coreference, Typing, and Linking (TACL 2014) [Paper] [System] [GitHub]
- Joint learning of entity typing, EL, and coreference.
- Hand-engineered features.
- Contextualized End-to-End Neural Entity Linking [Paper]
- An end-to-end differentiable neural EL model that jointly performs MD and ED, based on BERT, while eliminating external knowledge so that we can study the impact of external knowledge to the EL model.
- Other version: "YELM: End-to-End Contextualized Entity Linking" (2019) [Paper]
- Noise-robust Named Entity Understanding for Virtual Assistants [Paper]
- Combining NER and EL information in a joint reranking module for noisy spoken language queries in the context of a digital voice assistant, our proposed framework improves accuracy in both tasks.
- The main idea is to consider all possible spans as potential mentions and learn contextual similarity scores over their entity candidates that are useful for both MD and ED decisions.
General
- No Noun Phrase Left Behind: Detecting and Typing Unlinkable Entities (EMNLP-ACL 2012)
- Main idea: Train a classifier wuth features primarily derived from a timestamped corpus
- Main contribution: usage patterns di er across time between in-KB entities and out-of-KB entities and they operationalize this idea by computing the best t line (least-squares regression ) for usage over time.
- Entity Linking at Web Scale (AKBC-WEKEX 2012)
- Explores handling these unlinkable entities over 3 steps: Detect Entities, Predict Types, Disambiguation
- The Birth of Collective Memories: Analyzing Emerging Entities in Text streams (JASIST 2018)
- Track entities that emerge in public discourse (unstructured text) to gain insights into how these are added to Wikipedia.
Unsupervised
- Using encyclopedic knowledge for named entity disambiguation (2006) [PDF]
- Nlpr_kbp in tac 2009 kbp track: A two-stage method to entity linking (TAC workshop 2009)
- LCC approaches to knowledge base population at TAC 2010 (TAC workshop 2010)
- Linking entities to a knowledge base with query expansion (EMNLP 2011)
- LINDEN: Linking named entities with knowledge base via semantic knowledge (WWW 2012)
- Linking named entities in tweets with knowledge base via user interest modeling (KDD 2013)
- Tagme: On-the-fly annotation of short text fragments (by wikipedia entities) (CIKM 2010)
- From names to entities using thematic context distance (CIKM 2011)
- Mining evidences for named entity disambiguation (KDD 2013)
- Paper 1-9: If the score of a mention is smaller than a NIL threshold, then this mention is predicted as unlinkable. The threshold is learned from the training data.
- Novel Entity Discovery from Web Tables (WWW 2020) [PDF]
- A feature-based method, the first endeavor on mining the unlinked cells in web tables.
- Identifies not only out-of-KB (“novel”) information but also novel aliases for in-KB (“known”) entities
- Exploring Multiple Feature Spaces for Novel Entity Discovery (AAAI 2016)
Discover novel entities in news and Web data by exploring multiple feature spaces, including context, neural embedding, topical, query and lexical spaces
- The Birth of Collective Memories: Analyzing Emerging Entities in Text streams (JASIST 69, 2018)
- Track entities that emerge in public discourse to gain insights into how these are added to Wikipedia.
- Knowledge-driven joint posterior revision of named entity classification and linking (Journal of Web Semantics, 2020) [Paper]
Supervised
- NUS-I2R: Learning a combined system for entity linking (TAC workshop 2010)
- LCC approaches to knowledge base population at TAC 2010 (TAC workshop 2010)
- I2R-NUS-MSRA at TAC 2011: Entity linking (TAC workshop 2011)
- Entity linking with effective acronym expansion, instance selection and topic modeling (IJCAI 2011)
- Cross-lingual cross-document coreference with entity linking (TAC workshop 2011)
Utilize the binary classification technique (such as SVM). Positive --> entity_top, Negative --> NIL.
- Learning to link entities with knowledge base (NAACL 2010)
- Local and global algorithms for disambiguation to wikipedia (ACL 2011)
- Design some features for for unlinkable mention prediction, such as the score of the top-ranked candidate and whether the entity mention is detected by some NER as a named entity.
- Entity disambiguation for knowledge base population (COLING 2010) [Paper]
- HLTCOE efforts in entity linking at TAC KBP 2010 (TAC workshop 2011)
- Incorporate the unlinkable mention prediction process into the entity ranking process. They added a NIL entity into the candidate entity set, and considered NIL as a distinct candidate.
- A generative entity-mention model for linking entities with knowledge base (ACL 2011)
- The model assumes that for the entity mention which refers to some specific entity, the probability of this entity mention generated by this specific entity’s model should be significantly higher than the probability of this mention generated by a general language model.
- No Noun Phrase Left Behind: Detecting and Typing Unlinkable Entities (EMNLP-CoNLL ’12)
- Train a classifier with features primarily derived from a timestamped corpus.
- Wembedder: Wikidata entity embedding web service [Intro] [Github] [Web service]
- Web service: only the "most similar" service.
- 100-dimention and 50-dimention, parsed by numpy.memmap. However, I can only read one float (instead of a vector) for each entity. No sure whether there is mistake.
- EntEval: A Holistic Evaluation Benchmark for Entity Representations (EMNLP 2019) [Paper]
- DRIN: Dynamic Relation Interactive Network for Multimodal Entity Linking (2023) [Paper]
- Multi-Grained Multimodal Interaction Network for Entity Linking (2023) [Paper]
- Generative Multimodal Entity Linking (Arxiv 2023) [Paper]
- Multimodal Entity Linking: A New Dataset and A Baseline (MM 2021) [Paper]
- Self-Supervised Enhancement for Named Entity Disambiguation via Multimodal Graph Convolution (IEEE Transactions on Neural Networks and Learning Systems 2022) [Paper]
- Large language models struggle to learn long-tail knowledge (PMLR 2023) [Paper]
- Exploring the In-context Learning Ability of Large Language Model for Biomedical Concept Linking (Arxiv 2023) [Paper]
- Instructed Language Models with Retrievers Are Powerful Entity Linkers (Arxiv 2023) [Paper]
- Increasing Coverage and Precision of Textual Information in Multilingual Knowledge Graphs (Arxiv 2023) [Paper]
- Use Large Language Models for Named Entity Disambiguation in Academic Knowledge Graphs (EIMSS 2023) [Paper)]
- TagMe [Python API] [API] [GitHub1] [GitHub2]
- Stanford NER [Link]
- DBpedia Spotlight [Link]
- NLTK Tagger [Link]
- SpaCy [Link1] [Link2]
- spacy-llm [Link]
- EARL (including Relation Linking) [Link]
- Falcon (including Relatoin Linking) [DBpedia version] [Wikidata version]
- MonkeyLearn [Link]
- GERBIL - General Entity Annotator Benchmark [Link]
- PIKES [Link]