Skip to content

This repository contains the data introduced in the paper "Reconnaissance de défigements dans des tweets en français par similarité d'alignements textuels" (TALN 2023).

Notifications You must be signed in to change notification settings

JulienBez/DefigementTALN2023

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 

Repository files navigation

DefigementTALN2023

This repository contains data files introduced in our paper "Reconnaissance de défigements dans des tweets en français par similarité d'alignements textuels" (TALN 2023). 2 files can be found :

  • seeds.json : a list of every expressions used to collect tweets with Twitter's API.
  • tweets.json : a list containing the ids of every tweets we used for our analysis.

For an up-to-date version of this dataset, see the FrUIT corpus. For an up-to-date version of the scripts used to extract both multiwords and unfrozen multiwords expressions, see the ASMR repository.

About

This repository contains the data introduced in the paper "Reconnaissance de défigements dans des tweets en français par similarité d'alignements textuels" (TALN 2023).

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published