References¶

[1]

David M Blei, Andrew Y Ng, and Michael I Jordan. Latent dirichlet allocation. Journal of machine Learning research, 3(Jan):993–1022, 2003.

[2]

Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. Efficient estimation of word representations in vector space. In International Conference on Learning Representations (ICLR). 2013.

[3]

Bang Liu, Fred X Han, Di Niu, Linglong Kong, Kunfeng Lai, and Yu Xu. Story forest: extracting events and telling stories from breaking news. ACM Transactions on Knowledge Discovery from Data (TKDD), 14(3):1–28, 2020.

[4]

Yuwei Wang, Zhenfeng Liu, Chenwei Zhu, Meng Jiang, Yixuan Zhang, and Yizhou Wang. Kpgnn: knowledge-guided pattern graph neural network for social event detection. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management, 1025–1034. 2020.

[5]

Hao Peng, Ruitong Zhang, Shaoning Li, Yuwei Cao, Shirui Pan, and Philip S. Yu. Reinforced, incremental and cross-lingual event detection from social messages. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(1):980–998, 2022.

[6]

Dionysios Karamouzas, Ioannis Mademlis, and Ioannis Pitas. Public opinion monitoring through collective semantic analysis of tweets. Social Network Analysis and Mining, 12(1):91–112, 2022.

[7]

Yang Liu and Yi-Fang Brook Wu. Fned: a deep network for fake news early detection on social media. ACM Transactions on Information Systems (TOIS), 38(3):1–33, 2020.

[8]

Mahdi Abavisani, Liwei Wu, Shengli Hu, Joel Tetreault, and Alejandro Jaimes. Multimodal categorization of crisis events in social media. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 14679–14689. 2020.

[9]

Yuwei Cao, Hao Peng, Zhengtao Yu, and Philip S. Yu. Hierarchical and incremental structural entropy minimization for unsupervised social event detection. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 38, 8255–8264. 2024.

[10]

Yuwei Cao, Hao Peng, Jia Wu, Yingtong Dou, Jianxin Li, and Philip S. Yu. Knowledge-preserving incremental social event detection via heterogeneous gnns. In Proceedings of the Web Conference 2021, 3383–3395. 2021.

[11]

Jiaqian Ren, Lei Jiang, Hao Peng, Yuwei Cao, Jia Wu, Philip S. Yu, and Lifang He. From known to unknown: quality-aware self-improving graph neural network for open set social event detection. In Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 1696–1705. 2022.

[12]

Alex Graves and Jürgen Schmidhuber. Framewise phoneme classification with bidirectional lstm and other neural network architectures. Neural networks, 18(5-6):602–610, 2005.

[13]

Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. BERT: pre-training of deep bidirectional transformers for language understanding. In Proceeding of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics, 4171–4186. Minneapolis: Association for Computational Linguistics, 2019.

[14]

Yue Zhao, Zain Nasrullah, and Zheng Li. Pyod: a python toolbox for scalable outlier detection. Journal of machine learning research, 20(96):1–7, 2019.

[15]

Kay Liu, Yingtong Dou, Xueying Ding, Xiyang Hu, Ruitong Zhang, Hao Peng, Lichao Sun, and Philip S. Yu. Pygod: a python library for graph outlier detection. Journal of Machine Learning Research, 25(141):1–9, 2024.

[16]

Jeffrey Pennington, Richard Socher, and Christopher D Manning. Glove: global vectors for word representation. In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), 1532–1543. 2014.

[17]

Matt Kusner, Yu Sun, Nicholas Kolkin, and Kilian Weinberger. From word embeddings to document distances. In International conference on machine learning, 957–966. PMLR, 2015.

[18]

Jiaqian Ren, Hao Peng, Lei Jiang, Zhifeng Hao, Jia Wu, Shengxiang Gao, Zhengtao Yu, and Qiang Yang. Toward cross-lingual social event detection with hybrid knowledge distillation. ACM Transactions on Knowledge Discovery from Data, 18(9):1–36, 2024.

[19]

Jiaqian Ren, Hao Peng, Lei Jiang, Zhiwei Liu, Jia Wu, Zhengtao Yu, and Philip S. Yu. Uncertainty-guided boundary learning for imbalanced social event detection. IEEE Transactions on Knowledge and Data Engineering, pages 1–15, 2023.

[20]

Pu Li, Xiaoyan Yu, Hao Peng, Yantuan Xian, Linqin Wang, Li Sun, Jingyun Zhang, and Philip S. Yu. Relational prompt-based pre-trained language models for social event detection. ACM Transactions on Information Systems, pages 1–43, 2024.

[21]

Zhiwei Yang, Yuecen Wei, Haoran Li, Qian Li, Lei Jiang, Li Sun, Xiaoyan Yu, Chunming Hu, and Hao Peng. Adaptive differentially private structural entropy minimization for unsupervised social event detection. In Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2950–2960. 2024.

[22]

Nils Reimers and Iryna Gurevych. Sentence-bert: sentence embeddings using siamese bert-networks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 3982–3992. 2019.

[23]

Jiaqian Ren, Lei Jiang, Hao Peng, Zhiwei Liu, Jia Wu, and Philip S. Yu. Evidential temporal-aware graph-based social event detection via dempster-shafer theory. In 2022 IEEE International Conference on Web Services (ICWS), 331–336. IEEE, 2022.

[24]

Yuanyuan Guo, Zehua Zang, Hang Gao, Xiao Xu, Rui Wang, Lixiang Liu, and Jiangmeng Li. Unsupervised social event detection via hybrid graph contrastive learning and reinforced incremental clustering. Knowledge-Based Systems, 284:111225, 2024.

[25]

Xiaoyan Yu, Yifan Wei, Shuaishuai Zhou, Zhiwei Yang, Li Sun, Hao Peng, Liehuang Zhu, and Philip S. Yu. Towards effective, efficient and unsupervised social event detection in the hyperbolic space. In Proceedings of the AAAI Conference on Artificial Intelligence, 1–11. 2025.

[26]

Xiaoyan Yu, Yifan Wei, Pu Li, Shuaishuai Zhou, Hao Peng, Li Sun, Liehuang Zhu, and Philip S. Yu. Dame: personalized federated social event detection with dual aggregation mechanism. In Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 3052–3062. 2024.

[27]

Lars Buitinck, Gilles Louppe, Mathieu Blondel, Fabian Pedregosa, Andreas Mueller, Olivier Grisel, Vlad Niculae, Peter Prettenhofer, Alexandre Gramfort, Jaques Grobler, Robert Layton, Jake VanderPlas, Arnaud Joly, Brian Holt, and Gaël Varoquaux. API design for machine learning software: experiences from the scikit-learn project. In ECML PKDD Workshop: Languages for Data Mining and Machine Learning, 108–122. 2013.

[28]

T Wolf. Huggingface's transformers: state-of-the-art natural language processing. arXiv preprint arXiv:1910.03771, pages 1–8, 2019.

[29]

Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, and others. Pytorch: an imperative style, high-performance deep learning library. Advances in neural information processing systems, 32:1–12, 2019.

[30]

Mahmud Hasan, Mehmet A Orgun, and Rolf Schwitter. A survey on real-time event detection from the twitter data stream. Journal of Information Science, 44(4):443–463, 2018.

[31]

Andrew J McMinn, Yashar Moshfeghi, and Joemon M Jose. Building a large-scale corpus for evaluating event detection on twitter. In Proceedings of the 22nd ACM international conference on Information & Knowledge Management, 409–418. 2013.

[32]

Alaa Alharbi and Mark Lee. Kawarith: an arabic twitter corpus for crisis events. In Proceedings of the Sixth Arabic Natural Language Processing Workshop, 42–52. 2021.

[33]

Xiaozhi Wang, Ziqi Wang, Xu Han, Wangyi Jiang, Rong Han, Zhiyuan Liu, Juanzi Li, Peng Li, Yankai Lin, and Jie Zhou. Maven: a massive general domain event detection dataset. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), 1652–1671. 2020.

[34]

Alexandra Olteanu, Carlos Castillo, Fernando Diaz, and Sarah Vieweg. Crisislex: a lexicon for collecting and filtering microblogged communications in crises. In Proceedings of the international AAAI conference on web and social media, volume 8, 376–385. 2014.

[35]

Alexandra Olteanu, Sarah Vieweg, and Carlos Castillo. What to expect when the unexpected happens: social media communications across crises. In Proceedings of the 18th ACM conference on computer supported cooperative work & social computing, 994–1009. 2015.

[36]

Firoj Alam, Ferda Ofli, and Muhammad Imran. Crisismmd: multimodal twitter datasets from natural disasters. In Proceedings of the international AAAI conference on web and social media, volume 12. 2018.

[37]

Firoj Alam, Hassan Sajjad, Muhammad Imran, and Ferda Ofli. Crisisbench: benchmarking crisis-related social media datasets for humanitarian information processing. In Proceedings of the International AAAI conference on web and social media, volume 15, 923–932. 2021.

[38]

Shumin Deng, Ningyu Zhang, Jiaojian Kang, Yichi Zhang, Wei Zhang, and Huajun Chen. Meta-learning with dynamic-memory-based prototypical network for few-shot event detection. In Proceedings of the 13th international conference on web search and data mining, 151–159. 2020.

[39]

Muhammad Imran, Shady Elbassuoni, Carlos Castillo, Fernando Diaz, and Patrick Meier. Extracting information nuggets from disaster-related messages in social media. Iscram, 201(3):791–801, 2013.

[40]

Firoj Alam, Ferda Ofli, Muhammad Imran, and Michael Aupetit. A twitter tale of three hurricanes: harvey, irma, and maria. arXiv preprint arXiv:1805.05144, 2018.

[41]

Firoj Alam, Shafiq Joty, and Muhammad Imran. Graph based semi-supervised learning with convolution neural networks to classify crisis related tweets. In Proceedings of the international AAAI conference on web and social media, volume 12. 2018.

[42]

Firoj Alam, Umair Qazi, Muhammad Imran, and Ferda Ofli. Humaid: human-annotated disaster incidents data from twitter with deep learning benchmarks. In Proceedings of the International AAAI Conference on Web and social media, volume 15, 933–942. 2021.