Analyzing Multilingual French and Russian Text using NLTK, spaCy, and Stanza

This lesson covers tokenization, part-of-speech tagging, and lemmatization, as well as automatic language detection, for non-English and multilingual text. You’ll learn how to use the Python packages NLTK, spaCy, and Stanza to analyze a multilingual Russian and French text.

Saved in:
Bibliographic Details
Main Author: Ian Goodale
Format: Article
Language:English
Published: Editorial Board of the Programming Historian 2024-11-01
Series:The Programming Historian
Online Access:https://programminghistorian.org/en/lessons/analyzing-multilingual-text-nltk-spacy-stanza
Tags: Add Tag
No Tags, Be the first to tag this record!