Data Cleaning in Natural Language Provessing
M4A•منزل الحلقة
Manage episode 311353555 series 3111581
المحتوى المقدم من Sarvesh Bhatnagar. يتم تحميل جميع محتويات البودكاست بما في ذلك الحلقات والرسومات وأوصاف البودكاست وتقديمها مباشرة بواسطة Sarvesh Bhatnagar أو شريك منصة البودكاست الخاص بهم. إذا كنت تعتقد أن شخصًا ما يستخدم عملك المحمي بحقوق الطبع والنشر دون إذنك، فيمكنك اتباع العملية الموضحة هنا https://ar.player.fm/legal.
In this episode we talk about various steps in data cleaning process in Natural Language Processing. Data cleaning is almost a given whenever you want to perform natural language processing onto the given text. Data cleaning in natural language processing involves tokenization, lowering the words, lemmatization, and so on. Aside from talking about that we also talk about how you can implement those briefly. To install codesnip mentioned in the last part open your terminal and write pip install codesnip --- Send in a voice message: https://podcasters.spotify.com/pod/show/sarvesh-bhatnagar/message
…
continue reading
22 حلقات