<h1 id="heading-what-do-you-understand-by-tokenization">What do you understand by tokenization?</h1>
<p><strong>Tokenization is the act of breaking a sequence of strings into pieces such as words, keywords, phrases, symbols, and other elements called tokens.</strong></p>
<ul>
<li>Tokens can be individual words, phrases, or even whole sentences. In the process of tokenization, some characters like punctuation marks are discarded.</li>
</ul>
<blockquote>
<p><img src="https://cdn.hashnode.com/res/hashnode/image/upload/v1675181013232/4ae64e48-32cc-4675-90f9-bccd739ab2b9.png" alt class="image--center mx-auto" /></p>
</blockquote>


# What do you understand by tokenization?

**Tokenization is the act of breaking a sequence of strings into pieces such as words, keywords, phrases, symbols, and other elements called tokens.**

* Tokens can be individual words, phrases, or even whole sentences. In the process of tokenization, some characters like punctuation marks are discarded.
    

> ![](https://cdn.hashnode.com/res/hashnode/image/upload/v1675181013232/4ae64e48-32cc-4675-90f9-bccd739ab2b9.png align="center")

What do you understand by tokenization?

#83 Machine Learning & Data Science Challenge 83

A Mountain Hiker, Space, and Tech Enthusiast who became a Data Scientist.