Guide for classifying sentences into Intents


For your xBot to be well trained, we recommend the following best practices :

  1. Classify at least 5 sentences per intent.

  2. Maximum of 30 sentences classified by intent.

  3. Classify at least 1 sentence in your test set sentences for 5 classified sentences, in order to be able to compare the results after a classification session.

  4. Keep the number of classified sentences roughly identical among all intents.

  5. If several intents are expressed in a sentence, then it is wise to archive it.

  6. Classify relevant distinct sentences. Your xBot needs a clean sentence base to learn correctly. For example, if the sentence “What’s the weather in San Francisco?” is already classified, then it is better to classify this sentence Will it be sunny tomorrow in New York? than this sentence “What’s the weather in New York? . This last sentence is less distant , too close to the first sentence and should not be classified in the same Intent.

  7. Classify only sentences without spelling and grammar errors. Your xBot will be smart enough to deal correctly with typos. For example, classify the phrase “if a part does not fit, can I send it back?” rather than “if a prat is not fiting_, can I send it back?_”.

  8. Only classify sentences that correspond to the response / scenario of this intent. No need to “confuse” your xBot in its training! 😊

  9. Acronyms and abbreviations should be avoided … unless clearly used and understood by your users.

  10. The sentences collected on the “Pending” page (Sentences awaiting classification) are unique, in the sense that any identical phrase (including the same keystrokes, spaces and punctuation) is filtered by our engine, so you will never classify the same phrase twice.

  11. It is better to archive a sentence rather than to delete it. This will allow the unwanted sentence never to appear again as sentence awaiting for classification in the “Pending” page.



Documented/reviewed 27/07/2017 7:12:55 AM