Skip to content
#

text-classification

Here are 1,166 public repositories matching this topic...

heatherbshapiro
heatherbshapiro commented Aug 14, 2019

Description

There is a readme for the repo metrics subfolder under tools. It needs general review to make sure it is accurate.

Other Comments

Principles of NLP Documentation
Each landing page at the folder level should have a ReadMe which explains -
○ Summary of what this folder offers.
○ Why and how it benefits users
○ As applicable - Documentation of using it, brief d

ClemDoum
ClemDoum commented Dec 14, 2018

Problem description

Short description

In certain conditions some CRF tags transitions can by missing after the data augmentation or can be "underrepresented".
We must ensure that all possible tags transitions are in the augmented dataset so that inference does not fail systematically on those examples

Example

Given a dataset with 1 intent and 3 slots: slot_1, slot_2, `slot

zhangsh950618
zhangsh950618 commented Aug 20, 2018

we use this code to build our project, but we found the acc dropped. So , we review the code, and find the following issues.

  1. This code did not implemented "mask" in the "AttLayer" class.
  2. we believe "Dense layer" should implemented in the class "AttLayer", instead of using "Dense" out of the class
  3. lost "Activation function" in the Dense layer

We made the above changes,and the acc

dongcy-AHU
dongcy-AHU commented Sep 18, 2019

读了你的源码,关于这些矩阵names = ['x','y','tx','ty','allx','ally','adj']分别代表什么?
比如allx => the feature vectors of both labeled and unlabeled training docs/words,你的实验数据不都是有标签的嘛,为什么会有unlabeled training docs?
你在论文中说你的节点初始化为one-hot向量,而我在代码中看到你用word嵌入的平均作为doc嵌入输入,这是为什么?the one-hot labels of the labeled training docs又代表什么?
关于这些x,y,tx,ty等等,我比较难懂,请求您抽出时间为我解答,非常感谢

EagleYing
EagleYing commented Nov 10, 2018

(tensorflow) F:\Postgraduate\KaggleLearning\multi-class-text-classification-cnn-rnn-master\multi-class-text-classification-cnn-rnn-master>python predict.py ./t
rained_results_1541818386/ ./data2/samples.csv
D:\Anaconda\anaconda\envs\tensorflow\lib\site-packages\gensim\utils.py:1212: UserWarning: detected Windows; aliasing chunkize to chunkize_serial
warnings.warn("detected Windows; aliasing c

自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,bert+bilstm+crf),数据增强(text augment, data enhance),同义句同义词生成,句子主干提取(mainpart),中文汉语短文本相似度,文本特征工程,keras-http-service调用

  • Updated Jan 22, 2020
  • Python

基于Pytorch和torchtext的自然语言处理深度学习框架,包含序列标注、文本分类、句子关系、文本生成、结构分析、五大功能模块,已实现了命名实体识别、中文分词、词性标注、语义角色标注、情感分析、关系抽取、语言模型、文本相似度、文本蕴含、依存句法分析、词向量训练、聊天机器人、机器翻译、文本摘要等功能。框架功能丰富,开箱可用,极易上手!基本都是学习他人实现然后自己修改融合到框架中,没有细致调参,且有不少Bug~

  • Updated Jan 25, 2020
  • Python

Improve this page

Add a description, image, and links to the text-classification topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the text-classification topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.