postagging

안녕하세요 Noun extractor을 잘 사용하고 있는 학생입니다!
다름 아니라 사용 중에 의문이 하나 들어서 질문 드리게 되었습니다.
input으로 사용하는 doublespace txt 파일의 sentence length가 얼마가 되어야 많은 범위의 어절을 커버하게 되나요?
제가 몇가지 샘플을 만들어서 사용해 보았는데, 인풋 데이터가 적으면 적을수록 명사를 잘 못 뽑는 것 같습니다. (비지도학습 기반의 모델이라 당연하지만요 ㅎㅎ)

예를 들어서, num sentence가 약 1만개일 경우 50~55%의 어절이 커버되었다고 출력됩니다.
[Noun Extractor] 54.52 % eojeols are covered

num sentence가 약 10만개일 경우 60~65%의 어절이 커버되었다

postagging

Here are 47 public repositories matching this topic...

lovit / soynlp

Noun extractor covered eojeols 관련 질문

SimpleTagger에 postprocessor가 있어도 단어가 출력되지 않는 문제

smoothnlp / SmoothNLP

bastienbot / nlp-js-tools-french

MagedSaeed / farasapy

banyh / PyStanfordNLP

lancopku / SAPO

adhaamehab / arabicnlp

salsowelim / tawseem

kuangmeng / GraduationProject

made2591 / cognitive-system-postagger

StarlangSoftware / EnglishPosTagger

codekhal / Inshorts-NLP

Ricozero / pycrfsuite-segpos

jasonhavenD / DJH-Chunking-Encoding-Algorithm

StarlangSoftware / EnglishPosTagger-CPP

ajitrajasekharan / root

Kimonokimo / NLP-comment-project

saminens / HMM_Tagger

ArmandGiraud / quest2keys

sravya2694 / Natural-Language-Processing

StarlangSoftware / EnglishPosTagger-Py

lefaivre / eventExtractor

codewithus / Open-Domain-Question-Answering

sharmilathirumalai / Twitter-Sentimental-Analysis

munir-bd / Korean-POS-Tagger-LSTM

DigitalTools / hr-nlp

prachi1801 / POS-Tagging

Some1OutThere / NLP_Sentence_chunking

VhaijaiyanthishreeVenkataramanan / Natural-Language-Processing

DEEZZU / Quora-Question-Classifier

Improve this page

Add this topic to your repo