Skip to content
#

summarization

Here are 582 public repositories matching this topic...

haystack
ZanSara
ZanSara commented Jun 13, 2022

Currently Crawler does not let JavaScript run on crawled pages. This leads to issues for websites that rely on fully dynamically loaded content that is not present in the DOM at loading time, but is fetched with JS immediately after.

We should investigate how to allow JavaScript to run on the crawled pages with Selenium and add a switch to the Crawler that allows users to do this.

Based on

type:feature good first issue Contributions wanted! journey:intermediate
DayalStrub
DayalStrub commented Jul 7, 2021

Intro

I am getting TypeError: can not serialize 'BaseTextRank' object when trying to use spaCy's multiprocessing in nlp.pipe with a textrank pipeline component.

Sorry if this a known/expected feature/limitation - I couldn't find anything by searching repo. I generally find (spaCy's) multiprocessing a bit temperamental anyhow, but this seems to just not work.

_PS. thanks for all

bug enhancement help wanted good first issue
atulkum
atulkum commented Nov 30, 2018

Need help for retraining and cross validation and see if the ROUGE score matches exactly (or better) with the numbers reported in the paper.
I just train for 500k iteration (with batch size 8) with pointer generation enabled + coverage loss disabled and next 100k iteration (with batch size 8) with pointer generation enabled + coverage loss enabled.

It would be great if someone can help re-r

help wanted good first issue

Improve this page

Add a description, image, and links to the summarization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the summarization topic, visit your repo's landing page and select "manage topics."

Learn more