Skip to content
#

reinforcement-learning

Here are 4,234 public repositories matching this topic...

adabghi
adabghi commented Jun 19, 2019

Description

I trained a transformer model for English to French translation. It is working well when I give it a sentence to translate. However, when I give a whole document (or simply a paragraph with many sentences), it gives me back a very bad translation and sometimes it skips some sentences.

Did anyone encounter this kind of problem ?

tensorlayer
0xtyls
0xtyls commented Jan 3, 2020

I understand that these two python files show two different methods to construct a model. The original n_epoch is 500 which works perfect for both python files. But if I change n_epoch to 20, only tutorial_mnist_mlp_static.py can achieve a high test accuracy (~0.97). The other file tutorial_mnist_mlp_static_2.py only get 0.47.

The models built from these two files looks the same for me (the s

ZenithWang
ZenithWang commented Apr 4, 2019
    def _discount_and_norm_rewards(self):
        # discount episode rewards
        discounted_ep_rs = np.zeros_like(self.ep_rs)
        running_add = 0
        for t in reversed(range(0, len(self.ep_rs))):
            running_add = running_add * self.gamma + self.ep_rs[t]
            discounted_ep_rs[t] = running_add

        # normalize episode rewards
        discounted_ep_rs -= 
BlackTentacle
BlackTentacle commented Oct 26, 2018

I tried some RNN regression learning based on the code in the "PyTorch-Tutorial/tutorial-contents/403_RNN_regressor.py" file, which did not work for me at all.

According to an accepted answer on stack-overflow (https://stackoverflow.com/questions/52857213/recurrent-network-rnn-wont-learn-a-very-simple-function-plots-shown-in-the-q?noredirect=1#comment92916825_52857213), it turns out that the li

trax
sermolin
sermolin commented Jan 6, 2020

The notebook seems to use a pre-trained model from https://github.com/awslabs/amazon-sagemaker-examples/blob/master/introduction_to_applying_machine_learning/xgboost_customer_churn/xgboost_customer_churn.ipynb. The notebook should refer to the data schema from the above example when discussing generated traffic and suggested constraints.

Cell Deploy the model to Amazon SageMaker. THIS REQU

icoxfog417
icoxfog417 commented Jan 3, 2020

一言でいうと

画像のゆがみを補正する研究。画像全体ではなくパッチ+パッチ周辺の領域に切り出し、それらの勾配をつなぎ合わせることで本来の画像全体で得られる勾配フローを構築し、そこからサンプリングして画像を生成している。

image

論文リンク

https://arxiv.org/abs/1909.09470

著者/所属機関

Xiaoyu Li, Bo Zhang, Jing Liao, Pedro V. Sander

  • Hong Kong UST
  • Microso� Research Asia
  • City Unive

Improve this page

Add a description, image, and links to the reinforcement-learning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the reinforcement-learning topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.