Skip to content
#

pytorch

Here are 8,021 public repositories matching this topic...

biggoron
biggoron commented Dec 4, 2019

With the latest version of scipy.misc, scipy.misc.toimage is no longer available. To load and save an image as png we now have to use PIL, breaking tensorboard image summary.

Here is how I fixed the bug:
1./ At the end of main.py, log a uint8 image
logger.image_summary(tag, (images * 255).astype(np.uint8), step+1)
2./ In Logger class, package image as bytes with the PIL library (mode="L

zhou-frank
zhou-frank commented Sep 24, 2019

should I scale the multi-gpu lr to batchsize * single-gpu lr?
e.g. pix2pix model default lr = 0.0002(1 batchsize 1 gpu). When I use 16 batchsize and four gpu(4 image per gpu),I think we should set lr = 0.0002 * 16 = 0.0032.
Meanwhile, using WarmupScheduler to avoid gradient exploding problem.
thanks for any help.

Flamefire
Flamefire commented Nov 13, 2019

Judging by the logic in https://github.com/horovod/horovod/blob/38e91bee84efbb5b563a4928027a75dc3974633b/setup.py#L1369 it is clear, that before installing Horovod one needs to install the underlying framework(s) (TensorFlow, PyTorch, ...).

This is not mentioned in the installation instructions which made me think, I can install Horovod and then any framework I like (or switch between them) and

TMVector
TMVector commented Sep 16, 2019

Support for storing large tensor values in external files was introduced in #678, but AFAICT is undocumented.

This is a pretty important feature, functionally, but it's also important for end users who may not realise that they need to move around more than just the *.onnx file.

I would suggest it should be documented in IR.md, and perhaps there are other locations from which it could be s

samsontmr
samsontmr commented Nov 21, 2019

Hi! I tried running generate to evaluate transformer.wmt14.en-fr on the WMT'14 test set but was only able to get a BLEU score of 35.42. I ran prepare-wmt14en2fr.sh and fairseq-preprocess on the data beforehand as well. Could you share the command for evaluating the Transformer ENFR WMT'14 model?

Here is what I'm using:

fairseq-generate data-bin/wmt14_en_fr/ \
    --path checkpoin
TWRogers
TWRogers commented Nov 5, 2019

🐛 Bug

nan values in loss

To Reproduce

Steps to reproduce the behavior:

  1. Run python examples/qm9_nnconv.py
Epoch: 001, LR: 0.001000, Loss: nan, Validation MAE: nan, Test MAE: nan
Epoch: 002, LR: 0.001000, Loss: nan, Validation MAE: nan, Test MAE: nan
Epoch: 003, LR: 0.001000, Loss: nan, Validation MAE: nan, Test MAE: nan
...

Expected behavior

Exp

rbavery
rbavery commented Aug 14, 2019

I'm building an edited version of the tensorflow-py36-cuda90 dockerfile where I pip install some more packages

# ==================================================================
# module list
# ------------------------------------------------------------------
# python        3.6    (apt)
# tensorflow    latest (pip)
# ================================================================
bersbersbers
bersbersbers commented Sep 11, 2019

Platform (like ubuntu 16.04/win10): Windows 10
Python version: 3.7.4, mmdnn==0.2.5

Running scripts: mmconvert -f caffe -df keras -om test

I know that this command is not supposed to run without passing an input file, but the error message is incorrect and should be improved:

mmconvert: error: argument --srcFramework/-f: invalid choice: 'None' (choose from 'caffe', 'caffe2', 'cn

Improve this page

Add a description, image, and links to the pytorch topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pytorch topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.