-
Notifications
You must be signed in to change notification settings - Fork 97
Error when running Evaluate : axis 1 is out of bounds for array of dimension 0 #230
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hi @Julmatap, thank for the nice words. Could you please paste your directory structure (data paths) and the content of the |
Sure ! (Thanks for your quick reply !) Based on yours, here's mine :
And the parameters content of my neptune.yaml looks like this : name: mapping_challenge_baseline parameters: Data Pathsdata_dir: data_raw Hope I understood what you asked me. |
I'll try to use the exact same directory structure as you did in your example and correct the neptune.yaml accordingly to see if it resolves the problem. I didn't find where in the code it would pose a problem to have this kind of structure but since i'm a beginner it may be normal lmao. |
Mhm, so do you have |
Dear Jakub,
I changed my neptume.yaml to look like this : Data Pathsdata_dir: data/raw However, when i run evaluate, it still throws me the exact same error. |
Could you try and run the training pipeline first?
|
Could it be linked to my environment ? I had to change versions and some stuff for it to work (i'm on windows10 x64). I couldn't install Torch v 0.3.1 and others via your environment.yml. Here is my complete env for information :
|
Ok, I see. I think that is the problem. pip install torch==0.3.1 |
I think you need to downgrade python for this. conda env create -f environment.yml but as I understand that isn't working. conda create -n py_36_env python=3.6 activate it conda activate py_36_env and then install the dependencies from |
Dear @jakubczakon thank you for your time helping me ! It was indeed a problem with Torch version. However I still couldn't install Torch==0.3.1 even after downgrading python and so on... I tried manually installing it, but I still had the same error as per my previous screenshot, I also tried finding a .whl but there was none for v 0.3.1 win64. Right now I'm still evaluating since more than 5 hours, and I can tell it's running with neptune and my memory usage. (But it's not using GPU despite my GPU being cuda compatible, is it normal ?) Once it is finished and I am sure it is working, I will post my conda list, if it can help someone else facing this issue. Again, thank you very much for your time 👍 |
That is awesome thank you @Julmatap! You can see if it is running but going to the |
Also, it seems that there is a message in the terminal that explains why the GPU is not logged -> sending GPU metrics was blocked by the system. This is very much unexpected. |
Unfortunately there is nothing "No charts here" and "waiting for data", and I think there's a problem, someone else launched an experiments 3mn and he has already some stuffs. |
You are running the evaluation now correct? Also you can go to your terminal and run: nvidia-smi to see if your GPU is actually doing something. |
Hi Jakub, Yes I was running the evaluation, I tried running the training today, and there's still no charts. I guess I'll just start again on a Linux VM. Has anyone already suceeded in running it on windows ? The operating system blocking the request is something I don't find any informations on. |
I see @Julmatap, Unfortunately, I don't know if anyone succeeded on Windows -> everyone I know who used this repo was using Linux. |
I confirm it is working on Linux, I just did an evaluation and it worked, just had to wait a long time at "steps >>> step unet transforming...". Again, thanks for your time @jakubczakon ! 👍 |
That is awesome! I only wish I could be of more help but I am proud of you getting it done. |
Hello everyone,
Firstly I would like to say thank you for this amazing project and the release of your code.
I am very new to programming and ML and I wanted to try your project.
I downloaded the data, installed the environments on latest versions, followed the reproduced results and looked at the issues and tried to solve the errors I got by myself as much as possible, but right now i'm stucked.
I first prepared masks & metadata then I downloaded your best models weights, created the transformers folder in data_experiments/mapping_challenge_baseline and copied "unet" and "scoring_model" there. I then changed the values in neptume.yaml as suggested and tried to evaluate by running : "python main.py evaluate --pipeline_name unet"
Here is what I got :


Hope I gave you enough informations, and that you can help me resolve this issue.
Again, thank you guys !
PS : I saw that issue #228 has this problem also, but i don't have the "valid data is None" as you can see on screenshots here attached
The text was updated successfully, but these errors were encountered: