Scorer model loading #860

graemenail · 2021-04-20T08:55:50Z

Description

When creating scorers, there are two calls to loadItems:

getYamlFromModel loads the 'special' YAML from the model
When the scorer is created, a EncoderDecoder method results in a call to ExpressionGraph::load

This PR addresses these by calling io::loadItems in advance and passing the Item vector when creating scorers.

getYamlFromModel can now be called directly from an Item vector
ScorerWrapper can be constructed from an Item vector.

Amun and Nematus do some specific preprocessing before loading the graph, this PR moves that logic into a load from Items.

Closes #831

List of changes:

Adds --model-mmap as a command line option (uses mio)
Preloads model to Item vector before creating scorers
try-catch when loading model YAML is replaced with if-else
Extended IEncoderDecoder to load from Item vector
Amun and Nematus models load overload for Item vector

Added dependencies: None

How to test

Used marian-conv to test mmap loading
Loading from Items passes regression tests

Checklist

I have tested the code manually
I have run regression tests
I have read and followed CONTRIBUTING.md
I have updated CHANGELOG.md

The interface IEncoderDecoder can now call graph loads directly from an Item Vector.

Scorers are created from an Item vector

snukky

Looks good, only minor comments from me. Thanks! I will ask @emjotde to take a look too.

src/translator/translator.h

src/common/config_parser.cpp

src/translator/translator.h

snukky · 2021-04-28T11:56:02Z

Please add an entry about the model-mmap option to CHANGELOG.

snukky · 2022-01-14T11:46:09Z

This regression test fails for me with this PR:

tests/training/restarting/test_restarting_finished.sh

It fails because logs no longer have the message saying 'Loading model' when trying to restart a training that has already finished. @graemenail, Do you think this is expected now? If so, I will just update the test.

This was tested after rebasing with the current master. That's the only regression test that fails.

graemenail · 2022-01-14T11:56:28Z

This regression test fails for me with this PR:
tests/training/restarting/test_restarting_finished.sh
It fails because logs no longer have the message saying 'Loading model' when trying to restart a training that has already finished. @graemenail, Do you think this is expected now? If so, I will just update the test.

This was tested after rebasing with the current master. That's the only regression test that fails.

Thanks for looking at this @snukky. I believe this is expected, the Loading model from {} log messages contained the filename of the model loaded. In this PR, the model is loaded beforehand, and so the filename is replaced with the vector<io:Items> from that file.

That being said, I think it's a useful log message to include. I'll see where is best to reinstate it, and update this PR.

graemenail · 2022-01-17T14:31:13Z

I have reinstated the log messages for Amun and Nematus, which should fix the regression.
I have also added log messages when loading model ahead of createScorer as well.

graemenail added 15 commits April 19, 2021 13:56

Add MMAP as an option

90d4e3c

Use io::isBin

cbe0ff7

Allow getYamlFromModel from an Item vector

673eac1

ScorerWrapper can now load on to a graph from Item vector

b074fc5

The interface IEncoderDecoder can now call graph loads directly from an Item Vector.

Translator loads model before creating scorers

85882db

Scorers are created from an Item vector

Replace model-config try-catch with check using IsNull

68f064a

Prefer empty vs size

a94ec71

load by items should be pure virtual

d216927

Stepwise forward load to encdec

264a910

logging message

b6ef3d3

nematus can load from items

2f95bea

amun can load from items

8851b43

loadItems in TranslateService

1140443

Remove logging

b49c373

Remove by filename scorer functions

e6574ad

graemenail force-pushed the scorer-model-loading branch from 1a83f84 to e6574ad Compare April 26, 2021 15:11

graemenail marked this pull request as ready for review April 28, 2021 09:14

Replace by filename createScorer

d3623e5

snukky approved these changes Apr 28, 2021

View reviewed changes

src/translator/translator.h Outdated Show resolved Hide resolved

src/common/config_parser.cpp Outdated Show resolved Hide resolved

src/translator/translator.h Outdated Show resolved Hide resolved

src/translator/translator.h Outdated Show resolved Hide resolved

snukky requested a review from emjotde April 28, 2021 11:54

graemenail added 5 commits April 28, 2021 13:24

Explicitly provide default value for get model-mmap

9724900

Fix whitespace

2da869c

CLI option for model-mmap only for translation and CPU compile

400ed83

Ensure model-mmap option is CPU only

0157299

Update CHANGELOG

b0e6e24

graemenail mentioned this pull request May 17, 2021

Scorer model loading browsermt/marian-dev#42

Merged

4 tasks

graemenail and others added 2 commits May 17, 2021 18:03

Remove move on temporary object

e6098ad

Merge branch 'master' into scorer-model-loading

28657df

graemenail added 2 commits January 17, 2022 14:22

Reinstate log messages for model loading in Amun / Nematus

ae1e03c

Add log messages for model loading in scorers

7415ca8

snukky merged commit b29cc07 into marian-nmt:master Jan 18, 2022

jelmervdl mentioned this pull request Jun 15, 2022

Fix guaranteed YAML::InvalidNode when compiled with COMPILE_CPU=Off #944

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scorer model loading #860

Scorer model loading #860

graemenail commented Apr 20, 2021 •

edited

Loading

snukky left a comment

snukky commented Apr 28, 2021

snukky commented Jan 14, 2022

graemenail commented Jan 14, 2022

graemenail commented Jan 17, 2022 •

edited

Loading

Scorer model loading #860

Scorer model loading #860

Conversation

graemenail commented Apr 20, 2021 • edited Loading

Description

How to test

Checklist

snukky left a comment

Choose a reason for hiding this comment

snukky commented Apr 28, 2021

snukky commented Jan 14, 2022

graemenail commented Jan 14, 2022

graemenail commented Jan 17, 2022 • edited Loading

graemenail commented Apr 20, 2021 •

edited

Loading

graemenail commented Jan 17, 2022 •

edited

Loading