Announcement

**Blama** · 14 Sep 2018, 21:57

Hi Isomorphic,

let me modify the example in order to improve two other use cases:

lookup country "Germany" to 455 for field country_id
lookup country "Austria" to 446 for field country_id
lookup country "Switzerland" to 447 for field country_id
lookup temperature "cold" to 1 for field temperature_id
lookup temperature "warm" to 2 for field temperature_id
lookup temperature"hot" to 3 for field temperature_id
lookup product "ABC" 1001 for field product1_id
lookup product "DEF" 1002 for field product1_id
lookup product "ZYX" 1055 for field product2_id (different field also linking to product)
lookup product "WVU" 1054 for field product2_id (different field also linking to product)
....

If you ever implement the feature from this thread, it would be better if the cache is maintained as:
"uploadFieldName" - "key value" - "resulting id" (because of the potentially different results per uploadField that link to the same DataSource)

Also, if a lookup value is not found, this should be tracked as well, so that the next time the BatchUploader wants to request a value, because it does not have a stored value for it already, it does directly know that the query won't return any results. (e.g. if in all CSV entries the ISO code of a country is used instead of the name, that is stored in the DB). In this case the BatchUpload could directly add an error for the row/field in question, without issuing a query.

Best regards
Blama

**Isomorphic** · 22 Sep 2018, 05:15

This is indeed useful suggestion, thank you. It is implemented in 12.x versions and will be available for download in nightly builds since Sep 23 (tomorrow).

**Blama** · 14 Oct 2018, 10:20

Hi Isomorphic,

can you explain how the cache works (because of this thread)? I assume it is built in into 12.0p DataImport and only affects a single run of "upload", correct?
If not, is there a setting to disable the cache?

Thank you & Best regards
Blama

**Isomorphic** · 15 Oct 2018, 01:19

Yes, your assumption is correct. Values are cached in a context of single run of batchUpload. Every new upload will have its own cache built during the process.

**Blama** · 3 Jul 2019, 14:54

Hi Isomorphic,

as you might have noticed I posted a lot about BatchUploader lately. I also had a closer look at the SQL statements sent and how this improvement works.
It does reduce the amount of fetches a lot, as expected. Unfortunately, you only store the looked up ID if you find one.
A text without an entry in the parent table does not lead to this negative "key not present"-information stored in the result cache.
Instead you do this query over and over again. Of course, here the same optimization can be applied.

It will lead to great further improvements in cases of bad upload data or a wrongly configured includeFrom in the field mentioned as displayField in the ID field with importStrategy="display".

Best regards
Blama

**Blama** · 6 Apr 2020, 05:08

Hi Isomorphic,

did you have a chance to look at this optimization as well?

Thank you & Best regards
Blama

**Isomorphic** · 7 Apr 2020, 23:05

We did, although this was implemented in 12.1 only.

**Blama** · 7 Apr 2020, 23:28

Hi Isomorphic,

OK, thank you, perfect. We'll switch hopefully soon anyway.

Best regards
Blama

Announcement

Enhancement: BatchUploader lookup result cache to increase performance

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment