Ubuntu
nvidia-cuda-toolkit package

Simple file fails to compile with -O2 or -O3

Bug #1589751 reported by Michael Poole on 2016-06-07

This bug affects 1 person

Affects		Status	Importance	Assigned to	Milestone
	nvidia-cuda-toolkit (Debian)	Fix Released	Unknown	debbugs #822783
	nvidia-cuda-toolkit (Ubuntu)	Fix Released	Undecided	Unassigned

Bug Description

I have a new laptop with Ubuntu 16.06 (xenial) installed onto a new partition, plus nvidia-cuda-toolkit 7.5.18-0ubuntu1 from multiverse.

$ echo '#include <stdio.h>' > dummy.cpp
$ g++ -c -O2 dummy.cpp
$ cp dummy.cpp dummy.cu
$ nvcc -c -O2 dummy.cu
/usr/include/string.h: In function ‘void* __mempcpy_inline(void*, const void*, size_t)’:
/usr/include/string.h:652:42: error: ‘memcpy’ was not declared in this scope
return (char *) memcpy (__dest, __src, __n) + __n;
^

nvcc can compile it at -O0 and -O1, and it is a conforming translation unit (modulo the addition of at least one definition after the #include statement; including such a definition does not affect the error).

Tags:

Revision history for this message

Graham Inggs (ginggs) wrote on 2016-06-07:

Please try the following workaround:

$ nvcc -c -O2 -D_FORCE_INLINES dummy.cu

Changed in nvidia-cuda-toolkit (Ubuntu):
status:	New → Confirmed

Bug Watch Updater (bug-watch-updater) on 2016-06-07

Changed in nvidia-cuda-toolkit (Debian):
status:	Unknown → New

Revision history for this message

Graham Inggs (ginggs) wrote on 2016-06-07:

cuda-glibc223.patch Edit (692 bytes, text/plain)

Please also try editing /usr/include/common_functions.h and commenting out the line containing (as per the attached patch:

__cdecl memcpy(void*, const void*, size_t) __THROW;

Ubuntu Foundations Team Bug Bot (crichton) on 2016-06-07

tags:

added: patch

Revision history for this message

Michael Poole (mdpoole) wrote on 2016-06-08:

The workarounds (either one separately, or both) resolve the compile issue for me. Thank you for the pointers!

Revision history for this message

Graham Inggs (ginggs) wrote on 2016-06-10:

@mdpoole: thanks for confirming!

This seems to be fixed in the CUDA 8.0 release candidate.
I can't see that Nvidia changed anything related to this in common_functions.h, so I suggest using -D_FORCE_INLINES for now.

Bug Watch Updater (bug-watch-updater) on 2016-08-01

Changed in nvidia-cuda-toolkit (Debian):
status:	New → Fix Released

Graham Inggs (ginggs) on 2016-08-29

Changed in nvidia-cuda-toolkit (Ubuntu):
status:	Confirmed → Fix Released

Report a bug

This report contains Public information

Everyone can see this information.

You are

Subscribing...

Edit bug mail

Other bug subscribers

Patches

cuda-glibc223.patch Edit

Add patch

Remote bug watches

debbugs #822783
[done serious experimental patch] Edit

Bug watches keep track of this bug in other bug trackers.

Ubuntunvidia-cuda-toolkit package

Simple file fails to compile with -O2 or -O3

Bug Description

Other bug subscribers

Patches

Remote bug watches

Ubuntu
nvidia-cuda-toolkit package