[PATCH] D116052: [clang] fix out of bounds access in an empty string when lexing a _Pragma with missing string token

Alex Lorenz via Phabricator via cfe-commits cfe-commits at lists.llvm.org
Mon Dec 20 12:31:17 PST 2021


arphaman created this revision.
arphaman added reviewers: ravikandhadai, yln, bro4all.
Herald added a subscriber: ributzka.
arphaman requested review of this revision.
Herald added a project: clang.

The lexer can attempt to lex a _Pragma and crash with an out of bounds string access when it's

  lexing a _Pragma whose string token is an invalid buffer, e.g. when a module header file from which the macro

expansion for that token was deleted from the file system.


Repository:
  rG LLVM Github Monorepo

https://reviews.llvm.org/D116052

Files:
  clang/lib/Lex/Pragma.cpp
  clang/test/Preprocessor/pragma-missing-string-token.c


Index: clang/test/Preprocessor/pragma-missing-string-token.c
===================================================================
--- /dev/null
+++ clang/test/Preprocessor/pragma-missing-string-token.c
@@ -0,0 +1,27 @@
+// RUN: rm -rf %t
+// RUN: split-file %s %t
+
+// RUN: %clang_cc1 -emit-module -x c -fmodules -I %t/Inputs -fmodule-name=aa %t/Inputs/module.modulemap -o %t/aa.pcm
+// RUN: rm %t/Inputs/b.h
+// RUN: not %clang_cc1 -E -fmodules -I %t/Inputs -fmodule-file=%t/aa.pcm %s -o - -fallow-pcm-with-compiler-errors 2>&1 | FileCheck %s
+
+//--- Inputs/module.modulemap
+module aa {
+    header "a.h"
+    header "b.h"
+}
+
+//--- Inputs/a.h
+#define TEST(x) x
+
+//--- Inputs/b.h
+#define SUB "mypragma"
+
+//--- test.c
+#include "a.h"
+
+_Pragma(SUB);
+int a = TEST(SUB);
+
+// CHECK: int a
+// CHECK: 1 error generated
Index: clang/lib/Lex/Pragma.cpp
===================================================================
--- clang/lib/Lex/Pragma.cpp
+++ clang/lib/Lex/Pragma.cpp
@@ -263,7 +263,12 @@
   }
 
   SourceLocation RParenLoc = Tok.getLocation();
-  std::string StrVal = getSpelling(StrTok);
+  bool Invalid = false;
+  std::string StrVal = getSpelling(StrTok, &Invalid);
+  if (Invalid) {
+    Diag(PragmaLoc, diag::err__Pragma_malformed);
+    return;
+  }
 
   // The _Pragma is lexically sound.  Destringize according to C11 6.10.9.1:
   // "The string literal is destringized by deleting any encoding prefix,


-------------- next part --------------
A non-text attachment was scrubbed...
Name: D116052.395509.patch
Type: text/x-patch
Size: 1431 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/cfe-commits/attachments/20211220/39ccccd9/attachment.bin>


More information about the cfe-commits mailing list