# LAW OFFICES FINNEGAN, HENDERSON, FARABOW, GARRETT & DUNNER, L.L.P. 1300 | Street, N.W. Washington, DC 20005-3315

Telephone (202) 408-4000

Facsimile (202) 408-4400

# FACSIMILE TRANSMITTAL

| TO:        |                                                                        | FROM:                           |                          |  |
|------------|------------------------------------------------------------------------|---------------------------------|--------------------------|--|
| Name:      | Prof. Pinaki Mazumder                                                  | Name:                           | C. Gregory Gramenopoulos |  |
| Firm:      | Dept. of Elec. Engr. and<br>Computer Science<br>University of Michigan | Phone No.:                      | (202) 408-4263           |  |
| Fax No.:   | 1-734-763-1503                                                         | Fax <b># Verifi</b> e           | ed by:                   |  |
| Phone No.: | 1-734-763-2107                                                         | # Pages (incl. this): <u>13</u> |                          |  |
| Subject:   |                                                                        | Date: Decen                     | nber 6, 2000             |  |
|            | Confirmation C                                                         | ony to Follow                   | c                        |  |

Pinaki-

Attached is an English translation of the German's court decision (re: invalidating the Siemens patent based on your publication). Thank you once again for all of your hard work. Your input was invaluable to our victory.

Best personal regards,

Grea

If there is a problem with this transmission, notify fax room at (202) 408-4174 or the souder at the number above.

This facsimile is intended only for the individual to whom it is an iressed and may contain information that is privileged, confidential, or exempt from disclosed to under applicable law. If you have received this facsimile in even, please notify the sender immediately by telephone (collect), and return the original message by first-class mail to the above address.

202 408 4400 TO 17347631503 P.02

102/007. 2. 20001 9:25AMX +40TTC PALO ALTO

- -----

TER MEER & PARTNER

NO. 1713 P. 5 2024

## - 1 -

# +FEDERAL PATENT COURT

# IN THE NAME OF THE PEOPLE

JUDGHENT

2 Ni 26/98 (ED) (File reference)

Pronounced on 25 May 2000 Bandel Senior Court Secretary as Registrar of the Administration Office

# In the matter of patent annulment

Hyundai Micro Electronics Germany GmbH, Jakob-Kaiser-Straße 12, 47877 Willich, legally represented by its Managing Director, Jae Hyo Park, ibid.,

Plaintiff,

- Trial counsel: Patent attorneys ter Meer, Steinmeister & Partners, Mauerkircherstraße 45, 81679 Munich -

# versus

Siemens AG, Berlin and Munich, legally represented by its Board of Directors, the latter represented by the Chairman of the Board, Heinrich v. Pierer, ibid.,

Défendant,

# relating to European Patent 0 527 866 (DE 591 01 394)

the 2nd Senate (Annulment Senate) of the Federal Patent Court, on the basis of the oral proceedings on 25 May 2000, with the participation of Presiding Judge Baumgärtner and Judges Dipl.-Ing. Bertl and Dipl.-Ing. Prasch, Püschel and Dipl.-Ing. Schuster . . 02 OCT. 2. 2000 . 9:26AMAX -4TTC PALO ALTO

÷ 

P.03

NO. 1713 P 16 2015

- 2 -

TER MEER & PARTNER

has reached the following verdict:

- 1. European Patent 0 527 866 is declared to be null and void with effect for the sovereign territory of the Federal Republic of Germany within the scope of Patent Claim 1.
- 2. The Defendant shall bear the costs of the legal proceedings.
- 3. The judgment is provisionally enforceable, with respect to the costs for the Plaintiff, against security amounting to DM 40,000.00.

## Facts of the case:

The Defendant is the registered proprietor of European Patent 0 527 866 (contested patent), which was applied for on 8 May 1991 claiming the priority of European Patent Application 90108836 of 10 May 1990 and was granted with effect for the Federal Republic of Germany as well. It relates to an integrated semiconductor memory with parallel test capability and redundancy method. The contested patent, which is filed by the German Patent and Trademark Office under the number 591 01 394, comprises 80 patent claims, of which Patent Claim 1, which is the only patent claim contested by the action for annulment, has the following wording, translated from German, the language of the proceedings:

> "1. Integrated semiconductor memory having a parallel test device (PT) and U block groups  $(GP_{u=1...v})$ , in which, in a test operating mode, a plurality of groups of M memory cells (MC) can be tested simultaneously in terms of their functioning, each group being arranged along a

CCT. 2. 2000 9:264W - TTC PALO ALTO

NO. 1713 P. 17 2016

- 3 -

respective word line (WL) within a respective one of the U block groups (GP<sub>u</sub>), and in which the data read out in the process can be evaluated by the parallel test device (PT), characterized by

- the following features:
  - the parallel test device (PT) serves for writing in and evaluating data to be written to and read from the semiconductor memory,
  - the result of the evaluation, separately for each group of M memory cells (MC), is present on I/O data lines (IO1, IO2, IO3) of the semiconductor memory."

With its partial action for annulment, the Plaintiff asserts that the subject-matter of Patent Claim 1 is not patentable since it is not novel, but at any rate is revealed to the person skilled in the art in an obvious manner by the prior art. For this purpose, it refers to the following documents published before the priority date:

- 1. Data book from Toshiba Corporation "MOS Memory Products", 1989, pp. 313 to 331 (Annex D1);
- Pinaki Mazumder "Parallel Testing of Parametric Faults in a Three-Dimensional Dynamic Random-Access Memory" in IEEE Journal of Solid-State Circuits, Vol. 23, No. 4, August 1988, pp. 933 to 941 (Annex D2);
- 3. German Patent Specification 40 11 987 (Annex D3);

4. Pinaki Mazumder "Parallel Testing for Pattern-Sensitive Faults in Semiconductor Random-Access Memories" in IEEE Transactions on Computers, Vol. 38, No. 3, March 1989, pp. 394 to 407 (Annex D4);

- 5. Nasayoshi Wakane et al. "4 M-Bit DRAMS ..." in IEE, Journal of Electronic Engineering, Vol. 26, No. 265, January 1989, pp. 32 to 34 (Annex D5);
- 6. G. Finney "DRAM DA 4 MBIT AD ALTA VELOCITA'" in ELETTRONICA OGGI, No. 90, 15 November 1989, pp. 107 to 114 and English translation (Annexes D6 and D7).

"CT. 2. 2000 9: 26AM'AX +4TTC PALO ALTO

TER KEER & PARTNER

NO. 713 P. 18 20017

- 4 -

It furthermore asserts that, before the priority date of the contested patent, a memory chip 1M×4 DRAM from Toshiba Corporation bearing the designation "TC514400Z-80" was freely available, whose Circuitry relationships were analysable before the priority date of the contested patent and which anticipates the teaching of Claim 1 of the contested patent. In order to demonstrate the availability of the memory chip, it refers to Annexes D2, D5 to D7 (already presented) and additionally submits 5 invoice copies. For the analysability of the memory chip and also for the presentation of its circuits, the Plaintiff submits the following documents:

- 7. Design Analysis Reports from Semiconductor Insights Inc., Canada (Annex D8);
- Design Analysis Report from Chipworks, Canada (Annex D9) concerning the memory module TC5144002-80 from Toshiba/Japan (1Mx4 DRAM).

The Plaintiff makes its assertion that the circuits presented in Annex D9 originated from the memory chip TC5144002-80 with evidence from witnesses.

The Plaintiff requests

that European Patent 0 527 866 be declared to be null and void within the scope of Patent Claim 1 with effect for the sovereign territory of the Federal Republic of Germany.

The Defendant requests

that the action be rejected,

alternatively it defends Patent Claim 1 with the proviso that the following passage be inserted in Patent Claim 1 before the word "and" on page 19, line 45 of the European Patent Specification: "N memory cells (MC) 102'OCT. 2. 2000 3:26AMLA +41TTC PALO ALTO

- 5 -

being arranged along a word line (WL) and  $M \leq N$ ,".

It opposes the Plaintiff's submission in all points and considers the contested patent to be patentable within the scope contested, at any rate insofar as it is defended in restricted form.

During the oral proceedings, instead of Document D3, the associated published Patent Application DE 40 11 987 A1 was introduced into the proceedings.

# Grounds for the decision:

The action, which asserts the ground of annulment of lack of patentability, as provided in Art. II § 6 paragraph 1 No. 1 IntPatUG [Int. Pat. Transitory Law], Art. 138 para. 1 lit a EPC in conjunction with Art. 54 paragraphs 1, 2 and Art. 56 EPC, is completely justified.

I.

1. The contested fatent Claim 1 of the contested patent relates to an integrated semiconductor memory having a parallel test device which is divided into a plurality of block groups.

In the case of testing a conventional memory according to the prior art mentioned in the introduction to the description of the contested patent, a respective memory cell of one block group together with a respective memory cell from each of the remaining block groups are simultaneously tested jointly in terms of their functioning. If one (or more) of the jointly tested memory cells is (or are) defective, then although the position of the tested memory cells within the block groups is identified on account of the addressing data applied to the semiconductor memory, it is not known, however, whether just 02 OCT. 2. 2000 3 9: 27AV +4TTC PALO ALTO

### - 6 -

one or a plurality of the simultaneously tested memory cells is or are defective, that is to say it is also not known whether one block group or whether a plurality of block groups contain defective memory cells. The possibility of identifying the relevant defective block group(s) is just as slight. If it is desired to identify the defective memory cell(s) and/or its (their) block group(s), then one is compelled to test the semiconductor memory once again and to operate it conventionally in doing so. This means, however, no use of the parallel test possibility and, consequently, also no utilizing of the test time reduction, specified in the prior art, compared with testing without a parallel test possibility. A further disadvantage is that a defect which occurs simultaneously in all of the tested memory cells cannot be identified.

Against this background, the contested patent desires to develop such a memory in such a way that it fulfils the following provisions:

> Defects occurring on simultaneously tested memory cells should be identifiable irrespective of the number of defects,
> the position of defective memory cells should be determinable in a single test pass,
> the requisite space requirement and outlay on switching elements for the parallel test device should be kept to a minimum.

In order to solve this technical problem, Claim 1 of the contested patent proposes an integrated semiconductor memory which has the following individual features and groups of features:

- 1.) a parallel test device (PT)
- 2.) and U block groups  $(GP_{u=1...U})$ ,
- 3.) in which, in a test operating mode, a plurality of groups of M memory cells (MC)

92/10CT. 2.2000- 3:27AM +48 TTC PALO ALTO

ł

NO. 1713 2. 21 20 020

## - 7 -

can be tested simultaneously in terms of their functioning,

- (4.) each group being arranged along a respective word line (WL) within a respective one of the U block groups (GP<sub>u</sub>),
- 5.) and in which the data read out in the process can be evaluated by the parallel test device (PT),
- 6.) the parallel test device (PT) serves for writing in and evaluating data to be written to and read from the semiconductor memory,
- 7.) the result of the evaluation, separately for each group of M memory cells (MC), is present on I/O data lines (IO1, IO2, IO3) of the semiconductor memory.

A semiconductor memory having memory cells MC and having a test device PT according to Figure 1 of the contested patent contains U block groups  $GP_{1,...,G}$ , generally  $GP_u$ . In order to carry out a parallel test (= test operating mode), i.e. a test in which many memory cells MC are tested simultaneously (= within a memory cycle), which memory cells cannot be written to and/or read from simultaneously in a normal operating mode, as implemented by users of the semiconductor memory, the parallel test device PT is provided. It serves both for writing in data that are to be written to the semiconductor memory, and for reading from the semiconductor memory data that have been written to the latter, to be precise both in the normal operating mode and in the test operating mode. In the test operating mode, by means of the parallel test device FT, & plurality of groups of M memory cells MC in each case can be tested simultaneously in terms of their functioning. Each group of M memory cells MC is arranged along a respective word line WL. All of the data that are read out within a memory cycle can be evaluated in the parallel test device; the result of the evaluation is then present on I/O data lines, separately for each group of memory calls MC.

100 00 11:00 FR FINNEGAN HENDERSON 202 408 4400 TO 17347631503 UEPI aud U ; Page 9

02/0CT 2.20001 9:27AM# +49TTC PALO ALTO

T.

Į.

P.09

NO. 1713 P. 22 2021

- 8 -

TER MEER & PARTNER

2. The subject-matter of Patent Claim 1 of the contested patent in the granted version is not novel, where the examination of the novelty of the subject-matter of Patent Claim 1 with respect to the earlier application cited by the Plaintiff is to be based (by general consensus) DE 40 11 987 Al, which is associated with DE 40 11 987 C2. on

The earlier application relates, as emerges in particular from column 9, line 58 to column 10, line 22 with Figures 1, 3, 11, 20, to an integrated semiconductor memory having a parallel test device (16, 17, ML1 ... 4) corresponding to Feature 1 of Patent Claim 1 of the contested patent. The memory cells are subdivided (B1 ... 4) into four block groups (Feature 2). In a test operating mode, a plurality of groups of memory cells can be simultaneously tested in terms of their (MC) functioning (cf. in particular Patent Claim 1) (Feature 3), each group being arranged along a respective word line (WL) within a respective one of the block groups (cf. in particular column 8, lines 26 to 31) (Festure 4). The data read out in the process can be evaluated by the parallel test device (cf. in particular Patent Claim 1, column 16, lines 36 to 40) (Feature 5).

The parallel test device (PT) according to the earlier application serves for writing in and evaluating data to be written to and read from the semiconductor memory (cf. in particular Figures 20 to 22 and column 7, line 20 to column 8, line 66) (Feature 6). The result of the evaluation, separately for each group of memory cells (MC), is present on I/O data lines (31 to 34) of the semiconductor memory (cf. in particular Figure 3 and column 10, line 45 to column 11, line 22) (Feature 7).

Although the Defendant objects that, according to the exemplary embodiment as shown in Figure 2, the output of the error detector (the result of the evaluation) is not passed onto the I/O data lines, this is of no importance since Figure 3 relates to a variant which, besides Features 1 to  $\delta$ , also has Feature 7 since the test results are applied separately to PZ/OCT 2. 20001 9:27AM +49 TTC PALO ALTO TER MEER & PARTNER

- 9 -

the input/output terminals (31 to 34) via the respective matching line (ML1 to ML4) (cf. in particular column 9, line 66 to column 10, line 8).

Accordingly, the earlier application has all the features of the integrated semiconductor memory of Patent Claim 1 of the contested patent, which thus lacks novelty and which, therefore, can have no validity.

3. Although the subject-matter of Patent Claim 1 of the contested patent in the defended version in accordance with the Defendant's alternative petition is novel, it is not based on an inventive step.

a. The subject-matter of the Patent Claim 1 submitted by the Defendant in accordance with the alternative petition differs in its preamble from Claim 1 according to the main petition by the fact that after Feature 4 - in accordance with the analysis of features of Patent Claim 1 of the contested patent - the following is additionally inserted (= Feature 4.a): "N memory cells (MC) being arranged along a word line (WL) and M < N''. This is materially not a clarification but rather a restriction, since according to the main petition the group can also encompass all the memory cells along a word line, which is precluded by the formulation according to the alternative petition.

The inserted feature is disclosed both in the original documents and in the contested patent specification. Patent Claim 1 of the contested patent reads: "in which, in a test operating mode, a plurality of groups of M memory cells (MC) can be tested simultaneously in terms of their functioning". This can mean that either all the memory cells on a word line or just a subgroup are tested simultaneously. The exemplary embodiment according to Figure 3 reveals to the person skilled in the art that a plurality of groups of M memory cells are arranged on a word line, which are each selected via a bit switch (BSW). Moreover, the amendment does not extend the scope of protection of the granted patent since the teaching 92/OCT. 2.20001 9:28AM# +49TTC PALO ALTO

TER KEER & PARTNER

P.11

## - 10 -

that was initially formulated more broadly has been restricted to a narrower teaching and this feature can also be recognized as belonging to the claimed invention in the description (cf. BGH [German Federal Supreme Court] BIPMZ 1991, 188 -Bodenwalze).

b. By virtue of this additionally included feature, the subject-matter of Patent Claim 1 of the contested patent is no longer anticipated in a manner prejudicial to novelty.

Although the Plaintiff is of the opinion that the person skilled in the art gathers from Patent Claim 1 of DE 40 11 987 Al that it is also possible for just some of the memory cells arranged on a word line to be tested simultaneously, the formulation "a plurality of memory cells" chosen in the patent claim therein nonetheless does not unequivocally reveal what is meant thereby. Thus, the person skilled in the art will consult the description for further information, where he does not find advice to simultaneously test only groups of memory cells of a word line but rather only the advice to simultaneously test all the memory cells arranged on a word line (cf. e.g. column 7, lines 58 to 64 and column 8, lines 26 to 38).

The integrated semiconductor memory according to Claim 1 in accordance with the alternative petition is revealed to the person skilled in the art in an obvious manner, however, by the paper by Pinaki Mazunder "Parallel Testing of Parametric Faults in a Three-Dimensional Dynamic Random-Access Memory" in IEEE Journal of Solid-State Circuits, Vol. 23, No. 4, August 1988, pp. 933 to 941.

This reference discusses the parallel testing of memory cells in a dynamic RAM; it thus relates to an integrated semiconductor memory having a parallel test device (Feature 1). This semiconductor memory has a plurality of block groups (p subarrays) (Feature 2). In a test operating mode (cf. in particular page 935, left-hand column: "test mode"), a plurality of groups of M memory cells can be tested

PMS 5 Joor

A

## - 11 -

simultaneously in terms of their functioning (cf. particular Figure 1). The saving in test time is determined by the term on. In this case, p denotes the number of subarrays. The fact that the number of subarrays influences the test time reduction (cf. in particular page 934, left-hand column, lines 11 to 13) indicates to the person skilled in the art that the test is carried out simultaneously in the subarrays (Feature 3). In one of the test modes described, each group is arranged along a respective word line within a respective one of the block groups (cf. in particular Figure 1c and page 934, right-hand column, last paragraph, lines 1 and 2) (Feature 4). The DRAM is organized in the test mode in such a way that the M memory cells (group j) to be tested are arranged along a word line N (b in this document) and M < N, or rather j < b (cf. in particular Abstract and page 934, right-hand column, section II, lines 1 to 15) (Feature 4a). The data read out during the test can be evaluated by the parallel test device (cf. in particular page 934, left-hand column, fourth line from the bottom - 0/1 detector) (Feature 5).

The parallel test device (PT) serves for writing in and evaluating data to be written to and read from the semiconductor memory (cf. e.g. page 934, left-hand column, lines 1 to 5). Each subarray contains a detector, i.e. a test device (cf. in particular page 934, left-hand column, last four lines), which produces the test result separately for each group of memory cells. The person skilled in the art must route these results to the outside in order to evaluate them. Fast evaluation of the test necessitates that the individually produced results be made available as fast as possible, which means outputting them simultaneously, that is to say separately. For the outputting, the person skilled in the art is offered those existing terminals which are not currently required; these are the I/O data lines, so that it lies within the scope of the knowledge and ability of the average person skilled in the art to apply the result of the evaluation,

# | | 12 - '

separately for each group of M memory cells (MC), to I/O data lines (IO1, IO2, IO3) of the semiconductor memory (Feature 7). The person skilled in the art thus arrives at the subject-matter of the restricted Patent Claim 1 according to the alternative petition, without employing an inventive step, with the overall result that the action is successful.

## II.

As the defeated party, the Defendant must bear the costs of the legal proceedings pursuant to \$ 84 para. 2 PatG [German Patents Act] in conjunction with \$ 91 para. 1 clause 1 ZPO [German Code of Civil Procedure]. The decision on provisional enforceability is based on \$\$ 99 para. 1 PatG, 709 ZPO.

Baumgärtner

Bertl

Prasch

Puschel Schuster

Fa

# Parallel Testing of Parametric Faults in a Three-Dimensional Dynamic **Random-Access Memory**

## PINAKI MAZUMDER

Abstract --- This paper presents a testable design of dynamic randomaccess memory (DRAM) architecture which allows one to access multiple cells in a word line simultaneously. The technique utilizes the two-dimensional (2D) organization of the DRAM and the resulting speedup of the conventional algorithms is considerable. This paper specifically investigates the failure mechanisms in the three-dimensional (3D) DRAM with trench-type capacitor. As opposed to the earlier approaches for testing parametric faults that employed sliding diagonal-type tests with  $O(n^{3/2})$ complexity, the algorithms discussed in this paper are different and have  $O(\sqrt{n/p})$  complexity, where p is the number of subarrays within the DRAM chip. These algorithms can be applied externally from the chip and also they can be easily generated for built-in self-test (BIST) applications.

#### I. INTRODUCTION

S EMICONDUCTOR dynamic random-access memory (DRAM) is the highest beneficiary of the rapid growth of VLSI technology. As the device feature width is decreasing every year, the DRAM size is quadrupling every two to three years. Recently Nippon Telegraph and Telephone (NTT) Company has announced the development of 16-Mbit DRAM's, and by the turn of the decade it is expected that several manufacturers will fabricate 64-Mbit DRAM's employing 0.5-µm technology [1]. This enormous prospect of DRAM development cannot be economically exploited unless cost-efficient testing strategies are evolved to arrest the polynomial growth of testing cost with the increasing DRAM size. Conventional test algorithms like the sliding diagonal test and the GALDIA are employed to test the leakage currents and the faults caused by the variation of processing technology. The sliding diagonal test, which uses  $4N^{3/2}$  memory cycles, requires over 8 h to test the ac parametric faults in a 16-Mbit DRAM chip having a 100-ns memory cycle time. The main objective of this paper is to demonstrate how, by employing a new design-for-testability approach, parametric faults in a DRAM can be tested in parallel. The new algorithms test multiple cells in a memory simultaneously and thereby the

overall test complexity is reduced by more than 1000 times.

The problem of parallel testing has been addressed by other researchers in the past. In order to reduce the test time McAdams et al. [2] fabricated a 1-Mbit CMOS three-dimensional (3D) DRAM with design-for test functions. They partitioned the memory into eight subarrays and tested them concurrently. Their scheme simultaneously writes the same data on eight cells, which are identically located inside the different subarrays. During a READ operation when any of these eight cells is addressed, all eight cells are simultaneously accessed and their contents are compared by using a two-mode 8-bit parallel comparator. The resulting scheme which compares one cell from each subarray will be called in this paper inter-subarray single-cell comparison (Scheme 1). By using this scheme they reduced the test time by a factor of 5.2 times. Shah et al. [3] used a similar 16-bit parallel comparator in their 4-Mbit DRAM with trench-transistor cell and essentially reduced the test time complexity to that of a 256-kbit DRAM.

You and Hayes [4] have introduced the concept of parallel testing within the subarrays by applying simultaneously single-cell inter-subarray comparison over multiple cells within a subarray. The resulting scheme will be called here inter-subarray multiple-cell comparison (Scheme 2). They have reconfigured the memory subarray of size s bits into an s-bit cyclic shift register where the data recirculate whenever a READ operation is done. The reconfiguration was accomplished by introducing pass transistors on the bit lines which deteriorate both the sensitivity of the sense amplifiers by  $V_{\tau}$  (threshold voltage of the MOS devices) and the access time of the DRAM in normal mode of operation. In order to reduce the routing complexity, it is desirable to compare the adjacent subarrays only. Thus by this scheme the occurrence of a fault is detected by comparing only two cells which are simultaneously read. If both the cells are identically faulty, it fails to detect the fault. Moreover, the reconfiguration scheme is tailored to introduce parallelism in the sliding diagonal test and the proposed design of parallel testing cannot be adapted for a large class of functional faults, like coupling and static and dynamic pattern-sensitive faults [5].

Manuscript received December 14, 1987; revised March 23, 1988. This work was supported in part by the Army Research Office under the URI program, Contract DAAL03-87-K-0007, and by the Semiconductor Research Corporation. The author is with the Center for High-Frequency Microelectronics, December of Direction Section 14, 1987.

Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI 48105. IEEE Log Number 8822147.



Fig. 1. Strategies for parallel memory testing. (a) Inter-subarray singlecell comparison. (b) Inter-subarray multiple-cell comparison. (c) Intrasubarray multiple-cell comparison.

This paper proposes a design-for-testability approach which does not make any inter-subarray comparisons. It writes the same data on multiple cells in a word line of a subarray in parallel, and in READ mode it compares these simultaneously written cells within a subarray. The resulting test scheme is called intra-subarray multiple-cell comparison (Scheme 3). These three schemes are shown in Fig. 1 and compared in Table I. The advantage of the proposed technique is that it is not constrained to any specific test procedure and the test vectors can be applied externally or generated by a built-in self-testing (BIST) circuit. It can speed up any existing test procedure of O(n) test length by a factor of  $O(\sqrt{pn})$ . The paper investigates the parametric faults in a DRAM and proposes  $O(\sqrt{n/p})$  algorithms to test the different parametric faults. Unlike in [4] the proposed technique does not modify the memory plane to introduce parallelism in the diagonal test, and thereby the normal memory performance (viz., access time, senseamplifier sensitivity) does not degrade. The proposed design-for-testability technique employs very little overhead (only  $2\sqrt{pn} + p \log_2 n - p \log_2 p + 12p$  transistors if the DRAM is organized into p square subarrays each of size  $\sqrt{n/p} \times \sqrt{n/p}$ ) and needs only one transistor to fit within the  $3\lambda$  inter-celler pitch width of the vertically integrated 3D DRAM. It will be shown that the modified bit-line decoders in Scheme 3 need  $2\log_2\sqrt{n/p}$  extra transistors in each subarray to enable parallel access. Each 0/1 detector used in a subarray to test the occurrence of a fault will be implemented by  $2\sqrt{n/p}$  +12 transistors. The circuits for the modified bit-line decoders and the 0/1 detectors are designed such that each transistor can fit within the pitch width of the high-density memory.

The rest of the paper has been organized as follows. Section II proposes a new design-for-testability technique which allows multiple cells to be tested in one memory cycle. Section III enumerates the different faults which occur due to variations in processing technology in a 3D DRAM; the algorithms for testing these parametric faults are presented in Section IV. The main contribution of this work is to propose a design-for-testability technique and to demonstrate how the parametric faults in a 3D DRAM using trench-type memory cells can be tested in parallel.

#### II. DESIGN FOR TESTABILITY

The organization of the testable DRAM with augmented hardware is shown in Fig. 2. The memory is organized as a  $b \times w = n$  matrix, where b is the number of bit lines and w is the number of word lines. The normal 1-out-of-b decoder is modified to select multiple bit lines during test mode. In test mode, it divides the b bit lines into g groups such that the bit line *i* belongs to group *j*, where j = i(mod g). Thus a WRITE operation in test mode results in writing the content of the data-in buffer on all cells at the crosspoints of the selected word line and the bit lines in group j. In READ mode, the contents of the cells located at the crosspoints of the selected word-line and bit-line groups (say j) are read in parallel. Thus, a ZERO or ONE is entered in the data-out buffer if all the multiple-accessed cells contain ZERO or ONE, respectively. If the contents of all the cells are not identical, the data-out buffer may store a ZERO OF ONE. It should be noted that it is not correct to assume that the resulting operation will be a wired-or or wired-AND. On the contrary, it depends on the number of ZERO's and ONE's in the multiple-accessed cells. If almost all the multiple-accessed cells contain ONE's except a very few which contain ZERO's, then a ONE will be entered in the data-out buffer when the cells are read in parallel. A ZERO would have been entered if almost all the cells contained ZERO's and a few cells contained ONE's. To circumvent this problem, the contents of all the cells in a group are compared by a parallel comparator. In the event that all the cells do not have identical contents, the parallel comparator triggers an error latch to indicate that a fault has been detected by the test.

In contrast to the bit-line decoder, the word-line decoder is not modified and word lines are accessed one at a time. A parallel-word READ operation is not meaningful because two or more cells will be sensed by the same sense amplifier resulting in a wired-OR or a wired-AND operation. A parallel-wRITE operation through multiple word lines would require the sense amplifier to drive many cells at a time. For a moderate-size DRAM, this introduces high wRITE-cycle time delay. By increasing the physical size of the sense-amplifier driver, delay can be improved to a certain extent, however this increases power consumption, and because of its large gate capacitance, sense-amplifier

TABLE I Comparison of Three Parallel Testing Strategies

| Criterion                                                                 | Scheme 1                                  | Scheme 2                                                                                              | Scheme 3                                                           |
|---------------------------------------------------------------------------|-------------------------------------------|-------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------|
| Performance<br>Parallelism<br>Degradation                                 | p                                         | O(Vpn)                                                                                                | O(\fpn)                                                            |
| in Access Time                                                            | None                                      | Large                                                                                                 | Negligible                                                         |
| in Sense Amp Sensitivity                                                  | None                                      | Large                                                                                                 | None                                                               |
| Architecture Modification -<br>Decoder<br>Memory Plane<br>Comparator Size | Not Modified<br>Not Modified<br>One p-bit | Modified<br>Modified<br>√π/p ×p -bit                                                                  | Modified<br>Not Modified<br>One √n/p -bit<br>0/1-Detector/Subarray |
| Routing Complexity                                                        | None                                      | High                                                                                                  | Very Low                                                           |
| Reliability                                                               | Moderate                                  | Low, if only two cells<br>are mutually compared;<br>Moderate, if all p cells<br>are mutually compared | Very High                                                          |
| Fault Coverage                                                            | Punctional and<br>Parametric              | Only Parametric                                                                                       | Functional and<br>Parametric                                       |



Fig. 2. Testable RAM organization.

slew rate decreases. Parallel bit-line READ and WRITE do not suffer from this drawback.

### A. Modified DRAM Circuit

The modified CMOS decoder circuit is shown in Fig. 3. Transistors  $Q_1, \dots, Q_7$  with the transmission gate constitute a normal decoder circuit. In the clock phase  $\phi_P$ , the transistor  $Q_1$  turns on to precharge the common line connected to the address decoding transistors. If all the address bits,  $a_0, \dots, a_{k-1}$  are zeros, transistor  $Q_6$  pulls up the OUT to ONE, and the corresponding bit line is selected. The signal  $\phi_{EN}$  enables the transmission gate so that the decoder selects the bit line only after all address lines have changed. Transistors  $Q_8$  and  $Q_9$  have been added so that in the test mode the decoder output can be selected by applying SELECT = 0 independent of the input address. In the normal mode of operation, SELECT = 1 and the decoder output is selected by the address input  $a_0, \dots, a_{k-1}$ . The operation of the modified decoder is shown by the voltage waveforms in Fig. 4. The modified decoder is simulated using SPICE and the degradation in decoding time due to addition of the extra transistors has been found to be approximately 0.1 ns.

The parallel comparator, which is essentially a multibit 0/1 detector, monitors the output of sense amplifiers connected to bit lines which are selected in parallel and detects the concurrent occurrence of either ZERO's or ONE's. If a selected bit line is different from the others, it triggers the error latch indicating the occurrence of a fault. Fig. 5 shows the parallel comparator and error detector. The p-channel transistors  $T_1, \dots, T_{m-1}$  are connected in parallel and detect concurrent occurrence of ONE in the bit lines. The n-channel transistors  $P_1, \dots, P_{m-1}$  are also connected in parallel and detect concurrent occurrent occurrence of ZERO in the bit lines. Transistors  $T_0$  and  $P_0$  are the precharge transistors while transistor  $P_m$  is the discharge transistor which remains cut off during the precharge phase and turns on during discharge clock phase  $\phi 2$ . Since the bit lines are divided into g = 2 classes, pass transistors



Fig. 3. Modified decoder circuit.



Fig. 4. Operation of modified decoder circuit.

are introduced so that only the odd or even bit lines are compared simultaneously. Signals  $L_1$  and  $L_2$  select these bit lines. Transistors  $S_0$ ,  $S_1$ , and  $S_2$  form a coincidence detector. If all the selected bit lines are ZERO or ONE, then either  $S_1$  or  $S_2$  conducts and the output of the detector is ZERO. The output of the coincidence detector is connected to an error latch through the pass transistor  $S_4$  which isolates the error latch during the phase  $\phi$ 1. It may be noted that during the precharge phase the transistor  $S_0$  will be directly shorted through the error amplifier if  $S_4$  does not isolate the coincidence detector from the error amplifier. During phase  $\phi 2$ , the output of the coincidence detector is connected to the error amplifier through  $S_4$ . The error amplifier consists of transistors  $V_0, \dots, V_3$ . The error latch output is ERROR = 0, when the selected bit lines are identical. If the bit lines are not identical, then both  $S_1$ and  $S_2$  remain cut off and the detector output is ONE. This triggers the error latch to set its output to ERROR = 1. During the WRITE phase and normal mode of operation, the error latch is clamped to zero by  $V_4$ . The error detector is inhibited by the discharge transistor  $P_m$  during the start of the READ phase when the sense-amplifier outputs are not identical because of sluggish changes in some of the sense amplifiers.

The design-for-testability approach has been applied over an experimental DRAM chip of 16 kbit. The chip overhead was found to be less than 1 percent. For multimegabit DRAM's, this overhead will be less than 0.5 percent. The degradation in performance due to the modified decoder was less than 0.2 ns in memory cycle time. This is slightly larger than the value obtained from SPICE simulation of the modified decoder.

#### III. PARAMETRIC FAULTS IN A 3D DRAM

A typical configuration of the 3D trench-type memory cell [3], [6] with p<sup>+</sup> sidewall doping is shown in Fig. 6. The access transistor is a PMOS transistor located within an n-well diffused over the p<sup>+</sup> substrate. A deep trench capacitor extends from the planarized surface through the n-well into the p<sup>+</sup> substrate. A conducting strap connects the p<sup>+</sup>-doped polysilicon storage electrode inside the trench to the p<sup>+</sup> source region of the access transistor. With a thin composite insulator separating the polysilicon from the bulk silicon surrounding the trench, the storage capacitance comes primarily from the portions of the four trench sidewalls in the p<sup>+</sup> substrate region and the trench bottom. Some additional capacitance results from the four trench walls intersecting the n-well. The grounded p<sup>+</sup> substrate provides a very solid reference potential to the capacitor plate. The leakage currents due to process parameter variation have also been shown in the diagram. These currents are divided into four components: 1) weak-inversion current  $I_{W}$  from the storage area to the bit line; 2) field-inversion current  $I_F$  between the two adjacent cells; 3) gate leakage current  $I_G$  due to pin-hole defects in the gate oxide; and 4) the dark current  $I_B$ between the storage area and the p-type substrate. The weak-inversion current can degrade a stored ZERO by flow of minority carriers from the trench capacitor to the positively biased bit lines. Dark current which flows from the trench capacitor to the p<sup>+</sup> substrate can degrade stored ONE. It may also be observed that the cell forms a vertical parasitic FET device which occurs between the storage node and the substrate along the trench wall, gated by the node polysilicon as shown in Fig. 6.

The effects of the leakage currents result in parametric faults such as the bit-line voltage imbalance and the bit-line



Fig. 5. Parallel comparator with error detector (g = 2).



to word-line crosstalk. The other types of parametric faults emanate due to a wide variation of timing signals in the decoding, address buffer, and peripheral circuits, such as the sense amplifiers. Incorrect timing between decoder enable, precharge clock, and decoder address signals may cause multiple-address selection. These parametric faults are described here briefly.

### A. Bit-Line Voltage Imbalance

A typical memory array organization utilizes the differential amplifiers for sensing the signal partitioning of each array into two identical subarrays (called left and right in this paper) as shown in Fig. 7. Each bit line in the array is split into two halves and they are sensed by a differential pair of sense amplifiers. One of the cells in each half acts as a reference cell and its voltage is com-



pared with the selected cell on the same word line, but on the other half. Thus in Fig. 7 when a cell in the right half on the bit line  $B_i^R$  is selected for reading, the reference cell on left-half bit line  $B_i^L$  is utilized for comparison. The bit-line voltage in  $B_i^L$  is clamped to a reference voltage (which is close to precharge voltage  $V_p$ ) and is compared by the sense amplifier with the voltage of the bit-line voltage  $B_i^R$ , which will be near to the supply voltage if the selected cell on  $B_i^R$  contains a ONE, or near to the ground potential if the selected cell contains a ZERO. Thus if the difference of the voltages between the two bit lines is larger than a threshold value, the sense amplifier can correctly distinguish the state of the selected cell during a READ operation. When most of the cells in the left half of the memory subarray contain one type data (say, ONE) and most of the cells in the right half contain the opposite type data (ZERO), during a READ cycle the precharge voltage on the two halves of the bit lines will be different. This is illustrated in Fig. 7, where all the cells connected to the bit line  $B_i^L$  contain ONE and all the cells connected to the bit line  $B_i^R$ , except one which is connected to the word line  $W_i$ , contain ZERO. If the cell containing ONE in the right half of the memory is read, at first bit lines  $B_i^L$  and  $B_i^R$ 



Fig. 8. Degradation of precharge voltage level due to leakage currents.

will be precharged to a voltage  $V_p$ . But due to the weakinversion currents in the right-half cells, the precharge level will be degraded to

$$v_p = V_p - (1/C_B) \sum_{j=0}^{.5w-1} I_{W_j} \tau$$

where  $C_B$  is the capacitance of the bit line,  $I_{W_j}$  is the weak-inversion current in  $C_{ij}$ , and  $\tau$  is the time interval for precharge during READ operation. This is illustrated in Fig. 8. If the weak-inversion currents are sufficiently large, the degradation in precharge voltage will be sufficiently high, i.e.,  $v_p \ll V_p$ . Consequently, when the word line  $W_j$  is selected to read the content of the cell  $C_{ij}$ , which contains ONE, the bit-line voltage of  $B_i^R$  will be near to  $V_p$  and the contents of  $C_{ij}$  will be read as ZERO by the sense amplifier (because the ratio of bit-line capacitance and cell capacitance is usually greater than 15). This is illustrated in Fig. 7 where the difference in bit-line voltages is very small and thereby the sense amplifier incorrectly reads the contents of  $C_{ij}$ .

### B. Bit-Line to Word-Line Crosstalk

From Fig. 9 it can be seen that there is an overlap between the bit line and the word line, since they are orthogonal to each other. This overlapping forms a coupling between the bit lines and the word lines so that when the bit-line voltages change due to precharging and restoring operations during a READ cycle, the unselected word lines may be inadvertently turned on. This coupling is maximum if all the cells in the selected word line contain ONE and if some of the cells of the coupled unselected word line contain ZERO, which will be degraded due to the weak-inversion current of the access transistors. In the precharging phase, at first the bit-line voltage increases from ZERO to  $V_p$  which is coupled as a noise voltage  $V_{n_j}$  to a word line  $W_j$ , where  $0 \le j \ne i \le w - 1$ . If all the cells in the word line  $W_i$  contain ONE, then if  $W_i$  is selected its voltage will increase, coupling a noise voltage  $V_{v_i}$  to  $W_{i}$ . The effect of these superimposed noise voltages may generate a sufficient weak-inversion current such that a stored ZERO in a cell on the unselected word line  $W_i$  may be degraded [7].



Bit-line to word-line crosstalk. 9



Fig. 10. Single-ended WRITE.

### C. Single-Ended WRITE

In a DRAM employing single-ended wRITE technique, a single I/O line is used to write into the bit lines. In Fig. 10 it can be seen that writing on the right bit line  $B_i^R$  is controlled by the I/O line, while writing on the left bit line  $B_i^L$  is controlled by the sense amplifier. The ZERO level on  $B^{R}$  is determined by the input driver, but the ZERO level of  $B_i^L$  is determined by the sense amplifiers. Thus the level of ZERO in the two halves may be different, and this asymmetry may result in pattern-sensitive faults.

#### D. Multiple Selection

In the decoder circuit in Fig. 3, the precharge clock  $\phi_p$ and the decoder enable clock  $\phi_{EN}$  should be nonoverlapping in the sense that always  $\overline{\phi}_p$  and  $\phi_{EN} = 0$ . If due to incorrect timing they overlap, multiple selection may occur.

#### E. Transmission-Line Effect

In two-layer interconnect technology, either the bit lines or the word lines are made of metal and the other polysilicon or diffusion. Usually, the poly and diffusion lines have quadratic signal propagation delay (i.e., the signal propagation delay in an interconnect of length l is proportional to  $l^2$ , because delay is due to the product of its resistance and capacitance). Because of the high resistivity in these interconnects the cells at the periphery of the chips, away from the sense amplifiers, are delivered a weak signal and thereby may fail. By inserting the repeaters at suitable intervals, the signal strength and delay may be improved, but complexity is added to the layout. In order to check that all the cells in the array satisfy the limits of the stipulated memory cycle time, the transmission-line effect should be tested.

#### IV. TESTING STRATEGY AND ALGORITHMS

In order to test all of the above faults, it is necessary to identify the circumstances in which each of them is likely. to be maximum. It can be easily noted that the field-inversion current  $I_F$  which occurs between two adjoining storage cells is maximum if the four adjacent cells of a base cell contain opposite data to that of the base cell, i.e., a checkerboard-type pattern can test the effect of a fieldinversion current. Similarly, the effects of a dark current and gate short can be tested by the checkerboard pattern, because the presence of these leakage currents manifests in the form of cell stuck-at ZERO or ONE. Algorithm 1 is a parallel version of the checkerboard test, which tests the above three leakage currents in addition to the parametric faults due to the single-ended WRITE and the transmissionline effects. Since in Algorithm 1 each cell in a word line is tested for both ZERO and ONE data, the stuck-at ZERO and stuck-at ONE faults are also tested. Moreover, if a bit line or a word line is faulty (namely, broken, stuck to ground, or cannot be precharged), Algorithm 1 will detect the fault. It may be noted that a faulty bit line will be incorrectly compared by the parallel 0/1 detector to set ERROR = 1. A faulty word line where all the bits are identical may not be detected by the parallel 0/1 detector. But it can be easily detected by monitoring the output of the data-out buffer, if the entire memory is organized into a single array, otherwise, by mutually comparing the data-out values of all subarrays. It may be added that if the entire word line is faulty, then the expected value at the data-out buffer will be different from the obtained one if and only if the word line is tested for both ZERO and ONE data value. If a word line is completely fault free, then the expected value of data will always be the same as the obtained one. If only one or two bits are faulty in a word line, then the expected value may be different from the obtained one, but it will certainly be detected by the 0/1 detector.

#### Algorithm 1: Parametric Checkerboard Test

- Use complementing address sequence from word line W<sub>0</sub> until all word lines are scanned; write in two steps a pattern of (01)\* if the word line is even and two steps a pattern of (10)\* if the word line is odd.
- 2) Freeze the clock for the entire refresh interval  $\tau_R$  for testing static refresh.
- Use a complementing address sequence from word line W<sub>0</sub> until all word lines are scanned; compare in parallel all even and odd bit lines to check ERROR = 0.
- Read continuously any arbitrary word line for the entire refresh interval τ<sub>R</sub> to check ERROR = 0. This test checks the effect of temperature rise and tests the dynamic refresh.
- 5) Use the complementing address sequence from word line  $W_0$  until all word lines are scanned; compare in parallel all even and odd bit lines to check ERROR = 0.

- 6) Read continuously another distinct word line for the entire refresh interval  $\tau_R$  to check ERROR = 0. This test checks the effect of temperature rise and tests the dynamic refresh.
- 7) Use the complementing address sequence from word line  $W_0$  until all word lines are scanned; compare in parallel all even and odd bit lines to check ERROR = 0.
- 8) Repeat steps 1-7, with opposite data.

The effect of a weak-inversion current is maximum when all the cells in a bit line, except one, contain ZERO. If the cell which contains ONE is addressed for a READ operation, the weak-inversion currents in other cells will tend to degrade the precharge level of the bit line and thereby the cell containing ONE will be sensed as ZERO by the sense amplifier. Since the bit-line capacitance is typically 10–20 times the capacitance of an individual cell, the stored ONE may not be sufficient to replenish the degraded precharge level. The testing strategy needs to test each memory cell so that when it is ONE all its bit-line neighbors will contain ZERO.

In order to test the bit-line voltage imbalance, it is necessary to write ZERO (ONE) on the cells at the bit line on the left half of the subarray and write ONE (ZERO) on the cells at the bit line on the right half of the subarray. Thus the test to detect the weak-inversion current can be utilized to test the bit-line voltage imbalance by testing the left and right subarrays with opposite background data. It may be noted that the test also detects faults due to single-ended WRITE. Algorithm 2 tests all the above faults.

#### Algorithm 2: Parallel Parametric Walking Test

- 1) Initialize the entire memory writing ZERO in all locations.
- Select two arbitrary word lines W<sub>i</sub> and W<sub>j</sub> and read them alternately for one refresh interval. Check if the ERROR = 0 during the entire refresh interval.
- 3) For all word lines starting from  $W_0$ , compare in parallel all even and odd bit lines to check ERROR = 0.
- 4) Initialize the entire memory, writing ONE in all locations.
- 5) Select two arbitrary word lines  $W_p$  and  $W_q$  and read them alternately for one refresh interval. Check if the ERROR = 0 during the entire refresh interval.
- For all word lines starting from W<sub>0</sub>, compare in parallel all even and odd bit lines to check ERROR - 0.
- Initialize the memory such that the left subarray contains ZERO in all locations and right subarray contains ONE in all locations.
- 8) For all word lines starting from W<sub>0</sub>, do the following: i) write a pattern of (01)\* in the selected word line; ii) parallel compare and check if ERROR = 0; iii) initialize all the cells in the selected word line to ZERO if it is on left half, otherwise to ONE.
- 9) For all word lines starting from W<sub>0</sub>, do the following: i) write a pattern of (10)\* in the selected word line; ii) parallel compare and check if ERROR = 0; iii) initialize all the cells in the selected word line to ZERO if it is on left half, otherwise to ONE.
- 10) Repeat steps 7-9 with complementary bit patterns.

Algorithm 3, which runs a marching pattern of ONE at the background of ZERO and a marching pattern of ZERO at the background of ONE in each word line, will detect the multiple-access faults in the word-line decoder by comparing the READ data with the expected data. Since the algorithm employs parallel writing by accessing all the even bit lines and odd bit lines in a single memory cycle,

TABLE II Algorithms and Their Coverage of Parametric Faults

| Fault Type                     | Algorithm 1 | Algorithm 2 | Algorithm 3 |
|--------------------------------|-------------|-------------|-------------|
| Weak-Inversion Current         | No          | Yes         | Yes         |
| Field-Inversion Current        | Yes         | No          | No          |
| Dark Current                   | Yes         | No          | No          |
| Gate Short                     | Yes         | Yes         | Yes         |
| Multiple Selection             | No          | No          | Yes         |
| Single-Ended Write             | Yes         | Yes         | Yes         |
| Bit-Line Voltage Imbalance     | No          | Yes         | No          |
| Bit Line to Word Line Crostalk | No          | Yes         | No          |
| Transmission-Line Effect       | Yes         | No          | No          |

multiple access in the bit-line decoder will not be tested by Algorithm 2. A separate algorithm is needed to test the bit-line decoders and is given in Algorithm 3.

## Algorithm 3: Bit-Line and Word-Line Decoder Test

/\* Bit-Line Decoder Multiple Access Test \*/

- 1) Write in parallel ZERO in all cells on the arbitrarily selected word line  $W_j$ .
- 2) Read and compare in parallel all the cells on  $W_i$ .
- 3) Starting from the cell at the crosspoint of  $B_0$  and  $W_j$ , for each cell on  $W_j$ , at first write ONE and read the cell (one cell at a time in ascending order of the bit line).
- 4) Starting from the cell at the crosspoint of B<sub>j</sub> -1 and W<sub>j</sub>, for each cell on W<sub>j</sub>, at first write ONE and read the cell (one cell at a time in descending order of the bit line).
  - /\* Word-Line Decoder Multiple Access Test \*/
- 5) Write in parallel ZERO in all cells on the bit line  $B_i$ .
- 6) Read and compare in parallel all the cells on  $B_i$ .
- 7) Starting from the cell at the crosspoint of  $W_0$  and  $B_i$ , for each cell on  $B_i$ , at first write ONE and read the cell (one cell at a time in ascending order of the word line).
- 8) Starting from the cell at the crosspoint of  $W_{\sqrt{h}-1}$  and  $B_i$ , for each cell on  $B_i$ , at first write ONE and read the cell (one cell at a time in descending order of the word line).

It can be seen that Algorithm 1 takes altogether  $10w\tau_A + 6\tau_R$  time to complete all the steps, where  $\tau_A$  is the average memory cycle time and  $\tau_R$  is the refresh interval. Algorithm 2 takes  $20w\tau_A + 2\tau_R$  time to test the entire DRAM. Finally, Algorithm 3 takes  $(4w + 2)\tau_A$  time to test the multiple-access faults in the decoder logic. Hence, altogether  $(34w + 2)\tau_A + 8\tau_R$  time is needed to test all the parametric faults in the DRAM. Table II depicts the different types of parametric faults and how those are covered by these algorithms.

The above algorithms are tested at the rated maximum and minimum power-supply voltages. But, in addition to the worst-case measurements, it is necessary to test the memory when the supply voltage changes rapidly due to an impressed noise voltage. Noise spikes have high slew rates and they may occur during a READ or WRITE memory cycle causing the operation to fail. If the effect of the noise spike is to lower the supply voltage, the capacitive bias of the dynamic logic may be higher than the supply bias, and this may result in a failure of a READ or WRITE operation. Similarly, if the noise spike increases the supply voltage to a high value, the capacitive bias voltage may be sufficiently lower than the supply bias resulting in a faulty READ or WRITE operation. In order to test the effect of this powersupply voltage transition, the entire memory should be tested for both the cases when the supply voltage rapidly increases and again when it quickly decreases. The typical slew rate is about a few microseconds. Algorithm 4 tests the memory for the above faults.

### Algorithm 4: Power-Supply Voltage Transition Test

- Write in parallel ZERO in the entire memory at maximum supply voltage.
   For all word lines starting from W<sub>c</sub> do the following: i) write a
- 2) For all word lines starting from W<sub>0</sub>, do the following: i) write a pattern of (01)\* in the selected word line at maximum supply voltage; ii) rapidly reduce the supply voltage to minimum; iii) parallel compare and check if ERROR = 0.
- 3) For all word lines starting from W<sub>i</sub>, 1, do the following: i) write a pattern of (10)\* in the selected word line at maximum supply voltage; ii) rapidly reduce the supply voltage to minimum; iii) parallel compare and check if ERROR = 0.
- Write in parallel ZERO in the entire memory at minimum supply voltage.
- For all word lines starting from W<sub>0</sub>, do the following: i) write a pattern of (01)\* in the selected word line at minimum supply voltage; ii) rapidly increase the supply voltage to maximum; iii) parallel compare and check if ERROR = 0.
- 6) For all word lines starting from W<sub>yF-1</sub>, do the following: i) write a pattern of (10)\* in the selected word line at minimum supply voltage; ii) rapidly increase the supply voltage to maximum; iii) parallel compare and check if ERROR = 0.

#### V. CONCLUSIONS

The main objective of this paper is to propose a modified DRAM architecture which enhances the speed for testing the occurrence of parametric faults in the 3D DRAM employing trench-type capacitor. A novel testable architecture is proposed to test multiple cells in a word line simultaneously resulting in a speedup by a factor of  $O(\sqrt{n/p})$ . The testable design has been proposed to fit within the interceller pitch width of  $3\lambda$  in the 3D DRAM. It employs only an additional  $2\sqrt{pn} + p \log_2 n - p \log_2 p$ + 12p transistors and has very low overhead. Above all, the technique is not tailored to test only one specific test procedure and it can speed up the conventional test algorithms for functional faults [8]-[10].

A number of parametric faults, that manifest due to a variation of processing parameters and also due to a critical circuit design, are enumerated in this paper. Leakage currents in a 3D DRAM have been identified, and test procedures have been designed to test in parallel the faults due to these leakage currents. In addition to the faults due to leakage currents, circuit-related design weaknesses (like the multiple selection, single-ended write, etc.) are also tested by the algorithms described here. The effects of rapid supply-voltage transitions, which cause a faulty READ or WRITE due to disparity in capacitive voltages in the dynamic logic and the supply bias, are comprehensively tested in this paper. Unlike the earlier approaches [11], [12] which employed diagonal tests having  $O(n^{3/2})$  complexity, the algorithms described here have  $O(\sqrt{n/p})$  complexity, and thereby a dramatic improvement by a factor proportional to the size of the DRAM can be achieved. For a 16-Mbit DRAM organized into 16

quadrants, the diagonal test algorithm will need more than a few hours as opposed to the proposed algorithms which will need only a few milliseconds. Moreover, the fault coverage by these algorithms is far superior to that of the diagonal test procedure. It may also be noted that these algorithms can be easily generated for built-in self-test (BIST) applications.

#### ACKNOWLEDGMENT

The author wishes to thank Prof. J. H. Patel and Prof. W. K. Fuchs of the Coordinated Science Laboratory, University of Illinois, Urbana, for their helpful suggestions on the design-for-testability strategy in Section II.

#### References

- L. L. Lewyn and J. D. Meindl, "Physical limits of VLSI dRAM's," *IEEE J. Solid-State Circuits*, vol. SC-20, pp. 231-241, Feb. 1985.
   H. McAdams et al., "A 1-Mbit CMOS dynamic RAM with design-for test functions," *IEEE J. Solid-State Circuits*, vol. SC-21, pp. 635–641, Oct. 1986
- 635-641, Oct. 1986. A. H. Shah et al., "A 4-Mbit DRAM with trench-transistor cell," [3]
- [4]
- A. H. Shah et al., "A 4-Mbit DRAM with trench-transistor cell," *IEEE J. Solid-State Circuits*, vol. SC-21, pp. 618-627, Oct. 1986.
  Y. You and J. P. Hayes, "A self-testing dynamic RAM chip," *IEEE J. Solid-State Circuits*, vol. SC-20, pp. 428-435, Feb. 1985.
  M. S. Abadir and H. K. Reghbati, "Functional testing of semicon-ductor random-access memories," ACM Computing Surveys, vol. 15, no. 3, pp. 175-198, Sept. 1983.
  N. C. Lu et al., "A substrate-plate trench-capacitor (SPT) memory cell for dynamic RAM's," *IEEE J. Solid-State Circuits*, vol. SC-21, pp. 627-634, Oct. 1986. [5]
- [6] pp. 627-634, Oct. 1986.
  [7] H. Masuda *et al.*, "A 5-V-only 64K dynamic RAM based on high

S/N design," IEEE J. Solid-State Circuits, vol. SC-15, pp. 846-853,

- Oct. 1980.
   P. Mazumder, J. H. Patel, and W. K. Fuchs, "Design and algorithms for parallel testing of random-access and content-address-able memories," in *Proc. Design Automation Conf.*, July 1987, pp. 699 604 [8] 688-694.
- P. Mazumder and J. H. Patel, "An efficient built-in self-testing of random-access memory," in Proc. Int. Test Conf., Sept. 1987, pp. [9] 1072-1077.

- 1072-1077.
  [10] J. Inoue et al., "Parallel testing technology for VLSI memories," in Proc. Int. Test Conf., Sept. 1987, pp. 1066-1071.
  [11] M. A. Breuer and A. D. Friedman, Diagnosis and Reliable Design of Digital Systems. Los Angeles: Woodland Hills, 1976.
  [12] T. C. Lo and M. R. Guidry, "An integrated test concept for switched-capacitor dynamic MOS RAM's," IEEE J. Solid-State Circuits, vol. SC-12, pp. 693-703, Dec. 1977.



Pinaki Mazumder received the B.Sc. degree in physics from Gauhati University, India, the B.S.E.E. degree from the Indian Institute of Science, Bangalore, the M.Sc. degree in computer science from the University of Alberta, Canada, and the Ph.D. degree in electrical and computer engineering from the University of Illinois.

He worked two years as a Research Assistant at the Coordinated Science Laboratory, University of Illinois, and over six years at Bharat Electronics Ltd. (a collaborator of RCA-GE) in

the area of integrated circuit design and applications. During the summers of 1985 and 1986, he worked as a Member of the Technical Staff at AT&T Bell Laboratories, Indian Hill, Naperville, IL, in the area of hardware synthesis from system-level behavioral description. Presently he is working as an Assistant Professor at the Department of Electrical Engineering and Computer Science of the University of Michigan, Ann Arbor. His research interests include VLSI testing, computer-aided design, parallel architecture, and image processing.

Dr. Mazumder is a member of Phi Kappa Phi.