WO2012019997A1 - Storing/reading several data streams into/from an array of memories - Google Patents

Storing/reading several data streams into/from an array of memories Download PDF

Info

Publication number
WO2012019997A1
WO2012019997A1 PCT/EP2011/063619 EP2011063619W WO2012019997A1 WO 2012019997 A1 WO2012019997 A1 WO 2012019997A1 EP 2011063619 W EP2011063619 W EP 2011063619W WO 2012019997 A1 WO2012019997 A1 WO 2012019997A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
memory
current
bus
fifo
Prior art date
Application number
PCT/EP2011/063619
Other languages
French (fr)
Inventor
Oliver Kamphenkel
Thomas Brune
Michael Drexler
Stefan Abeling
Original Assignee
Thomson Licensing
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Thomson Licensing filed Critical Thomson Licensing
Priority to US13/816,250 priority Critical patent/US9026722B2/en
Priority to EP11741219.7A priority patent/EP2603856A1/en
Publication of WO2012019997A1 publication Critical patent/WO2012019997A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F12/00Accessing, addressing or allocating within memory systems or architectures
    • G06F12/02Addressing or allocation; Relocation
    • G06F12/0223User address space allocation, e.g. contiguous or non contiguous base addressing
    • G06F12/023Free address space management
    • G06F12/0238Memory management in non-volatile memory, e.g. resistive RAM or ferroelectric memory
    • G06F12/0246Memory management in non-volatile memory, e.g. resistive RAM or ferroelectric memory in block erasable memory, e.g. flash memory
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F13/00Interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
    • G06F13/14Handling requests for interconnection or transfer
    • G06F13/16Handling requests for interconnection or transfer for access to memory bus
    • G06F13/1668Details of memory controller
    • G06F13/1684Details of memory controller using multiple buses
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2212/00Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
    • G06F2212/72Details relating to flash memory management
    • G06F2212/7202Allocation control and policies
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2212/00Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
    • G06F2212/72Details relating to flash memory management
    • G06F2212/7203Temporary buffering, e.g. using volatile buffer or dedicated buffer blocks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2212/00Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
    • G06F2212/72Details relating to flash memory management
    • G06F2212/7208Multiple device management, e.g. distributing data over multiple flash devices

Definitions

  • the invention relates to a method and to an apparatus for storing at least two data streams into an array of memories, or for reading at least two data streams from an array of memories, wherein the array is arranged such that the memories are accessed via multiple data or memory buses to each of which multiple memories are connected, and wherein only blocks of data can be programmed in, or read from, a memory at a time.
  • High speed mass storage devices built with non-volatile memory chips are suitable for recording and playing back concurrent data/video streams under real-time conditions, e.g. for video productions in film and broadcast studio environments.
  • Higher spatial resolutions, higher frame rates and uncompressed multiple stream recording for 3D productions are increasing the requirements for storage media bandwidth and processing power capabilities.
  • non- volatile memories like NAND flash devices are used for recording digital video.
  • Nonvolatile memory devices are organised as an array of pro- gramable and readable memory pages, comprising data blocks of some kilobytes size. If data are to be written into a NAND flash memory device, it is necessary to program a full page of some kilobytes size (PAGESIZE) . Additionally, the flash device needs a non-negligible processing time for writing such page into its internal memory array. During this programming time no other read/write commands can be executed on that flash device. Therefore the memory bus resources connecting the flash device with its controller will be unused during most of the time.
  • PAGESIZE some kilobytes size
  • current NAND flash devices may have an 8-bit memory bus as external interface that can be driven with a speed of 40MHz.
  • the memory bus resource has a full bandwidth of approximately 40MB/s, but the NAND flash device is written by programming operations of 2KB pages that may last up to 600ps. This will result in a sustained bandwidth of approximately 3,2MB/s.
  • controller 15 passes input data 10 to, or output data 10 from, Y parallel memory buses MB1 to MBY, to each of which X flash memory devices MDy .1 to MDy.X are connected.
  • controller 15 does not show the prior art but is explained below in connection with the invention.
  • This kind of flash memory arrangement has the advantage that almost unlimited bandwidths can be provided for flash storage media. But increasing bandwidth leads also to an increasing amount of data that need to be written coherently. Only after a block of Y*PAGESIZE of data is available, these data can be programmed to the corresponding NAND flash mem- ory devices on all Y parallel memory buses. To guarantee a specific minimum read or write bandwidth for input/output data 10, it is necessary to read/write the data in an interleaved manner as mentioned above. That implies the need to read/write sequential blocks of a size X*Y*PAGESIZE .
  • One solution could be to buffer data of each data stream in an independent buffer, and to write data of the different streams consecutively to the flash memory array only after one buffer contains an amount of Y*PAGESIZE data.
  • processing would scramble the memory pages of different data streams into sequential 'interleaved blocks', whereby the interleaving is guaranteed when the data streams are written to and read from the flash memory array concurrently. But in case a single data stream only shall be read from the flash array, the interleaving is not ensured.
  • consecutive flash pages in a NAND flash memory are coupled to 'Erase Blocks', and in case one separate data stream is to be erased it would be necessary to implement a wasteful procedure for preserving the data pages of the other data streams.
  • a buffer of size 2*Y*PAGESIZE * NUMBER_OF_STREAMS would be required.
  • a problem to be solved by the invention is to provide a high-speed real-time flash memory processing in which multi- pie incoming data streams can be concurrently written to, or read from, independent areas of a flash memory array using a minimum buffer size only, and wherein interleaving requirements are fulfilled for each stream independently.
  • This problem is solved by the methods disclosed in claims 1 and 3.
  • An apparatus that utilises this method is disclosed in claims 2 and 4.
  • the Y parallel memory buses are not used as strictly parallel memory buses, but as serial data lanes (i.e. memory buses) . It is not necessary to buffer data until the amount of incoming data of one data stream will suffice writing corresponding data pages on all Y memory buses. Instead, data are written in the flash memory array as soon as one of the data streams has enough data buffered for writing a full 'interleaving block' on one mem- ory bus. In combination with a smart buffer control it is possible to allocate and use a minimal number of small buffers in a flexible way. Data of the different data streams are also concurrently written to the memory buses of the flash memory array and, depending on the receiving time of the data, it is advantageously possible to handle storage, or replay, of the data streams in a more flexible and effective manner.
  • the inventive method is suited for storing or recording at least two data streams or files into an array of memories, wherein said array is arranged such that the memories are accessed via multiple memory buses to each of which multiple memories are connected, and wherein only blocks of data can be programmed in a memory at a time, said method including the steps:
  • the inventive apparatus is suited for storing or recording at least two data streams or files into an ar- ray of memories, wherein said array is arranged such that the memories are accessed via multiple memory buses to each of which multiple memories are connected, and wherein only blocks of data can be programmed in a memory at a time, said apparatus including:
  • - means being adapted for collecting data from different ones of said data streams in corresponding different buffers until the amount of collected data for a current one of said data streams in a current buffer equals the size of a current one of said data blocks, wherein the number of buffers is different than the number of data streams;
  • - means being adapted for storing the data of said current data block from that current buffer into a memory or memories connected to a current one of said memory buses, wherein the following buffered data block of the related data stream is later on stored into a memory or memories connected to a following one of said memory buses, the number of said following memory bus being increased or decreased, respectively, with respect to the number of the current memory bus,
  • the inventive method is suited for reading or replaying at least two data streams or files from an array of memories, wherein said array is arranged such that the memories are accessed via multiple memory buses to each of which multiple memories are connected, and wherein only blocks of data can be read from a memory at a time, said method including the steps : a) reading data of a current data block from a memory or memories connected to a current one of said memory buses and storing them into a current buffer, wherein the following data block of the related data stream is later on read from a memory or memories connected to a following one of said memory buses, the number of said following memory bus being increased or decreased, respectively, with respect to the number of the current memory bus;
  • the inventive apparatus is suited for reading or replaying at least two data streams or files from an array of memories, wherein said array is arranged such that the memories are accessed via multiple memory buses to each of which multiple memories are connected, and wherein only blocks of data can be read from a memory at a time, said apparatus including:
  • - means being adapted for reading data of a current data block from a memory or memories connected to a current one of said memory buses and for storing them into a current buffer, wherein the following data block of the related data stream is later on read from a memory or memories connected to a following one of said memory buses, the number of said following memory bus being increased or decreased, respec- tively, with respect to the number of the current memory bus ;
  • - means being adapted for assembling data of different ones of said data streams from corresponding different buffers, wherein the number of buffers is different than the number of data streams,
  • Fig. 1 Inventive flash memory device
  • FIG. 2 Flowchart of the memory control
  • FIG. 3 Flowchart of the memory control for multiple data streams with equal data rates.
  • the format of the multiple data streams of the data input or output can conform the Ethernet protocol whereas the data for storage in, or read from, the memory array must be addressed according to a different protocol or format.
  • the invention deviates from the known strictly parallel handling of memory buses. Instead, the parallel memory buses are regarded as representing serial memory buses (i.e. lanes) that are fed, or read, by concurrent memory bus processing engines.
  • the known parallelism is broken with respect to temporal direc- tion because the processing of the memory buses no more starts and ends at the same time instants.
  • the known paral- lelism is also broken with respect to processing location because different memory buses can access different data streams in a different or flexible way.
  • Incoming data of different data streams which data streams can have differing data rates, are buffered in an independent FIFO buffer for each data stream.
  • X*PAGESIZE size of one 'interleaving block'
  • these already stored data are written at the actual position (i.e. the current position at which the data stream is to be written or its writing is to be continued) on the current memory bus for that data stream.
  • Successive 'interleaving blocks' of that data stream are distributed in an ascending order (or, as an alternative embodiment, in de- scending order) to the Y memory buses, i.e.
  • a first data block of a particular data stream is written to memory bus 1, the second data block to e.g. memory bus 2, the third data block to e.g. memory bus 3, the Yth data block to e.g. memory bus Y, and (Y+l)th data block again to memory bus I , and so on.
  • a parallelism in processing on the memory buses for a given data stream is remaining in order to achieve the desired bandwidth for that data stream read/write access.
  • such regular structure of memory bus ac- Steps is minimising the demands on a file system that is required for arranging data streams/files on a recording storage medium. This procedure supports the parallelism of data streams because, depending on the current incoming data, different memory buses can access different data streams concurrently.
  • the memory controller 11 receives streaming data 10 of up to N different data streams.
  • a number of Z buffers of size X*PAGESIZE are arranged between memory controller 11 and an internal memory bus MBy connecting the memory bus processing engines 13, wherein the number Z of buffers is different than the number N of data streams.
  • Controller 11 forwards the incoming data from each stream to a different one of the buffers, wherein any buffer can be allocated to any stream.
  • memory controller 11 assigns that buffer (and related storage information, like physical address etc.) to an appropriate memory bus processing engine 13 for memory bus MBy, followed by loading the data block into that memory bus processing engine, and allocates the next free buffer to that data stream .
  • the processing of the received stream data in memory controller 11 is shown in Fig. 2.
  • - ac- tual_fifo(i) to its current buffer is provided by controller 11.
  • the memory bus 'y' to which the next data of a stream i must be sent to is registered in value actual_bus ( i) .
  • the stream i data are forwarded to the actual buffer data input actual_fifo(i) .datain as long as a corresponding flag actual_fifo(i) .full is not set (step 23) .
  • step 24 the buffer data are passed in step 24 to memory bus processing engine ac- tual_bus(i) and value actual_bus ( i) is incremented to ac- tual_bus ( i+1) (or decremented to actual_bus ( i-1) in a different embodiment) in step 25, in a circular manner among memory bus numbers MB1 to MBY.
  • value ac- tual_fifo(i) is set to the number z nex -
  • the current memory bus processing engine 13 number ac- tual_bus(i) programs data of buffer number actual_fifo(i) to the flash memory devices of the corresponding memory bus in the order the buffer data has arrived.
  • the data stored in the buffers can have a format or timing or order of sequence different than the format or timing or order of sequence required for storing the data in the memory array.
  • the memory bus processing engine 13 also converts the incom- ing data format or timing or order of sequence into the data format required for output. After a memory bus processing engine has programmed stream data and has emptied a buffer, the control of that buffer is handed back to memory controller 11.
  • N is the number of concurrent data streams
  • Y is the number of memory buses
  • X is the number of interleaved flash devices. That amount of buffers would be needed for processing the worst case: all data streams are to be written into the same memory buses concurrently.
  • X*PAGESIZE is required to guarantee availability of the full data rate under all conditions.
  • N data streams having equal data rates are recorded concurrently.
  • Each one of these data streams has a maximum data rate of Y/N*DATARATE_OF_ONE_FLASH_BUS .
  • the processing described in connection with Fig. 2 is extended by a register storing the value last_accessed_bus that contains the number of the last memory bus into which stream signal data have been written. To start recording different streams on shifted memory buses, the following procedure could be used:
  • a reference value z current actual_fifo(i) to its current buffer is provided by controller 11.
  • the memory bus to which the next data of a stream i must be sent to is registered in value actual_ bus(i) .
  • the stream i data are forwarded to the actual buffer data input actual_fifo(i) .datain as long as a corresponding flag actual_fifo(i) .full is not set (step 33) .
  • buffer actual_fifo(i) is full, it is checked in step 37 whether or not data stream i is accessed for the first time. If true, the value actual_bus ( i) is set in step 38 to the sum of values last_accessed_bus and Y/N.
  • step 34 The buffer data are passed in step 34 to memory bus processing engine actual_bus ( i) .
  • step 35 the value
  • step 36 value actual_fifo(i) is set to the number z nex -
  • Memory bus processing engine 13 number receives the desired data from memory bus actual_bus ( i) and converts the incoming data format or timing or order of sequence into the data format required for output to the current buffer number actual_fifo(i) .
  • controller 11 assembles the data output from the buffers to the data streams 10 in the desired format.
  • the invention facilitates read/write access for multiple data streams in real-time. An access faster than real-time to separate data streams is possible. It is still possible to read or replay the data streams with maximum bandwidth. It is still possible to delete data streams independently from each other.
  • the invention can be used in all block-based storage media that are organised in parallel data paths.

Abstract

High speed mass storage devices using NAND flash memories (MDY.X) are suitable for recording and playing back a video data stream under real-time conditions, wherein the data are handled page-wise in the flash memories and are written in parallel to multiple memory buses (MBy). However, for operating with multiple independent data streams a significant buffer size is required. According to the invention, data from different data streams are collected in corresponding different buffers (FIFO 1,..., FIFO Z) until the amount of collected data in a current buffer corresponds to a current one of the data blocks. Then, the data of the current data block from the current buffer are stored into memories connected to a current one of the memory buses, wherein the following buffered data block of the related data stream is later on stored into memories connected to a following one of the memory buses, the number of the following memory bus being increased with respect to the number of the current memory bus. These steps are repeated, also for the other ones of the data streams using other available ones of the buffers and other ones of the memory buses. In combination with a corresponding buffer control it is possible to allocate and use a minimum number of buffers in a flexible way.

Description

STORING/READING SEVERAL DATA STREAMS INTO/FROM AN ARRAY OF
MEMORIES
The invention relates to a method and to an apparatus for storing at least two data streams into an array of memories, or for reading at least two data streams from an array of memories, wherein the array is arranged such that the memories are accessed via multiple data or memory buses to each of which multiple memories are connected, and wherein only blocks of data can be programmed in, or read from, a memory at a time.
Background
High speed mass storage devices built with non-volatile memory chips are suitable for recording and playing back concurrent data/video streams under real-time conditions, e.g. for video productions in film and broadcast studio environments. Higher spatial resolutions, higher frame rates and uncompressed multiple stream recording for 3D productions are increasing the requirements for storage media bandwidth and processing power capabilities. Since some years non- volatile memories like NAND flash devices are used for recording digital video. To fulfil the special storage requirements of digital video production with off-the-shelf NAND flash devices in mobile embedded storage media, a special handling of data and of NAND flash memories is re- quired.
Writing data to state-of-the-art NAND flash memories requires a special processing of data flow caused by the internal architecture of NAND flash memories. Such nonvolatile memory devices are organised as an array of pro- gramable and readable memory pages, comprising data blocks of some kilobytes size. If data are to be written into a NAND flash memory device, it is necessary to program a full page of some kilobytes size (PAGESIZE) . Additionally, the flash device needs a non-negligible processing time for writing such page into its internal memory array. During this programming time no other read/write commands can be executed on that flash device. Therefore the memory bus resources connecting the flash device with its controller will be unused during most of the time. To optimise utilisation of memory bus resources, it is known to connect multiple NAND flash devices to the same address/memory bus and to use them in an interleaved manner: the flash devices are processed one after the other, and while the first device is busy following a programming command, the memory bus resources are used to handle the other devices sharing the same memory bus. Manufacturers of NAND flash devices are supporting such kind of processing by integrating multiple dies of a flash device in one integrated circuit (IC), sharing a common external memory bus. Therefore such interleaved processing is feasible on a single IC. Depending on the time required for the programming/reading operations, it is possible to choose the number of interleaved devices in such a way that the bandwidth of the controlling memory bus is used in an optimum manner. For example, current NAND flash devices may have an 8-bit memory bus as external interface that can be driven with a speed of 40MHz. The memory bus resource has a full bandwidth of approximately 40MB/s, but the NAND flash device is written by programming operations of 2KB pages that may last up to 600ps. This will result in a sustained bandwidth of approximately 3,2MB/s. In order to use the full 40MB/s bandwidth of the memory bus resource, it would be necessary to connect 12 NAND flash devices to the memory bus and to use them in an interleaved manner.
To provide bandwidths higher than one memory bus can handle, data are written in parallel to multiple memory buses, whereby multiple flash pages on different flash devices are programmed concurrently. A corresponding structure of flash memories is shown in Fig. 1. A known controller 15 passes input data 10 to, or output data 10 from, Y parallel memory buses MB1 to MBY, to each of which X flash memory devices MDy .1 to MDy.X are connected. However, the internal structure shown within controller 15 does not show the prior art but is explained below in connection with the invention.
This kind of flash memory arrangement has the advantage that almost unlimited bandwidths can be provided for flash storage media. But increasing bandwidth leads also to an increasing amount of data that need to be written coherently. Only after a block of Y*PAGESIZE of data is available, these data can be programmed to the corresponding NAND flash mem- ory devices on all Y parallel memory buses. To guarantee a specific minimum read or write bandwidth for input/output data 10, it is necessary to read/write the data in an interleaved manner as mentioned above. That implies the need to read/write sequential blocks of a size X*Y*PAGESIZE .
Invention
When accessing only a single data stream or file, there is no problem in ensuring a sequential read/write or recording and playback, respectively. But in case multiple concurrent data streams are to be written to, or read from, the NAND flash array, some effort is required to comply with these conditions .
One solution could be to buffer data of each data stream in an independent buffer, and to write data of the different streams consecutively to the flash memory array only after one buffer contains an amount of Y*PAGESIZE data. However, such processing would scramble the memory pages of different data streams into sequential 'interleaved blocks', whereby the interleaving is guaranteed when the data streams are written to and read from the flash memory array concurrently. But in case a single data stream only shall be read from the flash array, the interleaving is not ensured. Additionally, consecutive flash pages in a NAND flash memory are coupled to 'Erase Blocks', and in case one separate data stream is to be erased it would be necessary to implement a wasteful procedure for preserving the data pages of the other data streams. For implementing a corresponding solution using double buffering processing, a buffer of size 2*Y*PAGESIZE * NUMBER_OF_STREAMS would be required.
To avoid scrambled data in the interleaved blocks and in the erasing blocks, it is necessary to handle the different data streams as independent data blocks in the flash array. For writing incoming data streams independently in the flash array while still fulfilling the interleaving requirements, the incoming data must be buffered until the buffer of one specific data stream has collected enough data for writing a full 'interleaving block' in the flash memory array. But that solution would require a large buffer size of
2*Y*X*PAGESIZE * NUMBER_OF_STREAMS .
A problem to be solved by the invention is to provide a high-speed real-time flash memory processing in which multi- pie incoming data streams can be concurrently written to, or read from, independent areas of a flash memory array using a minimum buffer size only, and wherein interleaving requirements are fulfilled for each stream independently. This problem is solved by the methods disclosed in claims 1 and 3. An apparatus that utilises this method is disclosed in claims 2 and 4.
According to the invention, the Y parallel memory buses are not used as strictly parallel memory buses, but as serial data lanes (i.e. memory buses) . It is not necessary to buffer data until the amount of incoming data of one data stream will suffice writing corresponding data pages on all Y memory buses. Instead, data are written in the flash memory array as soon as one of the data streams has enough data buffered for writing a full 'interleaving block' on one mem- ory bus. In combination with a smart buffer control it is possible to allocate and use a minimal number of small buffers in a flexible way. Data of the different data streams are also concurrently written to the memory buses of the flash memory array and, depending on the receiving time of the data, it is advantageously possible to handle storage, or replay, of the data streams in a more flexible and effective manner.
In principle, the inventive method is suited for storing or recording at least two data streams or files into an array of memories, wherein said array is arranged such that the memories are accessed via multiple memory buses to each of which multiple memories are connected, and wherein only blocks of data can be programmed in a memory at a time, said method including the steps:
a) collecting data from different ones of said data streams in corresponding different buffers until the amount of collected data for a current one of said data streams in a current buffer equals the size of a current one of said data blocks, wherein the number of buffers is different than the number of data streams;
b) storing the data of said current data block from that current buffer into a memory or memories connected to a current one of said memory buses, wherein the following buff- ered data block of the related data stream is later on stored into a memory or memories connected to a following one of said memory buses, the number of said following memory bus being increased or decreased, respectively, with respect to the number of the current memory bus;
c) repeatedly performing steps a) and b) , also for the other ones of said data streams using other available ones of said buffers and other ones of said memory buses.
In principle the inventive apparatus is suited for storing or recording at least two data streams or files into an ar- ray of memories, wherein said array is arranged such that the memories are accessed via multiple memory buses to each of which multiple memories are connected, and wherein only blocks of data can be programmed in a memory at a time, said apparatus including:
- means being adapted for collecting data from different ones of said data streams in corresponding different buffers until the amount of collected data for a current one of said data streams in a current buffer equals the size of a current one of said data blocks, wherein the number of buffers is different than the number of data streams;
- means being adapted for storing the data of said current data block from that current buffer into a memory or memories connected to a current one of said memory buses, wherein the following buffered data block of the related data stream is later on stored into a memory or memories connected to a following one of said memory buses, the number of said following memory bus being increased or decreased, respectively, with respect to the number of the current memory bus,
wherein said collecting means and said storing means are operating repeatedly, also for the other ones of said data streams using other available ones of said buffers and other ones of said memory buses. In principle, the inventive method is suited for reading or replaying at least two data streams or files from an array of memories, wherein said array is arranged such that the memories are accessed via multiple memory buses to each of which multiple memories are connected, and wherein only blocks of data can be read from a memory at a time, said method including the steps : a) reading data of a current data block from a memory or memories connected to a current one of said memory buses and storing them into a current buffer, wherein the following data block of the related data stream is later on read from a memory or memories connected to a following one of said memory buses, the number of said following memory bus being increased or decreased, respectively, with respect to the number of the current memory bus;
b) assembling data of different ones of said data streams from corresponding different buffers, wherein the number of buffers is different than the number of data streams;
c) repeatedly performing steps a) and b) , also for the other ones of said data streams using other available ones of said buffers and other ones of said memory buses.
In principle the inventive apparatus is suited for reading or replaying at least two data streams or files from an array of memories, wherein said array is arranged such that the memories are accessed via multiple memory buses to each of which multiple memories are connected, and wherein only blocks of data can be read from a memory at a time, said apparatus including:
- means being adapted for reading data of a current data block from a memory or memories connected to a current one of said memory buses and for storing them into a current buffer, wherein the following data block of the related data stream is later on read from a memory or memories connected to a following one of said memory buses, the number of said following memory bus being increased or decreased, respec- tively, with respect to the number of the current memory bus ;
- means being adapted for assembling data of different ones of said data streams from corresponding different buffers, wherein the number of buffers is different than the number of data streams,
and wherein said reading means and said assembling means are operating repeatedly, also for the other ones of said data streams using other available ones of said buffers and other ones of said memory buses. Advantageous additional embodiments of the invention are disclosed in the respective dependent claims.
Drawings
Exemplary embodiments of the invention are described with reference to the accompanying drawings, which show in:
Fig. 1 Inventive flash memory device;
Fig. 2 Flowchart of the memory control;
Fig. 3 Flowchart of the memory control for multiple data streams with equal data rates.
Exemplary embodiments
Accessing multiple data streams or files in the flash memory array requires as much flexibility for data block access as possible. For example, the format of the multiple data streams of the data input or output can conform the Ethernet protocol whereas the data for storage in, or read from, the memory array must be addressed according to a different protocol or format. To provide high-bandwidth access to a NAND flash memory array with a minimum of buffer resources and a maximum of flexibility in data stream access, the invention deviates from the known strictly parallel handling of memory buses. Instead, the parallel memory buses are regarded as representing serial memory buses (i.e. lanes) that are fed, or read, by concurrent memory bus processing engines. The known parallelism is broken with respect to temporal direc- tion because the processing of the memory buses no more starts and ends at the same time instants. The known paral- lelism is also broken with respect to processing location because different memory buses can access different data streams in a different or flexible way.
Incoming data of different data streams, which data streams can have differing data rates, are buffered in an independent FIFO buffer for each data stream. As soon as the amount of buffered data of one data stream is beginning to exceed the size of one 'interleaving block' (X*PAGESIZE) on one memory bus, these already stored data are written at the actual position (i.e. the current position at which the data stream is to be written or its writing is to be continued) on the current memory bus for that data stream. Successive 'interleaving blocks' of that data stream are distributed in an ascending order (or, as an alternative embodiment, in de- scending order) to the Y memory buses, i.e. a first data block of a particular data stream is written to memory bus 1, the second data block to e.g. memory bus 2, the third data block to e.g. memory bus 3, the Yth data block to e.g. memory bus Y, and (Y+l)th data block again to memory bus I , and so on. Thus, because of concurrent memory bus processing, a parallelism in processing on the memory buses for a given data stream is remaining in order to achieve the desired bandwidth for that data stream read/write access. Advantageously, such regular structure of memory bus ac- cesses is minimising the demands on a file system that is required for arranging data streams/files on a recording storage medium. This procedure supports the parallelism of data streams because, depending on the current incoming data, different memory buses can access different data streams concurrently.
The inventive processing can be implemented in a device as shown in the block diagram of Fig. 1. A memory controller 11 passes input or output data 10 via Z FIFO buffer stages 12 (with running index z = 1 to Z) and Y memory bus processing engines 13 to or from, respectively, Y parallel memory buses MB1 to MBY (with running index y = 1 to Y) to each of which X flash memory devices MDy .1 to MDy.X are connected, representing a memory array of Y*X memories MD1.1 to MDY.X. The memory controller 11 receives streaming data 10 of up to N different data streams. For intermediately buffering the stream data, a number of Z buffers of size X*PAGESIZE are arranged between memory controller 11 and an internal memory bus MBy connecting the memory bus processing engines 13, wherein the number Z of buffers is different than the number N of data streams. Controller 11 forwards the incoming data from each stream to a different one of the buffers, wherein any buffer can be allocated to any stream. As soon as a buffer is full of data of one of the data streams, memory controller 11 assigns that buffer (and related storage information, like physical address etc.) to an appropriate memory bus processing engine 13 for memory bus MBy, followed by loading the data block into that memory bus processing engine, and allocates the next free buffer to that data stream . The processing of the received stream data in memory controller 11 is shown in Fig. 2. For every received data stream i (step 21) a reference value zcurren-|- = ac- tual_fifo(i) to its current buffer is provided by controller 11. The memory bus 'y' to which the next data of a stream i must be sent to is registered in value actual_bus ( i) . In step 22, the stream i data are forwarded to the actual buffer data input actual_fifo(i) .datain as long as a corresponding flag actual_fifo(i) .full is not set (step 23) . As soon as buffer actual_fifo(i) is full, the buffer data are passed in step 24 to memory bus processing engine ac- tual_bus(i) and value actual_bus ( i) is incremented to ac- tual_bus ( i+1) (or decremented to actual_bus ( i-1) in a different embodiment) in step 25, in a circular manner among memory bus numbers MB1 to MBY. In step 26, value ac- tual_fifo(i) is set to the number znex-|- of the next free buffer among buffer numbers 1 to Z, and in step 21 the fol- lowing data block of stream i is received by controller 11. The current memory bus processing engine 13 number ac- tual_bus(i) programs data of buffer number actual_fifo(i) to the flash memory devices of the corresponding memory bus in the order the buffer data has arrived. For example, the data stored in the buffers can have a format or timing or order of sequence different than the format or timing or order of sequence required for storing the data in the memory array. The memory bus processing engine 13 also converts the incom- ing data format or timing or order of sequence into the data format required for output. After a memory bus processing engine has programmed stream data and has emptied a buffer, the control of that buffer is handed back to memory controller 11.
For such processing it is assumed that data of the different streams is received randomly and is written with separate regular structures into the memory array. To minimise the required buffer size X*PAGESIZE, the special properties of streaming data can be used. Given that the flash memory fabric is designed to achieve at least the write data rate of the sum of the data rates of the incoming data streams (whereby it is assumed that one memory bus can handle 1/Y of this data rate), one would need N*Y FIFO buffers for buffer- ing the incoming data for N simultaneous streams and in addition N+l FIFO buffers for bridging the time gap until the first buffer can be re-used. So, an amount of N*Y+N+1 buffers of size X*PAGESIZE is required, wherein N is the number of concurrent data streams, Y is the number of memory buses and X is the number of interleaved flash devices. That amount of buffers would be needed for processing the worst case: all data streams are to be written into the same memory buses concurrently. However, with the above described procedure it is additionally possible to reduce the number of buffers by the minimum number of freed buffers: as soon as the content of a buffer starts to be written into the flash memory devices, an amount of Y buffers is filled, until that buffer is freed. Assuming that an amount of N-l buffers may need to wait (while another data stream is written) before their content can be written into the memory bus, an amount of ) freed buffers will result.
Figure imgf000013_0001
Therefore, when using the processing described above, a to- tal amount of Z = (N * Y) + ) buffers of size
Figure imgf000013_0002
X*PAGESIZE is required to guarantee availability of the full data rate under all conditions.
In a further embodiment, N data streams having equal data rates are recorded concurrently.
Each one of these data streams has a maximum data rate of Y/N*DATARATE_OF_ONE_FLASH_BUS . Advantageously it is possible to minimise the required number of buffers (i.e. FIFOs) to Y+N+1 when preventing that data streams are written concurrently into a single one of the memory buses. That is accomplished by starting the writing of the different data streams in every processing cycle on different memory buses. The processing described in connection with Fig. 2 is extended by a register storing the value last_accessed_bus that contains the number of the last memory bus into which stream signal data have been written. To start recording different streams on shifted memory buses, the following procedure could be used:
For every received data stream i (step 31) a reference value zcurrent = actual_fifo(i) to its current buffer is provided by controller 11. The memory bus to which the next data of a stream i must be sent to is registered in value actual_ bus(i) . In step 32, the stream i data are forwarded to the actual buffer data input actual_fifo(i) .datain as long as a corresponding flag actual_fifo(i) .full is not set (step 33) . As soon as buffer actual_fifo(i) is full, it is checked in step 37 whether or not data stream i is accessed for the first time. If true, the value actual_bus ( i) is set in step 38 to the sum of values last_accessed_bus and Y/N.
The buffer data are passed in step 34 to memory bus processing engine actual_bus ( i) . In step 35 the value
last_accessed_bus is set to the value actual_bus ( i) , and value actual_bus ( i) is incremented to actual_bus ( i+1) (or decremented to actual_bus ( i-1 ) in a different embodiment), in a circular manner among memory bus numbers MB1 to MBY . In step 36, value actual_fifo(i) is set to the number znex-|- of the next free buffer among buffer numbers 1 to Z, and in step 31 the following data block of stream i is received by controller 11.
For replay of the data stream or streams stored in memory devices MD1.1 to MDY.X, the corresponding inverse steps are carried out. Memory bus processing engine 13 number receives the desired data from memory bus actual_bus ( i) and converts the incoming data format or timing or order of sequence into the data format required for output to the current buffer number actual_fifo(i) . Besides controlling the memory bus processing engines and the buffers, controller 11 assembles the data output from the buffers to the data streams 10 in the desired format. The invention facilitates read/write access for multiple data streams in real-time. An access faster than real-time to separate data streams is possible. It is still possible to read or replay the data streams with maximum bandwidth. It is still possible to delete data streams independently from each other.
The invention can be used in all block-based storage media that are organised in parallel data paths.

Claims

Claims
1. Method for storing or recording at least two data streams or files (10) into an array of NAND flash memories
(MD1.1, MDY.X), wherein said array is arranged such that said memories are accessed via multiple memory buses (MB1, MBY) to each of which multiple memories
(MDy.l, MDy.X) are connected, and wherein only blocks (PAGESIZE) of data can be programmed in a memory at a time,
characterised by the steps :
a) collecting (21, 22; 31, 32) data from different ones of said data streams (i) in corresponding different buffers (FIFO 1, FIFO Z) until (23; 33) the amount of col- lected data for a current one of said data streams in a current buffer equals the size of a current one of said data blocks (PAGESIZE), wherein the number (Z) of buffers is different than the number (N) of data streams;
b) storing (24; 34) the data of said current data block from that current buffer ( actual_fifo ( i) ) into a memory or memories connected to a current one ( actual_bus ( i) ) of said memory buses, wherein the following buffered data block of the related data stream (i) is later on stored into a memory or memories connected to a following one of said memory buses, the number ( actual_bus ( i+1 ) , ac- tual_bus ( i-1) ) of said following memory bus being increased or decreased (25; 35), respectively, with respect to the number of the current memory bus;
c) repeatedly performing steps a) and b) , also for the other ones of said data streams using other available ones of said buffers (FIFO 1, FIFO Z) and other ones of said memory buses .
2. Apparatus for storing or recording at least two data
streams or files (10) into an array of NAND flash memories (MD1.1, MDY.X), wherein said array is arranged such that said memories are accessed via multiple memory buses (MB1, MBY) to each of which multiple memories
(MDy.l, MDy.X) are connected, and wherein only blocks (PAGESIZE) of data can be programmed in a memory at a time, said apparatus including:
- means (11) being adapted for collecting data from different ones of said data streams (i) in corresponding different buffers (FIFO 1, FIFO Z) until the amount of collected data for a current one of said data streams in a current buffer equals the size of a current one of said data blocks (PAGESIZE), wherein the number (Z) of buffers is different than the number (N) of data streams;
- means (13) being adapted for storing the data of said
current data block from that current buffer (ac- tual_fifo(i)) into a memory or memories connected to a current one ( actual_bus ( i) ) of said memory buses (MB1, ... , MBY) , wherein the following buffered data block of the related data stream (i) is later on stored into a memory or memories connected to a following one of said memory buses (MB1, MBY), the number (ac- tual_bus ( i+1) , actual_bus ( i-1 ) ) of said following memory bus being increased or decreased, respectively, with respect to the number of the current memory bus,
wherein said collecting means (11) and said storing means (13) are operating repeatedly, also for the other ones of said data streams using other available ones of said buffers (FIFO 1, ..., FIFO Z) and other ones of said memory buses. 3. Method for reading or replaying at least two data streams or files (10) from an array of NAND flash memories
(MD1.1, MDY.X), wherein said array is arranged such that said memories are accessed via multiple memory buses (MB1, MBY) to each of which multiple memories
(MDy.l, MDy.X) are connected, and wherein only blocks (PAGESIZE) of data can be read from a memory at a time,
characterised by the steps :
a) reading data of a current data block (PAGESIZE) from a memory or memories connected to a current one (ac- tual_bus(i)) of said memory buses and storing them into a current buffer { actual_fifo { i) ) , wherein the following data block of the related data stream (i) is later on read from a memory or memories connected to a following one of said memory buses, the number ( actual_bus ( i+1 ) , actual_bus ( i-1 ) ) of said following memory bus being increased or decreased (25; 35), respectively, with respect to the number of the current memory bus;
b) assembling (11) data of different ones of said data
streams (i) from corresponding different buffers (FIFO 1, FIFO Z), wherein the number (Z) of buffers is different than the number (N) of data streams;
c) repeatedly performing steps a) and b) , also for the other ones of said data streams using other available ones of said buffers (FIFO 1, FIFO Z) and other ones of said memory buses.
Apparatus for reading or replaying at least two data streams or files (10) from an array of NAND flash memories (MD1.1, MDY.X), wherein said array is arranged such that said memories are accessed via multiple memory buses (MB1, MBY) to each of which multiple memories (MDy.l, MDy.X) are connected, and wherein only blocks (PAGESIZE) of data can be read from a memory at a time, said apparatus including:
means (13) being adapted for reading data of a current data block (PAGESIZE) from a memory or memories connected to a current one ( actual_bus ( i) ) of said memory buses and for storing them into a current buffer ( actual_fifo ( i) ) , wherein the following data block of the related data stream (i) is later on read from a memory or memories connected to a following one of said memory buses, the number ( actual_bus ( i+1 ) , actual_bus ( i-1 ) ) of said following memory bus being increased or decreased, respectively, with respect to the number of the current memory bus ;
- means (11) being adapted for assembling data of different ones of said data streams (i) from corresponding different buffers (FIFO 1, FIFO Z), wherein the number (Z) of buffers is different than the number (N) of data streams ,
and wherein said reading means (11) and said assembling means (13) are operating repeatedly, also for the other ones of said data streams using other available ones of said buffers (FIFO 1, FIFO Z) and other ones of said memory buses .
5. Method according to claim 1 or 3, or apparatus according to claim 2 or 4, wherein the number Z of buffers or buff- ers, respectively, is Z = (N * Y) + N + 1 - round _ up{ _ ) and the storage capacity of each buffer is at least
X*PAGESIZE, and wherein N is the number of data streams,
Y is the number of said memory buses, X is the number of said memories per memory bus, and PAGESIZE is the size of said data blocks. 6. Method according to one of claims 1, 3 and 5, or apparatus according to one of claims 2, 4 and 5, wherein each one of said data streams (10, i) has the same data rate and the writing of the different data streams is started in every processing cycle on different ones of said mem- ory buses.
PCT/EP2011/063619 2010-08-13 2011-08-08 Storing/reading several data streams into/from an array of memories WO2012019997A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US13/816,250 US9026722B2 (en) 2010-08-13 2011-08-08 Storing/reading several data streams into/from an array of memories
EP11741219.7A EP2603856A1 (en) 2010-08-13 2011-08-08 Storing/reading several data streams into/from an array of memories

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP10305890.5 2010-08-13
EP10305890A EP2418584A1 (en) 2010-08-13 2010-08-13 Method and apparatus for storing at least two data streams into an array of memories, or for reading at least two data streams from an array of memories

Publications (1)

Publication Number Publication Date
WO2012019997A1 true WO2012019997A1 (en) 2012-02-16

Family

ID=43402150

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2011/063619 WO2012019997A1 (en) 2010-08-13 2011-08-08 Storing/reading several data streams into/from an array of memories

Country Status (3)

Country Link
US (1) US9026722B2 (en)
EP (2) EP2418584A1 (en)
WO (1) WO2012019997A1 (en)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9516049B2 (en) 2013-11-13 2016-12-06 ProtectWise, Inc. Packet capture and network traffic replay
US10735453B2 (en) 2013-11-13 2020-08-04 Verizon Patent And Licensing Inc. Network traffic filtering and routing for threat analysis
US9471227B2 (en) * 2014-07-15 2016-10-18 Western Digital Technologies, Inc. Implementing enhanced performance with read before write to phase change memory to avoid write cancellations
WO2016054640A1 (en) * 2014-10-03 2016-04-07 Ezuniverse Inc. Data management system
US10078582B2 (en) * 2014-12-10 2018-09-18 International Business Machines Corporation Non-volatile memory system having an increased effective number of supported heat levels
CN107347058B (en) 2016-05-06 2021-07-23 阿里巴巴集团控股有限公司 Data encryption method, data decryption method, device and system
KR102398181B1 (en) 2017-07-03 2022-05-17 삼성전자주식회사 Storage device previously managing physical address to be allocated for write data
KR102336666B1 (en) * 2017-09-15 2021-12-07 삼성전자 주식회사 Memory device and memory system comprising the same
US10600484B2 (en) 2017-12-20 2020-03-24 Silicon Storage Technology, Inc. System and method for minimizing floating gate to floating gate coupling effects during programming in flash memory
CN108153490A (en) * 2017-12-21 2018-06-12 上海禾赛光电科技有限公司 For datacycle way to play for time and device, storage medium, the terminal of SOCFPGA
CN108470005A (en) * 2018-03-13 2018-08-31 山东超越数控电子股份有限公司 A kind of NandFlash antenna array controls method
CN109450620B (en) 2018-10-12 2020-11-10 创新先进技术有限公司 Method for sharing security application in mobile terminal and mobile terminal
US11082367B2 (en) * 2019-05-10 2021-08-03 Ciena Corporation FlexE frame format using 256b/257b block encoding
JP2021033847A (en) 2019-08-28 2021-03-01 キオクシア株式会社 Memory system and control method
US11429519B2 (en) * 2019-12-23 2022-08-30 Alibaba Group Holding Limited System and method for facilitating reduction of latency and mitigation of write amplification in a multi-tenancy storage drive

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050251617A1 (en) * 2004-05-07 2005-11-10 Sinclair Alan W Hybrid non-volatile memory system
US20100088463A1 (en) * 2008-10-02 2010-04-08 Samsung Electronics Co., Ltd. Nonvolatile memory system and data processing method

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6226708B1 (en) * 1997-08-18 2001-05-01 Texas Instruments Incorporated Method and system for efficiently programming non-volatile memory
US7934074B2 (en) 1999-08-04 2011-04-26 Super Talent Electronics Flash module with plane-interleaved sequential writes to restricted-write flash chips
US7020739B2 (en) 2000-12-06 2006-03-28 Tdk Corporation Memory controller, flash memory system having memory controller and method for controlling flash memory device
US6691205B2 (en) 2001-03-05 2004-02-10 M-Systems Flash Disk Pioneers Ltd. Method for using RAM buffers with simultaneous accesses in flash based storage systems
US20040193782A1 (en) * 2003-03-26 2004-09-30 David Bordui Nonvolatile intelligent flash cache memory
US20050010717A1 (en) 2003-07-07 2005-01-13 Soo-Ching Ng Access and data management method using double parallel tracks for flash memory cells
KR101257848B1 (en) * 2005-07-13 2013-04-24 삼성전자주식회사 Data storing apparatus comprising complex memory and method of operating the same
DE602007010439D1 (en) 2006-03-31 2010-12-23 Mosaid Technologies Inc FLASH MEMORY SYSTEM CONTROL METHOD
US7620784B2 (en) 2006-06-09 2009-11-17 Microsoft Corporation High speed nonvolatile memory device using parallel writing among a plurality of interfaces
US8385133B2 (en) 2006-12-05 2013-02-26 Avnera Corporation High-speed download device using multiple memory chips
EP2264604A1 (en) * 2009-06-15 2010-12-22 Thomson Licensing Device for real-time streaming of two or more streams in parallel to a solid state memory device array

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050251617A1 (en) * 2004-05-07 2005-11-10 Sinclair Alan W Hybrid non-volatile memory system
US20100088463A1 (en) * 2008-10-02 2010-04-08 Samsung Electronics Co., Ltd. Nonvolatile memory system and data processing method

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
JAE-HONG KIM ET AL: "A methodology for extracting performance parameters in solid state disks (SSDs)", MASCOTS '09, 21 September 2009 (2009-09-21), IEEE, PISCATAWAY, NJ, USA, pages 1 - 10, XP031640233, ISBN: 978-1-4244-4927-9 *
YOON JAE SEONG ET AL: "Hydra: A Block-Mapped Parallel Flash Memory Solid-State Disk Architecture", IEEE TRANSACTIONS ON COMPUTERS, vol. 59, no. 7, July 2010 (2010-07-01), IEEE SERVICE CENTER, LOS ALAMITOS, CA, USA, pages 905 - 921, XP011305073, ISSN: 0018-9340 *

Also Published As

Publication number Publication date
US20130138875A1 (en) 2013-05-30
US9026722B2 (en) 2015-05-05
EP2603856A1 (en) 2013-06-19
EP2418584A1 (en) 2012-02-15

Similar Documents

Publication Publication Date Title
US9026722B2 (en) Storing/reading several data streams into/from an array of memories
US7765359B2 (en) Flash memory system and programming method performed therein
US7644224B2 (en) Flash memory device and method
US9058254B2 (en) Memory device
US7917687B2 (en) Flash memory apparatus and access method to flash memory
US11385831B2 (en) Memory controller and storage device including the same
JP5378197B2 (en) Memory controller, memory card, nonvolatile memory system
JPWO2006051780A1 (en) Nonvolatile memory device and method of accessing nonvolatile memory device
US8010746B2 (en) Data processing apparatus and shared memory accessing method
US8750679B2 (en) Information processing apparatus, recording method, and computer program
KR101581311B1 (en) Flash memory apparatus and method of controlling the same
JP4818404B2 (en) Material server and material storage method
US20120007952A1 (en) Recording control apparatus, semiconductor recording apparatus, recording system, and nonvolatile storage medium
KR101628341B1 (en) Device for real-time streaming of two or more streams in parallel to a solid state memory device array
CA2619344C (en) Content data storage device and its control method
KR20110073815A (en) Imaging device and method for processing image rotation
KR20090056839A (en) Method and device for recording of frames
JP2007249662A (en) Memory card and control method of memory card
US8385133B2 (en) High-speed download device using multiple memory chips
US8452158B2 (en) Recording apparatus, imaging and recording apparatus, recording method, and program
JP4327863B2 (en) Video storage device and control method thereof
JP2007279873A (en) Data recorder
US20090094392A1 (en) System and Method for Data Operations in Memory
JP2016009411A (en) Recording device
JP2008021335A (en) Nonvolatile storage device, writing method of nonvolatile storage memory and controller

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11741219

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 13816250

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: DE

REEP Request for entry into the european phase

Ref document number: 2011741219

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2011741219

Country of ref document: EP