Physics 380, 2011: Lecture 5

From Ilya Nemenman: Theoretical Biophysics @ Emory
Jump to: navigation, search
Emory Logo

Back to the main Teaching page.

Back to Physics 380, 2011: Information Processing in Biology.

Today we are discussing random walks in biology, which we introduced during the previous lecture.

The book that we used to study probability has a great section on random walks: see Introduction to Probability by CM Grinstead and JL Snell. Another, more physics-like book on the subject that I recommend is "Random Walks in Biology" by H. Berg.

Warmup question

  1. Chemical signals from the outside world, such as antigens in the case of immune cells, are typically sensed by receptors on the cell surface. The binding of a signaling molecule changes the receptor's confirmational state. The receptor, still in the membrane, then meets with one or more of enzymes that diffuse in from afar on the membrane and catalyze its various additional modifications. Finally the receptor complex is cleaved, and part of its intra-cellular domain travels to other compartments of the cells, such as nucleus to initiate further signaling events (e.g., transcription). Can you explain why the opposite sequence, where a bound receptor is first cleaved and then its signal relay component is modified in the cytosol, is used much less frequently?

Main Lecture

  • (Biased) random walk
    • steps of length each, and the probability of left-right steps is not the same. For the total displacement, and .
    • Discrete time / discrete space walk is the simple, workable model of essentially all diffusive processes.
  • Properties of the walks depend on the number of dimensions
    • For a diffusive process, the radius of explored region goes as . The number of different sites in the explored region is . But the number of different visited sites is . Hence each site is explored about times. Hence in 1-d each site is explored many times, in 2-d each site is (barely) explored, and in 3-d very few sites are ever explored
  • Introducing the new model system: the E. coli lac operon. A good introduction is (Dreisigmeyer et al., 2008).
    • We write up the model.
    • two problems -- how does one activate transcription when it's shut off, and how does one stop it (or how does a TF find the binding site?)
    • both are related to properties of random walks. Let's discuss them one by one.
  • First passage times: what is a distribution of time until a random walk or diffusion reaches a particular point? (Grinstead and Snell book)
    • Connections between passage, return, and being there: the moment generating functions are all related. Typically problems for first/eventual passage/return/location analysis are solved using moment generating functions. E.g., probability of being at point at time is equal to a probability of first passing through at and then returning to in time . Hence .
    • Result for probability of return at time in 1d: . Show the plot of this.
    • Return and passage probabilities in different dimensions: mean return times diverge in all dimensions; probability of eventual return is 1 in 1-d and 2-d, and about 0.65 in 3-d.
  • Return times and Berg-von Hippel transcription factor searching for a binding site (Berg and von Hippel, 1987; Berg 1981). What is an optimal strategy for a transcription factor to search for a binding site?
    • Why 1-d search would fail? Because too much time is spent on exploration -- you always come back.
    • Why 3-d search would fail? Because very few sites are ever explored, and the TF will not come close to its needed target.
    • Why 1-d/3-d search is faster? You can move fast between patches (3-d), and then explore each patch throughly in 1-d way. Details of 1-3d search (following Slutsky and Mirny, 2004):
      • Search partitioned into 1-3d search rounds.
      • Total search time is the sum of search times in both modes: , where is the number of rounds.
      • In 3-d search the protein almost never come back to the same search patch.
      • In 1-d search the protein explores sites. Hence , where is the DNA length.
      • We get
      • for this model, where is the 1d diffusion constant. In general, we get .
      • Thus .
    • Is there an optimal time to spend on a 1-d search? Differentiating w.r.t. , we get . The transcription factor should spend the same amount of time in 1-d and 3-d search modes. Slutsky and Mirny (2004) review experimental confirmations of this.
  • Wiener process: A good model of random walk at long temporal and spatial scales is diffusion. That is . It's useful to represent such as a solution of an ordinary differential equation where is a Gaussian random variable with zero mean and the covariance . See the Homework problem No. 1 for the derivation of this. The random variable is called the Wiener process, after Norbert Wiener, who invented it.