Generalized Linear and Nonlinear Models for Correlated Data: Theory and Applications Using SAS

Ebook1,237 pages12 hours

Generalized Linear and Nonlinear Models for Correlated Data: Theory and Applications Using SAS

Name: Generalized Linear and Nonlinear Models for Correlated Data: Theory and Applications Using SAS
Author: Edward F. Vonesh
ISBN: 9781629592305

By Edward F. Vonesh

Rating: 0 out of 5 stars

()

Read preview

About this ebook

Edward Vonesh's Generalized Linear and Nonlinear Models for Correlated Data: Theory and Applications Using SAS is devoted to the analysis of correlated response data using SAS, with special emphasis on applications that require the use of generalized linear models or generalized nonlinear models. Written in a clear, easy-to-understand manner, it provides applied statisticians with the necessary theory, tools, and understanding to conduct complex analyses of continuous and/or discrete correlated data in a longitudinal or clustered data setting. Using numerous and complex examples, the book emphasizes real-world applications where the underlying model requires a nonlinear rather than linear formulation and compares and contrasts the various estimation techniques for both marginal and mixed-effects models. The SAS procedures MIXED, GENMOD, GLIMMIX, and NLMIXED as well as user-specified macros will be used extensively in these applications. In addition, the book provides detailed software code with most examples so that readers can begin applying the various techniques immediately.

This book is part of the SAS Press program.

Skip carousel

LanguageEnglish

PublisherSAS Institute

Release dateJul 7, 2014

ISBN9781629592305

Author

Edward F. Vonesh

Edward F. Vonesh, Ph.D., is self-employed as managing member of Vonesh Statistical Consulting, LLC, as well as a part-time employee of Northwestern University, where he supports research in his capacity as professor in the Department of Preventive Medicine. The author of numerous professional papers, he has published in the Journal of the American Statistical Association, Biometrika, Biometrics, and Statistics in Medicine. After twenty-nine years at Baxter Healthcare, Dr. Vonesh recently retired from his position as a senior Baxter research scientist. A SAS user since 1974, he received his B.S. and M.S. from Northern Illinois University and his Ph.D. in biostatistics from the University of Michigan and is a Fellow of the American Statistical Association.

Related authors

Skip carousel

Related to Generalized Linear and Nonlinear Models for Correlated Data

Related ebooks

Skip carousel

Applied Data Mining for Forecasting Using SAS
Ebook
Applied Data Mining for Forecasting Using SAS
byTim Rey
Rating: 0 out of 5 stars
0 ratings
Biostatistics and Computer-based Analysis of Health Data Using SAS
Ebook
Biostatistics and Computer-based Analysis of Health Data Using SAS
byChristophe Lalanne
Rating: 0 out of 5 stars
0 ratings
Categorical Data Analysis Using SAS, Third Edition
Ebook
Categorical Data Analysis Using SAS, Third Edition
byMaura E. Stokes
Rating: 0 out of 5 stars
0 ratings
Real World Health Care Data Analysis: Causal Methods and Implementation Using SAS
Ebook
Real World Health Care Data Analysis: Causal Methods and Implementation Using SAS
byDouglas Faries
Rating: 0 out of 5 stars
0 ratings
Applied Econometrics with SAS: Modeling Demand, Supply, and Risk
Ebook
Applied Econometrics with SAS: Modeling Demand, Supply, and Risk
byBarry K. Goodwin
Rating: 5 out of 5 stars
5/5
Methods and Applications of Longitudinal Data Analysis
Ebook
Methods and Applications of Longitudinal Data Analysis
byXian Liu
Rating: 0 out of 5 stars
0 ratings
Analysis of Clinical Trials Using SAS: A Practical Guide, Second Edition
Ebook
Analysis of Clinical Trials Using SAS: A Practical Guide, Second Edition
byCSPtrade2
Rating: 0 out of 5 stars
0 ratings
SAS for Mixed Models: Introduction and Basic Applications
Ebook
SAS for Mixed Models: Introduction and Basic Applications
byWalter W. Stroup, PhD
Rating: 1 out of 5 stars
1/5
Emerging Trends in Applications and Infrastructures for Computational Biology, Bioinformatics, and Systems Biology: Systems and Applications
Ebook
Emerging Trends in Applications and Infrastructures for Computational Biology, Bioinformatics, and Systems Biology: Systems and Applications
byHamid R Arabnia
Rating: 0 out of 5 stars
0 ratings
Mathematical Methods of Statistics (PMS-9), Volume 9
Ebook
Mathematical Methods of Statistics (PMS-9), Volume 9
byHarald Cramér
Rating: 3 out of 5 stars
3/5
Julia for Data Analysis
Ebook
Julia for Data Analysis
byBogumil Bogumil
Rating: 0 out of 5 stars
0 ratings
Stochastic Transport in Complex Systems: From Molecules to Vehicles
Ebook
Stochastic Transport in Complex Systems: From Molecules to Vehicles
byAndreas Schadschneider
Rating: 0 out of 5 stars
0 ratings
Java Data Mining: Strategy, Standard, and Practice: A Practical Guide for Architecture, Design, and Implementation
Ebook
Java Data Mining: Strategy, Standard, and Practice: A Practical Guide for Architecture, Design, and Implementation
byMark F. Hornick
Rating: 5 out of 5 stars
5/5
Probably Overthinking It: How to Use Data to Answer Questions, Avoid Statistical Traps, and Make Better Decisions
Ebook
Probably Overthinking It: How to Use Data to Answer Questions, Avoid Statistical Traps, and Make Better Decisions
byAllen B. Downey
Rating: 0 out of 5 stars
0 ratings
Life Out of Sequence: A Data-Driven History of Bioinformatics
Ebook
Life Out of Sequence: A Data-Driven History of Bioinformatics
byHallam Stevens
Rating: 4 out of 5 stars
4/5
R High Performance Programming
Ebook
R High Performance Programming
byAloysius Lim
Rating: 4 out of 5 stars
4/5
Hierarchical Modeling and Inference in Ecology: The Analysis of Data from Populations, Metapopulations and Communities
Ebook
Hierarchical Modeling and Inference in Ecology: The Analysis of Data from Populations, Metapopulations and Communities
byJ. Andrew Royle
Rating: 5 out of 5 stars
5/5
Phase Transitions
Ebook
Phase Transitions
byRicard Solé
Rating: 4 out of 5 stars
4/5
Mathematical Concepts and Methods in Modern Biology: Using Modern Discrete Models
Ebook
Mathematical Concepts and Methods in Modern Biology: Using Modern Discrete Models
byRaina Robeva
Rating: 0 out of 5 stars
0 ratings
Bayesian Data Analysis in Ecology Using Linear Models with R, BUGS, and Stan
Ebook
Bayesian Data Analysis in Ecology Using Linear Models with R, BUGS, and Stan
byFranzi Korner-Nievergelt
Rating: 0 out of 5 stars
0 ratings
Carpenter's Guide to Innovative SAS Techniques
Ebook
Carpenter's Guide to Innovative SAS Techniques
byArt Carpenter
Rating: 0 out of 5 stars
0 ratings
Probabilistic Methods for Bioinformatics: with an Introduction to Bayesian Networks
Ebook
Probabilistic Methods for Bioinformatics: with an Introduction to Bayesian Networks
byRichard E. Neapolitan
Rating: 0 out of 5 stars
0 ratings
Introduction to Bayesian Statistics
Ebook
Introduction to Bayesian Statistics
byWilliam M. Bolstad
Rating: 0 out of 5 stars
0 ratings
Laboratory Manual of Biomathematics
Ebook
Laboratory Manual of Biomathematics
byRaina Robeva
Rating: 5 out of 5 stars
5/5
Applied Hierarchical Modeling in Ecology: Analysis of distribution, abundance and species richness in R and BUGS: Volume 1:Prelude and Static Models
Ebook
Applied Hierarchical Modeling in Ecology: Analysis of distribution, abundance and species richness in R and BUGS: Volume 1:Prelude and Static Models
byMarc Kéry
Rating: 5 out of 5 stars
5/5
Regression Graphics: Ideas for Studying Regressions Through Graphics
Ebook
Regression Graphics: Ideas for Studying Regressions Through Graphics
byR. Dennis Cook
Rating: 0 out of 5 stars
0 ratings
Deep Learning with R, Second Edition
Ebook
Deep Learning with R, Second Edition
byFrancois Chollet
Rating: 0 out of 5 stars
0 ratings
The Postgenomic Condition: Ethics, Justice, and Knowledge after the Genome
Ebook
The Postgenomic Condition: Ethics, Justice, and Knowledge after the Genome
byJenny Reardon
Rating: 0 out of 5 stars
0 ratings
Quantifying Life: A Symbiosis of Computation, Mathematics, and Biology
Ebook
Quantifying Life: A Symbiosis of Computation, Mathematics, and Biology
byDmitry A. Kondrashov
Rating: 5 out of 5 stars
5/5
Bayesian Population Analysis using WinBUGS: A Hierarchical Perspective
Ebook
Bayesian Population Analysis using WinBUGS: A Hierarchical Perspective
byMarc Kéry
Rating: 3 out of 5 stars
3/5

Mathematics For You

Skip carousel

Quantum Physics for Beginners
Ebook
Quantum Physics for Beginners
byMax Thomson
Rating: 4 out of 5 stars
4/5
Algebra - The Very Basics
Ebook
Algebra - The Very Basics
byMetin Bektas
Rating: 5 out of 5 stars
5/5
Geometry For Dummies
Ebook
Geometry For Dummies
byMark Ryan
Rating: 5 out of 5 stars
5/5
Statistics 101: From Data Analysis and Predictive Modeling to Measuring Distribution and Determining Probability, Your Essential Guide to Statistics
Ebook
Statistics 101: From Data Analysis and Predictive Modeling to Measuring Distribution and Determining Probability, Your Essential Guide to Statistics
byDavid Borman
Rating: 4 out of 5 stars
4/5
Practice Makes Perfect Algebra II Review and Workbook, Second Edition
Ebook
Practice Makes Perfect Algebra II Review and Workbook, Second Edition
byChristopher Monahan
Rating: 4 out of 5 stars
4/5
Basic Math & Pre-Algebra For Dummies
Ebook
Basic Math & Pre-Algebra For Dummies
byMark Zegarelli
Rating: 4 out of 5 stars
4/5
Precalculus: A Self-Teaching Guide
Ebook
Precalculus: A Self-Teaching Guide
bySteve Slavin
Rating: 4 out of 5 stars
4/5
This is The Statistics Handbook your Professor Doesn't Want you to See. So Easy, it's Practically Cheating...
Ebook
This is The Statistics Handbook your Professor Doesn't Want you to See. So Easy, it's Practically Cheating...
byS. Deviant
Rating: 4 out of 5 stars
4/5
Build a Mathematical Mind - Even If You Think You Can't Have One: Become a Pattern Detective. Boost Your Critical and Logical Thinking Skills.
Ebook
Build a Mathematical Mind - Even If You Think You Can't Have One: Become a Pattern Detective. Boost Your Critical and Logical Thinking Skills.
byAlbert Rutherford
Rating: 5 out of 5 stars
5/5
Algebra I Workbook For Dummies
Ebook
Algebra I Workbook For Dummies
byMary Jane Sterling
Rating: 3 out of 5 stars
3/5
Game Theory: A Simple Introduction
Ebook
Game Theory: A Simple Introduction
byK.H. Erickson
Rating: 4 out of 5 stars
4/5
Mental Math Secrets - How To Be a Human Calculator
Ebook
Mental Math Secrets - How To Be a Human Calculator
byRandy Silverman
Rating: 5 out of 5 stars
5/5
The Little Book of Mathematical Principles, Theories & Things
Ebook
The Little Book of Mathematical Principles, Theories & Things
byRobert Solomon
Rating: 3 out of 5 stars
3/5
The Everything Guide to Pre-Algebra: A Helpful Practice Guide Through the Pre-Algebra Basics - in Plain English!
Ebook
The Everything Guide to Pre-Algebra: A Helpful Practice Guide Through the Pre-Algebra Basics - in Plain English!
byJane Cassie
Rating: 5 out of 5 stars
5/5
Painless Geometry
Ebook
Painless Geometry
byLynette Long
Rating: 4 out of 5 stars
4/5
Calculus Made Easy
Ebook
Calculus Made Easy
bySilvanus P. Thompson
Rating: 4 out of 5 stars
4/5
The Everything Guide to Algebra: A Step-by-Step Guide to the Basics of Algebra - in Plain English!
Ebook
The Everything Guide to Algebra: A Step-by-Step Guide to the Basics of Algebra - in Plain English!
byChristopher Monahan
Rating: 4 out of 5 stars
4/5
My Best Mathematical and Logic Puzzles
Ebook
My Best Mathematical and Logic Puzzles
byMartin Gardner
Rating: 5 out of 5 stars
5/5
Relativity: The special and the general theory
Ebook
Relativity: The special and the general theory
byAlbert Einstein
Rating: 5 out of 5 stars
5/5
See Ya Later Calculator: Simple Math Tricks You Can Do in Your Head
Ebook
See Ya Later Calculator: Simple Math Tricks You Can Do in Your Head
byEditors of Portable Press
Rating: 4 out of 5 stars
4/5
ACT Math & Science Prep: Includes 500+ Practice Questions
Ebook
ACT Math & Science Prep: Includes 500+ Practice Questions
byKaplan Test Prep
Rating: 3 out of 5 stars
3/5
The Thirteen Books of the Elements, Vol. 1
Ebook
The Thirteen Books of the Elements, Vol. 1
byEuclid
Rating: 0 out of 5 stars
0 ratings
Is God a Mathematician?
Ebook
Is God a Mathematician?
byMario Livio
Rating: 4 out of 5 stars
4/5
Mathematical Thinking - For People Who Hate Math: Level Up Your Analytical and Creative Thinking Skills. Excel at Problem-Solving and Decision-Making.
Ebook
Mathematical Thinking - For People Who Hate Math: Level Up Your Analytical and Creative Thinking Skills. Excel at Problem-Solving and Decision-Making.
byAlbert Rutherford
Rating: 3 out of 5 stars
3/5
Introducing Game Theory: A Graphic Guide
Ebook
Introducing Game Theory: A Graphic Guide
byIvan Pastine
Rating: 4 out of 5 stars
4/5
Flatland
Ebook
Flatland
byEdwin A. Abbott
Rating: 4 out of 5 stars
4/5
The Everything Everyday Math Book: From Tipping to Taxes, All the Real-World, Everyday Math Skills You Need
Ebook
The Everything Everyday Math Book: From Tipping to Taxes, All the Real-World, Everyday Math Skills You Need
byChristopher Monahan
Rating: 5 out of 5 stars
5/5
Summary of The Black Swan: by Nassim Nicholas Taleb | Includes Analysis
Ebook
Summary of The Black Swan: by Nassim Nicholas Taleb | Includes Analysis
byInstaread Summaries
Rating: 5 out of 5 stars
5/5
Real Estate by the Numbers: A Complete Reference Guide to Deal Analysis
Ebook
Real Estate by the Numbers: A Complete Reference Guide to Deal Analysis
byJ Scott
Rating: 0 out of 5 stars
0 ratings
The Golden Ratio: The Divine Beauty of Mathematics
Ebook
The Golden Ratio: The Divine Beauty of Mathematics
byGary B. Meisner
Rating: 5 out of 5 stars
5/5

Related podcast episodes

Skip carousel

#51 Bernoulli's Fallacy & the Crisis of Modern Science, with Aubrey Clayton
Podcast episode
#51 Bernoulli's Fallacy & the Crisis of Modern Science, with Aubrey Clayton
byLearning Bayesian Statistics
0 ratings
0% found this document useful
#1 Bayes, open-source and bioinformatics, with Osvaldo Martin
Podcast episode
#1 Bayes, open-source and bioinformatics, with Osvaldo Martin
byLearning Bayesian Statistics
0 ratings
0% found this document useful
#51 Francois Chollet - Intelligence and Generalisation
Podcast episode
#51 Francois Chollet - Intelligence and Generalisation
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
Kate Crawford, "The Atlas of AI: Power, Politics, and the Planetary Costs of Artificial Intelligence" (Yale UP, 2021): An interview with Kate Crawford
Podcast episode
Kate Crawford, "The Atlas of AI: Power, Politics, and the Planetary Costs of Artificial Intelligence" (Yale UP, 2021): An interview with Kate Crawford
byNew Books in Mathematics
0 ratings
0% found this document useful
Ants and Algorithms: David Sumpter describes the algorithms ruling the world
Podcast episode
Ants and Algorithms: David Sumpter describes the algorithms ruling the world
byMore or Less: Behind the Stats
100%
100% found this document useful
OLMo: Everything You Need to Train an Open Source LLM with Akshita Bhagia - #674
Podcast episode
OLMo: Everything You Need to Train an Open Source LLM with Akshita Bhagia - #674
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
The Outer Limits of Reason: What Science, Mathematics, and Logic Cannot Tell Us: An interview with Noson S. Yanofsky
Podcast episode
The Outer Limits of Reason: What Science, Mathematics, and Logic Cannot Tell Us: An interview with Noson S. Yanofsky
byNew Books in Mathematics
0 ratings
0% found this document useful
Unlocking the Brain's Mysteries: Chris Eliasmith on Spiking Neural Networks and the Future of Human-Machine Interaction
Podcast episode
Unlocking the Brain's Mysteries: Chris Eliasmith on Spiking Neural Networks and the Future of Human-Machine Interaction
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
Alfred Posamentier et. al., “The Joy of Mathematics: Marvels, Novelties, and Neglected Gems That Are Rarely Taught in Math Class” (Prometheus Books, 2017): The book discussed here is the The Joy of Mathematics (Prometheus Books, 2017), whose lead author, Alfred Posamentier, is our guest today. The subtitle Marvels, Novelties, and Neglected Gems That Are Rarely Taught in Math Classdescribes the book nicely...
Podcast episode
Alfred Posamentier et. al., “The Joy of Mathematics: Marvels, Novelties, and Neglected Gems That Are Rarely Taught in Math Class” (Prometheus Books, 2017): The book discussed here is the The Joy of Mathematics (Prometheus Books, 2017), whose lead author, Alfred Posamentier, is our guest today. The subtitle Marvels, Novelties, and Neglected Gems That Are Rarely Taught in Math Classdescribes the book nicely...
byNew Books in Mathematics
0 ratings
0% found this document useful
#70 Beyond the Language Wars: R & Python for the Modern Data Scientist
Podcast episode
#70 Beyond the Language Wars: R & Python for the Modern Data Scientist
byDataFramed
0 ratings
0% found this document useful
#68 Probabilistic Machine Learning & Generative Models, with Kevin Murphy
Podcast episode
#68 Probabilistic Machine Learning & Generative Models, with Kevin Murphy
byLearning Bayesian Statistics
0 ratings
0% found this document useful
What is beyond PoCs? ML project-hurdles you should be prepared to take with Balázs Kégl - 016: Why do we do PoCs all the time and why do we struggle with Real projects? We are going to talk about ML project-hurdles with the head of AI at Huawei Paris, Balazs Kegl.
Podcast episode
What is beyond PoCs? ML project-hurdles you should be prepared to take with Balázs Kégl - 016: Why do we do PoCs all the time and why do we struggle with Real projects? We are going to talk about ML project-hurdles with the head of AI at Huawei Paris, Balazs Kegl.
byMachine Learning Cafe
0 ratings
0% found this document useful
DR. JEFF BECK - THE BAYESIAN BRAIN
Podcast episode
DR. JEFF BECK - THE BAYESIAN BRAIN
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
#312: Mapping Genetics and Race with Dr. Janina Jeff (Replay): Dr. Jeff, a Senior Bioinformatics scientist at Illumina and host of the podcast ‘In Those Genes’, joins us today for an insightful conversation around genetics and race, helping us to understand that race is, indeed, a social construct.
Podcast episode
#312: Mapping Genetics and Race with Dr. Janina Jeff (Replay): Dr. Jeff, a Senior Bioinformatics scientist at Illumina and host of the podcast ‘In Those Genes’, joins us today for an insightful conversation around genetics and race, helping us to understand that race is, indeed, a social construct.
by15 Minute Matrix
0 ratings
0% found this document useful
Are LLMs Overhyped or Underappreciated? with Marti Hearst - #626
Podcast episode
Are LLMs Overhyped or Underappreciated? with Marti Hearst - #626
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
126 | FlowingData with Nathan Yau
Podcast episode
126 | FlowingData with Nathan Yau
byData Stories
0 ratings
0% found this document useful
Protein Synthesis: Mr. Peevyhouse AP Biology Podcast
Podcast episode
Protein Synthesis: Mr. Peevyhouse AP Biology Podcast
byAP Biology Podcast
0 ratings
0% found this document useful
Yoshua Bengio on Pausing More Powerful AI Models and His Work on World Models: Yoshua Bengio, one of the founders of deep learning, talks about the famous pause letter of which he was perhaps the most prominent signer. And discusses his work on world models and inference machines that would give AI systems the ability to reason...
Podcast episode
Yoshua Bengio on Pausing More Powerful AI Models and His Work on World Models: Yoshua Bengio, one of the founders of deep learning, talks about the famous pause letter of which he was perhaps the most prominent signer. And discusses his work on world models and inference machines that would give AI systems the ability to reason...
byEye On A.I.
0 ratings
0% found this document useful
#67 Operationalizing Machine Learning with MLOps
Podcast episode
#67 Operationalizing Machine Learning with MLOps
byDataFramed
0 ratings
0% found this document useful
The mathematics of machine learning: with Tivadar Danka
Podcast episode
The mathematics of machine learning: with Tivadar Danka
byPractical AI: Machine Learning, Data Science
0 ratings
0% found this document useful
MLA 020 Kubeflow: Conversation with Dirk-Jan Kubeflow (vs cloud native solutions like SageMaker) - Data Scientist at Dept Agency . (From the website:) The Machine Learning Toolkit for Kubernetes. The Kubeflow project is dedicated to making deployments of...
Podcast episode
MLA 020 Kubeflow: Conversation with Dirk-Jan Kubeflow (vs cloud native solutions like SageMaker) - Data Scientist at Dept Agency . (From the website:) The Machine Learning Toolkit for Kubernetes. The Kubeflow project is dedicated to making deployments of...
byMachine Learning Guide
0 ratings
0% found this document useful
Ali Ghodsi – The Past, Present, and Future of Big Data – [Founder’s Field Guide, EP.18]: My Guest today is Ali Ghodsi, founder and CEO of Databricks, a data analytics platform for data scientists and developers. He's also the founder of Apache Spark, the open-source project that Databricks is built on, and is an accomplished researcher at...
Podcast episode
Ali Ghodsi – The Past, Present, and Future of Big Data – [Founder’s Field Guide, EP.18]: My Guest today is Ali Ghodsi, founder and CEO of Databricks, a data analytics platform for data scientists and developers. He's also the founder of Apache Spark, the open-source project that Databricks is built on, and is an accomplished researcher at...
byInvest Like the Best with Patrick O'Shaughnessy
0 ratings
0% found this document useful
#10 Data Science, the Environment and MOOCs: Air pollution, the environment and data science: where do these intersect? Find out in this episode of DataFramed, in which Hugo speaks with Roger Peng, Professor in the Department of Biostatistics at the Johns Hopkins Bloomberg School of Public Health...
Podcast episode
#10 Data Science, the Environment and MOOCs: Air pollution, the environment and data science: where do these intersect? Find out in this episode of DataFramed, in which Hugo speaks with Roger Peng, Professor in the Department of Biostatistics at the Johns Hopkins Bloomberg School of Public Health...
byDataFramed
0 ratings
0% found this document useful
Episode 50: Materialism Retrospective
Podcast episode
Episode 50: Materialism Retrospective
byMaterialism: A Materials Science Podcast
0 ratings
0% found this document useful
Open Source TensorFlow with Yifei Feng: Yifei Feng, a TensorFlow software engineer, shares with Melanie and Mark about her work on the open source TensorFlow project and the tools she builds.
Podcast episode
Open Source TensorFlow with Yifei Feng: Yifei Feng, a TensorFlow software engineer, shares with Melanie and Mark about her work on the open source TensorFlow project and the tools she builds.
byGoogle Cloud Platform Podcast
100%
100% found this document useful
How LLMs and Generative AI are Revolutionizing AI for Science with Anima Anandkumar - #614
Podcast episode
How LLMs and Generative AI are Revolutionizing AI for Science with Anima Anandkumar - #614
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
#0 What is this podcast?
Podcast episode
#0 What is this podcast?
byLearning Bayesian Statistics
0 ratings
0% found this document useful
Ep. 220: “Developmental and Synthetic Biology” Featuring Dr. Michael Levin: Dr. Michael Levin is the Director of the Allen Discovery Center and a Distinguished Professor of Biology at Tufts University. He is a cognitive biologist who utilizes model systems such as Xenopus to answer fundamental questions in developmental biolog...
Podcast episode
Ep. 220: “Developmental and Synthetic Biology” Featuring Dr. Michael Levin: Dr. Michael Levin is the Director of the Allen Discovery Center and a Distinguished Professor of Biology at Tufts University. He is a cognitive biologist who utilizes model systems such as Xenopus to answer fundamental questions in developmental biolog...
byThe Stem Cell Podcast
0 ratings
0% found this document useful
Explaining maths without Numbers: Tim Harford speaks to mathematician Milo Beckman about the beauty of maths.
Podcast episode
Explaining maths without Numbers: Tim Harford speaks to mathematician Milo Beckman about the beauty of maths.
byMore or Less: Behind the Stats
0 ratings
0% found this document useful
Hidden geometry, with Jordan Ellenberg: Mathematician Jordan Ellenberg tells us about his book, Shape, and why geometry is about so much more than triangles and circles.
Podcast episode
Hidden geometry, with Jordan Ellenberg: Mathematician Jordan Ellenberg tells us about his book, Shape, and why geometry is about so much more than triangles and circles.
byInstant Genius
0 ratings
0% found this document useful

Skip carousel

19 Women Leading Math and Physics
Nautilus
Article
19 Women Leading Math and Physics
Aug 9, 2017
2 min read
Amid Global Electric-Car Buzz, Toyota Bullish on Hydrogen
TechLife News
Article
Amid Global Electric-Car Buzz, Toyota Bullish on Hydrogen
Nov 17, 2017
4 min read
We Need an FDA For Algorithms
Nautilus
Article
We Need an FDA For Algorithms
Nov 1, 2018
In the introduction to her new book, Hannah Fry points out something interesting about the phrase “Hello World.” It’s never been quite clear, she says, whether the phrase—which is frequently the entire output of a student’s first computer program—is
10 min read
A Math Teacher's Life Summed Up By The Gifted Students He Mentored
NPR
Article
A Math Teacher's Life Summed Up By The Gifted Students He Mentored
Apr 7, 2019
6 min read
Is Artificial Intelligence Permanently Inscrutable?
Nautilus
Article
Is Artificial Intelligence Permanently Inscrutable?
Sep 1, 2016
Dmitry Malioutov can’t say much about what he built. As a research scientist at IBM, Malioutov spends part of his time building machine learning systems that solve difficult problems faced by IBM’s corporate clients. One such program was meant for a
13 min read
Evolution May Be Drunk, But It’s Serious About Making Brains
Nautilus
Article
Evolution May Be Drunk, But It’s Serious About Making Brains
May 21, 2014
3 min read
How the Slowest Computer Programs Illuminate Math’s Fundamental Limits
Quanta
Article
How the Slowest Computer Programs Illuminate Math’s Fundamental Limits
Dec 10, 2020
6 min read
Water Vapour Detected On Jupiter’s Massive Moon
How It Works
Article
Water Vapour Detected On Jupiter’s Massive Moon
Sep 2, 2021
1 min read
Tensor Flow 101
APC
Article
Tensor Flow 101
Jan 27, 2020
4 min read
How to Build a Search Engine for Mathematics: The Surprising Power of Neil Sloane’s Encyclopedia of Integer Sequences.: The surprising power of Neil Sloane’s Encyclopedia of Integer Sequences.
Nautilus
Article
How to Build a Search Engine for Mathematics: The Surprising Power of Neil Sloane’s Encyclopedia of Integer Sequences.: The surprising power of Neil Sloane’s Encyclopedia of Integer Sequences.
Oct 22, 2015
On the average summer Saturday, the mathematician Neil Sloane woke up to a crisis. “There are always crises,” he said— albeit crises of the teapot tempest variety. One Saturday over breakfast, he faced an inbox message titled “edits from outer space.
10 min read
Can Many-Worlds Theory Rescue Us From Boltzmann Brains?
Nautilus
Article
Can Many-Worlds Theory Rescue Us From Boltzmann Brains?
Apr 7, 2017
4 min read
Docker Is Dead, Long Live Docker
Linux Format
Article
Docker Is Dead, Long Live Docker
Aug 22, 2023
1 min read
Former NFL Player Urschel Sells Virtue Of Math To Youngsters
AppleMagazine
Article
Former NFL Player Urschel Sells Virtue Of Math To Youngsters
Dec 11, 2020
3 min read
Five Veteran Scientists Tell Us What Most Surprised Them: Fifty years ago, who knew we’d learn to clone genes and find water on Mars?
Nautilus
Article
Five Veteran Scientists Tell Us What Most Surprised Them: Fifty years ago, who knew we’d learn to clone genes and find water on Mars?
Sep 24, 2015
Turn back the clock to 1965. Science appeared to be marching forward at an unrelenting pace. Biochemists had cracked the genetic code (how DNA translates into proteins), inspiring Life magazine to envision “superbabies with improved minds and bodies.
7 min read
The Deepest Uncertainty: When a hypothesis is neither true nor false.
Nautilus
Article
The Deepest Uncertainty: When a hypothesis is neither true nor false.
Jun 6, 2013
Georg Cantor died in 1918 in a sanatorium in Halle, Germany. A pre-eminent mathematician, he had laid the foundation for the theory of infinite numbers in the 1870s. At the time, his ideas received hostile opposition from prominent mathematicians in
4 min read
CRISPR Pioneer Doudna Opens Lab To Run Covid-19 Tests
STAT
Article
CRISPR Pioneer Doudna Opens Lab To Run Covid-19 Tests
Mar 30, 2020
2 min read
The Man Who Invented Modern Probability: Chance encounters in the life of Andrei Kolmogorov.
Nautilus
Article
The Man Who Invented Modern Probability: Chance encounters in the life of Andrei Kolmogorov.
Oct 16, 2013
If two statisticians were to lose each other in an infinite forest, the first thing they would do is get drunk. That way, they would walk more or less randomly, which would give them the best chance of finding each other. However, the statisticians s
9 min read
The Fundamental Limits of Machine Learning
Nautilus
Article
The Fundamental Limits of Machine Learning
Sep 20, 2016
5 min read
Spider Monkey Habits Reveal Why Humans Started Drinking Alcohol
Science Illustrated
Article
Spider Monkey Habits Reveal Why Humans Started Drinking Alcohol
Aug 17, 2022
ANIMALS Countless studies have demonstrated how alcohol intake affects our brains and bodies. Yet alcohol consumption remains a deeply rooted part of human culture – and has been so for thousands of years. But why are we so keen on fermented liquids?
1 min read
Opinion: Two Words To Help Ned Sharpless Revolutionize Clinical Trials: Data Standards
STAT
Article
Opinion: Two Words To Help Ned Sharpless Revolutionize Clinical Trials: Data Standards
May 13, 2019
4 min read
Opinion: Synthetic Control Arms Can Save Time And Money In Clinical Trials
STAT
Article
Opinion: Synthetic Control Arms Can Save Time And Money In Clinical Trials
Feb 5, 2019
Synthetic control arms aren't the solution to all of the challenges facing randomized trials, but they represent a great way for drug development companies to start using real-world evidence.
4 min read
THE WORLD’S BEST Specialized Hospitals 2023
Newsweek
Article
THE WORLD’S BEST Specialized Hospitals 2023
Sep 16, 2022
72 min read
Opinion: Machine Learning For Clinical Decision-making: Pay Attention To What You Don’t See
STAT
Article
Opinion: Machine Learning For Clinical Decision-making: Pay Attention To What You Don’t See
Dec 12, 2019
Don't take results from machine learning algorithms at face value. Ask what information isn't available. What subgroups haven't been prioritized? Who is on the research team?
4 min read
The World’s Best Specialized Hospitals 2024
Newsweek International
Article
The World’s Best Specialized Hospitals 2024
Sep 15, 2023
4 min read
The World’s Best Specialized Hospitals 2024
Newsweek
Article
The World’s Best Specialized Hospitals 2024
Sep 15, 2023
4 min read
Opinion: Real-world Evidence Is Changing The Way We Study Drug Safety And Effectiveness
STAT
Article
Opinion: Real-world Evidence Is Changing The Way We Study Drug Safety And Effectiveness
Jan 29, 2019
Instead of waiting for fleshed-out protocols for using real-world evidence, pharmaceutical companies should dive in now using a commonsense approach that relies on rigor and reason.
4 min read
THE WORLD’S BEST Specialized Hospitals 2023
Newsweek International
Article
THE WORLD’S BEST Specialized Hospitals 2023
Sep 16, 2022
4 min read
Opening The ‘Black Box,’ Google DeepMind AI System Diagnoses Eye Diseases And Shows Its Work
STAT
Article
Opening The ‘Black Box,’ Google DeepMind AI System Diagnoses Eye Diseases And Shows Its Work
Aug 13, 2018
Experts said the level of accuracy is impressive, but the bigger breakthrough is the DeepMind system’s solution to the so-called “black box” problem of artificial intelligence.
5 min read
Open Source Tool Picks Best Chemo Drug 80% Of The Time
Futurity
Article
Open Source Tool Picks Best Chemo Drug 80% Of The Time
Dec 4, 2018
A new machine learning tool could help find the chemotherapy drug most likely to attack cancer in individual patients. The tool, which analyzes RNA expression tied to information about patient outcomes with specific drugs, predicted the chemotherapy
3 min read
Opinion: The FDA Needs To Set Standards For Using Artificial Intelligence In Drug Development
STAT
Article
Opinion: The FDA Needs To Set Standards For Using Artificial Intelligence In Drug Development
Nov 7, 2019
Poorly constructed AI algorithms for drug discovery and testing have the potential to cause harm. The FDA should play an important role in ensuring that AI-based drug development tools meet…
3 min read

Related categories

Skip carousel

Reviews for Generalized Linear and Nonlinear Models for Correlated Data

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Generalized Linear and Nonlinear Models for Correlated Data - Edward F. Vonesh

1 Introduction

1.1 Correlated response data

1.2 Explanatory variables

1.3 Types of models

1.4 Some examples

1.5 Summary features

Correlated response data, either discrete (nominal, ordinal, counts), continuous or a combination thereof, occur in numerous disciplines and more often than not, require the use of statistical models that are nonlinear in the parameters of interest. In this chapter we briefly describe the different types of correlated response data one encounters in practice as well as the types of explanatory variables and models used to analyze such data.

1.1 Correlated response data

Within SAS, there are various models, methods and procedures that are available for analyzing any of four different types of correlated response data: (1) repeated measurements including longitudinal data, (2) clustered data, (3) spatially correlated data, and (4) multivariate data. Brief descriptions of these four types of correlated data follow.

1.1.1 Repeated measurements

Repeated measurements may be defined as data where a response variable for each experimental unit or subject is observed on multiple occasions and possibly under different experimental conditions. Within this context, the overall correlation structure among repeated measurements should be flexible enough to reflect the serial nature in which measurements are collected per subject while also accounting for possible correlation associated with subject-specific random effects. We have included longitudinal data under the heading of repeated measurements for several reasons:

Longitudinal data may be thought of as repeated measurements where the underlying metameter for the occasions at which measurements are taken is time. In this setting, interest generally centers on modeling and comparing trends over time. Consequently, longitudinal data will generally exhibit some form of serial correlation much as one would expect with time-series data. In addition, one may also model correlation induced by a random-effects structure that allows for within- and between-subject variability. Repeated measurements are more general than longitudinal data in that the underlying metameter may be time or it may be a set of experimental conditions (e.g., different dose levels of a drug). Within this broader context, one may be interested in modeling and comparing trends over the range of experimental conditions (e.g., dose-response curves) or simply comparing mean values across different experimental conditions (e.g., a repeated measures analysis of variance).

Some typical settings where one is likely to encounter repeated measurements and longitudinal data include:

• Pharmacokinetic studies

Studies where the plasma concentration of a drug is measured at several time points for each subject with an objective of estimating various population pharmacokinetic parameters.

• Econometrics

Panel data entailing both cross-sectional and time-series data together in a two-dimensional array.

• Crossover studies

Bioavailability studies, for example, routinely employ two-period, two-treatment crossover designs (e.g., AB | BA) where each subject receives each treatment on each of two occasions.

• Growth curve studies

– Pediatric studies examining the growth pattern of children.

– Agricultural studies examining the growth pattern of plants.

To illustrate, we consider the orange tree growth curve data presented in Draper and Smith (1981, p. 524) and analyzed by Lindstrom and Bates (1990). The data consists of the trunk circumference (millimeters) measured over 7 different time points on each of five orange trees. As shown in Figure 1.1, growth patterns exhibit a trend of ever increasing variability over time; a pattern reflective of a random-effects structure.

While we have lumped longitudinal data together with repeated measurements, it should be noted that event-times data, i.e., data representing time to some event, may also be classified as longitudinal even though the event time may be a single outcome measure such as patient survival. We will examine the analysis of event times data within the context of joint modeling of event times with repeated measurements/longitudinal data.

1.1.2 Clustered data

Clustered dependent data occur when observations are grouped in some natural fashion into clusters resulting in within-cluster data that tend to be correlated. Correlation induced by clustering is more often than not accounted for through the specification of a random-effects model in which cluster-specific random effects are used to differentiate within- and between-cluster variability. In some instances, there may be more than one level of clustering resulting in specification of a multi-level random-effects structure. Examples of clustered data include:

• Paired data

– Studies on twins where each pair serves as a natural cluster.

– Ophthalmology studies where a pair of eyes serves as a cluster.

Figure 1.1 Orange tree growth data

• Familial or teratologic studies

– Studies on members of a litter of animals (e.g., toxicology studies).

– Epidemiology studies of cancer where families serve as clusters.

• Agricultural studies

Studies in which different plots of land serve as clusters and measurements within a plot are homogeneous.

1.1.3 Spatially correlated data

Spatially correlated data occur when observations include both a response variable and a location vector associated with that response variable. The location vector describes the position or location at which the measured response was obtained. The proximity of measured responses with one another determines the extent to which they are correlated. Lattice data, where measurements are linked to discrete regions (e.g., townships, counties, etc.) rather than some continuous coordinate system, are also considered as spatially correlated and are usually obtained from administrative data sources like census data, socio-economical data, and health data. Examples of spatially correlated data include:

• Geostatistical data

Forestry and agronomy studies where sampling from specified (fixed) locations is used to draw inference over an entire region accounting for spatial dependencies. Mineral and petroleum exploration studies where the objective is more likely to be predictive in nature. Here one utilizes spatial variability patterns to help improve one’s ability to predict resources in unmeasured locations.

• Epidemiological studies

Studies aimed at describing the incidence and prevalence of a particular disease often use spatial correlation models in an attempt to smooth out region-specific counts so as to better assess potential environmental determinants and spatial patterns associated with the disease.

• Image analysis

Image segmentation studies where the goal is to extract information about a particular region of interest from a given image. For example, in the field of medicine, image segmentation may be required to identify tissue regions that have been stained versus not stained. In these settings, modeling spatial correlation associated with a lattice array of pixel locations can help improve digital image analysis.

1.1.4 Multivariate data

Historically, the concept of correlation has been closely linked with methods for analyzing multivariate data wherein two or more response variables are measured per experimental unit or individual. Such methods include multivariate analysis of variance, cluster analysis, discriminant analysis, principal components analysis, canonical correlation analysis, etc. This book does not cover those topics but instead considers applications requiring the analysis of multivariate repeated measurements or the joint modeling of repeated measurements and one or more outcome measures that are measured only once. Examples we consider include

• Multivariate repeated measurements

Any study where one has two or more outcome variables measured repeatedly over time.

• Joint modeling of repeated measurements and event-times data

Studies where the primary goal is to draw inference on serial trends associated with a set of repeated measurements while accounting for possible informative censoring due to dropout. Studies where the primary goal is to draw joint inference on patient outcomes (e.g., patient survival) and any serial trends one might observe in a potential surrogate marker of patient outcome.

Of course one can easily imagine applications that involve two or more types of correlated data. For example, a longitudinal study may well entail both repeated measurements taken at the individual level and clustered data where groups of individuals form clusters according to some pre-determined criteria (e.g., a group randomized trial). Spatio-temporal data such as found in the mapping of disease rates over time is another example of combining two types of correlated data, namely spatially correlated data with serially correlated longitudinal data.

The majority of applications in this book deal with the analysis of repeated measurements, longitudinal data, clustered data, and to a lesser extent, spatial data. A more thorough treatment and illustration of applications involving the analysis of spatially correlated data and panel data, for example, may found in other texts including Cressie (1993), Littell et. al. (2006), Hsiao (2003), Frees (2004) and Mátyás and Sevestre (2008). We will also examine applications that require modeling multivariate repeated measurements in which two or more response variables are measured on the same experimental unit or individual on multiple occasions. As the focus of this book is on applications requiring the use of generalized linear and nonlinear models, examples will include methods for analyzing continuous, discrete and ordinal data including logistic regression for binary data, Poisson regression for counts, and nonlinear regression for normally distributed data.

1.2 Explanatory variables

When analyzing correlated data, one need first consider the kind of study from which the data were obtained. In this book, we consider data arising from two types of studies: 1) experimental studies, and 2) observational studies. Experimental studies are generally interventional in nature in that two or more treatments are applied to experimental units with a goal of comparing the mean response across different treatments. Such studies may or may not entail randomization. For example, in a parallel group randomized placebo-controlled clinical trial, the experimental study may entail randomizing individuals into two groups; those who receive the placebo control versus those who receive an active ingredient. In contrast, in a simple pre-post study, a measured response is obtained on all individuals prior to receiving a planned intervention and then, following the intervention, a second response is measured. In this case, although no randomization is performed, the study is still experimental in that it does entail a planned intervention. Whether randomization is performed or not, additional explanatory variables are usually collected so as to 1) adjust for any residual confounding that may be present with or without randomization and 2) determine if interactions exist with the intervention.

In contrast to an experimental study, an observational study entails collecting data on available individuals from some target population. Such data would include any outcome measures of interest as well as any explanatory variables thought to be associated with the outcome measures.

In both kinds of studies, the set of explanatory variables, or covariates, used to model the mean response can be broadly classified into two categories:

• Within-unit factors or covariates (in longitudinal studies, these are often referred to as time-dependent covariates or repeated measures factors). For repeated measurements and longitudinal studies, examples include time itself, different dose levels applied to the same individual in a dose-response study, different treatment levels given to the same individual in a crossover study. For clustered data, examples include any covariates measured on individuals within a cluster.

• Between-unit factors or covariates (in longitudinal studies, these are often referred to as time-independent covariates) Examples include baseline characteristics in a longitudinal study (e.g., gender, race, baseline age), different treatment levels in a randomized prospective longitudinal study, different cluster-specific characteristics, etc.

It is important to maintain the distinction between these two types of covariates for several reasons. One, it helps remind us that within-unit covariates model unit-specific trends while between-unit covariates model trends across units. Two, it may help in formulating an appropriate variance-covariance structure depending on the degree of heterogeneity between select groups of individuals or units. Finally, such distinctions are needed when designing a study. For example, sample size will be determined, in large part, on the primary goals of a study. When those goals focus on comparisons that involve within-unit covariates (either main effects or interactions), the number of experimental units needed will generally be less than when based on comparisons involving strictly between-unit comparisons.

1.3 Types of models

While there are several approaches to modeling correlated response data, we will confine ourselves to two basic approaches, namely the use of 1) marginal models and 2) mixed-effects models. With marginal models, the emphasis is on population-averaged (PA) inference where one focuses on the marginal expectation of the responses. Correlation is accounted for solely through specification of a marginal variance-covariance structure. The regression parameters of marginal models describe the population mean response and are most applicable in settings where the data are used to derive public policies. In contrast,

Figure 1.2 Individual and marginal mean responses under a simple negative exponential decay model with random decay rates

with a mixed-effects model, inference is more likely to be subject-specific (SS) or cluster-specific in scope with the focus centering on the individual’s mean response. Here correlation is accounted for through specification of subject-specific random effects and possibly on an intra-subject covariance structure. Unlike marginal models, the fixed-effects regression parameters of mixed-effects models describe the average individual’s response and are more informative when advising individuals of their expected outcomes. When a mixed-effects model is strictly linear in the random effects, the regression parameters will have both a population-averaged and subject-specific interpretation.

1.3.1 Marginal versus mixed-effects models

In choosing between marginal and mixed-effects models, one need carefully assess the type of inference needed for a particular application and weigh this against the complexities associated with running each type of model. For example, while a mixed-effects model may make perfect sense as far as its ability to describe heterogeneity and correlation, it may be extremely difficult to draw any population-based inference unless the model is strictly linear in the random effects. Moreover, population-based inference on, say, the marginal means may make little sense in applications where a mixed-effects model is assumed at the start. We illustrate this with the following example.

Shown in Figure 1.2 are the individual and marginal mean responses from a randomly generated sample of 25 subjects assuming a simple negative exponential decay model with random decay rates. The model used to generate the data is given by:

yij|bi=exp{ −βitij }[ 1+εij ](1.1)βi=β+bi with β=0.20, bi~N(0, σb2), εij~iid N(0, σε2)

where yij is the response for the ith subject (i = 1,...,n) on the jth occasion, tij (j = 1,...,p), βi is the ith subject’s decay rate, β is a population parameter decay rate, and bi is a subject-specific random effect describing how far the ith individual’s decay rate deviates from the population parameter decay rate. Under this model, subject-specific (SS) inference targets the average individual’s response curve while population-averaged (PA) inference targets the average response in the population. Specifically we have:

SS Inference - Average Subject’s Response Curve:

Ey|b[ yij|bi=0 ]=Ey|b[ yij|bi=Eb(bi) ]=exp{ −βtij },

PA Inference - Population-Averaged Response Curve:

Ey[ yij ]=Eb[ Ey|b(yij|bi) ]=exp{ −βtij+12σb2tij2 }.

Notice that for the average subject’s response curve, expectation with respect to random effects is applied within the conditional argument (i.e., we are evaluating a conditional mean at the average random effect) while for the population-averaged response curve, expectation with respect to random effects is applied to the conditional means (i.e., we are evaluating the average response across subjects). Hence under model (1.1), the population-averaged (i.e., marginal) mean response depends on both the first and second moments of the subject-specific parameter, βi~N(β, σb2).

To contrast the SS response curves and PA response curves, we simulated data assuming 1) σb/β = 0.25 (i.e., a coefficient of variation, CV, of 25% with respect to βi), and 2) σb/β = 0.60 (CV = 60%). As indicated in Figure 1.2, when the coefficient of variation of βi increases, there is a clear divergence between the average subject’s response curve and the mean response in the population. This reflects the fact that the marginal mean is a logarithmically convex function in t for all t>β/σb2. It is also of interest to note that as the CV increases, the sample size required for the observed means to approximate the expected population means also increases (see Figure 1.3). One may question the validity of using simulated data where the response for some individuals actually increases over time despite the fact that the population parameter β depicts an overall decline over time (here, this will occur whenever the random effect bi < –0.20 as –βi will then be positive). However, such phenomena do occur. For example, in section 1.4, we consider a study among patients with end-stage renal disease where their remaining kidney function, as measured by the glomerular filtration rate (GFR), tends to decline exponentially over time but for a few patients, their GFR actually increases as they regain their renal function.

1.3.2 Models in SAS

There are a variety of SAS procedures and macros available to users seeking to analyze correlated data. Depending on the type of data (discrete, ordinal, continuous, or a combination thereof), one can choose from one of four basic categories of models available in SAS: linear models, generalized linear models, nonlinear models and generalized nonlinear models. Within each basic category one can also choose to run a marginal model or a mixed-effects model resulting in the following eight classes of models available in SAS: 1) Linear Models (LM); 2) Linear Mixed-Effects Models (LME models); 3) Generalized Linear Models (GLIM); 4) Generalized Linear Mixed-Effects Models (GLME models); 5) Nonlinear Models (NLM); 6) Nonlinear Mixed-Effects Models (NLME models); 7) Generalized Nonlinear Models (GNLM); and 8) Generalized Nonlinear Mixed-Effects

Figure 1.3 Observed and expected marginal means for different sample sizes

Models (GNLME models). The models within any one class are determined through specification of the moments and possibly the distribution functions under which the data are generated. Moment-based specifications usually entail specifying unconditional (marginal) or conditional (mixed-effects) means and variances in terms of their dependence on covariates and/or random effects. Using the term subject to refer to an individual, subject, cluster or experimental unit, we adopt the following general notation which we use to describe marginal or conditional moments and/or likelihood functions.

Notation

y is a response vector for a given subject. Within a given context or unless otherwise noted, lower case lettering y (or y) will be used to denote either the underlying random vector (or variable) or its realization.

β is a vector of fixed-effects parameters associated with first-order moments (i.e., marginal or conditional means).

b is a vector of random-effects (possibly multi-level) which, unless otherwise indicated, is assumed to have a multivariate normal distribution, b ~ N(0, ψ), with variance-covariance matrix ψ = ψ(θb) that depends on a vector of between-subject variance-covariance parameters, say θb.

X is a matrix of within- and between-subject covariates linking the fixed-effects parameter vector β to the marginal or conditional mean.

Z is a matrix of within-subject covariates contained in X that directly link the random effects to the conditional mean.

μ(X, β, Z, b) = μ(β, b) = E(y|b) is the conditional mean of y given random effects b.

Table 1.1 Hierarchy of models in SAS according to mean structure and cumulative distribution function (CDF)

Λ(X, β, Z, b, θw) = Λ(β, b, θw) = Var(y|b) is the conditional covariance matrix of y given random effects b. This matrix may depend on an additional vector, θw, of within-subject variance-covariance parameters.

μ(X, β) = μ(β) = E(y) xis the marginal mean of y except for mixed models where the marginal mean μ(X, β, Z, θb) = E(y) may depend on θb as well as β (e.g., see the PA mean response for model (1.1)).

∑(β, θ) = Var(y)is the marginal variance-covariance of y that depends on between-and/or within-subject covariance parameters, θ = (θb, θw) and possibly on the fixed-effects regression parameters β.

There is an inherent hierarchy to these models as suggested in Table 1.1. Specifically, linear models can be considered a special case of generalized linear models in that the latter allow for a broader class of distributions and a more general mean structure given by an inverse link function, g−1(Xβ), which is a monotonic invertible function linking the mean, E(y), to a linear predictor, Xβ, via the relationship g(E(y)) = Xβ. Likewise Gaussian-based linear models are a special case of Gaussian-based nonlinear models in that the latter allow for a more general nonlinear mean structure, f(X, β), rather than one that is strictly linear in the parameters of interest. Finally, generalized linear models are a special case of generalized nonlinear models in that the latter, in addition to allowing for a broader class of distributions, also allow for more general nonlinear mean structures of the form f(X, β). That is, generalized nonlinear models do not require a mean structure that is an invertible monotonic function of a linear predictor as is the case for generalized linear models. In like fashion, within any row of Table 1.1, one can consider the marginal model as a special case of the mixed-effect model in that the former can be obtained by merely setting the random effects of the mixed-effects model to 0. We also note that since nonlinear mixed models do not require specification of a conditional linear predictor, Xβ + Zb, the conditional mean, f(X, β, b), may be specified without reference to Z. This is because the fixed and random effects parameters, β and b, will be linked to the appropriate covariates through specification of the function f and its relationship to a design matrix X that encompasses Z.

The generalized nonlinear mixed-effects (GNLME) model is the most general model considered in that it combines the flexibility of a generalized linear model in terms of its ability to specify non-Gaussian distributions, and the flexibility of a nonlinear model in terms of its ability to specify more general mean structures. One might wonder, then, why SAS does not develop a single procedure based on the GNLME model rather than the various procedures currently available in SAS. The answer is simple. The various procedures specific to linear (MIXED, GENMOD and GLIMMIX) and generalized linear models (GENMOD, GLIMMIX) offer specific options and computational features that take full advantage of the inherent structure of the underlying model (e.g., the linearity of all parameters, both fixed and random, in the LME model, or the monotonic transformation that links the mean function to a linear predictor in a generalized linear model). Such flexibility is next to impossible to incorporate under NLME and GNLME models, both of which can be fit to data using either the SAS procedure NLMIXED or the SAS macro %NLINMIX. A road map linking the SAS procedures and their key statements/options to these various models is presented in Table 1.2 of the summary section at the end of this chapter.

Finally, we shall assume throughout that the design matrices, X and Z, are of full rank such that, where indicated, all matrices are invertible. In those cases where we are dealing with less than full rank design matrices and matrices of the form X′AX, for example, are not of full rank, the expression (X′AX)−¹ will be understood to represent a generalized inverse of X′AX.

1.3.3 Alternative approaches

Alternative approaches to modeling correlated response data include the use of conditional and/or transition models as well as hierarchical Bayesian models. Texts by Diggle, Liang and Zeger (1994) and Molenberghs and Verbeke (2005), for example, provide an excellent source of information on conditional and transition models for longitudinal data. Also, with the advent of Markov Chain Monte Carlo (MCMC) and other techniques for generating samples from Bayesian posterior distributions, interested practitioners can opt for a full Bayesian approach to modeling correlated data as exemplified in the texts by Carlin and Louis (2000) and Gelman et. al. (2004). A number of Bayesian capabilities were made available with the release of SAS 9.2 including the MCMC procedure. With PROC MCMC, users can fit a variety of Bayesian models using a general purpose MCMC simulation procedure.

1.4 Some examples

In this section, we present just a few examples illustrating the different types of response data, covariates, and models that will be discussed in more detail in later chapters.

Soybean Growth Data

Davidian and Giltinan (1993, 1995) describe an experimental study in which the growth patterns of two genotypes of soybeans were to be compared. The essential features of the study are as follows:

• The experimental unit or cluster is a plot of land

• Plots were sampled 8-10 occasions (times) within a calendar year

• Six plants were randomly selected at each occasion and the average leaf weight per plant was calculated for a plot

• Response variable:

– yij = average leaf weight (g) per plant for ith plot on the jth occasion (time) within a calendar year (i = 1,...,n = 48 plots; 16 plots per each of the calendar years 1988, 1989 and 1990; j = 1,...,pi with pi = 8 to 10 measurements per calendar year)

• One within-unit covariate:

– tij = days after planting for ith plot on the jth occasion

• Two between-unit covariates:

– Genotype of Soybean (F=commercial, P=experimental) denoted by

a1i={ 0,if commercial (F)1,if experimental (P)

Figure 1.4 Soybean growth data

– Calendar Year (1988, 1989, 1990) denoted by two indicator variables,

a2i={ 0,if year is 1988 or 19901,if year is 1989a3i={ 0,if year is 1988 or 19891,if year is 1990

• Goal: compare the growth patterns of the two genotypes of soybean over the three growing seasons represented by calendar years 1988-1990.

This is an example of an experimental study involving clustered longitudinal data in which the response variable, y = average leaf weight per plant (g), is measured over time within plots (clusters) of land. In each of the three years, 1988, 1989 and 1990, 8 different plots of land were seeded with the genotype Forrest (F) representing a commercial strain of seeds and 8 different plots were seeded with genotype Plant Introduction #416937 (P), an experimental strain of seeds. During the growing season of each calendar year, six plants were randomly sampled from each plot on a weekly basis (starting roughly two weeks after planting) and the leaves from these plants were collected and weighed yielding an average leaf weight per plant, y, per plot. A plot of the individual profiles, shown in Figure 1.4, suggest a nonlinear growth pattern which Davidian and Giltinan modeled using a nonlinear mixed-effects logistic growth curve model. One plausible form of this model might be

yij=f(xij′,β,bi)+εij(1.2)=f(xij′,βi)+εij=βi11+exp{βi3(tij−βi2)}+εijβi=(βi1βi2βi3)=(β01+β11a1i+β21a2i+β31a3iβ02+β12a1i+β22a2i+β32a3iβ03+β13a1i+β23a2i+β33a3i)+(bi1bi2bi3)

where yij is the average leaf weight per plant for the ith plot on the jth occasion, f(xij′,β,bi)=f(xij′,βi)=βi1/[ 1+exp{βi3(tij−βi2)} ] is the conditional mean response for the ith plot on the jth occasion, xij′=(1 tij a1i a2i a3i) is the vector of within- and between-cluster covariates associated with the population parameter vector, β′ = β01 β11 β21 β31 β02 ... β23 β33 on the jth occasion. The first two columns of xij′, say zij′=(1 tij), is the vector of within-cluster covariates associated with the cluster-specific random effects, bi′=(bi1 bi2 bi3) on the jth occasion, and εij is the intra-cluster (within-plot or within-unit) error on the jth occasion. The vector of random-effects are assumed to be iid N(0, ψ) where ψ is a 3 × 3 arbitrary positive definite covariance matrix. One should note that under this particular model, we are investigating the main effects of genotype and calendar year on the mean response over time.

The vector βi′=(βi1 βi2 βi3) may be regarded as a cluster-specific parameter vector that uniquely describes the mean response for the ith plot with βi1 > 0 representing the limiting growth value (asymptote), βi2 > 0 representing soybean half-life (i.e., the time at which the soybean reaches half its limiting growth value), and βi3 < 0 representing the growth rate. It may be written in terms of the linear random-effects model

βi=Aiβ+Bibi=Aiβ+bi

where

Ai=(1a1ia2ia3i0000000000001a1ia2ia3i0000000000001a1ia2ia3i)

is a between-cluster design matrix linking the between-cluster covariates, genotype and calendar year, to the population parameters β while Bi is an incidence matrix of 0’s and 1’s indicating which components of βi are random and which are strictly functions of the fixed-effect covariates. In our current example, all three components of βi are assumed random and hence Bi = I3 is simply the identity matrix. Suppose, however, that the half-life parameter, βi2, was assumed to be a function solely of the fixed-effects covariates. Then Bi would be given by

Bi=(100001)

with bi′=(bi1 bi3). Note, too, that we can express the between-unit design matrix more conveniently as Ai = I3 ⊗ (1 a1i a2i a3i) where ⊗ is the direct product operator (or Kronecker product) linking the dimension of βi via the identity matrix I3 to the between-unit covariate vector, ai′=(1 a1i a2i a3i). The ⊗ operator is a useful tool which we will have recourse to use throughout the book and which is described more fully in Appendix A.

Finally, by assuming the intra-cluster errors εij are iid N(0,σw2), model (1.2) may be classified as a nonlinear mixed-effects model (NLME) having conditional means expressed in terms of a nonlinear function, i.e., E(yij|bi)=μ(xij′,β,zij′,bi)=f(xij′,β,bi), as represented using the general notation of Table 1.1. In this case, inference with respect to the population parameters β and θ=(θb,θw)=(vech(Ψ), σw2) will be cluster-specific in scope in that the function f, when evaluated at bi = 0, describes what the average soybean growth pattern is for a typical plot.

Respiratory Disorder Data

Koch et. al. (1990) and Stokes et. al. (2000) analyzed data from a multicenter randomized controlled trial for patients with a respiratory disorder. The trial was conducted in two centers in which patients were randomly assigned to one of two treatment groups: an active treatment group or a placebo control group (0 = placebo, 1 = active). The initial outcome variable of interest was patient status which is defined in terms of the ordinal categorical responses: 0 = terrible, 1 = poor, 2 = fair, 3 = good, 4 = excellent. This categorical response was obtained both at baseline and at each of four visits (visit 1, visit 2, visit 3, visit 4) during the course of treatment. Here we consider an analysis obtained by collapsing the data into a discrete binary outcome with the primary focus being a comparison of the average response of patients. The essential components of the study are listed below (the SAS variables are listed in parentheses).

• The experimental unit is a patient (identified by SAS variable ID)

• The initial outcome variable was patient status defined by the ordinal response: 0 = terrible, 1 = poor, 2 = fair, 3 = good, 4 = excellent

• Response variable (y):

– The data were collapsed into a simple binary response as

yij={ 0=negative response if (terrible, poor, or fair)1=positive response if (good, excellent)

which is the ith patient’s response obtained on the jth visit

• One within-unit covariate:

– Visit (1, 2, 3, or 4) defined here by the indictor variables (Visit)

vij={ 0=if visit j1=otherwise

• Five between-unit covariates:

– Treatment group (Treatment) defined as ’P’ for placebo or ’A’ for active.

a1i={ 0,if placebo1,if active (this indicator is Drug0)

– Center (Center)

a2i={ 0,if center 11,if center 2 (this indicator is labeled Center0)

– Gender (Gender) defined as 0 = male, 1 = female

a3i={ 0,if male1,if female (this indicator is labeled Sex)

– Age at baseline (Age)

a4i = patient age

Figure 1.5 A plot of the proportion of positive responses by visit and drug for the respiratory disorder data

– Response at baseline (Baseline),

a5i={ 0=negative,1=positive (this indicator is labeled y0)

• Goal: determine if there is a treatment effect after adjusting for center, gender, age and baseline differences.

A plot of the proportion of positive responses at each visit according to treatment group is shown in Figure 1.5. In this example, previous authors (Koch et. al., 1977; Koch et. al., 1990) fit the data using several different marginal models resulting in a population-averaged approach to inference. One family of such models is the family of marginal generalized linear models (GLIM’s) with working correlation structure. For example, under the assumption of a working independence structure across visits and assuming there is no visit effect, one might fit this data to a binary logistic regression model of the form

E(yij)=μij(xij′,β)=g−1(xij′β)(1.3)=exp{xij′β}1+exp{xij′β}xij′β=β0vij+β1a1i+β2a2i+β3a3i+β4a4i+β5a5iVar(yij)=μij(xij′,β)[1−μij(xij′,β)]=μij(1−μij)

where μij=μij(xij′,β)=Pr(yij=1|xij) is the probability of a positive response on the jth visit, xij′=(vij a1i a2i a3i a4i a5i) is the design vector of within- and between-unit covariates on the jth visit and β' = (β0 β1 β2 β3 β4 β5) is the parameter vector associated with the mean response. Here, we do not model a visit effect but rather assume a common visit effect that is reflected in the overall intercept parameter, β0 (i.e., since we always have vij = 1 on the jth visit, we can simply replace β0vij with β0). In matrix notation, model (1.3) may be written as

E(yi)=μi(Xi,β)=g−1(Xiβ)Var(yi)=∑i(β)=Hi(μi)1/2I4Hi(μi)1/2

where

yi=(yi1yi2yi3yi4), Xi=(xi1′xi2′xi3′xi4′),

and Hi(μi) is the 4 × 4 diagonal variance matrix with Var(yij) = μij(1 – μij) as the jth diagonal element and Hi(μi)¹/² is the square root of Hi(μi) which, for arbitrary positive definite matrices, may be obtained via the Cholesky decomposition. To accommodate possible correlation among binary responses taken on the same subject across visits, we can use a generalized estimating equation (GEE) approach in which robust standard errors are computed using the so-called empirical sandwich estimator (e.g., see §4.2 of Chapter 4). An alternative GEE approach might assume an overdispersion parameter ϕ and working correlation matrix Ri(α) such that

Var(yi)=∑i(β,θ)=ϕHi(μi)1/2Ri(α)Hi(μi)1/2

where θ = (α, ɸ). Commonly used working correlation structures include working independence [i.e., Ri(α) = I4 as above], compound symmetry and first-order autoregressive. Alternatively, one can extend the GLIM in (1.3) to a generalized linear mixed-effects model (GLME) by simply adding a random intercept term, bi, to the linear predictor, i.e.,

xij′β+bi=β0vij+β1a1i+β2a2i+β3a3i+β4a4i+β5a5i+bi.

This will induce a correlation structure among the repeated binary responses via the random intercepts shared by patients across their visits.

In Chapters 4-5, we shall consider analyses based on both marginal and mixed-effects logistic regression thereby allowing one to contrast PA versus SS inference. With respect to marginal logistic regression, we will present results from a first-order and second-order generalized estimating equation approach.

Epileptic Seizure Data

Thall and Vail (1990) and Breslow and Clayton (1993) used Poisson regression to analyze epileptic seizure data from a randomized controlled trial designed to compare the effectiveness of progabide versus placebo to reduce the number of partial seizures occurring over time. The key attributes of the study are summarized below (SAS variables denoted in parentheses).

• The experimental unit is a patient (ID)

• The data consists of the number of partial seizures occurring over a two week period on each of four successive visits made by patients receiving one of two treatments (progabide, placebo).

• Response variable (y):

– yij = number of partial seizures in a two-week interval for the ith patient as recorded on the jth visit

• One within-unit covariate:

– Visit (1, 2, 3, or 4) defined here by the indictor variables (Visit)

vij={ 1,if visit j0,otherwise

• Three between-unit covariates:

– Treatment group (Trt)

a1i={ 0,if placebo1,if progabide

– Age at baseline (Age)

a2i = patient age

– Baseline seizure counts (bline) normalized to a two week period

a3i=14×baseline seizure counts over an 8 week period (y0)

• Goal: determine if progabide is effective in reducing the number of seizures after adjustment for relevant baseline covariates

In Chapters 4-5, we will consider several different models for analyzing this data using both a marginal and mixed-effects approach. For example, we consider several mixed-effects models in which the count data (yij = number of seizures per 2-week interval) were fitted to a mixed-effects log-linear model with conditional means of the form

E(yij|bi)=μij(xij′,β,zij,bi)=g−1(xij′β+zijbi)(1.4)=exp{xij′β+zijbi}xij′β+zijbi=β0+β1a1i+β2log(a2i)+β3log(a3i)+β4a1ilog(a3i)+β5vi4+log(2)+bi

where μij=μij(xij′,β,zij,bi) is the average number of seizures per two week period on the jth visit, xij′=(1 a1i log(a2i) log(a3i) a1ilog(a3i)) vi4) is the design vector of within-and between-unit covariates on the jth visit, zij = 1 for each j, bi is a subject-specifc random intercept, and β′ = (β0 β1 β2 β3 β4 β5) is the parameter vector associated with the mean response. Following Thall and Vail (1990) and Breslow and Clayton (1993), this model assumes that only visit 4 has an effect on seizure counts. The term, log(2), is an offset that is included to reflect that the mean count is over a two-week period. In matrix notation, the conditional means across all four visits for a given subject may be written as

E(yi|bi)=μi(Xi,β,Zi,bi)=μi(β,bi)=g−1(Xiβ+Zibi)

Where

yi=(yi1yi2yi3yi4), Xi=(xi1′xi2′xi3′xi4′), Zi=(1111).

One could consider fitting this model assuming any one of three conditional variance structures, Var(yij|bi) = Λi(xij, β, zij, bi, θw) = Λi(β, bi, θw),

Case 1: Var(yij|bi) = μij (standard Poisson variation)

Case 2: Var(yij|bi) = ϕij (extra-Poisson variation)

Case 3: Var(yij|bi) = μij(1+αμij) (negative binomial variation).

In the first case, Λi(β, bi, θw) = μij(β, bi) and there is no θw parameter. In the second case, we allow for conditional overdispersion in the form of Λi(β, bi, θw) = φμij(β, bi)where θw = ɸ while in the third case, over-dispersion in the form of a negative binomial model with Λi(β, bi, θw) = μij(β, bi) (1 + αμij(β, bi)) is considered with θw = α. The conditional negative binomial model coincides with a conditional gamma-Poisson model which is obtained by assuming the conditional rates within each two-week interval are further distributed conditionally as a gamma random variable. This allows for a specific form of conditional overdispersion which may or may not make much sense in this setting. Assuming bi~iid N(0,σb2), all three cases result in models that belong to the class of GLME models. However, the models in the first and third cases are based on a well-defined conditional distribution (Poisson in case 1 and negative binomial in case 3) which allows one to estimate the model parameters using maximum likelihood estimation. Under this same mixed-effects setting, other covariance structures could also be considered some of which when combined with the assumption of a conditional Poisson distribution yield a GNLME model generally not considered in most texts.

ADEMEX Data

Paniagua et. al. (2002) summarized results of the ADEMEX trial, a randomized multi-center trial of 965 Mexican patients designed to compare patient outcomes (e.g., survival, hospitalization, quality of life, etc.) among end-stage renal disease patients randomized to one of two dose levels of continuous ambulatory peritoneal dialysis (CAPD). While patient survival was the primary endpoint, there were a number of secondary and exploratory endpoints investigated as well. One such endpoint was the estimation and comparison of the decline in the glomerular filtration rate (GFR) among incident patients randomized to the standard versus high dose of dialysis. There were several challenges with this objective as described below. The essential features of this example are as follows:

• The experimental unit is an incident patient

• Response variables:

– yij = glomerular filtration rate (GFR) of the kidney (ml/min) for ith subject on the jth occasion

– Ti = patient survival time in months

• One within-unit covariate:

– tij = months after randomization

• Six between-unit covariates:

–Treatment group (standard dose, high dose)

a1i={ 0,if control = standard dose1,if intervention = high dose

– Gender

a2i={ 0,if male1,if female

– Age at baseline

a3i = patient age

– Presence or absence of diabetes at baseline

a4i={ 0,if non-diabetic1,if diabetic

– Baseline value of albumin

a5i = serum albumin (g/dL)

– Baseline value of normalized protein nitrogen appearance (nPNA)

a6i = nPNA (g/kg/day)

Figure 1.6 SS and PA mean GFR profiles among control patients randomized to the standard dose of dialysis.

• Goal: Estimate the rate of decline in GFR and assess whether this rate differentially affects patient survival according to dose of dialysis

• Issues:

1. Past studies have linked low GFR with increased mortality

2. The analysis requires one to jointly model GFR and patient survival in order to determine if a) the rate of decline in GFR is associated with survival and b) if the rate of decline is affected by the dose of dialysis

As shown in Figure 1.6, the decline in GFR appears to occur more rapidly early on and then gradually reaches a value of 0 provided the patient lives long enough (i.e., the patient becomes completely anuric). Such data might be reasonably fit assuming a nonlinear exponential decay model with a random intercept and random decay rate. One such model is given by

yij=f(xij′,β,bi)+εij(1.5)=f(xij′,βi)+εij=βi1exp(−βi2tij)+εijβi=(βi1βi2)=(ai′β1ai′β2)+(bi1bi2)=Aiβ+bi

where ai′=(1 a1i a2i a3i a4i a5i a6i) is the between-subject design vector, xij'=(tij,ai') is the vector of within- and between-subject covariates on the jth occasion,

ai′β1+bi1=β01+β11a1i+β21a2i+β31a3i+β41a4i+β51a5i+β61a6i+bi1ai′β2+bi2=β02+β12a1i+β22a2i+β32a3i+β42a4i+β52a5i+β62a6i+bi2

are linear predictors for the intercept (βi1) and decay rate (βi2) parameters, respectively, bi′=(bi1 bi2) are the random intercept and decay rate effects, and β′=(β1′,β2′) is the corresponding population parameter vector with β1′=(β01 β11 β21 β31 β41 β51 β61) denoting the population intercept parameters and β2′=(β02 β12 β22 β32 β42 β52 β62) the population decay rate parameters. Here Ai=I2⊗ai′ is the between-subject design matrix linking the covariates αi to β while εij~iid N(0,σw2) independent of bi.

Figure 1.6 reflects a reduced version of this model in which the subject-specific intercept and decay rate parameters are modeled as a simple linear function of the treatment group only, i.e.,

βi1 = β01 + β11a1i + bi1

βi2 = β02 + β12a1i + bi2.

What is shown in Figure 1.6 is the estimated marginal mean profile (i.e., PA mean response) for the patients randomized to the control group. This PA mean response is obtained by averaging over the individual predicted response curves while the average patient’s mean profile (i.e., the SS mean response) is obtained by simply plotting the predicted response curve achieved when one sets the random effects bi1 and bi2 equal to 0. As indicated previously (page 7), this example illustrates how one can have a random-effects exponential decay model that can predict certain individuals as having an increasing response over time. In this study, for example, two patients (one from each group) had a return of renal function while others showed either a modest rise or no change in renal function. The primary challenge here is to jointly model the decline in GFR and patient survival using a generalized nonlinear mixed-effects model that allows one to account for correlation in GFR values over time as well as determine if there is any association between the rate of decline in GFR over time and patient survival.

1.5 Summary features

We summarize here a number of features associated with the analysis of correlated data. First, since the response variables exhibit some degree of dependency as measured by correlation among the responses, most analyses may be classified as being essentially multivariate in nature. With repeated measurements and clustered data, for example, the analysis requires combining cross-sectional (between-cluster, between-subject, inter-subject) methods with time-series (within-cluster, within-subject, intra-subject) methods.

Second, the type of model one uses, marginal versus mixed, determines and/or limits the type of correlation structure one can model. In marginal models, correlation is accounted for by directly specifying an intra-subject or intra-cluster covariance structure. In mixed-effects models, a type of intraclass correlation structure is introduced through specification of subject-specific random effects. Specifically, intraclass correlation occurs as a result of having random effect variance components that are shared across measurements within subjects. Along with specifying what type of model, marginal or mixed, is needed for inferential purposes, one must also select what class of models is most appropriate based on the type of response variable being measured (e.g., continuous, ordinal, count or nominal) and its underlying mean structure. By specifying both the class of models and

Table 1.2 A summary of the different classes of models and the type of model within a class that are available for the analysis of correlated response data. ¹. The SAS macro %NLINMIX iteratively calls the MIXED procedure when fitting nonlinear models. ². NLMIXED can be adpated to analyze marginal correlation structures (see §4.4, §4.5.2, §4.5.3, §4.5.4, §5.3.3).

type of model within a class, an appropriate SAS procedure can then be used to analyze the data. Summarized in Table 1.2 are the three major classes of models used to analyze correlated response data and the SAS procedure(s) and corresponding procedural statements/options for conducting such an analysis.

Another feature worth noting is that studies involving clustered data or repeated measurements generally lead to efficient within-unit comparisons but inefficient between-unit comparisons. This is evident with split-plot designs where we have efficient split-plot comparisons but inefficient whole plot comparisons. In summary, some of the features we have discussed in this chapter include:

• Repeated measurements, clustered data and spatial data are essentially multivariate in nature due to the correlated outcome measures

• Analysis of repeated measurements and longitudinal data often requires combining cross-sectional methods with time-series methods

• The analysis of correlated data, especially that of repeated measurements, clustered data and spatial data can be based either on marginal or mixed-effects models. Parameters from these two types of models generally differ in that

1) Marginal models target population-averaged (PA) inference

2) Mixed-effects models target subject-specific (SS) inference

• Marginal models can accommodate correlation via

1) Direct specification of an intra-subject correlation structure

• Mixed-effects models can accommodate correlation via

1) Direct specification of an intra-subject correlation structure

2) Intraclass correlation resulting from random-effects variance components that are shared across observations within subjects

• The family of models available within SAS include linear, generalized linear, nonlinear, and generalized nonlinear models

• Repeated measurements and clustered data lead to efficient within-unit comparisons (including interactions of within-unit and between-unit covariates) but inefficient between-unit comparisons.

Part I

Linear Models

2 Marginal Linear Models—Normal Theory

2.1 The marginal linear model (LM)

2.2 Examples

2.3 Summary

In this chapter and the next, we examine various types of linear models used for the analysis of correlated response data assuming the data are normally distributed or at least approximately so. These two chapters are by no means intended to provide a comprehensive treatment of linear models for correlated response data. Indeed such texts as those by Verbeke and Molenberghs (1997), Davis (2002), Demidenko (2004), and Littell et. al. (2006) provide a far more in-depth treatment of such models, especially of linear mixed-effects models. Rather, our goals are to 1) provide a basic introduction to linear models; 2) provide several examples illustrating both the basic and more advanced capabilities within SAS for analyzing linear models, and 3) provide a general modeling framework that can be extended to the generalized linear and nonlinear model settings.

2.1 The marginal linear model (LM)

As noted in Chapter 1, marginal or population-averaged (PA) models are defined by parameters that describe what the average response in the population is and by a variance-covariance structure that is directly specified in terms of the marginal second-order moments. This is true whether the data are correlated or not. To set the stage for linear models for correlated response data, we first consider the univariate general linear model.

Linear models – univariate data

For univariate data consisting of a response variable y measured across n independent experimental units and related to a set of s explanatory variables, x1,x2,...,xs, the usual general linear model may be written as

yi = β0 + β1xi1 + ... + βsxis + εi,i = 1,...,n

with εi ~iid N(0,σ²). One can write this more compactly using matrix notation (see Appendix A) as

y = Xβ + ε, ε ~ N(0,σ²In)

where

y=(y1y2⋮yn), X=(1x11⋯x1s1x21⋯x2s⋮⋮⋮⋮1xn1⋯xns), β=(β0β1⋮βs), ε=(ε1ε2⋮εn)

and In is the n × n identity matrix. Estimation under such models is usually carried out using ordinary least squares (OLS) which may be implemented in SAS using the ANOVA, GLM, or REG procedures.

Linear models–correlated response data

Following the same basic scheme, the marginal linear model (LM) for correlated response data may be written as

yi = Xiβ + εi, i = 1,...,n units (2.1)

where

yi is a pi × 1 response vector, yi = (yi1 ...yipi)′,on the ith unit

Xi is a pi × s design matrix of within- and between-unit covariates,

β is an s × 1 unknown parameter vector,

εi a pi × 1 random error vector with εi ~ ind Npi(0, Σi(θ)), i = 1,...,n

and θ is a d × 1 vector of variance-covariance parameters that defines the variability and correlation across and within individual units. Here we differentiate experimental units, subjects or clusters by indexing them with the subscript i. Essentially, model (2.1) mimics the structure of the univariate general linear model except that we now think of a response vector, yi, a matrix of explanatory variables, Xi, and a vector of within-unit errors, εi for each of n independent experimental units. The key difference is that the residual vectors εi are assumed to be independent across units but within-unit residuals may be correlated, i.e., Cov(εij,εij′ ) ≠ 0 for j ≠ j′. If we set

y=(y1y2⋮yn), X=(X1X2⋮Xn), ε=(ε1ε2⋮εn), ∑(θ)=(Σ1(θ)0⋯00Σ2(θ)⋯0⋮⋮⋮⋮00⋯Σn(θ))

then we can write model (2.1) more succinctly as

y = Xβ + ε, ε ~ NN (0, Σ(θ))

where y and ε are N × 1 vectors; Σ(θ) is an N × N positive definite matrix, and N=Σi=1npi represents the total number of observations.

Model (2.1) is fairly general and offers users a wide variety of options for analyzing correlated data including such familiar techniques as

• Generalized multivariate analysis of variance (GMANOVA)

• Repeated measures analysis of variance (RM-ANOVA)

• Repeated measures analysis of covariance (RM-ANCOVA)

• Growth curve analysis

• Response surface regression

Table 2.1 Some marginal variance-covariance structures available in SAS

associated with data from repeated measures designs, crossover designs, longitudinal studies, group randomized trials, etc. Depending on the approach one takes and the covariance structure one assumes, the marginal LM (2.1) can be fit using a least squares approach (the GLM procedure with a REPEATED statement), a generalized estimating equations (GEE) approach (the GENMOD procedure) or a likelihood approach (the MIXED procedure).

There are a number of possible variance-covariance structures one can specify under (2.1) including some of the more common ones shown in Table 2.1. The MIXED procedure offers users the greatest flexibility in terms of specifying a suitable variance-covariance structure. For instance, one can choose from 23 different specifications suitable for applications involving repeated measurements, longitudinal data and clustered data while 13 different spatial covariance structures are available for applications involving spatially correlated data. A complete listing of the available covariance structures in SAS 9.2 are available in the MIXED documentation. Included are three specifications (UN@AR(1), UN@CS, UN@UN) ideally suited for applications involving multivariate repeated measurements.

2.1.1 Estimation

Estimation under the marginal LM (2.1) can be carried out using a generalized least squares (GLS) approach, a generalized estimating equations (GEE) approach or a likelihood approach, which can be either maximum likelihood (ML) or restricted maximum likelihood (REML). Under normality assumptions, all three approaches are closely related. In this and the following section, we briefly summarize methods of estimation and inference under the marginal linear model (2.1). A more comprehensive treatment of the theory of estimation and inference for marginal linear models can be found in such classical texts as that of Searle (1971, 1987), Rao (1973) and Graybill (1976) as well as recent texts by Vonesh and Chinchilli (1997), Timm (2002), Muller and Stewart (2006) and Kim and Timm (2007).

Generalized Least Squares (GLS):

Generalized least squares is an extension of least squares in that it seeks to minimize, with respect to regression parameters β, an objective function representing a weighted sum of squared distances between observations and their mean. Consequently, the resulting GLS estimator of β may be classified as a least mean distance estimator. Unlike ordinary least squares (OLS), GLS accommodates heterogeneity and correlation via weighted least squares where the weights correspond to the inverse of the variance-covariance matrix.

Estimation must be considered under two cases: 1) when the vector of variance-covariance parameters, θ, is known and 2) when θ is unknown.

Case 1 θ known:

Minimize the generalized least squares (GLS) objective function:

QGLS(β|θ)=∑i=1n(yi−Xiβ)′Σi(θ)−1(yi−Xiβ)(2.2)

where Σi(θ) = Var(yi). The GLS estimator β^(θ) is the solution to the normal estimating equations:

UGLS(β|θ)=∑i=1nXi′Σi(θ)−1(yi−Xiβ)=0(2.3)

obtained by differentiating (2.2) with respect to β, setting the result to 0, and solving for β. The resulting solution to UGLS(β|θ) = 0 is

β^GLS=β^(θ)=(∑i=1nXi′Σi(θ)−1Xi)−1 ∑i=1nXi′Σi(θ)−1yi(2.4)

and the model-based variance-covariance matrix of β^(θ) is

Var(β^GLS)=Ω(β^GLS)=Ω(β^(θ))=(∑i=1nXi′Σi(θ)−1Xi)−1.(2.5)

One can obtain the GLS estimator (2.4) and its corresponding covariance matrix (2.5) within the MIXED procedure of SAS by specifying the variance-covariance parameters directly using the PARMS statement along with the HOLD= or EQCONS= option and/or the NOITER option (see SAS documentation for MIXED). Rather than basing inference on the model-based variance-covariance matrix (2.5), one can estimate Var (β^(θ)) using the robust sandwich estimator (also called the empirical sandwich estimator)

Var(β^GLS)=ΩR(β^GLS)(2.6)=ΩR(β^(θ))=Ω(β^GLS)(∑i=1nXi′Σi(θ)−1e^ie^′iΣi(θ)−1Xi)Ω(β^GLS)

where e^i=yi−Xiβ^(θ) is the residual vector reflecting error between the observed value yi and its predicted value, y^i=Xiβ^(θ). Use of this robust estimator provides some protection against model misspecification with respect to the assumed variance-covariance structure of yi. The empirical sandwich estimator (2.6) is implemented whenever one specifies the EMPIRICAL option of the PROC MIXED statement.

Case 2 θ unknown:

For some consistent estimate, θ^ of θ, minimize the estimated generalized least squares (EGLS) objective function:

QEGLS(β|θ^)=∑i=1n(yi−Xiβ)′Σi(θ^)−1(yi−Xiβ)(2.7)

where Σi(θ^)=Var(yi) is evaluated at the initial estimate θ^. Minimizing (2.7) with respect to β corresponds to solving the EGLS estimating equations:

UEGLS(β|θ^)=∑i=1nXi′Σi(θ^)−1(yi−Xiβ)=0.

The solution to UEGLS(β|θ^)=0 is the EGLS estimator

β^EGLS=β^(θ^)=(∑i=1nXi′Σi(θ^)−1Xi)−1 ∑i=1nXi′Σi(θ^)−1yi(2.8)

where we write β^EGLS=β^(θ^) to highlight its dependence on θ^. The model-based variance-covariance matrix of β^EGLS may be estimated as

Var(β^EGLS)≃Ω^(β^EGLS)=Ω^(β^(θ^))= (∑i=1nXi′Σi(θ^)−1Xi)−1.(2.9)

As with the GLS estimator, one can also compute a robust sandwich estimate Ω^R(βÊGLS) of the variance-covariance of βÊGLS using (2.6) with Ω(β^GLS), Σ(θ) and β^(θ) replaced by Ω(βÊGLS), Σi(θ^) and β^(θ^), respectively.

Typically θ^=θ^(β^0), where β^0 is an initial unbiased estimate of β (e.g., the OLS estimate β^0=β^OLS) and θ^(β^0) is a noniterative method of moments (MM) type estimator that is consistent for θ (i.e., θ^(β^0)) converges in probability to θ as n → ∞). One can specify the components of θ^ within the PARMS statement of the MIXED procedure and use the HOLD= or EQCONS= option and/or the NOITER option to instruct SAS to obtain an EGLS estimator of θ based on the estimated value θ^. Alternatively, if one specifies the option METHOD=MIVQUE0 under PROC MIXED, SAS will compute the minimum variance quadratic unbiased estimator of the covariance parameters which is a noniterative method of moments type estimator of θ (e.g., Rao, 1972). SAS will then produce an EGLS estimate of β using this MINQUE0 estimate of θ.

Generalized Estimating Equations (GEE):

To remove any inefficiencies associated with using weights Σi(θ^)−1 that are based on an initial and perhaps imprecise estimate θ^=θ^(β^0), an alternative approach to EGLS is to perform some sort of iteratively reweighted generalized least squares (IRGLS) where one updates θ^=θ^(β^) based on a current estimate β^ of β. Specifically, if for fixed β, θ^(β) is any consistent estimator of θ, then one may improve on the EGLS procedure by simply updating Σi(θ^) using θ^=θ^(β^) prior to estimating β using (2.8). This leads to the following iteratively reweighted generalized least squares (IRGLS) algorithm:

IRGLS Algorithm:

Step 1: Given β^(k) apply method of moments (MM) or some other technique to obtain an updated estimate θ^(β^(k)) of θ

Step 2: Given θ^(β^(k)), apply the EGLS estimate (2.8) to update β^(k+1).

The IRGLS procedure corresponds to solving the set of generalized estimating equations (GEE):

UGEE(β,θ^(β))=∑i=1nXi′Σi(θ^(β))−1(yi−Xiβ)=0(2.10)

where θ^(β) is any consistent estimator of θ given β. Let θ^GEE=θ^(β) denote the estimator of θ one chooses when updating Σi(θ^(β)) within the GEE (2.10). At the point of convergence, the GEE estimator β^GEE=β^(θ^GEE) will equal the EGLS estimator (2.8) evaluated at the final value of θ^GEE. The variance-covariance matrix of β^GEE may be estimated as Ω^(β^GEE)=Ω^(β^(θ^GEE)) where Ω^(β^(θ^GEE)) has the same form as that of the GLS estimator, namely (2.5), but evaluated at θ=θ^GEE. The robust or empirical sandwich estimate Ω^R(β^GEE) would likewise be given by (2.6) but with Ω(β^GLS) Σ(θ) and β^(θ) now replaced by Ω^R(β^GEE),Σi(θ^GEE) and β^(θ^GEE) respectively. For a number of different marginal variance-covariance structures, one can obtain GEE-based estimates using the REPEATED statement in GENMOD or by using the RSIDE option of the RANDOM statement in the GLIMMIX procedure. In both procedures, inference can be made using either a model-based estimate of the variance-covariance of β^GEE or a robust/empirical sandwich estimate (see Chapter 4, §4.2 for more details).

Maximum Likelihood (ML):

Under normality assumptions, the joint maximum likelihood estimate (MLE) of β and θ is obtained by maximizing the log-likelihood function

L(β,θ;y)=−12N log(2π)−12∑i=1n{ (yi−Xiβ)′Σi(θ)−1(yi−Xiβ) }(2.11) −12∑i=1n{ log|Σi(θ)| }=-12N log(2π)−12∑i=1nQi,GLS(β|θ)−12∑i=1n{ log|Σi(θ)| }=-12N log(2π)−12QGLS(β|θ)−12∑i=1n{ log|Σi(θ)| }

where N=∑i=1npi, Qi,GLS(β|θ)=(yi−Xiβ)′Σi(θ)−1(yi−Xiβ). It is apparent from (2.11) that for known θ, the MLE of β is equivalent to the GLS estimator (2.4) since minimizing (2.2) is equivalent to maximizing (2.11) when θ is fixed and known. Similarly, when θ is unknown and must be estimated jointly with β, it can be shown that the EGLS estimate (2.8) of β will be equivalent to the MLE of β provided θ^ in (2.8) is the MLE of θ. One can use this feature to profile β out of the log-likelihood function and maximize the profile log-likelihood, L(θ;y)=−12{ (N−s)log(2π)+∑i=1n(ri′Σi(θ)−1ri+log|Σi(θ)|) }(2.12)

where ri=yi−Xi(Σk=1nXk′Σk(θ)−1Xk)−1 Σk=1nXk′Σk(θ)−1yk=yi−Xiβ^(θ). Estimation may be carried out using an EM algorithm, a Newton-Raphson algorithm or a Fisher scoring algorithm (e.g., Jennrich and Schluchter, 1986; Laird, Lange and Stram, 1987; Lindstrom and Bates, 1988). Within the SAS procedure MIXED, the Newton-Raphson algorithm is the default algorithm but one can also request the use of Fisher’s scoring algorithm by specifying the SCORING= option under PROC MIXED. If we let θ^MLE denote the MLE of θ obtained by maximizing the profile log-likelihood, L(θ; y), then the estimated variance-covariance matrix of the MLE of β is simply

Var(β^(θ^MLE))≃Ω^(β^MLE)=Ω^(β^(θ^MLE))=(∑i=1nXi′Σi(θ^MLE)−1Xi)−1

where we denote the MLE of β by β^MLE=β^(θ^MLE). Note that β^(θ^MLE) is simply the EGLS estimator (2.8) with θ^ replaced by θ^MLE. Similarly, a robust sandwich estimator, Ω^R(β^MLE), of Var(β^(θ^MLE)) would be given by (2.6) with θ evaluated at θ^MLE.

Restricted Maximum Likelihood (REML):

The default method of estimation in the MIXED procedure of SAS is the restricted maximum likelihood (REML) method. In small samples, ML estimation generally leads to small-sample bias in the estimated variance components. Consider, for example, a simple random sample of observations, y1,...,yn, which are independent and identically distributed as N(μ, σ²). The MLE of μ is the sample mean y¯=1nΣi=1nyi which is known to be an unbiased estimate of μ. However the MLE of σ² is 1nΣi=1n(yi−y¯)2 which has expectation σ²(n – 1)/n and hence underestimates the variance by a factor of –σ²/n in small samples (see, for example, Littell et. al., pp. 747-750, 2006). To minimize such bias, the method of restricted maximum likelihood was pioneered by Patterson and Thompson (1971). Briefly, their approach entails maximizing a reduced log-likelihood function obtained by transforming y to y* = Ty where the distribution of y* is independent of β. This approach adjusts for the loss in degrees of freedom due to estimating β and reduces the bias associated with ML estimation of θ. When we take the transformation Ty = (I – X(X′X)–¹X′)y, it can be shown that the REML estimate of θ is obtained by maximizing the restricted profile log-likelihood, LR(θ; y)

LR(θ;y)=−12{ (N−s)log(2π)+∑i=1n(ri′Σi(θ)−1ri+log| Σi(θ) |)(2.13) +log| ∑i=1nXi′Σi(θ)−1Xi | }

A REML-based estimate of β is then given by β^REML=β^(θ^REML) where β^(θ^REML) is the EGLS estimator (2.8) evaluated at the REML estimate θ^REML. Continuing with our process of substitution, the model-based and robust sandwich estimators of the variance-covariance of β^REML may be written as Ω^(β^REML) and Ω^R(β^REML), respectively, where Ω^(β^REML) is given by (2.5) and Ω^R(β^REML) by (2.6) but with each evaluated at θ=θ^REML.

2.1.2 Inference and test statistics

Inference under the marginal LM (2.1) is usually based on large-sample theory since the exact

Enjoying the preview?

Page 1 of 1

Generalized Linear and Nonlinear Models for Correlated Data: Theory and Applications Using SAS

About this ebook

Edward F. Vonesh

Related authors

Related to Generalized Linear and Nonlinear Models for Correlated Data

Related ebooks

Mathematics For You

Related podcast episodes

Related articles

Related categories

Reviews for Generalized Linear and Nonlinear Models for Correlated Data

What did you think?

Book preview

Generalized Linear and Nonlinear Models for Correlated Data - Edward F. Vonesh

1

Introduction

1.1 Correlated response data

1.1.1 Repeated measurements

1.1.2 Clustered data

1.1.3 Spatially correlated data

1.1.4 Multivariate data

1.2 Explanatory variables

1.3 Types of models

1.3.1 Marginal versus mixed-effects models

1.3.2 Models in SAS

1.3.3 Alternative approaches

1.4 Some examples

1.5 Summary features

2

Marginal Linear Models—Normal Theory

2.1 The marginal linear model (LM)

2.1.1 Estimation

2.1.2 Inference and test statistics