| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Henry Wellcome Laboratories for Integrative Neuroscience and Endocrinology (P.S., D.J.S., C.M.D.) and Academic Unit of Primary Health Care, Department of Community-Based Medicine (T.J.P.), University of Bristol, Bristol BS1 3NY, United Kingdom; and Research and Development Support Unit (R.G.), Bristol Royal Infirmary, Bristol BS2 8HW, United Kingdom
Address all correspondence and requests for reprints to: Dr. Colin M. Dayan, Henry Wellcome Laboratories for Integrative Neuroscience and Endocrinology, Dorothy Hodgkin Building, University of Bristol, Whitson Street, Bristol BS1 3NY, United Kingdom. E-mail: colin.dayan{at}bris.ac.uk.
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
Since the introduction of sensitive TSH assays in the 1980s, it has been recommended that thyroid hormone replacement with T4 be titrated to achieve TSH levels in the laboratory reference range (6, 7, 8). For many individuals, this change resulted in a significant reduction in T4 dose, and several studies indicate that titration to these levels results in high normal free T4 but free T3 levels in the low reference range (9, 10, 11). Anecdotally, many patients began to complain of impaired psychological well-being and symptoms suggestive of hypothyroidism when their dose was adjusted in this way (12). Carr et al. (13) reported that patients preferred doses of T4 50 µg greater than the dose that achieved a normal TSH level. In an attempt to quantify this, we used the General Health Questionnaire (GHQ) in a cross-sectional study involving 1922 subjects (14). A categorical score
3 (maximum score, 12) on this scale indicates significant psychiatric morbidity (psychiatric caseness) and is predictive of similar outcome at complex psychiatric interview (15). We identified a 6.7% absolute increase in psychiatric caseness in patients on T4 with a TSH in the normal range compared with a matched control group (14), although this was not apparent in a smaller study (16). Such an increase, if true, could account for 150,000 excess psychiatric cases in the United States alone.
In 1999, a 5-wk crossover study of 33 patients suggested that substitution of 50 µg T4 with 12.5 µg T3 significantly improves mood although not cognitive function (17), but more recently, five other trials have shown either no benefit (18, 19, 20, 21) or a worse outcome (22). However, the differences in psychological well-being potentially attributable to T4 replacement between patients and controls that we saw in our cross-sectional study, although important for the population as a whole, were relatively small and could have been missed in these studies. Here, we report a much larger and longer term study with greater power to resolve the issues raised by these studies.
| Patients and Methods |
|---|
|
|
|---|
Potentially eligible subjects were recruited from 28 family practices in the Bristol and Weston-super-Mare area, West of England, United Kingdom. Inclusion criteria were: age, 1875; T4 dose
100 µg/d; TSH level recorded in the last 15 months and the last level known to be within the local laboratory reference range; and no T4 dose adjustments in the last 3 months. Exclusion criteria were: a history of myocardial infarction, unstable angina or heart failure in the previous 3 months; thyroid cancer or secondary hypothyroidism; cholestyramine use; and use of antidepressants in the previous 3 months or amiodarone in the previous 12 months. The study was approved by the local research ethics committees.
Study protocol
The investigation was designed as a double-blind randomized controlled trial. Information about the study was sent to all the eligible patients with a stamped reply slip. On receiving the replies, interested patients were contacted by telephone to arrange appointments for the randomization visit. Patients were given opportunity to ask questions about the study, and then written, informed consent was obtained. They were then randomized by sequential allocation of study numbers (to which treatment groups had been randomly preallocated). Patients were given a trial pot, which either contained 10 µg T3 (T3 group) or matched tablet of 50 µg T4 indistinguishable from the T3 tablet (placebo/T4 alone group). The remaining T4 dose (original dose minus 50 µg) was given in open-label packs. Patients were assessed at 3 months (visit 2) and 12 months (visit 3) and reminded before these visits to take their medication at the same time of day as their appointment time for 7 d before review.
Evaluations: physical and biochemical measurements
Detailed thyroid and other medical history was taken at visit 1. The following physical measurements were taken at each visit: weight, electrocardiogram, blood pressure (twice at 10-min intervals), resting pulse rate, and body composition (Bodystat1500, Bodystat, Ltd., Isle of Man, United Kingdom). A serum sample was taken at baseline (visit 1, not timed) and 24-h postthyroid hormone dose (visits 2 and 3) for the following estimations: creatinine kinase [normal range (NR), 30150 U/liter]; total cholesterol, alkaline phosphatase (NR, 40110 IU/liter); calcium [NR, 910.5 mg/dl (2.252.7 mmol/liter)] (all Beckman LX20 analyzer, Beckman, Highwycombe, United Kingdom), free T3 [NR, 0.180.457 ng/dl (2.87.1 pmol/liter); Elecsys system 1010, Roche Diagnostics, Mannheim, Germany], free T4 [NR, 0.781.86 ng/dl (10.024.0 pmol/liter)], TSH [NR, 0.34.0 mU/liter (0.34.0 mIU/ml)], SHBG [NR, 0.272.15 µg/dl (1080 nmol/liter)], and antithyroid peroxidase (anti-TPO) antibodies (+ve if titer is
100) (all from Diagnostic Product Corporation, Los Angeles, CA). Samples for thyroid function and SHBG from all three visits were analyzed together.
Psychological measurements
At each visit, the patients psychological well-being was assessed by the following scales: the GHQ-12 (15, 23, 24), a disease-specific thyroid symptom questionnaire (TSQ) (14), the Hospital Anxiety and Depression questionnaire (HADS) (25), and 23 visual analog scales of mood, cognitive behavior, and physical symptoms used in the study of Bunevicius et al. (17). In addition, patients completed a satisfaction question on a five-point scale and devised-questions on sleep and neuromuscular symptoms. The GHQ-12 and the TSQ were scored both by the Likert method (03 per question, maximum score 36-most dissatisfied, linear method) and by the GHQ method (0, 0, 1, 1, maximum score 12-most dissatisfied) to assess caseness (using a threshold score of 3 or more, categorical method) (23, 24). The changes in the GHQ-12 scores at 3 months, controlling for baseline scores, represented the primary outcomes.
Statistical methods
A sample size of 700 patients was calculated based on our cross-sectional data to detect a 0.7-point difference between the groups on the GHQ-12 Likert scale with 80% powera difference sufficient to correct the deficit seen in our cross-sectional study (14). All the analyses were conducted in Stata version 8.0 (26). Results were analyzed by intention to treat. The last observation was carried forward to replace missing values at 3 months follow-up due to withdrawal of the patients from the study. For linear and categorical variables, linear and logistic regression analyses, respectively, were used to detect the differences between the groups at 3 months adjusting for baseline values (age, sex, type of diagnosis, T4 dose, duration of hypothyroidism, anti-TPO positivity, and baseline thyroid hormone measurements). Interactions between the baseline thyroid functions (free T3, free T4, TSH, and T3 to T4 ratio) and the treatment group were also investigated in the regression model for the two primary outcome variables (GHQ Likert and GHQ categorical scores). Repeated measures analysis by linear regression was used to detect any differences between the groups between 3 and 12 months follow-up. Median values were used as the summary statistic for TSH measurements because this parameter is not normally distributed, and statistical comparison was done using natural logarithmic values of TSH, yielding a comparison of geometric means.
| Results |
|---|
|
|
|---|
|
|
| Thyroid function |
|---|
|
|
|---|
|
|
|
At 3 months, the GHQ scores by the Likert method improved markedly in both the placebo (T4 alone) and the intervention (T3) groups compared with baseline (baseline to 3 months, T4 alone, 13.4811.13, P < 0.001; T3 group, 13.4210.67, P < 0.001) with a 39% relative improvement in psychiatric caseness in the placebo group (43.9 reducing to 26.6% caseness). These changes are consistent with a marked placebo effect, although improved compliance with medication in the placebo group as evidenced by a significant fall in the serum TSH levels (baseline to 3 months, T4 alone, 0.940.728 mIU/ml, P < 0.05) could have contributed (Fig. 2
; Table 2
). Comparisons between the groups revealed a difference of 0.47 points in the GHQ scored by the Likert method, which was smaller, than the difference used to power the trial (0.7) and did not reach significance at the P < 0.05 level (95% CI, 0.26, +1.12; P = 0.218; Fig. 4
). Using the categorical scoring methods with a threshold
3, a significantly greater reduction in psychiatric caseness was seen in the T3 group compared with T4 alone [19.2 vs. 26.6%, odds ratio (OR), 0.61; 95% CI, 0.42, 0.90; P = 0.01; Fig. 5
; Table 2
). Improvement was also seen in the HADS anxiety score at 3 months (OR, 0.55; 95% CI, 0.32, 0.95; P = 0.033). However, no difference was seen in the TSQ scores, sleep, neuromuscular symptoms, HADS depression category, or visual analog scales, and the percentage of patients reporting that they felt better on direct questioning was not different (well-being question, Table 2
). No significant differences were seen in any of the physical or biochemical measures other than a slightly lower diastolic blood pressure in the T4 alone group (Table 2
). The significance of these results was unchanged when controlled for age, sex, type of diagnosis, prestudy T4 dose, use of other chronic medication, baseline GHQ scores, anti-TPO positivity, and baseline thyroid function (free T3, free T4, TSH, and T3 to T4 ratio).
|
|
Predefined subgroup analysis demonstrated a significant interaction on GHQ scores between baseline free T3 (as a continuous variable) and study group allocation (OR, 2.2; 95% CI, 1.28, 3.98; P = 0.005). Division of free T3 values into quartiles revealed that patients with baseline free T3 in the highest quartile, approximately equivalent to the upper half of the reference range [>0.275 ng/dl (>4.27 pmol/liter), reference range, 0.180.457 ng/dl (2.87.1 pmol/liter)], responded best to the intervention (Table 3
) (only one patient in the T4 alone group, and none in the T3 group had a free T3 level above the normal range). No such effect was seen with baseline free T4 or baseline values of TSH, T3 to T4 ratio, or GHQ scores.
|
When the subjects were reassessed at 12 months, GHQ scores in the intervention group (T3) had risen (worsened, P = 0.0034), and there was now no difference between the two groups (T3 vs. T4 alone, P = 0.24) (Table 2
; Fig. 4
). Interestingly, in both groups, the free T3 to T4 ratio fell significantly (T3 group, 9% reduction; T4 alone group, 6% reduction, both P < 0.001) (Fig. 2D
) between months 3 and 12. This change was not explained by assay drift because samples from both visits were analyzed together. No change over this period was seen in TSH levels.
| Discussion |
|---|
|
|
|---|
Our protocol was designed to maximize sample size and contains several potential limitations. Firstly, the use of a fixed substitution of 50 µg T4 with 10 µg T3 resulted in a rise in TSH and a fall in T4 to the low reference range in the intervention group indicating underreplacement with T4. Because tissues derive up to 50% of their intracellular T3 directly from serum T4, this may have reduced the benefit gained in the intervention group and underlines the importance of replacing both T3 and T4 appropriately. Secondly, despite limiting study entry to patients on
100 µg of T4 (average daily dose 124.3 µg), many of the subjects with primary hypothyroidism in the trial may have had residual thyroid function. This, in combination with the use of a fixed T3 to T4 substitution ratio, might be expected to result in further variability in achieving optimal final T3 and T4 levels in subjects after intervention. Thirdly, the timing of the baseline (visit 1) blood sample was not controlled, whereas the 3- and 12-month samples were taken 24 h postdosing with thyroid hormone. This may underlie the apparent fall in FT4 levels between baseline and 3 months in the T4 group despite a fall in TSH because TSH levels are much less dependent on dose timing over the 24-h period (29). Fourthly, as with previous studies, multiple psychological scales were used, increasing the possibility of positive findings by chance in one or more or the measures. However, primary and secondary analyses were defined in advance both in terms of the parameters to be analyzed and the primary outcome time point. For this reason, formal statistical correction was not made in reporting the results of the secondary analyses.
There have now been six other reported studies of the use of T3 and T4 in combination including the original report of Buneuvicius et al. (17, 18, 19, 20, 21, 22). All used different T4 reduction/T3 substitution regimes, some using a crossover design and apart from the original report showed either no benefit (18, 19, 20, 21) or a worsening of well-being (22) even when a subgroup of dissatisfied patients was examined (22). The major difference with the current report was in the number of participants. Calculations on the basis of the differences seen in our previous cross-sectional study resulted in the current study being over 6 times larger than any of the other studies (n = 23101). Even on this basis, we were somewhat optimistic in powering the study to detect only a difference equal to or greater than the whole difference between the groups in our cross-sectional study; indeed, the observed difference was only around 60% of this (0.43 vs. 0.7 points on the GHQ Likert score). It remains possible that optimized replacement with T3 and T4 could improve well-being in a small subgroup of patients, but clear predictive markers would be required to help distinguish this group from the changes due to placebo and to guide the design of future studies. Our observation that subjects with higher baseline free T3 levels appear to benefit most is interesting in this respect, possibly suggestive of a threshold effect (Table 3
), but further evidence using different baseline parameters is required.
The large size and long duration of the current study provide information on other aspects of thyroid hormone replacement. Early studies suggested that TSH levels are more sensitive to T4 than T3 levels (29). Here, we confirm that this relationship holds true even with TSH levels in the laboratory reference range. Baseline thyroid function values showed a greater negative regression gradient between T4 and TSH than between T3 and TSH (Fig. 3
). In addition, TSH levels rose in response to a fall in T4 levels in the intervention group, even in the face of unchanged or probably higher T3 levels. This sensitivity of the pituitary/hypothalamic feedback to serum T4 and T3 explains how replacement with T4 alone can frequently achieve normal TSH levels with a combination of low T3 and high T4 levels as observed previously in smaller studies (9, 10, 11) and as seen in this study (Table 1
). In addition, over a 9-month period, a significant fall in the T3 to T4 ratio was seen with no associated change in TSH (Fig. 4
). This may reflect slow adaptation of tissue deiodinases to changes in thyroid hormone levels (T3 exposure in the intervention group and improved compliance in the T4 group) as reported in in vitro studies (30). Finally, the benefit attributed to placebo was larger (17.3% absolute reduction/39% relative reduction in psychiatric caseness; Table 2
; Fig. 5
) and more consistently maintained (no reduction after 12 months; Fig. 4
) than expected. Indeed, the estimate of the benefit specifically attributable to the intervention (7.4% less psychiatric caseness) was over 2-fold smaller than this effect, suggesting that in clinical practice, much of the improvement with T3 may reflect placebo benefit and that demonstrating a specific effect is difficult even in large clinical trials.
In conclusion, data from this large community-based study do not provide conclusive evidence of specific benefit from partial substitution of T4 by T3 in patients on T4 replacement. However, they do underline the large size and sustained nature of the placebo effect that may be obtained in studies of this nature. It remains possible that a small subgroup of individuals does benefit specifically from partial substitution, but parameters identifying such a group have yet to be clearly identified.
| Acknowledgments |
|---|
| Footnotes |
|---|
First Published Online December 7, 2004
Abbreviations: CI, Confidence interval; GHQ, General Health Questionnaire; HADS, Hospital Anxiety and Depression questionnaire; NR, normal range; OR, odds ratio; TPO, thyroid peroxidase; TSQ, thyroid symptom questionnaire.
Received August 20, 2004.
Accepted November 29, 2004.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
J. Jonklaas, B. Davidson, S. Bhagat, and S. J. Soldin Triiodothyronine Levels in Athyreotic Individuals During Levothyroxine Therapy JAMA, February 20, 2008; 299(7): 769 - 777. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. T. Joffe, M. Brimacombe, A. J. Levitt, and A. Stagnaro-Green Treatment of Clinical Hypothyroidism With Thyroxine and Triiodothyronine: A Literature Review and Metaanalysis Psychosomatics, October 1, 2007; 48(5): 379 - 384. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Saravanan, T. J. Visser, and C. M. Dayan Psychological Well-Being Correlates with Free Thyroxine But Not Free 3,5,3'-Triiodothyronine Levels in Patients on Thyroid Hormone Replacement J. Clin. Endocrinol. Metab., September 1, 2006; 91(9): 3389 - 3393. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. P. Walsh, L. C. Ward, V. Burke, C. I. Bhagat, L. Shiels, D. Henley, M. J. Gillett, R. Gilbert, M. Tanner, and B. G. A. Stuckey Small Changes in Thyroxine Dosage Do Not Produce Measurable Changes in Hypothyroid Symptoms, Well-Being, or Quality of Life: Results of a Double-Blind, Randomized Clinical Trial J. Clin. Endocrinol. Metab., July 1, 2006; 91(7): 2624 - 2630. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Grozinsky-Glasberg, A. Fraser, E. Nahshoni, A. Weizman, and L. Leibovici Thyroxine-Triiodothyronine Combination Therapy Versus Thyroxine Monotherapy for Clinical Hypothyroidism: Meta-Analysis of Randomized Controlled Trials J. Clin. Endocrinol. Metab., July 1, 2006; 91(7): 2592 - 2599. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. F. Escobar-Morreale, J. I. Botella-Carretero, F. E. del Rey, and G. M. de Escobar Treatment of Hypothyroidism with Combinations of Levothyroxine plus Liothyronine J. Clin. Endocrinol. Metab., August 1, 2005; 90(8): 4946 - 4954. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Endocrinology | Endocrine Reviews | J. Clin. End. & Metab. |
| Molecular Endocrinology | Recent Prog. Horm. Res. | All Endocrine Journals |