; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0006434 (gene) of Chayote v1 genome

Gene IDSed0006434
OrganismSechium edule (Chayote v1)
DescriptionReverse transcriptase Ty1/copia-type domain-containing protein
Genome locationLG10:27710069..27713935
RNA-Seq ExpressionSed0006434
SyntenySed0006434
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_020415542.1 uncharacterized protein LOC109948051 [Prunus persica]3.4e-3951.15Show/hide
Query:  LSNSHQGSDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVGPVS
        LS      DLG LTYFLGLQ+Q+ +T I VNQ +YAAD+L + GM+  + C+TP    +         L D   YRS+VG L YLTF+  DI F V  + 
Subjt:  LSNSHQGSDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVGPVS

Query:  QFMHSPMMIQLAAVKRILRYIKGSLDQGIQLQPGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK
        QFMH+P     AAVKR+LRY+ G+L  GI    G + L  +SD+DWAGD  DRRSTTG VV+LG+NPISW  KK
Subjt:  QFMHSPMMIQLAAVKRILRYIKGSLDQGIQLQPGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK

XP_021819565.1 uncharacterized protein LOC110761405 [Prunus avium]5.8e-3950.55Show/hide
Query:  TIGNGENLSNSHQGSDLGALTYFLGLQIQHKNT-CIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDI
        TIG   NL++  +  D+G LTYFLGLQI + +T  IFVNQH+YA ++L +  MS+ +AC+TP    S   T+    L D   +RS+VGAL YLTFT  DI
Subjt:  TIGNGENLSNSHQGSDLGALTYFLGLQIQHKNT-CIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDI

Query:  CFVVGPVSQFMHSPMMIQLAAVKRILRYIKGSLDQGIQLQPGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK
         + V  V Q+M +P       VKRILRYI+G+L  GI    GP+ L+ +SD+DWAGD   RRSTTGF+V+LG NP+SW  KK
Subjt:  CFVVGPVSQFMHSPMMIQLAAVKRILRYIKGSLDQGIQLQPGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK

XP_022143489.1 uncharacterized protein LOC111013365 [Momordica charantia]2.4e-4053.25Show/hide
Query:  SDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVGPVSQFMHSPM
        +DL  L YFLGL+I + N  + ++Q +Y  D+LQRFG+ + + C TP S  S      PCS++D   YRS++G+LHYLTFT  DI F VG +SQFMH+P 
Subjt:  SDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVGPVSQFMHSPM

Query:  MIQLAAVKRILRYIKGSLDQGIQLQ--PGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK
           L A KRILRY+ GSL   I  Q     L L  FSDSDWAGD  DRRST+GFV++LGSNPIS   KK
Subjt:  MIQLAAVKRILRYIKGSLDQGIQLQ--PGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK

XP_022152156.1 uncharacterized protein LOC111019945 [Momordica charantia]3.1e-4052.35Show/hide
Query:  SDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVGPVSQFMHSPM
        +DLG L YFLGL+I + +  I VNQ +Y  DIL+RFGM + + C+TP +  + +    PCS +DA  YRS++GA +YLTF+  DI F V  +SQ MH P 
Subjt:  SDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVGPVSQFMHSPM

Query:  MIQLAAVKRILRYIKGSLDQGIQLQPGP---LTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK
        +  L A KRILRY+ G++D G+  +  P   L L  FSDSDWAGD +DRRSTTG V++LGSNPISW  KK
Subjt:  MIQLAAVKRILRYIKGSLDQGIQLQPGP---LTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK

XP_030964983.1 uncharacterized protein LOC115986279, partial [Quercus lobata]7.6e-3951.15Show/hide
Query:  LSNSHQGSDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVGPVS
        LS +    DLG L YFLG+QI      + ++Q +YA++IL RF M N +   TP  S +  T      L D   YRSMVGAL YLTFT  D+ F V  + 
Subjt:  LSNSHQGSDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVGPVS

Query:  QFMHSPMMIQLAAVKRILRYIKGSLDQGIQLQPGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK
        QFM SP    L A KR+LRY++G+L  GI    GPLTL+ F+D+DWAGD  DR+STTGF+V+LGSNPISW  KK
Subjt:  QFMHSPMMIQLAAVKRILRYIKGSLDQGIQLQPGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK

TrEMBL top hitse value%identityAlignment
A0A2N9EKF8 Reverse transcriptase Ty1/copia-type domain-containing protein9.0e-4654.6Show/hide
Query:  LSNSHQGSDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVGPVS
        LSN+ +  DLG L YFLGLQI +K   +FV+QH+Y  D+L +F M+  +A +TP ++ S  +T+    L D   YRS+VGAL Y TFT  DI F V  V 
Subjt:  LSNSHQGSDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVGPVS

Query:  QFMHSPMMIQLAAVKRILRYIKGSLDQGIQLQPGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK
        QFMH P  I  AA KRILR++KG+LD+GI  QPGPL L+ F+D+DWAGD  DRRST+G VV+LG+NPI+W+ KK
Subjt:  QFMHSPMMIQLAAVKRILRYIKGSLDQGIQLQPGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK

A0A2N9EN11 Reverse transcriptase Ty1/copia-type domain-containing protein6.2e-4756.32Show/hide
Query:  LSNSHQGSDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVGPVS
        LS + +  DLG L YFLGLQI +KN   FV+Q +Y  D+L +F MS+ +A +TP ++  + +TS   SL D   YRS+VGAL Y TFT  DI F V  V 
Subjt:  LSNSHQGSDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVGPVS

Query:  QFMHSPMMIQLAAVKRILRYIKGSLDQGIQLQPGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK
        QFMH+P    L A KRILRY+KGSLD+GI  QPGPLTL+ F+D+DWAGD +DRRST+G  V+LG+NPI+WM KK
Subjt:  QFMHSPMMIQLAAVKRILRYIKGSLDQGIQLQPGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK

A0A2N9GPU6 Reverse transcriptase Ty1/copia-type domain-containing protein6.2e-4756.32Show/hide
Query:  LSNSHQGSDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVGPVS
        LS + +  DLG L YFLGLQI +KN   FV+Q +Y  D+L +F MS+ +A +TP ++  + +TS   SL D   YRS+VGAL Y TFT  DI F V  V 
Subjt:  LSNSHQGSDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVGPVS

Query:  QFMHSPMMIQLAAVKRILRYIKGSLDQGIQLQPGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK
        QFMH+P    L A KRILRY+KGSLD+GI  QPGPLTL+ F+D+DWAGD +DRRST+G  V+LG+NPI+WM KK
Subjt:  QFMHSPMMIQLAAVKRILRYIKGSLDQGIQLQPGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK

A0A2N9HAK4 Uncharacterized protein1.3e-4452.87Show/hide
Query:  LSNSHQGSDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVGPVS
        LS + +  DLG L YFLGLQI++K   +FV+QH+Y  D+L +F M+  +A  TP +     +T+    L D   YRS+VGAL Y TFT  DI F V  V 
Subjt:  LSNSHQGSDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVGPVS

Query:  QFMHSPMMIQLAAVKRILRYIKGSLDQGIQLQPGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK
        QFMH P  +  AA KRILRY+KG+LD+G+  QPGPL L+ F+D+DWAGD  DRRST+G VV+LG+NPI+W+ KK
Subjt:  QFMHSPMMIQLAAVKRILRYIKGSLDQGIQLQPGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK

A0A2N9JB37 Uncharacterized protein1.3e-4452.87Show/hide
Query:  LSNSHQGSDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVGPVS
        LS + +  DLG L YFLGLQI++K   +FV+QH+Y  D L +F M+  +A  TP +     +T+    L D   YRS+VGAL Y TFT  DI F V  V 
Subjt:  LSNSHQGSDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVGPVS

Query:  QFMHSPMMIQLAAVKRILRYIKGSLDQGIQLQPGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK
        QFMH P  +  AA KRILRY+KG+LD+G+  QPGPL L+ F+D+DWAGD  DRRST+G VV+LG+NPI+W+ KK
Subjt:  QFMHSPMMIQLAAVKRILRYIKGSLDQGIQLQPGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.3e-1430.81Show/hide
Query:  SDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTF-TCLDICFVVGPVSQFMHSP
        +DL  + +F+G++I+ +   I+++Q  Y   IL +F M N  A +TP  S+ IN        D     RS++G L Y+   T  D+   V  +S++    
Subjt:  SDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTF-TCLDICFVVGPVSQFMHSP

Query:  MMIQLAAVKRILRYIKGSLDQGIQLQPG---PLTLSTFSDSDWAGDCLDRRSTTGFVVYL-GSNPISWMLKK
               +KR+LRY+KG++D  +  +        +  + DSDWAG  +DR+STTG++  +   N I W  K+
Subjt:  MMIQLAAVKRILRYIKGSLDQGIQLQPG---PLTLSTFSDSDWAGDCLDRRSTTGFVVYL-GSNPISWMLKK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.2e-1631.15Show/hide
Query:  NLSNSHQGSDLGALTYFLGLQIQHKNTC--IFVNQHRYAADILQRFGMSNVEACATPSSSQ-SINTTSYPCSLDD-----ALAYRSMVGALHY-LTFTCL
        +LS S    DLG     LG++I  + T   ++++Q +Y   +L+RF M N +  +TP +    ++    P ++++      + Y S VG+L Y +  T  
Subjt:  NLSNSHQGSDLGALTYFLGLQIQHKNTC--IFVNQHRYAADILQRFGMSNVEACATPSSSQ-SINTTSYPCSLDD-----ALAYRSMVGALHY-LTFTCL

Query:  DICFVVGPVSQFMHSPMMIQLAAVKRILRYIKGSLDQGIQLQPGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLK
        DI   VG VS+F+ +P      AVK ILRY++G+    +        L  ++D+D AGD  +R+S+TG++       ISW  K
Subjt:  DICFVVGPVSQFMHSPMMIQLAAVKRILRYIKGSLDQGIQLQPGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLK

P92519 Uncharacterized mitochondrial protein AtMg008106.7e-3041.57Show/hide
Query:  LSNSHQGSDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATP---SSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVG
        LS++    DLG + YFLG+QI+   + +F++Q +YA  IL   GM + +  +TP     + S++T  YP    D   +RS+VGAL YLT T  DI + V 
Subjt:  LSNSHQGSDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATP---SSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVG

Query:  PVSQFMHSPMMIQLAAVKRILRYIKGSLDQGIQL-QPGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK
         V Q MH P +     +KR+LRY+KG++  G+ + +   L +  F DSDWAG    RRSTTGF  +LG N ISW  K+
Subjt:  PVSQFMHSPMMIQLAAVKRILRYIKGSLDQGIQL-QPGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.9e-3042.37Show/hide
Query:  ENLSNSHQGSDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVGP
        +NLS      D   L YFLG++ +   T + ++Q RY  D+L R  M   +   TP +     +      L D   YR +VG+L YL FT  DI + V  
Subjt:  ENLSNSHQGSDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVGP

Query:  VSQFMHSPMMIQLAAVKRILRYIKGSLDQGIQLQPG-PLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK
        +SQFMH P    L A+KRILRY+ G+ + GI L+ G  L+L  +SD+DWAGD  D  ST G++VYLG +PISW  KK
Subjt:  VSQFMHSPMMIQLAAVKRILRYIKGSLDQGIQLQPG-PLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.4e-2942.33Show/hide
Query:  LTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVGPVSQFMHSPMMIQLA
        L YFLG++ +     + ++Q RY  D+L R  M   +  ATP ++    T      L D   YR +VG+L YL FT  D+ + V  +SQ+MH P      
Subjt:  LTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVGPVSQFMHSPMMIQLA

Query:  AVKRILRYIKGSLDQGIQLQPG-PLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK
        A+KR+LRY+ G+ D GI L+ G  L+L  +SD+DWAGD  D  ST G++VYLG +PISW  KK
Subjt:  AVKRILRYIKGSLDQGIQLQPG-PLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.6e-2437.72Show/hide
Query:  DLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVGPVSQFMHSPMM
        DLG L YFLGL+I      I + Q +YA D+L   G+   +  + P       +        DA AYR ++G L YL  T LDI F V  +SQF  +P +
Subjt:  DLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVGPVSQFMHSPMM

Query:  IQLAAVKRILRYIKGSLDQGI-QLQPGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK
            AV +IL YIKG++ QG+       + L  FSD+ +      RRST G+ ++LG++ ISW  KK
Subjt:  IQLAAVKRILRYIKGSLDQGI-QLQPGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK

ATMG00240.1 Gag-Pol-related retrotransposon family protein3.0e-0942.86Show/hide
Query:  YLTFTCLDICFVVGPVSQFMHSPMMIQLAAVKRILRYIKGSLDQGI-QLQPGPLTLSTFSDSDWAGDCLDRRSTTGF
        YLT T  D+ F V  +SQF  +    Q+ AV ++L Y+KG++ QG+       L L  F+DSDWA     RRS TGF
Subjt:  YLTFTCLDICFVVGPVSQFMHSPMMIQLAAVKRILRYIKGSLDQGI-QLQPGPLTLSTFSDSDWAGDCLDRRSTTGF

ATMG00810.1 DNA/RNA polymerases superfamily protein4.7e-3141.57Show/hide
Query:  LSNSHQGSDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATP---SSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVG
        LS++    DLG + YFLG+QI+   + +F++Q +YA  IL   GM + +  +TP     + S++T  YP    D   +RS+VGAL YLT T  DI + V 
Subjt:  LSNSHQGSDLGALTYFLGLQIQHKNTCIFVNQHRYAADILQRFGMSNVEACATP---SSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVG

Query:  PVSQFMHSPMMIQLAAVKRILRYIKGSLDQGIQL-QPGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK
         V Q MH P +     +KR+LRY+KG++  G+ + +   L +  F DSDWAG    RRSTTGF  +LG N ISW  K+
Subjt:  PVSQFMHSPMMIQLAAVKRILRYIKGSLDQGIQL-QPGPLTLSTFSDSDWAGDCLDRRSTTGFVVYLGSNPISWMLKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCACTCCTTTCAAGGGCGACATCCTTCTGCAAGATTAGCCACTATGGTTGTAGCAGCAAATTCGAACTCCACATCTAATATGTCTGCTCCTTCATCTTTGCAAAC
CTCAACAATTTGGCTTTCGAATTCTAGATGCAATGCCCATCTAACTCACGATTTTGGGAATTTTAACAAATCTGAAGCTTACAATGGTGAGGAAAATGTCACAATAGGCA
ATGGAGAAAATCTTTCAAACTCCCATCAGGGCTCGGATTTGGGGGCCTTAACATATTTCCTGGGACTTCAAATTCAACATAAAAATACTTGTATTTTTGTTAATCAGCAC
AGATATGCAGCAGATATATTACAAAGATTTGGGATGTCAAATGTTGAGGCTTGTGCAACTCCTTCCTCATCTCAGTCCATCAATACAACATCTTATCCATGTTCTCTTGA
TGATGCCTTAGCCTATCGAAGCATGGTAGGTGCTCTCCATTATTTGACCTTCACTTGCCTTGACATTTGTTTTGTTGTTGGTCCTGTTTCCCAATTTATGCACTCTCCCA
TGATGATCCAACTAGCAGCTGTAAAAAGGATACTTCGTTACATCAAAGGTTCTCTTGACCAAGGAATTCAGCTTCAACCAGGTCCTTTAACATTGTCCACTTTCTCCGAC
TCTGATTGGGCTGGTGACTGCCTTGATCGAAGGTCCACAACTGGATTCGTTGTGTATCTAGGATCGAATCCAATTTCTTGGATGTTGAAAAAGTAG
mRNA sequenceShow/hide mRNA sequence
AAAACATCGATTCCTATATTCTGCGTATACAAAATATTATTCATAAACTTGCCACTGCAAACGTCCAAATTGAAGATGAAGACAGTGTGATATACGCAGTCAATGGTCTA
CCCAGTATGTATAATGTTTTGAAAACCACCTTGCGAACAAGATCCGTATCCCCAACGTTTGCTGAACCTTATACTCTTCTAAAGGCAGAAGAATTGGCAATCGAAAATCA
ATCAAAGATTGAAGAACGATCAAATCTTGCAACAATGGCAATGATGGCTACGTCAAATGTTCGTGGTTCTTGGAAATGGTCCAATCGAGGTAGATGTCGTAATGGTGGTG
GTCGGGGGCGAGCGATATTCAATTCTTCATTGCCACATAATTCATTAGGTTCTTCTTCTTCCAACACATCCAATTTTCAAATCTCAAACCCTACCCAAATCCAGTTGATC
CAAATACAAATCCACACAAAGTGAATTGTCAAATTTGCAACCGATTTGGCCACAATGCCCTAGACTGCTACAATCGAATGAACCACTCCTTTCAAGGGCGACATCCTTCT
GCAAGATTAGCCACTATGGTTGTAGCAGCAAATTCGAACTCCACATCTAATATGTCTGCTCCTTCATCTTTGCAAACCTCAACAATTTGGCTTTCGAATTCTAGATGCAA
TGCCCATCTAACTCACGATTTTGGGAATTTTAACAAATCTGAAGCTTACAATGGTGAGGAAAATGTCACAATAGGCAATGGAGAAAATCTTTCAAACTCCCATCAGGGCT
CGGATTTGGGGGCCTTAACATATTTCCTGGGACTTCAAATTCAACATAAAAATACTTGTATTTTTGTTAATCAGCACAGATATGCAGCAGATATATTACAAAGATTTGGG
ATGTCAAATGTTGAGGCTTGTGCAACTCCTTCCTCATCTCAGTCCATCAATACAACATCTTATCCATGTTCTCTTGATGATGCCTTAGCCTATCGAAGCATGGTAGGTGC
TCTCCATTATTTGACCTTCACTTGCCTTGACATTTGTTTTGTTGTTGGTCCTGTTTCCCAATTTATGCACTCTCCCATGATGATCCAACTAGCAGCTGTAAAAAGGATAC
TTCGTTACATCAAAGGTTCTCTTGACCAAGGAATTCAGCTTCAACCAGGTCCTTTAACATTGTCCACTTTCTCCGACTCTGATTGGGCTGGTGACTGCCTTGATCGAAGG
TCCACAACTGGATTCGTTGTGTATCTAGGATCGAATCCAATTTCTTGGATGTTGAAAAAGTAGACAACCATGTCGCGAAGCTCCACAGAGGACGAATATCGTGCGCTCGC
TGCCACTACTGCAGAAGTCTTCTGGGTTCGACAAATCCTGAAAGAACTAAATGTTTTTCTTCACGAACCACCCACTCTATTTTGTGACAATCTTTCAGCTATTCAGCTCG
CCGTAATCCTGTCTTTCAAGGTCGAACAAAGCACGTGGAGGTCGATTTTCATTTTGTTCGAGAACGGGTTGCTGCCAAGGACATCTCTCTTCGGTTTATATCCACTCGTG
AGCAGCCAGCCGATATCTTCACTAAATCCTTAAGCACGGGTCGTCTCTATTTTCTGTGAAACAAACTCA
Protein sequenceShow/hide protein sequence
MNHSFQGRHPSARLATMVVAANSNSTSNMSAPSSLQTSTIWLSNSRCNAHLTHDFGNFNKSEAYNGEENVTIGNGENLSNSHQGSDLGALTYFLGLQIQHKNTCIFVNQH
RYAADILQRFGMSNVEACATPSSSQSINTTSYPCSLDDALAYRSMVGALHYLTFTCLDICFVVGPVSQFMHSPMMIQLAAVKRILRYIKGSLDQGIQLQPGPLTLSTFSD
SDWAGDCLDRRSTTGFVVYLGSNPISWMLKK