; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g29090 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g29090
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr9:21816980..21819632
RNA-Seq ExpressionMoc09g29090
SyntenyMoc09g29090
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022157708.1 uncharacterized protein LOC111024361 [Momordica charantia]4.9e-9065.31Show/hide
Query:  QEMVHVFLTKFFPPAKTAQLRTKIGSFRQYDYEQLFEAWERYKELLRKCPQHGYLEWLQIQMFYNGLNGQKMTILDAAA---------------------
        +EMV  FLT FFPPAKT QLRT+I SFR+YDYEQLFE WERYKELLRKCPQHG LEWLQIQMFYNGLNGQ  TILDAAA                     
Subjt:  QEMVHVFLTKFFPPAKTAQLRTKIGSFRQYDYEQLFEAWERYKELLRKCPQHGYLEWLQIQMFYNGLNGQKMTILDAAA---------------------

Query:  -------ERSNAKRVAGMYEIDEVSSLKAQVQALTNVVSKLSGPGTSYSKEFVAAADTYSYYEPTIEQAQYVNNRNFGYKGNQQQSSLPTHYHPGLRTHE
               ERSNAK+VAGMYEIDE+SSLKAQVQALTN VSKLSGPGTS+S E VAA DTYSYYEPTIEQAQ                              
Subjt:  -------ERSNAKRVAGMYEIDEVSSLKAQVQALTNVVSKLSGPGTSYSKEFVAAADTYSYYEPTIEQAQYVNNRNFGYKGNQQQSSLPTHYHPGLRTHE

Query:  NFSYANNRNVLQPPPGFTSQPAEKKSSFEDLLGAFINESRSRASQIENQVEGMEVRLEGNTTAIKNMEVQIGQMASTLNTMQKGKFPSDIEVNP
                        FTS PAEKKSS EDLLGAFINE RSRAS+IENQVEGMEV+LEGNTT+IKNMEVQIGQ+A TLNTMQKGKFPSDIEV P
Subjt:  NFSYANNRNVLQPPPGFTSQPAEKKSSFEDLLGAFINESRSRASQIENQVEGMEVRLEGNTTAIKNMEVQIGQMASTLNTMQKGKFPSDIEVNP

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]1.1e-8150.16Show/hide
Query:  LQDKVRDWLKSLQPGSVNYWQEMVHVFLTKFFPPAKTAQLRTKIGSFRQYDYEQLFEAWERYKELLRKCPQHGYLEWLQIQMFYNGLNGQKMTILDAAA-
        L+DK R WL+SLQPGS+  WQ+M   FL KFFPPAKTAQLR++IG FRQ D+E L+EAWERYK+L+R CPQHG  +WLQ+QMFYNGLNGQ  TI+DAA+ 
Subjt:  LQDKVRDWLKSLQPGSVNYWQEMVHVFLTKFFPPAKTAQLRTKIGSFRQYDYEQLFEAWERYKELLRKCPQHGYLEWLQIQMFYNGLNGQKMTILDAAA-

Query:  ---------------------------ERSNAKRVAGMYEIDEVSSLKAQVQALTNVVSKLSGPGTSYSKEFVAAAD-TYSYYEPTIEQAQYVNNRNFGY
                                   ER+ AK+VAG++E++  ++L AQV +L++ VS L+        E+VAA+  T    E + EQ QY+NNRN+ Y
Subjt:  ---------------------------ERSNAKRVAGMYEIDEVSSLKAQVQALTNVVSKLSGPGTSYSKEFVAAAD-TYSYYEPTIEQAQYVNNRNFGY

Query:  KGNQQQSSLPTHYHPGLRTHENFSYANNRNVLQPPPGFTSQPAEKKSSFEDLLGAFINESRSRASQIENQVEGMEVRLEGNTTAIKNMEVQIGQMASTLN
        +GN     +P +YHPGLR HENFSY N +NVLQPPPGF SQP+EKK S ED + +F+ E+++   + ++Q++ +E         +KN+EVQIGQ+A+T+N
Subjt:  KGNQQQSSLPTHYHPGLRTHENFSYANNRNVLQPPPGFTSQPAEKKSSFEDLLGAFINESRSRASQIENQVEGMEVRLEGNTTAIKNMEVQIGQMASTLN

Query:  TMQKGKFPSDIEVNP
          Q+G FPS+ EVNP
Subjt:  TMQKGKFPSDIEVNP

XP_023903214.1 uncharacterized protein LOC112015077 [Quercus suber]9.8e-8350.79Show/hide
Query:  LQDKVRDWLKSLQPGSVNYWQEMVHVFLTKFFPPAKTAQLRTKIGSFRQYDYEQLFEAWERYKELLRKCPQHGYLEWLQIQMFYNGLNGQKMTILDAAA-
        L+DK R WL+SLQPGS+  WQ+M   FL KFFPPAKTAQLR++IG FRQ D+E L+EAWERYK+L+R CPQHG  +WLQ+QMFYNGLNGQ  TI+DAA+ 
Subjt:  LQDKVRDWLKSLQPGSVNYWQEMVHVFLTKFFPPAKTAQLRTKIGSFRQYDYEQLFEAWERYKELLRKCPQHGYLEWLQIQMFYNGLNGQKMTILDAAA-

Query:  ---------------------------ERSNAKRVAGMYEIDEVSSLKAQVQALTNVVSKLSGPGTSYSKEFVAAAD-TYSYYEPTIEQAQYVNNRNFGY
                                   ER+ AK+VAG++E++  ++L AQV +L++ VS LS      S E+VAA+  T    E + EQ QY+NNRN+ Y
Subjt:  ---------------------------ERSNAKRVAGMYEIDEVSSLKAQVQALTNVVSKLSGPGTSYSKEFVAAAD-TYSYYEPTIEQAQYVNNRNFGY

Query:  KGNQQQSSLPTHYHPGLRTHENFSYANNRNVLQPPPGFTSQPAEKKSSFEDLLGAFINESRSRASQIENQVEGMEVRLEGNTTAIKNMEVQIGQMASTLN
        +GN     +P +YHPGLR HENFSY N +NVLQPPPGF SQP+EKK S ED + +F+ E+++R  + +++++ +E         +KN+EVQIGQ+A+T+N
Subjt:  KGNQQQSSLPTHYHPGLRTHENFSYANNRNVLQPPPGFTSQPAEKKSSFEDLLGAFINESRSRASQIENQVEGMEVRLEGNTTAIKNMEVQIGQMASTLN

Query:  TMQKGKFPSDIEVNP
          Q+G FPS+ EVNP
Subjt:  TMQKGKFPSDIEVNP

XP_023929660.1 uncharacterized protein LOC112040975 [Quercus suber]2.2e-8250.79Show/hide
Query:  LQDKVRDWLKSLQPGSVNYWQEMVHVFLTKFFPPAKTAQLRTKIGSFRQYDYEQLFEAWERYKELLRKCPQHGYLEWLQIQMFYNGLNGQKMTILDAAA-
        L+DK R WL+SLQPGS+  WQ+M   FL KFFPPAKTAQLR++IG FRQ D+E L+EAWERYK+L+R CPQHG L+WLQ+QMFYNGLNGQ  TI+DAA+ 
Subjt:  LQDKVRDWLKSLQPGSVNYWQEMVHVFLTKFFPPAKTAQLRTKIGSFRQYDYEQLFEAWERYKELLRKCPQHGYLEWLQIQMFYNGLNGQKMTILDAAA-

Query:  ---------------------------ERSNAKRVAGMYEIDEVSSLKAQVQALTNVVSKLSGPGTSYSKEFVAAAD-TYSYYEPTIEQAQYVNNRNFGY
                                   ER+ AK+VAG++E++  ++L AQV +L++ VS L+      S E+VAA+  T    E + E  QY+NNRN+ Y
Subjt:  ---------------------------ERSNAKRVAGMYEIDEVSSLKAQVQALTNVVSKLSGPGTSYSKEFVAAAD-TYSYYEPTIEQAQYVNNRNFGY

Query:  KGNQQQSSLPTHYHPGLRTHENFSYANNRNVLQPPPGFTSQPAEKKSSFEDLLGAFINESRSRASQIENQVEGMEVRLEGNTTAIKNMEVQIGQMASTLN
         GN     +P +YHPGLR HENFSY N +NVLQPPPGF SQP+EKK S ED + +F+ E+++R  + ++Q++ +E         +KN+EVQIGQ+A+T+N
Subjt:  KGNQQQSSLPTHYHPGLRTHENFSYANNRNVLQPPPGFTSQPAEKKSSFEDLLGAFINESRSRASQIENQVEGMEVRLEGNTTAIKNMEVQIGQMASTLN

Query:  TMQKGKFPSDIEVNP
          Q+G FPS+ EVNP
Subjt:  TMQKGKFPSDIEVNP

XP_024020480.1 uncharacterized protein LOC112091333 [Morus notabilis]6.6e-7951.11Show/hide
Query:  LQDKVRDWLKSLQPGSVNYWQEMVHVFLTKFFPPAKTAQLRTKIGSFRQYDYEQLFEAWERYKELLRKCPQHGYLEWLQIQMFYNGLNGQKMTILDAAA-
        L+DK R WL SL   S+  W+EM   FL KFFPP+K +QL++++GSF Q D+E L+EAWER+K+LLRKCPQHGY EW+ I  FYNGLNGQ  TI+D+ A 
Subjt:  LQDKVRDWLKSLQPGSVNYWQEMVHVFLTKFFPPAKTAQLRTKIGSFRQYDYEQLFEAWERYKELLRKCPQHGYLEWLQIQMFYNGLNGQKMTILDAAA-

Query:  ---------------------------ERSNAKRVAGMYEIDEVSSLKAQVQALTNVVSKLSGPGTSYSKEFVAAADT-YSYYEPTIEQAQYVNNRNFGY
                                   ERS  K+ AG++E+D ++SL AQV AL+N ++ L+    S S+E VA A T ++  E T EQ Q+VNNRNF Y
Subjt:  ---------------------------ERSNAKRVAGMYEIDEVSSLKAQVQALTNVVSKLSGPGTSYSKEFVAAADT-YSYYEPTIEQAQYVNNRNFGY

Query:  KGNQQQSSLPTHYHPGLRTHENFSYANNRNVLQPPPGFTSQPAEKKSSFEDLLGAFINESRSRASQIENQVEGMEVRLEGNTTAIKNMEVQIGQMASTLN
        K NQ    LP HYHPGLR HENFSYANNRNVLQPPPGF  Q  EKK S EDLL  FI E+R R ++ E +++ +E         +K++EVQIGQ+A+T+ 
Subjt:  KGNQQQSSLPTHYHPGLRTHENFSYANNRNVLQPPPGFTSQPAEKKSSFEDLLGAFINESRSRASQIENQVEGMEVRLEGNTTAIKNMEVQIGQMASTLN

Query:  TMQKGKFPSDIEVNP
            GKFPSD E NP
Subjt:  TMQKGKFPSDIEVNP

TrEMBL top hitse value%identityAlignment
A0A2I4FP56 uncharacterized protein LOC1090008377.4e-6043.39Show/hide
Query:  MVHVFLTKFFPPAKTAQLRTKIGSFRQYDYEQLFEAWERYKELLRKCPQHGYLEWLQIQMFYNGLNGQKMTILDAAA-----------------------
        M   FL KFFPPAKTAQL+++IG  +Q D+E L++AWERYK+L+R CPQHG  +WLQ+QMFY   NGQ  T +DA +                       
Subjt:  MVHVFLTKFFPPAKTAQLRTKIGSFRQYDYEQLFEAWERYKELLRKCPQHGYLEWLQIQMFYNGLNGQKMTILDAAA-----------------------

Query:  -----ERSNAKRVAGMYEIDEVSSLKAQVQALTNVVSKLSGPGTSYSKEFVAAAD-TYSYYEPTIEQAQYVNNRNFGYKGNQQQSSLPTHYHPGLRTHEN
             +R+ AK+VAG++E++ ++++ AQV  L++ +S L       S E+VAA   T    E + EQ QY+NN+N+ Y+GN     +P H+HPGLR HEN
Subjt:  -----ERSNAKRVAGMYEIDEVSSLKAQVQALTNVVSKLSGPGTSYSKEFVAAAD-TYSYYEPTIEQAQYVNNRNFGYKGNQQQSSLPTHYHPGLRTHEN

Query:  FSYANNRNVL--QPPPGFTSQPAEKKSSFEDLLGAFINESRSRASQIENQVEGMEVRLEGNTTAIKNMEVQIGQMASTLNTMQKGKFPSDIEVNP
         SY N +NVL  QPP GF SQ ++KK S E+ + +F+ E+ ++  + ++Q++ +E        AIKN+EVQIGQ+A+T+N  Q+  FPS+ EVNP
Subjt:  FSYANNRNVL--QPPPGFTSQPAEKKSSFEDLLGAFINESRSRASQIENQVEGMEVRLEGNTTAIKNMEVQIGQMASTLNTMQKGKFPSDIEVNP

A0A2I4G4Q3 uncharacterized protein LOC1090047122.7e-7045.91Show/hide
Query:  LQDKVRDWLKSLQPGSVNYWQEMVHVFLTKFFPPAKTAQLRTKIGSFRQYDYEQLFEAWERYKELLRKCPQHGYLEWLQIQMFYNGLNGQKMTILDAAA-
        L+D+ R WL+SLQP S+  WQ+M   F  KFFPPAKT QLR++IG F+Q D+E L+EAWE YK+L+R+CPQHG  +WLQ+QMFYNGLNG   TI+D  + 
Subjt:  LQDKVRDWLKSLQPGSVNYWQEMVHVFLTKFFPPAKTAQLRTKIGSFRQYDYEQLFEAWERYKELLRKCPQHGYLEWLQIQMFYNGLNGQKMTILDAAA-

Query:  ---------------------------ERSNAKRVAGMYEIDEVSSLKAQVQALTNVVSKLSGPGTSYSKEFVAAAD-TYSYYEPTIEQAQYVNNRNFGY
                                   ER+ AK+VA ++E++ +++L AQV  L++ +S L+      S E+V A   T    E + EQ QY+NNRN+ Y
Subjt:  ---------------------------ERSNAKRVAGMYEIDEVSSLKAQVQALTNVVSKLSGPGTSYSKEFVAAAD-TYSYYEPTIEQAQYVNNRNFGY

Query:  KGNQQQSSLPTHYHPGLRTHENFSYANNRNVL--QPPPGFTSQPAEKKSSFEDLLGAFINESRSRASQIENQVEGMEVRLEGNTTAI-KNMEVQIGQMAS
         GN     +P +YHPG + HEN SY N +NVL  QPPPGF SQ +EKK S ED + +FI E+ +R  + +++++ +E        AI KN+EVQIGQ+A+
Subjt:  KGNQQQSSLPTHYHPGLRTHENFSYANNRNVL--QPPPGFTSQPAEKKSSFEDLLGAFINESRSRASQIENQVEGMEVRLEGNTTAI-KNMEVQIGQMAS

Query:  TLNTMQKGKFPSDIEVNP
        T+N  Q+G FPS+ EVNP
Subjt:  TLNTMQKGKFPSDIEVNP

A0A6J1DU19 uncharacterized protein LOC1110243612.4e-9065.31Show/hide
Query:  QEMVHVFLTKFFPPAKTAQLRTKIGSFRQYDYEQLFEAWERYKELLRKCPQHGYLEWLQIQMFYNGLNGQKMTILDAAA---------------------
        +EMV  FLT FFPPAKT QLRT+I SFR+YDYEQLFE WERYKELLRKCPQHG LEWLQIQMFYNGLNGQ  TILDAAA                     
Subjt:  QEMVHVFLTKFFPPAKTAQLRTKIGSFRQYDYEQLFEAWERYKELLRKCPQHGYLEWLQIQMFYNGLNGQKMTILDAAA---------------------

Query:  -------ERSNAKRVAGMYEIDEVSSLKAQVQALTNVVSKLSGPGTSYSKEFVAAADTYSYYEPTIEQAQYVNNRNFGYKGNQQQSSLPTHYHPGLRTHE
               ERSNAK+VAGMYEIDE+SSLKAQVQALTN VSKLSGPGTS+S E VAA DTYSYYEPTIEQAQ                              
Subjt:  -------ERSNAKRVAGMYEIDEVSSLKAQVQALTNVVSKLSGPGTSYSKEFVAAADTYSYYEPTIEQAQYVNNRNFGYKGNQQQSSLPTHYHPGLRTHE

Query:  NFSYANNRNVLQPPPGFTSQPAEKKSSFEDLLGAFINESRSRASQIENQVEGMEVRLEGNTTAIKNMEVQIGQMASTLNTMQKGKFPSDIEVNP
                        FTS PAEKKSS EDLLGAFINE RSRAS+IENQVEGMEV+LEGNTT+IKNMEVQIGQ+A TLNTMQKGKFPSDIEV P
Subjt:  NFSYANNRNVLQPPPGFTSQPAEKKSSFEDLLGAFINESRSRASQIENQVEGMEVRLEGNTTAIKNMEVQIGQMASTLNTMQKGKFPSDIEVNP

A0A6P9DWY0 uncharacterized protein LOC1183440263.3e-5244.88Show/hide
Query:  YKELLRKCPQHGYLEWLQIQMFYNGLNGQKMTILDAAA----------------------------ERSNAKRVAGMYEIDEVSSLKAQVQALTNVVSKL
        YK+L+R+CPQHG  +WLQ QMFYNGLNGQ  TI+DAA+                            ER+  K+VAG++E++ +++L AQV +L++ +S L
Subjt:  YKELLRKCPQHGYLEWLQIQMFYNGLNGQKMTILDAAA----------------------------ERSNAKRVAGMYEIDEVSSLKAQVQALTNVVSKL

Query:  SGPGTSYSKEFVAAAD-TYSYYEPTIEQAQYVNNRNFGYKGNQQQSSLPTHYHPGLRTHENFSYANNRNVLQPPPGFTSQPAEKKSSFEDLLGAFINESR
        +      S E+VAA   T    E + EQ QY+NNRN+ Y+GN     +P +YH GLR HEN SY N +NVLQP PGF SQP+EKK S ED + +F+ E+ 
Subjt:  SGPGTSYSKEFVAAAD-TYSYYEPTIEQAQYVNNRNFGYKGNQQQSSLPTHYHPGLRTHENFSYANNRNVLQPPPGFTSQPAEKKSSFEDLLGAFINESR

Query:  SRASQIENQVEGMEVRLEGNTTAIKNMEVQIGQMASTLNTMQKGKFPSDIEVNP
        +R  + +++++ +E        AIKN+EVQIGQ+A+T+N  Q+G FPS+ EVNP
Subjt:  SRASQIENQVEGMEVRLEGNTTAIKNMEVQIGQMASTLNTMQKGKFPSDIEVNP

A0A803PT47 Uncharacterized protein1.8e-5841.55Show/hide
Query:  LQDKVRDWLKSLQPGSVNYWQEMVHVFLTKFFPPAKTAQLRTKIGSFRQYDYEQLFEAWERYKELLRKCPQHGYLEWLQIQMFYNGLNGQKMTILDAAA-
        L+D+VR WL+S+QP S++ W EM   F+ KFFPP+K+AQLR++IG FR  D E  +EAWER K+LLR  PQHGY  W+Q+ +FYNGLNG   T++DAA  
Subjt:  LQDKVRDWLKSLQPGSVNYWQEMVHVFLTKFFPPAKTAQLRTKIGSFRQYDYEQLFEAWERYKELLRKCPQHGYLEWLQIQMFYNGLNGQKMTILDAAA-

Query:  ---------------------------ERSNAKRVAGMYEIDEVSSLKAQVQALTNVVSKLSGPGTSYSKEFVAAADTYSYYEPTIEQAQYVNNRNFGYK
                                   ER+  K++AG++E+D ++++ AQ+ AL+N  + L     + + E V AA T    E +IEQAQY+  +   Y 
Subjt:  ---------------------------ERSNAKRVAGMYEIDEVSSLKAQVQALTNVVSKLSGPGTSYSKEFVAAADTYSYYEPTIEQAQYVNNRNFGYK

Query:  GNQQQSSLPTHYHPGLRTHENFSYANNRNVLQPPPGFTSQPAEKKSSFEDLLGAFINESRSRASQIENQVEGMEVRLEGNTTAIKNMEVQIGQMAS
         N + + +P +YHPGLR HEN SY N +NVLQ P GF +Q  E K   ED+LG F+ ES+ R ++ E ++  +E  +     ++KN+EVQ+ ++A+
Subjt:  GNQQQSSLPTHYHPGLRTHENFSYANNRNVLQPPPGFTSQPAEKKSSFEDLLGAFINESRSRASQIENQVEGMEVRLEGNTTAIKNMEVQIGQMAS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTGCCTGCAGGATAAGGTAAGGGATTGGTTGAAATCTTTGCAACCAGGCAGTGTTAATTATTGGCAGGAGATGGTTCATGTGTTTCTCACAAAATTTTTCCCACC
TGCCAAGACAGCTCAACTTAGAACAAAGATCGGATCATTCAGACAATATGATTATGAGCAATTGTTTGAGGCCTGGGAGAGATATAAGGAGCTTCTAAGGAAATGCCCAC
AACATGGTTATCTAGAGTGGCTGCAGATTCAGATGTTTTACAATGGACTGAATGGACAAAAAATGACTATATTGGATGCTGCAGCTGAGAGATCGAATGCCAAGAGAGTT
GCTGGAATGTATGAAATCGATGAGGTAAGTTCCCTAAAAGCTCAAGTTCAAGCTCTGACTAATGTTGTCTCTAAACTTTCAGGACCAGGAACTTCTTATTCAAAAGAGTT
TGTGGCAGCAGCAGATACATATTCTTACTATGAGCCAACCATCGAGCAAGCTCAGTATGTCAATAATAGAAATTTTGGCTACAAGGGAAATCAGCAACAAAGCTCACTAC
CAACACACTATCATCCAGGGTTGAGGACTCATGAAAATTTTTCTTATGCTAACAACAGGAATGTTTTGCAACCTCCACCAGGTTTTACATCTCAGCCAGCTGAAAAGAAA
TCCTCCTTTGAGGATCTACTTGGTGCTTTCATCAATGAATCTAGGAGTCGAGCTAGTCAGATTGAAAATCAGGTAGAAGGGATGGAAGTTAGATTGGAAGGAAACACAAC
TGCCATCAAGAACATGGAGGTGCAGATAGGACAAATGGCATCCACATTGAACACTATGCAGAAAGGGAAGTTTCCAAGTGACATTGAAGTTAACCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTGCCTGCAGGATAAGGTAAGGGATTGGTTGAAATCTTTGCAACCAGGCAGTGTTAATTATTGGCAGGAGATGGTTCATGTGTTTCTCACAAAATTTTTCCCACC
TGCCAAGACAGCTCAACTTAGAACAAAGATCGGATCATTCAGACAATATGATTATGAGCAATTGTTTGAGGCCTGGGAGAGATATAAGGAGCTTCTAAGGAAATGCCCAC
AACATGGTTATCTAGAGTGGCTGCAGATTCAGATGTTTTACAATGGACTGAATGGACAAAAAATGACTATATTGGATGCTGCAGCTGAGAGATCGAATGCCAAGAGAGTT
GCTGGAATGTATGAAATCGATGAGGTAAGTTCCCTAAAAGCTCAAGTTCAAGCTCTGACTAATGTTGTCTCTAAACTTTCAGGACCAGGAACTTCTTATTCAAAAGAGTT
TGTGGCAGCAGCAGATACATATTCTTACTATGAGCCAACCATCGAGCAAGCTCAGTATGTCAATAATAGAAATTTTGGCTACAAGGGAAATCAGCAACAAAGCTCACTAC
CAACACACTATCATCCAGGGTTGAGGACTCATGAAAATTTTTCTTATGCTAACAACAGGAATGTTTTGCAACCTCCACCAGGTTTTACATCTCAGCCAGCTGAAAAGAAA
TCCTCCTTTGAGGATCTACTTGGTGCTTTCATCAATGAATCTAGGAGTCGAGCTAGTCAGATTGAAAATCAGGTAGAAGGGATGGAAGTTAGATTGGAAGGAAACACAAC
TGCCATCAAGAACATGGAGGTGCAGATAGGACAAATGGCATCCACATTGAACACTATGCAGAAAGGGAAGTTTCCAAGTGACATTGAAGTTAACCCATGA
Protein sequenceShow/hide protein sequence
MACLQDKVRDWLKSLQPGSVNYWQEMVHVFLTKFFPPAKTAQLRTKIGSFRQYDYEQLFEAWERYKELLRKCPQHGYLEWLQIQMFYNGLNGQKMTILDAAAERSNAKRV
AGMYEIDEVSSLKAQVQALTNVVSKLSGPGTSYSKEFVAAADTYSYYEPTIEQAQYVNNRNFGYKGNQQQSSLPTHYHPGLRTHENFSYANNRNVLQPPPGFTSQPAEKK
SSFEDLLGAFINESRSRASQIENQVEGMEVRLEGNTTAIKNMEVQIGQMASTLNTMQKGKFPSDIEVNP