; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g26640 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g26640
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr8:19277654..19280432
RNA-Seq ExpressionMoc08g26640
SyntenyMoc08g26640
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]3.8e-7235.5Show/hide
Query:  IRKDKDAILVPFDPEIEITCKRNRKEKKKTTAEMD----PPPPVPIVKP--------------------LKPGLIQMVRENTFRGNATEDPNNHLAMFLD
        +R+ +   ++P DPEIE T +  R+ K    AE D    P      V+P                    LKP LI MV++  F G+  +DPN HLAMFL+
Subjt:  IRKDKDAILVPFDPEIEITCKRNRKEKKKTTAEMD----PPPPVPIVKP--------------------LKPGLIQMVRENTFRGNATEDPNNHLAMFLD

Query:  VCGTLKMNGVTDDAIRLRLFPFSLQDK-------------------------------------------------------------------------
        +C T+K+NGVT+D IRLRLFPFSL+DK                                                                         
Subjt:  VCGTLKMNGVTDDAIRLRLFPFSLQDK-------------------------------------------------------------------------

Query:  --IQMFYNGLNGQTRTILDAVAGGTLLSKTPENAYILLEDMAANSFQWPSERSNAKRVVGMYEID-----------------------------------
          +QMFYNGLNGQTRTI+DA +GGTL+SKT E A  LLE+MA+N++QWP+ER+ AK+V G+++++                                   
Subjt:  --IQMFYNGLNGQTRTILDAVAGGTLLSKTPENAYILLEDMAANSFQWPSERSNAKRVVGMYEID-----------------------------------

Query:  ------ELTIDQAKYINNRNFGYKGNQQQSSLPTHYHPRLRNHENFSYAHNRNVLQP--PPGC-TLLAEKKSSLEDLLGAFINESRSRASRIENQVEGME
              E + +Q +Y+NNRN+ Y+GN     +P +YHP LRNHEN SY + +NVLQP  PPG  +  +E+K SLED + +F+ E+ +R  + +++++ +E
Subjt:  ------ELTIDQAKYINNRNFGYKGNQQQSSLPTHYHPRLRNHENFSYAHNRNVLQP--PPGC-TLLAEKKSSLEDLLGAFINESRSRASRIENQVEGME

Query:  VKLEGNATAIKNMEVQIRQIASTLKIMQEGKFPSDTEVNPREHCKAVTLRSRKELQES--EKEKMEEPVITTEEWENKEEVVKEVTPALQADKPTSSITF
                AIKN+EVQI Q+A+T+   Q G FPS+TEVNP+E CKA+TLRS KE++ S  ++ K     +   + +NK E  + V   L+      +I+F
Subjt:  VKLEGNATAIKNMEVQIRQIASTLKIMQEGKFPSDTEVNPREHCKAVTLRSRKELQES--EKEKMEEPVITTEEWENKEEVVKEVTPALQADKPTSSITF

Query:  SPFN------SLPYPQRFQKKKIN
         P N       LPYPQRFQK+K++
Subjt:  SPFN------SLPYPQRFQKKKIN

XP_022157708.1 uncharacterized protein LOC111024361 [Momordica charantia]1.7e-11258.78Show/hide
Query:  PPPPVPIVKP---------------------------------------LKPGLIQMVRENTFRGNATEDPNNHLAMFLDVCGTLKMNGVTDDAIRLRLF
        PPPPVPIV+P                                       LKPGLIQMVRENTFRGNATEDPNNHL +FLDVCGT+KMNGV DDAIRLRLF
Subjt:  PPPPVPIVKP---------------------------------------LKPGLIQMVRENTFRGNATEDPNNHLAMFLDVCGTLKMNGVTDDAIRLRLF

Query:  PFSLQDK----------------------------------------------------------IQMFYNGLNGQTRTILDAVAGGTLLSKTPENAYIL
        P SLQDK                                                          IQMFYNGLNGQTRTILDA AGGTLLS+TPENAYIL
Subjt:  PFSLQDK----------------------------------------------------------IQMFYNGLNGQTRTILDAVAGGTLLSKTPENAYIL

Query:  LEDMAANSFQWPSERSNAKRVVGMYEIDELTIDQAKYINNRNFGYKGNQQQSSLPTHYHPRLRNHENFSYAHNRNVLQPPPGCTLLAEKKSSLEDLLGAF
        L+DMA NSFQWPSERSNAK+V GMYEIDEL+  +A+     N   K      S P   H         +Y++    ++     +  AEKKSSLEDLLGAF
Subjt:  LEDMAANSFQWPSERSNAKRVVGMYEIDELTIDQAKYINNRNFGYKGNQQQSSLPTHYHPRLRNHENFSYAHNRNVLQPPPGCTLLAEKKSSLEDLLGAF

Query:  INESRSRASRIENQVEGMEVKLEGNATAIKNMEVQIRQIASTLKIMQEGKFPSDTEVNPREHCKAVTLRSRKELQESEKEKMEEPVITTEEWENKEEVVK
        INE RSRASRIENQVEGMEVKLEGN T+IKNMEVQI QIA TL  MQ+GKFPSD EV PREHCKAVTLRS KELQE EK+KMEEPVITTEE ENKEEVVK
Subjt:  INESRSRASRIENQVEGMEVKLEGNATAIKNMEVQIRQIASTLKIMQEGKFPSDTEVNPREHCKAVTLRSRKELQESEKEKMEEPVITTEEWENKEEVVK

Query:  EVTPALQADKPTSSITFSPFNSLPYPQ
        E TPALQADKPTSSI  SP NSLPYPQ
Subjt:  EVTPALQADKPTSSITFSPFNSLPYPQ

XP_022843226.1 uncharacterized protein LOC111366761 [Olea europaea var. sylvestris]1.2e-7340.09Show/hide
Query:  LKPGLIQMVRENTFRGNATEDPNNHLAMFLDVCGTLKMNGVTDDAIRLRLFPFSLQDK------------------------------------------
        LKPGLI MV++N F G A EDPN HL  FL++C T+KMNGVT+DAIRLRLF FSL+DK                                          
Subjt:  LKPGLIQMVRENTFRGNATEDPNNHLAMFLDVCGTLKMNGVTDDAIRLRLFPFSLQDK------------------------------------------

Query:  ---------------------------------IQMFYNGLNGQTRTILDAVAGGTLLSKTPENAYILLEDMAANSFQWPSERSNAKRVVGMYEIDELTI
                                         I++FYNGLNGQTRT++DA AGG L++KT E AY LL+D+A NS+QWPSERS  K+V G++E+D +T 
Subjt:  ---------------------------------IQMFYNGLNGQTRTILDAVAGGTLLSKTPENAYILLEDMAANSFQWPSERSNAKRVVGMYEIDELTI

Query:  -----------------------------------------DQAKYINNRNFGYKGNQQQSSLPTHYHPRLRNHENFSYAHNRNVLQPPPGC-TLLAEKK
                                                 +Q +YI++RN+  +G  Q +    HYHP LRNHEN SY +NRN LQPPPG  T  ++ K
Subjt:  -----------------------------------------DQAKYINNRNFGYKGNQQQSSLPTHYHPRLRNHENFSYAHNRNVLQPPPGC-TLLAEKK

Query:  SSLEDLLGAFINESRSRASRIENQVEGMEVKLEGNATAIKNMEVQIRQIASTLKIMQEGKFPSDTEVNPREHCKAVTLRSRKELQESEKEKMEEP---VI
          LED+LG FI+E+RSR ++ E +++ +E  +      +KN+EVQI Q+A+ +K  Q+GKFPSDTEVNPREHC A+TLRS K ++ES+ +K+  P   VI
Subjt:  SSLEDLLGAFINESRSRASRIENQVEGMEVKLEGNATAIKNMEVQIRQIASTLKIMQEGKFPSDTEVNPREHCKAVTLRSRKELQESEKEKMEEP---VI

Query:  TTEEWENKEEVVKEVTPALQADKPTSSITFSPFN------SLPYPQRFQKKKIN
         T+E +++ +  K      +  KP  SI+F P N       LP+PQRF KKK +
Subjt:  TTEEWENKEEVVKEVTPALQADKPTSSITFSPFN------SLPYPQRFQKKKIN

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]3.5e-7338.19Show/hide
Query:  LKPGLIQMVRENTFRGNATEDPNNHLAMFLDVCGTLKMNGVTDDAIRLRLFPFSLQDK------------------------------------------
        LKP LI MV++  F G+  +DPN HLAMFL++C T+KMNGVT+D IRLRLFPFSL+DK                                          
Subjt:  LKPGLIQMVRENTFRGNATEDPNNHLAMFLDVCGTLKMNGVTDDAIRLRLFPFSLQDK------------------------------------------

Query:  ---------------------------------IQMFYNGLNGQTRTILDAVAGGTLLSKTPENAYILLEDMAANSFQWPSERSNAKRVVGMYEID----
                                         +QMFYNGLNGQTRTI+DA +GGTL+SKT E A  LLE+MA+N++QWP+ER+ AK+V G++E++    
Subjt:  ---------------------------------IQMFYNGLNGQTRTILDAVAGGTLLSKTPENAYILLEDMAANSFQWPSERSNAKRVVGMYEID----

Query:  -------------------------------------ELTIDQAKYINNRNFGYKGNQQQSSLPTHYHPRLRNHENFSYAHNRNVLQPPPGC-TLLAEKK
                                             E + +Q +YINNRN+ Y+GN     +P +YHP LRNHENFSY + +NVLQPPPG  +  +EKK
Subjt:  -------------------------------------ELTIDQAKYINNRNFGYKGNQQQSSLPTHYHPRLRNHENFSYAHNRNVLQPPPGC-TLLAEKK

Query:  SSLEDLLGAFINESRSRASRIENQVEGMEVKLEGNATAIKNMEVQIRQIASTLKIMQEGKFPSDTEVNPREHCKAVTLRSRKELQESEKEKMEEPVITTE
         SLED + +F+ E+++   + ++Q++ +E         +KN+EVQI Q+A+T+   Q G FPS+TEVNP+E CKA+TLRS +E++ S  ++ E       
Subjt:  SSLEDLLGAFINESRSRASRIENQVEGMEVKLEGNATAIKNMEVQIRQIASTLKIMQEGKFPSDTEVNPREHCKAVTLRSRKELQESEKEKMEEPVITTE

Query:  EWENKEEVVKE--VTPALQADKPTSSITFSPFN------SLPYPQRFQKKKIN
          ++K +V +E  V   L+      SI+F P N       LPYPQRFQK+K++
Subjt:  EWENKEEVVKE--VTPALQADKPTSSITFSPFN------SLPYPQRFQKKKIN

XP_024032903.1 uncharacterized protein LOC112095347 [Morus notabilis]3.8e-7240.14Show/hide
Query:  LKPGLIQMVRENTFRGNATEDPNNHLAMFLDVCGTLKMNGVTDDAIRLRLFPFSLQDK------------------------------------------
        LKP LI MV+ N F G   EDPN HL+MFL+   T+K+NGVT+  IRL+LFPFSL+DK                                          
Subjt:  LKPGLIQMVRENTFRGNATEDPNNHLAMFLDVCGTLKMNGVTDDAIRLRLFPFSLQDK------------------------------------------

Query:  ---------------------------------IQMFYNGLNGQTRTILDAVAGGTLLSKTPENAYILLEDMAANSFQWPSERSNAKRVVGMYEI-----
                                         I  FYNGLNGQ+RTI+D+ AGG+L+ K+   AY LLE+M+ NS+QWPSERS +K+  G++EI     
Subjt:  ---------------------------------IQMFYNGLNGQTRTILDAVAGGTLLSKTPENAYILLEDMAANSFQWPSERSNAKRVVGMYEI-----

Query:  -------------------DELTIDQAKYINNRNFGYKGNQQQSSLPTHYHPRLRNHENFSYAHNRNVLQPPPGCT-LLAEKKSSLEDLLGAFINESRSR
                           DE T +Q +++NNRNF Y+ NQ    LP HYHP LRNHENFSYA+NRNVLQPP G    + EKK S+EDLL  FI E+R R
Subjt:  -------------------DELTIDQAKYINNRNFGYKGNQQQSSLPTHYHPRLRNHENFSYAHNRNVLQPPPGCT-LLAEKKSSLEDLLGAFINESRSR

Query:  ASRIENQVEGMEVKLEGNATAIKNMEVQIRQIASTLKIMQEGKFPSDTEVNPREHCKAVTLRSRKELQ--ESEKEKMEEPVITTEEWENKEEVVKEVTPA
         ++ E +++ +E         +K++EVQI Q+A+ +K    GKFPSDTE NP++HCKA+TLRS KE++  + +++K EE  + TE+       +KE    
Subjt:  ASRIENQVEGMEVKLEGNATAIKNMEVQIRQIASTLKIMQEGKFPSDTEVNPREHCKAVTLRSRKELQ--ESEKEKMEEPVITTEEWENKEEVVKEVTPA

Query:  LQADKPTSSITFSPFNSLPYPQRFQKKKINA
             P +    +P  +LPYPQRF+KKK++A
Subjt:  LQADKPTSSITFSPFNSLPYPQRFQKKKINA

TrEMBL top hitse value%identityAlignment
A0A2I4E1Q5 uncharacterized protein LOC1089854722.0e-5046.99Show/hide
Query:  LKPGLIQMVRENTFRGNATEDPNNHLAMFLDVCGTLKMNGVTDDAIRLRLFPFSLQDK------------------IQMFYNGLNGQTRTILDAVAGGTL
        LKP LI MV++  F G+  +DPN HLAMFL++C T+K+NGVT D IRLRLFPFSL+DK                  +QMFYNGLNGQTRTI+DAV+GGTL
Subjt:  LKPGLIQMVRENTFRGNATEDPNNHLAMFLDVCGTLKMNGVTDDAIRLRLFPFSLQDK------------------IQMFYNGLNGQTRTILDAVAGGTL

Query:  LSKTPENAYILLEDMAANSFQWPSERSNAKRVVGMYEIDELTI-----------------------------------------DQAKYINNRNFGYKGN
        +SKT E A  LLE+M +N++QWP ERS AK+V G++E++ L                                           +Q +YINNRN+ Y+GN
Subjt:  LSKTPENAYILLEDMAANSFQWPSERSNAKRVVGMYEIDELTI-----------------------------------------DQAKYINNRNFGYKGN

Query:  QQQSSLPTHYHPRLRNHENFSYAHNRNVL--QPPPGC-TLLAEKKSSLE
             +P +YH  LRNHEN SY++ +NVL  QPPPG  + L+EKK SLE
Subjt:  QQQSSLPTHYHPRLRNHENFSYAHNRNVL--QPPPGC-TLLAEKKSSLE

A0A2I4F4C8 uncharacterized protein LOC1089953734.2e-4833.72Show/hide
Query:  LKPGLIQMVRENTFRGNATEDPNNHLAMFLDVCGTLKMNGVTDDAIRLRLFPFSLQDK------------------------------------------
        LKP LI MV++  F  +  +DPN HLAMFL +C T+K+NGVT D IRLRLFPFSL+DK                                          
Subjt:  LKPGLIQMVRENTFRGNATEDPNNHLAMFLDVCGTLKMNGVTDDAIRLRLFPFSLQDK------------------------------------------

Query:  ---------------------------------IQMFYNGLNGQTRTILDAVAGGTLLSKTPEN-AYILLEDMAANSFQWPSERSNAKRVVGMY------
                                         +QMFYNGLNG+TRTI+DA AGGTL+SKT E  A  LLE+M +N++QWP+E++ AK+V G++      
Subjt:  ---------------------------------IQMFYNGLNGQTRTILDAVAGGTLLSKTPEN-AYILLEDMAANSFQWPSERSNAKRVVGMY------

Query:  -------------EIDELTIDQAKYINNRNFGYKGNQQQSSLPTHYHPRLRNHENFSYAHNRNVLQPPPGCTLLAEKKSSLEDLLGAFINESRSRASRIE
                       +E + +Q +YINNRN+ Y+G   Q S                                  +K  SLED + +F+ E+ +R  + +
Subjt:  -------------EIDELTIDQAKYINNRNFGYKGNQQQSSLPTHYHPRLRNHENFSYAHNRNVLQPPPGCTLLAEKKSSLEDLLGAFINESRSRASRIE

Query:  NQVEGMEVKLEGNATAIKNMEVQIRQIASTLKIMQEGKFPSDTEVNPREHCKAVTLRSRKELQES-EKEKMEEPVITTE-EWENKEEVVKEVTPALQADK
        + ++ ++        AIKN+EVQI ++A+ +   Q G FPS+TE NP+E CKA+TL+S +EL+ S  KE    P +    + +NK E  + V  AL+   
Subjt:  NQVEGMEVKLEGNATAIKNMEVQIRQIASTLKIMQEGKFPSDTEVNPREHCKAVTLRSRKELQES-EKEKMEEPVITTE-EWENKEEVVKEVTPALQADK

Query:  PTSSITF---SPFNS--LPYPQRFQKK
           +I+F    P  S  LPY QRFQK+
Subjt:  PTSSITF---SPFNS--LPYPQRFQKK

A0A2I4G4Q3 uncharacterized protein LOC1090047122.9e-5736.27Show/hide
Query:  MVRENTFRGNATEDPNNHLAMFLDVCGTLKMNGVTDDAIRLRLFPFSLQDK-------------------------------------------------
        MV++  F G+  +DPN HL MFL++C T+K+NGVT+D IRLRLFPFSL+D+                                                 
Subjt:  MVRENTFRGNATEDPNNHLAMFLDVCGTLKMNGVTDDAIRLRLFPFSLQDK-------------------------------------------------

Query:  --------------------------IQMFYNGLNGQTRTILDAVAGGTLLSKTPENAYILLEDMAANSFQWPSERSNAKRVVGMYEID-----------
                                  +QMFYNGLNG TRTI+D  +GGTL+ KT E A  LLE+MA+N++QWP ER+ AK+V  ++E++           
Subjt:  --------------------------IQMFYNGLNGQTRTILDAVAGGTLLSKTPENAYILLEDMAANSFQWPSERSNAKRVVGMYEID-----------

Query:  ------------------------------ELTIDQAKYINNRNFGYKGNQQQSSLPTHYHPRLRNHENFSYAHNRNVL--QPPPGC-TLLAEKKSSLED
                                      E++ +Q +YINNRN+ Y GN     +P +YHP  +NHEN SY + +NVL  QPPPG  +  +EKK SLED
Subjt:  ------------------------------ELTIDQAKYINNRNFGYKGNQQQSSLPTHYHPRLRNHENFSYAHNRNVL--QPPPGC-TLLAEKKSSLED

Query:  LLGAFINESRSRASRIENQVEGMEVKLEGNATAI-KNMEVQIRQIASTLKIMQEGKFPSDTEVNPREHCKAVTLRSRKELQES-EKEKMEEPVITTE
         + +FI E+ +R  + +++++ +E        AI KN+EVQI Q+A+T+   Q G FPS+TEVNPRE CKA+ LRS +EL+   E E+M++  I+ +
Subjt:  LLGAFINESRSRASRIENQVEGMEVKLEGNATAI-KNMEVQIRQIASTLKIMQEGKFPSDTEVNPREHCKAVTLRSRKELQES-EKEKMEEPVITTE

A0A6J1DU19 uncharacterized protein LOC1110243618.3e-11358.78Show/hide
Query:  PPPPVPIVKP---------------------------------------LKPGLIQMVRENTFRGNATEDPNNHLAMFLDVCGTLKMNGVTDDAIRLRLF
        PPPPVPIV+P                                       LKPGLIQMVRENTFRGNATEDPNNHL +FLDVCGT+KMNGV DDAIRLRLF
Subjt:  PPPPVPIVKP---------------------------------------LKPGLIQMVRENTFRGNATEDPNNHLAMFLDVCGTLKMNGVTDDAIRLRLF

Query:  PFSLQDK----------------------------------------------------------IQMFYNGLNGQTRTILDAVAGGTLLSKTPENAYIL
        P SLQDK                                                          IQMFYNGLNGQTRTILDA AGGTLLS+TPENAYIL
Subjt:  PFSLQDK----------------------------------------------------------IQMFYNGLNGQTRTILDAVAGGTLLSKTPENAYIL

Query:  LEDMAANSFQWPSERSNAKRVVGMYEIDELTIDQAKYINNRNFGYKGNQQQSSLPTHYHPRLRNHENFSYAHNRNVLQPPPGCTLLAEKKSSLEDLLGAF
        L+DMA NSFQWPSERSNAK+V GMYEIDEL+  +A+     N   K      S P   H         +Y++    ++     +  AEKKSSLEDLLGAF
Subjt:  LEDMAANSFQWPSERSNAKRVVGMYEIDELTIDQAKYINNRNFGYKGNQQQSSLPTHYHPRLRNHENFSYAHNRNVLQPPPGCTLLAEKKSSLEDLLGAF

Query:  INESRSRASRIENQVEGMEVKLEGNATAIKNMEVQIRQIASTLKIMQEGKFPSDTEVNPREHCKAVTLRSRKELQESEKEKMEEPVITTEEWENKEEVVK
        INE RSRASRIENQVEGMEVKLEGN T+IKNMEVQI QIA TL  MQ+GKFPSD EV PREHCKAVTLRS KELQE EK+KMEEPVITTEE ENKEEVVK
Subjt:  INESRSRASRIENQVEGMEVKLEGNATAIKNMEVQIRQIASTLKIMQEGKFPSDTEVNPREHCKAVTLRSRKELQESEKEKMEEPVITTEEWENKEEVVK

Query:  EVTPALQADKPTSSITFSPFNSLPYPQ
        E TPALQADKPTSSI  SP NSLPYPQ
Subjt:  EVTPALQADKPTSSITFSPFNSLPYPQ

A0A6P9DWY0 uncharacterized protein LOC1183440269.9e-5042.8Show/hide
Query:  KIQMFYNGLNGQTRTILDAVAGGTLLSKTPENAYILLEDMAANSFQWPSERSNAKRVVGMYEID------------------------------------
        + QMFYNGLNGQT+TI+DA +GGTL+SKT E A  LLE+MA+N++QWP ER+  K+V G++E++                                    
Subjt:  KIQMFYNGLNGQTRTILDAVAGGTLLSKTPENAYILLEDMAANSFQWPSERSNAKRVVGMYEID------------------------------------

Query:  -----ELTIDQAKYINNRNFGYKGNQQQSSLPTHYHPRLRNHENFSYAHNRNVLQPPPGC-TLLAEKKSSLEDLLGAFINESRSRASRIENQVEGMEVKL
             E + +Q +YINNRN+ Y+GN     +P +YH  LRNHEN SY + +NVLQP PG  +  +EKK SLED + +F+ E+ +R  + +++++ +E   
Subjt:  -----ELTIDQAKYINNRNFGYKGNQQQSSLPTHYHPRLRNHENFSYAHNRNVLQPPPGC-TLLAEKKSSLEDLLGAFINESRSRASRIENQVEGMEVKL

Query:  EGNATAIKNMEVQIRQIASTLKIMQEGKFPSDTEVNPREHCKAVTLRSRKELQESEKEKMEEPV
             AIKN+EVQI Q+A+T+   Q G FPS+TEVNPRE CKA+TLRS +EL  S   +   P+
Subjt:  EGNATAIKNMEVQIRQIASTLKIMQEGKFPSDTEVNPREHCKAVTLRSRKELQESEKEKMEEPV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGCGCCGCAACACTGTATTGTAGTGCCATGGCGCTACGAACTTGCACGTCCGTCACAAATCTGGGACAGCGCCATGACACTCACTAATGGTGCCATGGCG
CTGGAGTTGATTTGTATACGCAAGGACAAAGACGCAATTTTAGTCCCTTTTGATCCTGAAATTGAAATAACCTGTAAAAGAAATCGAAAGGAGAAAAAGAAGACG
ACTGCAGAGATGGATCCACCACCACCTGTACCGATTGTTAAACCTTTGAAACCTGGCCTCATCCAGATGGTTCGAGAAAATACATTTAGGGGCAATGCCACAGAG
GATCCAAACAATCATTTGGCAATGTTTCTAGATGTTTGTGGTACTTTGAAGATGAATGGAGTAACTGATGATGCGATTCGCTTACGCCTTTTTCCTTTTTCTTTG
CAGGATAAGATTCAGATGTTTTACAATGGACTGAATGGACAAACAAGGACTATACTAGATGCTGTAGCTGGAGGCACTTTATTATCCAAAACACCTGAGAATGCT
TACATCTTATTGGAGGACATGGCAGCCAATAGTTTCCAATGGCCTAGTGAGAGATCGAATGCCAAAAGAGTTGTTGGAATGTATGAAATCGATGAGCTAACCATC
GATCAAGCTAAGTATATCAATAATAGAAATTTTGGCTACAAGGGAAATCAGCAACAGAGCTCGCTGCCAACACACTATCATCCAAGGTTGAGGAATCATGAAAAT
TTTTCTTATGCTCACAATAGAAATGTTTTGCAACCTCCACCAGGTTGTACATTGCTAGCTGAAAAGAAATCATCCCTTGAGGATCTACTTGGGGCTTTCATTAAT
GAGTCCAGAAGTCGAGCTAGTCGGATTGAAAATCAGGTAGAAGGCATGGAAGTTAAATTGGAAGGAAACGCAACTGCCATCAAGAACATGGAGGTTCAGATAAGG
CAAATAGCATCTACCTTGAAAATTATGCAGGAAGGGAAGTTTCCAAGTGACACTGAAGTTAACCCACGAGAACATTGCAAAGCCGTAACTTTGAGAAGCAGAAAG
GAACTACAGGAGTCTGAAAAGGAAAAAATGGAAGAACCAGTCATCACAACTGAGGAATGGGAAAATAAGGAGGAAGTTGTAAAGGAGGTCACTCCTGCTCTACAG
GCTGACAAGCCTACTAGTTCTATTACTTTTAGTCCTTTTAACTCTTTACCTTATCCTCAGCGTTTCCAAAAGAAAAAGATTAATGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGACAGCGCCGCAACACTGTATTGTAGTGCCATGGCGCTACGAACTTGCACGTCCGTCACAAATCTGGGACAGCGCCATGACACTCACTAATGGTGCCATGGCG
CTGGAGTTGATTTGTATACGCAAGGACAAAGACGCAATTTTAGTCCCTTTTGATCCTGAAATTGAAATAACCTGTAAAAGAAATCGAAAGGAGAAAAAGAAGACG
ACTGCAGAGATGGATCCACCACCACCTGTACCGATTGTTAAACCTTTGAAACCTGGCCTCATCCAGATGGTTCGAGAAAATACATTTAGGGGCAATGCCACAGAG
GATCCAAACAATCATTTGGCAATGTTTCTAGATGTTTGTGGTACTTTGAAGATGAATGGAGTAACTGATGATGCGATTCGCTTACGCCTTTTTCCTTTTTCTTTG
CAGGATAAGATTCAGATGTTTTACAATGGACTGAATGGACAAACAAGGACTATACTAGATGCTGTAGCTGGAGGCACTTTATTATCCAAAACACCTGAGAATGCT
TACATCTTATTGGAGGACATGGCAGCCAATAGTTTCCAATGGCCTAGTGAGAGATCGAATGCCAAAAGAGTTGTTGGAATGTATGAAATCGATGAGCTAACCATC
GATCAAGCTAAGTATATCAATAATAGAAATTTTGGCTACAAGGGAAATCAGCAACAGAGCTCGCTGCCAACACACTATCATCCAAGGTTGAGGAATCATGAAAAT
TTTTCTTATGCTCACAATAGAAATGTTTTGCAACCTCCACCAGGTTGTACATTGCTAGCTGAAAAGAAATCATCCCTTGAGGATCTACTTGGGGCTTTCATTAAT
GAGTCCAGAAGTCGAGCTAGTCGGATTGAAAATCAGGTAGAAGGCATGGAAGTTAAATTGGAAGGAAACGCAACTGCCATCAAGAACATGGAGGTTCAGATAAGG
CAAATAGCATCTACCTTGAAAATTATGCAGGAAGGGAAGTTTCCAAGTGACACTGAAGTTAACCCACGAGAACATTGCAAAGCCGTAACTTTGAGAAGCAGAAAG
GAACTACAGGAGTCTGAAAAGGAAAAAATGGAAGAACCAGTCATCACAACTGAGGAATGGGAAAATAAGGAGGAAGTTGTAAAGGAGGTCACTCCTGCTCTACAG
GCTGACAAGCCTACTAGTTCTATTACTTTTAGTCCTTTTAACTCTTTACCTTATCCTCAGCGTTTCCAAAAGAAAAAGATTAATGCTTAA
Protein sequenceShow/hide protein sequence
MTAPQHCIVVPWRYELARPSQIWDSAMTLTNGAMALELICIRKDKDAILVPFDPEIEITCKRNRKEKKKTTAEMDPPPPVPIVKPLKPGLIQMVRENTFRGNATE
DPNNHLAMFLDVCGTLKMNGVTDDAIRLRLFPFSLQDKIQMFYNGLNGQTRTILDAVAGGTLLSKTPENAYILLEDMAANSFQWPSERSNAKRVVGMYEIDELTI
DQAKYINNRNFGYKGNQQQSSLPTHYHPRLRNHENFSYAHNRNVLQPPPGCTLLAEKKSSLEDLLGAFINESRSRASRIENQVEGMEVKLEGNATAIKNMEVQIR
QIASTLKIMQEGKFPSDTEVNPREHCKAVTLRSRKELQESEKEKMEEPVITTEEWENKEEVVKEVTPALQADKPTSSITFSPFNSLPYPQRFQKKKINA