; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g25550 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g25550
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr4:18552164..18557487
RNA-Seq ExpressionMoc04g25550
SyntenyMoc04g25550
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017227786.1 PREDICTED: uncharacterized protein LOC108203384 [Daucus carota subsp. sativus]7.9e-6146.57Show/hide
Query:  NKNAAGTPAKANVSHIQGISYSFCEGEHHYNSCPSNPKSVYYLGNTHNNINNPYSNTYNQGWSSHPNFSWSRNQGRNDVGTSNAPAYQQKGNYPPRIANQ
        N  +      + ++  + +S  FC   H Y+SCPSNP+SV+Y+GN   N   PYSNTYNQ W  HPNFSWS NQG N  GTSN      K N+PP     
Subjt:  NKNAAGTPAKANVSHIQGISYSFCEGEHHYNSCPSNPKSVYYLGNTHNNINNPYSNTYNQGWSSHPNFSWSRNQGRNDVGTSNAPAYQQKGNYPPRIANQ

Query:  GQGAGQKPPKGSFASLENLMKQYMEKNNVT-------VQSYAASLRNLELQVGQLATDLKSRPYGALPSDTKVSEQPKDSQDIASKEVNPVNAKASNFGT
          G  Q+ P+ +  SLEN++K+Y+ KN  +       VQS AASLRNLE Q GQLA +L++ P+G LPSDT   E+PK   +   K +   N K     T
Subjt:  GQGAGQKPPKGSFASLENLMKQYMEKNNVT-------VQSYAASLRNLELQVGQLATDLKSRPYGALPSDTKVSEQPKDSQDIASKEVNPVNAKASNFGT

Query:  SHAKVDE-----------KRKKIEHE--------DAPTEFRPTPPYPKRLKKKEQDVQFRKFLDVLNQLHVNIPLVEASEQMSTYVRFLKDILIKKRKLR
        + +K D+           + K+IE++           +   P PPYP+R +K++ DVQF+KFLDVL QLH+NIPLVEA EQM  YV+F+KDIL KKR+L 
Subjt:  SHAKVDE-----------KRKKIEHE--------DAPTEFRPTPPYPKRLKKKEQDVQFRKFLDVLNQLHVNIPLVEASEQMSTYVRFLKDILIKKRKLR

Query:  EYKTVAMTKESSNILISKIPTKIKDLGSFTIPISI
        E++TVA+TKE S+ L  K+PTK+KD GSFTIP +I
Subjt:  EYKTVAMTKESSNILISKIPTKIKDLGSFTIPISI

XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]7.9e-6146.41Show/hide
Query:  NKNAAGTPAKANVSHIQGISYSFCEGEHHYNSCPSNPKSVYYLGNTHNNINNPYSNTYNQGWSSHPNFSWSRNQGRNDVGTSNAPAYQQKGNYPPRIANQ
        N  +      + ++  + +S  FC   H Y+SCPSNP+SV+Y+GN   N   PYSNTYNQ W  HPNFSWS NQG N  GTS       K NYPP     
Subjt:  NKNAAGTPAKANVSHIQGISYSFCEGEHHYNSCPSNPKSVYYLGNTHNNINNPYSNTYNQGWSSHPNFSWSRNQGRNDVGTSNAPAYQQKGNYPPRIANQ

Query:  GQGAGQKPPKGSFASLENLMKQYMEKNNVT-------VQSYAASLRNLELQVGQLATDLKSRPYGALPSDTKVSEQPKDSQDIASKEVNPVNAKASNFGT
          G  Q+ P+ +  SLEN++K+Y+ KN  +       VQS AASLRNLE QVGQLA +L++RP+G LPSDT   E+PK   +   K +   + K      
Subjt:  GQGAGQKPPKGSFASLENLMKQYMEKNNVT-------VQSYAASLRNLELQVGQLATDLKSRPYGALPSDTKVSEQPKDSQDIASKEVNPVNAKASNFGT

Query:  SHAKVDEKRKKIEHEDAP------------------TEFRPTPPYPKRLKKKEQDVQFRKFLDVLNQLHVNIPLVEASEQMSTYVRFLKDILIKKRKLRE
        + AK D+  +   +E+ P                  +  +P PP+P+R +K++Q+VQF+KFLDVL QLH+NIPLVEA EQM  YV+F+KDIL KKR+L E
Subjt:  SHAKVDEKRKKIEHEDAP------------------TEFRPTPPYPKRLKKKEQDVQFRKFLDVLNQLHVNIPLVEASEQMSTYVRFLKDILIKKRKLRE

Query:  YKTVAMTKESSNILISKIPTKIKDLGSFTIPISI
        ++TVA+TKE S+ L  K+PTK+KD GSFTIP +I
Subjt:  YKTVAMTKESSNILISKIPTKIKDLGSFTIPISI

XP_022157917.1 uncharacterized protein LOC111024527 [Momordica charantia]7.9e-6146.61Show/hide
Query:  SVYYLGNTHNNINNPYSNTYNQGWSSHPNFSWSRNQGRNDVGTSNAPAYQQKGNYPPRIANQGQGAGQKPPKGSFASLENLMKQYMEKNNVTVQSYAASL
        +VYYLGN  N+ NNPYSNTYN GW +HPNFSWS  Q  ++VGTSNAPA+QQK +Y P  ANQGQ   QK  +GSFASLE LMKQYM  N+V VQS AASL
Subjt:  SVYYLGNTHNNINNPYSNTYNQGWSSHPNFSWSRNQGRNDVGTSNAPAYQQKGNYPPRIANQGQGAGQKPPKGSFASLENLMKQYMEKNNVTVQSYAASL

Query:  RNLELQVGQLATDLKSRPYGALPSDTKVSEQPKDSQDIASKEVNPVNAKASNFGTSHAKVDEKRKKIEHEDAPTEFRPTPPYPKRLKKKEQDVQFRKFLD
        RNLELQVGQLATDLKSRP                                                                                  
Subjt:  RNLELQVGQLATDLKSRPYGALPSDTKVSEQPKDSQDIASKEVNPVNAKASNFGTSHAKVDEKRKKIEHEDAPTEFRPTPPYPKRLKKKEQDVQFRKFLD

Query:  VLNQLHVNIPLVEASEQMSTYVRFLKDILIKKRKLREYKTVAMTKESSNILISKIPTKIKDLGSFTIPISIRG---------------------------
           QLHVNI LVE  EQM  Y+RFLK+IL KKR L EY+TVAMTK  S ILISKIP K+KD GSFTIP+SI G                           
Subjt:  VLNQLHVNIPLVEASEQMSTYVRFLKDILIKKRKLREYKTVAMTKESSNILISKIPTKIKDLGSFTIPISIRG---------------------------

Query:  -------------PLLATTMVLVDVQKGEVTMLVQDQEVKFSVYDAVKYPSKSE
                     P L T  VLVDV KGE+TM VQDQEVKFSV+D++K+P++SE
Subjt:  -------------PLLATTMVLVDVQKGEVTMLVQDQEVKFSVYDAVKYPSKSE

XP_022158611.1 uncharacterized protein LOC111025065 [Momordica charantia]2.2e-7950.13Show/hide
Query:  ANVSHIQGISYSFCEGEHHYNSCPSNPKSVYYLGNTHNNINNPYSNTYNQGWSSHPNFSWSRNQGRNDVGTSNAPAYQQKGNYPPRIANQGQGAGQKPPK
        ANV+ IQGIS SFCEG+HHYN+CP NP+SVYYLGN  NN NN YSNTYN GW +HPNFSWS +QG ++ GTS+APA+Q K +YPP   NQGQ   ++  +
Subjt:  ANVSHIQGISYSFCEGEHHYNSCPSNPKSVYYLGNTHNNINNPYSNTYNQGWSSHPNFSWSRNQGRNDVGTSNAPAYQQKGNYPPRIANQGQGAGQKPPK

Query:  GSFASLENLMKQYMEKNNVTVQSYAASLRNLELQVGQLATDLKSRPYGALPSDTKVSEQPKDSQDIASKEVNPVNAKASNFGTSHAKVDEKRKKIEHEDA
        GS ASLE LMKQYM  N+ TVQS A SLRNL+LQVGQLATDLKS+P                                        +V EKRK+ EHE+A
Subjt:  GSFASLENLMKQYMEKNNVTVQSYAASLRNLELQVGQLATDLKSRPYGALPSDTKVSEQPKDSQDIASKEVNPVNAKASNFGTSHAKVDEKRKKIEHEDA

Query:  PTEFRPTPPYPKRLKKKEQDVQFRKFLDVLNQLHVNIPLVEASEQMSTYVRFLKDILIKKRKLREYKTV--AMTKESSNILISKI------------PT-
        P E+ P PPYPKRL+KKE++VQF KFLDVL QLHVNIPLVEA EQM  YVRFLK+ILIKKR L EY T+  A+    +NI +  +            PT 
Subjt:  PTEFRPTPPYPKRLKKKEQDVQFRKFLDVLNQLHVNIPLVEASEQMSTYVRFLKDILIKKRKLREYKTV--AMTKESSNILISKI------------PT-

Query:  ---------------KIKDL-------------------GSFTIPISIRGPLLATTMVLVDVQKGEVTMLVQD
                       KI+D+                       +PI +  P LAT   LVDV KGE+TM VQD
Subjt:  ---------------KIKDL-------------------GSFTIPISIRGPLLATTMVLVDVQKGEVTMLVQD

XP_022158740.1 uncharacterized protein LOC111025203 [Momordica charantia]4.7e-170100Show/hide
Query:  KNAAGTPAKANVSHIQGISYSFCEGEHHYNSCPSNPKSVYYLGNTHNNINNPYSNTYNQGWSSHPNFSWSRNQGRNDVGTSNAPAYQQKGNYPPRIANQG
        KNAAGTPAKANVSHIQGISYSFCEGEHHYNSCPSNPKSVYYLGNTHNNINNPYSNTYNQGWSSHPNFSWSRNQGRNDVGTSNAPAYQQKGNYPPRIANQG
Subjt:  KNAAGTPAKANVSHIQGISYSFCEGEHHYNSCPSNPKSVYYLGNTHNNINNPYSNTYNQGWSSHPNFSWSRNQGRNDVGTSNAPAYQQKGNYPPRIANQG

Query:  QGAGQKPPKGSFASLENLMKQYMEKNNVTVQSYAASLRNLELQVGQLATDLKSRPYGALPSDTKVSEQPKDSQDIASKEVNPVNAKASNFGTSHAKVDEK
        QGAGQKPPKGSFASLENLMKQYMEKNNVTVQSYAASLRNLELQVGQLATDLKSRPYGALPSDTKVSEQPKDSQDIASKEVNPVNAKASNFGTSHAKVDEK
Subjt:  QGAGQKPPKGSFASLENLMKQYMEKNNVTVQSYAASLRNLELQVGQLATDLKSRPYGALPSDTKVSEQPKDSQDIASKEVNPVNAKASNFGTSHAKVDEK

Query:  RKKIEHEDAPTEFRPTPPYPKRLKKKEQDVQFRKFLDVLNQLHVNIPLVEASEQMSTYVRFLKDILIKKRKLREYKTVAMTKESSNILISKIPTKIKDLG
        RKKIEHEDAPTEFRPTPPYPKRLKKKEQDVQFRKFLDVLNQLHVNIPLVEASEQMSTYVRFLKDILIKKRKLREYKTVAMTKESSNILISKIPTKIKDLG
Subjt:  RKKIEHEDAPTEFRPTPPYPKRLKKKEQDVQFRKFLDVLNQLHVNIPLVEASEQMSTYVRFLKDILIKKRKLREYKTVAMTKESSNILISKIPTKIKDLG

Query:  SFTIPISIRG
        SFTIPISIRG
Subjt:  SFTIPISIRG

TrEMBL top hitse value%identityAlignment
A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129452.4e-5535.39Show/hide
Query:  CEGEHHYNSCPSNPKSVYYLGNTHNNINNPYSNTYNQGWSSHPNFSWSRNQGRNDVGTSNAPAYQQKGNYPPRIANQGQGAGQKPPKGSFASLENLMKQY
        C   H Y+ CP N +SV ++GN +   NNPYSNTYN GW +HPNFSWS N G ++      P +QQ            Q   Q P K S   LE L+ QY
Subjt:  CEGEHHYNSCPSNPKSVYYLGNTHNNINNPYSNTYNQGWSSHPNFSWSRNQGRNDVGTSNAPAYQQKGNYPPRIANQGQGAGQKPPKGSFASLENLMKQY

Query:  MEKNNVTVQSYAASLRNLELQVGQLATDLKSRPYGALPSDTKVSEQPKDS-QDI---ASKEVNPVNAKASNFGTSHA--------KVDEKRK---KIEHE
        + K +  +QS  ASLRNLE QVGQLA  + +RP G+LPSDT+++ + K+  Q I   + KE+  VN KA      H         +++ ++K   K E++
Subjt:  MEKNNVTVQSYAASLRNLELQVGQLATDLKSRPYGALPSDTKVSEQPKDS-QDI---ASKEVNPVNAKASNFGTSHA--------KVDEKRK---KIEHE

Query:  DAPTEFRPTPPYPKRLKKKEQDVQFRKFLDVLNQLHVNIPLVEASEQMSTYVRFLKDILIKKRKLREYKTVAMTKESSNILISKIPTKIKDLGSFTIPIS
               P PP+P+RL+K++ + QF+KFL+V  +LH+NIP  EA EQM +YV+FLKDIL KKRKL E++TV +T+E S IL +K+P K+KD GSFTIP +
Subjt:  DAPTEFRPTPPYPKRLKKKEQDVQFRKFLDVLNQLHVNIPLVEASEQMSTYVRFLKDILIKKRKLREYKTVAMTKESSNILISKIPTKIKDLGSFTIPIS

Query:  I-------------------------------------------------RG-----------------------------------PLLATTMVLVDVQ
        I                                                 RG                                   P LAT   ++DV+
Subjt:  I-------------------------------------------------RG-----------------------------------PLLATTMVLVDVQ

Query:  KGEVTMLVQDQEVKFSVYDAVKYPSKSEECSMLKVVDE
        +G+++  V ++ V+F++++A K+PS +  C  ++++DE
Subjt:  KGEVTMLVQDQEVKFSVYDAVKYPSKSEECSMLKVVDE

A0A6J1DVS9 uncharacterized protein LOC1110245273.8e-6146.61Show/hide
Query:  SVYYLGNTHNNINNPYSNTYNQGWSSHPNFSWSRNQGRNDVGTSNAPAYQQKGNYPPRIANQGQGAGQKPPKGSFASLENLMKQYMEKNNVTVQSYAASL
        +VYYLGN  N+ NNPYSNTYN GW +HPNFSWS  Q  ++VGTSNAPA+QQK +Y P  ANQGQ   QK  +GSFASLE LMKQYM  N+V VQS AASL
Subjt:  SVYYLGNTHNNINNPYSNTYNQGWSSHPNFSWSRNQGRNDVGTSNAPAYQQKGNYPPRIANQGQGAGQKPPKGSFASLENLMKQYMEKNNVTVQSYAASL

Query:  RNLELQVGQLATDLKSRPYGALPSDTKVSEQPKDSQDIASKEVNPVNAKASNFGTSHAKVDEKRKKIEHEDAPTEFRPTPPYPKRLKKKEQDVQFRKFLD
        RNLELQVGQLATDLKSRP                                                                                  
Subjt:  RNLELQVGQLATDLKSRPYGALPSDTKVSEQPKDSQDIASKEVNPVNAKASNFGTSHAKVDEKRKKIEHEDAPTEFRPTPPYPKRLKKKEQDVQFRKFLD

Query:  VLNQLHVNIPLVEASEQMSTYVRFLKDILIKKRKLREYKTVAMTKESSNILISKIPTKIKDLGSFTIPISIRG---------------------------
           QLHVNI LVE  EQM  Y+RFLK+IL KKR L EY+TVAMTK  S ILISKIP K+KD GSFTIP+SI G                           
Subjt:  VLNQLHVNIPLVEASEQMSTYVRFLKDILIKKRKLREYKTVAMTKESSNILISKIPTKIKDLGSFTIPISIRG---------------------------

Query:  -------------PLLATTMVLVDVQKGEVTMLVQDQEVKFSVYDAVKYPSKSE
                     P L T  VLVDV KGE+TM VQDQEVKFSV+D++K+P++SE
Subjt:  -------------PLLATTMVLVDVQKGEVTMLVQDQEVKFSVYDAVKYPSKSE

A0A6J1DWN2 uncharacterized protein LOC1110252032.3e-170100Show/hide
Query:  KNAAGTPAKANVSHIQGISYSFCEGEHHYNSCPSNPKSVYYLGNTHNNINNPYSNTYNQGWSSHPNFSWSRNQGRNDVGTSNAPAYQQKGNYPPRIANQG
        KNAAGTPAKANVSHIQGISYSFCEGEHHYNSCPSNPKSVYYLGNTHNNINNPYSNTYNQGWSSHPNFSWSRNQGRNDVGTSNAPAYQQKGNYPPRIANQG
Subjt:  KNAAGTPAKANVSHIQGISYSFCEGEHHYNSCPSNPKSVYYLGNTHNNINNPYSNTYNQGWSSHPNFSWSRNQGRNDVGTSNAPAYQQKGNYPPRIANQG

Query:  QGAGQKPPKGSFASLENLMKQYMEKNNVTVQSYAASLRNLELQVGQLATDLKSRPYGALPSDTKVSEQPKDSQDIASKEVNPVNAKASNFGTSHAKVDEK
        QGAGQKPPKGSFASLENLMKQYMEKNNVTVQSYAASLRNLELQVGQLATDLKSRPYGALPSDTKVSEQPKDSQDIASKEVNPVNAKASNFGTSHAKVDEK
Subjt:  QGAGQKPPKGSFASLENLMKQYMEKNNVTVQSYAASLRNLELQVGQLATDLKSRPYGALPSDTKVSEQPKDSQDIASKEVNPVNAKASNFGTSHAKVDEK

Query:  RKKIEHEDAPTEFRPTPPYPKRLKKKEQDVQFRKFLDVLNQLHVNIPLVEASEQMSTYVRFLKDILIKKRKLREYKTVAMTKESSNILISKIPTKIKDLG
        RKKIEHEDAPTEFRPTPPYPKRLKKKEQDVQFRKFLDVLNQLHVNIPLVEASEQMSTYVRFLKDILIKKRKLREYKTVAMTKESSNILISKIPTKIKDLG
Subjt:  RKKIEHEDAPTEFRPTPPYPKRLKKKEQDVQFRKFLDVLNQLHVNIPLVEASEQMSTYVRFLKDILIKKRKLREYKTVAMTKESSNILISKIPTKIKDLG

Query:  SFTIPISIRG
        SFTIPISIRG
Subjt:  SFTIPISIRG

A0A6J1DY39 uncharacterized protein LOC1110256533.0e-5032.11Show/hide
Query:  NKNAAGTPAKANVSHIQGISYS---FCEGEHHYNSCPSNPKSVYYLGNTHNNINNPYSNTYNQGWSSHPNFSWSRNQGRNDVGTS-------NAPAYQQK
        N  AA   A  N S +  I+ S   +C   H   +CPSNP S+YY+G  +    NPYSNTYN GW  HPNFSWS     N  G +         P +   
Subjt:  NKNAAGTPAKANVSHIQGISYS---FCEGEHHYNSCPSNPKSVYYLGNTHNNINNPYSNTYNQGWSSHPNFSWSRNQGRNDVGTS-------NAPAYQQK

Query:  GNYPPRIANQGQGAGQ-KPPKGSFASLENLMKQYMEKNNVT-----------------VQSYA----ASLRNLELQVGQLATDLKSRPYGALPSDTKVSE
          +PP      Q     +P + + +++E LMK+ + KN+ T                 V+ Y      ++R LE+Q+GQL  ++++RP G+LPS T+   
Subjt:  GNYPPRIANQGQGAGQ-KPPKGSFASLENLMKQYMEKNNVT-----------------VQSYA----ASLRNLELQVGQLATDLKSRPYGALPSDTKVSE

Query:  Q--PKDSQDIASKEVNPVNAKASNFGTSHA------------KVDEKRKKIEHEDAPTEFRPTPPYPKRLKKKEQDVQFRKFLDVLNQLHVNIPLVEASE
        +   +    IA++             +SH+            K+ E    +      +  RP PP+P+RL +K QD  FRKFLD+L QLH+NIP VEA E
Subjt:  Q--PKDSQDIASKEVNPVNAKASNFGTSHA------------KVDEKRKKIEHEDAPTEFRPTPPYPKRLKKKEQDVQFRKFLDVLNQLHVNIPLVEASE

Query:  QMSTYVRFLKDILIKKRKLREYKTVAMTKESSNILISKIPTKIKDLGSFTIPISIRG-------------------------------------------
        QM TY +F+KDI+ +K+KL EY+TVA+T+ SSN+  SK+P K+KD GSFTIP  I G                                           
Subjt:  QMSTYVRFLKDILIKKRKLREYKTVAMTKESSNILISKIPTKIKDLGSFTIPISIRG-------------------------------------------

Query:  -----------------------------------------PLLATTMVLVDVQKGEVTMLVQDQEVKFSVYDAVKYPSKSEECSMLKV
                                                 P LAT   L+DV+KGE+TM V DQ+V F++ DA+KY    EEC+++ +
Subjt:  -----------------------------------------PLLATTMVLVDVQKGEVTMLVQDQEVKFSVYDAVKYPSKSEECSMLKV

A0A6J1E1F3 uncharacterized protein LOC1110250651.1e-7950.13Show/hide
Query:  ANVSHIQGISYSFCEGEHHYNSCPSNPKSVYYLGNTHNNINNPYSNTYNQGWSSHPNFSWSRNQGRNDVGTSNAPAYQQKGNYPPRIANQGQGAGQKPPK
        ANV+ IQGIS SFCEG+HHYN+CP NP+SVYYLGN  NN NN YSNTYN GW +HPNFSWS +QG ++ GTS+APA+Q K +YPP   NQGQ   ++  +
Subjt:  ANVSHIQGISYSFCEGEHHYNSCPSNPKSVYYLGNTHNNINNPYSNTYNQGWSSHPNFSWSRNQGRNDVGTSNAPAYQQKGNYPPRIANQGQGAGQKPPK

Query:  GSFASLENLMKQYMEKNNVTVQSYAASLRNLELQVGQLATDLKSRPYGALPSDTKVSEQPKDSQDIASKEVNPVNAKASNFGTSHAKVDEKRKKIEHEDA
        GS ASLE LMKQYM  N+ TVQS A SLRNL+LQVGQLATDLKS+P                                        +V EKRK+ EHE+A
Subjt:  GSFASLENLMKQYMEKNNVTVQSYAASLRNLELQVGQLATDLKSRPYGALPSDTKVSEQPKDSQDIASKEVNPVNAKASNFGTSHAKVDEKRKKIEHEDA

Query:  PTEFRPTPPYPKRLKKKEQDVQFRKFLDVLNQLHVNIPLVEASEQMSTYVRFLKDILIKKRKLREYKTV--AMTKESSNILISKI------------PT-
        P E+ P PPYPKRL+KKE++VQF KFLDVL QLHVNIPLVEA EQM  YVRFLK+ILIKKR L EY T+  A+    +NI +  +            PT 
Subjt:  PTEFRPTPPYPKRLKKKEQDVQFRKFLDVLNQLHVNIPLVEASEQMSTYVRFLKDILIKKRKLREYKTV--AMTKESSNILISKI------------PT-

Query:  ---------------KIKDL-------------------GSFTIPISIRGPLLATTMVLVDVQKGEVTMLVQD
                       KI+D+                       +PI +  P LAT   LVDV KGE+TM VQD
Subjt:  ---------------KIKDL-------------------GSFTIPISIRGPLLATTMVLVDVQKGEVTMLVQD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACAAGGGAAAAGACAACAAAGGCTCAGCCATGATAATAGGAGTAGCAGTGGAGATGTTGTCAACTACTTATCAGCTCAAGTGTTGCTAGCTAGCCCTAAGATTAG
TACAACATCTATAGATCTGGTATCTGAGTGGGCAGTAGACAGTGCAGCCTCAGTACACATCGCTTCAGACAGATGGTTGTTTAAATCTTTCACTACAGTGAGCTTTGGTG
TAGTGAGAATGTGGAACAATAGACTCTCCAAGATCAGAGGCACTCCAGTTGTAAATCTGAAGACTGGCAATGAGCTAGTTTTAGGGGATGTCTTATATGTACCCAGTTTT
AGGAGGAATCTAATATCTGCTGAGAAGTTGGATGAAGATGGTTACAAGAGTGAGTTTGCAAGATGGGCTACAGATGGTTCAAGGCAAGACTTAAATGGACCAGCAGGAAT
GACAACCAAGATTGAGGAAGAAAATATATCATCGGTTCAAATACAACAGCTGGGAACGGGTTCGTGGATACTCAGAATGGGGTCCCCTAATCATGCTTCAAACTTGTGTT
TATTGCTTGAGTATCTAGCAACAGGAGTTGTGGATGTCCATGTTTCAGTGTCTCGGAAGGAAACTATGTGTATGAATGATTTTGAAAGCTTGGAATATATTATAGATAAC
GAGATAGAGCACACATTTCATAGGAATCAAAGAGAACAAAGGAGAACACAAGCTACAGCAAAGATGAATCCACCTAACCCACCTCCACGCCCACCTATTCCACCAAATAA
AAATGCAGCGGGCACCCCTGCTAAAGCAAATGTCAGCCACATCCAAGGGATTTCTTATTCTTTTTGCGAGGGAGAGCATCATTACAATAGTTGCCCTAGCAATCCAAAGT
CAGTGTACTATTTGGGAAATACTCATAATAATATAAACAATCCATACTCCAATACGTACAACCAAGGTTGGAGTAGTCATCCCAACTTTAGTTGGAGTAGAAATCAAGGA
CGAAATGATGTTGGAACATCCAATGCTCCAGCATATCAACAAAAAGGAAACTATCCTCCACGAATTGCTAACCAAGGTCAGGGAGCAGGACAAAAGCCACCTAAAGGATC
ATTTGCATCTTTGGAGAATCTGATGAAGCAGTATATGGAAAAGAATAATGTCACGGTACAAAGTTATGCAGCATCGTTGAGGAATCTAGAATTGCAAGTGGGCCAATTAG
CAACAGATTTGAAGAGTAGACCCTATGGAGCACTGCCTAGTGATACCAAGGTTAGTGAGCAACCGAAGGATAGTCAGGACATAGCGAGTAAAGAGGTCAACCCAGTCAAT
GCTAAAGCATCAAATTTTGGAACATCGCATGCAAAAGTGGATGAGAAAAGGAAAAAGATTGAACATGAAGATGCTCCAACTGAGTTTCGACCCACACCACCATACCCAAA
GCGGCTGAAAAAGAAAGAGCAGGATGTGCAATTTAGAAAGTTCCTTGATGTGCTGAATCAGTTGCATGTCAATATACCGCTGGTGGAAGCATCGGAGCAAATGTCGACTT
ATGTGCGGTTCCTCAAGGACATACTCATCAAGAAAAGGAAGTTGCGAGAATATAAAACTGTAGCAATGACCAAGGAGTCCAGCAACATCCTTATAAGCAAAATTCCTACT
AAAATTAAAGACCTTGGAAGCTTCACCATACCTATTTCCATCAGAGGACCATTACTTGCAACTACAATGGTGCTTGTGGATGTTCAAAAAGGCGAGGTGACAATGCTTGT
TCAAGATCAAGAAGTAAAGTTCTCAGTGTACGATGCAGTGAAATACCCATCAAAGTCGGAAGAGTGTTCAATGCTAAAAGTTGTAGATGAAGCTCTAATAGAGGAGTTAG
GAGTAGAAGCAATGCTGGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAACAAGGGAAAAGACAACAAAGGCTCAGCCATGATAATAGGAGTAGCAGTGGAGATGTTGTCAACTACTTATCAGCTCAAGTGTTGCTAGCTAGCCCTAAGATTAG
TACAACATCTATAGATCTGGTATCTGAGTGGGCAGTAGACAGTGCAGCCTCAGTACACATCGCTTCAGACAGATGGTTGTTTAAATCTTTCACTACAGTGAGCTTTGGTG
TAGTGAGAATGTGGAACAATAGACTCTCCAAGATCAGAGGCACTCCAGTTGTAAATCTGAAGACTGGCAATGAGCTAGTTTTAGGGGATGTCTTATATGTACCCAGTTTT
AGGAGGAATCTAATATCTGCTGAGAAGTTGGATGAAGATGGTTACAAGAGTGAGTTTGCAAGATGGGCTACAGATGGTTCAAGGCAAGACTTAAATGGACCAGCAGGAAT
GACAACCAAGATTGAGGAAGAAAATATATCATCGGTTCAAATACAACAGCTGGGAACGGGTTCGTGGATACTCAGAATGGGGTCCCCTAATCATGCTTCAAACTTGTGTT
TATTGCTTGAGTATCTAGCAACAGGAGTTGTGGATGTCCATGTTTCAGTGTCTCGGAAGGAAACTATGTGTATGAATGATTTTGAAAGCTTGGAATATATTATAGATAAC
GAGATAGAGCACACATTTCATAGGAATCAAAGAGAACAAAGGAGAACACAAGCTACAGCAAAGATGAATCCACCTAACCCACCTCCACGCCCACCTATTCCACCAAATAA
AAATGCAGCGGGCACCCCTGCTAAAGCAAATGTCAGCCACATCCAAGGGATTTCTTATTCTTTTTGCGAGGGAGAGCATCATTACAATAGTTGCCCTAGCAATCCAAAGT
CAGTGTACTATTTGGGAAATACTCATAATAATATAAACAATCCATACTCCAATACGTACAACCAAGGTTGGAGTAGTCATCCCAACTTTAGTTGGAGTAGAAATCAAGGA
CGAAATGATGTTGGAACATCCAATGCTCCAGCATATCAACAAAAAGGAAACTATCCTCCACGAATTGCTAACCAAGGTCAGGGAGCAGGACAAAAGCCACCTAAAGGATC
ATTTGCATCTTTGGAGAATCTGATGAAGCAGTATATGGAAAAGAATAATGTCACGGTACAAAGTTATGCAGCATCGTTGAGGAATCTAGAATTGCAAGTGGGCCAATTAG
CAACAGATTTGAAGAGTAGACCCTATGGAGCACTGCCTAGTGATACCAAGGTTAGTGAGCAACCGAAGGATAGTCAGGACATAGCGAGTAAAGAGGTCAACCCAGTCAAT
GCTAAAGCATCAAATTTTGGAACATCGCATGCAAAAGTGGATGAGAAAAGGAAAAAGATTGAACATGAAGATGCTCCAACTGAGTTTCGACCCACACCACCATACCCAAA
GCGGCTGAAAAAGAAAGAGCAGGATGTGCAATTTAGAAAGTTCCTTGATGTGCTGAATCAGTTGCATGTCAATATACCGCTGGTGGAAGCATCGGAGCAAATGTCGACTT
ATGTGCGGTTCCTCAAGGACATACTCATCAAGAAAAGGAAGTTGCGAGAATATAAAACTGTAGCAATGACCAAGGAGTCCAGCAACATCCTTATAAGCAAAATTCCTACT
AAAATTAAAGACCTTGGAAGCTTCACCATACCTATTTCCATCAGAGGACCATTACTTGCAACTACAATGGTGCTTGTGGATGTTCAAAAAGGCGAGGTGACAATGCTTGT
TCAAGATCAAGAAGTAAAGTTCTCAGTGTACGATGCAGTGAAATACCCATCAAAGTCGGAAGAGTGTTCAATGCTAAAAGTTGTAGATGAAGCTCTAATAGAGGAGTTAG
GAGTAGAAGCAATGCTGGAGTAG
Protein sequenceShow/hide protein sequence
MEQGKRQQRLSHDNRSSSGDVVNYLSAQVLLASPKISTTSIDLVSEWAVDSAASVHIASDRWLFKSFTTVSFGVVRMWNNRLSKIRGTPVVNLKTGNELVLGDVLYVPSF
RRNLISAEKLDEDGYKSEFARWATDGSRQDLNGPAGMTTKIEEENISSVQIQQLGTGSWILRMGSPNHASNLCLLLEYLATGVVDVHVSVSRKETMCMNDFESLEYIIDN
EIEHTFHRNQREQRRTQATAKMNPPNPPPRPPIPPNKNAAGTPAKANVSHIQGISYSFCEGEHHYNSCPSNPKSVYYLGNTHNNINNPYSNTYNQGWSSHPNFSWSRNQG
RNDVGTSNAPAYQQKGNYPPRIANQGQGAGQKPPKGSFASLENLMKQYMEKNNVTVQSYAASLRNLELQVGQLATDLKSRPYGALPSDTKVSEQPKDSQDIASKEVNPVN
AKASNFGTSHAKVDEKRKKIEHEDAPTEFRPTPPYPKRLKKKEQDVQFRKFLDVLNQLHVNIPLVEASEQMSTYVRFLKDILIKKRKLREYKTVAMTKESSNILISKIPT
KIKDLGSFTIPISIRGPLLATTMVLVDVQKGEVTMLVQDQEVKFSVYDAVKYPSKSEECSMLKVVDEALIEELGVEAMLE