; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g25670 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g25670
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr2:18322189..18332797
RNA-Seq ExpressionMoc02g25670
SyntenyMoc02g25670
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150317.1 uncharacterized protein LOC111018514 [Momordica charantia]5.3e-6251.01Show/hide
Query:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPVGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQP-------
        LDHPTKMMLNNAANGAFTKKTFNEIVDIL DLASHNELWCSQRS+ APKKQD  GVLALDIATSMQKEM TMNQ LKE+AL  K+    P QP       
Subjt:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPVGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQP-------

Query:  -VQSDYCTPAPDMGTIGTLTHIRTPTTQVG---------------------------------GTTLISHGEVKEQY--------------------NQR
         +    C+   D        H       VG                                 G T   + + K+ Y                    NQR
Subjt:  -VQSDYCTPAPDMGTIGTLTHIRTPTTQVG---------------------------------GTTLISHGEVKEQY--------------------NQR

Query:  TQTPPVQNNNSNLENMRKEYMARTDAV-------IQSQAASMRNFETQLGQIANEFKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDEPTMP
        T + P  NNN++LENM KEYMAR DA+       IQ+QAA MRN E Q+GQ AN+ K RPQGSFPGHTE+ KR+G EQCKAVTLRSGL+Y+ P MP
Subjt:  TQTPPVQNNNSNLENMRKEYMARTDAV-------IQSQAASMRNFETQLGQIANEFKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDEPTMP

XP_022157836.1 uncharacterized protein LOC111024449 [Momordica charantia]3.4e-6138.44Show/hide
Query:  IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPVGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT----
        IEHF+R  D PTKMMLN  ANG FT KT+NEI+ IL+ L  HN LWCS+RSR  PK  D  GV  LD  +SMQ ++ T+ Q +K M      P  T    
Subjt:  IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPVGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT----

Query:  ---PIQPVQSDYCTPAPDMGTIGTLTHIRTPTTQVGGTTLISHGEVKEQ----YN----------------QRTQTPPVQNNNSNLENMRKEYMA-----
           P+  +    C    D           T    VG  +  + G+V+ Q    YN                Q+  + PVQ   S +E+M KE M      
Subjt:  ---PIQPVQSDYCTPAPDMGTIGTLTHIRTPTTQVGGTTLISHGEVKEQ----YN----------------QRTQTPPVQNNNSNLENMRKEYMA-----

Query:  ---RTDAVIQS---------------QAASMRNFETQLGQIANEFKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDEPTMPTTDVQIPSTEPTVKI
           R D  +Q                  A+MRN E Q+GQ+A+E KNRP+G+ P  TE PK EG+E CK +T RSGLAY+EP MP      P+ E     
Subjt:  ---RTDAVIQS---------------QAASMRNFETQLGQIANEFKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDEPTMPTTDVQIPSTEPTVKI

Query:  PENPTTPEKKYALEQMPNYAKFLKDIVSRKKKIGEHELVAMTKCTSEAVGRPLPMKCNDPGSFTIPCSIGGKNLGDFEECSAITRLNPVMFDEFYDLLVT
         E  T P++   +E MP YAKFLKDI++RKKK+GE+E VA+T+C+S      +  K  DPGSFTIPCSIGGK++G    C     +N +    F  L + 
Subjt:  PENPTTPEKKYALEQMPNYAKFLKDIVSRKKKIGEHELVAMTKCTSEAVGRPLPMKCNDPGSFTIPCSIGGKNLGDFEECSAITRLNPVMFDEFYDLLVT

Query:  EIEEELDKIAEGPEDVTNPVKKIQ
        +       +      +T P  KI+
Subjt:  EIEEELDKIAEGPEDVTNPVKKIQ

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]5.0e-15360.56Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNNEFNHIQMTDNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARN+EFN+IQM DNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNNEFNHIQMTDNRDVAMREYAAT

Query:  AFQNFDSGI-------------------------------------------------------------------------------------------
        AFQNFDSGI                                                                                           
Subjt:  AFQNFDSGI-------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
                                                              IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPVGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPDMGTIGTLTHIRTPTTQ---VGGTTLISHGE-----
        CSQRSRAAPKKQDP GVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT IQPVQSDYCT AP       +     P       GG++  + G+     
Subjt:  CSQRSRAAPKKQDPVGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPDMGTIGTLTHIRTPTTQ---VGGTTLISHGE-----

Query:  -------------VKEQYNQRTQTPPVQNNNSNLENMRKEYMARTDAVIQSQAASMRNFETQLGQIANEFKNRPQGSFPGHTELPKREGKEQCKAVTLRS
                      ++QYNQRTQTPP+QNNNSNLENM KEYMARTDAVIQSQAASMRNF TQLG +ANE KNRPQGSFPGHTELP+REGKEQCKAVTLRS
Subjt:  -------------VKEQYNQRTQTPPVQNNNSNLENMRKEYMARTDAVIQSQAASMRNFETQLGQIANEFKNRPQGSFPGHTELPKREGKEQCKAVTLRS

Query:  GLAYDEPTMPTTDVQIPSTEPTVKIPENPTTPEKK
        GL YD PTMPTTDVQIPST+PTVKIPENPTTPEK+
Subjt:  GLAYDEPTMPTTDVQIPSTEPTVKIPENPTTPEKK

XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]2.6e-5634.73Show/hide
Query:  IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPVGVLALDIATSMQKEMVTMNQRLKEM---------ALGIK
        IEHFFRG D  TKMMLN AANG FT K+FNEIV+IL+ L+ HN  WCS++SR   K+ DP GVLALD  TSMQK++ T+ Q LK M         A    
Subjt:  IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPVGVLALDIATSMQKEMVTMNQRLKEM---------ALGIK

Query:  NPLATPIQPVQSDYC------------------------------TPAPDMGTIGTLTHIRTPTTQVGGTTLISHGE-------------------VKEQ
        NP  +P+  +    C                               P  +    G   H     +  G +    H +                      Q
Subjt:  NPLATPIQPVQSDYC------------------------------TPAPDMGTIGTLTHIRTPTTQVGGTTLISHGE-------------------VKEQ

Query:  YN-QRTQTPPVQNNNSNLENMRKEYMARTDAVIQS---------------------QAASMRNFETQLGQIANEFKNRPQGSFPGHTELPKREGKEQCKA
        YN Q+    P Q N SN+E + KE + + DA ++                         ++R  E QLGQ+ NE + RPQGS P  TE P+R GKE C +
Subjt:  YN-QRTQTPPVQNNNSNLENMRKEYMARTDAVIQS---------------------QAASMRNFETQLGQIANEFKNRPQGSFPGHTELPKREGKEQCKA

Query:  VTLRSGLAYDEPTMPTTDVQIPS------------TEPTVKIPENPTTPEKK-----------------------------------YALEQMPNYAKFL
        +  RSGL Y+ P MP      PS             EP V +P  P     +                                    ALEQMP YAKF+
Subjt:  VTLRSGLAYDEPTMPTTDVQIPS------------TEPTVKIPENPTTPEKK-----------------------------------YALEQMPNYAKFL

Query:  KDIVSRKKKIGEHELVAMTKCTSEAVGRPLPMKCNDPGSFTIPCSIGGKNLG
        KDI++RKKK+GE+E VA+T+C+S      +P K  DPGSFTIPC IGGK++G
Subjt:  KDIVSRKKKIGEHELVAMTKCTSEAVGRPLPMKCNDPGSFTIPCSIGGKNLG

XP_022159345.1 uncharacterized protein LOC111025764 [Momordica charantia]4.9e-10067.75Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPVGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAP---
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDP GVLALDIA+SMQKE VTMNQRLKEM LG+KNPLATPIQPVQSDYCTPAP   
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPVGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAP---

Query:  ----------------------------------------DMGTIGTLTHIRTPTTQVGGTTLISHGE------------------VKEQYNQRTQTPPV
                                                +    G   H        GG+   + G+                   +++YNQRTQTPPV
Subjt:  ----------------------------------------DMGTIGTLTHIRTPTTQVGGTTLISHGE------------------VKEQYNQRTQTPPV

Query:  QNNNSNLENMRKEYMARTDAVIQSQAASMRNFETQLGQIANEFKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDEPTMPTTDVQIPSTEPTVKIPE
        QNNNSNLENM KEYMARTDAVIQSQAASMRNFETQLGQ+ANE KNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYDEPTMPT DVQIPST PTVKIPE
Subjt:  QNNNSNLENMRKEYMARTDAVIQSQAASMRNFETQLGQIANEFKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDEPTMPTTDVQIPSTEPTVKIPE

Query:  NPTTPEK
        NPTTPEK
Subjt:  NPTTPEK

TrEMBL top hitse value%identityAlignment
A0A6J1DAE9 uncharacterized protein LOC1110185142.6e-6251.01Show/hide
Query:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPVGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQP-------
        LDHPTKMMLNNAANGAFTKKTFNEIVDIL DLASHNELWCSQRS+ APKKQD  GVLALDIATSMQKEM TMNQ LKE+AL  K+    P QP       
Subjt:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPVGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQP-------

Query:  -VQSDYCTPAPDMGTIGTLTHIRTPTTQVG---------------------------------GTTLISHGEVKEQY--------------------NQR
         +    C+   D        H       VG                                 G T   + + K+ Y                    NQR
Subjt:  -VQSDYCTPAPDMGTIGTLTHIRTPTTQVG---------------------------------GTTLISHGEVKEQY--------------------NQR

Query:  TQTPPVQNNNSNLENMRKEYMARTDAV-------IQSQAASMRNFETQLGQIANEFKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDEPTMP
        T + P  NNN++LENM KEYMAR DA+       IQ+QAA MRN E Q+GQ AN+ K RPQGSFPGHTE+ KR+G EQCKAVTLRSGL+Y+ P MP
Subjt:  TQTPPVQNNNSNLENMRKEYMARTDAV-------IQSQAASMRNFETQLGQIANEFKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDEPTMP

A0A6J1DW02 uncharacterized protein LOC1110248972.4e-15360.56Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNNEFNHIQMTDNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARN+EFN+IQM DNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNNEFNHIQMTDNRDVAMREYAAT

Query:  AFQNFDSGI-------------------------------------------------------------------------------------------
        AFQNFDSGI                                                                                           
Subjt:  AFQNFDSGI-------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
                                                              IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPVGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPDMGTIGTLTHIRTPTTQ---VGGTTLISHGE-----
        CSQRSRAAPKKQDP GVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT IQPVQSDYCT AP       +     P       GG++  + G+     
Subjt:  CSQRSRAAPKKQDPVGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPDMGTIGTLTHIRTPTTQ---VGGTTLISHGE-----

Query:  -------------VKEQYNQRTQTPPVQNNNSNLENMRKEYMARTDAVIQSQAASMRNFETQLGQIANEFKNRPQGSFPGHTELPKREGKEQCKAVTLRS
                      ++QYNQRTQTPP+QNNNSNLENM KEYMARTDAVIQSQAASMRNF TQLG +ANE KNRPQGSFPGHTELP+REGKEQCKAVTLRS
Subjt:  -------------VKEQYNQRTQTPPVQNNNSNLENMRKEYMARTDAVIQSQAASMRNFETQLGQIANEFKNRPQGSFPGHTELPKREGKEQCKAVTLRS

Query:  GLAYDEPTMPTTDVQIPSTEPTVKIPENPTTPEKK
        GL YD PTMPTTDVQIPST+PTVKIPENPTTPEK+
Subjt:  GLAYDEPTMPTTDVQIPSTEPTVKIPENPTTPEKK

A0A6J1DY39 uncharacterized protein LOC1110256531.2e-5634.73Show/hide
Query:  IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPVGVLALDIATSMQKEMVTMNQRLKEM---------ALGIK
        IEHFFRG D  TKMMLN AANG FT K+FNEIV+IL+ L+ HN  WCS++SR   K+ DP GVLALD  TSMQK++ T+ Q LK M         A    
Subjt:  IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPVGVLALDIATSMQKEMVTMNQRLKEM---------ALGIK

Query:  NPLATPIQPVQSDYC------------------------------TPAPDMGTIGTLTHIRTPTTQVGGTTLISHGE-------------------VKEQ
        NP  +P+  +    C                               P  +    G   H     +  G +    H +                      Q
Subjt:  NPLATPIQPVQSDYC------------------------------TPAPDMGTIGTLTHIRTPTTQVGGTTLISHGE-------------------VKEQ

Query:  YN-QRTQTPPVQNNNSNLENMRKEYMARTDAVIQS---------------------QAASMRNFETQLGQIANEFKNRPQGSFPGHTELPKREGKEQCKA
        YN Q+    P Q N SN+E + KE + + DA ++                         ++R  E QLGQ+ NE + RPQGS P  TE P+R GKE C +
Subjt:  YN-QRTQTPPVQNNNSNLENMRKEYMARTDAVIQS---------------------QAASMRNFETQLGQIANEFKNRPQGSFPGHTELPKREGKEQCKA

Query:  VTLRSGLAYDEPTMPTTDVQIPS------------TEPTVKIPENPTTPEKK-----------------------------------YALEQMPNYAKFL
        +  RSGL Y+ P MP      PS             EP V +P  P     +                                    ALEQMP YAKF+
Subjt:  VTLRSGLAYDEPTMPTTDVQIPS------------TEPTVKIPENPTTPEKK-----------------------------------YALEQMPNYAKFL

Query:  KDIVSRKKKIGEHELVAMTKCTSEAVGRPLPMKCNDPGSFTIPCSIGGKNLG
        KDI++RKKK+GE+E VA+T+C+S      +P K  DPGSFTIPC IGGK++G
Subjt:  KDIVSRKKKIGEHELVAMTKCTSEAVGRPLPMKCNDPGSFTIPCSIGGKNLG

A0A6J1DYG0 uncharacterized protein LOC1110257642.4e-10067.75Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPVGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAP---
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDP GVLALDIA+SMQKE VTMNQRLKEM LG+KNPLATPIQPVQSDYCTPAP   
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPVGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAP---

Query:  ----------------------------------------DMGTIGTLTHIRTPTTQVGGTTLISHGE------------------VKEQYNQRTQTPPV
                                                +    G   H        GG+   + G+                   +++YNQRTQTPPV
Subjt:  ----------------------------------------DMGTIGTLTHIRTPTTQVGGTTLISHGE------------------VKEQYNQRTQTPPV

Query:  QNNNSNLENMRKEYMARTDAVIQSQAASMRNFETQLGQIANEFKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDEPTMPTTDVQIPSTEPTVKIPE
        QNNNSNLENM KEYMARTDAVIQSQAASMRNFETQLGQ+ANE KNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYDEPTMPT DVQIPST PTVKIPE
Subjt:  QNNNSNLENMRKEYMARTDAVIQSQAASMRNFETQLGQIANEFKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDEPTMPTTDVQIPSTEPTVKIPE

Query:  NPTTPEK
        NPTTPEK
Subjt:  NPTTPEK

A0A6J1DZC3 uncharacterized protein LOC1110244491.7e-6138.44Show/hide
Query:  IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPVGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT----
        IEHF+R  D PTKMMLN  ANG FT KT+NEI+ IL+ L  HN LWCS+RSR  PK  D  GV  LD  +SMQ ++ T+ Q +K M      P  T    
Subjt:  IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPVGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT----

Query:  ---PIQPVQSDYCTPAPDMGTIGTLTHIRTPTTQVGGTTLISHGEVKEQ----YN----------------QRTQTPPVQNNNSNLENMRKEYMA-----
           P+  +    C    D           T    VG  +  + G+V+ Q    YN                Q+  + PVQ   S +E+M KE M      
Subjt:  ---PIQPVQSDYCTPAPDMGTIGTLTHIRTPTTQVGGTTLISHGEVKEQ----YN----------------QRTQTPPVQNNNSNLENMRKEYMA-----

Query:  ---RTDAVIQS---------------QAASMRNFETQLGQIANEFKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDEPTMPTTDVQIPSTEPTVKI
           R D  +Q                  A+MRN E Q+GQ+A+E KNRP+G+ P  TE PK EG+E CK +T RSGLAY+EP MP      P+ E     
Subjt:  ---RTDAVIQS---------------QAASMRNFETQLGQIANEFKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDEPTMPTTDVQIPSTEPTVKI

Query:  PENPTTPEKKYALEQMPNYAKFLKDIVSRKKKIGEHELVAMTKCTSEAVGRPLPMKCNDPGSFTIPCSIGGKNLGDFEECSAITRLNPVMFDEFYDLLVT
         E  T P++   +E MP YAKFLKDI++RKKK+GE+E VA+T+C+S      +  K  DPGSFTIPCSIGGK++G    C     +N +    F  L + 
Subjt:  PENPTTPEKKYALEQMPNYAKFLKDIVSRKKKIGEHELVAMTKCTSEAVGRPLPMKCNDPGSFTIPCSIGGKNLGDFEECSAITRLNPVMFDEFYDLLVT

Query:  EIEEELDKIAEGPEDVTNPVKKIQ
        +       +      +T P  KI+
Subjt:  EIEEELDKIAEGPEDVTNPVKKIQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAG
AGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGG
ATCACGCAAGAAATAATGAATTCAACCATATCCAGATGACGGACAACAGAGACGTGGCCATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAATA
GAACATTTCTTTAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCTAACGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTCGACATCCTAAATGA
CTTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCAGTTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAG
AGATGGTTACAATGAACCAGAGGCTGAAAGAGATGGCGTTGGGAATAAAAAATCCATTAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGAC
ATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGTCAAGGAGCAGTACAATCAGAGAACACA
GACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGAGGAAGGAGTACATGGCTCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATGAGGAATTTCG
AGACCCAATTGGGACAGATCGCAAATGAATTTAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACAGTGCAAAGCTGTC
ACCCTTAGGAGTGGACTGGCATATGATGAACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAACTGTAAAGATACCAGAGAATCCAACAACACCAGA
AAAAAAATATGCTCTAGAACAGATGCCAAATTATGCTAAGTTTTTGAAAGATATAGTTTCTAGGAAGAAGAAGATAGGAGAGCATGAACTGGTAGCCATGACAAAATGTA
CTAGTGAAGCTGTAGGCAGGCCGCTACCCATGAAATGTAACGATCCTGGCAGTTTTACCATTCCATGTTCCATAGGAGGGAAAAACTTAGGAGACTTTGAAGAGTGCTCT
GCTATAACTCGCTTGAATCCTGTTATGTTTGATGAGTTTTATGACTTGTTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCGGAAGATGTGACTAA
TCCTGTTAAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACTTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCGTCGG
CCACGGGGAAGAGCTCGACCTACTCCAAGTATGACCGTCGGGAGAGCATGACCTGCTTTTCAGGGAAGACCTTCAGATCGGCCTTTAGGGAAGGCCTGAGGTCGGCCATT
AGGCAAGGTCTTCAGGTCGGCCATCAGGGGAGCTCTTCAGGTCGGCCCTCAGTAGAGCTCTTTAGGTCGGCAGCCTCTAAGTTGGATCTGCCATGGTGGATGAAATATGA
TCAAACAATCGAAGTTCTTCATTTGGATGAGGATGAGCTATCTGATGGCAGCTCTTCCTCTAGCCGGGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAG
AGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGG
ATCACGCAAGAAATAATGAATTCAACCATATCCAGATGACGGACAACAGAGACGTGGCCATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAATA
GAACATTTCTTTAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCTAACGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTCGACATCCTAAATGA
CTTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCAGTTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAG
AGATGGTTACAATGAACCAGAGGCTGAAAGAGATGGCGTTGGGAATAAAAAATCCATTAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGAC
ATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGTCAAGGAGCAGTACAATCAGAGAACACA
GACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGAGGAAGGAGTACATGGCTCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATGAGGAATTTCG
AGACCCAATTGGGACAGATCGCAAATGAATTTAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACAGTGCAAAGCTGTC
ACCCTTAGGAGTGGACTGGCATATGATGAACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAACTGTAAAGATACCAGAGAATCCAACAACACCAGA
AAAAAAATATGCTCTAGAACAGATGCCAAATTATGCTAAGTTTTTGAAAGATATAGTTTCTAGGAAGAAGAAGATAGGAGAGCATGAACTGGTAGCCATGACAAAATGTA
CTAGTGAAGCTGTAGGCAGGCCGCTACCCATGAAATGTAACGATCCTGGCAGTTTTACCATTCCATGTTCCATAGGAGGGAAAAACTTAGGAGACTTTGAAGAGTGCTCT
GCTATAACTCGCTTGAATCCTGTTATGTTTGATGAGTTTTATGACTTGTTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCGGAAGATGTGACTAA
TCCTGTTAAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACTTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCGTCGG
CCACGGGGAAGAGCTCGACCTACTCCAAGTATGACCGTCGGGAGAGCATGACCTGCTTTTCAGGGAAGACCTTCAGATCGGCCTTTAGGGAAGGCCTGAGGTCGGCCATT
AGGCAAGGTCTTCAGGTCGGCCATCAGGGGAGCTCTTCAGGTCGGCCCTCAGTAGAGCTCTTTAGGTCGGCAGCCTCTAAGTTGGATCTGCCATGGTGGATGAAATATGA
TCAAACAATCGAAGTTCTTCATTTGGATGAGGATGAGCTATCTGATGGCAGCTCTTCCTCTAGCCGGGAATGA
Protein sequenceShow/hide protein sequence
MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNNEFNHIQMTDNRDVAMREYAATAFQNFDSGII
EHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPVGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPD
MGTIGTLTHIRTPTTQVGGTTLISHGEVKEQYNQRTQTPPVQNNNSNLENMRKEYMARTDAVIQSQAASMRNFETQLGQIANEFKNRPQGSFPGHTELPKREGKEQCKAV
TLRSGLAYDEPTMPTTDVQIPSTEPTVKIPENPTTPEKKYALEQMPNYAKFLKDIVSRKKKIGEHELVAMTKCTSEAVGRPLPMKCNDPGSFTIPCSIGGKNLGDFEECS
AITRLNPVMFDEFYDLLVTEIEEELDKIAEGPEDVTNPVKKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYASATGKSSTYSKYDRRESMTCFSGKTFRSAFREGLRSAI
RQGLQVGHQGSSSGRPSVELFRSAASKLDLPWWMKYDQTIEVLHLDEDELSDGSSSSSRE