; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g05920 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g05920
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationchr4:4045963..4046811
RNA-Seq ExpressionMoc04g05920
SyntenyMoc04g05920
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011696.1 hypothetical protein SDJN02_26602, partial [Cucurbita argyrosperma subsp. argyrosperma]6.3e-5861.02Show/hide
Query:  RQRRSRVELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGE
        R+RRSR       I+PPYPWS E +A +H+L YL+ N I+TI GDVRC +CE+ Y IEY+LM KF+EIA FIE+ +  +HDRAP  W NP   +C+ C E
Subjt:  RQRRSRVELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGE

Query:  ENCVRPTIPEGDNKN----INWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH
        ENCV P IP+ ++ N    INWLFLLLGQ+IGRLKL+ LKYFCA+T NHRTGAK+RL++LTYL LCKQLQPS  LF+
Subjt:  ENCVRPTIPEGDNKN----INWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH

XP_022135937.1 uncharacterized protein LOC111007768 [Momordica charantia]9.1e-15899.65Show/hide
Query:  MKFPPFNSHNLDVELSLRPPSAVDYSAAELIMQQEEVSLPPPLEPQPETLDLSTPLSTTTNQIMHSSTSSRNSQSLRVRRPRARARVRVRARASPSPRRA
        MKFPPFNSHNLDV+LSLRPPSAVDYSAAELIMQQEEVSLPPPLEPQPETLDLSTPLSTTTNQIMHSSTSSRNSQSLRVRRPRARARVRVRARASPSPRRA
Subjt:  MKFPPFNSHNLDVELSLRPPSAVDYSAAELIMQQEEVSLPPPLEPQPETLDLSTPLSTTTNQIMHSSTSSRNSQSLRVRRPRARARVRVRARASPSPRRA

Query:  REPASPVVRQRRSRVELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNF
        REPASPVVRQRRSRVELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNF
Subjt:  REPASPVVRQRRSRVELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNF

Query:  LDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFHR
        LDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFHR
Subjt:  LDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFHR

XP_022135938.1 probable serine/threonine-protein kinase samkC [Momordica charantia]8.7e-7657.59Show/hide
Query:  KFPP--FNSHNLDVELSLRPP-SAVDYSAAELIMQQEEVSL-PPPLEPQPETLDLSTPLSTTTNQIMHSSTSSRN-SQSLRVRRPRARARVRVRARASPS
        K PP  F  H+L  E S   P S  D        Q + + L P PL+PQP+              I H STSS N SQSL+  R R  ++       S S
Subjt:  KFPP--FNSHNLDVELSLRPP-SAVDYSAAELIMQQEEVSL-PPPLEPQPETLDLSTPLSTTTNQIMHSSTSSRN-SQSLRVRRPRARARVRVRARASPS

Query:  PRRAREPASPVVRQRRSRVELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWT
         +R ++P       RR R++ K+T I+PPYPWST ++AVVHDL YL++NQILTITGDV+C +C+KQY IEYDL+TKF+EIASFIEKNK TLHDRAP SWT
Subjt:  PRRAREPASPVVRQRRSRVELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWT

Query:  NPNFLDCKLCGEENCVRPTIP----EGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFHR
        NPN  +CK CG+E+C+RP IP    + D KNINWLFLLLGQMIG L L+HLKYFC YTNNHRT AK+RLVYLTYL+LCKQLQPS ELFHR
Subjt:  NPNFLDCKLCGEENCVRPTIP----EGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFHR

XP_022972400.1 uncharacterized protein KIAA0754-like [Cucurbita maxima]3.1e-5760.45Show/hide
Query:  RQRRSRVELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGE
        R+RRSR       I+PPYPWS E +A +H+L YL+ N I+ I GDVRC +CE+ Y IEY+LM KF+EIA FIE+ +  +HDRAP  W NP   +C+ C E
Subjt:  RQRRSRVELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGE

Query:  ENCVRPTIPEGDNKN----INWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH
        ENCV P IP+ ++ N    INWLFLLLGQ+IGRLKL+ LKYFCA+T NHRTGAK+RL++LTYL LCKQLQPS  LF+
Subjt:  ENCVRPTIPEGDNKN----INWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH

XP_038895979.1 junction-mediating and -regulatory protein-like [Benincasa hispida]8.2e-5847.96Show/hide
Query:  DVELSLRPPSAVDYSAAELIMQQEEVSLPPPLEPQPETLDLSTPLSTTTNQIMHSSTSSRNSQSLRVRRPRARARVRVRARASPSPRRAREPASPVVRQR
        ++ELSLR P     S   L         PPPLE  P    L  PL +TT  +    T + + +             +     S   ++  E   P  R R
Subjt:  DVELSLRPPSAVDYSAAELIMQQEEVSLPPPLEPQPETLDLSTPLSTTTNQIMHSSTSSRNSQSLRVRRPRARARVRVRARASPSPRRAREPASPVVRQR

Query:  RSRVELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGEENC
        R R     T I+PPYPWST+ +AV+H+L YL+ N I+TI G+V+C +CE++Y +EYDLM KF EIA FIE  K ++HDRAP+ WT P   +C LC +E C
Subjt:  RSRVELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGEENC

Query:  VRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELF
        V P I E D   INWLFLLLG+ +G LKL+ LKYFCA TN HRTGAKNRL+YL YLTLC QLQPS ELF
Subjt:  VRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELF

TrEMBL top hitse value%identityAlignment
A0A1S3BHR1 uncharacterized protein LOC1034897709.1e-5553.88Show/hide
Query:  TNQIMHSSTSSRNSQSLRVRRPRARARVRVRARASPSPRRAREPAS--PVVRQRRSRVELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCD
        TNQ +    S R        RP   A    RA A  + R  R   +    +R+  SR       I+PPYPWST  +A+V  LN LR +QIL ITGDVRC 
Subjt:  TNQIMHSSTSSRNSQSLRVRRPRARARVRVRARASPSPRRAREPAS--PVVRQRRSRVELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCD

Query:  RCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGA
        +C+ +YTIEYD+++KFEEIASF+E+NK    DRAP SW NPN+  C+ CG EN  RP IP+ + + INWLFLLLG+M+G L L HLKYFC+YTNNHRTGA
Subjt:  RCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGA

Query:  KNRLVYLTYLTLCKQLQPS
        KNRL+YLTY+TLC Q+ PS
Subjt:  KNRLVYLTYLTLCKQLQPS

A0A6J1C462 uncharacterized protein LOC1110077683.4e-158100Show/hide
Query:  MKFPPFNSHNLDVELSLRPPSAVDYSAAELIMQQEEVSLPPPLEPQPETLDLSTPLSTTTNQIMHSSTSSRNSQSLRVRRPRARARVRVRARASPSPRRA
        MKFPPFNSHNLDVELSLRPPSAVDYSAAELIMQQEEVSLPPPLEPQPETLDLSTPLSTTTNQIMHSSTSSRNSQSLRVRRPRARARVRVRARASPSPRRA
Subjt:  MKFPPFNSHNLDVELSLRPPSAVDYSAAELIMQQEEVSLPPPLEPQPETLDLSTPLSTTTNQIMHSSTSSRNSQSLRVRRPRARARVRVRARASPSPRRA

Query:  REPASPVVRQRRSRVELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNF
        REPASPVVRQRRSRVELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNF
Subjt:  REPASPVVRQRRSRVELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNF

Query:  LDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFHR
        LDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFHR
Subjt:  LDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFHR

A0A6J1C690 probable serine/threonine-protein kinase samkC4.2e-7657.59Show/hide
Query:  KFPP--FNSHNLDVELSLRPP-SAVDYSAAELIMQQEEVSL-PPPLEPQPETLDLSTPLSTTTNQIMHSSTSSRN-SQSLRVRRPRARARVRVRARASPS
        K PP  F  H+L  E S   P S  D        Q + + L P PL+PQP+              I H STSS N SQSL+  R R  ++       S S
Subjt:  KFPP--FNSHNLDVELSLRPP-SAVDYSAAELIMQQEEVSL-PPPLEPQPETLDLSTPLSTTTNQIMHSSTSSRN-SQSLRVRRPRARARVRVRARASPS

Query:  PRRAREPASPVVRQRRSRVELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWT
         +R ++P       RR R++ K+T I+PPYPWST ++AVVHDL YL++NQILTITGDV+C +C+KQY IEYDL+TKF+EIASFIEKNK TLHDRAP SWT
Subjt:  PRRAREPASPVVRQRRSRVELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWT

Query:  NPNFLDCKLCGEENCVRPTIP----EGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFHR
        NPN  +CK CG+E+C+RP IP    + D KNINWLFLLLGQMIG L L+HLKYFC YTNNHRT AK+RLVYLTYL+LCKQLQPS ELFHR
Subjt:  NPNFLDCKLCGEENCVRPTIP----EGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFHR

A0A6J1GM83 mucin-16-like5.4e-5547.83Show/hide
Query:  EEVSLPPPLEPQP--ETLDLSTPLSTTTNQIMHSSTSSRNSQSLRVRRPRARARVRVRARASPSPRRAREPASPVVRQRRSRVELKNTPIKPPYPWSTEH
        E+ S  PP   Q   ETL+    +  T  +  + ST+              +A  +   +++  P+     ++   R+RRSR       I+PPYPWS E 
Subjt:  EEVSLPPPLEPQP--ETLDLSTPLSTTTNQIMHSSTSSRNSQSLRVRRPRARARVRVRARASPSPRRAREPASPVVRQRRSRVELKNTPIKPPYPWSTEH

Query:  QAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGEENCVRPTIPEGDNKN----INWLF
        +A +H+L YL+ N I+TI GDVRC +CE+ Y IEY+LM KF+EIA FIE+ +  +HDRAP  W NP   +C+ C EENCV P IP+ ++ N    INWLF
Subjt:  QAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGEENCVRPTIPEGDNKN----INWLF

Query:  LLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH
        LLLGQ+IGRLKL+ LKYFCA+T NHRTGAK+RL++LTYL LCKQLQPS  LF+
Subjt:  LLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH

A0A6J1I8I0 uncharacterized protein KIAA0754-like1.5e-5760.45Show/hide
Query:  RQRRSRVELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGE
        R+RRSR       I+PPYPWS E +A +H+L YL+ N I+ I GDVRC +CE+ Y IEY+LM KF+EIA FIE+ +  +HDRAP  W NP   +C+ C E
Subjt:  RQRRSRVELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGE

Query:  ENCVRPTIPEGDNKN----INWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH
        ENCV P IP+ ++ N    INWLFLLLGQ+IGRLKL+ LKYFCA+T NHRTGAK+RL++LTYL LCKQLQPS  LF+
Subjt:  ENCVRPTIPEGDNKN----INWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein2.9e-4545.99Show/hide
Query:  SPSPRRAREPASPVVRQRRSRVELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPE
        +P P +     S  + + RS V  K+  I PP+PW+T  +  +  L YL  NQI TITG+V+C  CEK Y + Y+L  +F E+  F    K  + DRA +
Subjt:  SPSPRRAREPASPVVRQRRSRVELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPE

Query:  SWTNPNFLDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELF
         W  P    C+LCG E  V+P I E  ++ INWLFLLLGQ +G   LE LK FC ++ NHRTGAK+R++YLTY+ LCK LQP  +LF
Subjt:  SWTNPNFLDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELF

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)1.2e-3534.44Show/hide
Query:  SHNLDVELSLR----PPSAVDYSAAELIMQQEE--------VSLPP----PLEPQPETLDLSTPLSTTTNQIMHSSTSSRNS---QSLRVRRP---RARA
        S + D++LSLR    P   V+     +  +QEE         S PP    P  PQP  +   T  +  TN ++  + +   +    ++ VR P   +   
Subjt:  SHNLDVELSLR----PPSAVDYSAAELIMQQEE--------VSLPP----PLEPQPETLDLSTPLSTTTNQIMHSSTSSRNS---QSLRVRRP---RARA

Query:  RV-------RVRARASPSPRRAREPASPVVRQRRSRV-----ELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKF
         V       +V   A  +PRR R P     R  +  V      + +  I PPYPW+T+    +     L  N I  I+G V C  C++  T+EY+L  KF
Subjt:  RV-------RVRARASPSPRRAREPASPVVRQRRSRV-----ELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKF

Query:  EEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQL
         E+  +I+ NK  +  RAP SW+ P  + C+ C  E  ++P + E   + INWLFLLLGQM+G   L+ L+YFC   + HRTG+K+R+VY+TYL+LCKQL
Subjt:  EEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQL

Query:  QP
         P
Subjt:  QP

AT2G16190.2 FUNCTIONS IN: molecular_function unknown2.2e-2131.11Show/hide
Query:  SHNLDVELSLR----PPSAVDYSAAELIMQQEE--------VSLPP----PLEPQPETLDLSTPLSTTTNQIMHSSTSSRNS---QSLRVRRP---RARA
        S + D++LSLR    P   V+     +  +QEE         S PP    P  PQP  +   T  +  TN ++  + +   +    ++ VR P   +   
Subjt:  SHNLDVELSLR----PPSAVDYSAAELIMQQEE--------VSLPP----PLEPQPETLDLSTPLSTTTNQIMHSSTSSRNS---QSLRVRRP---RARA

Query:  RV-------RVRARASPSPRRAREPASPVVRQRRSRV-----ELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKF
         V       +V   A  +PRR R P     R  +  V      + +  I PPYPW+T+    +     L  N I  I+G V C  C++  T+EY+L  KF
Subjt:  RV-------RVRARASPSPRRAREPASPVVRQRRSRV-----ELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKF

Query:  EEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHL
         E+  +I+ NK  +  RAP SW+ P  + C+ C  E  ++P + E   + INWLFLLLGQM+G   L+ L
Subjt:  EEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGEENCVRPTIPEGDNKNINWLFLLLGQMIGRLKLEHL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTTTCCGCCGTTCAACTCTCACAATCTAGACGTCGAACTCTCTCTCCGGCCGCCGTCGGCGGTAGACTACAGTGCTGCAGAACTAATAATGCAGCAGGAAGAAGT
CTCCCTCCCTCCACCGCTGGAGCCACAGCCGGAGACTCTTGATCTTTCAACTCCGCTATCCACGACAACGAATCAGATTATGCATTCTTCCACTTCCTCTCGCAATTCAC
AATCCTTGAGAGTGAGACGCCCTAGGGCTAGGGCTAGGGTTAGGGTTAGGGCTAGGGCTAGTCCTAGTCCTAGGCGTGCTCGAGAGCCTGCATCTCCAGTTGTGCGGCAG
AGGCGATCGAGAGTCGAGCTGAAGAACACGCCGATCAAGCCACCCTATCCATGGTCCACGGAGCACCAAGCCGTGGTTCACGACCTCAACTACCTCCGCGAGAATCAAAT
CCTGACAATCACAGGTGACGTCAGATGCGATCGATGCGAGAAACAGTACACGATCGAGTACGACCTAATGACGAAATTTGAAGAGATTGCGAGTTTCATAGAGAAGAACA
AGGCTACTTTGCACGACCGAGCTCCGGAGTCGTGGACGAACCCTAATTTTCTGGACTGCAAATTGTGTGGGGAAGAAAACTGCGTGAGGCCGACAATACCGGAGGGCGAT
AACAAGAACATAAATTGGCTGTTCTTGCTTTTAGGGCAAATGATTGGACGTTTGAAACTTGAACATCTCAAATACTTCTGTGCTTACACCAATAATCATCGAACTGGGGC
TAAGAATCGTCTTGTTTATCTCACTTATCTTACTCTTTGCAAACAACTTCAGCCCTCCATGGAACTCTTCCATCGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGTTTCCGCCGTTCAACTCTCACAATCTAGACGTCGAACTCTCTCTCCGGCCGCCGTCGGCGGTAGACTACAGTGCTGCAGAACTAATAATGCAGCAGGAAGAAGT
CTCCCTCCCTCCACCGCTGGAGCCACAGCCGGAGACTCTTGATCTTTCAACTCCGCTATCCACGACAACGAATCAGATTATGCATTCTTCCACTTCCTCTCGCAATTCAC
AATCCTTGAGAGTGAGACGCCCTAGGGCTAGGGCTAGGGTTAGGGTTAGGGCTAGGGCTAGTCCTAGTCCTAGGCGTGCTCGAGAGCCTGCATCTCCAGTTGTGCGGCAG
AGGCGATCGAGAGTCGAGCTGAAGAACACGCCGATCAAGCCACCCTATCCATGGTCCACGGAGCACCAAGCCGTGGTTCACGACCTCAACTACCTCCGCGAGAATCAAAT
CCTGACAATCACAGGTGACGTCAGATGCGATCGATGCGAGAAACAGTACACGATCGAGTACGACCTAATGACGAAATTTGAAGAGATTGCGAGTTTCATAGAGAAGAACA
AGGCTACTTTGCACGACCGAGCTCCGGAGTCGTGGACGAACCCTAATTTTCTGGACTGCAAATTGTGTGGGGAAGAAAACTGCGTGAGGCCGACAATACCGGAGGGCGAT
AACAAGAACATAAATTGGCTGTTCTTGCTTTTAGGGCAAATGATTGGACGTTTGAAACTTGAACATCTCAAATACTTCTGTGCTTACACCAATAATCATCGAACTGGGGC
TAAGAATCGTCTTGTTTATCTCACTTATCTTACTCTTTGCAAACAACTTCAGCCCTCCATGGAACTCTTCCATCGTTGA
Protein sequenceShow/hide protein sequence
MKFPPFNSHNLDVELSLRPPSAVDYSAAELIMQQEEVSLPPPLEPQPETLDLSTPLSTTTNQIMHSSTSSRNSQSLRVRRPRARARVRVRARASPSPRRAREPASPVVRQ
RRSRVELKNTPIKPPYPWSTEHQAVVHDLNYLRENQILTITGDVRCDRCEKQYTIEYDLMTKFEEIASFIEKNKATLHDRAPESWTNPNFLDCKLCGEENCVRPTIPEGD
NKNINWLFLLLGQMIGRLKLEHLKYFCAYTNNHRTGAKNRLVYLTYLTLCKQLQPSMELFHR