; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS022068 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS022068
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationscaffold47:441930..442694
RNA-Seq ExpressionMS022068
SyntenyMS022068
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036575.1 uncharacterized protein E6C27_scaffold191G00850 [Cucumis melo var. makuwa]6.2e-6563.06Show/hide
Query:  DLQLSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVRAIGMEASTREIRPPYLWSTTRRAVVHTLSHL
        DLQLSLRPP  +  S   PSPP    +       RAN++T+ RI+RN G  RRSS    RCNSRS          +T  I PPY WST RRA+V TL+ L
Subjt:  DLQLSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVRAIGMEASTREIRPPYLWSTTRRAVVHTLSHL

Query:  KSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHL
        +S++I+ I+G+VRCR+CQ  Y IEYD+VSKFEEIASFVE+NK+ F DRAP  WMNPNYP C+FCG ENGARPV+P E R+INWLFLLLG+MLGVL+LNHL
Subjt:  KSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHL

Query:  KYFCGCTDNHRTGAKNRLLYLT
        KYFC  T+NHRTGAKNRLLYLT
Subjt:  KYFCGCTDNHRTGAKNRLLYLT

XP_008447299.1 PREDICTED: uncharacterized protein LOC103489770 [Cucumis melo]2.5e-7464.29Show/hide
Query:  DLQLSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVRAIGMEASTREIRPPYLWSTTRRAVVHTLSHL
        DLQLSLRPP  +  S   PSPP    +       RAN++T+ RI+RN G  RRSS    RCNSRS          +T  I PPY WST RRA+V TL+ L
Subjt:  DLQLSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVRAIGMEASTREIRPPYLWSTTRRAVVHTLSHL

Query:  KSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHL
        +S++I+ I+G+VRCR+CQ  Y IEYD+VSKFEEIASFVE+NK+ F DRAP  WMNPNYP C+FCG ENGARPV+P E R+INWLFLLLG+MLGVL+LNHL
Subjt:  KSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHL

Query:  KYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFSR
        KYFC  T+NHRTGAKNRLLYLTY+TLC+QVDPSGRF+R
Subjt:  KYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFSR

XP_011659748.1 uncharacterized protein LOC105436256 [Cucumis sativus]3.1e-7262.3Show/hide
Query:  ERHSP-DLQLSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVRAIGMEASTREIRPPYLWSTTRRAVV
        ERH+  DL+LSLRPP  + SS   PS  P   +       R N++T+ R++R+ G  RRSS    RCNSRS          +T  I PPY WST RRA+V
Subjt:  ERHSP-DLQLSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVRAIGMEASTREIRPPYLWSTTRRAVV

Query:  HTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGV
         TL+ LKSN+I+ I+G+V+CR+CQ  Y IEYD+ SKFEEIASFVE+NK+SF DRAP  WMNPNYP C+FCG ENGARPV+P + R+INWLFLLLG+MLGV
Subjt:  HTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGV

Query:  LSLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFSR
        L+LNHLKYFC  T NHRTGAKNRLLYLTY+TLC+QVDPSGRF+R
Subjt:  LSLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFSR

XP_022952797.1 uncharacterized protein LOC111455388 [Cucurbita moschata]1.9e-6153.59Show/hide
Query:  DLQLSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVRAIGMEASTREIRPPYLWSTTRRAVVHTLSHL
        DL+LSL  P   S+++ A S          +     + ++S R   N G +R++S    + NS            +T  I PPY WST R AVVHTL +L
Subjt:  DLQLSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVRAIGMEASTREIRPPYLWSTTRRAVVHTLSHL

Query:  KSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHL
         SN+I+TI+GEV+C++C+R+YEIEYD+VSKF EI SFVE N +SF DRAP EWM PNYP C+FCG E G +PV+P E  +INW+FLLLG+M+G L LNHL
Subjt:  KSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHL

Query:  KYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFS
        KYFC  T NHRTG+K+RL+YLTY+TLC Q+DPSGRFS
Subjt:  KYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFS

XP_022972401.1 uncharacterized protein LOC111470968 [Cucurbita maxima]8.4e-6258.97Show/hide
Query:  RNPGI--IRRSSPAGGRCNSRSAAVRAIGMEASTREIRPPYLWSTTRRAVVHTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKD
        R P I  +R++S    +CNS            +T  I PPY WST R AVVHTL +L  N+I+TI+G+V+C++C+R+YEIEY++VSKF EI SFVE N +
Subjt:  RNPGI--IRRSSPAGGRCNSRSAAVRAIGMEASTREIRPPYLWSTTRRAVVHTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKD

Query:  SFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFSR
        SF DRAP +WM PNYP C+FCG E G +PV+P E  +INW+FLLLG+M+G L LNHLKYFC  T NHRTG+K+RL+YLTY+TLC Q+DPSGRFSR
Subjt:  SFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFSR

TrEMBL top hitse value%identityAlignment
A0A0A0K3Q8 Uncharacterized protein1.5e-7262.3Show/hide
Query:  ERHSP-DLQLSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVRAIGMEASTREIRPPYLWSTTRRAVV
        ERH+  DL+LSLRPP  + SS   PS  P   +       R N++T+ R++R+ G  RRSS    RCNSRS          +T  I PPY WST RRA+V
Subjt:  ERHSP-DLQLSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVRAIGMEASTREIRPPYLWSTTRRAVV

Query:  HTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGV
         TL+ LKSN+I+ I+G+V+CR+CQ  Y IEYD+ SKFEEIASFVE+NK+SF DRAP  WMNPNYP C+FCG ENGARPV+P + R+INWLFLLLG+MLGV
Subjt:  HTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGV

Query:  LSLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFSR
        L+LNHLKYFC  T NHRTGAKNRLLYLTY+TLC+QVDPSGRF+R
Subjt:  LSLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFSR

A0A1S3BHR1 uncharacterized protein LOC1034897701.2e-7464.29Show/hide
Query:  DLQLSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVRAIGMEASTREIRPPYLWSTTRRAVVHTLSHL
        DLQLSLRPP  +  S   PSPP    +       RAN++T+ RI+RN G  RRSS    RCNSRS          +T  I PPY WST RRA+V TL+ L
Subjt:  DLQLSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVRAIGMEASTREIRPPYLWSTTRRAVVHTLSHL

Query:  KSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHL
        +S++I+ I+G+VRCR+CQ  Y IEYD+VSKFEEIASFVE+NK+ F DRAP  WMNPNYP C+FCG ENGARPV+P E R+INWLFLLLG+MLGVL+LNHL
Subjt:  KSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHL

Query:  KYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFSR
        KYFC  T+NHRTGAKNRLLYLTY+TLC+QVDPSGRF+R
Subjt:  KYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFSR

A0A5A7T547 Uncharacterized protein3.0e-6563.06Show/hide
Query:  DLQLSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVRAIGMEASTREIRPPYLWSTTRRAVVHTLSHL
        DLQLSLRPP  +  S   PSPP    +       RAN++T+ RI+RN G  RRSS    RCNSRS          +T  I PPY WST RRA+V TL+ L
Subjt:  DLQLSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVRAIGMEASTREIRPPYLWSTTRRAVVHTLSHL

Query:  KSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHL
        +S++I+ I+G+VRCR+CQ  Y IEYD+VSKFEEIASFVE+NK+ F DRAP  WMNPNYP C+FCG ENGARPV+P E R+INWLFLLLG+MLGVL+LNHL
Subjt:  KSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHL

Query:  KYFCGCTDNHRTGAKNRLLYLT
        KYFC  T+NHRTGAKNRLLYLT
Subjt:  KYFCGCTDNHRTGAKNRLLYLT

A0A6J1GLD4 uncharacterized protein LOC1114553889.1e-6253.59Show/hide
Query:  DLQLSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVRAIGMEASTREIRPPYLWSTTRRAVVHTLSHL
        DL+LSL  P   S+++ A S          +     + ++S R   N G +R++S    + NS            +T  I PPY WST R AVVHTL +L
Subjt:  DLQLSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVRAIGMEASTREIRPPYLWSTTRRAVVHTLSHL

Query:  KSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHL
         SN+I+TI+GEV+C++C+R+YEIEYD+VSKF EI SFVE N +SF DRAP EWM PNYP C+FCG E G +PV+P E  +INW+FLLLG+M+G L LNHL
Subjt:  KSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHL

Query:  KYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFS
        KYFC  T NHRTG+K+RL+YLTY+TLC Q+DPSGRFS
Subjt:  KYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFS

A0A6J1I5V9 uncharacterized protein LOC1114709684.1e-6258.97Show/hide
Query:  RNPGI--IRRSSPAGGRCNSRSAAVRAIGMEASTREIRPPYLWSTTRRAVVHTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKD
        R P I  +R++S    +CNS            +T  I PPY WST R AVVHTL +L  N+I+TI+G+V+C++C+R+YEIEY++VSKF EI SFVE N +
Subjt:  RNPGI--IRRSSPAGGRCNSRSAAVRAIGMEASTREIRPPYLWSTTRRAVVHTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKD

Query:  SFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFSR
        SF DRAP +WM PNYP C+FCG E G +PV+P E  +INW+FLLLG+M+G L LNHLKYFC  T NHRTG+K+RL+YLTY+TLC Q+DPSGRFSR
Subjt:  SFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFSR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein5.7e-4036.71Show/hide
Query:  SSSSVAPSPPPP------------RRSPGR----SHIQRANSITSHRISRNPGIIRRSSPAGGRC-NSRSAAVRAIGMEASTREIRPPYLWSTTRRAVVH
        ++ S+ P PPPP            +++P      +H    + +     +  P  ++R      R   SRS       +   +  I PP+ W+T RR  + 
Subjt:  SSSSVAPSPPPP------------RRSPGR----SHIQRANSITSHRISRNPGIIRRSSPAGGRC-NSRSAAVRAIGMEASTREIRPPYLWSTTRRAVVH

Query:  TLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVL
        +L +L+SN+I TI+GEV+CR C++VY++ Y+L  +F E+  F    K    DRA  +W  P   RC+ CG E   +PV+   + +INWLFLLLGQ LG  
Subjt:  TLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVL

Query:  SLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDP
        +L  LK FC  + NHRTGAK+R+LYLTY+ LC  + P
Subjt:  SLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDP

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)2.3e-4142.25Show/hide
Query:  RRSSPAGGRCNSRS-AAVRAIGMEASTREIRPPYLWSTTRRAVVHTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAP
        RR  P GG+    S   V  +      REI PPY W+T +   + +   L SN I  ISG+V C+ C R   +EY+L  KF E+  +++ NK+    RAP
Subjt:  RRSSPAGGRCNSRS-AAVRAIGMEASTREIRPPYLWSTTRRAVVHTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAP

Query:  SEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFS
          W  P    C+ C  E   +PV+   +  INWLFLLLGQMLG  +L+ L+YFC     HRTG+K+R++Y+TYL+LC Q+DP G F+
Subjt:  SEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHLKYFCGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFS

AT2G16190.2 FUNCTIONS IN: molecular_function unknown6.1e-2639.33Show/hide
Query:  RRSSPAGGRCNSRS-AAVRAIGMEASTREIRPPYLWSTTRRAVVHTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAP
        RR  P GG+    S   V  +      REI PPY W+T +   + +   L SN I  ISG+V C+ C R   +EY+L  KF E+  +++ NK+    RAP
Subjt:  RRSSPAGGRCNSRS-AAVRAIGMEASTREIRPPYLWSTTRRAVVHTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAP

Query:  SEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHL
          W  P    C+ C  E   +PV+   +  INWLFLLLGQMLG  +L+ L
Subjt:  SEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GATAATAATAACAATCCCGCCGCCGCCCGAAGCCGTGAACGCCACAGTCCCGATCTCCAACTCTCGCTCCGTCCGCCGGTACAAAACAGTAGTAGTAGTGTGGCGCCGTC
TCCGCCGCCGCCTCGTCGCTCCCCCGGCCGCAGCCACATCCAGCGGGCAAATTCAATAACCTCACATAGAATTAGCCGGAATCCTGGAATAATTCGCCGATCTTCTCCGG
CCGGCGGACGCTGCAATTCGCGATCCGCGGCGGTGCGGGCGATCGGTATGGAGGCGAGTACGAGGGAGATCCGGCCACCGTACCTGTGGTCGACGACGCGCCGAGCGGTG
GTCCACACTCTGAGCCACCTAAAATCGAACCGGATCGTCACGATCAGTGGCGAGGTCCGGTGCCGGCGGTGCCAGAGAGTGTACGAGATCGAGTATGACCTGGTTTCGAA
ATTCGAAGAGATCGCGAGTTTCGTAGAGAAAAACAAGGATTCGTTTCACGACAGAGCACCGAGCGAGTGGATGAACCCAAATTATCCACGGTGCAAATTCTGCGGCCTGG
AGAACGGAGCGCGGCCGGTGGTTCCGACGGAGGAGCGGCGGATCAATTGGCTGTTCTTGCTTTTGGGACAAATGCTCGGAGTTTTGAGTCTGAATCATCTGAAATATTTC
TGCGGATGCACAGATAATCATCGAACAGGTGCCAAGAATCGTCTCCTCTATCTCACTTACCTCACTCTCTGCAACCAAGTTGATCCTTCTGGCCGATTCAGTCGC
mRNA sequenceShow/hide mRNA sequence
GATAATAATAACAATCCCGCCGCCGCCCGAAGCCGTGAACGCCACAGTCCCGATCTCCAACTCTCGCTCCGTCCGCCGGTACAAAACAGTAGTAGTAGTGTGGCGCCGTC
TCCGCCGCCGCCTCGTCGCTCCCCCGGCCGCAGCCACATCCAGCGGGCAAATTCAATAACCTCACATAGAATTAGCCGGAATCCTGGAATAATTCGCCGATCTTCTCCGG
CCGGCGGACGCTGCAATTCGCGATCCGCGGCGGTGCGGGCGATCGGTATGGAGGCGAGTACGAGGGAGATCCGGCCACCGTACCTGTGGTCGACGACGCGCCGAGCGGTG
GTCCACACTCTGAGCCACCTAAAATCGAACCGGATCGTCACGATCAGTGGCGAGGTCCGGTGCCGGCGGTGCCAGAGAGTGTACGAGATCGAGTATGACCTGGTTTCGAA
ATTCGAAGAGATCGCGAGTTTCGTAGAGAAAAACAAGGATTCGTTTCACGACAGAGCACCGAGCGAGTGGATGAACCCAAATTATCCACGGTGCAAATTCTGCGGCCTGG
AGAACGGAGCGCGGCCGGTGGTTCCGACGGAGGAGCGGCGGATCAATTGGCTGTTCTTGCTTTTGGGACAAATGCTCGGAGTTTTGAGTCTGAATCATCTGAAATATTTC
TGCGGATGCACAGATAATCATCGAACAGGTGCCAAGAATCGTCTCCTCTATCTCACTTACCTCACTCTCTGCAACCAAGTTGATCCTTCTGGCCGATTCAGTCGC
Protein sequenceShow/hide protein sequence
DNNNNPAAARSRERHSPDLQLSLRPPVQNSSSSVAPSPPPPRRSPGRSHIQRANSITSHRISRNPGIIRRSSPAGGRCNSRSAAVRAIGMEASTREIRPPYLWSTTRRAV
VHTLSHLKSNRIVTISGEVRCRRCQRVYEIEYDLVSKFEEIASFVEKNKDSFHDRAPSEWMNPNYPRCKFCGLENGARPVVPTEERRINWLFLLLGQMLGVLSLNHLKYF
CGCTDNHRTGAKNRLLYLTYLTLCNQVDPSGRFSR