; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g11760 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g11760
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:8824571..8827839
RNA-Seq ExpressionMoc04g11760
SyntenyMoc04g11760
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149029.1 uncharacterized protein LOC111017548 [Momordica charantia]4.0e-6261.43Show/hide
Query:  MMYFIIGLNGRNLTIEFRSRPPTSLNEILARARQYIDGLELWKANKARRSSRSKDRDQKSPPPKKQRADDRSLSRRADDNKNRKHRGKRTPSDRRGPKFD
        MMYF  GLN RNLTIEF SRPP SLN++LARARQYIDGLELWKA  ARRSSR KDRDQ+S PPKK+ +DD+S SR+A D+++R    +R  SDR GPKFD
Subjt:  MMYFIIGLNGRNLTIEFRSRPPTSLNEILARARQYIDGLELWKANKARRSSRSKDRDQKSPPPKKQRADDRSLSRRADDNKNRKHRGKRTPSDRRGPKFD

Query:  RFTPLNASIAEIYAATEDTDLEALFAASEKLRRSSGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDL---------------------------------
        +FTPLNAS+AEIYA  E+TD++ALF A +KL R SGKRDKRLYCRFHKDHGH++SRCFHLKEQV+DL                                 
Subjt:  RFTPLNASIAEIYAATEDTDLEALFAASEKLRRSSGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDL---------------------------------

Query:  --KEGHPAVISTIHVGPSGGQSG
          KE  PAVI+TIH GPSG +SG
Subjt:  --KEGHPAVISTIHVGPSGGQSG

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]1.1e-5441.08Show/hide
Query:  MMYFIIGLNGRNLTIEFRSRPPTSLNEILARARQYIDGLELWKANKARRSSRSKDRDQKSPPPKKQR--ADDRSLSRRADDNKNRKHRGKRTPSDRRGPK
        M YF+ GL    LT++     P +  E+L +A++ IDG EL +  K  +    KD +   P  K +   ++ R+  RRA++   R               
Subjt:  MMYFIIGLNGRNLTIEFRSRPPTSLNEILARARQYIDGLELWKANKARRSSRSKDRDQKSPPPKKQR--ADDRSLSRRADDNKNRKHRGKRTPSDRRGPK

Query:  FDRFTPLNASIAEIYAATEDTDLEALFAASEKLRRSSGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDL-------------------------------
        ++RFTP    I+EI    E++ +E L    EKLR +  +R K  YCRFH++HGH+TS  + LK Q+EDL                               
Subjt:  FDRFTPLNASIAEIYAATEDTDLEALFAASEKLRRSSGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDL-------------------------------

Query:  -KEGHPAVISTIHVGPSGGQSGQKRKALAREAAHEVCTSYPKEPVMPILFDEQDSEGVHLPHNDVLVIAPLIDHVKVRRVLVDGGASANILSFLTYTALG
         +   PAVI+TI  GPSGGQSG KRK LAR A  EVC    + P  PI FD  D   VHLPHND LVIAPLIDHV VRRVLVDGGASANILS  TY ALG
Subjt:  -KEGHPAVISTIHVGPSGGQSGQKRKALAREAAHEVCTSYPKEPVMPILFDEQDSEGVHLPHNDVLVIAPLIDHVKVRRVLVDGGASANILSFLTYTALG

Query:  WEMRHLKRSPTPLVGFAGEMTDLLRIVPVEILAESSIDQLEVMEVQSARLTWM
        W    LK+SPTPLVGF+GE           ++ E  ID    +     R+T M
Subjt:  WEMRHLKRSPTPLVGFAGEMTDLLRIVPVEILAESSIDQLEVMEVQSARLTWM

XP_022154797.1 uncharacterized protein LOC111021964 [Momordica charantia]3.3e-5645.11Show/hide
Query:  MMYFIIGLNGRNLTIEFRSRPPTSLNEILARARQYIDGLELWKANKARRSSRSKDRDQKSPPPKKQRADDRSLSRRADDNKNRKHRGKRTPSDRRGPKFD
        M YF+ GL    LT++     PT+  E+L +A++ IDG EL +  K  R  R   R +     +K  AD +S  + +  +   ++R       R  P ++
Subjt:  MMYFIIGLNGRNLTIEFRSRPPTSLNEILARARQYIDGLELWKANKARRSSRSKDRDQKSPPPKKQRADDRSLSRRADDNKNRKHRGKRTPSDRRGPKFD

Query:  RFTPLNASIAEIYAATEDTDLEALFAASEKLRRSSGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDL-KEGH----------------------------
        RFTP    I+EI    E++ +E      EKLR +  +R K  YCRFH++HGH+TS C+ LK Q+EDL ++G+                            
Subjt:  RFTPLNASIAEIYAATEDTDLEALFAASEKLRRSSGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDL-KEGH----------------------------

Query:  ---PAVISTIHVGPSGGQSGQKRKALAREAAHEVCTSYPKEPVMPILFDEQDSEGVHLPHNDVLVIAPLIDHVKVRRVLVDGGASANILSFLTYTALGWE
           PAVI+TI  GPSGGQSG KRK LAR A  EVC    + P  PI FD  D E VHLPHND LVIAPLIDHV VRRVLVDGGASANILS  TY ALGW 
Subjt:  ---PAVISTIHVGPSGGQSGQKRKALAREAAHEVCTSYPKEPVMPILFDEQDSEGVHLPHNDVLVIAPLIDHVKVRRVLVDGGASANILSFLTYTALGWE

Query:  MRHLKRSPTPLVGFAGE
           LK+SPTPLVGF+GE
Subjt:  MRHLKRSPTPLVGFAGE

XP_022157474.1 uncharacterized protein LOC111024166 [Momordica charantia]1.1e-5442.9Show/hide
Query:  MMYFIIGLNGRNLTIEFRSRPPTSLNEILARARQYIDGLELWKANKARRSSRSKDRDQKSPPPKKQRADDRSLSRRADDNKNRKHRGKRTPSDRRGPKFD
        M YF+ GL    L ++     P +  E+L +A++ IDG EL +   +R     K  DQK    + +R + +S  +    + +R    +      R   ++
Subjt:  MMYFIIGLNGRNLTIEFRSRPPTSLNEILARARQYIDGLELWKANKARRSSRSKDRDQKSPPPKKQRADDRSLSRRADDNKNRKHRGKRTPSDRRGPKFD

Query:  RFTPLNASIAEIYAATEDTDLEALFAASEKLRRSSGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDL--------------------------------K
        R+TP    I+ I    E+T +E L     KLR    K +K  YCRFH+DHGH+TS C+ LK Q+EDL                                 
Subjt:  RFTPLNASIAEIYAATEDTDLEALFAASEKLRRSSGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDL--------------------------------K

Query:  EGHPAVISTIHVGPSGGQSGQKRKALAREAAHEVCTSYPKEPVMPILFDEQDSEGVHLPHNDVLVIAPLIDHVKVRRVLVDGGASANILSFLTYTALGWE
        +  P VI+TI  GPSGGQSG KRK LAREA  EVC    ++P   I F + D EGVHLPHND LVIAPLIDHV VRRVLVDGGASANILS  TY ALGW 
Subjt:  EGHPAVISTIHVGPSGGQSGQKRKALAREAAHEVCTSYPKEPVMPILFDEQDSEGVHLPHNDVLVIAPLIDHVKVRRVLVDGGASANILSFLTYTALGWE

Query:  MRHLKRSPTPLVGFAGE
           LK+SPTPLVGF+ E
Subjt:  MRHLKRSPTPLVGFAGE

XP_022158844.1 uncharacterized protein LOC111025310 [Momordica charantia]1.0e-11872.19Show/hide
Query:  MMYFIIGLNGRNLTIEFRSRPPTSLNEILARARQYIDGLELWKANKARRSSRSKDRDQKSPPPKKQRADDRSLSRRADDNKNRKHRGKRTPSDRRGPKFD
        MMYF  GLN RNLTIEFRSRPP SLNE+ ARARQYIDGLELWKAN ARRSSR +DRD KSPP KK+  DDRS SRRADD+K+R  R +R  S+RRGPKFD
Subjt:  MMYFIIGLNGRNLTIEFRSRPPTSLNEILARARQYIDGLELWKANKARRSSRSKDRDQKSPPPKKQRADDRSLSRRADDNKNRKHRGKRTPSDRRGPKFD

Query:  RFTPLNASIAEIYAATEDTDLEALFAASEKLRRSSGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDL---------------------------------
        +FTPLNASIAEIYA  EDTD+E LFA+ EKLRR SGKR+KRLYCRFHKDHGHDTSRCFHLKEQVEDL                                 
Subjt:  RFTPLNASIAEIYAATEDTDLEALFAASEKLRRSSGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDL---------------------------------

Query:  --KEGHPAVISTIHVGPSGGQSGQKRKALAREAAHEVCTSYPKEPVMPILFDEQDSEGVHLPHNDVLVIAPLIDHVKVRRVLVDGGASANILSFLTYTAL
          KE  PAVI+TIH GPSG +SGQKRKALARE AHEVCTSYPK PVMPILFDEQD E VH+PHND LVIAPLIDHVKVRRV VDGGASANI SF TYTAL
Subjt:  --KEGHPAVISTIHVGPSGGQSGQKRKALAREAAHEVCTSYPKEPVMPILFDEQDSEGVHLPHNDVLVIAPLIDHVKVRRVLVDGGASANILSFLTYTAL

Query:  GWEMRHLKRSPTPLVGFAGE
        GWE RHLK   T LVGFA E
Subjt:  GWEMRHLKRSPTPLVGFAGE

TrEMBL top hitse value%identityAlignment
A0A6J1D5T3 uncharacterized protein LOC1110175481.9e-6261.43Show/hide
Query:  MMYFIIGLNGRNLTIEFRSRPPTSLNEILARARQYIDGLELWKANKARRSSRSKDRDQKSPPPKKQRADDRSLSRRADDNKNRKHRGKRTPSDRRGPKFD
        MMYF  GLN RNLTIEF SRPP SLN++LARARQYIDGLELWKA  ARRSSR KDRDQ+S PPKK+ +DD+S SR+A D+++R    +R  SDR GPKFD
Subjt:  MMYFIIGLNGRNLTIEFRSRPPTSLNEILARARQYIDGLELWKANKARRSSRSKDRDQKSPPPKKQRADDRSLSRRADDNKNRKHRGKRTPSDRRGPKFD

Query:  RFTPLNASIAEIYAATEDTDLEALFAASEKLRRSSGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDL---------------------------------
        +FTPLNAS+AEIYA  E+TD++ALF A +KL R SGKRDKRLYCRFHKDHGH++SRCFHLKEQV+DL                                 
Subjt:  RFTPLNASIAEIYAATEDTDLEALFAASEKLRRSSGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDL---------------------------------

Query:  --KEGHPAVISTIHVGPSGGQSG
          KE  PAVI+TIH GPSG +SG
Subjt:  --KEGHPAVISTIHVGPSGGQSG

A0A6J1DD03 uncharacterized protein LOC1110198995.1e-5541.08Show/hide
Query:  MMYFIIGLNGRNLTIEFRSRPPTSLNEILARARQYIDGLELWKANKARRSSRSKDRDQKSPPPKKQR--ADDRSLSRRADDNKNRKHRGKRTPSDRRGPK
        M YF+ GL    LT++     P +  E+L +A++ IDG EL +  K  +    KD +   P  K +   ++ R+  RRA++   R               
Subjt:  MMYFIIGLNGRNLTIEFRSRPPTSLNEILARARQYIDGLELWKANKARRSSRSKDRDQKSPPPKKQR--ADDRSLSRRADDNKNRKHRGKRTPSDRRGPK

Query:  FDRFTPLNASIAEIYAATEDTDLEALFAASEKLRRSSGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDL-------------------------------
        ++RFTP    I+EI    E++ +E L    EKLR +  +R K  YCRFH++HGH+TS  + LK Q+EDL                               
Subjt:  FDRFTPLNASIAEIYAATEDTDLEALFAASEKLRRSSGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDL-------------------------------

Query:  -KEGHPAVISTIHVGPSGGQSGQKRKALAREAAHEVCTSYPKEPVMPILFDEQDSEGVHLPHNDVLVIAPLIDHVKVRRVLVDGGASANILSFLTYTALG
         +   PAVI+TI  GPSGGQSG KRK LAR A  EVC    + P  PI FD  D   VHLPHND LVIAPLIDHV VRRVLVDGGASANILS  TY ALG
Subjt:  -KEGHPAVISTIHVGPSGGQSGQKRKALAREAAHEVCTSYPKEPVMPILFDEQDSEGVHLPHNDVLVIAPLIDHVKVRRVLVDGGASANILSFLTYTALG

Query:  WEMRHLKRSPTPLVGFAGEMTDLLRIVPVEILAESSIDQLEVMEVQSARLTWM
        W    LK+SPTPLVGF+GE           ++ E  ID    +     R+T M
Subjt:  WEMRHLKRSPTPLVGFAGEMTDLLRIVPVEILAESSIDQLEVMEVQSARLTWM

A0A6J1DMN7 uncharacterized protein LOC1110219641.6e-5645.11Show/hide
Query:  MMYFIIGLNGRNLTIEFRSRPPTSLNEILARARQYIDGLELWKANKARRSSRSKDRDQKSPPPKKQRADDRSLSRRADDNKNRKHRGKRTPSDRRGPKFD
        M YF+ GL    LT++     PT+  E+L +A++ IDG EL +  K  R  R   R +     +K  AD +S  + +  +   ++R       R  P ++
Subjt:  MMYFIIGLNGRNLTIEFRSRPPTSLNEILARARQYIDGLELWKANKARRSSRSKDRDQKSPPPKKQRADDRSLSRRADDNKNRKHRGKRTPSDRRGPKFD

Query:  RFTPLNASIAEIYAATEDTDLEALFAASEKLRRSSGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDL-KEGH----------------------------
        RFTP    I+EI    E++ +E      EKLR +  +R K  YCRFH++HGH+TS C+ LK Q+EDL ++G+                            
Subjt:  RFTPLNASIAEIYAATEDTDLEALFAASEKLRRSSGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDL-KEGH----------------------------

Query:  ---PAVISTIHVGPSGGQSGQKRKALAREAAHEVCTSYPKEPVMPILFDEQDSEGVHLPHNDVLVIAPLIDHVKVRRVLVDGGASANILSFLTYTALGWE
           PAVI+TI  GPSGGQSG KRK LAR A  EVC    + P  PI FD  D E VHLPHND LVIAPLIDHV VRRVLVDGGASANILS  TY ALGW 
Subjt:  ---PAVISTIHVGPSGGQSGQKRKALAREAAHEVCTSYPKEPVMPILFDEQDSEGVHLPHNDVLVIAPLIDHVKVRRVLVDGGASANILSFLTYTALGWE

Query:  MRHLKRSPTPLVGFAGE
           LK+SPTPLVGF+GE
Subjt:  MRHLKRSPTPLVGFAGE

A0A6J1DWK7 uncharacterized protein LOC1110241665.1e-5542.9Show/hide
Query:  MMYFIIGLNGRNLTIEFRSRPPTSLNEILARARQYIDGLELWKANKARRSSRSKDRDQKSPPPKKQRADDRSLSRRADDNKNRKHRGKRTPSDRRGPKFD
        M YF+ GL    L ++     P +  E+L +A++ IDG EL +   +R     K  DQK    + +R + +S  +    + +R    +      R   ++
Subjt:  MMYFIIGLNGRNLTIEFRSRPPTSLNEILARARQYIDGLELWKANKARRSSRSKDRDQKSPPPKKQRADDRSLSRRADDNKNRKHRGKRTPSDRRGPKFD

Query:  RFTPLNASIAEIYAATEDTDLEALFAASEKLRRSSGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDL--------------------------------K
        R+TP    I+ I    E+T +E L     KLR    K +K  YCRFH+DHGH+TS C+ LK Q+EDL                                 
Subjt:  RFTPLNASIAEIYAATEDTDLEALFAASEKLRRSSGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDL--------------------------------K

Query:  EGHPAVISTIHVGPSGGQSGQKRKALAREAAHEVCTSYPKEPVMPILFDEQDSEGVHLPHNDVLVIAPLIDHVKVRRVLVDGGASANILSFLTYTALGWE
        +  P VI+TI  GPSGGQSG KRK LAREA  EVC    ++P   I F + D EGVHLPHND LVIAPLIDHV VRRVLVDGGASANILS  TY ALGW 
Subjt:  EGHPAVISTIHVGPSGGQSGQKRKALAREAAHEVCTSYPKEPVMPILFDEQDSEGVHLPHNDVLVIAPLIDHVKVRRVLVDGGASANILSFLTYTALGWE

Query:  MRHLKRSPTPLVGFAGE
           LK+SPTPLVGF+ E
Subjt:  MRHLKRSPTPLVGFAGE

A0A6J1E0L8 uncharacterized protein LOC1110253105.0e-11972.19Show/hide
Query:  MMYFIIGLNGRNLTIEFRSRPPTSLNEILARARQYIDGLELWKANKARRSSRSKDRDQKSPPPKKQRADDRSLSRRADDNKNRKHRGKRTPSDRRGPKFD
        MMYF  GLN RNLTIEFRSRPP SLNE+ ARARQYIDGLELWKAN ARRSSR +DRD KSPP KK+  DDRS SRRADD+K+R  R +R  S+RRGPKFD
Subjt:  MMYFIIGLNGRNLTIEFRSRPPTSLNEILARARQYIDGLELWKANKARRSSRSKDRDQKSPPPKKQRADDRSLSRRADDNKNRKHRGKRTPSDRRGPKFD

Query:  RFTPLNASIAEIYAATEDTDLEALFAASEKLRRSSGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDL---------------------------------
        +FTPLNASIAEIYA  EDTD+E LFA+ EKLRR SGKR+KRLYCRFHKDHGHDTSRCFHLKEQVEDL                                 
Subjt:  RFTPLNASIAEIYAATEDTDLEALFAASEKLRRSSGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDL---------------------------------

Query:  --KEGHPAVISTIHVGPSGGQSGQKRKALAREAAHEVCTSYPKEPVMPILFDEQDSEGVHLPHNDVLVIAPLIDHVKVRRVLVDGGASANILSFLTYTAL
          KE  PAVI+TIH GPSG +SGQKRKALARE AHEVCTSYPK PVMPILFDEQD E VH+PHND LVIAPLIDHVKVRRV VDGGASANI SF TYTAL
Subjt:  --KEGHPAVISTIHVGPSGGQSGQKRKALAREAAHEVCTSYPKEPVMPILFDEQDSEGVHLPHNDVLVIAPLIDHVKVRRVLVDGGASANILSFLTYTAL

Query:  GWEMRHLKRSPTPLVGFAGE
        GWE RHLK   T LVGFA E
Subjt:  GWEMRHLKRSPTPLVGFAGE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGTATTTCATAATAGGGTTAAATGGCAGGAACCTCACGATCGAGTTCAGAAGTCGTCCGCCGACCTCACTGAACGAAATACTCGCCCGAGCTCGGCAGTACATTGA
TGGCCTGGAGTTGTGGAAGGCCAACAAAGCCAGGCGGAGCAGCCGCAGTAAAGATCGGGACCAGAAGTCCCCTCCTCCAAAGAAGCAACGTGCTGATGATAGGAGCTTGT
CTCGACGGGCCGACGACAACAAGAACCGAAAACACCGTGGTAAGAGAACCCCTTCAGACCGTCGGGGGCCGAAGTTTGATAGGTTCACCCCGCTGAATGCCTCAATCGCG
GAGATCTACGCAGCAACTGAAGATACCGACCTGGAGGCGCTGTTCGCAGCCTCAGAAAAGCTCCGCCGATCTTCGGGGAAGCGAGACAAGCGACTCTACTGCAGATTCCA
CAAGGATCACGGCCATGACACTTCTCGTTGCTTTCACTTAAAGGAGCAAGTAGAGGACCTGAAGGAAGGTCATCCCGCAGTAATAAGTACCATCCATGTGGGCCCAAGTG
GGGGACAGTCAGGGCAGAAGAGAAAAGCTCTGGCTCGGGAGGCAGCACACGAGGTCTGTACCTCGTACCCCAAGGAGCCCGTGATGCCGATCTTGTTTGATGAACAGGAC
AGTGAGGGAGTGCATCTGCCTCATAACGACGTCCTGGTGATCGCCCCACTAATAGACCACGTGAAGGTCAGAAGAGTGCTTGTTGATGGCGGAGCGTCGGCCAATATACT
GTCCTTCTTGACCTACACCGCCCTAGGATGGGAGATGAGACATTTGAAGCGTAGCCCGACGCCTTTGGTCGGCTTTGCCGGGGAGATGACCGACCTGCTGAGAATAGTTC
CTGTTGAAATACTCGCCGAGTCATCTATCGACCAGCTTGAAGTAATGGAGGTCCAGTCAGCTCGGCTCACGTGGATGGGCCCGATTAAAGACTTCCTAGTCAGTGCCTCA
GCCCCTGCTAATCCGAGCCAGGCCAGGAAGCTCCGACGTCAAGCTGCTCACTACTTGATGCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGATGTATTTCATAATAGGGTTAAATGGCAGGAACCTCACGATCGAGTTCAGAAGTCGTCCGCCGACCTCACTGAACGAAATACTCGCCCGAGCTCGGCAGTACATTGA
TGGCCTGGAGTTGTGGAAGGCCAACAAAGCCAGGCGGAGCAGCCGCAGTAAAGATCGGGACCAGAAGTCCCCTCCTCCAAAGAAGCAACGTGCTGATGATAGGAGCTTGT
CTCGACGGGCCGACGACAACAAGAACCGAAAACACCGTGGTAAGAGAACCCCTTCAGACCGTCGGGGGCCGAAGTTTGATAGGTTCACCCCGCTGAATGCCTCAATCGCG
GAGATCTACGCAGCAACTGAAGATACCGACCTGGAGGCGCTGTTCGCAGCCTCAGAAAAGCTCCGCCGATCTTCGGGGAAGCGAGACAAGCGACTCTACTGCAGATTCCA
CAAGGATCACGGCCATGACACTTCTCGTTGCTTTCACTTAAAGGAGCAAGTAGAGGACCTGAAGGAAGGTCATCCCGCAGTAATAAGTACCATCCATGTGGGCCCAAGTG
GGGGACAGTCAGGGCAGAAGAGAAAAGCTCTGGCTCGGGAGGCAGCACACGAGGTCTGTACCTCGTACCCCAAGGAGCCCGTGATGCCGATCTTGTTTGATGAACAGGAC
AGTGAGGGAGTGCATCTGCCTCATAACGACGTCCTGGTGATCGCCCCACTAATAGACCACGTGAAGGTCAGAAGAGTGCTTGTTGATGGCGGAGCGTCGGCCAATATACT
GTCCTTCTTGACCTACACCGCCCTAGGATGGGAGATGAGACATTTGAAGCGTAGCCCGACGCCTTTGGTCGGCTTTGCCGGGGAGATGACCGACCTGCTGAGAATAGTTC
CTGTTGAAATACTCGCCGAGTCATCTATCGACCAGCTTGAAGTAATGGAGGTCCAGTCAGCTCGGCTCACGTGGATGGGCCCGATTAAAGACTTCCTAGTCAGTGCCTCA
GCCCCTGCTAATCCGAGCCAGGCCAGGAAGCTCCGACGTCAAGCTGCTCACTACTTGATGCAATAA
Protein sequenceShow/hide protein sequence
MMYFIIGLNGRNLTIEFRSRPPTSLNEILARARQYIDGLELWKANKARRSSRSKDRDQKSPPPKKQRADDRSLSRRADDNKNRKHRGKRTPSDRRGPKFDRFTPLNASIA
EIYAATEDTDLEALFAASEKLRRSSGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLKEGHPAVISTIHVGPSGGQSGQKRKALAREAAHEVCTSYPKEPVMPILFDEQD
SEGVHLPHNDVLVIAPLIDHVKVRRVLVDGGASANILSFLTYTALGWEMRHLKRSPTPLVGFAGEMTDLLRIVPVEILAESSIDQLEVMEVQSARLTWMGPIKDFLVSAS
APANPSQARKLRRQAAHYLMQ