; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g0487 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g0487
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPollen Ole e 1 allergen and extensin family protein
Genome locationMC03:11687184..11691227
RNA-Seq ExpressionMC03g0487
SyntenyMC03g0487
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7012641.1 hypothetical protein SDJN02_25393 [Cucurbita argyrosperma subsp. argyrosperma]7.88e-11676.7Show/hide
Query:  MMTRFCRFHDSVKPFWFSFFFFFLLLGHGFPVVVESAEGDTPVSDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSCH
        M+TR C F+         FFFFFLL   GF +VVESAE +TP+SDLLSRD+WRQ AGYGEERLSTVLVTGS+LCEACLHGDEPQ+H+WP+ GAMVGV CH
Subjt:  MMTRFCRFHDSVKPFWFSFFFFFLLLGHGFPVVVESAEGDTPVSDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSCH

Query:  NNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIKILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRTSRPLQGCINKGSG
        N+G+NSKSSDW HG+TDEFGDFIIDIPS  HAT+SFEKVCSIKIL+TPKN RCRPAHFAGR+QLQLSSFGGGIRTYTSG L+LQH+TS+PLQ C NKG  
Subjt:  NNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIKILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRTSRPLQGCINKGSG

Query:  DKQTLW
        D+QTLW
Subjt:  DKQTLW

XP_004139527.1 uncharacterized protein LOC101215830 [Cucumis sativus]1.06e-11977.67Show/hide
Query:  MMTRFCRFHDSVKPFWFSFFFFFLLLGHGFPVVVESAEGDTPVSDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSCH
        M+ R   FH SV  FW   FFF+L+ GHGFP+V+ESAE +TPVSDLLSRD WR++AGYGEERLSTVLVTGSVLCEACLHGDEPQ+H+WPI GAMVGV+CH
Subjt:  MMTRFCRFHDSVKPFWFSFFFFFLLLGHGFPVVVESAEGDTPVSDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSCH

Query:  NNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIKILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRTSRPLQGCINKGSG
        N G+NSKSSDW HGVTDEFGDF+IDIPSHLHATRSFE VCSIKIL+TPKN  CRPAH AGR+ LQLSSFGGGIRTYTSG L+LQH+TSRPLQ C N+G G
Subjt:  NNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIKILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRTSRPLQGCINKGSG

Query:  DKQTLW
         +QT W
Subjt:  DKQTLW

XP_008463598.1 PREDICTED: uncharacterized protein LOC103501709 [Cucumis melo]1.68e-11676.21Show/hide
Query:  MMTRFCRFHDSVKPFWFSFFFFFLLLGHGFPVVVESAEGDTPVSDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSCH
        MM R   FH SV  FW   FFF+L+ GHGFP+V ESAE +TPVSDLL+RD WR++AGYGEERLSTVLVTGSVLCE+CLHGDEPQ+H+WPI GAMVGV+CH
Subjt:  MMTRFCRFHDSVKPFWFSFFFFFLLLGHGFPVVVESAEGDTPVSDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSCH

Query:  NNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIKILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRTSRPLQGCINKGSG
        N G+NSKSSDW HGVTDEFGDF+IDIPS LHAT+SFE VCSIKIL+TPKN  CRPAH AGR+QLQLSSFGGGIRTYTSG L+LQH+TSRPLQ C N+G  
Subjt:  NNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIKILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRTSRPLQGCINKGSG

Query:  DKQTLW
         +QT W
Subjt:  DKQTLW

XP_022142515.1 uncharacterized protein LOC111012615 [Momordica charantia]4.67e-15399.52Show/hide
Query:  MMTRFCRFHDSVKPFWFSFFFFF-LLLGHGFPVVVESAEGDTPVSDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSC
        MMTRFCRFHDSVKPFWFSFFFFF LLLGHGFPVVVESAEGDTPVSDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSC
Subjt:  MMTRFCRFHDSVKPFWFSFFFFF-LLLGHGFPVVVESAEGDTPVSDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSC

Query:  HNNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIKILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRTSRPLQGCINKGS
        HNNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIKILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRTSRPLQGCINKGS
Subjt:  HNNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIKILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRTSRPLQGCINKGS

Query:  GDKQTLW
        GDKQTLW
Subjt:  GDKQTLW

XP_038895694.1 uncharacterized protein LOC120083866 [Benincasa hispida]6.70e-12380Show/hide
Query:  MTRFCRFHDSVKPFWFSFFFFFLLLGHGFPVVVESAEGDTPVSDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSCHN
        M R    H SV  FW   FFFF L GHGFP+ VE+AE +TPVSDLLSRDDWRQ+AGYGEERLSTVLVTGSVLCEACLHGDEPQ+H+WPI GAMVGV+CHN
Subjt:  MTRFCRFHDSVKPFWFSFFFFFLLLGHGFPVVVESAEGDTPVSDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSCHN

Query:  NGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIKILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRTSRPLQGCINKGSGD
        NG+NSKSSDW HGVTDEFGDFIIDIPSHLHATRSFE VCSIKIL+TPKN  CRPAH AG +QLQLSSFGGGIRTYTSG L+LQH+TSRPLQ C N+G GD
Subjt:  NGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIKILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRTSRPLQGCINKGSGD

Query:  KQTLW
        +QT W
Subjt:  KQTLW

TrEMBL top hitse value%identityAlignment
A0A0A0LT20 Uncharacterized protein5.14e-12077.67Show/hide
Query:  MMTRFCRFHDSVKPFWFSFFFFFLLLGHGFPVVVESAEGDTPVSDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSCH
        M+ R   FH SV  FW   FFF+L+ GHGFP+V+ESAE +TPVSDLLSRD WR++AGYGEERLSTVLVTGSVLCEACLHGDEPQ+H+WPI GAMVGV+CH
Subjt:  MMTRFCRFHDSVKPFWFSFFFFFLLLGHGFPVVVESAEGDTPVSDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSCH

Query:  NNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIKILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRTSRPLQGCINKGSG
        N G+NSKSSDW HGVTDEFGDF+IDIPSHLHATRSFE VCSIKIL+TPKN  CRPAH AGR+ LQLSSFGGGIRTYTSG L+LQH+TSRPLQ C N+G G
Subjt:  NNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIKILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRTSRPLQGCINKGSG

Query:  DKQTLW
         +QT W
Subjt:  DKQTLW

A0A1S3CJM3 uncharacterized protein LOC1035017098.15e-11776.21Show/hide
Query:  MMTRFCRFHDSVKPFWFSFFFFFLLLGHGFPVVVESAEGDTPVSDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSCH
        MM R   FH SV  FW   FFF+L+ GHGFP+V ESAE +TPVSDLL+RD WR++AGYGEERLSTVLVTGSVLCE+CLHGDEPQ+H+WPI GAMVGV+CH
Subjt:  MMTRFCRFHDSVKPFWFSFFFFFLLLGHGFPVVVESAEGDTPVSDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSCH

Query:  NNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIKILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRTSRPLQGCINKGSG
        N G+NSKSSDW HGVTDEFGDF+IDIPS LHAT+SFE VCSIKIL+TPKN  CRPAH AGR+QLQLSSFGGGIRTYTSG L+LQH+TSRPLQ C N+G  
Subjt:  NNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIKILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRTSRPLQGCINKGSG

Query:  DKQTLW
         +QT W
Subjt:  DKQTLW

A0A5D3E5G2 Pollen_Ole_e_I domain-containing protein8.15e-11776.21Show/hide
Query:  MMTRFCRFHDSVKPFWFSFFFFFLLLGHGFPVVVESAEGDTPVSDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSCH
        MM R   FH SV  FW   FFF+L+ GHGFP+V ESAE +TPVSDLL+RD WR++AGYGEERLSTVLVTGSVLCE+CLHGDEPQ+H+WPI GAMVGV+CH
Subjt:  MMTRFCRFHDSVKPFWFSFFFFFLLLGHGFPVVVESAEGDTPVSDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSCH

Query:  NNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIKILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRTSRPLQGCINKGSG
        N G+NSKSSDW HGVTDEFGDF+IDIPS LHAT+SFE VCSIKIL+TPKN  CRPAH AGR+QLQLSSFGGGIRTYTSG L+LQH+TSRPLQ C N+G  
Subjt:  NNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIKILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRTSRPLQGCINKGSG

Query:  DKQTLW
         +QT W
Subjt:  DKQTLW

A0A6J1CL55 uncharacterized protein LOC1110126152.26e-15399.52Show/hide
Query:  MMTRFCRFHDSVKPFWFSFFFFF-LLLGHGFPVVVESAEGDTPVSDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSC
        MMTRFCRFHDSVKPFWFSFFFFF LLLGHGFPVVVESAEGDTPVSDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSC
Subjt:  MMTRFCRFHDSVKPFWFSFFFFF-LLLGHGFPVVVESAEGDTPVSDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSC

Query:  HNNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIKILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRTSRPLQGCINKGS
        HNNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIKILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRTSRPLQGCINKGS
Subjt:  HNNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIKILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRTSRPLQGCINKGS

Query:  GDKQTLW
        GDKQTLW
Subjt:  GDKQTLW

A0A6J1KR10 uncharacterized protein LOC1114956893.13e-11580Show/hide
Query:  MMTRFCRFHDSVKPFWFSFFFFFLLLGHGFPVVVESAEGDTPVSDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSCH
        M+TR C FH+SV  F   FFFFFLL GHGFP+VVESAEG+TP+SDLL RDDWRQ+AGYGEERLSTVLVTGSVLCEACLHGDE Q+H+WPI GAMVGV+C 
Subjt:  MMTRFCRFHDSVKPFWFSFFFFFLLLGHGFPVVVESAEGDTPVSDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSCH

Query:  NNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIKILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRTSRPLQGCI
        N G+NSKS +W +GVTDEFGDF+IDIPSHLHA RSFEK CSIKIL+TPKN RCRPAH AG EQLQLSSFGGG RTYTSG L+LQH+TSRPLQ CI
Subjt:  NNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIKILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRTSRPLQGCI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G40113.1 Pollen Ole e 1 allergen and extensin family protein8.7e-2338.96Show/hide
Query:  SDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSCHNNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIK
        S L S  +   MAGYGE +LS+V++TGS+LC              P+SGA V + CH   +  + S W   VT++FG+F+I +PSHLHA    EK C +K
Subjt:  SDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSCHNNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIK

Query:  ILQTPKN-ARCRPAHFAG--REQLQLSSFGGGIRTYTSGALKL----QHRTSRP
         +  PK+  RC          + ++L S   G R YTSG +KL      RTS+P
Subjt:  ILQTPKN-ARCRPAHFAG--REQLQLSSFGGGIRTYTSGALKL----QHRTSRP

AT4G17215.1 Pollen Ole e 1 allergen and extensin family protein2.2e-2641.21Show/hide
Query:  FLLLGHGFPVVVESAEGDTPVSDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSCHNNGRNSKSSDWAHGVTDEFGDF
        FLL+     ++V   E     SD  SRD+  +MAGYGE++LS+VL+T S+L  +          S PI GA +G  CH   R  + S W   VT+E G F
Subjt:  FLLLGHGFPVVVESAEGDTPVSDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSCHNNGRNSKSSDWAHGVTDEFGDF

Query:  IIDIPSHLHATRSFEKVCSIKILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRT
        +ID+PSHLHA    +K C IK L  PK  RC      G   +QL S   G R YT+G + LQ  T
Subjt:  IIDIPSHLHATRSFEKVCSIKILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRT

AT5G15780.1 Pollen Ole e 1 allergen and extensin family protein5.1e-0728.81Show/hide
Query:  STVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSCHNNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIKILQT--PKNARCRPAHFAGR
        S+ +V G+V C+ C +G   +  +  ISGA+V V C +   NSK S      TD+ G+F + +P  +       K CS+K+L +  P  +    A  +  
Subjt:  STVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSCHNNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIKILQT--PKNARCRPAHFAGR

Query:  EQLQLSSFGGGIRTYTSG
        ++L+ +  G   R +++G
Subjt:  EQLQLSSFGGGIRTYTSG

AT5G47635.1 Pollen Ole e 1 allergen and extensin family protein6.9e-2841.78Show/hide
Query:  SDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSCHNNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIK
        ++L +R +  +MAGYGE++LS+V++TGS+LC+       P LHS PI GA V + CH   +  + S W   VTDE G+F ID+PS LHA    E  C IK
Subjt:  SDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSCHNNGRNSKSSDWAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIK

Query:  ILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRTSR
         +  P+  RC        + ++L S   G R YTSG ++LQ  +SR
Subjt:  ILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRTSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGACCAGATTTTGCAGATTCCATGACTCTGTCAAGCCATTTTGGTTTAGTTTCTTCTTCTTCTTCCTCCTGCTCGGCCATGGATTTCCAGTGGTAGTAGAATCTGC
AGAAGGTGATACCCCAGTATCCGACCTTTTGAGTCGAGATGATTGGAGGCAGATGGCTGGATATGGTGAGGAGAGACTGTCCACAGTTCTGGTCACAGGCTCTGTTCTTT
GTGAAGCTTGTTTGCATGGTGATGAACCTCAACTTCATTCATGGCCTATTTCAGGTGCCATGGTGGGTGTGAGCTGCCACAACAACGGAAGAAATAGCAAATCTTCTGAT
TGGGCACATGGAGTCACTGATGAATTTGGAGACTTCATTATTGATATTCCATCCCATCTTCATGCAACACGCAGCTTTGAAAAGGTTTGTTCAATCAAGATTCTTCAGAC
TCCGAAGAACGCACGCTGCCGACCTGCTCATTTCGCTGGTCGGGAGCAGCTGCAATTGTCATCGTTTGGAGGTGGCATCCGTACATATACTTCTGGCGCTCTCAAGCTGC
AGCACCGGACATCTCGACCCCTGCAAGGTTGTATAAACAAGGGAAGTGGTGACAAACAGACCTTATGGTAG
mRNA sequenceShow/hide mRNA sequence
TCACCAATCTGTACTGTACATAACTTAATTGATTAAATTATATATCATCGATCGAAAAATTAGAGGTTGGAATCCTCACTTCAACATATAAGTCAACGTCAAATCCAAAA
AGGGTATTTCTGTAAATTCAGCATAGGGAGAAACTACAAGTCATCTCCTATGGGATGCATTATATAAACTAATAATCTGTCTCTCAAATGGTGTCGATTGATTTGTGTGA
TCATATTGATAAAGTTCAGCTCCCCACAGACAAAGCAAGAAGATGATGACCAGATTTTGCAGATTCCATGACTCTGTCAAGCCATTTTGGTTTAGTTTCTTCTTCTTCTT
CCTCCTGCTCGGCCATGGATTTCCAGTGGTAGTAGAATCTGCAGAAGGTGATACCCCAGTATCCGACCTTTTGAGTCGAGATGATTGGAGGCAGATGGCTGGATATGGTG
AGGAGAGACTGTCCACAGTTCTGGTCACAGGCTCTGTTCTTTGTGAAGCTTGTTTGCATGGTGATGAACCTCAACTTCATTCATGGCCTATTTCAGGTGCCATGGTGGGT
GTGAGCTGCCACAACAACGGAAGAAATAGCAAATCTTCTGATTGGGCACATGGAGTCACTGATGAATTTGGAGACTTCATTATTGATATTCCATCCCATCTTCATGCAAC
ACGCAGCTTTGAAAAGGTTTGTTCAATCAAGATTCTTCAGACTCCGAAGAACGCACGCTGCCGACCTGCTCATTTCGCTGGTCGGGAGCAGCTGCAATTGTCATCGTTTG
GAGGTGGCATCCGTACATATACTTCTGGCGCTCTCAAGCTGCAGCACCGGACATCTCGACCCCTGCAAGGTTGTATAAACAAGGGAAGTGGTGACAAACAGACCTTATGG
TAGACAGAAACCTAGTCAAGTGAATGAATTTGGTGCAGACCATACATGGCTCATGTAGATGGCGCTGAAGTCACAACAATGAGTTATTGGCTCGTAGCCAGATCGATAAA
TCGAGCTTGCATAAATATGTATTGTTTATGTGTATTAACACCTCTAGATAATGTAATACAAAAATTTATATCATACCATACACGATGGATGCTTTGACTGCATTGTGTCA
TATTCCACAGTTGAATGTAGTCACAAGGTTGCAATTGTATTCATCAAAAAAACAGTTAGGAGCCAAAGAGATCATATGTTCTTTAACTATCATTGCATAGCATGACAGAT
AATCTAATCCCAACACAAAAAGATTAGGCAATATGACAACCAAAAAGACATTAGCTAATGGGCGTTACACGATGATAAATGACGAGAAACTGCTTATTGGTTCTCCTCTC
TCTCTTCCTCAACTTTAGAAGTTATGTATCGTGCTACATCGGCACAGCAGGTGAGCTTATCTGCCTTTTCATCTGGGATTTCGATGGAGAATTCTTGTTCGAAAGCCATA
ACAAGCTCCACCCTGTCCAAGCTGTCCAGGCTCAAATCCTTCTGAAAATCAGCCGTTTCAGTAACCTGCTCGGCTCAACACAACATTCAACAGTAAGAACTATTTTCTAT
TAAGAGGACAAATATACACTTTTGCATTTAACAGGATAGGATAAATATGAGAATCTACAAAAATCCCGAAAAGTGATATAAAAGTTCAAAGATAAGTTGTAGGATGGCCT
TGGACTAAGTACCTTTGAGGCATCAATTCTATCGAACTTCTTAACCAATCCAATCACTCGGTCTACTATTTGATCAGAGCTTGTACCTTTTGACGTGCACAAATGGCGAG
TCAGCTTCTCCATTACAGTCTCATCCTTCATAAACAACCATGTCGTAGTATTAGGTATTCTGATCCTCATGTGCTTCAGGATTGAATTACTGATGCTTTGCATTTTTTGT
CTATAGGAGCTCAAATGAACAAGTTCCAATTCCAATCATCAGTTCAGGTAACAAAGAATTGAAAAGTTTTGAAAATAATAGACAGGCATGCAGACAAAATGAGTTTGTGG
AGGGACTTATTATCAAGATCCTAACTTTTTTTTTTTTTGAAATCCGTGTGTCCGGGCCAGCTTGCACGCACCTTGACTAATCACGAGCTAACCGCCTGACCAAACATGCA
ACTTCAAGAAATTACTGAAACACATTACAATACCAGAGCCCTTGAAAGGAAATACATACATTCCTATTGAAAATAATGTGTAGTAACTTCCACGGGGATAATTACATACA
ATAGGAAGAATGAAAGTTGTTTTGTCGTACCAAAAATCTAATCATGATAACTTTTCAAATATTAACCAATCGCATTATCAGCAAGTTTCAAGGAAGATCGAGATAAAATT
ATGTGCTCAACGAGGAGGGAAATGAATTTCATTAGAAAAATGCTATACAATCAAATCTATGTTTGCGCCAGAAAGCTAAAAGATATCAAGCTCTATAAACAGCTGGCCAC
AAATTTGTTAGTAAAATCAAGAAATTTAAGGTCATATACAACTAGAATGCATGACAATTTGAATTGATCGAAAAGGAACGACTTCCCATTCATAACAAGATAGGTACAGA
GACTGAAGTCATCAACCAAAATGGAAAAAATTTAAATAAGGAAATTAAAACAGTAGGTAAATGGGTAAGAGAAAAGTTCGTAGGAGGCTATGTAAAATGAATTGCCAGGA
CGTCCTACACTAGGCATTGCATTGTAGGCTAAATAGCATTTAAAAGAATGATAATGTTCTCAATAAGAAACAAGAATATATTTTACTATAAAATAAAGATTAATGTACAA
AGTTGAAAGGGACAGGGCATCCTCCAGGAGAGACAAATTACTTCGGAAGTCTATAACATATTGAACAAAATGGAAAAAGAGATGAAATAAAGCGGCAACTTGTTTGTGCT
ACCACATAAGCAAAAATTAATATAAAAATTTGATTTTAAAAACTACGTATTCATTGCCATAAAATCACTTACCCCCGTCAACTTTTTGACTATAGCACTAACAAAGTGAT
GATTCCAAGGATCATCTAAAAGAAACTATTTAGAACTCAGCTTTCCAAAGTATTTAAGAAAATTTTTGTTTCATTTGGATTTTCTTCAAAACACTTGTTATAAAACATTG
TATTATCAGAAATATGTAAGTAGTTCTTTTAACTTCTTGTTGTGTATTTTTTTTTTTTTATCTTTATCTCATATATTAAATTAAAAATGTAAAAATTCAAATCGGTAAAA
GACACCTAAACTGTGTTCATGTCCTAATTCTTTTCTACGGTTATTTTGTCCATCTTATATCATGTTCATTTCCACTGTTATGAGTCCATGTTCGTTAGCCCCCAGCTGTC
CCTAGCTGCTAATTAACTACAATAGCTAGCATTTATCAGCAATCTTAAATCTCAAATATGTGCACAGATGTGTTCCTAATCTTGTTCAAGCAGAACCAACAGAAGGGTTT
ATTTTCCAGCCGCATTTGGTTCTTTTTCTTGGTCATCAAATCTATACCAAGGTTCCTTAGTGGAACCCTTGAATGCTATACATAAACAAGGAGTTCCATAGTTACTTTAA
GGCACTTTCTTTGGTTTTTTAGTGCTATCTCCAGGCATTGATTAATTTATTGGCTGGCAAGAAGAAACAGGTTAAGAACCAACCAGAGATAGGCTTCAGCAATTCGAGTG
CATTAGGAATGCTTTAATGTTTTTTCCAGAGGAGGATTCTTTTGGATTTGAGGCCTTTTCTCTAGGGGTGTTCTTTTGTCTAGCTATTTTTTTTGTATGTCCTTGCATAT
TTTTTCATTTCTCTCAAATGAAAGCTCGGTTTCTTATCTAAAAGAATTTGTTTGCAACGTTTTGTTCGTAGAACAATTAGTCGACCATTTTCTTTTCTTTTGATATCCTG
GTATCTAGCAGTACTGCACTGGACTGGAGAGACTTTGCAATATCCTAGATAATAGTTTTCTCTCATTAAGACACTATTTGGTAACAATTTT
Protein sequenceShow/hide protein sequence
MMTRFCRFHDSVKPFWFSFFFFFLLLGHGFPVVVESAEGDTPVSDLLSRDDWRQMAGYGEERLSTVLVTGSVLCEACLHGDEPQLHSWPISGAMVGVSCHNNGRNSKSSD
WAHGVTDEFGDFIIDIPSHLHATRSFEKVCSIKILQTPKNARCRPAHFAGREQLQLSSFGGGIRTYTSGALKLQHRTSRPLQGCINKGSGDKQTLW