; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g0848 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g0848
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationMC03:15217617..15220825
RNA-Seq ExpressionMC03g0848
SyntenyMC03g0848
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573220.1 hypothetical protein SDJN03_27107, partial [Cucurbita argyrosperma subsp. sororia]3.04e-9783.24Show/hide
Query:  MAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF
        MA+NYGFLVCILVMVMDAVAG+L IRAEKAQN+V LQS S+WVYECSRKPRDDAFSQGLA TILLGLAHAIAKVLG CI IR+ QHFQ+S+AN+RLGLLF
Subjt:  MAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF

Query:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSP-HV
        MILSWITLAIGFSML+AGTVDNS  KNSCEISS GLFL GGIVCF HGL TVAYYVSATAA REE+RK  E  S P HV
Subjt:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSP-HV

XP_022139993.1 uncharacterized protein LOC111010766 [Momordica charantia]4.80e-15799.55Show/hide
Query:  MIPFIYVCNVASLVFLYPIFPPSNSELPNLFLSIPNLGESSESAAEMAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDA
        MIPFIYVCNVASLVFLYPIFPPSNSELPNLFLSIPNLGESSESAAEMAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDA
Subjt:  MIPFIYVCNVASLVFLYPIFPPSNSELPNLFLSIPNLGESSESAAEMAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDA

Query:  FSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLFMILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAY
        FSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGL FMILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAY
Subjt:  FSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLFMILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAY

Query:  YVSATAAKREEQRKRNENVSSPHV
        YVSATAAKREEQRKRNENVSSPHV
Subjt:  YVSATAAKREEQRKRNENVSSPHV

XP_022954638.1 uncharacterized protein LOC111456841 [Cucurbita moschata]1.18e-9983.71Show/hide
Query:  MAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF
        MA+NYGFLVCILVMVMDAVAG+L IRAEKAQN+V LQS S+WVYECSRKPRDDAFSQGLA TILLGLAHAIAKVLG CI IR+ QHFQ+S+AN+RLGLLF
Subjt:  MAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF

Query:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSPHV
        MILSWITLAIGFSML+AGTVDNS  KNSC+ISS GLFL GGIVCF HGLCTVAYYVSATAA REE+RK  E  S PHV
Subjt:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSPHV

XP_023542772.1 uncharacterized protein LOC111802580 [Cucurbita pepo subsp. pepo]4.94e-9984.36Show/hide
Query:  MAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF
        MAQNYGFLVCILVMVMDAVAG+L IRAEKAQN+V LQS S+WVYECSRKPRDDAFSQGLA TILLGLAHAIAKVLG CI IR+ QHFQ+S+AN+RLGLLF
Subjt:  MAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF

Query:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSP-HV
        MILSWITLAIGFSML+AGTVDNS  KNSCEISS GLFL GGIVCF HGLCTVAYYVSATAA REE+RK  E  S P HV
Subjt:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSP-HV

XP_038894860.1 uncharacterized protein LOC120083260 [Benincasa hispida]5.80e-10284.36Show/hide
Query:  MAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF
        MA+NYGFLVCILVMV+DAVAGILGI+AEKAQNRVVL+SVS+WV  CSRKPRDDAFSQGLA TILLG+AH IAKVLG CICIR+KQHFQES+ANKRLGLLF
Subjt:  MAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF

Query:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSP-HV
        MILSWITLAIG S+L+AGTVDNS WKNSCEISS GLFL GGIVCF HGLCTVAYYVSATAA REEQRK     S P HV
Subjt:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSP-HV

TrEMBL top hitse value%identityAlignment
A0A0A0LRQ0 Uncharacterized protein1.83e-9479.14Show/hide
Query:  FLSIPNLGESSESAAE--MAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLG--RCIC
        F  I NL ESSESAAE  M +NYGFLVCILV+V+DAVAG+LGI AEKAQNRVVL+S+S+ + ECSRKPRDDAFS+GLA +ILLGLAH IAKVLG  +CIC
Subjt:  FLSIPNLGESSESAAE--MAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLG--RCIC

Query:  IRSKQHFQESSANKRLGLLFMILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQR
        IR+KQ+ QE SAN+ LG LFMILSWITLAIGFS+LMA T+DNS WKNSCEISS GLFL GGIVCFFHGLCTVAYYVSATAA REEQR
Subjt:  IRSKQHFQESSANKRLGLLFMILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQR

A0A1S3B4I4 uncharacterized protein LOC1034857089.22e-8577.51Show/hide
Query:  MAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLGRCICI--RSKQHFQESSANKRLGL
        M +NYGFLVCILVMV+D VAG+LGI AEKAQNRVVL+S+S+ V ECSRKPRDDAFS+GLA  ILLGLAH IA VLG C CI   +KQ+ Q+ SAN+ LGL
Subjt:  MAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLGRCICI--RSKQHFQESSANKRLGL

Query:  LFMILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQR
         FMILSWITL IGFS+LMA T+DNS WKNSCEISS GLFL GGIVCF HGLCTVAYYVSATAA REEQR
Subjt:  LFMILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQR

A0A6J1CGZ6 uncharacterized protein LOC1110107662.32e-15799.55Show/hide
Query:  MIPFIYVCNVASLVFLYPIFPPSNSELPNLFLSIPNLGESSESAAEMAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDA
        MIPFIYVCNVASLVFLYPIFPPSNSELPNLFLSIPNLGESSESAAEMAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDA
Subjt:  MIPFIYVCNVASLVFLYPIFPPSNSELPNLFLSIPNLGESSESAAEMAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDA

Query:  FSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLFMILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAY
        FSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGL FMILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAY
Subjt:  FSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLFMILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAY

Query:  YVSATAAKREEQRKRNENVSSPHV
        YVSATAAKREEQRKRNENVSSPHV
Subjt:  YVSATAAKREEQRKRNENVSSPHV

A0A6J1GRM8 uncharacterized protein LOC1114568415.69e-10083.71Show/hide
Query:  MAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF
        MA+NYGFLVCILVMVMDAVAG+L IRAEKAQN+V LQS S+WVYECSRKPRDDAFSQGLA TILLGLAHAIAKVLG CI IR+ QHFQ+S+AN+RLGLLF
Subjt:  MAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF

Query:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSPHV
        MILSWITLAIGFSML+AGTVDNS  KNSC+ISS GLFL GGIVCF HGLCTVAYYVSATAA REE+RK  E  S PHV
Subjt:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSPHV

A0A6J1K198 uncharacterized protein LOC1114901801.60e-9782.68Show/hide
Query:  MAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF
        MA+NYGFLVCILVMVMDAVAG+L IRAEKAQN+V LQS S+W YECSRKPRDDAFSQGLA TILLGLAHAIAKVLG CI IR+ QHFQ+S+AN+RLGLLF
Subjt:  MAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF

Query:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSP-HV
        MILSWITLAIGFSML+AGTVDNS  KNSC+ISS GLFL GGIVCF HGLCTVAYYVSATAA REE+RK  E  S P HV
Subjt:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVSSP-HV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11500.1 Protein of unknown function (DUF1218)2.0e-3342.22Show/hide
Query:  MAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVV----LQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRL
        M    GFLV ++++  D  A +LGI AE AQ++       Q        C R P D AF++G+A  +LL + H +A VLG C  IRSKQ F+ ++ANK L
Subjt:  MAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVV----LQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRL

Query:  GLLFMILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKREE-QRKRNENVSS
         + F++LSWI   + +S LM GT+ NS     C +  R  FL GGI C  HG+ T AYYVSA AAK+E+ +  + EN+++
Subjt:  GLLFMILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKREE-QRKRNENVSS

AT2G32280.1 Protein of unknown function (DUF1218)4.8e-3543.4Show/hide
Query:  GFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLFMILSW
        G LVC++++ +D  A ILGI+AE AQN+V  + + +W++EC R+P  DAF  GL    +L +AH +  ++G C+CI S+  FQ SS+ +++ +  ++L+W
Subjt:  GFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLFMILSW

Query:  ITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKRE
        I  A+GF  ++ GT+ NS  ++SC  +       GGI+CF H L  VAYYVSATAAK E
Subjt:  ITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKRE

AT4G21310.1 Protein of unknown function (DUF1218)1.1e-4252.12Show/hide
Query:  MAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF
        MA+N GF +CIL++ MD  AGILGI AE AQN+V  + + +W++EC R P   AF  GLA  ILL LAH  A  LG C+C+ S+Q  ++SSANK+L +  
Subjt:  MAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLF

Query:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKREE
        +I +WI LAI FSML+ GT+ NS  + +C IS   +   GGI+CF HGL  VAYY+SATA+ RE+
Subjt:  MILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKREE

AT5G17210.1 Protein of unknown function (DUF1218)1.5e-0725.15Show/hide
Query:  VCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLFMILSWITL
        V  L+ ++ AV   +   A + +   V  +VS  + +C+  PR  AF+ G    + L +A  I  V   C C R       S +N  + L+  ++SW T 
Subjt:  VCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLFMILSWITL

Query:  AIGFSMLMAGTVDNSNWKNS--------CEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKR
         I F +L++G   N              C I   G+F  G ++        + YY+  T+ K+
Subjt:  AIGFSMLMAGTVDNSNWKNS--------CEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKR

AT5G17210.2 Protein of unknown function (DUF1218)5.6e-0726.09Show/hide
Query:  VVLQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLFMILSWITLAIGFSMLMAGTVDNSNWKNS-----
        +V  +VS  + +C+  PR  AF+ G    + L +A  I  V   C C R       S +N  + L+  ++SW T  I F +L++G   N           
Subjt:  VVLQSVSVWVYECSRKPRDDAFSQGLAGTILLGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLFMILSWITLAIGFSMLMAGTVDNSNWKNS-----

Query:  ---CEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKR
           C I   G+F  G ++        + YY+  T+ K+
Subjt:  ---CEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCCTTTCATATATGTATGTAATGTTGCAAGCCTGGTTTTCCTTTATCCCATCTTCCCTCCCTCCAATTCAGAGCTCCCAAACCTTTTTCTCTCAATTCCAAATCT
TGGGGAGAGCTCTGAATCTGCAGCAGAAATGGCGCAAAACTATGGCTTTCTGGTCTGTATCTTGGTCATGGTAATGGACGCTGTTGCTGGGATACTTGGAATTCGAGCGG
AAAAGGCTCAGAATCGGGTGGTACTACAATCGGTGAGCGTATGGGTGTATGAATGCAGCAGAAAGCCGAGAGATGATGCTTTTAGCCAGGGACTGGCCGGTACAATTCTG
CTTGGCCTTGCTCATGCCATTGCTAAAGTACTTGGTAGGTGCATTTGCATTCGGTCTAAGCAACATTTTCAAGAATCATCTGCTAACAAGCGATTGGGATTGCTCTTCAT
GATTCTCTCATGGATTACATTGGCTATTGGGTTCTCAATGTTGATGGCGGGGACGGTGGACAATTCCAACTGGAAGAACTCTTGCGAGATATCAAGCCGTGGCCTATTCT
TGGCAGGTGGGATTGTGTGTTTCTTTCATGGGCTCTGTACTGTCGCTTATTATGTTTCTGCAACAGCAGCTAAGAGAGAAGAACAGAGGAAACGCAATGAAAATGTTTCT
TCTCCACATGTTTAA
mRNA sequenceShow/hide mRNA sequence
CGGATTTCTTTTCGGAGCTCTGGTTTTTCAGGACATGAAAAGAAGAAGATGCTTACGGTTACAATCTGGCGCCGCCGATTTAAAAGAATCCGGAAACGTTTGAAAATTTC
CTCGGATTCTCTGCGTTTCTTGACGACCATGAAAAGCATCTGCATCAAAATAAAAAGAACGAATAAGGAGAACTGTAGAAAGGAAAGAAGCTTTCCTGTAAAATTCGTCA
TGGTAGATGATTCCTTTCATATATGTATGTAATGTTGCAAGCCTGGTTTTCCTTTATCCCATCTTCCCTCCCTCCAATTCAGAGCTCCCAAACCTTTTTCTCTCAATTCC
AAATCTTGGGGAGAGCTCTGAATCTGCAGCAGAAATGGCGCAAAACTATGGCTTTCTGGTCTGTATCTTGGTCATGGTAATGGACGCTGTTGCTGGGATACTTGGAATTC
GAGCGGAAAAGGCTCAGAATCGGGTGGTACTACAATCGGTGAGCGTATGGGTGTATGAATGCAGCAGAAAGCCGAGAGATGATGCTTTTAGCCAGGGACTGGCCGGTACA
ATTCTGCTTGGCCTTGCTCATGCCATTGCTAAAGTACTTGGTAGGTGCATTTGCATTCGGTCTAAGCAACATTTTCAAGAATCATCTGCTAACAAGCGATTGGGATTGCT
CTTCATGATTCTCTCATGGATTACATTGGCTATTGGGTTCTCAATGTTGATGGCGGGGACGGTGGACAATTCCAACTGGAAGAACTCTTGCGAGATATCAAGCCGTGGCC
TATTCTTGGCAGGTGGGATTGTGTGTTTCTTTCATGGGCTCTGTACTGTCGCTTATTATGTTTCTGCAACAGCAGCTAAGAGAGAAGAACAGAGGAAACGCAATGAAAAT
GTTTCTTCTCCACATGTTTAACCCAGCTGACAGCCGCTGCCTGTAATGACAATATTAGTCTCTAATTTATTAGTTATTTAAAGAAATTTTGAGTCCACAGCTTAGTTCAC
ATTGAAAACATCTCTATTGTTTTCTTCTCTCACACCAACGTAGTAACACTAAGTACGCCTTTTCCATTATTCTTTTTGGGTTTGGAACTTCTGTTTAGTACAGAATCCGT
AGAGAATGCATTGGTTAAATGTGCACCGAATCACGTTCAAACCCAATTGGTAGAGAACATAAAACATATGAAGTCTTAAAATGCTTGCGGCAAGAACTTTTTGTTCTGTA
TCTGATTATGCATTGGAGAGTGCTTAAAGAAGACTGAAATAGTATAAGATAATCTCACAATTCCACCCACAGGAAGATTGGGCGAGACTAGTAAACCCAGTTAACGATTG
AGAAATCATTACACAATTGGTAATGGACTAGAGACAATTTCACCATGATCTATAGATCCAAGTGAATTTCTTCTTAGTTTGATTGTTATGATCATGCTTTGGGTTTATTT
CTTAGACTAAGCATCTAATTGTGGCTTCAACTTAGGAAAATATATTTTCTAACTATCACTGCATTGGTATTAATGTATTCCAAATGTAACTAGTCCATATGGTAGGGCAA
CTGACCAAAATAAGCAATGCAATAAGAAGTTGAAAAGAGATGAGTTGAGGGTTGAAACCAAATTTCCGTTGATTATAACATTGGTTCAAACTGGAGGGTCTCGCATATGA
TCTTCACAAATAATCTTACATCACAGTAACAGAAAACGCTCCTCCTTCTAACCGAGCTTGGCCAGGACATCGTTGAGGGTAGACACGGTCTGAGCATAATACTTCTCTGC
TTCTGAGCTGCTCTTAATCTTTGCTGCATAGTCCAACTGCAAGTTTCCAATTGACCACAGAAAAAGCAAACGTCAGGAACAAGAACAGTAGATAACACTCACCTTGTTGA
TGAAATAACACTCACCTTGTTGATGTCTTGGAAGAGCTTTCCCGTGAGGCTCTTAAGTTCTTCCTTCTCTCCCTTGGGCTTGGCAGAAATGATTGTCTTGAGATCATACC
GGAGATACCCCGCCCTGAGACGGAGGTCGTCTCGAACGAACGGCCAAGCCTTCTTGTCTATCGACCCCTTTACATTAATGATGTCCTTGGCTGACTCCTTCGCCCTTGCC
ACCGCCAACTCTGGACTTTGTGGCTGCAGAAAGAACCTGTCTTTCAGTGGCAGGTCCAGGTCTCTTGGCTCATCCGAGTTAAGAGTCCCAGCTGCCAAATTCATCATTCC
AATCATCAAATCAAGAACAGAAATAAGATAGATAGATATAAGTATTTGAATTGAATTCAATATATAATATGGTCACTTACGGAGTCCGCCGGAAGGAGGGGGCGGGGGGC
CGACCTTGATGGGCTTAGCTTCAGCAAGAACGGCTTGAACAAAAGAGCCGGAGGCGAGACCGGCGGCGACCAGGCAGAGAACGGCCCTACGGCTGGTCTCTGGCTCTGCA
GCCACTTGCTGGGCCCTGATACTCAACCCAGAGCGGGGGACGGAGGCCCTGCTGCTGCCGGCTACACTCAAGCGGCTACAGCCACTGATTTGAAGGCTACCCTCCAACAC
AGCCTGAGAGCAACCACACAGCCCTGCCATTGATGCCATAGCTACGATTTTTGCAATCGGTTTTCCAAGTTAAAGGCTTTTTCTAGTGGGAAATCGAGAATTATATTATT
GCTGCAGAGTTTATGCTGAGATCTCTATTTGTTTATGTGACGCAGAGGCGACGGGGGCGCCCACCTTATCTCCCCATTTTCTACTTGGATAACTTGGATATGGTTTCAAC
CAATTGCATTTCAACACGTGTTAAGTGATTCTTCTTCATTGGACAAGTTGGATAGGTTCCAGCGTGGGCTACTAGAAGTTCGCCACGTGGAGACGTTGCGAGATCATGAT
ATTTCTAACATCCCGTAGTTGTCTCCGGTTTCATCTCCTCGGTGTGTTACTATGGCCA
Protein sequenceShow/hide protein sequence
MIPFIYVCNVASLVFLYPIFPPSNSELPNLFLSIPNLGESSESAAEMAQNYGFLVCILVMVMDAVAGILGIRAEKAQNRVVLQSVSVWVYECSRKPRDDAFSQGLAGTIL
LGLAHAIAKVLGRCICIRSKQHFQESSANKRLGLLFMILSWITLAIGFSMLMAGTVDNSNWKNSCEISSRGLFLAGGIVCFFHGLCTVAYYVSATAAKREEQRKRNENVS
SPHV