; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC10g0958 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC10g0958
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionUnknown protein
Genome locationMC10:8453578..8457946
RNA-Seq ExpressionMC10g0958
SyntenyMC10g0958
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149291.1 uncharacterized protein LOC101205846 [Cucumis sativus]5.58e-13882.93Show/hide
Query:  MTSIIRFSTFNILHDHFCKKPTRFSPLPARKVGFCNRNGFRLRAYGWRLTFLGGSCGNEGIFGKDLGFKGRRG-LIVARFNQGFGFNGGGGGGG-----G
        MTS IRFSTFNILH++FC KPT+F+PLP  KV FC  +G RLR YGWRL F GG   ++ +FGK  GFKG+RG LIVARFNQGFGFNGGGG GG     G
Subjt:  MTSIIRFSTFNILHDHFCKKPTRFSPLPARKVGFCNRNGFRLRAYGWRLTFLGGSCGNEGIFGKDLGFKGRRG-LIVARFNQGFGFNGGGGGGG-----G

Query:  DDGATARILGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQGTCPNCGNDFQIFKSTLNEELQLCPFCSQPFSVVDDKF
        DDGATAR++GNIALA GLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQ  CPNCGN+FQIFKSTLNEELQLCPFCSQPFSVVDDKF
Subjt:  DDGATARILGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQGTCPNCGNDFQIFKSTLNEELQLCPFCSQPFSVVDDKF

Query:  VRDSVKFSNKTSSTFGQAFSNFTSPKKGKETSAAVVDIEAEVKDVD
        VRDSVKFSNKTSSTFGQAFS+FTSP+KGKETS AVVDIEAEVKDVD
Subjt:  VRDSVKFSNKTSSTFGQAFSNFTSPKKGKETSAAVVDIEAEVKDVD

XP_022142897.1 uncharacterized protein LOC111012900 [Momordica charantia]9.32e-172100Show/hide
Query:  MTSIIRFSTFNILHDHFCKKPTRFSPLPARKVGFCNRNGFRLRAYGWRLTFLGGSCGNEGIFGKDLGFKGRRGLIVARFNQGFGFNGGGGGGGGDDGATA
        MTSIIRFSTFNILHDHFCKKPTRFSPLPARKVGFCNRNGFRLRAYGWRLTFLGGSCGNEGIFGKDLGFKGRRGLIVARFNQGFGFNGGGGGGGGDDGATA
Subjt:  MTSIIRFSTFNILHDHFCKKPTRFSPLPARKVGFCNRNGFRLRAYGWRLTFLGGSCGNEGIFGKDLGFKGRRGLIVARFNQGFGFNGGGGGGGGDDGATA

Query:  RILGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQGTCPNCGNDFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSVK
        RILGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQGTCPNCGNDFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSVK
Subjt:  RILGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQGTCPNCGNDFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSVK

Query:  FSNKTSSTFGQAFSNFTSPKKGKETSAAVVDIEAEVKDVD
        FSNKTSSTFGQAFSNFTSPKKGKETSAAVVDIEAEVKDVD
Subjt:  FSNKTSSTFGQAFSNFTSPKKGKETSAAVVDIEAEVKDVD

XP_022985259.1 uncharacterized protein LOC111483302 [Cucurbita maxima]1.99e-13984.65Show/hide
Query:  MTSIIRFSTFNIL-HDHFCKKPTRFSPLPARKVGFCNRNGFRLRAYGWRLTFLGGSCGNEGIFGKDLGFKGRRGLIVARFNQGFGFNGGGGGGGGDDGAT
        MTS IRFS+FN+L HD+F KKPTRF+PLP  KV  C+RNGFRLR  G +L FL G   N+G+ GK   FKG+RGLIVARFNQGFGFNGGGGGGGGDDGAT
Subjt:  MTSIIRFSTFNIL-HDHFCKKPTRFSPLPARKVGFCNRNGFRLRAYGWRLTFLGGSCGNEGIFGKDLGFKGRRGLIVARFNQGFGFNGGGGGGGGDDGAT

Query:  ARILGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQGTCPNCGNDFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV
        AR+LGNIALA GLTYLSVTGQLGW+LDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQ TCPNCGN+FQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV
Subjt:  ARILGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQGTCPNCGNDFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV

Query:  KFSNKTSSTFGQAFSNFTSPKKGKETSAAVVDIEAEVKDVD
         FSNKTSSTFGQAFS+FTSP+KGKETS AVVDIEAEVKDVD
Subjt:  KFSNKTSSTFGQAFSNFTSPKKGKETSAAVVDIEAEVKDVD

XP_023552443.1 uncharacterized protein LOC111810103 [Cucurbita pepo subsp. pepo]1.15e-13884.65Show/hide
Query:  MTSIIRFSTFNILHDHFCKKPTRFSPLPARKVGFCNRNGFRLRAYGWRLTFLGGSCGNEGIFGKDLGFKGRRGLIVARFNQGFGFNGGG-GGGGGDDGAT
        MTS IRFS+FN+LHD+F  KPTRF+PLP  KV  C+RNGFRLR  G +L FL G  GN G+ GK   FKGRRGLIVARF+QGFGFNGGG GGGGGDDGAT
Subjt:  MTSIIRFSTFNILHDHFCKKPTRFSPLPARKVGFCNRNGFRLRAYGWRLTFLGGSCGNEGIFGKDLGFKGRRGLIVARFNQGFGFNGGG-GGGGGDDGAT

Query:  ARILGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQGTCPNCGNDFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV
        AR+LGNIALA GLTYLSVTGQLGW+LDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQ TCPNCGN+FQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV
Subjt:  ARILGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQGTCPNCGNDFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV

Query:  KFSNKTSSTFGQAFSNFTSPKKGKETSAAVVDIEAEVKDVD
         FSNKTSSTFGQAFS+FTSP+KGKETS AVVDIEAEVKDVD
Subjt:  KFSNKTSSTFGQAFSNFTSPKKGKETSAAVVDIEAEVKDVD

XP_038905451.1 uncharacterized protein LOC120091477 isoform X2 [Benincasa hispida]1.70e-14086.31Show/hide
Query:  MTSIIRFSTFNILHDHFCKKPTRFSPLPARKVGFCNRNGFRLRAYGWRLTFLGGSCGNEGIFGKDLGFKGRRGLIVARFNQGFGFNGGGGGGGG-DDGAT
        MTS IRFST NILHD FC KPT+F+PLPA KV   +RNGFRLR YG RL+FL    GN  +FGK  G KG+RGLIVARFNQGFGFNGGGGGGGG DDGAT
Subjt:  MTSIIRFSTFNILHDHFCKKPTRFSPLPARKVGFCNRNGFRLRAYGWRLTFLGGSCGNEGIFGKDLGFKGRRGLIVARFNQGFGFNGGGGGGGG-DDGAT

Query:  ARILGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQGTCPNCGNDFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV
        AR+LGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQ TCPNCGN+FQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV
Subjt:  ARILGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQGTCPNCGNDFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV

Query:  KFSNKTSSTFGQAFSNFTSPKKGKETSAAVVDIEAEVKDVD
        KFSNKTSSTFGQAF +FTSP+KGKETS AVVDIEAEVKDVD
Subjt:  KFSNKTSSTFGQAFSNFTSPKKGKETSAAVVDIEAEVKDVD

TrEMBL top hitse value%identityAlignment
A0A0A0L5A4 Uncharacterized protein2.70e-13882.93Show/hide
Query:  MTSIIRFSTFNILHDHFCKKPTRFSPLPARKVGFCNRNGFRLRAYGWRLTFLGGSCGNEGIFGKDLGFKGRRG-LIVARFNQGFGFNGGGGGGG-----G
        MTS IRFSTFNILH++FC KPT+F+PLP  KV FC  +G RLR YGWRL F GG   ++ +FGK  GFKG+RG LIVARFNQGFGFNGGGG GG     G
Subjt:  MTSIIRFSTFNILHDHFCKKPTRFSPLPARKVGFCNRNGFRLRAYGWRLTFLGGSCGNEGIFGKDLGFKGRRG-LIVARFNQGFGFNGGGGGGG-----G

Query:  DDGATARILGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQGTCPNCGNDFQIFKSTLNEELQLCPFCSQPFSVVDDKF
        DDGATAR++GNIALA GLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQ  CPNCGN+FQIFKSTLNEELQLCPFCSQPFSVVDDKF
Subjt:  DDGATARILGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQGTCPNCGNDFQIFKSTLNEELQLCPFCSQPFSVVDDKF

Query:  VRDSVKFSNKTSSTFGQAFSNFTSPKKGKETSAAVVDIEAEVKDVD
        VRDSVKFSNKTSSTFGQAFS+FTSP+KGKETS AVVDIEAEVKDVD
Subjt:  VRDSVKFSNKTSSTFGQAFSNFTSPKKGKETSAAVVDIEAEVKDVD

A0A5A7TVG5 Uncharacterized protein6.95e-13583.4Show/hide
Query:  MTSIIRFSTFNILHDHFCKKPTRFSPLPARKVGFCNRNGFRLRAYGWRLTFLGGSCGNEGIFGKDLGFKGRRG-LIVARFNQGFGFNGGGGGGGGDDGAT
        MTS IRFSTFNILH++FC KPT+F+PLP  KV  CN +G RLR YG RL F GG    + +FGK  GFKG+RG LIVARFNQGFGFNGGGG GG DDGAT
Subjt:  MTSIIRFSTFNILHDHFCKKPTRFSPLPARKVGFCNRNGFRLRAYGWRLTFLGGSCGNEGIFGKDLGFKGRRG-LIVARFNQGFGFNGGGGGGGGDDGAT

Query:  ARILGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQGTCPNCGNDFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV
        AR+LGNIALA GLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQ  CPNCGN+FQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV
Subjt:  ARILGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQGTCPNCGNDFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV

Query:  KFSNKTSSTFGQAFSNFTSPKKGKETSAAVVDIEAEVKDVD
         FS KTSSTFGQAFS+FTSP+KGKETS AVVDIEAEVKDVD
Subjt:  KFSNKTSSTFGQAFSNFTSPKKGKETSAAVVDIEAEVKDVD

A0A6J1CP68 uncharacterized protein LOC1110129004.51e-172100Show/hide
Query:  MTSIIRFSTFNILHDHFCKKPTRFSPLPARKVGFCNRNGFRLRAYGWRLTFLGGSCGNEGIFGKDLGFKGRRGLIVARFNQGFGFNGGGGGGGGDDGATA
        MTSIIRFSTFNILHDHFCKKPTRFSPLPARKVGFCNRNGFRLRAYGWRLTFLGGSCGNEGIFGKDLGFKGRRGLIVARFNQGFGFNGGGGGGGGDDGATA
Subjt:  MTSIIRFSTFNILHDHFCKKPTRFSPLPARKVGFCNRNGFRLRAYGWRLTFLGGSCGNEGIFGKDLGFKGRRGLIVARFNQGFGFNGGGGGGGGDDGATA

Query:  RILGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQGTCPNCGNDFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSVK
        RILGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQGTCPNCGNDFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSVK
Subjt:  RILGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQGTCPNCGNDFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSVK

Query:  FSNKTSSTFGQAFSNFTSPKKGKETSAAVVDIEAEVKDVD
        FSNKTSSTFGQAFSNFTSPKKGKETSAAVVDIEAEVKDVD
Subjt:  FSNKTSSTFGQAFSNFTSPKKGKETSAAVVDIEAEVKDVD

A0A6J1EMN9 uncharacterized protein LOC1114359814.56e-13884.23Show/hide
Query:  MTSIIRFSTFNILHDHFCKKPTRFSPLPARKVGFCNRNGFRLRAYGWRLTFLGGSCGNEGIFGKDLGFKGRRGLIVARFNQGFGFNGGG-GGGGGDDGAT
        MTS IRFS+FN+LHD+F  KPTRF+PLP  KV  C+RNGFRLR    +L FL G  GN G+ GK   FKG+RGLIVARFNQGFGFNGGG GGGGGDDGAT
Subjt:  MTSIIRFSTFNILHDHFCKKPTRFSPLPARKVGFCNRNGFRLRAYGWRLTFLGGSCGNEGIFGKDLGFKGRRGLIVARFNQGFGFNGGG-GGGGGDDGAT

Query:  ARILGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQGTCPNCGNDFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV
        AR+LGNIALA GLTYLSVTGQLGW+LDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQ TCPNCGN+FQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV
Subjt:  ARILGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQGTCPNCGNDFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV

Query:  KFSNKTSSTFGQAFSNFTSPKKGKETSAAVVDIEAEVKDVD
         FSNKTSSTFGQAFS+FTSP+KGKETS AVVDIEAEVKDVD
Subjt:  KFSNKTSSTFGQAFSNFTSPKKGKETSAAVVDIEAEVKDVD

A0A6J1JCT5 uncharacterized protein LOC1114833029.61e-14084.65Show/hide
Query:  MTSIIRFSTFNIL-HDHFCKKPTRFSPLPARKVGFCNRNGFRLRAYGWRLTFLGGSCGNEGIFGKDLGFKGRRGLIVARFNQGFGFNGGGGGGGGDDGAT
        MTS IRFS+FN+L HD+F KKPTRF+PLP  KV  C+RNGFRLR  G +L FL G   N+G+ GK   FKG+RGLIVARFNQGFGFNGGGGGGGGDDGAT
Subjt:  MTSIIRFSTFNIL-HDHFCKKPTRFSPLPARKVGFCNRNGFRLRAYGWRLTFLGGSCGNEGIFGKDLGFKGRRGLIVARFNQGFGFNGGGGGGGGDDGAT

Query:  ARILGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQGTCPNCGNDFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV
        AR+LGNIALA GLTYLSVTGQLGW+LDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQ TCPNCGN+FQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV
Subjt:  ARILGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQGTCPNCGNDFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSV

Query:  KFSNKTSSTFGQAFSNFTSPKKGKETSAAVVDIEAEVKDVD
         FSNKTSSTFGQAFS+FTSP+KGKETS AVVDIEAEVKDVD
Subjt:  KFSNKTSSTFGQAFSNFTSPKKGKETSAAVVDIEAEVKDVD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G27390.1 unknown protein1.5e-5560.54Show/hide
Query:  GNEGIFGKDLGFKGRRGLIVARFNQGFGFNGGGGGGGGDDGATARILGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQ
        G+ G+   +     +R ++  R   G  FNGG   G G      RILGN+ALA+GLTYLS+TGQLGW+LDAIVS+WL+ V+VPI+G+ AF WWA RDIVQ
Subjt:  GNEGIFGKDLGFKGRRGLIVARFNQGFGFNGGGGGGGGDDGATARILGNIALAVGLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQ

Query:  GTCPNCGNDFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSVKFSNKTSSTFGQAFSNFTS-PKKGKETSAAVVDIEAEVKDVD
          CPNCGN+FQIFKS +++E+QLCPFC+QPFSVVDDKFV++ VKFSN+T++ FGQ  + F+S PKKGK +S AVVDIEAEV D D
Subjt:  GTCPNCGNDFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSVKFSNKTSSTFGQAFSNFTS-PKKGKETSAAVVDIEAEVKDVD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAAGTATCATCAGATTCTCTACCTTCAACATCCTGCACGATCATTTCTGCAAGAAGCCCACCAGATTCAGTCCTCTTCCGGCTCGTAAGGTTGGCTTCTGCAATCG
TAATGGGTTCCGCTTGAGGGCTTACGGATGGAGATTGACATTTCTGGGCGGGAGTTGTGGTAACGAAGGGATTTTTGGAAAGGATTTAGGGTTTAAGGGAAGGAGAGGGT
TAATTGTGGCAAGGTTTAATCAGGGTTTTGGGTTTAACGGTGGCGGCGGTGGTGGCGGTGGAGACGATGGGGCAACCGCAAGGATTTTGGGTAATATTGCTTTGGCTGTT
GGGTTAACTTATCTTTCGGTAACAGGGCAGCTTGGCTGGGTTTTGGATGCCATTGTTTCCATTTGGCTTGTTGCAGTTCTCGTACCAATTGTCGGAGTGGCCGCTTTTAT
CTGGTGGGCGGGACGAGATATAGTCCAAGGCACTTGCCCTAACTGTGGAAATGATTTTCAAATATTCAAATCAACTCTCAATGAAGAGCTGCAACTGTGCCCTTTCTGTA
GCCAACCTTTCTCTGTGGTAGATGACAAGTTTGTGAGGGATTCTGTGAAGTTCTCCAACAAAACTTCTTCCACCTTTGGACAGGCTTTCAGTAACTTCACTTCTCCTAAA
AAAGGGAAGGAAACCTCTGCAGCAGTGGTTGACATAGAAGCAGAAGTAAAAGATGTAGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAATGGACCAACAAGAGCCACTGGATAACGAGGCAGCCGTATACGGAACAGCAAAGAAATACGCCCACCTCACTCTCACACCAGATAATCCACCATGAATTTCTTCT
GGAGAGACAATTGGGTTCCGGGGTAATCCGCACGAGAGATTAGATTTTGATCAAAACCCATTAAGAAAAAGCTGATTCGTGACATTTCTTTAATTCCCAAATGACAAGTA
TCATCAGATTCTCTACCTTCAACATCCTGCACGATCATTTCTGCAAGAAGCCCACCAGATTCAGTCCTCTTCCGGCTCGTAAGGTTGGCTTCTGCAATCGTAATGGGTTC
CGCTTGAGGGCTTACGGATGGAGATTGACATTTCTGGGCGGGAGTTGTGGTAACGAAGGGATTTTTGGAAAGGATTTAGGGTTTAAGGGAAGGAGAGGGTTAATTGTGGC
AAGGTTTAATCAGGGTTTTGGGTTTAACGGTGGCGGCGGTGGTGGCGGTGGAGACGATGGGGCAACCGCAAGGATTTTGGGTAATATTGCTTTGGCTGTTGGGTTAACTT
ATCTTTCGGTAACAGGGCAGCTTGGCTGGGTTTTGGATGCCATTGTTTCCATTTGGCTTGTTGCAGTTCTCGTACCAATTGTCGGAGTGGCCGCTTTTATCTGGTGGGCG
GGACGAGATATAGTCCAAGGCACTTGCCCTAACTGTGGAAATGATTTTCAAATATTCAAATCAACTCTCAATGAAGAGCTGCAACTGTGCCCTTTCTGTAGCCAACCTTT
CTCTGTGGTAGATGACAAGTTTGTGAGGGATTCTGTGAAGTTCTCCAACAAAACTTCTTCCACCTTTGGACAGGCTTTCAGTAACTTCACTTCTCCTAAAAAAGGGAAGG
AAACCTCTGCAGCAGTGGTTGACATAGAAGCAGAAGTAAAAGATGTAGACTGAAGTCTTACCGAGTTACTGGCCGTTTCACTCTGGAAGCATACCTCAGAAGGCCAACTA
ATATTGGGAAGAATCAGTTGGTTCAATTGGCTTTGCCAGTTTAAAGTAAGTTTTGACACTGCAAATCTTTGTATATTTGTGTTATCTTTCATAGCCAGAAATTTCCAATT
CTGAAACAACTGTGCCTCCACTTTGCATACCTTTTTTTTTTATATATACAAATATGTCTTATGTACAATCTTTACTCTGTAACTTATCAAGACTAATGGACTAGAACATG
AACTACTCTACTTGTTTTCCTCCACATATCGGAAAAGAGACGATTCTGGATTGTTTTATTTTTAGATACTTCCAGGAATGGTCTCGAAAACTTGAGAGATTAGCGTGCTC
CACCTCCAACTTTCTGCAGGAGAGAACAGAGGCAAGTTAGAGCGCCCATGTGTATGGCATTTAGCAGTAATCTGAAAAATAGAACACCATTGATGTTGTCGGAACAGAAC
ACATTTTCTGCAAACATGTTGGAACTTTACCGATCTTCATCGCGTAGCTAGTAGCTCCTAAAGCTAATAATCACATCCAATCAATGTTGATCTCAATATCCAATTATCCA
TAAAGAAGCCATACCTCTTCCTTCTTGCCTCTGGTGAAGAACTCGTAGCCCCCATAAAGAACTAAACCCCCTCCAGTCAAAGACACAGTCACAAACTGGAAAATGTACAT
ATATCTTTAGTAAGATGTGCAGTGATTCTCAAAGACAATGCAAGTATTCTCCATGTGTCTAATAATTATTTCGAAAATATTTCGAACGGCTGATTAATATCTTTAACATG
TATTTGGTGACACGTGGTGAAACAAGTGTTAGAGTGTCCGACACGTGTTAGACATGCAAACTAAACAGTTCGTGCTTCGTAGAAAGATACGAATTAATCAGAGTTCAAGT
GACATGAGTAGAATGAGTAACTAGCATTTGAAATTTAAACAGGGACGGATGATACTCAATTTCAAAAACTAAAAAAATGCTCACATGTTCTCTTTTCCATTTGGTTGGAT
TAAGAGGATCCTCCCAAAAGTTCACCTTCGGGGATCCATGATGTTCTGCAAGATTACAAAAGCAGAACAAGCAAATTACTCGCAGATTCAATTCAAGTTCATCAATTACT
TCACCACGAGAGTTTGATAACCAGATATGGCCAACATTTAAACATGATGAATCAGAAGTTCATGAAGCTGCTTGAATTTAGAAAAACTCATTCCGAGTTTGAATTCACAA
AAATGAATAGGTTATAAAAATCAGTTTATGTAACTTGTGTTAAATTCATAATATGGGATAAACATAGCAATCTCAGATACTATTGTAAACTTCGAATTACACGAGTCTAA
TTAGTGAGCGAATCATCCCTAAATGATCCAGTTTAGGCGCGAATTGCTCATCAGTTAGGGAAGGAGTAACGAGATAAGAAACAAACTATGAACATAGAAAGAGCGGGAGG
AAATACCGCCACCGCCGGCGAGACCGCGGTTTAGGGTTAGAGAGGCGATTTGAGGCGTAGAAACTGATTTGTGAGAGGAAACCATACGGCGAGCCGCGGCCGTCCACACC
GCCATAGTTCGGTGCTCTGCTCTTCGAGATTGTGCCCACAGAAGAGAGGACGGAAGCTTCCA
Protein sequenceShow/hide protein sequence
MTSIIRFSTFNILHDHFCKKPTRFSPLPARKVGFCNRNGFRLRAYGWRLTFLGGSCGNEGIFGKDLGFKGRRGLIVARFNQGFGFNGGGGGGGGDDGATARILGNIALAV
GLTYLSVTGQLGWVLDAIVSIWLVAVLVPIVGVAAFIWWAGRDIVQGTCPNCGNDFQIFKSTLNEELQLCPFCSQPFSVVDDKFVRDSVKFSNKTSSTFGQAFSNFTSPK
KGKETSAAVVDIEAEVKDVD