; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC10g1458 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC10g1458
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionFUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages;
Genome locationMC10:17817760..17825589
RNA-Seq ExpressionMC10g1458
SyntenyMC10g1458
Gene Ontology termsGO:0048767 - root hair elongation (biological process)
GO:0071816 - tail-anchored membrane protein insertion into ER membrane (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0043529 - GET complex (cellular component)
GO:0043621 - protein self-association (molecular function)
InterPro domainsIPR028945 - Get1 family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008448478.1 PREDICTED: uncharacterized protein LOC103490650 [Cucumis melo]1.89e-10085.39Show/hide
Query:  MEAEAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKT
        ME EAEGIVEH SS AAP IF +VI FQFLARWLE  K+ GSN+ VE+ELRKSIKQLL+EAS LSQPSTFAQAAKLRRLAAAKEKELANYQESR+KE+KT
Subjt:  MEAEAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKT

Query:  SYGLYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVV
        SYGLYSQVLL SKV+IYI LV WFWRASVATVPHHLVQPFGK LSW+AGGTVNDYVKVG+IPWLILSTRVSK+VCQVV
Subjt:  SYGLYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVV

XP_022145254.1 uncharacterized protein LOC111014752 [Momordica charantia]2.76e-118100Show/hide
Query:  MEAEAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKT
        MEAEAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKT
Subjt:  MEAEAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKT

Query:  SYGLYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVVP
        SYGLYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVVP
Subjt:  SYGLYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVVP

XP_022923433.1 uncharacterized protein LOC111431128 [Cucurbita moschata]4.65e-10185.39Show/hide
Query:  MEAEAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKT
        MEAEAEGIVEHRSS AAP IFLVVI FQFLARWLE  K+ GSN+ VEMELRKSIKQLL+EAS LSQPSTFAQAAKLRRLAAAKEKELANYQESR+KE+KT
Subjt:  MEAEAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKT

Query:  SYGLYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVV
        SYGLYS+VLL SKV +YIAL+ WFWR SVATVPHHLVQPFG++LSW+AGG VNDYVKVG+IPWLILSTRVSK+VCQVV
Subjt:  SYGLYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVV

XP_023007643.1 uncharacterized protein LOC111500212 [Cucurbita maxima]1.17e-9984.75Show/hide
Query:  EAEAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKTS
        EAEAEGIVEHRSS AAP IFL+VI FQFLARWLE  K+ GSN+ VEMELRKSIKQLL+EAS LSQPSTFAQAAKLRRLAAAKEKELANYQESR+KE+KTS
Subjt:  EAEAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKTS

Query:  YGLYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVV
        YGLYS+VLL SKV +YI LV WFWR SVATVPHHLVQPFG++LSW+AGG VNDYVKVG+IPWLILSTRVSK+VCQVV
Subjt:  YGLYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVV

XP_023552250.1 uncharacterized protein LOC111809975 [Cucurbita pepo subsp. pepo]3.27e-10185.96Show/hide
Query:  MEAEAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKT
        MEAEAEGIVEHRSS AAP IFLVVI FQFLARWLE  K+ GSN+ VEMELRKSIKQLL+EAS LSQPSTFAQAAKLRRLAAAKEKELANYQESR+KE+KT
Subjt:  MEAEAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKT

Query:  SYGLYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVV
        SYGLYS+VLL SKV +YIALV WFWR SVATVPHHLVQPFG++LSW+AGG VNDYVKVG+IPWLILSTRVSKYVC+VV
Subjt:  SYGLYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVV

TrEMBL top hitse value%identityAlignment
A0A0A0L4T9 Uncharacterized protein1.35e-9785.14Show/hide
Query:  EAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKTSYG
        E EGIVEHRSS AAP IF +VI FQFLA+WLE  K+ GSN+ VEMELRKSIKQLLKEAS LSQPSTFAQAAKLRRLAAAKEKELANYQESR+KE+KTSYG
Subjt:  EAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKTSYG

Query:  LYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVV
        LYSQVLL SKV+I+I LV WFWRASVATVPHHLVQPFGK LSWRAGGTVNDYVKVG+IPWLILSTRVSK+V +VV
Subjt:  LYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVV

A0A1S3BKE1 uncharacterized protein LOC1034906509.15e-10185.39Show/hide
Query:  MEAEAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKT
        ME EAEGIVEH SS AAP IF +VI FQFLARWLE  K+ GSN+ VE+ELRKSIKQLL+EAS LSQPSTFAQAAKLRRLAAAKEKELANYQESR+KE+KT
Subjt:  MEAEAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKT

Query:  SYGLYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVV
        SYGLYSQVLL SKV+IYI LV WFWRASVATVPHHLVQPFGK LSW+AGGTVNDYVKVG+IPWLILSTRVSK+VCQVV
Subjt:  SYGLYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVV

A0A6J1CUP1 uncharacterized protein LOC1110147521.34e-118100Show/hide
Query:  MEAEAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKT
        MEAEAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKT
Subjt:  MEAEAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKT

Query:  SYGLYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVVP
        SYGLYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVVP
Subjt:  SYGLYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVVP

A0A6J1E9M6 uncharacterized protein LOC1114311282.25e-10185.39Show/hide
Query:  MEAEAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKT
        MEAEAEGIVEHRSS AAP IFLVVI FQFLARWLE  K+ GSN+ VEMELRKSIKQLL+EAS LSQPSTFAQAAKLRRLAAAKEKELANYQESR+KE+KT
Subjt:  MEAEAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKT

Query:  SYGLYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVV
        SYGLYS+VLL SKV +YIAL+ WFWR SVATVPHHLVQPFG++LSW+AGG VNDYVKVG+IPWLILSTRVSK+VCQVV
Subjt:  SYGLYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVV

A0A6J1L884 uncharacterized protein LOC1115002125.67e-10084.75Show/hide
Query:  EAEAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKTS
        EAEAEGIVEHRSS AAP IFL+VI FQFLARWLE  K+ GSN+ VEMELRKSIKQLL+EAS LSQPSTFAQAAKLRRLAAAKEKELANYQESR+KE+KTS
Subjt:  EAEAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKTS

Query:  YGLYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVV
        YGLYS+VLL SKV +YI LV WFWR SVATVPHHLVQPFG++LSW+AGG VNDYVKVG+IPWLILSTRVSK+VCQVV
Subjt:  YGLYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVV

SwissProt top hitse value%identityAlignment
Q1H5D2 Protein GET17.5e-5261.14Show/hide
Query:  EAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKTSYG
        E E ++E R   AAP  F+VV+ FQ L++WL++ K+ GS N  E ELR  IKQLL+EASALSQP+TFAQAAKLRR AA KEKELA Y E   KE+K SY 
Subjt:  EAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKTSYG

Query:  LYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVV
        +Y + LL SKVVIY+ LVL FWR  +A +   LVQPFG LLSW  GG +  +V VG+IPWLILS RVSKYVC+ V
Subjt:  LYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVV

Arabidopsis top hitse value%identityAlignment
AT4G16444.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: CHD5-like protein (InterPro:IPR007514); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).5.4e-5361.14Show/hide
Query:  EAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKTSYG
        E E ++E R   AAP  F+VV+ FQ L++WL++ K+ GS N  E ELR  IKQLL+EASALSQP+TFAQAAKLRR AA KEKELA Y E   KE+K SY 
Subjt:  EAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKTSYG

Query:  LYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVV
        +Y + LL SKVVIY+ LVL FWR  +A +   LVQPFG LLSW  GG +  +V VG+IPWLILS RVSKYVC+ V
Subjt:  LYSQVLLTSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCGGAAGCGGAAGGAATCGTAGAGCATCGAAGCTCGTTTGCGGCTCCGTCCATATTTTTGGTTGTTATTACCTTTCAGTTTCTCGCTAGATGGCTCGAACGCAC
GAAGAGGAGCGGTTCCAACAACGATGTGGAAATGGAGTTGCGCAAATCTATAAAGCAACTTCTGAAGGAGGCAAGCGCCTTATCTCAACCATCTACATTTGCACAAGCTG
CAAAACTTCGGAGGTTGGCAGCCGCTAAGGAGAAGGAACTGGCAAATTATCAAGAATCACGTAGTAAGGAGATGAAGACGTCATATGGTTTATATAGTCAAGTACTACTG
ACGTCAAAGGTTGTGATCTATATTGCTCTGGTTCTCTGGTTTTGGAGAGCTTCTGTTGCTACTGTACCTCATCACCTTGTTCAGCCGTTTGGAAAACTTTTGTCTTGGAG
GGCTGGAGGTACCGTAAATGATTATGTGAAGGTTGGAGTTATACCTTGGTTGATACTGTCCACAAGGGTTAGCAAATATGTATGTCAAGTCGTCCCGTGA
mRNA sequenceShow/hide mRNA sequence
GGGATGCATTCGAACACTGAGACACAAATATATCTGAAATTCTTGTTAATTTTCAATTATGTCAAAGGCAAAGAAGAATACACATCCATGGATTGGACAACACCATAATT
CAAGATTTTAAAGGCAGGTCATATATAGCATATGGATACTTCAATAGACTTGGACACTTTAAAGAAGGAAAGAAGAAGAAGAAGCCCAATTCCCTCCTCCGTCTCTCTTA
TGAACTATGTGTTAAAACATACAATTCATTTTACTATCTAATTGACCAATCTTCGCCGTGTTTGCTTTATTAAATAGATTGACATGTGACCATGTGGATGCCAATTCAAA
GCTGACCACTCTTACATTTAAAAAAGGGTTAAATGCATGCAAAATAAAGAAAACTTAGTGCAGTACCCATCGAGCAGAAGAATTAAGAGGGGAGCAACATTACAGCGATG
AAGTTTGGAAATCATCATAATTGAAAACAAGTCCTCAAGGCATAGGAAACTCGAATTAGTTCAGGTAGCATTGCAGCTTAGATGAGAAATCAAGCAAAGGCAAAGGCAAT
CGCAAGAAATTGCATTACAAAATAAGGAAATCACGAACAAAATCTCCCAAGAATCCTACAGAATTGAAAGAAAATAAAGTAACGAGGAGAGAGAGAGAGAGAGAGAGGTA
AGGACCTGGATACAGAGGTGGAGGCGAGGGGAGCAGTTCGGCCACCTAAGGTAACGACCTCAAAGCCAACACGGTGGCTTCCTTGCCTGATTTCTCCTGCAACAACGGAA
CACCCACAGCAAAATCGACGGCGTAGAAAGAAAAGAAGAAGAAATGTATATGCTGTGCTTACGAGTGTAGAAGCCTTGGATCTTCAAGTTAGGGTTGGAGGATTTAAGCA
TCTCAAAAACCCGAACGATTAGAGTAGTTTTGCCCACGCCCTGCGCACAGCACAAACCGACACAGTCATTCACATGAACATGAATCCATCCATTTCGAATTCCATTGTTA
CGATCCCCGCCGGAGAAAAGCTTACCGGCGGCCCCGTAACCAGCAGGCACCGTCCTACTCCTGGTCCTACCATTCTCACTTCTCACTTCTCACGCTTTTCAGACGTACCA
ACACGCCGCTTTCATTCTTGTTAGTGTTTTCATCATGGGACCGAACCCAAATTTTTTTTTCTTTTAATTTTTTTTTAAAATTTTTTTAACACGGTTAAAGAGGTGATTTT
CTTTTTCTACACAGTGAGAATTTAATTTTTGAACGGAAATAGGATCTTAATCAATTGGAGAAAGAATTGAATTTTTTTTAATAAATAAAACACAGTAAAAGCATAGGATT
TGAATCTTGACTCGAAATAAAAAAGTCAAAGAATCAATATTTGAATAGTGTTTCATCTATAGATATAACCAGTAGAGATGAAAAGACAAGTATATTTTAAAGTAAAGAGT
TCTATCTTTCAAATATTTTTTACAGATTCATAAAATATAAATATCTAAAAAATATCAAAAGTGTAAAAATGAGCTCAATGATGACAGGCCTATCAGGTCTTTACTGGATC
ACAATATTTACTACTGTGTTGAGTATAATAACGTAACACAGTTTTTAGATACAACCACAGATTGGTTAAAGATCAGAGCAACAAAGCAAAGGGCCTAATCAGGCAGAAGT
CTACATGAATGAGCTCAGAATTACAAAAGATGGAAATACAATTTGACTTTTAGCAGAATTAATTAACCATAACTCAAAATTCCCAAATTTGGTCATTTTTCAACTCAGGT
GAGAAATCATAACAGGAAAAACAAATAGTAAAAAAAATATATATATATTTTCAATGATAGTCGAAAAGTTGAGAAGTCAACCAACACCATCATTTCATCCACCAGTGCCA
AAGCTCAAAACAAGGTTGGGGAATCAAACAAACTTTCAAACTCAAGTCCTCCTGTCAGAAAGAAAAACCAGACAAAACTCACCCAACAATTTATTAAGCTCACAAAGTAT
CTCATGCATTTCCATGGAGAATGGTATGCTGATAGGTGAACAGGGGAATGGAATTAGCAATTTTTTGTTGATTCCAAGCTTCCCAGCACGAGGAAAGGCAATACACAAAA
TTCAAGAAGATCAGTTAGAATAATCAGCTAAAACTGAATGCTTGTGCCATTACTAACAGATTTTGCTTTCAAATACAGAACCATGTCAACAACTCAAGGAATCTAACATT
GCACCTGATACTTCTTTGTTTCAATTTCTTTTCTCGCATCTATCCTACTTACCCAAAAAAGGGGAAAAAAAGGTAGCTATTTATCCGAGACATAAGAAAAATCTAAAGTG
CTGAGTCTTTATGCTCCAAAGCTATAAAGGAGTTAAATGCTTGTTCTTCCACCGCCACACCTCCCTCGTCCCTAACATTTACTCATCGTCCTCCTCCTGCAATGTAAAAA
ATCAAATATTATCATCCACTTACTACGTTTTTTATTTGCTATGATTACAAAGTTCAGGCATAAGTTGAATCTGAACCTCTCCACTCTCATCATCCTCATCATCGTCGTCG
TTGACTTCTGATTTGGACTTGTCTGATTCTTCTTCTTCAGCTCCATTTTCTCCTTCCGCCTAATGGAATGGTGCATGATTATACAGATTAGAATTAATGATTGTGTAACA
AAGACATTGAGAATGAAAATAGGATAATAGGGCATACAAGTCTCTTATTATAAGCCTGCATGCTCTTATTGTACTCAGTCTTCCTTTTCTCAGCCTTGTTTATGTAAGGA
GCTTTCTCCTGTAATATTAGGGAACATATGAGCAACAAGTAATTTGATTCCATTTTGTTAAAATATAATCAGTTTCCATTAAAATACTTACAGCTTCTGACATTGACTTC
CATTTATCACCTCCAGCTTTTCCAACCTGTTTATTCACCAAAATAGCACGCGTTTACAAATCAAAGAATTTCCTAAAAATTGACGGAAAAAAAATCAGCGCGTAGATCTG
AAATCACTTACAGCAGCCACGGACTTGTTATTAGGATGCTCCTTCTTGTACTGTTTCCTGAATTCCTCCCTATATATTCATCAACAAAAACATCGAAACAGTAAAAATCC
GATACAAAATAAAAATAACCTAAACGATAAAAAAACGCGGATCTGAAAAAGAAAAATTCAAGACAAGCTCGAGAGAGATACATGAAAACGAAGAAAGCACTGGCAGGCCT
CTTCGGCTTGTTAGGATCCTTCGCAGCCTTCGCCGATTTCTTACTAGCTCCAACACTTTTGCTCTTCAACCTAGCAAAATCAAACAACAAAACACGAAAAAAAAAAAGGA
AAAAAACTGAGAACAGAACAAACAAATAAAACGCATAGGCAGGATCCATCCAACCGCCTACTCCGAAACACACTTCCACACAACACATCAAAATCAAAAGCAACCGATAT
ACTCACTTAGTGTCGGTCTTTTTAGGCGCCGCTTTGGATTTCCCGCCCTTCATGACGAATCCTGCAGAAAAACGAGCAAGAAAAAAGGTAAAAAAAAAAAAAAAAACCGT
TGTAGAGATGATGAGCAAGCAACGGAGCGAGCGGCGCAGAGATTCGGAGATTTACCTGCAAACTCTAGGGTTTGAGAAGAAGATAAGGAGAGAGAGATGTGGGGGGTTGC
GTTTGGTTCGTGTGATGCGTTTGCGTTGGGCTTTGGGTTTGGGTTTGGCTTTGTGAGAGTGTGTGTGATGAGAGTGTTCTGTGGAAAGGTTTTTGAGGGCGAAGAGCGAG
AGAGGAGAAGCAGTCGGTTCGTTTCTGAGAGAGAGAGAAATCGCGATATCAATTGAAGTCCGGGTTTAGGCGGGATTTCAAAAATCTTTTGCCACGTCATCGATCCGCGT
CCTCGCTTCTCATTGGCTTCCCTTCCATTCCTTCCCCTCTTTTGTGTTTCCTTTGTGTGTTGTTTGCAATTTCTTCGCTGTGAATGCTTCCCTCTTGCCAGATCCGGACA
CGTGAGTTAAAGCCCGTTGTTTCGCCGCGAAATTACGCCTTTGCCCTTCCCTCTTACGACTCGTGCCTTTCCTTTGTTCGCAACTTATCAGAATAAACGTGATAAAACGA
TTAACGCGTCGTCCTGGTAATCGCTCGAGGATATTTTCGTAACTTCACTTCTCTTCTCTTCTCTCCGTCAGATATTTTCTCTCCTTCGACTAACACGGTGCGTTTTCTTC
CTCCATGACTTGGGCTGGGCTTCGGTTTTGGGCCTTGGATTTGAAGACCTTCTTCGGAAGCCCATTTGTTGATGTCCGGAGCCCAATGTCCCGCCCAACTGCTTCCTGAA
TAACCTCCGTCGTATTGCGACGACGAACGGAGTGTTCGGTTTGATTAAGTTACTGTGGTGGCAGAAAAATTCTCCGGAAATGGAAGCGGAAGCGGAAGGAATCGTAGAGC
ATCGAAGCTCGTTTGCGGCTCCGTCCATATTTTTGGTTGTTATTACCTTTCAGTTTCTCGCTAGATGGCTCGAACGCACGAAGAGGAGCGGTTCCAACAACGATGTGGAA
ATGGAGTTGCGCAAATCTATAAAGCAACTTCTGAAGGAGGCAAGCGCCTTATCTCAACCATCTACATTTGCACAAGCTGCAAAACTTCGGAGGTTGGCAGCCGCTAAGGA
GAAGGAACTGGCAAATTATCAAGAATCACGTAGTAAGGAGATGAAGACGTCATATGGTTTATATAGTCAAGTACTACTGACGTCAAAGGTTGTGATCTATATTGCTCTGG
TTCTCTGGTTTTGGAGAGCTTCTGTTGCTACTGTACCTCATCACCTTGTTCAGCCGTTTGGAAAACTTTTGTCTTGGAGGGCTGGAGGTACCGTAAATGATTATGTGAAG
GTTGGAGTTATACCTTGGTTGATACTGTCCACAAGGGTTAGCAAATATGTATGTCAAGTCGTCCCGTGAAGAATCATTGAAGGTAAATGTGATGTAGATGATACATCGCA
TGATGCTGTGTAAAATATTTGGCAACCAATATTTTCCTGAACTTCGGTGAGGGGAGGCTGGAGAAGTTTTTACAGCCCCACCACTTTTTCCTCTTATATTCAAATATGGG
GTTCAGCATTGCTATTGTTTTTTTTTTTTTTTCCTTCTTTTTTCTTCTTCTTTTGCCTGCAAATGTTTTCCCTTTCCCTTTCCTTTTTTTCCCCCTTGTTGGTGAACTCT
TGCTCTTAGGCATAATTTATTTTTTGTTATTTATATATAGAATTAATATATCCTTTATATAGCTTGATTGCTTGTTCATAAGTTAAGGGTCATTCTTTTTTTCTTTCCTT
CCTTTGGGTAGGGTTGGAAATATTTTTATACGTAAGATGCTGTCTTATGGCATGGTTTATTTTTAGAAATCTTTCTATATTAATTTAATGTAAATAGGAGTTAGTAAATT
GAACATGCTTTTATATTAAACATTGTATA
Protein sequenceShow/hide protein sequence
MEAEAEGIVEHRSSFAAPSIFLVVITFQFLARWLERTKRSGSNNDVEMELRKSIKQLLKEASALSQPSTFAQAAKLRRLAAAKEKELANYQESRSKEMKTSYGLYSQVLL
TSKVVIYIALVLWFWRASVATVPHHLVQPFGKLLSWRAGGTVNDYVKVGVIPWLILSTRVSKYVCQVVP