; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS018618 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS018618
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUnknown protein
Genome locationscaffold313:477516..484791
RNA-Seq ExpressionMS018618
SyntenyMS018618
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579038.1 hypothetical protein SDJN03_23486, partial [Cucurbita argyrosperma subsp. sororia]9.7e-25578.74Show/hide
Query:  GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRFPIFLVI
        GH  D+Y+E ARLN E        FENPTPPEV D VRVEST ILSGTL  GVDNF  AGVAVTKVKNEMFDDFNEDLDHV+ IERLRMLLSR  + L +
Subjt:  GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRFPIFLVI

Query:  IFSLSGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT
           + GGSG  SGD +QCFLKQK KSMF++ E     NVLHD++G  AP   SPSVVCSP AT+SGS FSSN SLNK TESGNDMELKE D+IC SEKV 
Subjt:  IFSLSGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT

Query:  TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG
        TELGSR LTNH P+ANL  STKVKDEPYDH +GC+++GKDMNNV S  LS+KSETTMPDEPYENKVDDM LQDRMKFFSSRK  G TS DYEHPKPSDPG
Subjt:  TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG

Query:  CSSLVSEPASLMNIKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPP-IRC
        CS LVSEP +  N KRRRK K+TATNSIETALEEDAPGLLQILV+KG+ VDEIKLYGE ESDDDLDES SE+SF ELE VI+RLF QR SFLKFP  IRC
Subjt:  CSSLVSEPASLMNIKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPP-IRC

Query:  TKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDFLPIKWQIKRLVIAMKLTNCSRISLLENRPL
        TKASR+SYCLACLVSLIEQTRYLHFR+WPVEWGWCRDLQSFIFVFERHKRIV+ERPEYGYATYFFELV+ LPI WQIKRLVIAMKLTNCSRISLLENRPL
Subjt:  TKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDFLPIKWQIKRLVIAMKLTNCSRISLLENRPL

Query:  LVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
        LVGEDLTEGEA VL SYGW+ NSGLG+MLNY  RVVHDR++EDISEWRSKIGKLL+DGYNGGALV EN PK+VAEY SSQ TQ+KLEL
Subjt:  LVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL

KAG7016560.1 hypothetical protein SDJN02_21670 [Cucurbita argyrosperma subsp. argyrosperma]3.7e-25478.57Show/hide
Query:  GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRFPIFLVI
        GH  D+Y+E ARLN E        FENPTPPEV D VRVEST ILSGTL  GVDNF  AGVAVTKVKNEMFDDFNEDLDHV+ IERLRMLLSR  + L +
Subjt:  GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRFPIFLVI

Query:  IFSLSGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT
           + GGSG  SGD +QCFLKQK KSMF++ E     NVLHD++G  AP   SPSVVCSP AT+SGS FSSN SLNK TESGNDMELKE D+IC SEKV 
Subjt:  IFSLSGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT

Query:  TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG
        TELGSR LTNH P+ANL  STKVKDEPYDH +GC+++GKDMNNV S  LS+KSETTMP+EPYENKVDDM LQDRMKFFSSRK  G TS DYEHPKPSDPG
Subjt:  TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG

Query:  CSSLVSEPASLMNIKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPP-IRC
        CS LVSEP +  N KRRRK K+TATNSIETALEEDAPGLLQILV+KG+ VDEIKLYGE ESDDDLDES SE+SF ELE VI+RLF QR SFLKFP  IRC
Subjt:  CSSLVSEPASLMNIKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPP-IRC

Query:  TKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDFLPIKWQIKRLVIAMKLTNCSRISLLENRPL
        TKASR+SYCLACLVSLIEQTRYLHFR+WPVEWGWCRDLQSFIFVFERHKRIV+ERPEYGYATYFFELV+ LPI WQIKRLVIAMKLTNCSRISLLENRPL
Subjt:  TKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDFLPIKWQIKRLVIAMKLTNCSRISLLENRPL

Query:  LVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
        LVGEDLTEGEA VL SYGW+ NSGLG+MLNY  RVVHDR++EDISEWRSKIGKLL+DGYNGGALV EN PK+VAEY SSQ TQ+KLEL
Subjt:  LVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL

XP_022141523.1 uncharacterized protein LOC111011878 isoform X1 [Momordica charantia]0.0e+0096.79Show/hide
Query:  GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRFPIFLVI
        GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSR  +   +
Subjt:  GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRFPIFLVI

Query:  IFSLSGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT
           + GGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT
Subjt:  IFSLSGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT

Query:  TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG
        TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG
Subjt:  TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG

Query:  CSSLVSEPASLMNIKRRRKWKRTATNSIETALEEDAPGLL----QILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPP
        CSSLVSEPASLMN+KRRRKWKRTATNSIETALEEDAPGLL    QILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPP
Subjt:  CSSLVSEPASLMNIKRRRKWKRTATNSIETALEEDAPGLL----QILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPP

Query:  IRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDFLPIKWQIKRLVIAMKLTNCSRISLLEN
        IRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELV+FLPIKWQIKRLVIA+KLTNCSRISLLEN
Subjt:  IRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDFLPIKWQIKRLVIAMKLTNCSRISLLEN

Query:  RPLLVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
        RPLLVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
Subjt:  RPLLVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL

XP_022141525.1 uncharacterized protein LOC111011878 isoform X2 [Momordica charantia]0.0e+0097.44Show/hide
Query:  GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRFPIFLVI
        GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSR  +   +
Subjt:  GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRFPIFLVI

Query:  IFSLSGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT
           + GGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT
Subjt:  IFSLSGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT

Query:  TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG
        TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG
Subjt:  TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG

Query:  CSSLVSEPASLMNIKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCT
        CSSLVSEPASLMN+KRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCT
Subjt:  CSSLVSEPASLMNIKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCT

Query:  KASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDFLPIKWQIKRLVIAMKLTNCSRISLLENRPLL
        KASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELV+FLPIKWQIKRLVIA+KLTNCSRISLLENRPLL
Subjt:  KASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDFLPIKWQIKRLVIAMKLTNCSRISLLENRPLL

Query:  VGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
        VGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
Subjt:  VGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL

XP_022141526.1 uncharacterized protein LOC111011878 isoform X3 [Momordica charantia]0.0e+0096.79Show/hide
Query:  GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRFPIFLVI
        GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSR  +   +
Subjt:  GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRFPIFLVI

Query:  IFSLSGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT
           + GGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT
Subjt:  IFSLSGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT

Query:  TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG
        TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG
Subjt:  TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG

Query:  CSSLVSEPASLMNIKRRRKWKRTATNSIETALEEDAPGLL----QILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPP
        CSSLVSEPASLMN+KRRRKWKRTATNSIETALEEDAPGLL    QILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPP
Subjt:  CSSLVSEPASLMNIKRRRKWKRTATNSIETALEEDAPGLL----QILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPP

Query:  IRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDFLPIKWQIKRLVIAMKLTNCSRISLLEN
        IRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELV+FLPIKWQIKRLVIA+KLTNCSRISLLEN
Subjt:  IRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDFLPIKWQIKRLVIAMKLTNCSRISLLEN

Query:  RPLLVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
        RPLLVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
Subjt:  RPLLVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL

TrEMBL top hitse value%identityAlignment
A0A6J1CIB2 uncharacterized protein LOC111011878 isoform X20.0e+0097.44Show/hide
Query:  GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRFPIFLVI
        GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSR  +   +
Subjt:  GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRFPIFLVI

Query:  IFSLSGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT
           + GGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT
Subjt:  IFSLSGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT

Query:  TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG
        TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG
Subjt:  TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG

Query:  CSSLVSEPASLMNIKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCT
        CSSLVSEPASLMN+KRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCT
Subjt:  CSSLVSEPASLMNIKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCT

Query:  KASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDFLPIKWQIKRLVIAMKLTNCSRISLLENRPLL
        KASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELV+FLPIKWQIKRLVIA+KLTNCSRISLLENRPLL
Subjt:  KASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDFLPIKWQIKRLVIAMKLTNCSRISLLENRPLL

Query:  VGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
        VGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
Subjt:  VGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL

A0A6J1CJF8 uncharacterized protein LOC111011878 isoform X30.0e+0096.79Show/hide
Query:  GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRFPIFLVI
        GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSR  +   +
Subjt:  GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRFPIFLVI

Query:  IFSLSGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT
           + GGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT
Subjt:  IFSLSGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT

Query:  TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG
        TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG
Subjt:  TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG

Query:  CSSLVSEPASLMNIKRRRKWKRTATNSIETALEEDAPGLL----QILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPP
        CSSLVSEPASLMN+KRRRKWKRTATNSIETALEEDAPGLL    QILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPP
Subjt:  CSSLVSEPASLMNIKRRRKWKRTATNSIETALEEDAPGLL----QILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPP

Query:  IRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDFLPIKWQIKRLVIAMKLTNCSRISLLEN
        IRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELV+FLPIKWQIKRLVIA+KLTNCSRISLLEN
Subjt:  IRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDFLPIKWQIKRLVIAMKLTNCSRISLLEN

Query:  RPLLVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
        RPLLVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
Subjt:  RPLLVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL

A0A6J1CKR0 uncharacterized protein LOC111011878 isoform X10.0e+0096.79Show/hide
Query:  GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRFPIFLVI
        GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSR  +   +
Subjt:  GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRFPIFLVI

Query:  IFSLSGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT
           + GGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT
Subjt:  IFSLSGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT

Query:  TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG
        TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG
Subjt:  TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG

Query:  CSSLVSEPASLMNIKRRRKWKRTATNSIETALEEDAPGLL----QILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPP
        CSSLVSEPASLMN+KRRRKWKRTATNSIETALEEDAPGLL    QILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPP
Subjt:  CSSLVSEPASLMNIKRRRKWKRTATNSIETALEEDAPGLL----QILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPP

Query:  IRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDFLPIKWQIKRLVIAMKLTNCSRISLLEN
        IRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELV+FLPIKWQIKRLVIA+KLTNCSRISLLEN
Subjt:  IRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDFLPIKWQIKRLVIAMKLTNCSRISLLEN

Query:  RPLLVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
        RPLLVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
Subjt:  RPLLVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL

A0A6J1FLT1 uncharacterized protein LOC111445382 isoform X14.4e-25378.23Show/hide
Query:  GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRFPIFLVI
        GH  D+Y+E ARLN E        FENPTPPEV D VRVEST ILSGTL  GVDNF  AGVAVTKVKNEMFDDF+EDLDHV+ IERLRMLLSR  + L +
Subjt:  GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRFPIFLVI

Query:  IFSLSGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT
           + GGSG  SGD +QCFLKQK KSMF++ E     NVLHD++G  AP   SPSVVCSP AT+SGS FSSN SLNK TESGNDMELKE D+IC SEKV 
Subjt:  IFSLSGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT

Query:  TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG
        TELGSR LTNH P+ NL  STKVKDEPYDH +GC+++GKDMNNV S  LS+KSETTMPDEPYENKVDDM LQDRMKFFSSRK  G TS DYEHPKPSDPG
Subjt:  TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG

Query:  CSSLVSEPASLMNIKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPP-IRC
        CS LVSEP +  N KRRRK K+TATNSIETALEEDAPGLLQILV+KG+ VDEIKLYGE ESDDDLDES SE+SF ELE VI+RLF QR SFLKFP  IRC
Subjt:  CSSLVSEPASLMNIKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPP-IRC

Query:  TKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDFLPIKWQIKRLVIAMKLTNCSRISLLENRPL
         KASR+SYCLACLVSLIEQTRYLHFR+WPVEWGWCRDLQSFIFVFERHKRIV+ERPEYGYATYFFELV+ LPI WQIKRLVIAMKLTNCSRISLLENRPL
Subjt:  TKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDFLPIKWQIKRLVIAMKLTNCSRISLLENRPL

Query:  LVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
        LVGEDLTEGEA VL SYGW+ NSGLG+MLNY  RVVHDR++EDISEWRSKIGKLL+DGYNGGALV EN PK+VAEY SSQ TQ+KLEL
Subjt:  LVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL

A0A6J1FMV3 uncharacterized protein LOC111445382 isoform X24.4e-25378.23Show/hide
Query:  GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRFPIFLVI
        GH  D+Y+E ARLN E        FENPTPPEV D VRVEST ILSGTL  GVDNF  AGVAVTKVKNEMFDDF+EDLDHV+ IERLRMLLSR  + L +
Subjt:  GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRFPIFLVI

Query:  IFSLSGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT
           + GGSG  SGD +QCFLKQK KSMF++ E     NVLHD++G  AP   SPSVVCSP AT+SGS FSSN SLNK TESGNDMELKE D+IC SEKV 
Subjt:  IFSLSGGSGASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT

Query:  TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG
        TELGSR LTNH P+ NL  STKVKDEPYDH +GC+++GKDMNNV S  LS+KSETTMPDEPYENKVDDM LQDRMKFFSSRK  G TS DYEHPKPSDPG
Subjt:  TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPG

Query:  CSSLVSEPASLMNIKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPP-IRC
        CS LVSEP +  N KRRRK K+TATNSIETALEEDAPGLLQILV+KG+ VDEIKLYGE ESDDDLDES SE+SF ELE VI+RLF QR SFLKFP  IRC
Subjt:  CSSLVSEPASLMNIKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPP-IRC

Query:  TKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDFLPIKWQIKRLVIAMKLTNCSRISLLENRPL
         KASR+SYCLACLVSLIEQTRYLHFR+WPVEWGWCRDLQSFIFVFERHKRIV+ERPEYGYATYFFELV+ LPI WQIKRLVIAMKLTNCSRISLLENRPL
Subjt:  TKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDFLPIKWQIKRLVIAMKLTNCSRISLLENRPL

Query:  LVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
        LVGEDLTEGEA VL SYGW+ NSGLG+MLNY  RVVHDR++EDISEWRSKIGKLL+DGYNGGALV EN PK+VAEY SSQ TQ+KLEL
Subjt:  LVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G16610.1 unknown protein1.7e-9557.91Show/hide
Query:  VKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGS-TSRDYEHPKPSDPGCSSLVSEPASLMNIKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVL
        +K+E     E  E+ +D M+L DR+K    R   GS    D   P      C+S   E      + R  K K+TAT+SIETALEEDAPGLLQ+L+ +GV 
Subjt:  VKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGS-TSRDYEHPKPSDPGCSSLVSEPASLMNIKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVL

Query:  VDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKR
        VDE++LYG    D   D+S   ESF ELE VIS+LF +R++  K      +KASR+SYCL CL SLIEQ RYL FR WPVEWGWCRDLQSFIFVFERH R
Subjt:  VDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKR

Query:  IVLERPEYGYATYFFELVDFLPIKWQIKRLVIAMKLTNCSRISLLENRPLLVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDR-NHEDISEWRS
        IV+ERPEYGYATYFFEL +   I+WQ+KRLV+AMKL +C R  L+EN+PLLVGED+T GEA VL  YGW+ N+GLG+MLNY DRV HDR   +  SEWRS
Subjt:  IVLERPEYGYATYFFELVDFLPIKWQIKRLVIAMKLTNCSRISLLENRPLLVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDR-NHEDISEWRS

Query:  KIGKLLVDGYNGGALV
        KI +LL+DGYN G +V
Subjt:  KIGKLLVDGYNGGALV

AT5G16610.2 unknown protein6.8e-9741.34Show/hide
Query:  GTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRFPIFL--------------------VIIFSLSGGSGASSGDPMQCFLKQKGKS
        G   + V+NF   G            + ++DL+H+   ER +MLL R  I L                        S   G  +SSG     FL++    
Subjt:  GTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRFPIFL--------------------VIIFSLSGGSGASSGDPMQCFLKQKGKS

Query:  MFSNVEL-TGTRNVLHDQNGCDAPHLGSPSVVCSPIAT---ISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLTNHAPEANLFYSTK
        +  N  + + + + L      D P     S   SP A+   +S SN +  + + ++    N + L E++      KV       +  N            
Subjt:  MFSNVEL-TGTRNVLHDQNGCDAPHLGSPSVVCSPIAT---ISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLTNHAPEANLFYSTK

Query:  VKDEPYDHVDGCNLHGKDMNNVCSRI----------LSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGS-TSRDYEHPKPSDPGCSSLVSEPASL
        VK E   H +  + +  D   +  R+            +K+E     E  E+ +D M+L DR+K    R   GS    D   P      C+S   E    
Subjt:  VKDEPYDHVDGCNLHGKDMNNVCSRI----------LSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGS-TSRDYEHPKPSDPGCSSLVSEPASL

Query:  MNIKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCTKASRSSYCLAC
          + R  K K+TAT+SIETALEEDAPGLLQ+L+ +GV VDE++LYG    D   D+S   ESF ELE VIS+LF +R++  K      +KASR+SYCL C
Subjt:  MNIKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCTKASRSSYCLAC

Query:  LVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDFLPIKWQIKRLVIAMKLTNCSRISLLENRPLLVGEDLTEGEAG
        L SLIEQ RYL FR WPVEWGWCRDLQSFIFVFERH RIV+ERPEYGYATYFFEL +   I+WQ+KRLV+AMKL +C R  L+EN+PLLVGED+T GEA 
Subjt:  LVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVDFLPIKWQIKRLVIAMKLTNCSRISLLENRPLLVGEDLTEGEAG

Query:  VLSSYGWLPNSGLGSMLNYCDRVVHDR-NHEDISEWRSKIGKLLVDGYNGGALV
        VL  YGW+ N+GLG+MLNY DRV HDR   +  SEWRSKI +LL+DGYN G +V
Subjt:  VLSSYGWLPNSGLGSMLNYCDRVVHDR-NHEDISEWRSKIGKLLVDGYNGGALV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GGTCATCTTGAGGACAGCTACACTGAGGAGGCTCGATTAAATACAGAGAAGCAGATATCATGCACTATGGAATTTGAGAATCCAACACCACCTGAGGTTCCTGATTGGGT
AAGGGTGGAATCTACAGGCATCTTATCAGGTACCTTGGAAGATGGTGTTGATAACTTTACTTCTGCTGGTGTGGCAGTAACTAAGGTTAAAAATGAGATGTTCGACGACT
TCAATGAAGATCTTGATCATGTTGTATTTATTGAGCGTCTGAGGATGCTGCTATCAAGGTTCCCCATTTTCTTGGTGATCATATTTTCTCTTTCTGGTGGTTCTGGTGCG
TCATCAGGAGATCCTATGCAATGCTTCTTGAAACAGAAGGGAAAGTCTATGTTTTCTAATGTAGAACTGACGGGAACTAGAAATGTGTTGCATGATCAAAATGGATGTGA
TGCTCCTCATCTTGGCAGCCCTTCAGTAGTTTGTTCACCTATTGCAACCATTTCTGGATCCAATTTCTCAAGCAATCGTTCTTTGAATAAATTAACTGAATCAGGCAATG
ATATGGAACTTAAAGAAGATGATAGGATCTGCTTGTCCGAGAAGGTGACCACAGAATTAGGTTCACGACTTTTGACTAATCATGCCCCTGAAGCAAATTTATTTTATTCC
ACAAAAGTGAAGGATGAACCCTATGATCATGTTGACGGCTGCAACTTACATGGTAAGGATATGAATAATGTCTGCAGCAGAATTCTGTCGGTAAAGAGTGAAACAACCAT
GCCTGATGAACCTTATGAAAACAAGGTAGATGATATGCGACTGCAAGATCGAATGAAGTTCTTCTCATCTCGAAAGGTTTTTGGTTCTACATCTAGAGATTACGAGCATC
CAAAACCTTCTGACCCCGGATGTAGTTCTCTTGTTTCAGAACCTGCTAGTTTAATGAACATTAAACGTCGACGCAAATGGAAAAGGACTGCCACGAATTCAATTGAAACA
GCACTCGAGGAAGATGCTCCTGGCCTTCTCCAGATCCTAGTTGACAAAGGTGTTCTAGTTGATGAAATTAAGCTTTATGGAGAGATGGAAAGTGATGATGATTTGGATGA
GTCCTTTAGTGAAGAAAGCTTTGGTGAGCTTGAAGCTGTGATATCGAGGCTTTTTTCTCAACGCGATTCCTTTCTGAAGTTTCCTCCTATAAGATGCACAAAAGCTTCAA
GATCAAGCTATTGCTTAGCTTGTCTAGTTTCACTTATTGAGCAGACAAGATATCTTCATTTCCGAAGTTGGCCTGTCGAATGGGGGTGGTGCCGTGATCTCCAGTCTTTT
ATATTTGTTTTTGAAAGACATAAAAGAATAGTGCTAGAACGTCCCGAGTATGGGTATGCTACGTATTTCTTTGAGCTTGTGGATTTCTTACCCATCAAGTGGCAGATAAA
GCGGTTGGTGATTGCTATGAAGCTGACTAATTGTAGCAGAATTTCACTACTTGAGAACAGACCATTATTGGTTGGGGAAGACTTGACCGAAGGTGAAGCGGGGGTTTTAT
CGAGCTATGGATGGCTGCCAAATAGTGGCTTGGGTTCAATGCTGAACTACTGTGACAGAGTCGTCCACGACCGAAATCACGAGGACATCTCGGAATGGAGATCAAAAATA
GGGAAGCTACTGGTTGACGGTTATAATGGCGGGGCTCTAGTTCAAGAAAATATTCCAAAGCAGGTTGCAGAGTACAGAAGTTCCCAAACCACACAACTTAAGCTGGAACT
C
mRNA sequenceShow/hide mRNA sequence
GGTCATCTTGAGGACAGCTACACTGAGGAGGCTCGATTAAATACAGAGAAGCAGATATCATGCACTATGGAATTTGAGAATCCAACACCACCTGAGGTTCCTGATTGGGT
AAGGGTGGAATCTACAGGCATCTTATCAGGTACCTTGGAAGATGGTGTTGATAACTTTACTTCTGCTGGTGTGGCAGTAACTAAGGTTAAAAATGAGATGTTCGACGACT
TCAATGAAGATCTTGATCATGTTGTATTTATTGAGCGTCTGAGGATGCTGCTATCAAGGTTCCCCATTTTCTTGGTGATCATATTTTCTCTTTCTGGTGGTTCTGGTGCG
TCATCAGGAGATCCTATGCAATGCTTCTTGAAACAGAAGGGAAAGTCTATGTTTTCTAATGTAGAACTGACGGGAACTAGAAATGTGTTGCATGATCAAAATGGATGTGA
TGCTCCTCATCTTGGCAGCCCTTCAGTAGTTTGTTCACCTATTGCAACCATTTCTGGATCCAATTTCTCAAGCAATCGTTCTTTGAATAAATTAACTGAATCAGGCAATG
ATATGGAACTTAAAGAAGATGATAGGATCTGCTTGTCCGAGAAGGTGACCACAGAATTAGGTTCACGACTTTTGACTAATCATGCCCCTGAAGCAAATTTATTTTATTCC
ACAAAAGTGAAGGATGAACCCTATGATCATGTTGACGGCTGCAACTTACATGGTAAGGATATGAATAATGTCTGCAGCAGAATTCTGTCGGTAAAGAGTGAAACAACCAT
GCCTGATGAACCTTATGAAAACAAGGTAGATGATATGCGACTGCAAGATCGAATGAAGTTCTTCTCATCTCGAAAGGTTTTTGGTTCTACATCTAGAGATTACGAGCATC
CAAAACCTTCTGACCCCGGATGTAGTTCTCTTGTTTCAGAACCTGCTAGTTTAATGAACATTAAACGTCGACGCAAATGGAAAAGGACTGCCACGAATTCAATTGAAACA
GCACTCGAGGAAGATGCTCCTGGCCTTCTCCAGATCCTAGTTGACAAAGGTGTTCTAGTTGATGAAATTAAGCTTTATGGAGAGATGGAAAGTGATGATGATTTGGATGA
GTCCTTTAGTGAAGAAAGCTTTGGTGAGCTTGAAGCTGTGATATCGAGGCTTTTTTCTCAACGCGATTCCTTTCTGAAGTTTCCTCCTATAAGATGCACAAAAGCTTCAA
GATCAAGCTATTGCTTAGCTTGTCTAGTTTCACTTATTGAGCAGACAAGATATCTTCATTTCCGAAGTTGGCCTGTCGAATGGGGGTGGTGCCGTGATCTCCAGTCTTTT
ATATTTGTTTTTGAAAGACATAAAAGAATAGTGCTAGAACGTCCCGAGTATGGGTATGCTACGTATTTCTTTGAGCTTGTGGATTTCTTACCCATCAAGTGGCAGATAAA
GCGGTTGGTGATTGCTATGAAGCTGACTAATTGTAGCAGAATTTCACTACTTGAGAACAGACCATTATTGGTTGGGGAAGACTTGACCGAAGGTGAAGCGGGGGTTTTAT
CGAGCTATGGATGGCTGCCAAATAGTGGCTTGGGTTCAATGCTGAACTACTGTGACAGAGTCGTCCACGACCGAAATCACGAGGACATCTCGGAATGGAGATCAAAAATA
GGGAAGCTACTGGTTGACGGTTATAATGGCGGGGCTCTAGTTCAAGAAAATATTCCAAAGCAGGTTGCAGAGTACAGAAGTTCCCAAACCACACAACTTAAGCTGGAACT
C
Protein sequenceShow/hide protein sequence
GHLEDSYTEEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRFPIFLVIIFSLSGGSGA
SSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLTNHAPEANLFYS
TKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPASLMNIKRRRKWKRTATNSIET
ALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSF
IFVFERHKRIVLERPEYGYATYFFELVDFLPIKWQIKRLVIAMKLTNCSRISLLENRPLLVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKI
GKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL