; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10005027 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10005027
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationChr08:22377998..22380111
RNA-Seq ExpressionHG10005027
SyntenyHG10005027
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7029277.1 hypothetical protein SDJN02_07615, partial [Cucurbita argyrosperma subsp. argyrosperma]2.6e-20480.05Show/hide
Query:  MGKRFSCNGLMFKMLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKK
        MGKRF     +  MLV+TL V+VCG VEGGSLSKQKSL    +MN LRKQA KSI+SEDGDIIDC++IYDQPAFDHPAL+NHTIQ+APT+DPT+D+HSKK
Subjt:  MGKRFSCNGLMFKMLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKK

Query:  ATAKREGKEEKDSMVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNY
        ATA+REGKEEK+SMVVKQ WRKSGSCP+GTIPIRRIRK VLLKA+SIERYGRK+PM S E AQL  +RSSH LL N S           KAFLL  GNNY
Subjt:  ATAKREGKEEKDSMVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNY

Query:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHNSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGL
        NAAKGDIKVCNP+VEFDDEYSTSQVALLTGP+ ++EA+ESGWAVNPGVYGDRQTRLFVYWT DAS KTGCFDLTCPGFVQTSNEIALGSAIYPISTP GL
Subjt:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHNSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGL

Query:  PYEITMFLYRDLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWV
        PYEI MFL+RD KT+NWWVQYG SINIGYWP ELF AL  TAETVQWGGEVYSTK+GRPPHT T MG+GRFPDFISG SGWVK++RV+DNSM+L FP WV
Subjt:  PYEITMFLYRDLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWV

Query:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP
        EHYSDEYDCYDVDFIRDYL+DPELYYGGPGKNP+CP
Subjt:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP

XP_004143037.3 uncharacterized protein LOC101203978 [Cucumis sativus]3.2e-21082.34Show/hide
Query:  MGKRFSCNGLMFKMLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKK
        MGK   CNGLMFKMLV  LTV+VCGVVE GSLSK K+    KKM+ LRKQATKSIQSEDGDIIDC++IYDQPAFDHPAL+NHTIQMAPT+DPT+DKHSKK
Subjt:  MGKRFSCNGLMFKMLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKK

Query:  ATAKREGKEEKDSMVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNY
        ATA+ EG  EK SM VKQ WRKSGSCPK TIPIRRIRK V LKA+S+  YG+KRP P LEIAQLSNSRSSHFLLKNHS           KA LLA+G+N+
Subjt:  ATAKREGKEEKDSMVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNY

Query:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHNSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGL
        N AKGDIKVCNP VEFDDEYSTSQVALLTGP+ +YEA+ESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPIST T L
Subjt:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHNSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGL

Query:  PYEITMFLYRDLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWV
        P+EITMFL+RD +TNNWWVQYG SINIGYWPSELFKALK+TAETVQWGGEVYSTKLG PPHTGTGMGNG+FPD+ISG+SGWVK+IRV+DNSMILKFPN+V
Subjt:  PYEITMFLYRDLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWV

Query:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP
        EHYSDEYDCYDVDFIR+YL+DPELYYGGPGKN +CP
Subjt:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP

XP_008444496.1 PREDICTED: uncharacterized protein LOC103487798 [Cucumis melo]3.1e-20580.73Show/hide
Query:  MGKRFSCNGLMFKMLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKK
        MGK    +GLMFKMLV+ LTV+VCGVVE GSLSK +S    KKMN LRKQATKSIQSEDGDIIDC++IYDQPAFDHPAL+NHTIQMAPT+DPT+DKHSKK
Subjt:  MGKRFSCNGLMFKMLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKK

Query:  ATAKREGKEEKDSMVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNY
        ATA++EG  EK+SM VKQ WR SGSCPK TIPIRRIRK     A+S+  YG+KRP P LEIAQLSNSRSSHFLLKNHS           KA LLA+GNN+
Subjt:  ATAKREGKEEKDSMVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNY

Query:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHNSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGL
        N AKGDIKVCNP VEFDDEYSTSQVALLTGP+ +YEA+ESGWAVNPGVYGDRQTRLFVYWT+DASHKTGCFDLTCPGFVQTSNEIALGSAIYPIST   L
Subjt:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHNSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGL

Query:  PYEITMFLYRDLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWV
        P+EITMFL+RD +TNNWWVQYG SINIGYWPSELFKALK+TAETVQWGGEVYSTKLG PPHT TGMGNG+FPD+ISG+SGWVK+IRV+DNSMILKFPN+V
Subjt:  PYEITMFLYRDLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWV

Query:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP
        EHYSDEYDCYDVDFIR+YL+DPELYYGGPGKN +CP
Subjt:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP

XP_023546232.1 uncharacterized protein LOC111805388 isoform X2 [Cucurbita pepo subsp. pepo]9.0e-20580.28Show/hide
Query:  MGKRFSCNGLMFKMLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKK
        MGKRF     +  MLV+TLTV+VCG VEGGSLSKQKSL    +MN LRKQA KSI+SEDGDIIDC++IYDQPAFDHPAL+NHTIQ+APT+DPT+D+HSKK
Subjt:  MGKRFSCNGLMFKMLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKK

Query:  ATAKREGKEEKDSMVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNY
        ATA+REGKEEK+SMVVKQ WRKSGSCP+GTIPIRRIRK VLLKA+SIERYGRK+PM S E AQL  +RSSH LL N S           KAFLL  GNNY
Subjt:  ATAKREGKEEKDSMVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNY

Query:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHNSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGL
        NAAKGDIKVCNP+VEFDDEYSTSQVALLTGP+ ++EA+ESGWAVNPGVYGDRQTRLFVYWT DAS KTGCFDLTCPGFVQTSNEIALGSAIYPISTP GL
Subjt:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHNSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGL

Query:  PYEITMFLYRDLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWV
        PYEI MFL+RD KT+NWWVQYG SINIGYWP ELF AL  TAETVQWGGEVYSTK+GRPPHT T MG+GRFPDFISG SGWVK++RV+DNSM+L FP WV
Subjt:  PYEITMFLYRDLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWV

Query:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP
        EHYSDEYDCYDVDFIRDYL+DPELYYGGPGKNP+CP
Subjt:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP

XP_038886336.1 uncharacterized protein LOC120076548 [Benincasa hispida]1.9e-22386.18Show/hide
Query:  KRFSCNGLMFKMLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKKAT
        KRFSCN LMFKMLV+TLTV+VCG+VEGGSL+KQKSL +EKKMN LRKQATKSIQS+DGDIIDCINIYDQPAFDHPAL+NHTIQMAPT+DPT+D+HSKKAT
Subjt:  KRFSCNGLMFKMLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKKAT

Query:  AKREGKEEKDSMVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNYNA
        A+REG E KDSMVVKQ WRKSGSCPKGTIPIRRI+K +L KADS+E YGRKRP  S EIAQLSNSRSSH+LLKNHS           KA LLALGNNYN 
Subjt:  AKREGKEEKDSMVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNYNA

Query:  AKGDIKVCNPKVEFDDEYSTSQVALLTGPHNSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGLPY
        AKGDIKVCNPKVEFDDEYSTSQVALLTGP+ +YEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPIST TGLPY
Subjt:  AKGDIKVCNPKVEFDDEYSTSQVALLTGPHNSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGLPY

Query:  EITMFLYRDLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWVEH
        EITMFL+RDLKTNNWWVQYG SI+IGYWPSELF ALK+TAETVQWGGEVYSTKLG PPHT TGMGNG+FPD+ISG+SGWVK+IRV+DNSM+LKFP+WVEH
Subjt:  EITMFLYRDLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWVEH

Query:  YSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP
        YSDEYDCYD+DFIRDYL+DPELYYGGPGKNPKCP
Subjt:  YSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP

TrEMBL top hitse value%identityAlignment
A0A0A0LQZ6 Uncharacterized protein1.5e-21082.34Show/hide
Query:  MGKRFSCNGLMFKMLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKK
        MGK   CNGLMFKMLV  LTV+VCGVVE GSLSK K+    KKM+ LRKQATKSIQSEDGDIIDC++IYDQPAFDHPAL+NHTIQMAPT+DPT+DKHSKK
Subjt:  MGKRFSCNGLMFKMLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKK

Query:  ATAKREGKEEKDSMVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNY
        ATA+ EG  EK SM VKQ WRKSGSCPK TIPIRRIRK V LKA+S+  YG+KRP P LEIAQLSNSRSSHFLLKNHS           KA LLA+G+N+
Subjt:  ATAKREGKEEKDSMVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNY

Query:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHNSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGL
        N AKGDIKVCNP VEFDDEYSTSQVALLTGP+ +YEA+ESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPIST T L
Subjt:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHNSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGL

Query:  PYEITMFLYRDLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWV
        P+EITMFL+RD +TNNWWVQYG SINIGYWPSELFKALK+TAETVQWGGEVYSTKLG PPHTGTGMGNG+FPD+ISG+SGWVK+IRV+DNSMILKFPN+V
Subjt:  PYEITMFLYRDLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWV

Query:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP
        EHYSDEYDCYDVDFIR+YL+DPELYYGGPGKN +CP
Subjt:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP

A0A1S3BAI4 uncharacterized protein LOC1034877981.5e-20580.73Show/hide
Query:  MGKRFSCNGLMFKMLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKK
        MGK    +GLMFKMLV+ LTV+VCGVVE GSLSK +S    KKMN LRKQATKSIQSEDGDIIDC++IYDQPAFDHPAL+NHTIQMAPT+DPT+DKHSKK
Subjt:  MGKRFSCNGLMFKMLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKK

Query:  ATAKREGKEEKDSMVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNY
        ATA++EG  EK+SM VKQ WR SGSCPK TIPIRRIRK     A+S+  YG+KRP P LEIAQLSNSRSSHFLLKNHS           KA LLA+GNN+
Subjt:  ATAKREGKEEKDSMVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNY

Query:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHNSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGL
        N AKGDIKVCNP VEFDDEYSTSQVALLTGP+ +YEA+ESGWAVNPGVYGDRQTRLFVYWT+DASHKTGCFDLTCPGFVQTSNEIALGSAIYPIST   L
Subjt:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHNSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGL

Query:  PYEITMFLYRDLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWV
        P+EITMFL+RD +TNNWWVQYG SINIGYWPSELFKALK+TAETVQWGGEVYSTKLG PPHT TGMGNG+FPD+ISG+SGWVK+IRV+DNSMILKFPN+V
Subjt:  PYEITMFLYRDLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWV

Query:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP
        EHYSDEYDCYDVDFIR+YL+DPELYYGGPGKN +CP
Subjt:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP

A0A5A7V3A1 Uncharacterized protein1.5e-20580.73Show/hide
Query:  MGKRFSCNGLMFKMLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKK
        MGK    +GLMFKMLV+ LTV+VCGVVE GSLSK +S    KKMN LRKQATKSIQSEDGDIIDC++IYDQPAFDHPAL+NHTIQMAPT+DPT+DKHSKK
Subjt:  MGKRFSCNGLMFKMLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKK

Query:  ATAKREGKEEKDSMVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNY
        ATA++EG  EK+SM VKQ WR SGSCPK TIPIRRIRK     A+S+  YG+KRP P LEIAQLSNSRSSHFLLKNHS           KA LLA+GNN+
Subjt:  ATAKREGKEEKDSMVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNY

Query:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHNSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGL
        N AKGDIKVCNP VEFDDEYSTSQVALLTGP+ +YEA+ESGWAVNPGVYGDRQTRLFVYWT+DASHKTGCFDLTCPGFVQTSNEIALGSAIYPIST   L
Subjt:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHNSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGL

Query:  PYEITMFLYRDLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWV
        P+EITMFL+RD +TNNWWVQYG SINIGYWPSELFKALK+TAETVQWGGEVYSTKLG PPHT TGMGNG+FPD+ISG+SGWVK+IRV+DNSMILKFPN+V
Subjt:  PYEITMFLYRDLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWV

Query:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP
        EHYSDEYDCYDVDFIR+YL+DPELYYGGPGKN +CP
Subjt:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP

A0A6J1HDC5 uncharacterized protein LOC111462577 isoform X14.8e-20479.59Show/hide
Query:  MGKRFSCNGLMFKMLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKK
        MGKRF     +  ML++TLTV+VCG VEGGSLSKQKS+    +MN LRKQA KSI+SEDGDIIDC++IYDQPAFDHPAL+NHTIQ+APT+DPT+D+HSKK
Subjt:  MGKRFSCNGLMFKMLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKK

Query:  ATAKREGKEEKDSMVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNY
        ATA+REGKEEK+SMVVKQ WRKSGSCP+GTIPIRRIRK VLLKA SIERYGRK+PM S E AQL  +RSSH LL N S           KAFLL  GNNY
Subjt:  ATAKREGKEEKDSMVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNY

Query:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHNSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGL
        NAAKGDIKVCNP+VEFDDEYSTSQVALLTGP+ ++EA+ESGWAVNPGVYGDRQTRLFVYWT DAS KTGCFDLTCPGFVQTSNEIALGSAIYPISTP GL
Subjt:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHNSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGL

Query:  PYEITMFLYRDLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWV
        PYEI MFL+RD KT+NWWVQYG SINIGYWP ELF AL  TAETVQWGGEVYST +GRPPHT T MG+GRFPDFISG SGWVK++RV+DNSM+L FP WV
Subjt:  PYEITMFLYRDLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWV

Query:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP
        EHYSDEYDCYDVDFIRDYL+DPELYYGGPGKNP+CP
Subjt:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP

A0A6J1K820 uncharacterized protein LOC111492515 isoform X12.2e-20480.05Show/hide
Query:  MGKRFSCNGLMFKMLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKK
        MGKRF     +  MLV+TLTV+VCG VEGGSLSKQKSL    +MN LRKQA KSI+SEDGDIIDC++IYDQPAFDHPAL NHTIQ+APT+DPT+D+HSKK
Subjt:  MGKRFSCNGLMFKMLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKK

Query:  ATAKREGKEEKDSMVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNY
        ATA+REGKEEK+SMVVKQ WRKSGSCP+GTIPIRRIRK VLLKA+S+ERYGRK+PM S E AQL  +RSSH LL N S           KAFLL  GNNY
Subjt:  ATAKREGKEEKDSMVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNY

Query:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHNSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGL
        NAAKGDIKVCNP+VEFDDEYSTSQVALLTGP+ ++EA+ESGWAVNPGVYGDRQTRLFVYWT DAS KTGCFDLTCPGFVQTSNEIALGSAIYPISTP GL
Subjt:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHNSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGL

Query:  PYEITMFLYRDLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWV
        PYEI MFL+RD KT+NWWVQYG SINIGYWP ELF AL  TAETVQWGGEVYSTK+GRPPHT T MG+GRFPDFISG SGWVK++RV+DNSM+L FP WV
Subjt:  PYEITMFLYRDLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWV

Query:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP
        EHYSDEYDCYDVDFIRDYL+DPELYYGGPGKNP+CP
Subjt:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)9.3e-8339.46Show/hide
Query:  LSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKKATAKREGKEEKDSMVVKQRWRKSGSCPKGTI
        +SKQK  E++K +N L K A KSIQS DGD+IDC+ I  QPAFDHP LK+H IQM P + P       K +A +  ++E     + Q W + G C +GTI
Subjt:  LSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKKATAKREGKEEKDSMVVKQRWRKSGSCPKGTI

Query:  PIRRIRKRVLLKADSIERYGRK--RPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNYNAAKGDIKVCNPKVEFDDEYSTSQVALLT
        P+RR ++  +L+A S++RYG+K  R +P                L   +   L++    Q A     G+ Y  AK  I V  PK++  +E+S SQ+ LL 
Subjt:  PIRRIRKRVLLKADSIERYGRK--RPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNYNAAKGDIKVCNPKVEFDDEYSTSQVALLT

Query:  GPH-NSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGLPYEITMFLYRDLKTNNWWVQYGGSINIG
        G       ++E+GW V+P +YGD  TRLF YWT DA   TGC++L C GF+Q +++IA+G++I P+S      Y+I++ +++D K  +WW+Q+G    +G
Subjt:  GPH-NSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGLPYEITMFLYRDLKTNNWWVQYGGSINIG

Query:  YWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWVEHYSDEYDCYDVDFIRDYLEDPELYYGG
        YWPS LF  L  +A  ++WGGEV +++     HT T MG+G+FP+     + + + I+V D S  LK P  +  ++++ +CYDV    +       YYGG
Subjt:  YWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWVEHYSDEYDCYDVDFIRDYLEDPELYYGG

Query:  PGKNPKCP
        PGKN KCP
Subjt:  PGKNPKCP

AT2G44210.1 Protein of Unknown Function (DUF239)6.4e-8439.2Show/hide
Query:  FKMLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKKATAKREGKEEK
        F  LVMT+ +L   VV G   +    L++   +  L K A KSI+S DGD+IDC+ I DQPAF HP L NHT+QM P+ +P       K ++K + ++  
Subjt:  FKMLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKKATAKREGKEEK

Query:  DSMVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNYNAAKGDIKVCN
         S  + Q W  +G CPK TIPIRR R++ L +A S+E YG K       I +  +S   + L +N            Q A +      +  AK  I V  
Subjt:  DSMVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNYNAAKGDIKVCN

Query:  PKVEFDDEYSTSQVALLTGPHNS-YEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGLPYEITMFLYR
        P VE  +E+S +Q+ +L G  NS   ++E+GW V+P +YGD +TRLF YWT DA   TGC++L C GFVQ + EIA+G +I P+S      Y+IT+ +++
Subjt:  PKVEFDDEYSTSQVALLTGPHNS-YEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGLPYEITMFLYR

Query:  DLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWVEHYSDEYDCY
        D K  +WW+Q+G    IGYWP+ LF  L  +A  ++WGGEV +++     HT T MG+GRF +   G + + K ++V D S  L+ P  ++ ++D+ +CY
Subjt:  DLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWVEHYSDEYDCY

Query:  DVDFIRDYLEDPELYYGGPGKNPKCP
        +V            YYGGPG+NP CP
Subjt:  DVDFIRDYLEDPELYYGGPGKNPKCP

AT3G13510.1 Protein of Unknown Function (DUF239)6.0e-8238.55Show/hide
Query:  FKMLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKKATAKREGKEEK
        F + +  +  L C     GS S+QK  E++K +N L K   K+IQS DGDIIDCI I  QPAFDHP LK+H IQM P++ P       K +A+ +GKE  
Subjt:  FKMLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKKATAKREGKEEK

Query:  DSMVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRK--RPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNYNAAKGDIKV
            + Q W + G C +GTIP+RR R+  +L+A S++RYG+K  R +P                +   +   L++    Q A     G+ Y  AK  + V
Subjt:  DSMVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRK--RPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNYNAAKGDIKV

Query:  CNPKVEFDDEYSTSQVALLTGPH-NSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGLPYEITMFL
          PK++  +E+S SQ+ LL G       ++E+GW V+P +YGD  TRLF YWT DA   TGC++L C GF+Q +++IA+G++I P+S      Y+I++ +
Subjt:  CNPKVEFDDEYSTSQVALLTGPH-NSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGLPYEITMFL

Query:  YRDLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWVEHYSDEYD
        ++D K  +WW+Q+G    +GYWPS LF  L  +A  ++WGGEV +++     HT T MG+G FP+     + + + I+V D S  LK P  +  ++++ +
Subjt:  YRDLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWVEHYSDEYD

Query:  CYDVDFIRDYLEDPELYYGGPGKNPKCP
        CYDV    +       YYGGPGKN  CP
Subjt:  CYDVDFIRDYLEDPELYYGGPGKNPKCP

AT5G25950.1 Protein of Unknown Function (DUF239)3.5e-9040.97Show/hide
Query:  MLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKKATAKREGKEEKDS
        M    + V++CG         + SL+++ K+  L K A K+I+SEDGDIIDCI+IY Q AFDHPALKNH IQM     P+V   +KK T    G  E   
Subjt:  MLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKKATAKREGKEEKDS

Query:  MVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLD---NGA----KQKAFLLALGNNYNAAKGD
         +  Q W KSG CP GTIP+RR+ +  + +A S   +GRK P    + + L N+      L++  ++++     N A    + +AF++ALG N+  A+ D
Subjt:  MVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLD---NGA----KQKAFLLALGNNYNAAKGD

Query:  IKVCNPKVEFDDEYSTSQVALLTGPHNSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGLPYEITM
        I + NP      +YST+Q+ L+ G   ++E+VE GW VNP V+GD +TRLF+ WT D   KTGC +L C GFVQTS + ALG+ + P+S+ +   Y IT+
Subjt:  IKVCNPKVEFDDEYSTSQVALLTGPHNSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGLPYEITM

Query:  FLYRDLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLG-RPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWVEHYSD
         ++ D  + NWW+    ++ +GYWP  LF  LK +A  VQWGGEV+S  +  + PHT T MG+G++  +I   + +   +R++D SM LK+P ++  Y+D
Subjt:  FLYRDLKTNNWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLG-RPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWVEHYSD

Query:  EYDCYDVDFIR-DYLEDPELYYGGPGKNPKCP
        EY+CY     R  Y+ +P  Y+GGPG+N +CP
Subjt:  EYDCYDVDFIR-DYLEDPELYYGGPGKNPKCP

AT5G25960.1 Protein of Unknown Function (DUF239)5.4e-8340.4Show/hide
Query:  SLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKKATAKREGKEEKDSMVVKQRWRKSGSCPKGTIPIRRI
        SL+++ K+  L K + K+I+SEDGDIIDCI+IY Q AFDHPAL+NH IQM     P+VD  +KK T    G  E+   +  Q W KSG+CPKGTIP    
Subjt:  SLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKKATAKREGKEEKDSMVVKQRWRKSGSCPKGTIPIRRI

Query:  RKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNYNAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHNSYE
                                                             +A L+ALG N+  A+ DI V NP      +YS++Q+ LL G  +++E
Subjt:  RKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNYNAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHNSYE

Query:  AVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGLPYEITMFLYRDLKTNNWWVQYGGSINIGYWPSELFK
        ++E+GWAVNP V+GD +TRLF YWT D   KTGC +L C GFVQT+ + ALG+AI P+ST +   + IT     D  + NWW+    ++ IGYWP  LF 
Subjt:  AVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGLPYEITMFLYRDLKTNNWWVQYGGSINIGYWPSELFK

Query:  ALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWVEHYSDEYDCYDVDFIR-DYLEDPELYYGGPGKNPKC
         LK +A  VQ GGEV+S  +G+ PHT T MG+G++  ++   + +   IR++D S+ +K+P ++  Y+DEY+CY     R  Y+ +P  Y+GGPG+N +C
Subjt:  ALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWVEHYSDEYDCYDVDFIR-DYLEDPELYYGGPGKNPKC

Query:  P
        P
Subjt:  P


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAGAGATTCTCTTGCAATGGTTTGATGTTCAAAATGTTGGTTATGACATTGACAGTCTTAGTTTGTGGAGTTGTGGAGGGTGGTTCACTTTCTAAACAG
AAGAGTTTGGAACTTGAGAAGAAAATGAACTTTCTTAGGAAACAAGCAACAAAAAGCATTCAGAGCGAAGATGGTGACATCATTGATTGCATTAACATCTACGAC
CAGCCTGCTTTTGATCATCCTGCCCTGAAGAATCACACCATCCAGATGGCACCCACTTTTGATCCCACCGTGGATAAACATTCAAAGAAAGCAACAGCAAAAAGG
GAAGGGAAGGAGGAAAAGGATTCCATGGTTGTGAAACAAAGATGGAGGAAAAGTGGCAGTTGTCCTAAAGGAACAATACCAATTCGAAGGATCCGAAAACGTGTC
CTACTCAAAGCTGATTCGATAGAACGCTATGGAAGAAAGAGACCGATGCCTTCGCTAGAAATCGCACAGCTTTCCAACAGCCGAAGTTCCCACTTTCTACTGAAG
AATCATTCGTCTTACCATCTACTTGACAATGGTGCAAAACAGAAGGCATTTCTGCTTGCTTTAGGAAACAACTACAATGCAGCAAAAGGAGACATTAAAGTATGC
AACCCCAAGGTCGAATTTGATGATGAATACAGTACTTCCCAAGTGGCTCTATTAACCGGCCCTCACAATAGTTACGAGGCTGTCGAATCTGGATGGGCAGTAAAT
CCAGGTGTTTACGGGGATCGGCAGACTCGACTGTTCGTGTATTGGACTGTTGATGCTTCCCACAAAACAGGCTGCTTTGATCTCACTTGCCCTGGTTTTGTTCAA
ACAAGCAATGAAATTGCTCTTGGTTCTGCTATTTATCCAATCTCAACTCCAACTGGGCTTCCATATGAAATAACTATGTTCCTTTATAGAGATTTAAAGACAAAT
AATTGGTGGGTACAATATGGTGGAAGCATTAACATAGGTTATTGGCCTTCTGAGTTATTCAAAGCTCTAAAATTTACTGCAGAAACGGTACAATGGGGAGGGGAA
GTTTACAGCACAAAGTTAGGAAGACCTCCTCACACTGGAACAGGTATGGGCAATGGAAGGTTTCCTGACTTCATTTCTGGTAATTCAGGTTGGGTAAAACAGATA
AGGGTTCAAGACAACTCGATGATTTTGAAGTTTCCTAATTGGGTTGAGCACTACTCCGATGAATATGATTGTTACGATGTCGATTTTATCCGAGATTATTTAGAA
GATCCCGAACTATACTATGGAGGTCCTGGTAAAAATCCAAAGTGCCCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAGAGATTCTCTTGCAATGGTTTGATGTTCAAAATGTTGGTTATGACATTGACAGTCTTAGTTTGTGGAGTTGTGGAGGGTGGTTCACTTTCTAAACAG
AAGAGTTTGGAACTTGAGAAGAAAATGAACTTTCTTAGGAAACAAGCAACAAAAAGCATTCAGAGCGAAGATGGTGACATCATTGATTGCATTAACATCTACGAC
CAGCCTGCTTTTGATCATCCTGCCCTGAAGAATCACACCATCCAGATGGCACCCACTTTTGATCCCACCGTGGATAAACATTCAAAGAAAGCAACAGCAAAAAGG
GAAGGGAAGGAGGAAAAGGATTCCATGGTTGTGAAACAAAGATGGAGGAAAAGTGGCAGTTGTCCTAAAGGAACAATACCAATTCGAAGGATCCGAAAACGTGTC
CTACTCAAAGCTGATTCGATAGAACGCTATGGAAGAAAGAGACCGATGCCTTCGCTAGAAATCGCACAGCTTTCCAACAGCCGAAGTTCCCACTTTCTACTGAAG
AATCATTCGTCTTACCATCTACTTGACAATGGTGCAAAACAGAAGGCATTTCTGCTTGCTTTAGGAAACAACTACAATGCAGCAAAAGGAGACATTAAAGTATGC
AACCCCAAGGTCGAATTTGATGATGAATACAGTACTTCCCAAGTGGCTCTATTAACCGGCCCTCACAATAGTTACGAGGCTGTCGAATCTGGATGGGCAGTAAAT
CCAGGTGTTTACGGGGATCGGCAGACTCGACTGTTCGTGTATTGGACTGTTGATGCTTCCCACAAAACAGGCTGCTTTGATCTCACTTGCCCTGGTTTTGTTCAA
ACAAGCAATGAAATTGCTCTTGGTTCTGCTATTTATCCAATCTCAACTCCAACTGGGCTTCCATATGAAATAACTATGTTCCTTTATAGAGATTTAAAGACAAAT
AATTGGTGGGTACAATATGGTGGAAGCATTAACATAGGTTATTGGCCTTCTGAGTTATTCAAAGCTCTAAAATTTACTGCAGAAACGGTACAATGGGGAGGGGAA
GTTTACAGCACAAAGTTAGGAAGACCTCCTCACACTGGAACAGGTATGGGCAATGGAAGGTTTCCTGACTTCATTTCTGGTAATTCAGGTTGGGTAAAACAGATA
AGGGTTCAAGACAACTCGATGATTTTGAAGTTTCCTAATTGGGTTGAGCACTACTCCGATGAATATGATTGTTACGATGTCGATTTTATCCGAGATTATTTAGAA
GATCCCGAACTATACTATGGAGGTCCTGGTAAAAATCCAAAGTGCCCCTAA
Protein sequenceShow/hide protein sequence
MGKRFSCNGLMFKMLVMTLTVLVCGVVEGGSLSKQKSLELEKKMNFLRKQATKSIQSEDGDIIDCINIYDQPAFDHPALKNHTIQMAPTFDPTVDKHSKKATAKR
EGKEEKDSMVVKQRWRKSGSCPKGTIPIRRIRKRVLLKADSIERYGRKRPMPSLEIAQLSNSRSSHFLLKNHSSYHLLDNGAKQKAFLLALGNNYNAAKGDIKVC
NPKVEFDDEYSTSQVALLTGPHNSYEAVESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTPTGLPYEITMFLYRDLKTN
NWWVQYGGSINIGYWPSELFKALKFTAETVQWGGEVYSTKLGRPPHTGTGMGNGRFPDFISGNSGWVKQIRVQDNSMILKFPNWVEHYSDEYDCYDVDFIRDYLE
DPELYYGGPGKNPKCP