; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g1193 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g1193
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionUnknown protein
Genome locationMC04:19953735..19961177
RNA-Seq ExpressionMC04g1193
SyntenyMC04g1193
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579038.1 hypothetical protein SDJN03_23486, partial [Cucurbita argyrosperma subsp. sororia]0.080.48Show/hide
Query:  EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG
        E ARLN E        FENPTPPEV D VRVEST ILSGTL  GVDNF  AGVAVTKVKNEMFDDFNEDLDHV+ IERLRMLLSR+ALG MNQHVEGGSG
Subjt:  EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG

Query:  ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT
          SGD +QCFLKQK KSMF++ E     NVLHD++G  AP   SPSVVCSP AT+SGS FSSN SLNK TESGNDMELKED +IC SEKV TELGSR LT
Subjt:  ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT

Query:  NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA
        NH P+ANL  STKVKDEPYDH +GC+++GKDMNNV S  LS+KSETTMPDEPYENKVDDM LQDRMKFFSSRK  G TS DYEHPKPSDPGCS LVSEP 
Subjt:  NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA

Query:  SLMNVKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPI-RCTKASRSSYC
        +  N KRRRK K+TATNSIETALEEDAPGLLQILV+KG+ VDEIKLYGE ESDDDLDES SE+SF ELE VI+RLF QR SFLKFP I RCTKASR+SYC
Subjt:  SLMNVKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPI-RCTKASRSSYC

Query:  LACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDLTEG
        LACLVSLIEQTRYLHFR+WPVEWGWCRDLQSFIFVFERHKRIV+ERPEYGYATYFFELV  LPI WQIKRLVIA+KLTNCSRISLLENRPLLVGEDLTEG
Subjt:  LACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDLTEG

Query:  EAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
        EA VL SYGW+ NSGLG+MLNY  RVVHDR++EDISEWRSKIGKLL+DGYNGGALV EN PK+VAEY SSQ TQ+KLEL
Subjt:  EAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL

KAG7016560.1 hypothetical protein SDJN02_21670 [Cucurbita argyrosperma subsp. argyrosperma]0.080.31Show/hide
Query:  EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG
        E ARLN E        FENPTPPEV D VRVEST ILSGTL  GVDNF  AGVAVTKVKNEMFDDFNEDLDHV+ IERLRMLLSR+ALG MNQHVEGGSG
Subjt:  EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG

Query:  ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT
          SGD +QCFLKQK KSMF++ E     NVLHD++G  AP   SPSVVCSP AT+SGS FSSN SLNK TESGNDMELKED +IC SEKV TELGSR LT
Subjt:  ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT

Query:  NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA
        NH P+ANL  STKVKDEPYDH +GC+++GKDMNNV S  LS+KSETTMP+EPYENKVDDM LQDRMKFFSSRK  G TS DYEHPKPSDPGCS LVSEP 
Subjt:  NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA

Query:  SLMNVKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPI-RCTKASRSSYC
        +  N KRRRK K+TATNSIETALEEDAPGLLQILV+KG+ VDEIKLYGE ESDDDLDES SE+SF ELE VI+RLF QR SFLKFP I RCTKASR+SYC
Subjt:  SLMNVKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPI-RCTKASRSSYC

Query:  LACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDLTEG
        LACLVSLIEQTRYLHFR+WPVEWGWCRDLQSFIFVFERHKRIV+ERPEYGYATYFFELV  LPI WQIKRLVIA+KLTNCSRISLLENRPLLVGEDLTEG
Subjt:  LACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDLTEG

Query:  EAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
        EA VL SYGW+ NSGLG+MLNY  RVVHDR++EDISEWRSKIGKLL+DGYNGGALV EN PK+VAEY SSQ TQ+KLEL
Subjt:  EAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL

XP_022141523.1 uncharacterized protein LOC111011878 isoform X1 [Momordica charantia]0.099.31Show/hide
Query:  EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG
        EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG
Subjt:  EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG

Query:  ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT
        ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT
Subjt:  ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT

Query:  NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA
        NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA
Subjt:  NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA

Query:  SLMNVKRRRKWKRTATNSIETALEEDAPGLLQ----ILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCTKASRS
        SLMNVKRRRKWKRTATNSIETALEEDAPGLLQ    ILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCTKASRS
Subjt:  SLMNVKRRRKWKRTATNSIETALEEDAPGLLQ----ILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCTKASRS

Query:  SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDL
        SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDL
Subjt:  SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDL

Query:  TEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
        TEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
Subjt:  TEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL

XP_022141525.1 uncharacterized protein LOC111011878 isoform X2 [Momordica charantia]0.0100Show/hide
Query:  EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG
        EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG
Subjt:  EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG

Query:  ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT
        ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT
Subjt:  ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT

Query:  NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA
        NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA
Subjt:  NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA

Query:  SLMNVKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCTKASRSSYCL
        SLMNVKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCTKASRSSYCL
Subjt:  SLMNVKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCTKASRSSYCL

Query:  ACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDLTEGE
        ACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDLTEGE
Subjt:  ACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDLTEGE

Query:  AGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
        AGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
Subjt:  AGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL

XP_022141526.1 uncharacterized protein LOC111011878 isoform X3 [Momordica charantia]0.099.31Show/hide
Query:  EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG
        EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG
Subjt:  EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG

Query:  ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT
        ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT
Subjt:  ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT

Query:  NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA
        NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA
Subjt:  NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA

Query:  SLMNVKRRRKWKRTATNSIETALEEDAPGLLQ----ILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCTKASRS
        SLMNVKRRRKWKRTATNSIETALEEDAPGLLQ    ILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCTKASRS
Subjt:  SLMNVKRRRKWKRTATNSIETALEEDAPGLLQ----ILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCTKASRS

Query:  SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDL
        SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDL
Subjt:  SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDL

Query:  TEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
        TEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
Subjt:  TEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL

TrEMBL top hitse value%identityAlignment
A0A6J1CIB2 uncharacterized protein LOC111011878 isoform X20.0100Show/hide
Query:  EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG
        EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG
Subjt:  EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG

Query:  ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT
        ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT
Subjt:  ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT

Query:  NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA
        NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA
Subjt:  NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA

Query:  SLMNVKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCTKASRSSYCL
        SLMNVKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCTKASRSSYCL
Subjt:  SLMNVKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCTKASRSSYCL

Query:  ACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDLTEGE
        ACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDLTEGE
Subjt:  ACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDLTEGE

Query:  AGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
        AGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
Subjt:  AGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL

A0A6J1CJF8 uncharacterized protein LOC111011878 isoform X30.099.31Show/hide
Query:  EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG
        EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG
Subjt:  EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG

Query:  ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT
        ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT
Subjt:  ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT

Query:  NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA
        NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA
Subjt:  NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA

Query:  SLMNVKRRRKWKRTATNSIETALEEDAPGLLQ----ILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCTKASRS
        SLMNVKRRRKWKRTATNSIETALEEDAPGLLQ    ILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCTKASRS
Subjt:  SLMNVKRRRKWKRTATNSIETALEEDAPGLLQ----ILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCTKASRS

Query:  SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDL
        SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDL
Subjt:  SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDL

Query:  TEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
        TEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
Subjt:  TEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL

A0A6J1CKR0 uncharacterized protein LOC111011878 isoform X10.099.31Show/hide
Query:  EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG
        EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG
Subjt:  EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG

Query:  ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT
        ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT
Subjt:  ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT

Query:  NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA
        NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA
Subjt:  NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA

Query:  SLMNVKRRRKWKRTATNSIETALEEDAPGLLQ----ILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCTKASRS
        SLMNVKRRRKWKRTATNSIETALEEDAPGLLQ    ILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCTKASRS
Subjt:  SLMNVKRRRKWKRTATNSIETALEEDAPGLLQ----ILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCTKASRS

Query:  SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDL
        SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDL
Subjt:  SYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDL

Query:  TEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
        TEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
Subjt:  TEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL

A0A6J1FLT1 uncharacterized protein LOC111445382 isoform X10.079.97Show/hide
Query:  EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG
        E ARLN E        FENPTPPEV D VRVEST ILSGTL  GVDNF  AGVAVTKVKNEMFDDF+EDLDHV+ IERLRMLLSR+ALG MNQHVEGGSG
Subjt:  EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG

Query:  ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT
          SGD +QCFLKQK KSMF++ E     NVLHD++G  AP   SPSVVCSP AT+SGS FSSN SLNK TESGNDMELKED +IC SEKV TELGSR LT
Subjt:  ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT

Query:  NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA
        NH P+ NL  STKVKDEPYDH +GC+++GKDMNNV S  LS+KSETTMPDEPYENKVDDM LQDRMKFFSSRK  G TS DYEHPKPSDPGCS LVSEP 
Subjt:  NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA

Query:  SLMNVKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPI-RCTKASRSSYC
        +  N KRRRK K+TATNSIETALEEDAPGLLQILV+KG+ VDEIKLYGE ESDDDLDES SE+SF ELE VI+RLF QR SFLKFP I RC KASR+SYC
Subjt:  SLMNVKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPI-RCTKASRSSYC

Query:  LACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDLTEG
        LACLVSLIEQTRYLHFR+WPVEWGWCRDLQSFIFVFERHKRIV+ERPEYGYATYFFELV  LPI WQIKRLVIA+KLTNCSRISLLENRPLLVGEDLTEG
Subjt:  LACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDLTEG

Query:  EAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
        EA VL SYGW+ NSGLG+MLNY  RVVHDR++EDISEWRSKIGKLL+DGYNGGALV EN PK+VAEY SSQ TQ+KLEL
Subjt:  EAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL

A0A6J1FMV3 uncharacterized protein LOC111445382 isoform X20.079.97Show/hide
Query:  EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG
        E ARLN E        FENPTPPEV D VRVEST ILSGTL  GVDNF  AGVAVTKVKNEMFDDF+EDLDHV+ IERLRMLLSR+ALG MNQHVEGGSG
Subjt:  EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSG

Query:  ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT
          SGD +QCFLKQK KSMF++ E     NVLHD++G  AP   SPSVVCSP AT+SGS FSSN SLNK TESGNDMELKED +IC SEKV TELGSR LT
Subjt:  ASSGDPMQCFLKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLT

Query:  NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA
        NH P+ NL  STKVKDEPYDH +GC+++GKDMNNV S  LS+KSETTMPDEPYENKVDDM LQDRMKFFSSRK  G TS DYEHPKPSDPGCS LVSEP 
Subjt:  NHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPA

Query:  SLMNVKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPI-RCTKASRSSYC
        +  N KRRRK K+TATNSIETALEEDAPGLLQILV+KG+ VDEIKLYGE ESDDDLDES SE+SF ELE VI+RLF QR SFLKFP I RC KASR+SYC
Subjt:  SLMNVKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPI-RCTKASRSSYC

Query:  LACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDLTEG
        LACLVSLIEQTRYLHFR+WPVEWGWCRDLQSFIFVFERHKRIV+ERPEYGYATYFFELV  LPI WQIKRLVIA+KLTNCSRISLLENRPLLVGEDLTEG
Subjt:  LACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDLTEG

Query:  EAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL
        EA VL SYGW+ NSGLG+MLNY  RVVHDR++EDISEWRSKIGKLL+DGYNGGALV EN PK+VAEY SSQ TQ+KLEL
Subjt:  EAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYNGGALVQENIPKQVAEYRSSQTTQLKLEL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G16610.1 unknown protein1.7e-9544.96Show/hide
Query:  EGGSGASSGDPMQCFLKQKGKSMFSNVEL-TGTRNVLHDQNGCDAPHLGSPSVVCSPIAT---ISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT
        E G  +SSG     FL++    +  N  + + + + L      D P     S   SP A+   +S SN +  + + ++    N + L E++      KV 
Subjt:  EGGSGASSGDPMQCFLKQKGKSMFSNVEL-TGTRNVLHDQNGCDAPHLGSPSVVCSPIAT---ISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT

Query:  TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRI----------LSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGS-TSR
              +  N            VK E   H +  + +  D   +  R+            +K+E     E  E+ +D M+L DR+K    R   GS    
Subjt:  TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRI----------LSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGS-TSR

Query:  DYEHPKPSDPGCSSLVSEPASLMNVKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRD
        D   P      C+S   E      V R  K K+TAT+SIETALEEDAPGLLQ+L+ +GV VDE++LYG    D   D+S   ESF ELE VIS+LF +R+
Subjt:  DYEHPKPSDPGCSSLVSEPASLMNVKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRD

Query:  SFLKFPPIRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCS
        +  K      +KASR+SYCL CL SLIEQ RYL FR WPVEWGWCRDLQSFIFVFERH RIV+ERPEYGYATYFFEL N   I+WQ+KRLV+A+KL +C 
Subjt:  SFLKFPPIRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCS

Query:  RISLLENRPLLVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDR-NHEDISEWRSKIGKLLVDGYNGGALV
        R  L+EN+PLLVGED+T GEA VL  YGW+ N+GLG+MLNY DRV HDR   +  SEWRSKI +LL+DGYN G +V
Subjt:  RISLLENRPLLVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDR-NHEDISEWRSKIGKLLVDGYNGGALV

AT5G16610.2 unknown protein1.7e-9544.96Show/hide
Query:  EGGSGASSGDPMQCFLKQKGKSMFSNVEL-TGTRNVLHDQNGCDAPHLGSPSVVCSPIAT---ISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT
        E G  +SSG     FL++    +  N  + + + + L      D P     S   SP A+   +S SN +  + + ++    N + L E++      KV 
Subjt:  EGGSGASSGDPMQCFLKQKGKSMFSNVEL-TGTRNVLHDQNGCDAPHLGSPSVVCSPIAT---ISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVT

Query:  TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRI----------LSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGS-TSR
              +  N            VK E   H +  + +  D   +  R+            +K+E     E  E+ +D M+L DR+K    R   GS    
Subjt:  TELGSRLLTNHAPEANLFYSTKVKDEPYDHVDGCNLHGKDMNNVCSRI----------LSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGS-TSR

Query:  DYEHPKPSDPGCSSLVSEPASLMNVKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRD
        D   P      C+S   E      V R  K K+TAT+SIETALEEDAPGLLQ+L+ +GV VDE++LYG    D   D+S   ESF ELE VIS+LF +R+
Subjt:  DYEHPKPSDPGCSSLVSEPASLMNVKRRRKWKRTATNSIETALEEDAPGLLQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRD

Query:  SFLKFPPIRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCS
        +  K      +KASR+SYCL CL SLIEQ RYL FR WPVEWGWCRDLQSFIFVFERH RIV+ERPEYGYATYFFEL N   I+WQ+KRLV+A+KL +C 
Subjt:  SFLKFPPIRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKRIVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCS

Query:  RISLLENRPLLVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDR-NHEDISEWRSKIGKLLVDGYNGGALV
        R  L+EN+PLLVGED+T GEA VL  YGW+ N+GLG+MLNY DRV HDR   +  SEWRSKI +LL+DGYN G +V
Subjt:  RISLLENRPLLVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDR-NHEDISEWRSKIGKLLVDGYNGGALV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GAGGAGGCTCGATTAAATACAGAGAAGCAGATATCATGCACTATGGAATTTGAGAATCCAACACCACCTGAGGTTCCTGATTGGGTAAGGGTGGAATCTACAGGCATCTT
ATCAGGTACCTTGGAAGATGGTGTTGATAACTTTACTTCTGCTGGTGTGGCAGTAACTAAGGTTAAAAATGAGATGTTCGACGACTTCAATGAAGATCTTGATCATGTTG
TATTTATTGAGCGTCTGAGGATGCTGCTATCAAGGAAAGCATTGGGTTCCATGAATCAACATGTGGAGGGTGGTTCTGGTGCGTCATCAGGAGATCCTATGCAATGCTTC
TTGAAACAGAAGGGAAAGTCTATGTTTTCTAATGTAGAACTGACGGGAACTAGAAATGTGTTGCATGATCAAAATGGATGTGATGCTCCTCATCTTGGCAGCCCTTCAGT
AGTTTGTTCACCTATTGCAACCATTTCTGGATCCAATTTCTCAAGCAATCGTTCTTTGAATAAATTAACTGAATCAGGCAATGATATGGAACTTAAAGAAGATGATAGGA
TCTGCTTGTCCGAGAAGGTGACCACAGAATTAGGTTCACGACTTTTGACTAATCATGCCCCTGAAGCAAATTTATTTTATTCCACAAAAGTGAAGGATGAACCCTATGAT
CATGTTGACGGCTGCAACTTACATGGTAAGGATATGAATAATGTCTGCAGCAGAATTCTGTCGGTAAAGAGTGAAACAACCATGCCTGATGAACCTTATGAAAACAAGGT
AGATGATATGCGACTGCAAGATCGAATGAAGTTCTTCTCATCTCGAAAGGTTTTTGGTTCTACATCTAGAGATTACGAGCATCCAAAACCTTCTGACCCCGGATGTAGTT
CTCTTGTTTCAGAACCTGCTAGTTTAATGAACGTTAAACGTCGACGCAAATGGAAAAGGACTGCCACGAATTCAATTGAAACAGCACTCGAGGAAGATGCTCCTGGCCTT
CTCCAGATCCTAGTTGACAAAGGTGTTCTAGTTGATGAAATTAAGCTTTATGGAGAGATGGAAAGTGATGATGATTTGGATGAGTCCTTTAGTGAAGAAAGCTTTGGTGA
GCTTGAAGCTGTGATATCGAGGCTTTTTTCTCAACGCGATTCCTTTCTGAAGTTTCCTCCTATAAGATGCACAAAAGCTTCAAGATCAAGCTATTGCTTAGCTTGTCTAG
TTTCACTTATTGAGCAGACAAGATATCTTCATTTCCGAAGTTGGCCTGTCGAATGGGGGTGGTGCCGTGATCTCCAGTCTTTTATATTTGTTTTTGAAAGACATAAAAGA
ATAGTGCTAGAACGTCCCGAGTATGGGTATGCTACGTATTTCTTTGAGCTTGTGAATTTCTTACCCATCAAGTGGCAGATAAAGCGGTTGGTGATTGCTTTGAAGCTGAC
TAATTGTAGCAGAATTTCACTACTTGAGAACAGACCATTATTGGTTGGGGAAGACTTGACCGAAGGTGAAGCGGGGGTTTTATCGAGCTATGGATGGCTGCCAAATAGTG
GCTTGGGTTCAATGCTGAACTACTGTGACAGAGTCGTCCACGACCGAAATCACGAGGACATCTCGGAATGGAGATCAAAAATAGGGAAGCTACTGGTTGACGGTTATAAT
GGCGGGGCTCTAGTTCAAGAAAATATTCCAAAGCAGGTTGCAGAGTACAGAAGTTCCCAAACCACACAACTTAAGCTGGAACTCTGA
mRNA sequenceShow/hide mRNA sequence
GAGGAGGCTCGATTAAATACAGAGAAGCAGATATCATGCACTATGGAATTTGAGAATCCAACACCACCTGAGGTTCCTGATTGGGTAAGGGTGGAATCTACAGGCATCTT
ATCAGGTACCTTGGAAGATGGTGTTGATAACTTTACTTCTGCTGGTGTGGCAGTAACTAAGGTTAAAAATGAGATGTTCGACGACTTCAATGAAGATCTTGATCATGTTG
TATTTATTGAGCGTCTGAGGATGCTGCTATCAAGGAAAGCATTGGGTTCCATGAATCAACATGTGGAGGGTGGTTCTGGTGCGTCATCAGGAGATCCTATGCAATGCTTC
TTGAAACAGAAGGGAAAGTCTATGTTTTCTAATGTAGAACTGACGGGAACTAGAAATGTGTTGCATGATCAAAATGGATGTGATGCTCCTCATCTTGGCAGCCCTTCAGT
AGTTTGTTCACCTATTGCAACCATTTCTGGATCCAATTTCTCAAGCAATCGTTCTTTGAATAAATTAACTGAATCAGGCAATGATATGGAACTTAAAGAAGATGATAGGA
TCTGCTTGTCCGAGAAGGTGACCACAGAATTAGGTTCACGACTTTTGACTAATCATGCCCCTGAAGCAAATTTATTTTATTCCACAAAAGTGAAGGATGAACCCTATGAT
CATGTTGACGGCTGCAACTTACATGGTAAGGATATGAATAATGTCTGCAGCAGAATTCTGTCGGTAAAGAGTGAAACAACCATGCCTGATGAACCTTATGAAAACAAGGT
AGATGATATGCGACTGCAAGATCGAATGAAGTTCTTCTCATCTCGAAAGGTTTTTGGTTCTACATCTAGAGATTACGAGCATCCAAAACCTTCTGACCCCGGATGTAGTT
CTCTTGTTTCAGAACCTGCTAGTTTAATGAACGTTAAACGTCGACGCAAATGGAAAAGGACTGCCACGAATTCAATTGAAACAGCACTCGAGGAAGATGCTCCTGGCCTT
CTCCAGATCCTAGTTGACAAAGGTGTTCTAGTTGATGAAATTAAGCTTTATGGAGAGATGGAAAGTGATGATGATTTGGATGAGTCCTTTAGTGAAGAAAGCTTTGGTGA
GCTTGAAGCTGTGATATCGAGGCTTTTTTCTCAACGCGATTCCTTTCTGAAGTTTCCTCCTATAAGATGCACAAAAGCTTCAAGATCAAGCTATTGCTTAGCTTGTCTAG
TTTCACTTATTGAGCAGACAAGATATCTTCATTTCCGAAGTTGGCCTGTCGAATGGGGGTGGTGCCGTGATCTCCAGTCTTTTATATTTGTTTTTGAAAGACATAAAAGA
ATAGTGCTAGAACGTCCCGAGTATGGGTATGCTACGTATTTCTTTGAGCTTGTGAATTTCTTACCCATCAAGTGGCAGATAAAGCGGTTGGTGATTGCTTTGAAGCTGAC
TAATTGTAGCAGAATTTCACTACTTGAGAACAGACCATTATTGGTTGGGGAAGACTTGACCGAAGGTGAAGCGGGGGTTTTATCGAGCTATGGATGGCTGCCAAATAGTG
GCTTGGGTTCAATGCTGAACTACTGTGACAGAGTCGTCCACGACCGAAATCACGAGGACATCTCGGAATGGAGATCAAAAATAGGGAAGCTACTGGTTGACGGTTATAAT
GGCGGGGCTCTAGTTCAAGAAAATATTCCAAAGCAGGTTGCAGAGTACAGAAGTTCCCAAACCACACAACTTAAGCTGGAACTCTGATTGCTGTTAGTTTCTCATAGTTT
ACTGTTTTAATACAATTTGTATAATTTATTTGCTGTCAAAGTTTAGTATAATTTCAAGCTGTTCAAGCTTATTGTTCATTAATATGAAGTAGAGAATATTTGGCTCTCAT
TTGTACATGCATAAAGGACTTCGACTATATTGTTGGAAGTGACTATTCTTTTATGTTATTTTATTTATAAATGCCCATTCTAACACAATTCTCTTATCTAGATTTGGACC
CTTTTC
Protein sequenceShow/hide protein sequence
EEARLNTEKQISCTMEFENPTPPEVPDWVRVESTGILSGTLEDGVDNFTSAGVAVTKVKNEMFDDFNEDLDHVVFIERLRMLLSRKALGSMNQHVEGGSGASSGDPMQCF
LKQKGKSMFSNVELTGTRNVLHDQNGCDAPHLGSPSVVCSPIATISGSNFSSNRSLNKLTESGNDMELKEDDRICLSEKVTTELGSRLLTNHAPEANLFYSTKVKDEPYD
HVDGCNLHGKDMNNVCSRILSVKSETTMPDEPYENKVDDMRLQDRMKFFSSRKVFGSTSRDYEHPKPSDPGCSSLVSEPASLMNVKRRRKWKRTATNSIETALEEDAPGL
LQILVDKGVLVDEIKLYGEMESDDDLDESFSEESFGELEAVISRLFSQRDSFLKFPPIRCTKASRSSYCLACLVSLIEQTRYLHFRSWPVEWGWCRDLQSFIFVFERHKR
IVLERPEYGYATYFFELVNFLPIKWQIKRLVIALKLTNCSRISLLENRPLLVGEDLTEGEAGVLSSYGWLPNSGLGSMLNYCDRVVHDRNHEDISEWRSKIGKLLVDGYN
GGALVQENIPKQVAEYRSSQTTQLKLEL