; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023206 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023206
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSequence-specific DNA binding transcription factor
Genome locationtig00000892:982903..992626
RNA-Seq ExpressionSgr023206
SyntenySgr023206
Gene Ontology termsNA
InterPro domainsIPR001841 - Zinc finger, RING-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605907.1 hypothetical protein SDJN03_03224, partial [Cucurbita argyrosperma subsp. sororia]8.5e-20882.35Show/hide
Query:  MDSSGLGGGFLSANGGLLDLESPIRRHQQTQLVNPALTHQHHLNMMSTFEGDHQSIGLVDTKNLTQKDLSMTFTKGKAIA-GSATNNNNTSEEDEPSFTE
        MDSSGLGGGFLS NGGLLDLESPIRRHQQTQL+N +LTH+HHL MM+T EGDHQ +G++DTK L  KDLSMTFTKGKAIA G  TNN+NTSEEDEPSFTE
Subjt:  MDSSGLGGGFLSANGGLLDLESPIRRHQQTQLVNPALTHQHHLNMMSTFEGDHQSIGLVDTKNLTQKDLSMTFTKGKAIA-GSATNNNNTSEEDEPSFTE

Query:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGPKRKSGILQKKGKWKTVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DGEC+EFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMG KRKSGILQKKGKWK VSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGPKRKSGILQKKGKWKTVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAVNCSKGNNESEEADDSGSEDDESDDEDDHYPVENGLW
        SCRVVENPALMDSMPHLS+K K+DVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILP VN SKGNNESEEADDS S+ DESD+EDDHYP EN LW
Subjt:  SCRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAVNCSKGNNESEEADDSGSEDDESDDEDDHYPVENGLW

Query:  VAESRGRDRVSADDGPLWSNSVAQNEL------------------------KMLQLQEQCISFHAQAVELEKQRFKWLRYCSKKSRDLERARLENERMKL
         AESRGRD+ SADDGPLWS + AQNE                         +MLQLQEQC+SF AQ+ ELEKQRFKWLRYCSKKSRDLER RLENERMK+
Subjt:  VAESRGRDRVSADDGPLWSNSVAQNEL------------------------KMLQLQEQCISFHAQAVELEKQRFKWLRYCSKKSRDLERARLENERMKL

Query:  DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQGREQIDLDASLVELLVLKNNVEAL
        DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG         LVELLVL NNVEAL
Subjt:  DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQGREQIDLDASLVELLVLKNNVEAL

XP_022957960.1 uncharacterized protein LOC111459338 [Cucurbita moschata]2.0e-20483.94Show/hide
Query:  MDSSGLGGGFLSANGGLLDLESPIRRHQQTQLVNPALTHQHHLNMMSTFEGDHQSIGLVDTKNLTQKDLSMTFTKGKAIA-GSATNNNNTSEEDEPSFTE
        MDSSGLGGGFLS NGGLLDLESPIRRHQQTQL+N +LTH+HHL MM+T EGDHQS+G++DTK L  KDLSMTFTKGKAIA G  TNN+NTSEEDEPSFTE
Subjt:  MDSSGLGGGFLSANGGLLDLESPIRRHQQTQLVNPALTHQHHLNMMSTFEGDHQSIGLVDTKNLTQKDLSMTFTKGKAIA-GSATNNNNTSEEDEPSFTE

Query:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGPKRKSGILQKKGKWKTVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DGEC+EFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMG KRKSGILQKKGKWK VSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGPKRKSGILQKKGKWKTVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAVNCSKGNNESEEADDSGSEDDESDDEDDHYPVENGLW
        SCRVVENPALMDSMPHLS+K K+DVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILP VN SKGNNESEEADDS S+ DESD+EDDHYP EN LW
Subjt:  SCRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAVNCSKGNNESEEADDSGSEDDESDDEDDHYPVENGLW

Query:  VAESRGRDRVSADDGPLWSNSVAQNEL------------------------KMLQLQEQCISFHAQAVELEKQRFKWLRYCSKKSRDLERARLENERMKL
         AESRGRD+ SADDGPLWS + AQNE                         +MLQLQEQC+SF AQ+ ELEKQRFKWLRYCSKKSRDLER RLENERMK+
Subjt:  VAESRGRDRVSADDGPLWSNSVAQNEL------------------------KMLQLQEQCISFHAQAVELEKQRFKWLRYCSKKSRDLERARLENERMKL

Query:  DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG
        DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG
Subjt:  DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG

XP_022995089.1 uncharacterized protein LOC111490737 [Cucurbita maxima]6.7e-20583.72Show/hide
Query:  MDSSGLGGGFLSANGGLLDLESPIRRHQQTQLVNPALTHQHHLNMMSTFEGDHQSIGLVDTKNLTQKDLSMTFTKGKAIA-GSATNNNNTSEEDEPSFTE
        MDSSGLGGGFLS NGGLLDLESPIRRHQQTQL+N +LTH+HHL MM+T EGDHQS+G++DTK +  KDLSMTFTKGKAIA G  TNN+NTSEEDEPSFTE
Subjt:  MDSSGLGGGFLSANGGLLDLESPIRRHQQTQLVNPALTHQHHLNMMSTFEGDHQSIGLVDTKNLTQKDLSMTFTKGKAIA-GSATNNNNTSEEDEPSFTE

Query:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGPKRKSGILQKKGKWKTVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DGEC+EFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMG KRKSGILQKKGKWK VSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGPKRKSGILQKKGKWKTVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAVNCSKGNNESEEADDSGSEDDESDDEDDHYPVENGLW
        SCRVVENPALMDSMPHLS+K K+DVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILP VN S+GNNESEEADDS S+ DESD+EDDHYP EN LW
Subjt:  SCRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAVNCSKGNNESEEADDSGSEDDESDDEDDHYPVENGLW

Query:  VAESRGRDRVSADDGPLWSNSVAQNEL------------------------KMLQLQEQCISFHAQAVELEKQRFKWLRYCSKKSRDLERARLENERMKL
         A+SRGRD+ SADDGPLWSN+ AQNEL                        +MLQLQEQC+SF AQ+ ELEKQRFKWLRYCSKKSRDLER RLENERMK+
Subjt:  VAESRGRDRVSADDGPLWSNSVAQNEL------------------------KMLQLQEQCISFHAQAVELEKQRFKWLRYCSKKSRDLERARLENERMKL

Query:  DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG
        DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG
Subjt:  DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG

XP_023534092.1 uncharacterized protein LOC111795758 [Cucurbita pepo subsp. pepo]2.2e-20383.49Show/hide
Query:  MDSSGLGGGFLSANGGLLDLESPIRRHQQTQLVNPALTHQHHLNMMSTFEGDHQSIGLVDTKNLTQKDLSMTFTKGKAIA-GSATNNNNTSEEDEPSFTE
        MDSSGLGGGFLS NGGLLDLESPIRRHQQTQL+N +LTH+HHL MM+T E DHQS+G++DTK L  KDLSMTFTKGKAIA G  TNN+NTSEEDEPSFTE
Subjt:  MDSSGLGGGFLSANGGLLDLESPIRRHQQTQLVNPALTHQHHLNMMSTFEGDHQSIGLVDTKNLTQKDLSMTFTKGKAIA-GSATNNNNTSEEDEPSFTE

Query:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGPKRKSGILQKKGKWKTVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DGEC++FLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMG KRKSGILQKKGKWK VSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGPKRKSGILQKKGKWKTVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAVNCSKGNNESEEADDSGSEDDESDDEDDHYPVENGLW
        SCRVVENPALMDSMPHLS+K K+DVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILP VN SKGNNESEEADDS S+ DESD+EDDHYP EN LW
Subjt:  SCRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAVNCSKGNNESEEADDSGSEDDESDDEDDHYPVENGLW

Query:  VAESRGRDRVSADDGPLWSNSVAQNEL------------------------KMLQLQEQCISFHAQAVELEKQRFKWLRYCSKKSRDLERARLENERMKL
         AESRGRD+ SADDGPLWS + AQNE                         +MLQLQEQC+SF AQ+ ELEKQRFKWLRYCSKKSRDLER RLENERMK+
Subjt:  VAESRGRDRVSADDGPLWSNSVAQNEL------------------------KMLQLQEQCISFHAQAVELEKQRFKWLRYCSKKSRDLERARLENERMKL

Query:  DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG
        DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG
Subjt:  DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG

XP_038901508.1 uncharacterized protein LOC120088355 [Benincasa hispida]5.3e-20282.21Show/hide
Query:  MDSSGLGGGFLSANGGLLDLESPIRRHQQTQLVNPALTHQHHLNMMSTFEGDHQSIGLVDTKNLTQKDLSMTFTKGKAIA-GSATNNNNTSEEDEPSFTE
        MDSSGLGGGFLS NGGL+DLESPIRR Q+TQLVNP+LTH+HHLNMMSTFEGDH S+G VDTK+L QKDL M F KGKAIA G  TNNN TSEEDEPSFTE
Subjt:  MDSSGLGGGFLSANGGLLDLESPIRRHQQTQLVNPALTHQHHLNMMSTFEGDHQSIGLVDTKNLTQKDLSMTFTKGKAIA-GSATNNNNTSEEDEPSFTE

Query:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGPKRKSGILQKKGKWKTVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DGEC EFLKGKKGSPWQRMKWTD+IVRLLIAVVACVGDDGEAG G KRKSGILQKKGKWKT+SKIM+SKGCHVSPQQCEDKFNDLNKRYKRLNDI+G+GT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGPKRKSGILQKKGKWKTVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAVNCSKGNNESEEADDSGSEDD--ESDDEDDHYPVENG
        SCRVVENPALMDSMPHLS+KAK+DVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILP  N SKGNNES+EA+DS S+ D  ESD+EDDH PVEN 
Subjt:  SCRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAVNCSKGNNESEEADDSGSEDD--ESDDEDDHYPVENG

Query:  LWVAESRGRDRVSADDGPLWSNSVAQNEL------------------------KMLQLQEQCISFHAQAVELEKQRFKWLRYCSKKSRDLERARLENERM
        LW +ESRGRD+VSADDGPLWSNSVA+NE                         +MLQLQEQC +F AQ+VELEKQRFKWLRYCSKK+RDLERARLENERM
Subjt:  LWVAESRGRDRVSADDGPLWSNSVAQNEL------------------------KMLQLQEQCISFHAQAVELEKQRFKWLRYCSKKSRDLERARLENERM

Query:  KLDNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQGREQIDL
        KLDNERRVLQLKQKEMELE KR DSSFGPTLGIDRIQGREQ+DL
Subjt:  KLDNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQGREQIDL

TrEMBL top hitse value%identityAlignment
A0A1S3BM36 uncharacterized protein LOC1034915222.8e-19680.14Show/hide
Query:  MDSSGLGGGFLSANGGLLDLESPIRRHQQTQLVNPALTHQHHLNMMSTFEGDHQSIGLVDTKNLTQKDLSMTFTKGKAIAGSATNNNNTSEEDEPSFTED
        MDSSGLGGGFLS NGGLLDLESPIRR Q+TQLVNP+LT +H LNMMS FEGDHQSIG++D+K+L QKDL M F +GKAIA +   NN TSEEDEPS+TED
Subjt:  MDSSGLGGGFLSANGGLLDLESPIRRHQQTQLVNPALTHQHHLNMMSTFEGDHQSIGLVDTKNLTQKDLSMTFTKGKAIAGSATNNNNTSEEDEPSFTED

Query:  GECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGPKRKSGILQKKGKWKTVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTS
        GECSEFLKGKKGSPWQRMKWTD+IVRLLIAVVACVGDDGEAGMG KRKSGIL KKGKW+TVSKIM SKGCHVSPQQCEDKFNDLNKRYKRLNDILG+GTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGPKRKSGILQKKGKWKTVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTS

Query:  CRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAVNCSKGNNESEEADDSGSEDD--ESDDEDDHYPVENGL
        CRVVENPALMDSMPHLS+KAK+DVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPA N SKGNNESEEA+DS S+ D  ESD+EDDH P EN L
Subjt:  CRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAVNCSKGNNESEEADDSGSEDD--ESDDEDDHYPVENGL

Query:  WVAESRGRDRVSADDGPLWSNSVAQNEL------------------------KMLQLQEQCISFHAQAVELEKQRFKWLRYCSKKSRDLERARLENERMK
        W +ESRGRD+VSADDGPLWSNSV +NE                         +MLQLQEQC SF AQ+VELEKQRFKWLRYCSKK+RDLERARLENERMK
Subjt:  WVAESRGRDRVSADDGPLWSNSVAQNEL------------------------KMLQLQEQCISFHAQAVELEKQRFKWLRYCSKKSRDLERARLENERMK

Query:  LDNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQGREQIDL
        LDNE+RVLQLK+KEMELE KRSDS+ GP L  DRIQGREQ+DL
Subjt:  LDNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQGREQIDL

A0A5A7TE21 Stress response protein nst1 isoform X12.8e-19680.14Show/hide
Query:  MDSSGLGGGFLSANGGLLDLESPIRRHQQTQLVNPALTHQHHLNMMSTFEGDHQSIGLVDTKNLTQKDLSMTFTKGKAIAGSATNNNNTSEEDEPSFTED
        MDSSGLGGGFLS NGGLLDLESPIRR Q+TQLVNP+LT +H LNMMS FEGDHQSIG++D+K+L QKDL M F +GKAIA +   NN TSEEDEPS+TED
Subjt:  MDSSGLGGGFLSANGGLLDLESPIRRHQQTQLVNPALTHQHHLNMMSTFEGDHQSIGLVDTKNLTQKDLSMTFTKGKAIAGSATNNNNTSEEDEPSFTED

Query:  GECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGPKRKSGILQKKGKWKTVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTS
        GECSEFLKGKKGSPWQRMKWTD+IVRLLIAVVACVGDDGEAGMG KRKSGIL KKGKW+TVSKIM SKGCHVSPQQCEDKFNDLNKRYKRLNDILG+GTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGPKRKSGILQKKGKWKTVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTS

Query:  CRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAVNCSKGNNESEEADDSGSEDD--ESDDEDDHYPVENGL
        CRVVENPALMDSMPHLS+KAK+DVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPA N SKGNNESEEA+DS S+ D  ESD+EDDH P EN L
Subjt:  CRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAVNCSKGNNESEEADDSGSEDD--ESDDEDDHYPVENGL

Query:  WVAESRGRDRVSADDGPLWSNSVAQNEL------------------------KMLQLQEQCISFHAQAVELEKQRFKWLRYCSKKSRDLERARLENERMK
        W +ESRGRD+VSADDGPLWSNSV +NE                         +MLQLQEQC SF AQ+VELEKQRFKWLRYCSKK+RDLERARLENERMK
Subjt:  WVAESRGRDRVSADDGPLWSNSVAQNEL------------------------KMLQLQEQCISFHAQAVELEKQRFKWLRYCSKKSRDLERARLENERMK

Query:  LDNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQGREQIDL
        LDNE+RVLQLK+KEMELE KRSDS+ GP L  DRIQGREQ+DL
Subjt:  LDNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQGREQIDL

A0A5D3BB81 Stress response protein nst1 isoform X11.2e-19680.36Show/hide
Query:  MDSSGLGGGFLSANGGLLDLESPIRRHQQTQLVNPALTHQHHLNMMSTFEGDHQSIGLVDTKNLTQKDLSMTFTKGKAIAGSATNNNNTSEEDEPSFTED
        MDSSGLGGGFLS NGGLLDLESPIRR Q+TQLVNP+LT +H LNMMS FEGDHQSIG++D+K+L QKDL M F +GKAIA +   NN TSEEDEPS+TED
Subjt:  MDSSGLGGGFLSANGGLLDLESPIRRHQQTQLVNPALTHQHHLNMMSTFEGDHQSIGLVDTKNLTQKDLSMTFTKGKAIAGSATNNNNTSEEDEPSFTED

Query:  GECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGPKRKSGILQKKGKWKTVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTS
        GECSEFLKGKKGSPWQRMKWTD+IVRLLIAVVACVGDDGEAGMG KRKSGIL KKGKWKTVSKIM SKGCHVSPQQCEDKFNDLNKRYKRLNDILG+GTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGPKRKSGILQKKGKWKTVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTS

Query:  CRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAVNCSKGNNESEEADDSGSEDD--ESDDEDDHYPVENGL
        CRVVENPALMDSMPHLS+KAK+DVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPA N SKGNNESEEA+DS S+ D  ESD+EDDH P EN L
Subjt:  CRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAVNCSKGNNESEEADDSGSEDD--ESDDEDDHYPVENGL

Query:  WVAESRGRDRVSADDGPLWSNSVAQNEL------------------------KMLQLQEQCISFHAQAVELEKQRFKWLRYCSKKSRDLERARLENERMK
        W +ESRGRD+VSADDGPLWSNSV +NE                         +MLQLQEQC SF AQ+VELEKQRFKWLRYCSKK+RDLERARLENERMK
Subjt:  WVAESRGRDRVSADDGPLWSNSVAQNEL------------------------KMLQLQEQCISFHAQAVELEKQRFKWLRYCSKKSRDLERARLENERMK

Query:  LDNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQGREQIDL
        LDNE+RVLQLK+KEMELE KRSDS+ GP L  DRIQGREQ+DL
Subjt:  LDNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQGREQIDL

A0A6J1H0P0 uncharacterized protein LOC1114593389.5e-20583.94Show/hide
Query:  MDSSGLGGGFLSANGGLLDLESPIRRHQQTQLVNPALTHQHHLNMMSTFEGDHQSIGLVDTKNLTQKDLSMTFTKGKAIA-GSATNNNNTSEEDEPSFTE
        MDSSGLGGGFLS NGGLLDLESPIRRHQQTQL+N +LTH+HHL MM+T EGDHQS+G++DTK L  KDLSMTFTKGKAIA G  TNN+NTSEEDEPSFTE
Subjt:  MDSSGLGGGFLSANGGLLDLESPIRRHQQTQLVNPALTHQHHLNMMSTFEGDHQSIGLVDTKNLTQKDLSMTFTKGKAIA-GSATNNNNTSEEDEPSFTE

Query:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGPKRKSGILQKKGKWKTVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DGEC+EFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMG KRKSGILQKKGKWK VSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGPKRKSGILQKKGKWKTVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAVNCSKGNNESEEADDSGSEDDESDDEDDHYPVENGLW
        SCRVVENPALMDSMPHLS+K K+DVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILP VN SKGNNESEEADDS S+ DESD+EDDHYP EN LW
Subjt:  SCRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAVNCSKGNNESEEADDSGSEDDESDDEDDHYPVENGLW

Query:  VAESRGRDRVSADDGPLWSNSVAQNEL------------------------KMLQLQEQCISFHAQAVELEKQRFKWLRYCSKKSRDLERARLENERMKL
         AESRGRD+ SADDGPLWS + AQNE                         +MLQLQEQC+SF AQ+ ELEKQRFKWLRYCSKKSRDLER RLENERMK+
Subjt:  VAESRGRDRVSADDGPLWSNSVAQNEL------------------------KMLQLQEQCISFHAQAVELEKQRFKWLRYCSKKSRDLERARLENERMKL

Query:  DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG
        DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG
Subjt:  DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG

A0A6J1K4Q0 uncharacterized protein LOC1114907373.3e-20583.72Show/hide
Query:  MDSSGLGGGFLSANGGLLDLESPIRRHQQTQLVNPALTHQHHLNMMSTFEGDHQSIGLVDTKNLTQKDLSMTFTKGKAIA-GSATNNNNTSEEDEPSFTE
        MDSSGLGGGFLS NGGLLDLESPIRRHQQTQL+N +LTH+HHL MM+T EGDHQS+G++DTK +  KDLSMTFTKGKAIA G  TNN+NTSEEDEPSFTE
Subjt:  MDSSGLGGGFLSANGGLLDLESPIRRHQQTQLVNPALTHQHHLNMMSTFEGDHQSIGLVDTKNLTQKDLSMTFTKGKAIA-GSATNNNNTSEEDEPSFTE

Query:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGPKRKSGILQKKGKWKTVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
        DGEC+EFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMG KRKSGILQKKGKWK VSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGPKRKSGILQKKGKWKTVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGT

Query:  SCRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAVNCSKGNNESEEADDSGSEDDESDDEDDHYPVENGLW
        SCRVVENPALMDSMPHLS+K K+DVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILP VN S+GNNESEEADDS S+ DESD+EDDHYP EN LW
Subjt:  SCRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAVNCSKGNNESEEADDSGSEDDESDDEDDHYPVENGLW

Query:  VAESRGRDRVSADDGPLWSNSVAQNEL------------------------KMLQLQEQCISFHAQAVELEKQRFKWLRYCSKKSRDLERARLENERMKL
         A+SRGRD+ SADDGPLWSN+ AQNEL                        +MLQLQEQC+SF AQ+ ELEKQRFKWLRYCSKKSRDLER RLENERMK+
Subjt:  VAESRGRDRVSADDGPLWSNSVAQNEL------------------------KMLQLQEQCISFHAQAVELEKQRFKWLRYCSKKSRDLERARLENERMKL

Query:  DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG
        DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG
Subjt:  DNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQG

SwissProt top hitse value%identityAlignment
O49500 E3 ubiquitin-protein ligase MBR24.8e-1241.86Show/hide
Query:  VDPDNMTYEELLDLGETVGTQSRGLSQELIALLPVSKYKCGFFSRKKSRNERCVICQMEYKRGDQRITLPCKHRYHTGCGTKWLSI
        +D DNM+YEELL LGE +G  S GLS+E+I  +          +      E C +CQ EY  GD   TL C H +HT C  +WL +
Subjt:  VDPDNMTYEELLDLGETVGTQSRGLSQELIALLPVSKYKCGFFSRKKSRNERCVICQMEYKRGDQRITLPCKHRYHTGCGTKWLSI

Q7XTV7 Probable E3 ubiquitin-protein ligase HIP14.1e-1141.86Show/hide
Query:  VDPDNMTYEELLDLGETVGTQSRGLSQELIALLPVSKYKCGFFSRKKSRNERCVICQMEYKRGDQRITLPCKHRYHTGCGTKWLSI
        +D DNM+YEELL L E +G  S GLS+E +  L   +    +        E C ICQ EY  GD   TL C H +H GC  +WL +
Subjt:  VDPDNMTYEELLDLGETVGTQSRGLSQELIALLPVSKYKCGFFSRKKSRNERCVICQMEYKRGDQRITLPCKHRYHTGCGTKWLSI

Q8L649 E3 ubiquitin-protein ligase BIG BROTHER4.5e-3443.09Show/hide
Query:  EETVYPSTNSNYYKFGHSDSWSTSYFDAQSFEVQGHESTIDEHRRLQDFSTIPN-----EQSVGNRVWEENANPIMSGHSMECPRRHPNYHEYQTIWQDI
        ++ +Y + N+N YKFG S S + S++   S+++  H S +   R   D+  + N     E +V   V  +  +      + EC     +    Q  WQD 
Subjt:  EETVYPSTNSNYYKFGHSDSWSTSYFDAQSFEVQGHESTIDEHRRLQDFSTIPN-----EQSVGNRVWEENANPIMSGHSMECPRRHPNYHEYQTIWQDI

Query:  VDPDNMTYEELLDLGETVGTQSRGLSQELIALLPVSKYKCGFFSRKKSRNERCVICQMEYKRGDQRITLPCKHRYHTGCGTKWLSINK
        +DPD MTYEEL++LGE VGT+SRGLSQELI  LP  KYK G    +K   ERCVICQ++YK G++++ LPCKH YH+ C +KWLSINK
Subjt:  VDPDNMTYEELLDLGETVGTQSRGLSQELIALLPVSKYKCGFFSRKKSRNERCVICQMEYKRGDQRITLPCKHRYHTGCGTKWLSINK

Q9LT17 E3 ubiquitin ligase BIG BROTHER-related4.3e-2151.02Show/hide
Query:  HEYQTIWQDIVDPDNMTYEELLDLGETVGTQSRGLSQELIALLPVSKYKCGFFSRKKSRNERCVICQMEYKRGDQRITLPCKHRYHTGCGTKWLSINK
        H  Q  W D +DPD ++YEELL LG+ VGT+SRGLS + IA LP  +YK G    +   NE CVIC+++Y+  +  I LPCKH YH+ C   WL INK
Subjt:  HEYQTIWQDIVDPDNMTYEELLDLGETVGTQSRGLSQELIALLPVSKYKCGFFSRKKSRNERCVICQMEYKRGDQRITLPCKHRYHTGCGTKWLSINK

Q9ZQF9 E3 ubiquitin-protein ligase MBR12.6e-1043.33Show/hide
Query:  VDPDNMTYEELLDLGETVGTQSRGLSQELIALLPVSKYKCGFFSRKK----SRNERCVICQMEYKRGDQRITLPCKHRYHTGCGTKWLSI
        +D DNM+YEELL LGE +G  S GLS+E+I L  + ++K    S          E C ICQ EY  GD   TL C H +H  C  +W+ I
Subjt:  VDPDNMTYEELLDLGETVGTQSRGLSQELIALLPVSKYKCGFFSRKK----SRNERCVICQMEYKRGDQRITLPCKHRYHTGCGTKWLSI

Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors1.2e-7440.31Show/hide
Query:  MDSSGLGGGFL---SANGGLLDLESPIRRHQQTQLVNPALTHQHHLNMMSTFEGDHQSIGLVDTKNLTQKDLSMTFTKGKAIAGSATNNNNTSEEDEPSF
        MD +   GG +   +++ G  DL+  +R H Q  +      H+H+ N     EG   ++    T +  Q        + KA        N+ S++DEPSF
Subjt:  MDSSGLGGGFL---SANGGLLDLESPIRRHQQTQLVNPALTHQHHLNMMSTFEGDHQSIGLVDTKNLTQKDLSMTFTKGKAIAGSATNNNNTSEEDEPSF

Query:  TE---DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGPKRKSGILQKKGKWKTVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDI
        TE   DG  +E  +  KGSPWQR+KWTD +V+LLI  V+ +GDD       +RK  +LQKKGKWK+VSK+M  +G HVSPQQCEDKFNDLNKRYK+LND+
Subjt:  TE---DGECSEFLKGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGPKRKSGILQKKGKWKTVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDI

Query:  LGRGTSCRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAV-------NCSKGNNESEEADD---SGSEDDE
        LGRGTSC+VVENPAL+DS+ +L++K K+DVRKI+SSKHLFY+EMC+YHNG  +    D+  Q  +  A+       N     ++ E+ DD    G  D+ 
Subjt:  LGRGTSCRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAV-------NCSKGNNESEEADD---SGSEDDE

Query:  SDDEDDHYPVEN-------GLWVAESRGRDRVSADDG--PLWSNSVAQN--------------------------------ELKMLQLQEQCISFHAQAV
         + E+ HY   +       G      + R  +S +DG  P   NS+  N                                E + LQL+EQ +    + +
Subjt:  SDDEDDHYPVEN-------GLWVAESRGRDRVSADDG--PLWSNSVAQN--------------------------------ELKMLQLQEQCISFHAQAV

Query:  ELEKQRFKWLRYCSKKSRDLERARLENERMKLDNERRVLQLKQKEMELE
        ELEKQRF+W R+  K+ ++LER R+ENERMKL+N+R  L+LKQ+E+ +E
Subjt:  ELEKQRFKWLRYCSKKSRDLERARLENERMKLDNERRVLQLKQKEMELE

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)9.2e-5937.5Show/hide
Query:  NLTQKDLSMTFTKGKAIAGSATNNNNTSEEDEPSFTEDGECSEFL-----KGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGPKRKSGILQKKGK
        N  QK       +      +    +N  +  + S +ED E          K K+ SPWQR+KW D +V+L+I  ++ +G+D     G  +K  +LQKKGK
Subjt:  NLTQKDLSMTFTKGKAIAGSATNNNNTSEEDEPSFTEDGECSEFL-----KGKKGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGPKRKSGILQKKGK

Query:  WKTVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTSCRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQG
        W++VSK+M  +G HVSPQQCEDKFNDLNKRYK+LN++LGRGTSC VVENP+L+D + +L+ K K++VR+I+SSKHLFY+EMC+YHNG  +    D     
Subjt:  WKTVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTSCRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQG

Query:  KILPAVNCS-----KGNNESEEADDSGSEDDESDDEDDHYPVENGLWVAE------------------SRGRD-------------RVSADDGPLWSNSV
           PAV  S      G+ +  + D+ G   +E  D+DD Y  ++   +++                  ++G D              +S D         
Subjt:  KILPAVNCS-----KGNNESEEADDSGSEDDESDDEDDHYPVENGLWVAE------------------SRGRD-------------RVSADDGPLWSNSV

Query:  AQNELKMLQLQEQCISFHAQAVELEKQRFKWLRYCSKKSRDLERARLENERMKLDNERRVLQLKQKEM
         Q E K L+L+ + +   A+ +ELE+Q+FKW  +  ++ + L + R+ENERMKL+NER  L+LK+ E+
Subjt:  AQNELKMLQLQEQCISFHAQAVELEKQRFKWLRYCSKKSRDLERARLENERMKLDNERRVLQLKQKEM

AT3G10040.1 sequence-specific DNA binding transcription factors5.4e-5941.73Show/hide
Query:  SATNNNNTSEEDEPSFTEDG---ECSEFLKGK-KGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMG----PKRKS----------GILQKKGKWKTVSK
        S  +     +ED  S +  G   E S    GK K S W RMKWTD +VRLLI  V  +GD  EAG+      K+K+          G+LQKKGKWK+VS+
Subjt:  SATNNNNTSEEDEPSFTEDG---ECSEFLKGK-KGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMG----PKRKS----------GILQKKGKWKTVSK

Query:  IMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTSCRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHN------------------GQ
         M+ KG  VSPQQCEDKFNDLNKRYKR+NDILG+G +CRVVEN  L++SM HL+ K K++V+K+L+SKHLF++EMCAYHN                    
Subjt:  IMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTSCRVVENPALMDSMPHLSNKAKNDVRKILSSKHLFYKEMCAYHN------------------GQ

Query:  TIPGCQDVDF-------QGKILPAVNCSKGNNESEEADDSGSEDDESDDEDDHYPVENGLWVAESRGRDRVSA---DDGPLWSNSVAQNELKMLQLQEQC
         IP  Q   F         +I   V   +   ES+ A+DS SE +ES++E+     +  +  A  R R+  ++   D G            KML+++E+ 
Subjt:  TIPGCQDVDF-------QGKILPAVNCSKGNNESEEADDSGSEDDESDDEDDHYPVENGLWVAESRGRDRVSA---DDGPLWSNSVAQNELKMLQLQEQC

Query:  ISFHAQAVELEKQRFKWLRYCSKKSRDLERARLENERMKLDNERRVLQLKQKEMEL-EFKRSDSSFGPT
        I +  + VE+EKQR KW+RY SKK R++E+A+L+N+R +L+ ER +L L++ E+EL E + S +   P+
Subjt:  ISFHAQAVELEKQRFKWLRYCSKKSRDLERARLENERMKLDNERRVLQLKQKEMEL-EFKRSDSSFGPT

AT3G63530.1 RING/U-box superfamily protein3.2e-3543.09Show/hide
Query:  EETVYPSTNSNYYKFGHSDSWSTSYFDAQSFEVQGHESTIDEHRRLQDFSTIPN-----EQSVGNRVWEENANPIMSGHSMECPRRHPNYHEYQTIWQDI
        ++ +Y + N+N YKFG S S + S++   S+++  H S +   R   D+  + N     E +V   V  +  +      + EC     +    Q  WQD 
Subjt:  EETVYPSTNSNYYKFGHSDSWSTSYFDAQSFEVQGHESTIDEHRRLQDFSTIPN-----EQSVGNRVWEENANPIMSGHSMECPRRHPNYHEYQTIWQDI

Query:  VDPDNMTYEELLDLGETVGTQSRGLSQELIALLPVSKYKCGFFSRKKSRNERCVICQMEYKRGDQRITLPCKHRYHTGCGTKWLSINK
        +DPD MTYEEL++LGE VGT+SRGLSQELI  LP  KYK G    +K   ERCVICQ++YK G++++ LPCKH YH+ C +KWLSINK
Subjt:  VDPDNMTYEELLDLGETVGTQSRGLSQELIALLPVSKYKCGFFSRKKSRNERCVICQMEYKRGDQRITLPCKHRYHTGCGTKWLSINK

AT3G63530.2 RING/U-box superfamily protein3.2e-3543.09Show/hide
Query:  EETVYPSTNSNYYKFGHSDSWSTSYFDAQSFEVQGHESTIDEHRRLQDFSTIPN-----EQSVGNRVWEENANPIMSGHSMECPRRHPNYHEYQTIWQDI
        ++ +Y + N+N YKFG S S + S++   S+++  H S +   R   D+  + N     E +V   V  +  +      + EC     +    Q  WQD 
Subjt:  EETVYPSTNSNYYKFGHSDSWSTSYFDAQSFEVQGHESTIDEHRRLQDFSTIPN-----EQSVGNRVWEENANPIMSGHSMECPRRHPNYHEYQTIWQDI

Query:  VDPDNMTYEELLDLGETVGTQSRGLSQELIALLPVSKYKCGFFSRKKSRNERCVICQMEYKRGDQRITLPCKHRYHTGCGTKWLSINK
        +DPD MTYEEL++LGE VGT+SRGLSQELI  LP  KYK G    +K   ERCVICQ++YK G++++ LPCKH YH+ C +KWLSINK
Subjt:  VDPDNMTYEELLDLGETVGTQSRGLSQELIALLPVSKYKCGFFSRKKSRNERCVICQMEYKRGDQRITLPCKHRYHTGCGTKWLSINK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAGTTCAGGTCTGGGTGGTGGATTTCTGTCAGCAAATGGGGGGCTATTAGATCTGGAATCTCCTATCCGAAGACATCAACAGACCCAATTGGTCAATCCCGCGTT
GACACACCAACATCACTTGAACATGATGAGTACTTTTGAAGGTGATCACCAGTCCATTGGGCTTGTGGACACGAAAAACTTGACGCAGAAAGATTTATCAATGACCTTCA
CTAAAGGGAAAGCTATTGCCGGTAGCGCAACAAACAACAATAATACAAGTGAAGAAGATGAGCCGAGTTTTACAGAGGATGGTGAGTGCTCTGAATTTCTGAAGGGTAAA
AAGGGCTCTCCATGGCAGAGAATGAAGTGGACAGATGACATTGTAAGGCTTCTCATAGCTGTGGTTGCTTGTGTGGGTGATGACGGTGAGGCTGGGATGGGCCCCAAGAG
AAAATCTGGGATTTTGCAAAAGAAGGGCAAATGGAAAACAGTGTCAAAGATTATGATAAGTAAGGGGTGTCATGTATCACCGCAGCAGTGTGAGGATAAATTTAACGACT
TAAACAAGAGGTACAAGAGATTGAACGATATTCTTGGGAGGGGAACCAGTTGTAGGGTTGTGGAGAACCCTGCACTCATGGACTCAATGCCTCACCTCTCAAATAAAGCC
AAGAATGATGTCAGAAAAATATTAAGCTCAAAACACTTGTTTTACAAGGAAATGTGTGCTTACCATAATGGACAAACAATTCCTGGTTGCCAGGATGTTGATTTCCAAGG
TAAAATTTTGCCTGCTGTGAATTGCTCCAAAGGAAATAATGAGTCAGAAGAGGCTGATGACAGTGGCAGTGAGGATGATGAATCAGATGATGAAGATGATCACTATCCTG
TTGAAAATGGATTATGGGTGGCTGAATCGCGTGGCAGGGATAGAGTGAGTGCAGATGATGGTCCTCTGTGGTCAAACTCTGTTGCACAAAATGAATTGAAGATGCTACAA
CTTCAGGAGCAATGTATCAGCTTCCATGCTCAAGCTGTTGAACTTGAGAAACAACGTTTCAAATGGTTAAGATATTGCAGTAAGAAGAGTAGGGATTTGGAGAGAGCGAG
GCTTGAAAATGAGAGGATGAAACTAGATAATGAGCGGAGAGTATTGCAACTGAAGCAGAAGGAAATGGAATTGGAATTCAAAAGGTCTGATTCATCCTTTGGTCCAACCC
TTGGCATTGATAGAATTCAAGGAAGAGAGCAAATAGATTTGGATGCCAGTCTGGTTGAACTACTAGTTCTGAAGAACAATGTTGAAGCCCTTACACTCCATTTGTGGATC
ATCTCTGATTCTCAATTTTGGAGGTTGAAAGACAGCGTGTTAGAGAGAGAGAGAGAGAGCGTTTTCTGTTATAAGCAGCTCAACTTATTCCTCACCAGTTTCCATTTCCA
CTCTCAGATTATACAGAGAGAACCAGAAGCAGAAAGAGGAAAGTTCAACTGCACAATCTCTTCGCAGAAGCCATCAATGTCGATTAGTGATCCGCTCACTCCATCTCCTC
CCTCCCCTTCTCTCTCTCTTTCTCCACTTTTCCGTGAAAATTTGGCTGCAAGATCGCCGAAACTGCGTGTTTCAGCGTTGCGAGTGAGCATAGAGAGGAGAGAGGAGACT
GTTTATCCATCGACTAATTCAAATTACTACAAGTTTGGGCATTCTGATTCTTGGAGCACGTCATACTTCGATGCTCAATCATTTGAGGTTCAAGGTCATGAATCCACTAT
TGATGAACATAGGAGGCTGCAGGACTTCTCGACAATCCCAAATGAACAGAGTGTAGGAAATAGAGTGTGGGAAGAAAATGCCAATCCCATTATGTCCGGCCACAGCATGG
AATGCCCTCGGAGGCATCCAAATTATCATGAGTATCAGACTATTTGGCAAGATATTGTTGATCCTGATAACATGACTTATGAGGAATTACTAGATTTAGGCGAGACCGTT
GGAACTCAAAGCCGAGGCCTTTCACAAGAACTGATTGCATTGCTTCCAGTATCAAAGTATAAATGTGGGTTTTTCTCAAGGAAGAAATCACGAAATGAAAGGTGTGTGAT
ATGCCAGATGGAGTATAAACGCGGAGATCAAAGGATCACTCTACCTTGCAAACACAGGTACCATACCGGTTGCGGGACCAAGTGGCTTAGCATAAACAAG
mRNA sequenceShow/hide mRNA sequence
ATGGATAGTTCAGGTCTGGGTGGTGGATTTCTGTCAGCAAATGGGGGGCTATTAGATCTGGAATCTCCTATCCGAAGACATCAACAGACCCAATTGGTCAATCCCGCGTT
GACACACCAACATCACTTGAACATGATGAGTACTTTTGAAGGTGATCACCAGTCCATTGGGCTTGTGGACACGAAAAACTTGACGCAGAAAGATTTATCAATGACCTTCA
CTAAAGGGAAAGCTATTGCCGGTAGCGCAACAAACAACAATAATACAAGTGAAGAAGATGAGCCGAGTTTTACAGAGGATGGTGAGTGCTCTGAATTTCTGAAGGGTAAA
AAGGGCTCTCCATGGCAGAGAATGAAGTGGACAGATGACATTGTAAGGCTTCTCATAGCTGTGGTTGCTTGTGTGGGTGATGACGGTGAGGCTGGGATGGGCCCCAAGAG
AAAATCTGGGATTTTGCAAAAGAAGGGCAAATGGAAAACAGTGTCAAAGATTATGATAAGTAAGGGGTGTCATGTATCACCGCAGCAGTGTGAGGATAAATTTAACGACT
TAAACAAGAGGTACAAGAGATTGAACGATATTCTTGGGAGGGGAACCAGTTGTAGGGTTGTGGAGAACCCTGCACTCATGGACTCAATGCCTCACCTCTCAAATAAAGCC
AAGAATGATGTCAGAAAAATATTAAGCTCAAAACACTTGTTTTACAAGGAAATGTGTGCTTACCATAATGGACAAACAATTCCTGGTTGCCAGGATGTTGATTTCCAAGG
TAAAATTTTGCCTGCTGTGAATTGCTCCAAAGGAAATAATGAGTCAGAAGAGGCTGATGACAGTGGCAGTGAGGATGATGAATCAGATGATGAAGATGATCACTATCCTG
TTGAAAATGGATTATGGGTGGCTGAATCGCGTGGCAGGGATAGAGTGAGTGCAGATGATGGTCCTCTGTGGTCAAACTCTGTTGCACAAAATGAATTGAAGATGCTACAA
CTTCAGGAGCAATGTATCAGCTTCCATGCTCAAGCTGTTGAACTTGAGAAACAACGTTTCAAATGGTTAAGATATTGCAGTAAGAAGAGTAGGGATTTGGAGAGAGCGAG
GCTTGAAAATGAGAGGATGAAACTAGATAATGAGCGGAGAGTATTGCAACTGAAGCAGAAGGAAATGGAATTGGAATTCAAAAGGTCTGATTCATCCTTTGGTCCAACCC
TTGGCATTGATAGAATTCAAGGAAGAGAGCAAATAGATTTGGATGCCAGTCTGGTTGAACTACTAGTTCTGAAGAACAATGTTGAAGCCCTTACACTCCATTTGTGGATC
ATCTCTGATTCTCAATTTTGGAGGTTGAAAGACAGCGTGTTAGAGAGAGAGAGAGAGAGCGTTTTCTGTTATAAGCAGCTCAACTTATTCCTCACCAGTTTCCATTTCCA
CTCTCAGATTATACAGAGAGAACCAGAAGCAGAAAGAGGAAAGTTCAACTGCACAATCTCTTCGCAGAAGCCATCAATGTCGATTAGTGATCCGCTCACTCCATCTCCTC
CCTCCCCTTCTCTCTCTCTTTCTCCACTTTTCCGTGAAAATTTGGCTGCAAGATCGCCGAAACTGCGTGTTTCAGCGTTGCGAGTGAGCATAGAGAGGAGAGAGGAGACT
GTTTATCCATCGACTAATTCAAATTACTACAAGTTTGGGCATTCTGATTCTTGGAGCACGTCATACTTCGATGCTCAATCATTTGAGGTTCAAGGTCATGAATCCACTAT
TGATGAACATAGGAGGCTGCAGGACTTCTCGACAATCCCAAATGAACAGAGTGTAGGAAATAGAGTGTGGGAAGAAAATGCCAATCCCATTATGTCCGGCCACAGCATGG
AATGCCCTCGGAGGCATCCAAATTATCATGAGTATCAGACTATTTGGCAAGATATTGTTGATCCTGATAACATGACTTATGAGGAATTACTAGATTTAGGCGAGACCGTT
GGAACTCAAAGCCGAGGCCTTTCACAAGAACTGATTGCATTGCTTCCAGTATCAAAGTATAAATGTGGGTTTTTCTCAAGGAAGAAATCACGAAATGAAAGGTGTGTGAT
ATGCCAGATGGAGTATAAACGCGGAGATCAAAGGATCACTCTACCTTGCAAACACAGGTACCATACCGGTTGCGGGACCAAGTGGCTTAGCATAAACAAG
Protein sequenceShow/hide protein sequence
MDSSGLGGGFLSANGGLLDLESPIRRHQQTQLVNPALTHQHHLNMMSTFEGDHQSIGLVDTKNLTQKDLSMTFTKGKAIAGSATNNNNTSEEDEPSFTEDGECSEFLKGK
KGSPWQRMKWTDDIVRLLIAVVACVGDDGEAGMGPKRKSGILQKKGKWKTVSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTSCRVVENPALMDSMPHLSNKA
KNDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAVNCSKGNNESEEADDSGSEDDESDDEDDHYPVENGLWVAESRGRDRVSADDGPLWSNSVAQNELKMLQ
LQEQCISFHAQAVELEKQRFKWLRYCSKKSRDLERARLENERMKLDNERRVLQLKQKEMELEFKRSDSSFGPTLGIDRIQGREQIDLDASLVELLVLKNNVEALTLHLWI
ISDSQFWRLKDSVLERERESVFCYKQLNLFLTSFHFHSQIIQREPEAERGKFNCTISSQKPSMSISDPLTPSPPSPSLSLSPLFRENLAARSPKLRVSALRVSIERREET
VYPSTNSNYYKFGHSDSWSTSYFDAQSFEVQGHESTIDEHRRLQDFSTIPNEQSVGNRVWEENANPIMSGHSMECPRRHPNYHEYQTIWQDIVDPDNMTYEELLDLGETV
GTQSRGLSQELIALLPVSKYKCGFFSRKKSRNERCVICQMEYKRGDQRITLPCKHRYHTGCGTKWLSINK