; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg029616 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg029616
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionSAP domain-containing protein
Genome locationscaffold2:21548534..21557166
RNA-Seq ExpressionSpg029616
SyntenySpg029616
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0006979 - response to oxidative stress (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0004601 - peroxidase activity (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR003034 - SAP domain
IPR036361 - SAP domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008443746.1 PREDICTED: uncharacterized protein LOC103487261 isoform X1 [Cucumis melo]3.6e-26289.28Show/hide
Query:  LISLVYAGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM
        L +L   GRV++LLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPM
Subjt:  LISLVYAGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM

Query:  ETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN
        ETSFKQRCLEDWKMYHRKILKTLQNEGL AL DASEADY RV E+LKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN
Subjt:  ETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN

Query:  RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDEKSEPLDSLDDVDIVEDVAKEIEEEEA-EEEEVEQTENQDG
        RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSED+KS+ LDSLDDVD +EDVAKEIEEEEA EEEEVEQTENQDG
Subjt:  RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDEKSEPLDSLDDVDIVEDVAKEIEEEEA-EEEEVEQTENQDG

Query:  ERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRSSIK
        ERVIKKEVEAKKPLQMIGVQLLKDVDQ T TSKKSRRRSSRASLEDDRDEDWFPEDIFEAF+EL+KRKVFDVSDMYTIADVWGWTWERELKNRPPR   +
Subjt:  ERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRSSIK

Query:  IWALSSLNFLLSNVHSMFHLLLQVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYVEIITLCLDLGELDAAIAIVADLETTGIS
         W           V     ++ +VIELGGTPTIGDCAMILRAAI+APLPSAFLKILQTTHGLGYVFGSPLY E+ITLCLDLGELDAAIAIVADLETTGI 
Subjt:  IWALSSLNFLLSNVHSMFHLLLQVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYVEIITLCLDLGELDAAIAIVADLETTGIS

Query:  VPDETLDRVISARQTNDATPKPDSAIDTTLNDHS-AIDEAS
        VPDETLDRVIS RQTNDA PKPDSAIDTT+NDHS A DEAS
Subjt:  VPDETLDRVISARQTNDATPKPDSAIDTTLNDHS-AIDEAS

XP_011660243.1 uncharacterized protein LOC101209618 isoform X1 [Cucumis sativus]2.1e-26289.83Show/hide
Query:  LISLVYAGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM
        L +L   GRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPM
Subjt:  LISLVYAGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM

Query:  ETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN
        ETSFKQRCLEDWKMYHRKILKTLQNEGL AL DASEADY RV ERL+KIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN
Subjt:  ETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN

Query:  RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDEKSEPLDSLDDVDIVEDVAKEIEEEEA-EEEEVEQTENQDG
        RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGL SNNVKPSED+KS+PLDSLDDVD +EDVAKEIEEEEA EEEEVEQTENQDG
Subjt:  RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDEKSEPLDSLDDVDIVEDVAKEIEEEEA-EEEEVEQTENQDG

Query:  ERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRSSIK
        ERVIKKEVEAKKPLQMIGVQLLKDVDQ TTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAF+EL+KRKVFDVSDMYTIADVWGWTWERELKNRPPR   +
Subjt:  ERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRSSIK

Query:  IWALSSLNFLLSNVHSMFHLLLQVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYVEIITLCLDLGELDAAIAIVADLETTGIS
         W           V     ++ +VIELGG PTIGDCAMILRAAI+APLPSAFLKILQTTHGLGYVFGSPLY E+ITLCLDLGELDAAIAIVADLETTGI 
Subjt:  IWALSSLNFLLSNVHSMFHLLLQVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYVEIITLCLDLGELDAAIAIVADLETTGIS

Query:  VPDETLDRVISARQTNDATPKPDSAIDTTLNDHS-AIDEAS
        V DETLDRVISARQTNDA PKPDSAIDTTLNDHS A DEAS
Subjt:  VPDETLDRVISARQTNDATPKPDSAIDTTLNDHS-AIDEAS

XP_022151680.1 uncharacterized protein LOC111019595 [Momordica charantia]1.1e-26690.74Show/hide
Query:  LISLVYAGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM
        L +L   GRVVELLEALEAMARDNQQIP RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM
Subjt:  LISLVYAGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM

Query:  ETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN
        ETSFKQRCLEDWKMYHRKILKTLQNEGL ALGDASEADY RVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN
Subjt:  ETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN

Query:  RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDEKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEVEQTENQDGE
        RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTE+WKRRFLGEGLD+N+VKPSED+KSEPLDSLDDVDIVED AKEIEEEE EEEEVEQTENQDGE
Subjt:  RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDEKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEVEQTENQDGE

Query:  RVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRSSIKI
        RVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAF+ELRKR+VFDVSDMYTIADVWGWTWERELKNRPPR   + 
Subjt:  RVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRSSIKI

Query:  WALSSLNFLLSNVHSMFHLLLQVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYVEIITLCLDLGELDAAIAIVADLETTGISV
        W           V     ++ +VIELGGTPTIGDCAMILRAAIR+PLPSAFLKILQTTH LGYVFGSPLY E+ITLCLDLGELDAAIAIVADLETTGI V
Subjt:  WALSSLNFLLSNVHSMFHLLLQVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYVEIITLCLDLGELDAAIAIVADLETTGISV

Query:  PDETLDRVISARQTNDATPKPDSAIDTTLNDHS-AIDEAS
        PDETLDRVISARQTNDA PKPD+AIDTTLNDHS A DEAS
Subjt:  PDETLDRVISARQTNDATPKPDSAIDTTLNDHS-AIDEAS

XP_031740953.1 uncharacterized protein LOC101209618 isoform X2 [Cucumis sativus]2.1e-26289.83Show/hide
Query:  LISLVYAGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM
        L +L   GRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPM
Subjt:  LISLVYAGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM

Query:  ETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN
        ETSFKQRCLEDWKMYHRKILKTLQNEGL AL DASEADY RV ERL+KIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN
Subjt:  ETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN

Query:  RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDEKSEPLDSLDDVDIVEDVAKEIEEEEA-EEEEVEQTENQDG
        RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGL SNNVKPSED+KS+PLDSLDDVD +EDVAKEIEEEEA EEEEVEQTENQDG
Subjt:  RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDEKSEPLDSLDDVDIVEDVAKEIEEEEA-EEEEVEQTENQDG

Query:  ERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRSSIK
        ERVIKKEVEAKKPLQMIGVQLLKDVDQ TTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAF+EL+KRKVFDVSDMYTIADVWGWTWERELKNRPPR   +
Subjt:  ERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRSSIK

Query:  IWALSSLNFLLSNVHSMFHLLLQVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYVEIITLCLDLGELDAAIAIVADLETTGIS
         W           V     ++ +VIELGG PTIGDCAMILRAAI+APLPSAFLKILQTTHGLGYVFGSPLY E+ITLCLDLGELDAAIAIVADLETTGI 
Subjt:  IWALSSLNFLLSNVHSMFHLLLQVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYVEIITLCLDLGELDAAIAIVADLETTGIS

Query:  VPDETLDRVISARQTNDATPKPDSAIDTTLNDHS-AIDEAS
        V DETLDRVISARQTNDA PKPDSAIDTTLNDHS A DEAS
Subjt:  VPDETLDRVISARQTNDATPKPDSAIDTTLNDHS-AIDEAS

XP_038879291.1 uncharacterized protein LOC120071230 [Benincasa hispida]3.2e-26390.39Show/hide
Query:  LISLVYAGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM
        L +L   GRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM
Subjt:  LISLVYAGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM

Query:  ETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN
        ETSFKQRCLEDWKM+HRKILKTLQNEGLAALG ASEADY RV ERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN
Subjt:  ETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN

Query:  RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDEKSEPLDSLDDVDIVEDVAKEIEEEEA-EEEEVEQTENQDG
        RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSED+KSEPLDSLDDVD VEDVAKEIEEEEA EEEEVEQTENQDG
Subjt:  RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDEKSEPLDSLDDVDIVEDVAKEIEEEEA-EEEEVEQTENQDG

Query:  ERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRSSIK
        ERVIKKEVEAKKPLQMIGVQLLKDVDQ  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAF+ELRKRKVFDVSDMYTIADVWGWTWERELKNRPPR   +
Subjt:  ERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRSSIK

Query:  IWALSSLNFLLSNVHSMFHLLLQVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYVEIITLCLDLGELDAAIAIVADLETTGIS
         W           V     ++ +VIELGGTPTIGDCAMILRAAI+APLPS+FLKILQTTHGLGY FGSPLY E+ITLCLDLGELDAAIAIVADLETTGI 
Subjt:  IWALSSLNFLLSNVHSMFHLLLQVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYVEIITLCLDLGELDAAIAIVADLETTGIS

Query:  VPDETLDRVISARQTNDATPKPDSAIDTTLNDHS-AIDEAS
        VPDETLDRVIS RQTND+ PKPDSAIDTTLNDHS A DEAS
Subjt:  VPDETLDRVISARQTNDATPKPDSAIDTTLNDHS-AIDEAS

TrEMBL top hitse value%identityAlignment
A0A0A0M091 SAP domain-containing protein1.0e-26289.83Show/hide
Query:  LISLVYAGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM
        L +L   GRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPM
Subjt:  LISLVYAGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM

Query:  ETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN
        ETSFKQRCLEDWKMYHRKILKTLQNEGL AL DASEADY RV ERL+KIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN
Subjt:  ETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN

Query:  RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDEKSEPLDSLDDVDIVEDVAKEIEEEEA-EEEEVEQTENQDG
        RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGL SNNVKPSED+KS+PLDSLDDVD +EDVAKEIEEEEA EEEEVEQTENQDG
Subjt:  RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDEKSEPLDSLDDVDIVEDVAKEIEEEEA-EEEEVEQTENQDG

Query:  ERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRSSIK
        ERVIKKEVEAKKPLQMIGVQLLKDVDQ TTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAF+EL+KRKVFDVSDMYTIADVWGWTWERELKNRPPR   +
Subjt:  ERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRSSIK

Query:  IWALSSLNFLLSNVHSMFHLLLQVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYVEIITLCLDLGELDAAIAIVADLETTGIS
         W           V     ++ +VIELGG PTIGDCAMILRAAI+APLPSAFLKILQTTHGLGYVFGSPLY E+ITLCLDLGELDAAIAIVADLETTGI 
Subjt:  IWALSSLNFLLSNVHSMFHLLLQVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYVEIITLCLDLGELDAAIAIVADLETTGIS

Query:  VPDETLDRVISARQTNDATPKPDSAIDTTLNDHS-AIDEAS
        V DETLDRVISARQTNDA PKPDSAIDTTLNDHS A DEAS
Subjt:  VPDETLDRVISARQTNDATPKPDSAIDTTLNDHS-AIDEAS

A0A1S3B8T6 uncharacterized protein LOC103487261 isoform X11.7e-26289.28Show/hide
Query:  LISLVYAGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM
        L +L   GRV++LLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPM
Subjt:  LISLVYAGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM

Query:  ETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN
        ETSFKQRCLEDWKMYHRKILKTLQNEGL AL DASEADY RV E+LKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN
Subjt:  ETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN

Query:  RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDEKSEPLDSLDDVDIVEDVAKEIEEEEA-EEEEVEQTENQDG
        RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSED+KS+ LDSLDDVD +EDVAKEIEEEEA EEEEVEQTENQDG
Subjt:  RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDEKSEPLDSLDDVDIVEDVAKEIEEEEA-EEEEVEQTENQDG

Query:  ERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRSSIK
        ERVIKKEVEAKKPLQMIGVQLLKDVDQ T TSKKSRRRSSRASLEDDRDEDWFPEDIFEAF+EL+KRKVFDVSDMYTIADVWGWTWERELKNRPPR   +
Subjt:  ERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRSSIK

Query:  IWALSSLNFLLSNVHSMFHLLLQVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYVEIITLCLDLGELDAAIAIVADLETTGIS
         W           V     ++ +VIELGGTPTIGDCAMILRAAI+APLPSAFLKILQTTHGLGYVFGSPLY E+ITLCLDLGELDAAIAIVADLETTGI 
Subjt:  IWALSSLNFLLSNVHSMFHLLLQVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYVEIITLCLDLGELDAAIAIVADLETTGIS

Query:  VPDETLDRVISARQTNDATPKPDSAIDTTLNDHS-AIDEAS
        VPDETLDRVIS RQTNDA PKPDSAIDTT+NDHS A DEAS
Subjt:  VPDETLDRVISARQTNDATPKPDSAIDTTLNDHS-AIDEAS

A0A6J1DBV3 uncharacterized protein LOC1110195955.2e-26790.74Show/hide
Query:  LISLVYAGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM
        L +L   GRVVELLEALEAMARDNQQIP RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM
Subjt:  LISLVYAGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM

Query:  ETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN
        ETSFKQRCLEDWKMYHRKILKTLQNEGL ALGDASEADY RVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN
Subjt:  ETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN

Query:  RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDEKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEVEQTENQDGE
        RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTE+WKRRFLGEGLD+N+VKPSED+KSEPLDSLDDVDIVED AKEIEEEE EEEEVEQTENQDGE
Subjt:  RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDEKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEVEQTENQDGE

Query:  RVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRSSIKI
        RVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAF+ELRKR+VFDVSDMYTIADVWGWTWERELKNRPPR   + 
Subjt:  RVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRSSIKI

Query:  WALSSLNFLLSNVHSMFHLLLQVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYVEIITLCLDLGELDAAIAIVADLETTGISV
        W           V     ++ +VIELGGTPTIGDCAMILRAAIR+PLPSAFLKILQTTH LGYVFGSPLY E+ITLCLDLGELDAAIAIVADLETTGI V
Subjt:  WALSSLNFLLSNVHSMFHLLLQVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYVEIITLCLDLGELDAAIAIVADLETTGISV

Query:  PDETLDRVISARQTNDATPKPDSAIDTTLNDHS-AIDEAS
        PDETLDRVISARQTNDA PKPD+AIDTTLNDHS A DEAS
Subjt:  PDETLDRVISARQTNDATPKPDSAIDTTLNDHS-AIDEAS

A0A6J1KW34 uncharacterized protein LOC111499221 isoform X29.5e-26188.54Show/hide
Query:  LISLVYAGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM
        L +L   GRVVELLEALEAMARDNQQIP RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM
Subjt:  LISLVYAGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM

Query:  ETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN
        ETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRIN
Subjt:  ETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN

Query:  RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDEKSEPLDSLDDVDIVEDVAKEIEEEEA-EEEEVEQTENQDG
        RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSED++SEPLDSLDDVD+VEDVAKEI+EEEA EEEEVE TENQDG
Subjt:  RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDEKSEPLDSLDDVDIVEDVAKEIEEEEA-EEEEVEQTENQDG

Query:  ERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRSSIK
        ERVIKKEVEAKKP QMIGVQLLKDVDQT+TTSKKSRRR SRAS+EDDRDEDWFPED+FEAF ELRKRK+FD SDMYTIADVWGWTWERELKNRPPR   +
Subjt:  ERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRSSIK

Query:  IWALSSLNFLLSNVHSMFHLLLQVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYVEIITLCLDLGELDAAIAIVADLETTGIS
         W           V     ++ +VIELGG PTIGDCAMILRAAI+APLPSAF KILQTTH LGYVFGSPLY E+ITLCLDLGELDAAIAIVADLETTGIS
Subjt:  IWALSSLNFLLSNVHSMFHLLLQVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYVEIITLCLDLGELDAAIAIVADLETTGIS

Query:  VPDETLDRVISARQTNDATPKPDSAIDTTLNDHS-AIDEAS
        VPDETLDR+ISARQTNDA PK DS ID TLNDHS A DE S
Subjt:  VPDETLDRVISARQTNDATPKPDSAIDTTLNDHS-AIDEAS

A0A6J1L2D9 uncharacterized protein LOC111499221 isoform X19.5e-26188.54Show/hide
Query:  LISLVYAGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM
        L +L   GRVVELLEALEAMARDNQQIP RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM
Subjt:  LISLVYAGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM

Query:  ETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN
        ETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRIN
Subjt:  ETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN

Query:  RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDEKSEPLDSLDDVDIVEDVAKEIEEEEA-EEEEVEQTENQDG
        RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSED++SEPLDSLDDVD+VEDVAKEI+EEEA EEEEVE TENQDG
Subjt:  RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDEKSEPLDSLDDVDIVEDVAKEIEEEEA-EEEEVEQTENQDG

Query:  ERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRSSIK
        ERVIKKEVEAKKP QMIGVQLLKDVDQT+TTSKKSRRR SRAS+EDDRDEDWFPED+FEAF ELRKRK+FD SDMYTIADVWGWTWERELKNRPPR   +
Subjt:  ERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRSSIK

Query:  IWALSSLNFLLSNVHSMFHLLLQVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYVEIITLCLDLGELDAAIAIVADLETTGIS
         W           V     ++ +VIELGG PTIGDCAMILRAAI+APLPSAF KILQTTH LGYVFGSPLY E+ITLCLDLGELDAAIAIVADLETTGIS
Subjt:  IWALSSLNFLLSNVHSMFHLLLQVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYVEIITLCLDLGELDAAIAIVADLETTGIS

Query:  VPDETLDRVISARQTNDATPKPDSAIDTTLNDHS-AIDEAS
        VPDETLDR+ISARQTNDA PK DS ID TLNDHS A DE S
Subjt:  VPDETLDRVISARQTNDATPKPDSAIDTTLNDHS-AIDEAS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G04260.1 plastid transcriptionally active 31.1e-20369.59Show/hide
Query:  LISLVYAGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM
        L +L   GR+VEL++AL+AM +DNQ IPPRAMI+SRKYR+LVSSWIEPLQEEAE GYEIDY+ARYIEEGGLTGERKRWVPRRGKTPLDPDA GFIYSNP+
Subjt:  LISLVYAGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPM

Query:  ETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN
        ETSFKQRCLEDWK++HRK+L+TLQ+EGL  LGDASE+DY+RV ERL+ IIKGP  N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRIN
Subjt:  ETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRIN

Query:  RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDEKSEPLDSLDDVDIVEDVAKEIEEEEAEEEE----------
        +SRGRPLWVPP+EEEEEEVDEE+D+LI RIKLHEG+TEFWKRRFLGEGL   +V+    E +E + + +    +ED++KE + EE ++EE          
Subjt:  RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDEKSEPLDSLDDVDIVEDVAKEIEEEEAEEEE----------

Query:  -------VEQTENQ-DGERVIK-KEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVW
               V +TEN+ +GE ++K K  +AKK LQMIGVQLLK+ D+   T KK  +R+SR +LEDD DEDWFPE+ FEAF+E+R+RKVFDV+DMYTIADVW
Subjt:  -------VEQTENQ-DGERVIK-KEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVW

Query:  GWTWERELKNRPPRSSIKIWALSSLNFLLSNVHSMFHLLLQVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYVEIITLCLDLG
        GWTWE++ KN+ PR   + W           V     L+ +VIELGG PTIGDCA+ILRAA+RAP+PSAFLKILQTTH LGY FGSPLY EIITLCLDLG
Subjt:  GWTWERELKNRPPRSSIKIWALSSLNFLLSNVHSMFHLLLQVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYVEIITLCLDLG

Query:  ELDAAIAIVADLETTGISVPDETLDRVISARQTNDA
        ELDAAIAIVAD+ETTGI+VPD+TLD+VISARQ+N++
Subjt:  ELDAAIAIVADLETTGISVPDETLDRVISARQTNDA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGAGCGCGGCACTGTCTGTGTGCGAGGAAAGATCGTCTGGACACTAGGGCAGCGGCGGCGGCGTCGTTGAGTGCTAGGAAAATCTCTGAACTGATATCCCTTGTTTA
TGCAGGTAGGGTAGTAGAGCTCTTAGAAGCGTTAGAAGCTATGGCTAGAGATAACCAACAGATTCCTCCAAGAGCCATGATCTTGAGCAGAAAATATAGATCTCTGGTGA
GCTCATGGATTGAACCTTTACAGGAAGAAGCTGAACATGGATATGAGATAGACTACATTGCAAGATACATTGAAGAGGGTGGACTCACTGGAGAACGCAAGAGATGGGTC
CCTCGAAGAGGAAAAACTCCTCTAGATCCTGATGCAGATGGATTCATTTATTCAAATCCTATGGAAACATCCTTTAAGCAACGATGTCTAGAAGATTGGAAGATGTACCA
CCGAAAGATTTTGAAAACCTTGCAGAATGAAGGACTTGCAGCTCTTGGGGATGCATCTGAAGCTGATTATCTTAGAGTCGAGGAGAGATTGAAGAAAATTATAAAGGGTC
CTGACCAAAATGTTTTAAAGCCAAAGGCTGCAAGTAAGATGATTGTATCAGAATTAAAAGAAGAATTAGAAGCACAAGGTTTACCAATTGACGGAACTAGAAATGTTCTT
TACCAGCGTGTTCAAAAAGCAAGGAGAATAAATCGGTCTCGTGGTCGGCCCCTTTGGGTTCCTCCAGTAGAGGAGGAGGAAGAAGAGGTTGATGAAGAGCTGGATGAACT
AATTTCACGAATAAAGCTACATGAAGGAAATACAGAGTTCTGGAAACGTCGCTTTCTTGGAGAAGGCTTGGACAGCAATAATGTTAAACCTTCTGAAGATGAAAAATCAG
AACCTCTTGATTCTTTGGATGATGTTGACATTGTCGAAGACGTTGCAAAGGAGATTGAAGAAGAAGAAGCCGAGGAAGAAGAGGTAGAACAAACTGAGAATCAAGATGGT
GAAAGAGTTATCAAGAAGGAAGTTGAAGCTAAGAAGCCTCTTCAAATGATAGGTGTCCAATTGCTAAAAGATGTTGACCAAACTACAACAACATCGAAAAAGTCAAGGAG
GAGAAGTTCTAGAGCATCACTCGAGGACGATCGTGATGAAGACTGGTTTCCTGAAGATATATTCGAGGCATTTGAAGAGTTGCGAAAGAGGAAAGTCTTTGATGTATCTG
ACATGTACACAATAGCTGATGTTTGGGGTTGGACTTGGGAAAGAGAACTTAAGAACAGACCTCCCAGGAGTAGCATTAAAATTTGGGCCCTTTCATCTTTAAATTTTCTA
TTGTCTAATGTTCATTCAATGTTCCACTTACTATTACAGGTGATAGAATTGGGTGGAACACCAACGATTGGCGACTGTGCCATGATCTTGCGAGCTGCCATCAGGGCTCC
TCTACCGTCTGCTTTTTTGAAAATTTTGCAGACAACACATGGTCTTGGCTATGTATTTGGGAGCCCTTTATATGTTGAGATTATTACCCTGTGTCTTGATCTTGGGGAAC
TGGATGCTGCCATTGCAATTGTAGCAGACCTGGAAACCACAGGAATCTCAGTTCCTGATGAAACACTCGATCGGGTAATCTCCGCAAGACAGACAAACGATGCTACCCCC
AAGCCTGATTCAGCCATTGATACTACACTCAATGATCATAGTGCCATTGATGAAGCATCATAA
mRNA sequenceShow/hide mRNA sequence
ATGCGAGCGCGGCACTGTCTGTGTGCGAGGAAAGATCGTCTGGACACTAGGGCAGCGGCGGCGGCGTCGTTGAGTGCTAGGAAAATCTCTGAACTGATATCCCTTGTTTA
TGCAGGTAGGGTAGTAGAGCTCTTAGAAGCGTTAGAAGCTATGGCTAGAGATAACCAACAGATTCCTCCAAGAGCCATGATCTTGAGCAGAAAATATAGATCTCTGGTGA
GCTCATGGATTGAACCTTTACAGGAAGAAGCTGAACATGGATATGAGATAGACTACATTGCAAGATACATTGAAGAGGGTGGACTCACTGGAGAACGCAAGAGATGGGTC
CCTCGAAGAGGAAAAACTCCTCTAGATCCTGATGCAGATGGATTCATTTATTCAAATCCTATGGAAACATCCTTTAAGCAACGATGTCTAGAAGATTGGAAGATGTACCA
CCGAAAGATTTTGAAAACCTTGCAGAATGAAGGACTTGCAGCTCTTGGGGATGCATCTGAAGCTGATTATCTTAGAGTCGAGGAGAGATTGAAGAAAATTATAAAGGGTC
CTGACCAAAATGTTTTAAAGCCAAAGGCTGCAAGTAAGATGATTGTATCAGAATTAAAAGAAGAATTAGAAGCACAAGGTTTACCAATTGACGGAACTAGAAATGTTCTT
TACCAGCGTGTTCAAAAAGCAAGGAGAATAAATCGGTCTCGTGGTCGGCCCCTTTGGGTTCCTCCAGTAGAGGAGGAGGAAGAAGAGGTTGATGAAGAGCTGGATGAACT
AATTTCACGAATAAAGCTACATGAAGGAAATACAGAGTTCTGGAAACGTCGCTTTCTTGGAGAAGGCTTGGACAGCAATAATGTTAAACCTTCTGAAGATGAAAAATCAG
AACCTCTTGATTCTTTGGATGATGTTGACATTGTCGAAGACGTTGCAAAGGAGATTGAAGAAGAAGAAGCCGAGGAAGAAGAGGTAGAACAAACTGAGAATCAAGATGGT
GAAAGAGTTATCAAGAAGGAAGTTGAAGCTAAGAAGCCTCTTCAAATGATAGGTGTCCAATTGCTAAAAGATGTTGACCAAACTACAACAACATCGAAAAAGTCAAGGAG
GAGAAGTTCTAGAGCATCACTCGAGGACGATCGTGATGAAGACTGGTTTCCTGAAGATATATTCGAGGCATTTGAAGAGTTGCGAAAGAGGAAAGTCTTTGATGTATCTG
ACATGTACACAATAGCTGATGTTTGGGGTTGGACTTGGGAAAGAGAACTTAAGAACAGACCTCCCAGGAGTAGCATTAAAATTTGGGCCCTTTCATCTTTAAATTTTCTA
TTGTCTAATGTTCATTCAATGTTCCACTTACTATTACAGGTGATAGAATTGGGTGGAACACCAACGATTGGCGACTGTGCCATGATCTTGCGAGCTGCCATCAGGGCTCC
TCTACCGTCTGCTTTTTTGAAAATTTTGCAGACAACACATGGTCTTGGCTATGTATTTGGGAGCCCTTTATATGTTGAGATTATTACCCTGTGTCTTGATCTTGGGGAAC
TGGATGCTGCCATTGCAATTGTAGCAGACCTGGAAACCACAGGAATCTCAGTTCCTGATGAAACACTCGATCGGGTAATCTCCGCAAGACAGACAAACGATGCTACCCCC
AAGCCTGATTCAGCCATTGATACTACACTCAATGATCATAGTGCCATTGATGAAGCATCATAA
Protein sequenceShow/hide protein sequence
MRARHCLCARKDRLDTRAAAAASLSARKISELISLVYAGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWV
PRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVL
YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDEKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEVEQTENQDG
ERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRSSIKIWALSSLNFL
LSNVHSMFHLLLQVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYVEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRVISARQTNDATP
KPDSAIDTTLNDHSAIDEAS