; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035391 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035391
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSAP domain-containing protein
Genome locationchr3:20765093..20770009
RNA-Seq ExpressionLag0035391
SyntenyLag0035391
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0006979 - response to oxidative stress (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0004601 - peroxidase activity (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR003034 - SAP domain
IPR036361 - SAP domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008443746.1 PREDICTED: uncharacterized protein LOC103487261 isoform X1 [Cucumis melo]2.8e-27594.65Show/hide
Query:  RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC
        RV++LLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRC
Subjt:  RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC

Query:  LEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW
        LEDWKMYHRKILKTLQNEGL AL DASEADY RV E+LKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW
Subjt:  LEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW

Query:  VPPVEEEEEEVDEDLDELISRVKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV
        VPPVEEEEEEVDE+LDELISR+KLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKS+ LDSLDDVD +EDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV
Subjt:  VPPVEEEEEEVDEDLDELISRVKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV

Query:  EAKKPLQMIGVQLLKDVDQ-TTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAI
        EAKKPLQMIGVQLLKDVDQ T TSKKSRRRSSRASLEDDRDEDWFPEDIFEAF+EL+KRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAI
Subjt:  EAKKPLQMIGVQLLKDVDQ-TTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAI

Query:  KIMHKVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRVISARQTNDA
        KIMHKVIELGGTPTIGDCAMILRAAI+APLPSAFLKILQTTHGLGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGI VPDETLDRVIS RQTNDA
Subjt:  KIMHKVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRVISARQTNDA

Query:  TPKPDSDIDTTLNDHS-AIDEAS
         PKPDS IDTT+NDHS A DEAS
Subjt:  TPKPDSDIDTTLNDHS-AIDEAS

XP_011660243.1 uncharacterized protein LOC101209618 isoform X1 [Cucumis sativus]1.7e-27595.22Show/hide
Query:  RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC
        RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRC
Subjt:  RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC

Query:  LEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW
        LEDWKMYHRKILKTLQNEGL AL DASEADY RV ERL+KIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW
Subjt:  LEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW

Query:  VPPVEEEEEEVDEDLDELISRVKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV
        VPPVEEEEEEVDE+LDELISR+KLHEGNTEFWKRRFLGEGL SNNVKPSEDDKS+PLDSLDDVD +EDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV
Subjt:  VPPVEEEEEEVDEDLDELISRVKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV

Query:  EAKKPLQMIGVQLLKDVDQ-TTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAI
        EAKKPLQMIGVQLLKDVDQ TTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAF+EL+KRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAI
Subjt:  EAKKPLQMIGVQLLKDVDQ-TTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAI

Query:  KIMHKVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRVISARQTNDA
        KIMHKVIELGG PTIGDCAMILRAAI+APLPSAFLKILQTTHGLGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGI V DETLDRVISARQTNDA
Subjt:  KIMHKVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRVISARQTNDA

Query:  TPKPDSDIDTTLNDHS-AIDEAS
         PKPDS IDTTLNDHS A DEAS
Subjt:  TPKPDSDIDTTLNDHS-AIDEAS

XP_022151680.1 uncharacterized protein LOC111019595 [Momordica charantia]1.3e-27595.41Show/hide
Query:  RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC
        RVVELLEALEAMARDNQQIP RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC
Subjt:  RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC

Query:  LEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW
        LEDWKMYHRKILKTLQNEGL ALGDASEADY RVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW
Subjt:  LEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW

Query:  VPPVEEEEEEVDEDLDELISRVKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV
        VPPVEEEEEEVDE+LDELISR+KLHEGNTE+WKRRFLGEGLD+N+VKPSEDDKSEPLDSLDDVDIVED AKEIEEEE  EEEEVEQTENQDGERVIKKEV
Subjt:  VPPVEEEEEEVDEDLDELISRVKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV

Query:  EAKKPLQMIGVQLLKDVDQ-TTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAI
        EAKKPLQMIGVQLLKDVDQ TTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAF+ELRKR+VFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELA 
Subjt:  EAKKPLQMIGVQLLKDVDQ-TTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAI

Query:  KIMHKVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRVISARQTNDA
        KIMHKVIELGGTPTIGDCAMILRAAIR+PLPSAFLKILQTTH LGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGI VPDETLDRVISARQTNDA
Subjt:  KIMHKVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRVISARQTNDA

Query:  TPKPDSDIDTTLNDHS-AIDEAS
         PKPD+ IDTTLNDHS A DEAS
Subjt:  TPKPDSDIDTTLNDHS-AIDEAS

XP_031740953.1 uncharacterized protein LOC101209618 isoform X2 [Cucumis sativus]1.7e-27595.22Show/hide
Query:  RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC
        RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRC
Subjt:  RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC

Query:  LEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW
        LEDWKMYHRKILKTLQNEGL AL DASEADY RV ERL+KIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW
Subjt:  LEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW

Query:  VPPVEEEEEEVDEDLDELISRVKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV
        VPPVEEEEEEVDE+LDELISR+KLHEGNTEFWKRRFLGEGL SNNVKPSEDDKS+PLDSLDDVD +EDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV
Subjt:  VPPVEEEEEEVDEDLDELISRVKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV

Query:  EAKKPLQMIGVQLLKDVDQ-TTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAI
        EAKKPLQMIGVQLLKDVDQ TTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAF+EL+KRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAI
Subjt:  EAKKPLQMIGVQLLKDVDQ-TTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAI

Query:  KIMHKVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRVISARQTNDA
        KIMHKVIELGG PTIGDCAMILRAAI+APLPSAFLKILQTTHGLGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGI V DETLDRVISARQTNDA
Subjt:  KIMHKVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRVISARQTNDA

Query:  TPKPDSDIDTTLNDHS-AIDEAS
         PKPDS IDTTLNDHS A DEAS
Subjt:  TPKPDSDIDTTLNDHS-AIDEAS

XP_038879291.1 uncharacterized protein LOC120071230 [Benincasa hispida]5.5e-27995.98Show/hide
Query:  RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC
        RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC
Subjt:  RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC

Query:  LEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW
        LEDWKM+HRKILKTLQNEGLAALG ASEADY RV ERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW
Subjt:  LEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW

Query:  VPPVEEEEEEVDEDLDELISRVKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV
        VPPVEEEEEEVDE+LDELISR+KLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEPLDSLDDVD VEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV
Subjt:  VPPVEEEEEEVDEDLDELISRVKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV

Query:  EAKKPLQMIGVQLLKDVDQTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIK
        EAKKPLQMIGVQLLKDVDQ TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAF+ELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIK
Subjt:  EAKKPLQMIGVQLLKDVDQTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIK

Query:  IMHKVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRVISARQTNDAT
        IMHKVIELGGTPTIGDCAMILRAAI+APLPS+FLKILQTTHGLGY FGSPLYDE+ITLCLDLGELDAAIAIVADLETTGI VPDETLDRVIS RQTND+ 
Subjt:  IMHKVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRVISARQTNDAT

Query:  PKPDSDIDTTLNDHS-AIDEAS
        PKPDS IDTTLNDHS A DEAS
Subjt:  PKPDSDIDTTLNDHS-AIDEAS

TrEMBL top hitse value%identityAlignment
A0A0A0M091 SAP domain-containing protein8.1e-27695.22Show/hide
Query:  RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC
        RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRC
Subjt:  RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC

Query:  LEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW
        LEDWKMYHRKILKTLQNEGL AL DASEADY RV ERL+KIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW
Subjt:  LEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW

Query:  VPPVEEEEEEVDEDLDELISRVKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV
        VPPVEEEEEEVDE+LDELISR+KLHEGNTEFWKRRFLGEGL SNNVKPSEDDKS+PLDSLDDVD +EDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV
Subjt:  VPPVEEEEEEVDEDLDELISRVKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV

Query:  EAKKPLQMIGVQLLKDVDQ-TTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAI
        EAKKPLQMIGVQLLKDVDQ TTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAF+EL+KRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAI
Subjt:  EAKKPLQMIGVQLLKDVDQ-TTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAI

Query:  KIMHKVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRVISARQTNDA
        KIMHKVIELGG PTIGDCAMILRAAI+APLPSAFLKILQTTHGLGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGI V DETLDRVISARQTNDA
Subjt:  KIMHKVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRVISARQTNDA

Query:  TPKPDSDIDTTLNDHS-AIDEAS
         PKPDS IDTTLNDHS A DEAS
Subjt:  TPKPDSDIDTTLNDHS-AIDEAS

A0A1S3B8T6 uncharacterized protein LOC103487261 isoform X11.4e-27594.65Show/hide
Query:  RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC
        RV++LLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRC
Subjt:  RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC

Query:  LEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW
        LEDWKMYHRKILKTLQNEGL AL DASEADY RV E+LKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW
Subjt:  LEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW

Query:  VPPVEEEEEEVDEDLDELISRVKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV
        VPPVEEEEEEVDE+LDELISR+KLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKS+ LDSLDDVD +EDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV
Subjt:  VPPVEEEEEEVDEDLDELISRVKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV

Query:  EAKKPLQMIGVQLLKDVDQ-TTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAI
        EAKKPLQMIGVQLLKDVDQ T TSKKSRRRSSRASLEDDRDEDWFPEDIFEAF+EL+KRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAI
Subjt:  EAKKPLQMIGVQLLKDVDQ-TTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAI

Query:  KIMHKVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRVISARQTNDA
        KIMHKVIELGGTPTIGDCAMILRAAI+APLPSAFLKILQTTHGLGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGI VPDETLDRVIS RQTNDA
Subjt:  KIMHKVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRVISARQTNDA

Query:  TPKPDSDIDTTLNDHS-AIDEAS
         PKPDS IDTT+NDHS A DEAS
Subjt:  TPKPDSDIDTTLNDHS-AIDEAS

A0A6J1DBV3 uncharacterized protein LOC1110195956.2e-27695.41Show/hide
Query:  RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC
        RVVELLEALEAMARDNQQIP RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC
Subjt:  RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC

Query:  LEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW
        LEDWKMYHRKILKTLQNEGL ALGDASEADY RVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW
Subjt:  LEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW

Query:  VPPVEEEEEEVDEDLDELISRVKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV
        VPPVEEEEEEVDE+LDELISR+KLHEGNTE+WKRRFLGEGLD+N+VKPSEDDKSEPLDSLDDVDIVED AKEIEEEE  EEEEVEQTENQDGERVIKKEV
Subjt:  VPPVEEEEEEVDEDLDELISRVKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV

Query:  EAKKPLQMIGVQLLKDVDQ-TTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAI
        EAKKPLQMIGVQLLKDVDQ TTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAF+ELRKR+VFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELA 
Subjt:  EAKKPLQMIGVQLLKDVDQ-TTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAI

Query:  KIMHKVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRVISARQTNDA
        KIMHKVIELGGTPTIGDCAMILRAAIR+PLPSAFLKILQTTH LGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGI VPDETLDRVISARQTNDA
Subjt:  KIMHKVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRVISARQTNDA

Query:  TPKPDSDIDTTLNDHS-AIDEAS
         PKPD+ IDTTLNDHS A DEAS
Subjt:  TPKPDSDIDTTLNDHS-AIDEAS

A0A6J1KW34 uncharacterized protein LOC111499221 isoform X22.6e-27494.07Show/hide
Query:  RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC
        RVVELLEALEAMARDNQQIP RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC
Subjt:  RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC

Query:  LEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW
        LEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW
Subjt:  LEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW

Query:  VPPVEEEEEEVDEDLDELISRVKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV
        VPPVEEEEEEVDE+LDELISR+KLHEGNTEFWKRRFLGEGLDSNNVKPSEDD+SEPLDSLDDVD+VEDVAKEI+EEEAEEEEEVE TENQDGERVIKKEV
Subjt:  VPPVEEEEEEVDEDLDELISRVKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV

Query:  EAKKPLQMIGVQLLKDVDQT-TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAI
        EAKKP QMIGVQLLKDVDQT TTSKKSRRR SRAS+EDDRDEDWFPED+FEAF ELRKRK+FD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAI
Subjt:  EAKKPLQMIGVQLLKDVDQT-TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAI

Query:  KIMHKVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRVISARQTNDA
        KIMHKVIELGG PTIGDCAMILRAAI+APLPSAF KILQTTH LGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGISVPDETLDR+ISARQTNDA
Subjt:  KIMHKVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRVISARQTNDA

Query:  TPKPDSDIDTTLNDHS-AIDEAS
         PK DS ID TLNDHS A DE S
Subjt:  TPKPDSDIDTTLNDHS-AIDEAS

A0A6J1L2D9 uncharacterized protein LOC111499221 isoform X12.6e-27494.07Show/hide
Query:  RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC
        RVVELLEALEAMARDNQQIP RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC
Subjt:  RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC

Query:  LEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW
        LEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW
Subjt:  LEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW

Query:  VPPVEEEEEEVDEDLDELISRVKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV
        VPPVEEEEEEVDE+LDELISR+KLHEGNTEFWKRRFLGEGLDSNNVKPSEDD+SEPLDSLDDVD+VEDVAKEI+EEEAEEEEEVE TENQDGERVIKKEV
Subjt:  VPPVEEEEEEVDEDLDELISRVKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEV

Query:  EAKKPLQMIGVQLLKDVDQT-TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAI
        EAKKP QMIGVQLLKDVDQT TTSKKSRRR SRAS+EDDRDEDWFPED+FEAF ELRKRK+FD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAI
Subjt:  EAKKPLQMIGVQLLKDVDQT-TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAI

Query:  KIMHKVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRVISARQTNDA
        KIMHKVIELGG PTIGDCAMILRAAI+APLPSAF KILQTTH LGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGISVPDETLDR+ISARQTNDA
Subjt:  KIMHKVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRVISARQTNDA

Query:  TPKPDSDIDTTLNDHS-AIDEAS
         PK DS ID TLNDHS A DE S
Subjt:  TPKPDSDIDTTLNDHS-AIDEAS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G04260.1 plastid transcriptionally active 31.6e-21573.79Show/hide
Query:  RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC
        R+VEL++AL+AM +DNQ IPPRAMI+SRKYR+LVSSWIEPLQEEAE GYEIDY+ARYIEEGGLTGERKRWVPRRGKTPLDPDA GFIYSNP+ETSFKQRC
Subjt:  RVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRC

Query:  LEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW
        LEDWK++HRK+L+TLQ+EGL  LGDASE+DY+RV ERL+ IIKGP  N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRIN+SRGRPLW
Subjt:  LEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLW

Query:  VPPVEEEEEEVDEDLDELISRVKLHEGNTEFWKRRFLGEGLDSNNVKPSE--------------DDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQ
        VPP+EEEEEEVDE++D+LI R+KLHEG+TEFWKRRFLGEGL   +V+  E              +D S+  D+ +D D  E    E ++E  EEE  V +
Subjt:  VPPVEEEEEEVDEDLDELISRVKLHEGNTEFWKRRFLGEGLDSNNVKPSE--------------DDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQ

Query:  TENQ-DGERVIK-KEVEAKKPLQMIGVQLLKDVDQTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNR
        TEN+ +GE ++K K  +AKK LQMIGVQLLK+ D+   +KK  +R+SR +LEDD DEDWFPE+ FEAF+E+R+RKVFDV+DMYTIADVWGWTWE++ KN+
Subjt:  TENQ-DGERVIK-KEVEAKKPLQMIGVQLLKDVDQTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNR

Query:  PPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPD
         PR+WSQEWEVELAI +M KVIELGG PTIGDCA+ILRAA+RAP+PSAFLKILQTTH LGY FGSPLYDEIITLCLDLGELDAAIAIVAD+ETTGI+VPD
Subjt:  PPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPD

Query:  ETLDRVISARQTNDA
        +TLD+VISARQ+N++
Subjt:  ETLDRVISARQTNDA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGAAGGCGGAGGAAGATCTGGATCACATTCTGTGGAGTTGTGCCTATGTGCGTGCAGCGTGGGATCGGTTTGCCCAGTTTTTGGGTTGCAGGGGATGTCTTTTGT
TGACCATAGGGAGATGATTGAGAAGTTCCTCCTCCATCCGCCTTTTCGTAGGGTAGTAGAGCTCTTAGAAGCGTTAGAAGCTATGGCTAGAGATAACCAACAGATTCCTC
CAAGAGCCATGATCTTGAGCAGAAAGTATAGATCACTGGTGAGCTCATGGATTGAACCTTTACAGGAAGAAGCTGAACATGGATATGAGATAGACTACATTGCAAGATAC
ATTGAAGAGGGTGGACTCACTGGAGAACGCAAGAGATGGGTCCCTCGAAGAGGAAAAACTCCTCTAGATCCTGATGCAGATGGATTCATTTATTCAAATCCTATGGAAAC
ATCCTTTAAGCAACGATGTCTAGAAGATTGGAAGATGTACCACCGAAAGATTTTGAAAACCTTGCAGAATGAAGGGCTTGCAGCTCTTGGGGATGCATCTGAAGCTGATT
ATCTTAGAGTCGAGGAGAGATTGAAGAAAATTATTAAGGGTCCTGACCAAAATGTTTTAAAGCCGAAGGCTGCAAGTAAGATGATTGTATCAGAATTAAAAGAAGAATTA
GAAGCACAAGGTTTACCAATTGACGGAACTAGAAATGTTCTTTACCAACGTGTTCAAAAAGCAAGGAGAATAAATCGGTCTCGTGGTCGGCCCCTTTGGGTTCCTCCAGT
AGAGGAGGAGGAAGAAGAGGTTGATGAAGACCTGGATGAACTAATTTCACGAGTAAAGCTACATGAAGGAAATACAGAGTTCTGGAAACGTCGCTTTCTTGGAGAAGGCT
TGGACAGCAATAATGTTAAACCTTCTGAAGATGATAAATCAGAACCTCTTGATTCTTTGGATGATGTTGACATTGTCGAAGACGTTGCAAAGGAGATTGAAGAAGAAGAA
GCCGAGGAGGAAGAAGAGGTAGAACAAACTGAGAATCAAGATGGTGAAAGAGTTATCAAGAAGGAAGTTGAAGCTAAGAAGCCTCTTCAAATGATAGGTGTCCAATTGCT
GAAAGATGTTGACCAAACTACAACATCGAAAAAGTCAAGGAGGAGAAGTTCTAGAGCATCACTCGAGGACGATCGTGATGAAGACTGGTTTCCTGAAGATATATTCGAGG
CATTTGAAGAGTTGCGAAAGAGGAAAGTCTTTGATGTATCTGACATGTACACAATAGCTGATGTTTGGGGTTGGACTTGGGAAAGAGAACTTAAGAACAGACCTCCCAGG
AGGTGGTCACAGGAATGGGAAGTGGAGTTGGCTATTAAAATTATGCACAAGGTGATAGAATTGGGTGGAACACCAACGATTGGCGACTGTGCCATGATCTTGCGAGCTGC
CATCAGGGCTCCTCTACCGTCTGCTTTTTTGAAAATTTTGCAGACAACACATGGTCTTGGCTATGTATTTGGGAGCCCTTTATATGATGAGATTATTACCCTGTGTCTTG
ATCTTGGGGAACTGGATGCAGCCATTGCAATTGTAGCAGACCTGGAAACCACAGGAATCTCAGTTCCTGATGAAACACTCGATCGGGTAATCTCCGCAAGACAGACGAAC
GATGCTACGCCCAAGCCTGATTCAGACATTGATACTACACTCAATGATCATAGTGCCATTGATGAAGCATCATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGGAAGGCGGAGGAAGATCTGGATCACATTCTGTGGAGTTGTGCCTATGTGCGTGCAGCGTGGGATCGGTTTGCCCAGTTTTTGGGTTGCAGGGGATGTCTTTTGT
TGACCATAGGGAGATGATTGAGAAGTTCCTCCTCCATCCGCCTTTTCGTAGGGTAGTAGAGCTCTTAGAAGCGTTAGAAGCTATGGCTAGAGATAACCAACAGATTCCTC
CAAGAGCCATGATCTTGAGCAGAAAGTATAGATCACTGGTGAGCTCATGGATTGAACCTTTACAGGAAGAAGCTGAACATGGATATGAGATAGACTACATTGCAAGATAC
ATTGAAGAGGGTGGACTCACTGGAGAACGCAAGAGATGGGTCCCTCGAAGAGGAAAAACTCCTCTAGATCCTGATGCAGATGGATTCATTTATTCAAATCCTATGGAAAC
ATCCTTTAAGCAACGATGTCTAGAAGATTGGAAGATGTACCACCGAAAGATTTTGAAAACCTTGCAGAATGAAGGGCTTGCAGCTCTTGGGGATGCATCTGAAGCTGATT
ATCTTAGAGTCGAGGAGAGATTGAAGAAAATTATTAAGGGTCCTGACCAAAATGTTTTAAAGCCGAAGGCTGCAAGTAAGATGATTGTATCAGAATTAAAAGAAGAATTA
GAAGCACAAGGTTTACCAATTGACGGAACTAGAAATGTTCTTTACCAACGTGTTCAAAAAGCAAGGAGAATAAATCGGTCTCGTGGTCGGCCCCTTTGGGTTCCTCCAGT
AGAGGAGGAGGAAGAAGAGGTTGATGAAGACCTGGATGAACTAATTTCACGAGTAAAGCTACATGAAGGAAATACAGAGTTCTGGAAACGTCGCTTTCTTGGAGAAGGCT
TGGACAGCAATAATGTTAAACCTTCTGAAGATGATAAATCAGAACCTCTTGATTCTTTGGATGATGTTGACATTGTCGAAGACGTTGCAAAGGAGATTGAAGAAGAAGAA
GCCGAGGAGGAAGAAGAGGTAGAACAAACTGAGAATCAAGATGGTGAAAGAGTTATCAAGAAGGAAGTTGAAGCTAAGAAGCCTCTTCAAATGATAGGTGTCCAATTGCT
GAAAGATGTTGACCAAACTACAACATCGAAAAAGTCAAGGAGGAGAAGTTCTAGAGCATCACTCGAGGACGATCGTGATGAAGACTGGTTTCCTGAAGATATATTCGAGG
CATTTGAAGAGTTGCGAAAGAGGAAAGTCTTTGATGTATCTGACATGTACACAATAGCTGATGTTTGGGGTTGGACTTGGGAAAGAGAACTTAAGAACAGACCTCCCAGG
AGGTGGTCACAGGAATGGGAAGTGGAGTTGGCTATTAAAATTATGCACAAGGTGATAGAATTGGGTGGAACACCAACGATTGGCGACTGTGCCATGATCTTGCGAGCTGC
CATCAGGGCTCCTCTACCGTCTGCTTTTTTGAAAATTTTGCAGACAACACATGGTCTTGGCTATGTATTTGGGAGCCCTTTATATGATGAGATTATTACCCTGTGTCTTG
ATCTTGGGGAACTGGATGCAGCCATTGCAATTGTAGCAGACCTGGAAACCACAGGAATCTCAGTTCCTGATGAAACACTCGATCGGGTAATCTCCGCAAGACAGACGAAC
GATGCTACGCCCAAGCCTGATTCAGACATTGATACTACACTCAATGATCATAGTGCCATTGATGAAGCATCATAA
Protein sequenceShow/hide protein sequence
MSEGGGRSGSHSVELCLCACSVGSVCPVFGLQGMSFVDHREMIEKFLLHPPFRRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARY
IEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEEL
EAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEDLDELISRVKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSEPLDSLDDVDIVEDVAKEIEEEE
AEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPR
RWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRAAIRAPLPSAFLKILQTTHGLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRVISARQTN
DATPKPDSDIDTTLNDHSAIDEAS