; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019893 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019893
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSAP domain-containing protein
Genome locationtig00153424:1071138..1088436
RNA-Seq ExpressionSgr019893
SyntenySgr019893
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0006979 - response to oxidative stress (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0004601 - peroxidase activity (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR003034 - SAP domain
IPR036361 - SAP domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008443746.1 PREDICTED: uncharacterized protein LOC103487261 isoform X1 [Cucumis melo]4.8e-28594.51Show/hide
Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMSRDNQQIPP
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRV++LLEALEAM+RDNQQIPP
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMSRDNQQIPP

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA
        RAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPD +GFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL 
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA

Query:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        AL DASEADYHRV E+LKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNIEFWKRRFLGEGLDNNNVKPSEDDKSEPLDSLDDGDIVEDVAKEIEEEEADEEEEVEQTENQDGERV-KKEVEAKKPLQMIGVQLLKDVDQTT
        IKLHEGN EFWKRRFLGEGLD+NNVKPSEDDKS+ LDSLDD D +EDVAKEIEEEEA+EEEEVEQTENQDGERV KKEVEAKKPLQMIGVQLLKDVDQ T
Subjt:  IKLHEGNIEFWKRRFLGEGLDNNNVKPSEDDKSEPLDSLDDGDIVEDVAKEIEEEEADEEEEVEQTENQDGERV-KKEVEAKKPLQMIGVQLLKDVDQTT

Query:  TTSKKSRRRTSRASLEDDRDEDWFPEDIFEAFRELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVDLAIKIMQKVIELGGTPTIGDCAMI
         TSKKSRRR+SRASLEDDRDEDWFPEDIFEAF+EL+KRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEV+LAIKIM KVIELGGTPTIGDCAMI
Subjt:  TTSKKSRRRTSRASLEDDRDEDWFPEDIFEAFRELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVDLAIKIMQKVIELGGTPTIGDCAMI

Query:  LRSAIRAPLPSAFLKILQTTHGLGYVFG
        LR+AI+APLPSAFLKILQTTHGLGYVFG
Subjt:  LRSAIRAPLPSAFLKILQTTHGLGYVFG

XP_008443747.1 PREDICTED: uncharacterized protein LOC103487261 isoform X2 [Cucumis melo]4.8e-28594.51Show/hide
Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMSRDNQQIPP
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRV++LLEALEAM+RDNQQIPP
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMSRDNQQIPP

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA
        RAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPD +GFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL 
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA

Query:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        AL DASEADYHRV E+LKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNIEFWKRRFLGEGLDNNNVKPSEDDKSEPLDSLDDGDIVEDVAKEIEEEEADEEEEVEQTENQDGERV-KKEVEAKKPLQMIGVQLLKDVDQTT
        IKLHEGN EFWKRRFLGEGLD+NNVKPSEDDKS+ LDSLDD D +EDVAKEIEEEEA+EEEEVEQTENQDGERV KKEVEAKKPLQMIGVQLLKDVDQ T
Subjt:  IKLHEGNIEFWKRRFLGEGLDNNNVKPSEDDKSEPLDSLDDGDIVEDVAKEIEEEEADEEEEVEQTENQDGERV-KKEVEAKKPLQMIGVQLLKDVDQTT

Query:  TTSKKSRRRTSRASLEDDRDEDWFPEDIFEAFRELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVDLAIKIMQKVIELGGTPTIGDCAMI
         TSKKSRRR+SRASLEDDRDEDWFPEDIFEAF+EL+KRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEV+LAIKIM KVIELGGTPTIGDCAMI
Subjt:  TTSKKSRRRTSRASLEDDRDEDWFPEDIFEAFRELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVDLAIKIMQKVIELGGTPTIGDCAMI

Query:  LRSAIRAPLPSAFLKILQTTHGLGYVFG
        LR+AI+APLPSAFLKILQTTHGLGYVFG
Subjt:  LRSAIRAPLPSAFLKILQTTHGLGYVFG

XP_022151680.1 uncharacterized protein LOC111019595 [Momordica charantia]8.7e-28795.83Show/hide
Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMSRDNQQIPP
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAM+RDNQQIP 
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMSRDNQQIPP

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA
        RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPD +GFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL 
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA

Query:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNIEFWKRRFLGEGLDNNNVKPSEDDKSEPLDSLDDGDIVEDVAKEIEEEEADEEEEVEQTENQDGERV-KKEVEAKKPLQMIGVQLLKDVDQTT
        IKLHEGN E+WKRRFLGEGLDNN+VKPSEDDKSEPLDSLDD DIVED AKEIEEEE  EEEEVEQTENQDGERV KKEVEAKKPLQMIGVQLLKDVDQTT
Subjt:  IKLHEGNIEFWKRRFLGEGLDNNNVKPSEDDKSEPLDSLDDGDIVEDVAKEIEEEEADEEEEVEQTENQDGERV-KKEVEAKKPLQMIGVQLLKDVDQTT

Query:  TTSKKSRRRTSRASLEDDRDEDWFPEDIFEAFRELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVDLAIKIMQKVIELGGTPTIGDCAMI
        TTSKKSRRR+SRASLEDDRDEDWFPEDIFEAF+ELRKR+VFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEV+LA KIM KVIELGGTPTIGDCAMI
Subjt:  TTSKKSRRRTSRASLEDDRDEDWFPEDIFEAFRELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVDLAIKIMQKVIELGGTPTIGDCAMI

Query:  LRSAIRAPLPSAFLKILQTTHGLGYVFG
        LR+AIR+PLPSAFLKILQTTH LGYVFG
Subjt:  LRSAIRAPLPSAFLKILQTTHGLGYVFG

XP_031740953.1 uncharacterized protein LOC101209618 isoform X2 [Cucumis sativus]4.0e-28494.32Show/hide
Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMSRDNQQIPP
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRAL+TF+GGT ALHNEGNFGDPLSLYLRALCREGRVVELLEALEAM+RDNQQIPP
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMSRDNQQIPP

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA
        RAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPD +GFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL 
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA

Query:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        AL DASEADYHRV ERL+KIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNIEFWKRRFLGEGLDNNNVKPSEDDKSEPLDSLDDGDIVEDVAKEIEEEEADEEEEVEQTENQDGERV-KKEVEAKKPLQMIGVQLLKDVDQTT
        IKLHEGN EFWKRRFLGEGL +NNVKPSEDDKS+PLDSLDD D +EDVAKEIEEEEA+EEEEVEQTENQDGERV KKEVEAKKPLQMIGVQLLKDVDQ T
Subjt:  IKLHEGNIEFWKRRFLGEGLDNNNVKPSEDDKSEPLDSLDDGDIVEDVAKEIEEEEADEEEEVEQTENQDGERV-KKEVEAKKPLQMIGVQLLKDVDQTT

Query:  TTSKKSRRRTSRASLEDDRDEDWFPEDIFEAFRELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVDLAIKIMQKVIELGGTPTIGDCAMI
        TTSKKSRRR+SRASLEDDRDEDWFPEDIFEAF+EL+KRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEV+LAIKIM KVIELGG PTIGDCAMI
Subjt:  TTSKKSRRRTSRASLEDDRDEDWFPEDIFEAFRELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVDLAIKIMQKVIELGGTPTIGDCAMI

Query:  LRSAIRAPLPSAFLKILQTTHGLGYVFG
        LR+AI+APLPSAFLKILQTTHGLGYVFG
Subjt:  LRSAIRAPLPSAFLKILQTTHGLGYVFG

XP_038879291.1 uncharacterized protein LOC120071230 [Benincasa hispida]3.3e-28695.64Show/hide
Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMSRDNQQIPP
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAM+RDNQQIPP
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMSRDNQQIPP

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA
        RAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPRRGKTPLDPD +GFIYSNPMETSFKQRCLEDWKM+HRKILKTLQNEGLA
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA

Query:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        ALG ASEADYHRV ERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNIEFWKRRFLGEGLDNNNVKPSEDDKSEPLDSLDDGDIVEDVAKEIEEEEADEEEEVEQTENQDGERV-KKEVEAKKPLQMIGVQLLKDVDQTT
        IKLHEGN EFWKRRFLGEGLD+NNVKPSEDDKSEPLDSLDD D VEDVAKEIEEEEA+EEEEVEQTENQDGERV KKEVEAKKPLQMIGVQLLKDVDQ  
Subjt:  IKLHEGNIEFWKRRFLGEGLDNNNVKPSEDDKSEPLDSLDDGDIVEDVAKEIEEEEADEEEEVEQTENQDGERV-KKEVEAKKPLQMIGVQLLKDVDQTT

Query:  TTSKKSRRRTSRASLEDDRDEDWFPEDIFEAFRELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVDLAIKIMQKVIELGGTPTIGDCAMI
        TTSKKSRRR+SRASLEDDRDEDWFPEDIFEAF+ELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEV+LAIKIM KVIELGGTPTIGDCAMI
Subjt:  TTSKKSRRRTSRASLEDDRDEDWFPEDIFEAFRELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVDLAIKIMQKVIELGGTPTIGDCAMI

Query:  LRSAIRAPLPSAFLKILQTTHGLGYVFG
        LR+AI+APLPS+FLKILQTTHGLGY FG
Subjt:  LRSAIRAPLPSAFLKILQTTHGLGYVFG

TrEMBL top hitse value%identityAlignment
A0A1S3B8T6 uncharacterized protein LOC103487261 isoform X12.3e-28594.51Show/hide
Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMSRDNQQIPP
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRV++LLEALEAM+RDNQQIPP
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMSRDNQQIPP

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA
        RAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPD +GFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL 
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA

Query:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        AL DASEADYHRV E+LKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNIEFWKRRFLGEGLDNNNVKPSEDDKSEPLDSLDDGDIVEDVAKEIEEEEADEEEEVEQTENQDGERV-KKEVEAKKPLQMIGVQLLKDVDQTT
        IKLHEGN EFWKRRFLGEGLD+NNVKPSEDDKS+ LDSLDD D +EDVAKEIEEEEA+EEEEVEQTENQDGERV KKEVEAKKPLQMIGVQLLKDVDQ T
Subjt:  IKLHEGNIEFWKRRFLGEGLDNNNVKPSEDDKSEPLDSLDDGDIVEDVAKEIEEEEADEEEEVEQTENQDGERV-KKEVEAKKPLQMIGVQLLKDVDQTT

Query:  TTSKKSRRRTSRASLEDDRDEDWFPEDIFEAFRELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVDLAIKIMQKVIELGGTPTIGDCAMI
         TSKKSRRR+SRASLEDDRDEDWFPEDIFEAF+EL+KRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEV+LAIKIM KVIELGGTPTIGDCAMI
Subjt:  TTSKKSRRRTSRASLEDDRDEDWFPEDIFEAFRELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVDLAIKIMQKVIELGGTPTIGDCAMI

Query:  LRSAIRAPLPSAFLKILQTTHGLGYVFG
        LR+AI+APLPSAFLKILQTTHGLGYVFG
Subjt:  LRSAIRAPLPSAFLKILQTTHGLGYVFG

A0A1S3B9H7 uncharacterized protein LOC103487261 isoform X22.3e-28594.51Show/hide
Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMSRDNQQIPP
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRV++LLEALEAM+RDNQQIPP
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMSRDNQQIPP

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA
        RAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPD +GFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL 
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA

Query:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        AL DASEADYHRV E+LKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNIEFWKRRFLGEGLDNNNVKPSEDDKSEPLDSLDDGDIVEDVAKEIEEEEADEEEEVEQTENQDGERV-KKEVEAKKPLQMIGVQLLKDVDQTT
        IKLHEGN EFWKRRFLGEGLD+NNVKPSEDDKS+ LDSLDD D +EDVAKEIEEEEA+EEEEVEQTENQDGERV KKEVEAKKPLQMIGVQLLKDVDQ T
Subjt:  IKLHEGNIEFWKRRFLGEGLDNNNVKPSEDDKSEPLDSLDDGDIVEDVAKEIEEEEADEEEEVEQTENQDGERV-KKEVEAKKPLQMIGVQLLKDVDQTT

Query:  TTSKKSRRRTSRASLEDDRDEDWFPEDIFEAFRELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVDLAIKIMQKVIELGGTPTIGDCAMI
         TSKKSRRR+SRASLEDDRDEDWFPEDIFEAF+EL+KRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEV+LAIKIM KVIELGGTPTIGDCAMI
Subjt:  TTSKKSRRRTSRASLEDDRDEDWFPEDIFEAFRELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVDLAIKIMQKVIELGGTPTIGDCAMI

Query:  LRSAIRAPLPSAFLKILQTTHGLGYVFG
        LR+AI+APLPSAFLKILQTTHGLGYVFG
Subjt:  LRSAIRAPLPSAFLKILQTTHGLGYVFG

A0A6J1DBV3 uncharacterized protein LOC1110195954.2e-28795.83Show/hide
Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMSRDNQQIPP
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAM+RDNQQIP 
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMSRDNQQIPP

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA
        RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPD +GFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL 
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA

Query:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNIEFWKRRFLGEGLDNNNVKPSEDDKSEPLDSLDDGDIVEDVAKEIEEEEADEEEEVEQTENQDGERV-KKEVEAKKPLQMIGVQLLKDVDQTT
        IKLHEGN E+WKRRFLGEGLDNN+VKPSEDDKSEPLDSLDD DIVED AKEIEEEE  EEEEVEQTENQDGERV KKEVEAKKPLQMIGVQLLKDVDQTT
Subjt:  IKLHEGNIEFWKRRFLGEGLDNNNVKPSEDDKSEPLDSLDDGDIVEDVAKEIEEEEADEEEEVEQTENQDGERV-KKEVEAKKPLQMIGVQLLKDVDQTT

Query:  TTSKKSRRRTSRASLEDDRDEDWFPEDIFEAFRELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVDLAIKIMQKVIELGGTPTIGDCAMI
        TTSKKSRRR+SRASLEDDRDEDWFPEDIFEAF+ELRKR+VFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEV+LA KIM KVIELGGTPTIGDCAMI
Subjt:  TTSKKSRRRTSRASLEDDRDEDWFPEDIFEAFRELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVDLAIKIMQKVIELGGTPTIGDCAMI

Query:  LRSAIRAPLPSAFLKILQTTHGLGYVFG
        LR+AIR+PLPSAFLKILQTTH LGYVFG
Subjt:  LRSAIRAPLPSAFLKILQTTHGLGYVFG

A0A6J1EQ88 uncharacterized protein LOC111436825 isoform X15.3e-28293.75Show/hide
Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMSRDNQQIPP
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFR LKTF GGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAM+RDNQQIP 
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMSRDNQQIPP

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA
        RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPD +GFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA

Query:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        ALGDASEADY RVEERLKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRN+LYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNIEFWKRRFLGEGLDNNNVKPSEDDKSEPLDSLDDGDIVEDVAKEIEEEEADEEEEVEQTENQDGERV-KKEVEAKKPLQMIGVQLLKDVDQTT
        IKLHEGN EFWKRRFLGEGLD+NNVKPSEDD+SEPLDSLDD DIVEDVAKEI+EEEA+EEEEVE TENQDGERV KKEVEAKKP QMIGVQLLKDVDQT+
Subjt:  IKLHEGNIEFWKRRFLGEGLDNNNVKPSEDDKSEPLDSLDDGDIVEDVAKEIEEEEADEEEEVEQTENQDGERV-KKEVEAKKPLQMIGVQLLKDVDQTT

Query:  TTSKKSRRRTSRASLEDDRDEDWFPEDIFEAFRELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVDLAIKIMQKVIELGGTPTIGDCAMI
        TTSKKSRRR SRAS+EDDRDEDWFPED+FEAF ELRKRKVFD SDMYTIADVWGWTWERELKNRPPRRWSQEWEV+LAIKIM KVIELGG PTIGDCAMI
Subjt:  TTSKKSRRRTSRASLEDDRDEDWFPEDIFEAFRELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVDLAIKIMQKVIELGGTPTIGDCAMI

Query:  LRSAIRAPLPSAFLKILQTTHGLGYVFG
        LR+AI+APLPSAF KILQTTH LGYVFG
Subjt:  LRSAIRAPLPSAFLKILQTTHGLGYVFG

A0A6J1EWQ7 uncharacterized protein LOC111436825 isoform X25.3e-28293.75Show/hide
Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMSRDNQQIPP
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFR LKTF GGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAM+RDNQQIP 
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMSRDNQQIPP

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA
        RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPD +GFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA

Query:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        ALGDASEADY RVEERLKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRN+LYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNIEFWKRRFLGEGLDNNNVKPSEDDKSEPLDSLDDGDIVEDVAKEIEEEEADEEEEVEQTENQDGERV-KKEVEAKKPLQMIGVQLLKDVDQTT
        IKLHEGN EFWKRRFLGEGLD+NNVKPSEDD+SEPLDSLDD DIVEDVAKEI+EEEA+EEEEVE TENQDGERV KKEVEAKKP QMIGVQLLKDVDQT+
Subjt:  IKLHEGNIEFWKRRFLGEGLDNNNVKPSEDDKSEPLDSLDDGDIVEDVAKEIEEEEADEEEEVEQTENQDGERV-KKEVEAKKPLQMIGVQLLKDVDQTT

Query:  TTSKKSRRRTSRASLEDDRDEDWFPEDIFEAFRELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVDLAIKIMQKVIELGGTPTIGDCAMI
        TTSKKSRRR SRAS+EDDRDEDWFPED+FEAF ELRKRKVFD SDMYTIADVWGWTWERELKNRPPRRWSQEWEV+LAIKIM KVIELGG PTIGDCAMI
Subjt:  TTSKKSRRRTSRASLEDDRDEDWFPEDIFEAFRELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVDLAIKIMQKVIELGGTPTIGDCAMI

Query:  LRSAIRAPLPSAFLKILQTTHGLGYVFG
        LR+AI+APLPSAF KILQTTH LGYVFG
Subjt:  LRSAIRAPLPSAFLKILQTTHGLGYVFG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G04260.1 plastid transcriptionally active 34.7e-23073.63Show/hide
Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMSRDNQQIPP
        VQDVAELLGMMVEDHKR+QPN++TYALLVECFTKYCV++EAIRHFRALK F+GGT  LHN GNF DPLSLYLRALCREGR+VEL++AL+AM +DNQ IPP
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMSRDNQQIPP

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA
        RAMI+SRKYR+LVSSWIEPLQEEAE GYEIDY+ARYIEEGGLTGERKRWVPRRGKTPLDPD  GFIYSNP+ETSFKQRCLEDWK++HRK+L+TLQ+EGL 
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA

Query:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
         LGDASE+DY RV ERL+ IIKGP  N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRIN+SRGRPLWVPP+EEEEEEVDEE+D+LI R
Subjt:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNIEFWKRRFLGEGLDNNNVKPSEDDKSEPLDSLDDGDIVEDVAKEIEEEEADEEEE----------------VEQTENQ-DGERV--KKEVEAK
        IKLHEG+ EFWKRRFLGEGL   +V+  E   +E + + +    +ED++KE + EE D+EEE                V +TEN+ +GE +   K  +AK
Subjt:  IKLHEGNIEFWKRRFLGEGLDNNNVKPSEDDKSEPLDSLDDGDIVEDVAKEIEEEEADEEEE----------------VEQTENQ-DGERV--KKEVEAK

Query:  KPLQMIGVQLLKDVDQTTTTSKKSRRRTSRASLEDDRDEDWFPEDIFEAFRELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVDLAIKIM
        K LQMIGVQLLK+ D+   T KK  +R SR +LEDD DEDWFPE+ FEAF+E+R+RKVFDV+DMYTIADVWGWTWE++ KN+ PR+WSQEWEV+LAI +M
Subjt:  KPLQMIGVQLLKDVDQTTTTSKKSRRRTSRASLEDDRDEDWFPEDIFEAFRELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVDLAIKIM

Query:  QKVIELGGTPTIGDCAMILRSAIRAPLPSAFLKILQTTHGLGYVFG
         KVIELGG PTIGDCA+ILR+A+RAP+PSAFLKILQTTH LGY FG
Subjt:  QKVIELGGTPTIGDCAMILRSAIRAPLPSAFLKILQTTHGLGYVFG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GGTGCAAGATGTTGCTGAGTTACTTGGCATGATGGTTGAAGACCACAAGCGTCTACAGCCTAACATGAGAACCTATGCGCTCTTGGTAGAGTGTTTTACTAAGTATTGTG
TTATACGAGAAGCCATTAGGCATTTTCGTGCACTGAAAACCTTTCAAGGTGGAACAAAAGCTTTGCATAATGAAGGAAATTTTGGTGATCCACTTTCTTTATATCTTCGA
GCTTTATGTAGAGAAGGTAGAGTTGTAGAGCTCTTAGAAGCGTTAGAAGCTATGTCTAGAGATAACCAACAGATACCTCCAAGAGCCATGATCTTGAGCAGAAAGTATCG
GTCACTGGTGAGCTCATGGATTGAACCTTTACAGGAAGAAGCTGAACATGGATACGAAATAGACTACATTGCAAGATACATTGAAGAGGGTGGACTCACTGGAGAACGTA
AGAGATGGGTCCCTCGAAGAGGAAAAACTCCTCTAGATCCTGATGTCGAGGGATTTATCTATTCAAATCCGATGGAAACATCCTTTAAGCAACGATGTCTAGAAGATTGG
AAGATGTACCACCGAAAGATTCTGAAAACTTTGCAGAATGAAGGCCTTGCAGCTCTTGGGGATGCATCTGAAGCTGATTATCATAGAGTCGAGGAGAGATTGAAGAAAAT
TATAAAGGGTCCAGACCAAAATGTTTTAAAGCCAAAGGCTGCAAGTAAGATGATTGTATCAGAATTAAAAGAAGAATTAGAAGCACAAGGTTTACCGATTGATGGAACTA
GAAATGTTCTTTACCAGCGTGTTCAAAAAGCAAGGAGAATAAATCGGTCACGTGGTAGGCCCCTTTGGGTTCCTCCAGTGGAGGAGGAAGAAGAAGAGGTTGATGAAGAG
CTGGATGAACTAATTTCACGAATAAAGCTACATGAAGGAAATATAGAGTTCTGGAAACGCCGCTTTCTTGGAGAAGGCTTGGACAATAATAATGTTAAACCTTCTGAGGA
TGATAAATCAGAACCTCTTGATTCTTTGGATGACGGCGACATTGTAGAAGATGTTGCAAAGGAGATTGAAGAAGAAGAAGCCGATGAAGAAGAGGAGGTAGAACAAACTG
AGAATCAAGATGGTGAAAGAGTTAAGAAGGAAGTTGAAGCTAAGAAGCCTCTTCAAATGATAGGTGTCCAATTGTTAAAAGATGTCGACCAAACCACAACAACATCCAAA
AAGTCAAGGAGGAGAACTTCTCGAGCATCACTTGAGGACGATCGTGATGAAGACTGGTTTCCTGAAGATATATTCGAGGCATTTAGAGAGTTGCGAAAGAGGAAAGTATT
TGATGTATCTGACATGTACACAATAGCTGATGTTTGGGGTTGGACTTGGGAGAGAGAACTTAAGAATAGACCTCCCAGGAGGTGGTCACAGGAATGGGAAGTGGACTTGG
CTATTAAAATTATGCAGAAGGTGATAGAATTGGGTGGAACGCCAACAATTGGGGACTGTGCCATGATCTTGCGATCTGCCATCAGGGCTCCTCTACCTTCTGCCTTTTTG
AAGATCTTGCAGACAACACACGGTCTTGGCTATGTTTTTGGGAG
mRNA sequenceShow/hide mRNA sequence
GGTGCAAGATGTTGCTGAGTTACTTGGCATGATGGTTGAAGACCACAAGCGTCTACAGCCTAACATGAGAACCTATGCGCTCTTGGTAGAGTGTTTTACTAAGTATTGTG
TTATACGAGAAGCCATTAGGCATTTTCGTGCACTGAAAACCTTTCAAGGTGGAACAAAAGCTTTGCATAATGAAGGAAATTTTGGTGATCCACTTTCTTTATATCTTCGA
GCTTTATGTAGAGAAGGTAGAGTTGTAGAGCTCTTAGAAGCGTTAGAAGCTATGTCTAGAGATAACCAACAGATACCTCCAAGAGCCATGATCTTGAGCAGAAAGTATCG
GTCACTGGTGAGCTCATGGATTGAACCTTTACAGGAAGAAGCTGAACATGGATACGAAATAGACTACATTGCAAGATACATTGAAGAGGGTGGACTCACTGGAGAACGTA
AGAGATGGGTCCCTCGAAGAGGAAAAACTCCTCTAGATCCTGATGTCGAGGGATTTATCTATTCAAATCCGATGGAAACATCCTTTAAGCAACGATGTCTAGAAGATTGG
AAGATGTACCACCGAAAGATTCTGAAAACTTTGCAGAATGAAGGCCTTGCAGCTCTTGGGGATGCATCTGAAGCTGATTATCATAGAGTCGAGGAGAGATTGAAGAAAAT
TATAAAGGGTCCAGACCAAAATGTTTTAAAGCCAAAGGCTGCAAGTAAGATGATTGTATCAGAATTAAAAGAAGAATTAGAAGCACAAGGTTTACCGATTGATGGAACTA
GAAATGTTCTTTACCAGCGTGTTCAAAAAGCAAGGAGAATAAATCGGTCACGTGGTAGGCCCCTTTGGGTTCCTCCAGTGGAGGAGGAAGAAGAAGAGGTTGATGAAGAG
CTGGATGAACTAATTTCACGAATAAAGCTACATGAAGGAAATATAGAGTTCTGGAAACGCCGCTTTCTTGGAGAAGGCTTGGACAATAATAATGTTAAACCTTCTGAGGA
TGATAAATCAGAACCTCTTGATTCTTTGGATGACGGCGACATTGTAGAAGATGTTGCAAAGGAGATTGAAGAAGAAGAAGCCGATGAAGAAGAGGAGGTAGAACAAACTG
AGAATCAAGATGGTGAAAGAGTTAAGAAGGAAGTTGAAGCTAAGAAGCCTCTTCAAATGATAGGTGTCCAATTGTTAAAAGATGTCGACCAAACCACAACAACATCCAAA
AAGTCAAGGAGGAGAACTTCTCGAGCATCACTTGAGGACGATCGTGATGAAGACTGGTTTCCTGAAGATATATTCGAGGCATTTAGAGAGTTGCGAAAGAGGAAAGTATT
TGATGTATCTGACATGTACACAATAGCTGATGTTTGGGGTTGGACTTGGGAGAGAGAACTTAAGAATAGACCTCCCAGGAGGTGGTCACAGGAATGGGAAGTGGACTTGG
CTATTAAAATTATGCAGAAGGTGATAGAATTGGGTGGAACGCCAACAATTGGGGACTGTGCCATGATCTTGCGATCTGCCATCAGGGCTCCTCTACCTTCTGCCTTTTTG
AAGATCTTGCAGACAACACACGGTCTTGGCTATGTTTTTGGGAG
Protein sequenceShow/hide protein sequence
VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMSRDNQQIPPRAMILSRKYR
SLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYHRVEERLKKI
IKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNIEFWKRRFLGEGLDNNNVKPSED
DKSEPLDSLDDGDIVEDVAKEIEEEEADEEEEVEQTENQDGERVKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRTSRASLEDDRDEDWFPEDIFEAFRELRKRKVF
DVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVDLAIKIMQKVIELGGTPTIGDCAMILRSAIRAPLPSAFLKILQTTHGLGYVFGX