; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg16461 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg16461
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionSAP domain-containing protein
Genome locationCarg_Chr12:7627148..7633864
RNA-Seq ExpressionCarg16461
SyntenyCarg16461
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0006979 - response to oxidative stress (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0004601 - peroxidase activity (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR003034 - SAP domain
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR036361 - SAP domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588298.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]1.3e-29397.72Show/hide
Query:  PWLKFDTFQ-LNHMTDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHN
        P + F TF+ + +  DYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHN
Subjt:  PWLKFDTFQ-LNHMTDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHN

Query:  EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP
        EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP
Subjt:  EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP

Query:  DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNVL
        DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNVL
Subjt:  DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNVL

Query:  YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDRSEPLDSLDDVDIVEDVAKEIDEEEAEEE
        YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDRSEPLDSLDDVDIVEDVAKEIDEEEAEEE
Subjt:  YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDRSEPLDSLDDVDIVEDVAKEIDEEEAEEE

Query:  EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERE
        EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERE
Subjt:  EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERE

Query:  LKNRPPRRWSQEWEVELAIKIMHKAI
        LKNRPPRRWSQEWEVELAIKIMHK I
Subjt:  LKNRPPRRWSQEWEVELAIKIMHKAI

KAG7020857.1 hypothetical protein SDJN02_17545, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MVWTNFQLVLLYVQFVTELCSYSPWLKFDTFQLNHMTDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVI
        MVWTNFQLVLLYVQFVTELCSYSPWLKFDTFQLNHMTDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVI
Subjt:  MVWTNFQLVLLYVQFVTELCSYSPWLKFDTFQLNHMTDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVI

Query:  REAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIE
        REAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIE
Subjt:  REAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIE

Query:  EGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKML
        EGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKML
Subjt:  EGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKML

Query:  VSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDRSEPLDS
        VSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDRSEPLDS
Subjt:  VSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDRSEPLDS

Query:  LDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKR
        LDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKR
Subjt:  LDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKR

Query:  KVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKAI
        KVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKAI
Subjt:  KVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKAI

XP_022930357.1 uncharacterized protein LOC111436825 isoform X1 [Cucurbita moschata]4.9e-29397.34Show/hide
Query:  PWLKFDTFQ-LNHMTDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHN
        P + F TF+ + +  DYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHN
Subjt:  PWLKFDTFQ-LNHMTDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHN

Query:  EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP
        EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP
Subjt:  EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP

Query:  DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNVL
        DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRN+L
Subjt:  DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNVL

Query:  YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDRSEPLDSLDDVDIVEDVAKEIDEEEAEEE
        YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDD+SEPLDSLDDVDIVEDVAKEIDEEEAEEE
Subjt:  YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDRSEPLDSLDDVDIVEDVAKEIDEEEAEEE

Query:  EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERE
        EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERE
Subjt:  EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERE

Query:  LKNRPPRRWSQEWEVELAIKIMHKAI
        LKNRPPRRWSQEWEVELAIKIMHK I
Subjt:  LKNRPPRRWSQEWEVELAIKIMHKAI

XP_022930365.1 uncharacterized protein LOC111436825 isoform X2 [Cucurbita moschata]4.9e-29397.34Show/hide
Query:  PWLKFDTFQ-LNHMTDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHN
        P + F TF+ + +  DYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHN
Subjt:  PWLKFDTFQ-LNHMTDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHN

Query:  EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP
        EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP
Subjt:  EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP

Query:  DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNVL
        DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRN+L
Subjt:  DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNVL

Query:  YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDRSEPLDSLDDVDIVEDVAKEIDEEEAEEE
        YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDD+SEPLDSLDDVDIVEDVAKEIDEEEAEEE
Subjt:  YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDRSEPLDSLDDVDIVEDVAKEIDEEEAEEE

Query:  EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERE
        EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERE
Subjt:  EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERE

Query:  LKNRPPRRWSQEWEVELAIKIMHKAI
        LKNRPPRRWSQEWEVELAIKIMHK I
Subjt:  LKNRPPRRWSQEWEVELAIKIMHKAI

XP_023531020.1 uncharacterized protein LOC111793400 isoform X2 [Cucurbita pepo subsp. pepo]5.5e-29296.77Show/hide
Query:  PWLKFDTFQ-LNHMTDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHN
        P + F TF+ + +  DYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGL+TFPGGTKALHN
Subjt:  PWLKFDTFQ-LNHMTDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHN

Query:  EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP
        EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPRRGKTPLDP
Subjt:  EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP

Query:  DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNVL
        DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADY+RVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNVL
Subjt:  DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNVL

Query:  YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDRSEPLDSLDDVDIVEDVAKEIDEEEAEEE
        YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDD+SEPLDSLDDVDIVEDVAKEIDEEEAEEE
Subjt:  YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDRSEPLDSLDDVDIVEDVAKEIDEEEAEEE

Query:  EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERE
        EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKD+DQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERE
Subjt:  EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERE

Query:  LKNRPPRRWSQEWEVELAIKIMHKAI
        LKNRPPRRWSQEWEVELAIKIMHK I
Subjt:  LKNRPPRRWSQEWEVELAIKIMHKAI

TrEMBL top hitse value%identityAlignment
A0A6J1DBV3 uncharacterized protein LOC1110195959.1e-27792.59Show/hide
Query:  PWLKFDTFQ-LNHMTDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHN
        P + F TF+ + +  D MKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFR LKTF GGTKALHN
Subjt:  PWLKFDTFQ-LNHMTDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHN

Query:  EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP
        EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIP RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP
Subjt:  EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP

Query:  DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNVL
        DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL ALGDASEADY RVEERLKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVL
Subjt:  DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNVL

Query:  YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDRSEPLDSLDDVDIVEDVAKEIDEEEAEEE
        YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTE+WKRRFLGEGLD+N+VKPSEDD+SEPLDSLDDVDIVED AKEI+EEE  EE
Subjt:  YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDRSEPLDSLDDVDIVEDVAKEIDEEEAEEE

Query:  EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERE
        EEVE TENQDGERVIKKEVEAKKP QMIGVQLLKDVDQT+TTSKKSRRR SRAS+EDDRDEDWFPED+FEAF ELRKR+VFD SDMYTIADVWGWTWERE
Subjt:  EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERE

Query:  LKNRPPRRWSQEWEVELAIKIMHKAI
        LKNRPPRRWSQEWEVELA KIMHK I
Subjt:  LKNRPPRRWSQEWEVELAIKIMHKAI

A0A6J1EQ88 uncharacterized protein LOC111436825 isoform X12.4e-29397.34Show/hide
Query:  PWLKFDTFQ-LNHMTDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHN
        P + F TF+ + +  DYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHN
Subjt:  PWLKFDTFQ-LNHMTDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHN

Query:  EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP
        EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP
Subjt:  EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP

Query:  DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNVL
        DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRN+L
Subjt:  DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNVL

Query:  YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDRSEPLDSLDDVDIVEDVAKEIDEEEAEEE
        YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDD+SEPLDSLDDVDIVEDVAKEIDEEEAEEE
Subjt:  YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDRSEPLDSLDDVDIVEDVAKEIDEEEAEEE

Query:  EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERE
        EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERE
Subjt:  EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERE

Query:  LKNRPPRRWSQEWEVELAIKIMHKAI
        LKNRPPRRWSQEWEVELAIKIMHK I
Subjt:  LKNRPPRRWSQEWEVELAIKIMHKAI

A0A6J1EWQ7 uncharacterized protein LOC111436825 isoform X22.4e-29397.34Show/hide
Query:  PWLKFDTFQ-LNHMTDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHN
        P + F TF+ + +  DYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHN
Subjt:  PWLKFDTFQ-LNHMTDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHN

Query:  EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP
        EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP
Subjt:  EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP

Query:  DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNVL
        DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRN+L
Subjt:  DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNVL

Query:  YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDRSEPLDSLDDVDIVEDVAKEIDEEEAEEE
        YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDD+SEPLDSLDDVDIVEDVAKEIDEEEAEEE
Subjt:  YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDRSEPLDSLDDVDIVEDVAKEIDEEEAEEE

Query:  EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERE
        EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERE
Subjt:  EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERE

Query:  LKNRPPRRWSQEWEVELAIKIMHKAI
        LKNRPPRRWSQEWEVELAIKIMHK I
Subjt:  LKNRPPRRWSQEWEVELAIKIMHKAI

A0A6J1KW34 uncharacterized protein LOC111499221 isoform X27.7e-29296.58Show/hide
Query:  PWLKFDTFQ-LNHMTDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHN
        P + F TF+ + +  DYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALH+
Subjt:  PWLKFDTFQ-LNHMTDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHN

Query:  EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP
        EG+FGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP
Subjt:  EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP

Query:  DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNVL
        DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADY+RVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNVL
Subjt:  DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNVL

Query:  YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDRSEPLDSLDDVDIVEDVAKEIDEEEAEEE
        YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDD+SEPLDSLDDVD+VEDVAKEIDEEEAEEE
Subjt:  YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDRSEPLDSLDDVDIVEDVAKEIDEEEAEEE

Query:  EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERE
        EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRK+FDESDMYTIADVWGWTWERE
Subjt:  EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERE

Query:  LKNRPPRRWSQEWEVELAIKIMHKAI
        LKNRPPRRWSQEWEVELAIKIMHK I
Subjt:  LKNRPPRRWSQEWEVELAIKIMHKAI

A0A6J1L2D9 uncharacterized protein LOC111499221 isoform X17.7e-29296.58Show/hide
Query:  PWLKFDTFQ-LNHMTDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHN
        P + F TF+ + +  DYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALH+
Subjt:  PWLKFDTFQ-LNHMTDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHN

Query:  EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP
        EG+FGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP
Subjt:  EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP

Query:  DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNVL
        DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADY+RVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNVL
Subjt:  DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNVL

Query:  YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDRSEPLDSLDDVDIVEDVAKEIDEEEAEEE
        YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDD+SEPLDSLDDVD+VEDVAKEIDEEEAEEE
Subjt:  YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDRSEPLDSLDDVDIVEDVAKEIDEEEAEEE

Query:  EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERE
        EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRK+FDESDMYTIADVWGWTWERE
Subjt:  EEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERE

Query:  LKNRPPRRWSQEWEVELAIKIMHKAI
        LKNRPPRRWSQEWEVELAIKIMHK I
Subjt:  LKNRPPRRWSQEWEVELAIKIMHKAI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G04260.1 plastid transcriptionally active 32.4e-22172.32Show/hide
Query:  PWLKFDTFQ-LNHMTDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHN
        P + + TF+ + +   +MKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKR+QPN++TYALLVECFTKYCV++EAIRHFR LK F GGT  LHN
Subjt:  PWLKFDTFQ-LNHMTDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHN

Query:  EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP
         GNF DPLSLYLRALCREGR+VEL++AL+AM +DNQ IP RAMI+SRKYR+LVSSWIEPLQEEAE GYEIDY+ARYIEEGGLTGERKRWVPRRGKTPLDP
Subjt:  EGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDP

Query:  DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNVL
        DA GFIYSNP+ETSFKQRCLEDWK++HRK+L+TLQ+EGL  LGDASE+DY+RV ERL+ IIKGP  N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVL
Subjt:  DADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNVL

Query:  YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSE--------------DDRSEPLDSLDDVDIVE
        YQRVQKARRIN+SRGRPLWVPP+EEEEEEVDEE+D+LI RIKLHEG+TEFWKRRFLGEGL   +V+  E              +D S+  D+ +D D  E
Subjt:  YQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSE--------------DDRSEPLDSLDDVDIVE

Query:  DVAKEIDEEEAEEEEEVEPTENQ-DGERVIK-KEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDES
            E D+E  EEE  V  TEN+ +GE ++K K  +AKK  QMIGVQLLK+ D+ + T KK  +R SR ++EDD DEDWFPE+ FEAF E+R+RKVFD +
Subjt:  DVAKEIDEEEAEEEEEVEPTENQ-DGERVIK-KEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDES

Query:  DMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKAI
        DMYTIADVWGWTWE++ KN+ PR+WSQEWEVELAI +M K I
Subjt:  DMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCTGGACCAACTTTCAGCTGGTTTTGCTTTATGTGCAGTTTGTAACTGAGCTCTGCTCATATTCTCCATGGTTGAAGTTTGATACTTTTCAACTAAATCATATGAC
AGATTACATGAAGCCCGACACTGAGACATATAATTGGGTGATCCAAGCATATACAAGAGCTGAATCTTATGATAGGGTGCAAGATGTTGCTGAGTTACTTGGCATGATGG
TTGAAGACCACAAGCGTCTACAGCCTAACATGAGAACCTATGCGCTCTTGGTAGAGTGTTTTACCAAGTATTGTGTTATACGAGAAGCCATCAGGCATTTTCGTGGGCTA
AAAACTTTTCCAGGTGGAACAAAAGCTTTGCATAATGAAGGAAATTTTGGTGATCCACTTTCCTTATATCTTCGAGCTTTATGTAGAGAAGGTAGGGTTGTAGAGCTCTT
AGAAGCATTAGAAGCTATGGCTAGAGACAACCAACAGATTCCTTCGAGAGCCATGATCTTGAGCAGAAAGTATCGATCACTTGTGAGCTCATGGATTGAACCTTTACAGG
AAGAAGCTGAACATGGATACGAGATAGACTACATTGCAAGATACATTGAAGAGGGTGGACTCACTGGAGAACGCAAGAGATGGGTGCCTCGAAGAGGAAAAACTCCTCTA
GATCCTGATGCAGATGGATTCATCTATTCAAATCCTATGGAAACATCCTTTAAGCAACGATGTCTAGAAGATTGGAAGATGTACCACCGAAAGATTTTGAAAACCTTGCA
GAATGAAGGACTTGCAGCTCTTGGGGATGCATCTGAAGCTGATTATATTAGAGTCGAGGAGAGATTGAAGAAAATTATAAAGGGTCCTGATCCAAATATTTTAAAGCCAA
AGGCTGCAAGTAAGATGCTTGTATCAGAATTAAAAGAAGAATTGGAAGCACAAGGATTACCGATTGATGGAACTAGAAATGTTCTTTACCAGCGGGTTCAAAAAGCAAGG
AGAATAAATCGGTCTCGTGGTCGGCCCCTTTGGGTTCCTCCAGTGGAGGAGGAGGAAGAGGAGGTTGATGAAGAGCTGGATGAACTAATTTCACGAATAAAGCTACACGA
AGGAAACACAGAGTTCTGGAAACGCCGTTTTCTTGGAGAAGGCTTGGACAGTAATAATGTTAAACCTTCTGAAGATGATCGGTCAGAACCTCTTGATTCTTTGGATGATG
TTGACATTGTAGAAGACGTTGCAAAGGAGATTGATGAAGAAGAAGCCGAGGAGGAAGAGGAGGTTGAACCAACCGAGAATCAAGATGGTGAAAGAGTTATTAAGAAGGAA
GTTGAAGCTAAGAAGCCTCCTCAAATGATAGGTGTCCAATTGTTGAAAGACGTTGACCAAACCTCAACAACATCCAAAAAGTCAAGGAGAAGACGTTCTCGAGCATCAGT
TGAGGACGATCGTGATGAAGACTGGTTTCCTGAAGATTTATTCGAGGCATTTGGAGAGTTGCGAAAGAGGAAAGTCTTTGATGAATCTGACATGTACACAATAGCTGATG
TTTGGGGTTGGACTTGGGAGAGAGAACTTAAGAACAGACCTCCCAGGAGGTGGTCACAGGAATGGGAAGTGGAGTTGGCCATTAAAATTATGCACAAGGCAATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTCTGGACCAACTTTCAGCTGGTTTTGCTTTATGTGCAGTTTGTAACTGAGCTCTGCTCATATTCTCCATGGTTGAAGTTTGATACTTTTCAACTAAATCATATGAC
AGATTACATGAAGCCCGACACTGAGACATATAATTGGGTGATCCAAGCATATACAAGAGCTGAATCTTATGATAGGGTGCAAGATGTTGCTGAGTTACTTGGCATGATGG
TTGAAGACCACAAGCGTCTACAGCCTAACATGAGAACCTATGCGCTCTTGGTAGAGTGTTTTACCAAGTATTGTGTTATACGAGAAGCCATCAGGCATTTTCGTGGGCTA
AAAACTTTTCCAGGTGGAACAAAAGCTTTGCATAATGAAGGAAATTTTGGTGATCCACTTTCCTTATATCTTCGAGCTTTATGTAGAGAAGGTAGGGTTGTAGAGCTCTT
AGAAGCATTAGAAGCTATGGCTAGAGACAACCAACAGATTCCTTCGAGAGCCATGATCTTGAGCAGAAAGTATCGATCACTTGTGAGCTCATGGATTGAACCTTTACAGG
AAGAAGCTGAACATGGATACGAGATAGACTACATTGCAAGATACATTGAAGAGGGTGGACTCACTGGAGAACGCAAGAGATGGGTGCCTCGAAGAGGAAAAACTCCTCTA
GATCCTGATGCAGATGGATTCATCTATTCAAATCCTATGGAAACATCCTTTAAGCAACGATGTCTAGAAGATTGGAAGATGTACCACCGAAAGATTTTGAAAACCTTGCA
GAATGAAGGACTTGCAGCTCTTGGGGATGCATCTGAAGCTGATTATATTAGAGTCGAGGAGAGATTGAAGAAAATTATAAAGGGTCCTGATCCAAATATTTTAAAGCCAA
AGGCTGCAAGTAAGATGCTTGTATCAGAATTAAAAGAAGAATTGGAAGCACAAGGATTACCGATTGATGGAACTAGAAATGTTCTTTACCAGCGGGTTCAAAAAGCAAGG
AGAATAAATCGGTCTCGTGGTCGGCCCCTTTGGGTTCCTCCAGTGGAGGAGGAGGAAGAGGAGGTTGATGAAGAGCTGGATGAACTAATTTCACGAATAAAGCTACACGA
AGGAAACACAGAGTTCTGGAAACGCCGTTTTCTTGGAGAAGGCTTGGACAGTAATAATGTTAAACCTTCTGAAGATGATCGGTCAGAACCTCTTGATTCTTTGGATGATG
TTGACATTGTAGAAGACGTTGCAAAGGAGATTGATGAAGAAGAAGCCGAGGAGGAAGAGGAGGTTGAACCAACCGAGAATCAAGATGGTGAAAGAGTTATTAAGAAGGAA
GTTGAAGCTAAGAAGCCTCCTCAAATGATAGGTGTCCAATTGTTGAAAGACGTTGACCAAACCTCAACAACATCCAAAAAGTCAAGGAGAAGACGTTCTCGAGCATCAGT
TGAGGACGATCGTGATGAAGACTGGTTTCCTGAAGATTTATTCGAGGCATTTGGAGAGTTGCGAAAGAGGAAAGTCTTTGATGAATCTGACATGTACACAATAGCTGATG
TTTGGGGTTGGACTTGGGAGAGAGAACTTAAGAACAGACCTCCCAGGAGGTGGTCACAGGAATGGGAAGTGGAGTTGGCCATTAAAATTATGCACAAGGCAATTTAA
Protein sequenceShow/hide protein sequence
MVWTNFQLVLLYVQFVTELCSYSPWLKFDTFQLNHMTDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGL
KTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPL
DPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNVLYQRVQKAR
RINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDRSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKE
VEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKAI