; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0014107 (gene) of Chayote v1 genome

Gene IDSed0014107
OrganismSechium edule (Chayote v1)
DescriptionSAP domain-containing protein
Genome locationLG06:7748640..7768871
RNA-Seq ExpressionSed0014107
SyntenySed0014107
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0006979 - response to oxidative stress (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0004601 - peroxidase activity (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR003034 - SAP domain
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR036361 - SAP domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588298.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0092.2Show/hide
Query:  MSKLLLSHSHLLTLPHKRHSVSLNNGVL-PIRSVLSAPEKRGRKKRQSRQQQLLPKDDDSSALENALRFTFMEELMDRARNRDPVGVSDVIYDMVAAGLS
        MSK LLSHS LLTLPHK HS SL+NGVL P+RSVLS  EKRGRKKRQSRQQQL  KDDDS+ LE +LRFTFMEELMDRARN DP+GVSDVIYDMVAAGLS
Subjt:  MSKLLLSHSHLLTLPHKRHSVSLNNGVL-PIRSVLSAPEKRGRKKRQSRQQQLLPKDDDSSALENALRFTFMEELMDRARNRDPVGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAK
        PGPRSFHGLVVS+ LNADAEGAMQSLR+ELS GLRPLHETFVALVRLFG+KGLATRG E+LAAMEKLNYDIR AWLILIEELV+NKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
        GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKDNQKIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCV+REAIRHFRGLKTF GGTK LHNEG FGDPLSLYLRALCREGRVVELLEALEAMA+DNQ+IP RAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKDNQKIPPRAM

Query:  LLSRKYRSLVSSWIEPLQEEADHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVDGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNVGLAALG
        +LSRKYRSLVSSWIEPLQEEA+HGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPD DGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQN GLAALG
Subjt:  LLSRKYRSLVSSWIEPLQEEADHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVDGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNVGLAALG

Query:  DASEADYLRVEERLKKIIKGPDQNILKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIRL
        DASEADY+RVEERLKKIIKGPD NILKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRI+L
Subjt:  DASEADYLRVEERLKKIIKGPDQNILKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIRL

Query:  HEGNTEFWKRRFLGEGLNSNNVIPSE-DKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTETQDGERVIKKEVEAKNPLQMIGVQLLKDVDQTTATS
        HEGNTEFWKRRFLGEGL+SNNV PSE D+SEPLDSLDDVDIVEDVAKEI+EEEAEEEEEVE TE QDGERVIKKEVEAK P QMIGVQLLKDVDQT+ TS
Subjt:  HEGNTEFWKRRFLGEGLNSNNVIPSE-DKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTETQDGERVIKKEVEAKNPLQMIGVQLLKDVDQTTATS

Query:  KKLRRRSSRASIEDDRDEDWFPEDLFEAFGELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMDKVIELGGIPTIGDCAMILRA
        KK RRR SRAS+EDDRDEDWFPEDLFEAFGELRKRKVFD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIM KVIELGGIPTIGDCAMILRA
Subjt:  KKLRRRSSRASIEDDRDEDWFPEDLFEAFGELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMDKVIELGGIPTIGDCAMILRA

Query:  AIKAPLPSAFLKILQTTHGLGYVFGSSLYDEVVTLCLDLGELDAAIAIVADLETTGISVPDEMLDRVISARQTNDAVPKPDAAIDITPNDHSLANDE
        AIKAPLPSAF KILQTTH LGYVFGSSLYDE++TLCLDLGELDAAIAIVADLETTGISVPDE LDR+ISARQTNDA PK D+ IDIT NDHSL NDE
Subjt:  AIKAPLPSAFLKILQTTHGLGYVFGSSLYDEVVTLCLDLGELDAAIAIVADLETTGISVPDEMLDRVISARQTNDAVPKPDAAIDITPNDHSLANDE

XP_008443746.1 PREDICTED: uncharacterized protein LOC103487261 isoform X1 [Cucumis melo]0.0e+0091.31Show/hide
Query:  MSKLLLSHSHLLTLPHKRHSVSLNNGVLPIRSVLSAPEKRGRKKRQSR-QQQLLPKDDDSSALENALRFTFMEELMDRARNRDPVGVSDVIYDMVAAGLS
        MSKLLLSH+HLLTLP+   S SLN+G+LPIRSVLSAP+KRGRKKRQSR QQQL  KDDDS++LEN+LRFTFMEELMDRARN DP+GVSDVIYDMVAAGLS
Subjt:  MSKLLLSHSHLLTLPHKRHSVSLNNGVLPIRSVLSAPEKRGRKKRQSR-QQQLLPKDDDSSALENALRFTFMEELMDRARNRDPVGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAK
        PGPRSFHGLVVS+ LN D EGAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA RG E+LAAME+LNYDIR AWLIL EELVRNKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
         GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKDNQKIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCV+REAIRHFR LKTFQGGTK LHNEG FGDPLSLYLRALCREGRV++LLEALEAMA+DNQ+IPPRAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKDNQKIPPRAM

Query:  LLSRKYRSLVSSWIEPLQEEADHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVDGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNVGLAALG
        +LSRKYRSLVSSWIEPLQEEA+HG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPD DGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQN GL AL 
Subjt:  LLSRKYRSLVSSWIEPLQEEADHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVDGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNVGLAALG

Query:  DASEADYLRVEERLKKIIKGPDQNILKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIRL
        DASEADY RV E+LKKIIKGPDQN+LKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRI+L
Subjt:  DASEADYLRVEERLKKIIKGPDQNILKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIRL

Query:  HEGNTEFWKRRFLGEGLNSNNVIPSE-DKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTETQDGERVIKKEVEAKNPLQMIGVQLLKDVDQTTATS
        HEGNTEFWKRRFLGEGL+SNNV PSE DKS+ LDSLDDVD +EDVAKEIEEEEAEEEEEVEQTE QDGERVIKKEVEAK PLQMIGVQLLKDVDQ TATS
Subjt:  HEGNTEFWKRRFLGEGLNSNNVIPSE-DKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTETQDGERVIKKEVEAKNPLQMIGVQLLKDVDQTTATS

Query:  KKLRRRSSRASIEDDRDEDWFPEDLFEAFGELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMDKVIELGGIPTIGDCAMILRA
        KK RRRSSRAS+EDDRDEDWFPED+FEAF EL+KRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIM KVIELGG PTIGDCAMILRA
Subjt:  KKLRRRSSRASIEDDRDEDWFPEDLFEAFGELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMDKVIELGGIPTIGDCAMILRA

Query:  AIKAPLPSAFLKILQTTHGLGYVFGSSLYDEVVTLCLDLGELDAAIAIVADLETTGISVPDEMLDRVISARQTNDAVPKPDAAIDITPNDHSLANDEA
        AIKAPLPSAFLKILQTTHGLGYVFGS LYDEV+TLCLDLGELDAAIAIVADLETTGI VPDE LDRVIS RQTNDA+PKPD+AID T NDHSLANDEA
Subjt:  AIKAPLPSAFLKILQTTHGLGYVFGSSLYDEVVTLCLDLGELDAAIAIVADLETTGISVPDEMLDRVISARQTNDAVPKPDAAIDITPNDHSLANDEA

XP_022930357.1 uncharacterized protein LOC111436825 isoform X1 [Cucurbita moschata]0.0e+0091.86Show/hide
Query:  MSKLLLSHSHLLTLPHKRHSVSLNNGVL-PIRSVLSAPEKRGRKKRQSRQQQLLPKDDDSSALENALRFTFMEELMDRARNRDPVGVSDVIYDMVAAGLS
        MSK LLSHS+LLTLPHK HS SL+NGV  PIRSVLS  EKRGRKKRQSRQQQL  KDDDS+  E +LRFTFMEELMDRARN DP+GVSDVIYDMVAAGLS
Subjt:  MSKLLLSHSHLLTLPHKRHSVSLNNGVL-PIRSVLSAPEKRGRKKRQSRQQQLLPKDDDSSALENALRFTFMEELMDRARNRDPVGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAK
        PGPRSFHGLVVS+ LNADAEGAMQSLR+ELS GLRPLHETFVALVRLFG+KGLATRG E+LAAMEKLNYDIR AWLILIEELV+NKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
        GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKDNQKIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCV+REAIRHFRGLKTF GGTK LHNEG FGDPLSLYLRALCREGRVVELLEALEAMA+DNQ+IP RAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKDNQKIPPRAM

Query:  LLSRKYRSLVSSWIEPLQEEADHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVDGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNVGLAALG
        +LSRKYRSLVSSWIEPLQEEA+HGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPD DGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQN GLAALG
Subjt:  LLSRKYRSLVSSWIEPLQEEADHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVDGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNVGLAALG

Query:  DASEADYLRVEERLKKIIKGPDQNILKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIRL
        DASEADY+RVEERLKKIIKGPD NILKPKAASKM+VSELKEELEAQGLPIDGTRN+LYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRI+L
Subjt:  DASEADYLRVEERLKKIIKGPDQNILKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIRL

Query:  HEGNTEFWKRRFLGEGLNSNNVIPSE-DKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTETQDGERVIKKEVEAKNPLQMIGVQLLKDVDQTTATS
        HEGNTEFWKRRFLGEGL+SNNV PSE D+SEPLDSLDDVDIVEDVAKEI+EEEAEEEEEVE TE QDGERVIKKEVEAK P QMIGVQLLKDVDQT+ TS
Subjt:  HEGNTEFWKRRFLGEGLNSNNVIPSE-DKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTETQDGERVIKKEVEAKNPLQMIGVQLLKDVDQTTATS

Query:  KKLRRRSSRASIEDDRDEDWFPEDLFEAFGELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMDKVIELGGIPTIGDCAMILRA
        KK RRR SRAS+EDDRDEDWFPEDLFEAFGELRKRKVFD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIM KVIELGGIPTIGDCAMILRA
Subjt:  KKLRRRSSRASIEDDRDEDWFPEDLFEAFGELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMDKVIELGGIPTIGDCAMILRA

Query:  AIKAPLPSAFLKILQTTHGLGYVFGSSLYDEVVTLCLDLGELDAAIAIVADLETTGISVPDEMLDRVISARQTNDAVPKPDAAIDITPNDHSLANDE
        AIKAPLPSAF KILQTTH LGYVFGS LYDE++TLCLDLGELDAAIAIVADLETTGISVPDE LDR+ISARQTNDA PK D+ IDIT NDHSL NDE
Subjt:  AIKAPLPSAFLKILQTTHGLGYVFGSSLYDEVVTLCLDLGELDAAIAIVADLETTGISVPDEMLDRVISARQTNDAVPKPDAAIDITPNDHSLANDE

XP_023531019.1 uncharacterized protein LOC111793400 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0091.76Show/hide
Query:  MSKLLLSHSHLLTLPHKRHSVSLNNGVL--PIRSVLSAPEKRGRKKRQSRQQQLLPKDDDSSALENALRFTFMEELMDRARNRDPVGVSDVIYDMVAAGL
        MSK LLSHS LLTLPHK HS SL+NG+L  PIRSVLS  EKRGRKKRQSRQQQL  KDDDS+ LE +LRFTFMEELMDRARN DP+GVSDVIYDMVAAGL
Subjt:  MSKLLLSHSHLLTLPHKRHSVSLNNGVL--PIRSVLSAPEKRGRKKRQSRQQQLLPKDDDSSALENALRFTFMEELMDRARNRDPVGVSDVIYDMVAAGL

Query:  SPGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKGA
        SPGPRSFHGLVVS+ LNADAEGAMQSLR+ELS GLRP+HETFVALVRLFG+KGLATRG E+LAAMEKLNYDIR AWLILIEELV+NKYLEDAN+VFLKGA
Subjt:  SPGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKGA

Query:  KGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ
        KGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ
Subjt:  KGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ

Query:  DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKDNQKIPPRA
        DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCV+REAIRHFRGL+TF GGTK LHNEG FGDPLSLYLRALCREGRVVELLEALEAMA+DNQ+IP RA
Subjt:  DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKDNQKIPPRA

Query:  MLLSRKYRSLVSSWIEPLQEEADHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVDGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNVGLAAL
        M+LSRKYRSLVSSWIEPLQEEA+HG+EIDYIARYIEEGGLTGERKRWVPRRGKTPLDPD DGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQN GLAAL
Subjt:  MLLSRKYRSLVSSWIEPLQEEADHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVDGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNVGLAAL

Query:  GDASEADYLRVEERLKKIIKGPDQNILKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIR
        GDASEADYLRVEERLKKIIKGPD NILKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRI+
Subjt:  GDASEADYLRVEERLKKIIKGPDQNILKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIR

Query:  LHEGNTEFWKRRFLGEGLNSNNVIPSE-DKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTETQDGERVIKKEVEAKNPLQMIGVQLLKDVDQTTAT
        LHEGNTEFWKRRFLGEGL+SNNV PSE D+SEPLDSLDDVDIVEDVAKEI+EEEAEEEEEVE TE QDGERVIKKEVEAK P QMIGVQLLKD+DQT+ T
Subjt:  LHEGNTEFWKRRFLGEGLNSNNVIPSE-DKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTETQDGERVIKKEVEAKNPLQMIGVQLLKDVDQTTAT

Query:  SKKLRRRSSRASIEDDRDEDWFPEDLFEAFGELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMDKVIELGGIPTIGDCAMILR
        SKK RRR SRAS+EDDRDEDWFPEDLFEAFGELRKRKVFD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIM KVIELGGIPTIGDCAMILR
Subjt:  SKKLRRRSSRASIEDDRDEDWFPEDLFEAFGELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMDKVIELGGIPTIGDCAMILR

Query:  AAIKAPLPSAFLKILQTTHGLGYVFGSSLYDEVVTLCLDLGELDAAIAIVADLETTGISVPDEMLDRVISARQTNDAVPKPDAAIDITPNDHSLANDE
        AAIKAPLPSAF KILQTTH LGYVFGS LYDE++TLCLDLGELDAAIAIVADLETTGISVPDE LDR+ISARQTNDA PK D+ IDIT NDHSLANDE
Subjt:  AAIKAPLPSAFLKILQTTHGLGYVFGSSLYDEVVTLCLDLGELDAAIAIVADLETTGISVPDEMLDRVISARQTNDAVPKPDAAIDITPNDHSLANDE

XP_038879291.1 uncharacterized protein LOC120071230 [Benincasa hispida]0.0e+0091.98Show/hide
Query:  MSKLLLSHSHLLTLPHKRHSVSLNNGVLPIRSVLSAPEKRGRKKRQSRQQQLL-PKDDDSSALENALRFTFMEELMDRARNRDPVGVSDVIYDMVAAGLS
        MSK LLSH+HLLTLP+K HS SLN+GV+PIRSVLSAP+KRGRKKRQ+RQQQ L  KD DS+ALE +LRFTFMEELMDRARN DP+GVSDVIYDMVAAGLS
Subjt:  MSKLLLSHSHLLTLPHKRHSVSLNNGVLPIRSVLSAPEKRGRKKRQSRQQQLL-PKDDDSSALENALRFTFMEELMDRARNRDPVGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAK
        PGPRSFHGLVVS+ LN D EGAMQSLRRELSAGL PLHETFVALVRLFGSKGLATRG E+LAAMEKLNYDIR AWLIL EELVRNKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
        GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKDNQKIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCV+REAIRHFR LKTFQGGTK LHNEG FGDPLSLYLRALCREGRVVELLEALEAMA+DNQ+IPPRAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKDNQKIPPRAM

Query:  LLSRKYRSLVSSWIEPLQEEADHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVDGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNVGLAALG
        +LSRKYRSLVSSWIEPLQEEA+HG+EIDYIARYIEEGGLTGERKRWVPRRGKTPLDPD DGFIYSNPMETSFKQRCLEDWKM+HRKILKTLQN GLAALG
Subjt:  LLSRKYRSLVSSWIEPLQEEADHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVDGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNVGLAALG

Query:  DASEADYLRVEERLKKIIKGPDQNILKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIRL
         ASEADY RV ERLKKIIKGPDQN+LKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRI+L
Subjt:  DASEADYLRVEERLKKIIKGPDQNILKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIRL

Query:  HEGNTEFWKRRFLGEGLNSNNVIPSE-DKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTETQDGERVIKKEVEAKNPLQMIGVQLLKDVDQTTATS
        HEGNTEFWKRRFLGEGL+SNNV PSE DKSEPLDSLDDVD VEDVAKEIEEEEAEEEEEVEQTE QDGERVIKKEVEAK PLQMIGVQLLKDVDQ T TS
Subjt:  HEGNTEFWKRRFLGEGLNSNNVIPSE-DKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTETQDGERVIKKEVEAKNPLQMIGVQLLKDVDQTTATS

Query:  KKLRRRSSRASIEDDRDEDWFPEDLFEAFGELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMDKVIELGGIPTIGDCAMILRA
        KK RRRSSRAS+EDDRDEDWFPED+FEAF ELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIM KVIELGG PTIGDCAMILRA
Subjt:  KKLRRRSSRASIEDDRDEDWFPEDLFEAFGELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMDKVIELGGIPTIGDCAMILRA

Query:  AIKAPLPSAFLKILQTTHGLGYVFGSSLYDEVVTLCLDLGELDAAIAIVADLETTGISVPDEMLDRVISARQTNDAVPKPDAAIDITPNDHSLANDEA
        AIKAPLPS+FLKILQTTHGLGY FGS LYDEV+TLCLDLGELDAAIAIVADLETTGI VPDE LDRVIS RQTND++PKPD+AID T NDHSLA+DEA
Subjt:  AIKAPLPSAFLKILQTTHGLGYVFGSSLYDEVVTLCLDLGELDAAIAIVADLETTGISVPDEMLDRVISARQTNDAVPKPDAAIDITPNDHSLANDEA

TrEMBL top hitse value%identityAlignment
A0A1S3B8T6 uncharacterized protein LOC103487261 isoform X10.0e+0091.31Show/hide
Query:  MSKLLLSHSHLLTLPHKRHSVSLNNGVLPIRSVLSAPEKRGRKKRQSR-QQQLLPKDDDSSALENALRFTFMEELMDRARNRDPVGVSDVIYDMVAAGLS
        MSKLLLSH+HLLTLP+   S SLN+G+LPIRSVLSAP+KRGRKKRQSR QQQL  KDDDS++LEN+LRFTFMEELMDRARN DP+GVSDVIYDMVAAGLS
Subjt:  MSKLLLSHSHLLTLPHKRHSVSLNNGVLPIRSVLSAPEKRGRKKRQSR-QQQLLPKDDDSSALENALRFTFMEELMDRARNRDPVGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAK
        PGPRSFHGLVVS+ LN D EGAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA RG E+LAAME+LNYDIR AWLIL EELVRNKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
         GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKDNQKIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCV+REAIRHFR LKTFQGGTK LHNEG FGDPLSLYLRALCREGRV++LLEALEAMA+DNQ+IPPRAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKDNQKIPPRAM

Query:  LLSRKYRSLVSSWIEPLQEEADHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVDGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNVGLAALG
        +LSRKYRSLVSSWIEPLQEEA+HG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPD DGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQN GL AL 
Subjt:  LLSRKYRSLVSSWIEPLQEEADHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVDGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNVGLAALG

Query:  DASEADYLRVEERLKKIIKGPDQNILKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIRL
        DASEADY RV E+LKKIIKGPDQN+LKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRI+L
Subjt:  DASEADYLRVEERLKKIIKGPDQNILKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIRL

Query:  HEGNTEFWKRRFLGEGLNSNNVIPSE-DKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTETQDGERVIKKEVEAKNPLQMIGVQLLKDVDQTTATS
        HEGNTEFWKRRFLGEGL+SNNV PSE DKS+ LDSLDDVD +EDVAKEIEEEEAEEEEEVEQTE QDGERVIKKEVEAK PLQMIGVQLLKDVDQ TATS
Subjt:  HEGNTEFWKRRFLGEGLNSNNVIPSE-DKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTETQDGERVIKKEVEAKNPLQMIGVQLLKDVDQTTATS

Query:  KKLRRRSSRASIEDDRDEDWFPEDLFEAFGELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMDKVIELGGIPTIGDCAMILRA
        KK RRRSSRAS+EDDRDEDWFPED+FEAF EL+KRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIM KVIELGG PTIGDCAMILRA
Subjt:  KKLRRRSSRASIEDDRDEDWFPEDLFEAFGELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMDKVIELGGIPTIGDCAMILRA

Query:  AIKAPLPSAFLKILQTTHGLGYVFGSSLYDEVVTLCLDLGELDAAIAIVADLETTGISVPDEMLDRVISARQTNDAVPKPDAAIDITPNDHSLANDEA
        AIKAPLPSAFLKILQTTHGLGYVFGS LYDEV+TLCLDLGELDAAIAIVADLETTGI VPDE LDRVIS RQTNDA+PKPD+AID T NDHSLANDEA
Subjt:  AIKAPLPSAFLKILQTTHGLGYVFGSSLYDEVVTLCLDLGELDAAIAIVADLETTGISVPDEMLDRVISARQTNDAVPKPDAAIDITPNDHSLANDEA

A0A1S3B9H7 uncharacterized protein LOC103487261 isoform X20.0e+0091.43Show/hide
Query:  MSKLLLSHSHLLTLPHKRHSVSLNNGVLPIRSVLSAPEKRGRKKRQSR-QQQLLPKDDDSSALENALRFTFMEELMDRARNRDPVGVSDVIYDMVAAGLS
        MSKLLLSH+HLLTLP+   S SLN+G+LPIRSVLSAP+KRGRKKRQSR QQQL  KDDDS++LEN+LRFTFMEELMDRARN DP+GVSDVIYDMVAAGLS
Subjt:  MSKLLLSHSHLLTLPHKRHSVSLNNGVLPIRSVLSAPEKRGRKKRQSR-QQQLLPKDDDSSALENALRFTFMEELMDRARNRDPVGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAK
        PGPRSFHGLVVS+ LN D EGAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA RG E+LAAME+LNYDIR AWLIL EELVRNKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
         GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKDNQKIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCV+REAIRHFR LKTFQGGTK LHNEG FGDPLSLYLRALCREGRV++LLEALEAMA+DNQ+IPPRAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKDNQKIPPRAM

Query:  LLSRKYRSLVSSWIEPLQEEADHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVDGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNVGLAALG
        +LSRKYRSLVSSWIEPLQEEA+HG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPD DGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQN GL AL 
Subjt:  LLSRKYRSLVSSWIEPLQEEADHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVDGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNVGLAALG

Query:  DASEADYLRVEERLKKIIKGPDQNILKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIRL
        DASEADY RV E+LKKIIKGPDQN+LKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRI+L
Subjt:  DASEADYLRVEERLKKIIKGPDQNILKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIRL

Query:  HEGNTEFWKRRFLGEGLNSNNVIPSE-DKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTETQDGERVIKKEVEAKNPLQMIGVQLLKDVDQTTATS
        HEGNTEFWKRRFLGEGL+SNNV PSE DKS+ LDSLDDVD +EDVAKEIEEEEAEEEEEVEQTE QDGERVIKKEVEAK PLQMIGVQLLKDVDQ TATS
Subjt:  HEGNTEFWKRRFLGEGLNSNNVIPSE-DKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTETQDGERVIKKEVEAKNPLQMIGVQLLKDVDQTTATS

Query:  KKLRRRSSRASIEDDRDEDWFPEDLFEAFGELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMDKVIELGGIPTIGDCAMILRA
        KK RRRSSRAS+EDDRDEDWFPED+FEAF EL+KRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIM KVIELGG PTIGDCAMILRA
Subjt:  KKLRRRSSRASIEDDRDEDWFPEDLFEAFGELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMDKVIELGGIPTIGDCAMILRA

Query:  AIKAPLPSAFLKILQTTHGLGYVFGSSL
        AIKAPLPSAFLKILQTTHGLGYVFG +L
Subjt:  AIKAPLPSAFLKILQTTHGLGYVFGSSL

A0A6J1DBV3 uncharacterized protein LOC1110195950.0e+0091.11Show/hide
Query:  MSKLLL-SHSHLLTLPHKRHSVSL--NNGVLPIRSVLSAPEKRGRKKRQSRQQQLLPKDDDSSALENALRFTFMEELMDRARNRDPVGVSDVIYDMVAAG
        MSKLLL SH+HLLTLPHKR S+ L  +NGVLPIRSVLSAPEKRGRKKRQ R     PKDD S+ALE  LRFTFMEELMDRAR+ D VGVSDVIYDMVAAG
Subjt:  MSKLLL-SHSHLLTLPHKRHSVSL--NNGVLPIRSVLSAPEKRGRKKRQSRQQQLLPKDDDSSALENALRFTFMEELMDRARNRDPVGVSDVIYDMVAAG

Query:  LSPGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKG
        LSPGPRSFHGLVVS++LN D EGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLA+RG E+L+AMEKLNYDIR AWLILI+ELVRNKYLEDAN+ FLKG
Subjt:  LSPGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKG

Query:  AKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRV
        AKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGED MKPDTETYNWVIQAYTRAESYDRV
Subjt:  AKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRV

Query:  QDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKDNQKIPPR
        QDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCV+REAIRHFR LKTFQGGTK LHNEG FGDPLSLYLRALCREGRVVELLEALEAMA+DNQ+IP R
Subjt:  QDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKDNQKIPPR

Query:  AMLLSRKYRSLVSSWIEPLQEEADHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVDGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNVGLAA
        AM+LSRKYRSLVSSWIEPLQEEA+HGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPD DGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQN GL A
Subjt:  AMLLSRKYRSLVSSWIEPLQEEADHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVDGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNVGLAA

Query:  LGDASEADYLRVEERLKKIIKGPDQNILKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRI
        LGDASEADY RVEERLKKIIKGPDQN+LKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRI
Subjt:  LGDASEADYLRVEERLKKIIKGPDQNILKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRI

Query:  RLHEGNTEFWKRRFLGEGLNSNNVIPSE-DKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTETQDGERVIKKEVEAKNPLQMIGVQLLKDVDQTTA
        +LHEGNTE+WKRRFLGEGL++N+V PSE DKSEPLDSLDDVDIVED AKEIEEEE  EEEEVEQTE QDGERVIKKEVEAK PLQMIGVQLLKDVDQTT 
Subjt:  RLHEGNTEFWKRRFLGEGLNSNNVIPSE-DKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTETQDGERVIKKEVEAKNPLQMIGVQLLKDVDQTTA

Query:  TSKKLRRRSSRASIEDDRDEDWFPEDLFEAFGELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMDKVIELGGIPTIGDCAMIL
        TSKK RRRSSRAS+EDDRDEDWFPED+FEAF ELRKR+VFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELA KIM KVIELGG PTIGDCAMIL
Subjt:  TSKKLRRRSSRASIEDDRDEDWFPEDLFEAFGELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMDKVIELGGIPTIGDCAMIL

Query:  RAAIKAPLPSAFLKILQTTHGLGYVFGSSLYDEVVTLCLDLGELDAAIAIVADLETTGISVPDEMLDRVISARQTNDAVPKPDAAIDITPNDHSLANDEA
        RAAI++PLPSAFLKILQTTH LGYVFGS LYDEV+TLCLDLGELDAAIAIVADLETTGI VPDE LDRVISARQTNDA+PKPD AID T NDHSLANDEA
Subjt:  RAAIKAPLPSAFLKILQTTHGLGYVFGSSLYDEVVTLCLDLGELDAAIAIVADLETTGISVPDEMLDRVISARQTNDAVPKPDAAIDITPNDHSLANDEA

A0A6J1EQ88 uncharacterized protein LOC111436825 isoform X10.0e+0091.86Show/hide
Query:  MSKLLLSHSHLLTLPHKRHSVSLNNGVL-PIRSVLSAPEKRGRKKRQSRQQQLLPKDDDSSALENALRFTFMEELMDRARNRDPVGVSDVIYDMVAAGLS
        MSK LLSHS+LLTLPHK HS SL+NGV  PIRSVLS  EKRGRKKRQSRQQQL  KDDDS+  E +LRFTFMEELMDRARN DP+GVSDVIYDMVAAGLS
Subjt:  MSKLLLSHSHLLTLPHKRHSVSLNNGVL-PIRSVLSAPEKRGRKKRQSRQQQLLPKDDDSSALENALRFTFMEELMDRARNRDPVGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAK
        PGPRSFHGLVVS+ LNADAEGAMQSLR+ELS GLRPLHETFVALVRLFG+KGLATRG E+LAAMEKLNYDIR AWLILIEELV+NKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
        GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKDNQKIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCV+REAIRHFRGLKTF GGTK LHNEG FGDPLSLYLRALCREGRVVELLEALEAMA+DNQ+IP RAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKDNQKIPPRAM

Query:  LLSRKYRSLVSSWIEPLQEEADHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVDGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNVGLAALG
        +LSRKYRSLVSSWIEPLQEEA+HGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPD DGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQN GLAALG
Subjt:  LLSRKYRSLVSSWIEPLQEEADHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVDGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNVGLAALG

Query:  DASEADYLRVEERLKKIIKGPDQNILKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIRL
        DASEADY+RVEERLKKIIKGPD NILKPKAASKM+VSELKEELEAQGLPIDGTRN+LYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRI+L
Subjt:  DASEADYLRVEERLKKIIKGPDQNILKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIRL

Query:  HEGNTEFWKRRFLGEGLNSNNVIPSE-DKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTETQDGERVIKKEVEAKNPLQMIGVQLLKDVDQTTATS
        HEGNTEFWKRRFLGEGL+SNNV PSE D+SEPLDSLDDVDIVEDVAKEI+EEEAEEEEEVE TE QDGERVIKKEVEAK P QMIGVQLLKDVDQT+ TS
Subjt:  HEGNTEFWKRRFLGEGLNSNNVIPSE-DKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTETQDGERVIKKEVEAKNPLQMIGVQLLKDVDQTTATS

Query:  KKLRRRSSRASIEDDRDEDWFPEDLFEAFGELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMDKVIELGGIPTIGDCAMILRA
        KK RRR SRAS+EDDRDEDWFPEDLFEAFGELRKRKVFD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIM KVIELGGIPTIGDCAMILRA
Subjt:  KKLRRRSSRASIEDDRDEDWFPEDLFEAFGELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMDKVIELGGIPTIGDCAMILRA

Query:  AIKAPLPSAFLKILQTTHGLGYVFGSSLYDEVVTLCLDLGELDAAIAIVADLETTGISVPDEMLDRVISARQTNDAVPKPDAAIDITPNDHSLANDE
        AIKAPLPSAF KILQTTH LGYVFGS LYDE++TLCLDLGELDAAIAIVADLETTGISVPDE LDR+ISARQTNDA PK D+ IDIT NDHSL NDE
Subjt:  AIKAPLPSAFLKILQTTHGLGYVFGSSLYDEVVTLCLDLGELDAAIAIVADLETTGISVPDEMLDRVISARQTNDAVPKPDAAIDITPNDHSLANDE

A0A6J1L2D9 uncharacterized protein LOC111499221 isoform X10.0e+0091.86Show/hide
Query:  MSKLLLSHSHLLTLPHKRHSVSLNNGVL-PIRSVLSAPEKRGRKKRQSRQQQLLPKDDDSSALENALRFTFMEELMDRARNRDPVGVSDVIYDMVAAGLS
        MSK LLSHS LLTLPHK HS SL+N VL PIRSVLS  EKRGRKKRQSRQQQL  KD DS+ LE +LRFTFMEELMDRARN DP+GVSDVIYDMVAAGLS
Subjt:  MSKLLLSHSHLLTLPHKRHSVSLNNGVL-PIRSVLSAPEKRGRKKRQSRQQQLLPKDDDSSALENALRFTFMEELMDRARNRDPVGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAK
        PGPRSFHGLVVS+ LNADAEGAMQSLR+ELS GLRPLHETFVALVRLFG+KGLATRG E+LAAMEKLNYDIR AWLILIEELV+NKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
        GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKDNQKIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCV+REAIRHFRGLKTF GGTK LH+EG FGDPLSLYLRALCREGRVVELLEALEAMA+DNQ+IP RAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKDNQKIPPRAM

Query:  LLSRKYRSLVSSWIEPLQEEADHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVDGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNVGLAALG
        +LSRKYRSLVSSWIEPLQEEA+HGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPD DGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQN GLAALG
Subjt:  LLSRKYRSLVSSWIEPLQEEADHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVDGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNVGLAALG

Query:  DASEADYLRVEERLKKIIKGPDQNILKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIRL
        DASEADYLRVEERLKKIIKGPD NILKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRI+L
Subjt:  DASEADYLRVEERLKKIIKGPDQNILKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIRL

Query:  HEGNTEFWKRRFLGEGLNSNNVIPSE-DKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTETQDGERVIKKEVEAKNPLQMIGVQLLKDVDQTTATS
        HEGNTEFWKRRFLGEGL+SNNV PSE D+SEPLDSLDDVD+VEDVAKEI+EEEAEEEEEVE TE QDGERVIKKEVEAK P QMIGVQLLKDVDQT+ TS
Subjt:  HEGNTEFWKRRFLGEGLNSNNVIPSE-DKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTETQDGERVIKKEVEAKNPLQMIGVQLLKDVDQTTATS

Query:  KKLRRRSSRASIEDDRDEDWFPEDLFEAFGELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMDKVIELGGIPTIGDCAMILRA
        KK RRR SRAS+EDDRDEDWFPEDLFEAFGELRKRK+FD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIM KVIELGGIPTIGDCAMILRA
Subjt:  KKLRRRSSRASIEDDRDEDWFPEDLFEAFGELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMDKVIELGGIPTIGDCAMILRA

Query:  AIKAPLPSAFLKILQTTHGLGYVFGSSLYDEVVTLCLDLGELDAAIAIVADLETTGISVPDEMLDRVISARQTNDAVPKPDAAIDITPNDHSLANDE
        AIKAPLPSAF KILQTTH LGYVFGS LYDEV+TLCLDLGELDAAIAIVADLETTGISVPDE LDR+ISARQTNDA PK D+ IDIT NDHSLA+DE
Subjt:  AIKAPLPSAFLKILQTTHGLGYVFGSSLYDEVVTLCLDLGELDAAIAIVADLETTGISVPDEMLDRVISARQTNDAVPKPDAAIDITPNDHSLANDE

SwissProt top hitse value%identityAlignment
O04504 Pentatricopeptide repeat-containing protein At1g098209.7e-0722.89Show/hide
Query:  VIYDMVAAGLSPGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRG--FEVLAAMEKL-NYDIRP---AWLILIEELV
        V+ +MV   +SP   +F+ L+  +  + +  G+M+  +  L   ++P   ++ +L+      GL   G   E ++  +K+ +  ++P    +  LI    
Subjt:  VIYDMVAAGLSPGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRG--FEVLAAMEKL-NYDIRP---AWLILIEELV

Query:  RNKYLEDANQVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYN
        +N  L++A  +F      G   T ++Y++LI+  CK G   +   +  EME  G +     +NCL++     G  E A   F+ +        PD  T++
Subjt:  RNKYLEDANQVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYN

Query:  WVIQAYTR-AESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTK
         +++ Y R  ES      + E+  M       L+P   TY ++++ + K
Subjt:  WVIQAYTR-AESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTK

Q9FVX2 Pentatricopeptide repeat-containing protein At1g77360, mitochondrial2.6e-0719.39Show/hide
Query:  NALRFTFMEELMDRARNRDPVGVSDVIYDMVAAGLSPGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAM
        N + F  +   + +++N   V  +  +++ +    +P  +++  L+  +    +   A +  R  + AG  P   T+  +V +    G       ++ +M
Subjt:  NALRFTFMEELMDRARNRDPVGVSDVIYDMVAAGLSPGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAM

Query:  E-KLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFS
        +  +       + +L+        LE+A   FL+  + G++A   +++ LI   CKA    N   +  EM++ G    +   N +L      G  + AF 
Subjt:  E-KLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFS

Query:  TFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLV
         F  M       +PD +TY  VI+ +   +  +    V + +       K + P+M T+++L+
Subjt:  TFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLV

Q9LMH5 Putative pentatricopeptide repeat-containing protein At1g138003.7e-0621.66Show/hide
Query:  VIYDMVAAGLSPGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDI-RPAWLILIEELVRNKYL
        V+ DM   G+ P    +  ++  +  N +   A+    + L    R       ++++ +   G  +  +++     + N  + R  + +  + L +   +
Subjt:  VIYDMVAAGLSPGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDI-RPAWLILIEELVRNKYL

Query:  EDANQVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQA
        E+A ++F +    G+      Y  LI   C  G  S+A ++  EM+  G+      +N L    AT G+ + AF T + ME     +KP   T+N VI+ 
Subjt:  EDANQVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQA

Query:  YTRAESYDRVQDVAELL
           A   D+ +   E L
Subjt:  YTRAESYDRVQDVAELL

Q9LN22 Pentatricopeptide repeat-containing protein At1g20300, mitochondrial4.8e-0626.29Show/hide
Query:  ETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRP---AWLILIEELVRNKYLEDANQVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEA
        ETF  L+R +   GLA+        ME  +Y   P   A+ I+I  L R +   +A Q F    K        +Y  L+   C+AG+ S A ++  EM+ 
Subjt:  ETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRP---AWLILIEELVRNKYLEDANQVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEA

Query:  AGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRL--QPNMRTYALLVE
        AG     + ++ ++     CG    A   F +M   +    P+  T+N +++ + +A    R + V ++   M    K+L  +P+  TY  L+E
Subjt:  AGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRL--QPNMRTYALLVE

Q9SIC9 Pentatricopeptide repeat-containing protein At2g31400, chloroplastic3.0e-0823.75Show/hide
Query:  GLRPLHETFVALVRLFGSKGL--ATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEIS
        G++P   TF +L+ +    GL  A R         ++  D+  ++  L++ + +   ++ A ++  +     +      Y  +I+   KAG    AL + 
Subjt:  GLRPLHETFVALVRLFGSKGL--ATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEIS

Query:  YEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT
         EM   G       +N LLS+    G  E A      M      +K D  TYN ++  Y +   YD V+ V      M  +H  + PN+ TY+ L++ ++
Subjt:  YEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT

Query:  KYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKD
        K  + +EA+  FR  K+      V+          S  + ALC+ G V   +  ++ M K+
Subjt:  KYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKD

Arabidopsis top hitse value%identityAlignment
AT1G13800.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.6e-0721.66Show/hide
Query:  VIYDMVAAGLSPGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDI-RPAWLILIEELVRNKYL
        V+ DM   G+ P    +  ++  +  N +   A+    + L    R       ++++ +   G  +  +++     + N  + R  + +  + L +   +
Subjt:  VIYDMVAAGLSPGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDI-RPAWLILIEELVRNKYL

Query:  EDANQVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQA
        E+A ++F +    G+      Y  LI   C  G  S+A ++  EM+  G+      +N L    AT G+ + AF T + ME     +KP   T+N VI+ 
Subjt:  EDANQVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQA

Query:  YTRAESYDRVQDVAELL
           A   D+ +   E L
Subjt:  YTRAESYDRVQDVAELL

AT1G20300.1 Pentatricopeptide repeat (PPR) superfamily protein3.4e-0726.29Show/hide
Query:  ETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRP---AWLILIEELVRNKYLEDANQVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEA
        ETF  L+R +   GLA+        ME  +Y   P   A+ I+I  L R +   +A Q F    K        +Y  L+   C+AG+ S A ++  EM+ 
Subjt:  ETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRP---AWLILIEELVRNKYLEDANQVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEA

Query:  AGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRL--QPNMRTYALLVE
        AG     + ++ ++     CG    A   F +M   +    P+  T+N +++ + +A    R + V ++   M    K+L  +P+  TY  L+E
Subjt:  AGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRL--QPNMRTYALLVE

AT1G77360.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.8e-0819.39Show/hide
Query:  NALRFTFMEELMDRARNRDPVGVSDVIYDMVAAGLSPGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAM
        N + F  +   + +++N   V  +  +++ +    +P  +++  L+  +    +   A +  R  + AG  P   T+  +V +    G       ++ +M
Subjt:  NALRFTFMEELMDRARNRDPVGVSDVIYDMVAAGLSPGPRSFHGLVVSYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAM

Query:  E-KLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFS
        +  +       + +L+        LE+A   FL+  + G++A   +++ LI   CKA    N   +  EM++ G    +   N +L      G  + AF 
Subjt:  E-KLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFS

Query:  TFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLV
         F  M       +PD +TY  VI+ +   +  +    V + +       K + P+M T+++L+
Subjt:  TFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLV

AT2G31400.1 genomes uncoupled 12.2e-0923.75Show/hide
Query:  GLRPLHETFVALVRLFGSKGL--ATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEIS
        G++P   TF +L+ +    GL  A R         ++  D+  ++  L++ + +   ++ A ++  +     +      Y  +I+   KAG    AL + 
Subjt:  GLRPLHETFVALVRLFGSKGL--ATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEIS

Query:  YEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT
         EM   G       +N LLS+    G  E A      M      +K D  TYN ++  Y +   YD V+ V      M  +H  + PN+ TY+ L++ ++
Subjt:  YEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT

Query:  KYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKD
        K  + +EA+  FR  K+      V+          S  + ALC+ G V   +  ++ M K+
Subjt:  KYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKD

AT3G04260.1 plastid transcriptionally active 30.0e+0073.49Show/hide
Query:  GVLPIRSVLSAPEKRGRKKRQSRQQQLLPKDDD--------SSALENALRFTFMEELMDRARNRDPVGVSDVIYDMVAAGLSPGPRSFHGLVVSYALNAD
        G+  IR  +SAPEK+ R++R+ ++      DD          SALE +LR TFM+ELM+RARNRD  GVS+VIYDM+AAGLSPGPRSFHGLVV++ALN D
Subjt:  GVLPIRSVLSAPEKRGRKKRQSRQQQLLPKDDD--------SSALENALRFTFMEELMDRARNRDPVGVSDVIYDMVAAGLSPGPRSFHGLVVSYALNAD

Query:  AEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAKGGLRATDKIYDLLIEEDC
         +GAM SLR+EL AG RPL ET +ALVRL GSKG ATRG E+LAAMEKL YDIR AWLIL+EEL+R  +LEDAN+VFLKGA+GG+RATD++YDL+IEEDC
Subjt:  AEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAKGGLRATDKIYDLLIEEDC

Query:  KAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPN
        KAGDHSNAL+ISYEMEAAGRMATTFHFNCLLSVQATCGIPE+A++TFENMEYGE +MKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKR+QPN
Subjt:  KAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPN

Query:  MRTYALLVECFTKYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKDNQKIPPRAMLLSRKYRSLVSSWIEPLQ
        ++TYALLVECFTKYCVV+EAIRHFR LK F+GGT +LHN G F DPLSLYLRALCREGR+VEL++AL+AM KDNQ IPPRAM++SRKYR+LVSSWIEPLQ
Subjt:  MRTYALLVECFTKYCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKDNQKIPPRAMLLSRKYRSLVSSWIEPLQ

Query:  EEADHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVDGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNVGLAALGDASEADYLRVEERLKKII
        EEA+ GYEIDY+ARYIEEGGLTGERKRWVPRRGKTPLDPD  GFIYSNP+ETSFKQRCLEDWK++HRK+L+TLQ+ GL  LGDASE+DY+RV ERL+ II
Subjt:  EEADHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDVDGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNVGLAALGDASEADYLRVEERLKKII

Query:  KGPDQNILKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIRLHEGNTEFWKRRFLGEGLN
        KGP  N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRIN+SRGRPLWVPP+EEEEEEVDEE+D+LI RI+LHEG+TEFWKRRFLGEGL 
Subjt:  KGPDQNILKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIRLHEGNTEFWKRRFLGEGLN

Query:  SNNVIPSEDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTETQD-----------------GERVIK-KEVEAKNPLQMIGVQLLKDVDQTTATSK
          +V  S++ +E + + +    +ED++KE + EE ++EEE E  E  D                 GE ++K K  +AK  LQMIGVQLLK+ D+   T K
Subjt:  SNNVIPSEDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQTETQD-----------------GERVIK-KEVEAKNPLQMIGVQLLKDVDQTTATSK

Query:  KLRRRSSRASIEDDRDEDWFPEDLFEAFGELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMDKVIELGGIPTIGDCAMILRAA
        K  +R+SR ++EDD DEDWFPE+ FEAF E+R+RKVFDV+DMYTIADVWGWTWE++ KN+ PR+WSQEWEVELAI +M KVIELGGIPTIGDCA+ILRAA
Subjt:  KLRRRSSRASIEDDRDEDWFPEDLFEAFGELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMDKVIELGGIPTIGDCAMILRAA

Query:  IKAPLPSAFLKILQTTHGLGYVFGSSLYDEVVTLCLDLGELDAAIAIVADLETTGISVPDEMLDRVISARQTNDA
        ++AP+PSAFLKILQTTH LGY FGS LYDE++TLCLDLGELDAAIAIVAD+ETTGI+VPD+ LD+VISARQ+N++
Subjt:  IKAPLPSAFLKILQTTHGLGYVFGSSLYDEVVTLCLDLGELDAAIAIVADLETTGISVPDEMLDRVISARQTNDA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCAAACTCCTGCTCTCTCACTCCCATCTCCTCACCCTTCCCCACAAACGCCATTCCGTTTCTCTCAACAATGGCGTCCTCCCAATCCGCTCGGTCCTCTCTGCTCC
AGAGAAGCGAGGTCGAAAGAAGCGGCAGTCGCGGCAGCAGCAACTGCTCCCCAAGGACGACGATTCCTCTGCGCTTGAAAACGCCCTTCGCTTCACGTTCATGGAGGAGC
TCATGGACCGCGCTCGAAACCGCGATCCGGTCGGCGTCTCGGACGTCATTTACGATATGGTTGCTGCTGGATTGAGCCCTGGCCCTCGCTCCTTCCATGGATTGGTTGTT
TCTTATGCTCTCAATGCCGACGCCGAGGGTGCGATGCAATCTTTGAGAAGGGAATTAAGTGCTGGACTTCGTCCTCTTCATGAAACGTTTGTTGCATTAGTTCGGCTATT
TGGTTCCAAGGGCCTTGCTACTAGGGGCTTTGAAGTCCTTGCAGCCATGGAGAAATTGAACTATGACATTCGCCCAGCTTGGCTCATTCTTATCGAGGAACTCGTAAGGA
ACAAGTATTTAGAAGACGCCAATCAAGTGTTTTTAAAGGGGGCTAAAGGGGGTCTAAGAGCCACCGACAAGATTTACGATCTTCTAATCGAGGAAGACTGCAAAGCTGGG
GATCATTCAAATGCCTTAGAGATTTCATATGAAATGGAGGCTGCCGGGCGGATGGCAACGACCTTTCATTTCAATTGCCTTCTCAGTGTCCAGGCTACTTGTGGAATACC
TGAAATTGCTTTCTCAACATTTGAGAACATGGAATATGGAGAAGATTACATGAAGCCCGACACAGAGACATATAATTGGGTGATTCAAGCATATACAAGAGCTGAATCTT
ATGATAGGGTGCAAGATGTTGCCGAGTTACTTGGCATGATGGTTGAAGACCACAAGCGCCTACAGCCTAACATGAGAACCTATGCGCTCTTGGTAGAGTGTTTTACTAAG
TATTGCGTTGTACGAGAAGCCATTAGGCATTTTCGTGGACTAAAAACCTTTCAAGGTGGAACAAAAGTTTTGCATAATGAAGGAAAATTTGGCGATCCACTTTCCTTATA
TCTTCGAGCTTTATGCAGAGAAGGTAGGGTTGTAGAGCTCTTAGAAGCGTTAGAAGCCATGGCTAAAGATAACCAAAAAATTCCTCCAAGAGCCATGCTCTTGAGCAGAA
AGTATCGTTCACTGGTGAGCTCATGGATTGAACCGTTACAGGAAGAAGCTGATCATGGATATGAGATAGACTACATTGCAAGATACATTGAAGAAGGTGGACTCACTGGA
GAACGCAAGAGATGGGTTCCTCGAAGAGGGAAAACTCCTCTAGATCCTGATGTAGATGGATTCATCTATTCAAATCCTATGGAAACATCCTTTAAGCAACGATGTCTAGA
AGATTGGAAGATGTACCACCGAAAGATTTTGAAAACCTTGCAGAATGTTGGACTTGCAGCTCTTGGGGATGCATCTGAAGCTGATTATCTTAGAGTCGAGGAGAGATTGA
AGAAAATTATAAAGGGTCCTGATCAAAATATTTTAAAGCCAAAGGCTGCAAGTAAGATGATTGTATCAGAACTAAAAGAAGAACTGGAAGCACAAGGTTTGCCAATTGAT
GGAACTAGAAATGTTCTTTACCAACGCGTTCAAAAAGCAAGGAGAATAAATCGGTCTCGTGGTCGACCCCTTTGGGTTCCTCCAGTGGAGGAGGAGGAAGAAGAGGTTGA
TGAAGAGCTCGATGAGCTAATTTCGCGAATAAGGCTTCACGAAGGAAATACAGAGTTCTGGAAACGCCGCTTTCTTGGAGAAGGCTTGAACAGTAATAATGTAATACCTT
CAGAAGACAAGTCAGAACCTCTCGATTCTTTGGATGATGTTGACATTGTAGAAGACGTTGCAAAGGAGATTGAAGAAGAAGAAGCCGAGGAGGAAGAGGAGGTTGAACAA
ACTGAAACTCAAGATGGTGAAAGAGTTATTAAGAAGGAAGTTGAAGCTAAGAACCCTCTTCAAATGATAGGTGTCCAATTGTTAAAAGATGTTGACCAAACCACGGCAAC
ATCCAAAAAGTTAAGGAGGAGAAGTTCTCGAGCATCAATTGAGGACGATCGCGATGAAGACTGGTTTCCTGAAGATTTATTTGAAGCATTCGGAGAGTTGCGAAAGAGGA
AAGTCTTTGATGTATCTGACATGTACACAATAGCTGATGTTTGGGGTTGGACTTGGGAGAGAGAACTTAAGAACAGACCTCCCAGGAGGTGGTCACAGGAGTGGGAAGTG
GAGTTGGCTATTAAAATTATGGACAAGGTCATAGAATTGGGTGGAATACCAACAATTGGAGACTGTGCCATGATCTTGCGAGCTGCCATTAAGGCTCCTCTACCGTCTGC
ATTTTTGAAAATCTTGCAGACAACCCATGGTCTTGGCTATGTATTTGGGAGCTCTTTATACGATGAGGTTGTTACCCTGTGTCTTGATCTTGGGGAACTAGATGCAGCCA
TTGCAATTGTAGCAGACCTGGAGACAACAGGAATCTCGGTTCCCGATGAAATGCTTGATCGGGTAATCTCTGCTAGACAGACGAATGATGCCGTGCCCAAACCCGATGCA
GCCATTGACATTACACCCAATGATCATAGTTTAGCCAATGATGAAGCATTATAA
mRNA sequenceShow/hide mRNA sequence
CAGTATATCACAATAGTTGCAGTGTTCATTATCGTCTCTCTTCATCAGTCCCGCCATTTTCTCTCTGCTTCTTATCCAGATTCCGGCCGCCATTAATGTCCAAACTCCTG
CTCTCTCACTCCCATCTCCTCACCCTTCCCCACAAACGCCATTCCGTTTCTCTCAACAATGGCGTCCTCCCAATCCGCTCGGTCCTCTCTGCTCCAGAGAAGCGAGGTCG
AAAGAAGCGGCAGTCGCGGCAGCAGCAACTGCTCCCCAAGGACGACGATTCCTCTGCGCTTGAAAACGCCCTTCGCTTCACGTTCATGGAGGAGCTCATGGACCGCGCTC
GAAACCGCGATCCGGTCGGCGTCTCGGACGTCATTTACGATATGGTTGCTGCTGGATTGAGCCCTGGCCCTCGCTCCTTCCATGGATTGGTTGTTTCTTATGCTCTCAAT
GCCGACGCCGAGGGTGCGATGCAATCTTTGAGAAGGGAATTAAGTGCTGGACTTCGTCCTCTTCATGAAACGTTTGTTGCATTAGTTCGGCTATTTGGTTCCAAGGGCCT
TGCTACTAGGGGCTTTGAAGTCCTTGCAGCCATGGAGAAATTGAACTATGACATTCGCCCAGCTTGGCTCATTCTTATCGAGGAACTCGTAAGGAACAAGTATTTAGAAG
ACGCCAATCAAGTGTTTTTAAAGGGGGCTAAAGGGGGTCTAAGAGCCACCGACAAGATTTACGATCTTCTAATCGAGGAAGACTGCAAAGCTGGGGATCATTCAAATGCC
TTAGAGATTTCATATGAAATGGAGGCTGCCGGGCGGATGGCAACGACCTTTCATTTCAATTGCCTTCTCAGTGTCCAGGCTACTTGTGGAATACCTGAAATTGCTTTCTC
AACATTTGAGAACATGGAATATGGAGAAGATTACATGAAGCCCGACACAGAGACATATAATTGGGTGATTCAAGCATATACAAGAGCTGAATCTTATGATAGGGTGCAAG
ATGTTGCCGAGTTACTTGGCATGATGGTTGAAGACCACAAGCGCCTACAGCCTAACATGAGAACCTATGCGCTCTTGGTAGAGTGTTTTACTAAGTATTGCGTTGTACGA
GAAGCCATTAGGCATTTTCGTGGACTAAAAACCTTTCAAGGTGGAACAAAAGTTTTGCATAATGAAGGAAAATTTGGCGATCCACTTTCCTTATATCTTCGAGCTTTATG
CAGAGAAGGTAGGGTTGTAGAGCTCTTAGAAGCGTTAGAAGCCATGGCTAAAGATAACCAAAAAATTCCTCCAAGAGCCATGCTCTTGAGCAGAAAGTATCGTTCACTGG
TGAGCTCATGGATTGAACCGTTACAGGAAGAAGCTGATCATGGATATGAGATAGACTACATTGCAAGATACATTGAAGAAGGTGGACTCACTGGAGAACGCAAGAGATGG
GTTCCTCGAAGAGGGAAAACTCCTCTAGATCCTGATGTAGATGGATTCATCTATTCAAATCCTATGGAAACATCCTTTAAGCAACGATGTCTAGAAGATTGGAAGATGTA
CCACCGAAAGATTTTGAAAACCTTGCAGAATGTTGGACTTGCAGCTCTTGGGGATGCATCTGAAGCTGATTATCTTAGAGTCGAGGAGAGATTGAAGAAAATTATAAAGG
GTCCTGATCAAAATATTTTAAAGCCAAAGGCTGCAAGTAAGATGATTGTATCAGAACTAAAAGAAGAACTGGAAGCACAAGGTTTGCCAATTGATGGAACTAGAAATGTT
CTTTACCAACGCGTTCAAAAAGCAAGGAGAATAAATCGGTCTCGTGGTCGACCCCTTTGGGTTCCTCCAGTGGAGGAGGAGGAAGAAGAGGTTGATGAAGAGCTCGATGA
GCTAATTTCGCGAATAAGGCTTCACGAAGGAAATACAGAGTTCTGGAAACGCCGCTTTCTTGGAGAAGGCTTGAACAGTAATAATGTAATACCTTCAGAAGACAAGTCAG
AACCTCTCGATTCTTTGGATGATGTTGACATTGTAGAAGACGTTGCAAAGGAGATTGAAGAAGAAGAAGCCGAGGAGGAAGAGGAGGTTGAACAAACTGAAACTCAAGAT
GGTGAAAGAGTTATTAAGAAGGAAGTTGAAGCTAAGAACCCTCTTCAAATGATAGGTGTCCAATTGTTAAAAGATGTTGACCAAACCACGGCAACATCCAAAAAGTTAAG
GAGGAGAAGTTCTCGAGCATCAATTGAGGACGATCGCGATGAAGACTGGTTTCCTGAAGATTTATTTGAAGCATTCGGAGAGTTGCGAAAGAGGAAAGTCTTTGATGTAT
CTGACATGTACACAATAGCTGATGTTTGGGGTTGGACTTGGGAGAGAGAACTTAAGAACAGACCTCCCAGGAGGTGGTCACAGGAGTGGGAAGTGGAGTTGGCTATTAAA
ATTATGGACAAGGTCATAGAATTGGGTGGAATACCAACAATTGGAGACTGTGCCATGATCTTGCGAGCTGCCATTAAGGCTCCTCTACCGTCTGCATTTTTGAAAATCTT
GCAGACAACCCATGGTCTTGGCTATGTATTTGGGAGCTCTTTATACGATGAGGTTGTTACCCTGTGTCTTGATCTTGGGGAACTAGATGCAGCCATTGCAATTGTAGCAG
ACCTGGAGACAACAGGAATCTCGGTTCCCGATGAAATGCTTGATCGGGTAATCTCTGCTAGACAGACGAATGATGCCGTGCCCAAACCCGATGCAGCCATTGACATTACA
CCCAATGATCATAGTTTAGCCAATGATGAAGCATTATAATCAATCAGTTTGTTCTTATTTTCCTTTGTACAGTTCAGTTCTAGGGTTGCCAGCAAAATGCTAATGTATTT
AGAACGTTAGAATTAAAATATTTTGATTTCAGAAACAATATTATGAAAATTTTGTTTATCAATCACTTGCTTTATATTTGTTTAATGTTTCACCGGATGTAGGCAGGGTA
GAGCAG
Protein sequenceShow/hide protein sequence
MSKLLLSHSHLLTLPHKRHSVSLNNGVLPIRSVLSAPEKRGRKKRQSRQQQLLPKDDDSSALENALRFTFMEELMDRARNRDPVGVSDVIYDMVAAGLSPGPRSFHGLVV
SYALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGFEVLAAMEKLNYDIRPAWLILIEELVRNKYLEDANQVFLKGAKGGLRATDKIYDLLIEEDCKAG
DHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTK
YCVVREAIRHFRGLKTFQGGTKVLHNEGKFGDPLSLYLRALCREGRVVELLEALEAMAKDNQKIPPRAMLLSRKYRSLVSSWIEPLQEEADHGYEIDYIARYIEEGGLTG
ERKRWVPRRGKTPLDPDVDGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNVGLAALGDASEADYLRVEERLKKIIKGPDQNILKPKAASKMIVSELKEELEAQGLPID
GTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIRLHEGNTEFWKRRFLGEGLNSNNVIPSEDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQ
TETQDGERVIKKEVEAKNPLQMIGVQLLKDVDQTTATSKKLRRRSSRASIEDDRDEDWFPEDLFEAFGELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEV
ELAIKIMDKVIELGGIPTIGDCAMILRAAIKAPLPSAFLKILQTTHGLGYVFGSSLYDEVVTLCLDLGELDAAIAIVADLETTGISVPDEMLDRVISARQTNDAVPKPDA
AIDITPNDHSLANDEAL