; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh11G009790 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh11G009790
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionSAP domain-containing protein
Genome locationCmo_Chr11:5185000..5196841
RNA-Seq ExpressionCmoCh11G009790
SyntenyCmoCh11G009790
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0006979 - response to oxidative stress (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0004601 - peroxidase activity (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR003034 - SAP domain
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR036361 - SAP domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588298.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0099.22Show/hide
Query:  MSKFLLSHSYLLTLPHKHHSFSLHNGVFPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP
        MSKFLLSHS LLTLPHKHHSFSLHNGV PP+RSVLSTEKRGRKKRQSRQQQLQQKDDDSTV EKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP
Subjt:  MSKFLLSHSYLLTLPHKHHSFSLHNGVFPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP

Query:  GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG
        GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG
Subjt:  GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG

Query:  GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDV
        GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDV
Subjt:  GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDV

Query:  AELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMI
        AELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMI
Subjt:  AELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMI

Query:  LSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGD
        LSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGD
Subjt:  LSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGD

Query:  ASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNILYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLH
        ASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRN+LYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLH
Subjt:  ASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNILYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLH

Query:  EGNTEFWKRRFLGEGLDSNNVKPSEDDQSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSK
        EGNTEFWKRRFLGEGLDSNNVKPSEDD+SEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSK
Subjt:  EGNTEFWKRRFLGEGLDSNNVKPSEDDQSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSK

Query:  KSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAA
        KSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAA
Subjt:  KSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAA

Query:  IKAPLPSAFFKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDAAPKRDSPIDITLNDHSLGNDEES
        IKAPLPSAFFKILQTTHSLGYVFGS LYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDAAPKRDSPIDITLNDHSLGNDEES
Subjt:  IKAPLPSAFFKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDAAPKRDSPIDITLNDHSLGNDEES

XP_022930357.1 uncharacterized protein LOC111436825 isoform X1 [Cucurbita moschata]0.0e+00100Show/hide
Query:  MSKFLLSHSYLLTLPHKHHSFSLHNGVFPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP
        MSKFLLSHSYLLTLPHKHHSFSLHNGVFPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP
Subjt:  MSKFLLSHSYLLTLPHKHHSFSLHNGVFPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP

Query:  GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG
        GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG
Subjt:  GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG

Query:  GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDV
        GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDV
Subjt:  GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDV

Query:  AELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMI
        AELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMI
Subjt:  AELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMI

Query:  LSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGD
        LSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGD
Subjt:  LSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGD

Query:  ASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNILYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLH
        ASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNILYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLH
Subjt:  ASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNILYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLH

Query:  EGNTEFWKRRFLGEGLDSNNVKPSEDDQSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSK
        EGNTEFWKRRFLGEGLDSNNVKPSEDDQSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSK
Subjt:  EGNTEFWKRRFLGEGLDSNNVKPSEDDQSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSK

Query:  KSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAA
        KSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAA
Subjt:  KSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAA

Query:  IKAPLPSAFFKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDAAPKRDSPIDITLNDHSLGNDEES
        IKAPLPSAFFKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDAAPKRDSPIDITLNDHSLGNDEES
Subjt:  IKAPLPSAFFKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDAAPKRDSPIDITLNDHSLGNDEES

XP_023006519.1 uncharacterized protein LOC111499221 isoform X1 [Cucurbita maxima]0.0e+0098.44Show/hide
Query:  MSKFLLSHSYLLTLPHKHHSFSLHNGVFPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP
        MSKFLLSHS LLTLPHKHHSFSLHN V PPIRSVLSTEKRGRKKRQSRQQQLQQKD DSTV EKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP
Subjt:  MSKFLLSHSYLLTLPHKHHSFSLHNGVFPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP

Query:  GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG
        GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG
Subjt:  GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG

Query:  GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDV
        GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDV
Subjt:  GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDV

Query:  AELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMI
        AELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALH+EG+FGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMI
Subjt:  AELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMI

Query:  LSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGD
        LSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGD
Subjt:  LSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGD

Query:  ASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNILYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLH
        ASEADY+RVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRN+LYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLH
Subjt:  ASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNILYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLH

Query:  EGNTEFWKRRFLGEGLDSNNVKPSEDDQSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSK
        EGNTEFWKRRFLGEGLDSNNVKPSEDDQSEPLDSLDDVD+VEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSK
Subjt:  EGNTEFWKRRFLGEGLDSNNVKPSEDDQSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSK

Query:  KSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAA
        KSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRK+FDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAA
Subjt:  KSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAA

Query:  IKAPLPSAFFKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDAAPKRDSPIDITLNDHSLGNDEES
        IKAPLPSAFFKILQTTHSLGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDAAPKRDSPIDITLNDHSL +DEES
Subjt:  IKAPLPSAFFKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDAAPKRDSPIDITLNDHSLGNDEES

XP_023531019.1 uncharacterized protein LOC111793400 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0098.67Show/hide
Query:  MSKFLLSHSYLLTLPHKHHSFSLHNG-VFPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSKFLLSHS LLTLPHKHHSFSLHNG + PPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTV EKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHSYLLTLPHKHHSFSLHNG-VFPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAK
        PGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRP+HETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAK
Subjt:  PGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
        GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGL+TFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALG
        ILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALG
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALG

Query:  DASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNILYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
        DASEADY+RVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRN+LYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNILYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDQSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTS
        HEGNTEFWKRRFLGEGLDSNNVKPSEDDQSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKD+DQTSTTS
Subjt:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDQSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTS

Query:  KKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRA
        KKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRA
Subjt:  KKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRA

Query:  AIKAPLPSAFFKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDAAPKRDSPIDITLNDHSLGNDEES
        AIKAPLPSAFFKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDAAPKRDSPIDITLNDHSL NDEES
Subjt:  AIKAPLPSAFFKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDAAPKRDSPIDITLNDHSLGNDEES

XP_038879291.1 uncharacterized protein LOC120071230 [Benincasa hispida]0.0e+0092.67Show/hide
Query:  MSKFLLSHSYLLTLPHKHHSFSLHNGVFPPIRSVLST-EKRGRKKRQSR-QQQLQQKDDDSTVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL
        MSKFLLSH++LLTLP+KHHSFSL++GV  PIRSVLS  +KRGRKKRQ+R QQQL  KD DST  EKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL
Subjt:  MSKFLLSHSYLLTLPHKHHSFSLHNGVFPPIRSVLST-EKRGRKKRQSR-QQQLQQKDDDSTVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL

Query:  SPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGA
        SPGPRSFHGLVVSHVLN D EGAMQSLR+ELS GL PLHETFVALVRLFG+KGLATRGLEILAAMEKLNYDIRQAWLIL EELV+NKYLEDANKVFLKGA
Subjt:  SPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGA

Query:  KGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ
        KGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ
Subjt:  KGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ

Query:  DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRA
        DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFR LKTF GGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIP RA
Subjt:  DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRA

Query:  MILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAAL
        MILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKM+HRKILKTLQNEGLAAL
Subjt:  MILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAAL

Query:  GDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNILYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK
        G ASEADY RV ERLKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRN+LYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK
Subjt:  GDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNILYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK

Query:  LHEGNTEFWKRRFLGEGLDSNNVKPSEDDQSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTT
        LHEGNTEFWKRRFLGEGLDSNNVKPSEDD+SEPLDSLDDVD VEDVAKEI+EEEAEEEEEVE TENQDGERVIKKEVEAKKP QMIGVQLLKDVDQ  TT
Subjt:  LHEGNTEFWKRRFLGEGLDSNNVKPSEDDQSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTT

Query:  SKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILR
        SKKSRRR SRAS+EDDRDEDWFPED+FEAF ELRKRKVFD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGG PTIGDCAMILR
Subjt:  SKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILR

Query:  AAIKAPLPSAFFKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDAAPKRDSPIDITLNDHSLGNDEES
        AAIKAPLPS+F KILQTTH LGY FGSPLYDE+ITLCLDLGELDAAIAIVADLETTGI VPDETLDR+IS RQTND+ PK DS ID TLNDHSL +DE S
Subjt:  AAIKAPLPSAFFKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDAAPKRDSPIDITLNDHSLGNDEES

TrEMBL top hitse value%identityAlignment
A0A1S3B8T6 uncharacterized protein LOC103487261 isoform X10.0e+0091.33Show/hide
Query:  MSKFLLSHSYLLTLPHKHHSFSLHNGVFPPIRSVLST-EKRGRKKRQSR-QQQLQQKDDDSTVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL
        MSK LLSH++LLTLP+ H SFSL++G+  PIRSVLS  +KRGRKKRQSR QQQLQ KDDDST  E SLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL
Subjt:  MSKFLLSHSYLLTLPHKHHSFSLHNGVFPPIRSVLST-EKRGRKKRQSR-QQQLQQKDDDSTVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL

Query:  SPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGA
        SPGPRSFHGLVVSH LN D EGAMQSLR+ELS+GLRPLHETFVALVRLFG+KGLA RGLEILAAME+LNYDIRQAWLIL EELV+NKYLEDANKVFLKGA
Subjt:  SPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGA

Query:  KGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ
        K GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ
Subjt:  KGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ

Query:  DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRA
        DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFR LKTF GGTKALHNEGNFGDPLSLYLRALCREGRV++LLEALEAMARDNQQIP RA
Subjt:  DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRA

Query:  MILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAAL
        MILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL AL
Subjt:  MILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAAL

Query:  GDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNILYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK
         DASEADY RV E+LKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRN+LYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK
Subjt:  GDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNILYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK

Query:  LHEGNTEFWKRRFLGEGLDSNNVKPSEDDQSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTT
        LHEGNTEFWKRRFLGEGLDSNNVKPSEDD+S+ LDSLDDVD +EDVAKEI+EEEAEEEEEVE TENQDGERVIKKEVEAKKP QMIGVQLLKDVDQ + T
Subjt:  LHEGNTEFWKRRFLGEGLDSNNVKPSEDDQSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTT

Query:  SKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILR
        SKKSRRR SRAS+EDDRDEDWFPED+FEAF EL+KRKVFD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGG PTIGDCAMILR
Subjt:  SKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILR

Query:  AAIKAPLPSAFFKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDAAPKRDSPIDITLNDHSLGNDEES
        AAIKAPLPSAF KILQTTH LGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGI VPDETLDR+IS RQTNDA PK DS ID T+NDHSL NDE S
Subjt:  AAIKAPLPSAFFKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDAAPKRDSPIDITLNDHSLGNDEES

A0A1S3B9H7 uncharacterized protein LOC103487261 isoform X20.0e+0091.68Show/hide
Query:  MSKFLLSHSYLLTLPHKHHSFSLHNGVFPPIRSVLST-EKRGRKKRQSR-QQQLQQKDDDSTVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL
        MSK LLSH++LLTLP+ H SFSL++G+  PIRSVLS  +KRGRKKRQSR QQQLQ KDDDST  E SLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL
Subjt:  MSKFLLSHSYLLTLPHKHHSFSLHNGVFPPIRSVLST-EKRGRKKRQSR-QQQLQQKDDDSTVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL

Query:  SPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGA
        SPGPRSFHGLVVSH LN D EGAMQSLR+ELS+GLRPLHETFVALVRLFG+KGLA RGLEILAAME+LNYDIRQAWLIL EELV+NKYLEDANKVFLKGA
Subjt:  SPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGA

Query:  KGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ
        K GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ
Subjt:  KGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ

Query:  DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRA
        DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFR LKTF GGTKALHNEGNFGDPLSLYLRALCREGRV++LLEALEAMARDNQQIP RA
Subjt:  DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRA

Query:  MILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAAL
        MILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL AL
Subjt:  MILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAAL

Query:  GDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNILYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK
         DASEADY RV E+LKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRN+LYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK
Subjt:  GDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNILYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK

Query:  LHEGNTEFWKRRFLGEGLDSNNVKPSEDDQSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTT
        LHEGNTEFWKRRFLGEGLDSNNVKPSEDD+S+ LDSLDDVD +EDVAKEI+EEEAEEEEEVE TENQDGERVIKKEVEAKKP QMIGVQLLKDVDQ + T
Subjt:  LHEGNTEFWKRRFLGEGLDSNNVKPSEDDQSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTT

Query:  SKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILR
        SKKSRRR SRAS+EDDRDEDWFPED+FEAF EL+KRKVFD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGG PTIGDCAMILR
Subjt:  SKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILR

Query:  AAIKAPLPSAFFKILQTTHSLGYVFGSPL
        AAIKAPLPSAF KILQTTH LGYVFG  L
Subjt:  AAIKAPLPSAFFKILQTTHSLGYVFGSPL

A0A6J1DBV3 uncharacterized protein LOC1110195950.0e+0090.91Show/hide
Query:  MSKFLL-SHSYLLTLPHKHHSFSL--HNGVFPPIRSVLST-EKRGRKKRQSRQQQLQQKDDDSTVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAA
        MSK LL SH++LLTLPHK  S  L  HNGV  PIRSVLS  EKRGRKKRQ R      KDD ST  EK LRFTFMEELMDRAR+ D +GVSDVIYDMVAA
Subjt:  MSKFLL-SHSYLLTLPHKHHSFSL--HNGVFPPIRSVLST-EKRGRKKRQSRQQQLQQKDDDSTVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAA

Query:  GLSPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLK
        GLSPGPRSFHGLVVSH LN D EGAMQSLR+ELS GLRPLHETFVALVRLFG+KGLA+RGLEIL+AMEKLNYDIRQAWLILI+ELV+NKYLEDANK FLK
Subjt:  GLSPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLK

Query:  GAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDR
        GAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGED MKPDTETYNWVIQAYTRAESYDR
Subjt:  GAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDR

Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPS
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFR LKTF GGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIP 
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPS

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA
        RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL 
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLA

Query:  ALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNILYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        ALGDASEADY RVEERLKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRN+LYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALGDASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNILYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDQSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTS
        IKLHEGNTE+WKRRFLGEGLD+N+VKPSEDD+SEPLDSLDDVDIVED AKEI+EEE  EEEEVE TENQDGERVIKKEVEAKKP QMIGVQLLKDVDQT+
Subjt:  IKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDQSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTS

Query:  TTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI
        TTSKKSRRR SRAS+EDDRDEDWFPED+FEAF ELRKR+VFD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELA KIMHKVIELGG PTIGDCAMI
Subjt:  TTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI

Query:  LRAAIKAPLPSAFFKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDAAPKRDSPIDITLNDHSLGNDE
        LRAAI++PLPSAF KILQTTHSLGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGI VPDETLDR+ISARQTNDA PK D+ ID TLNDHSL NDE
Subjt:  LRAAIKAPLPSAFFKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDAAPKRDSPIDITLNDHSLGNDE

Query:  ES
         S
Subjt:  ES

A0A6J1EQ88 uncharacterized protein LOC111436825 isoform X10.0e+00100Show/hide
Query:  MSKFLLSHSYLLTLPHKHHSFSLHNGVFPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP
        MSKFLLSHSYLLTLPHKHHSFSLHNGVFPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP
Subjt:  MSKFLLSHSYLLTLPHKHHSFSLHNGVFPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP

Query:  GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG
        GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG
Subjt:  GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG

Query:  GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDV
        GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDV
Subjt:  GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDV

Query:  AELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMI
        AELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMI
Subjt:  AELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMI

Query:  LSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGD
        LSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGD
Subjt:  LSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGD

Query:  ASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNILYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLH
        ASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNILYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLH
Subjt:  ASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNILYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLH

Query:  EGNTEFWKRRFLGEGLDSNNVKPSEDDQSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSK
        EGNTEFWKRRFLGEGLDSNNVKPSEDDQSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSK
Subjt:  EGNTEFWKRRFLGEGLDSNNVKPSEDDQSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSK

Query:  KSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAA
        KSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAA
Subjt:  KSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAA

Query:  IKAPLPSAFFKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDAAPKRDSPIDITLNDHSLGNDEES
        IKAPLPSAFFKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDAAPKRDSPIDITLNDHSLGNDEES
Subjt:  IKAPLPSAFFKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDAAPKRDSPIDITLNDHSLGNDEES

A0A6J1L2D9 uncharacterized protein LOC111499221 isoform X10.0e+0098.44Show/hide
Query:  MSKFLLSHSYLLTLPHKHHSFSLHNGVFPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP
        MSKFLLSHS LLTLPHKHHSFSLHN V PPIRSVLSTEKRGRKKRQSRQQQLQQKD DSTV EKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP
Subjt:  MSKFLLSHSYLLTLPHKHHSFSLHNGVFPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSP

Query:  GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG
        GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG
Subjt:  GPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKG

Query:  GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDV
        GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDV
Subjt:  GLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDV

Query:  AELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMI
        AELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALH+EG+FGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMI
Subjt:  AELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMI

Query:  LSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGD
        LSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGD
Subjt:  LSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGD

Query:  ASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNILYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLH
        ASEADY+RVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRN+LYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLH
Subjt:  ASEADYIRVEERLKKIIKGPDPNILKPKAASKMLVSELKEELEAQGLPIDGTRNILYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLH

Query:  EGNTEFWKRRFLGEGLDSNNVKPSEDDQSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSK
        EGNTEFWKRRFLGEGLDSNNVKPSEDDQSEPLDSLDDVD+VEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSK
Subjt:  EGNTEFWKRRFLGEGLDSNNVKPSEDDQSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSK

Query:  KSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAA
        KSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRK+FDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAA
Subjt:  KSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAA

Query:  IKAPLPSAFFKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDAAPKRDSPIDITLNDHSLGNDEES
        IKAPLPSAFFKILQTTHSLGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDAAPKRDSPIDITLNDHSL +DEES
Subjt:  IKAPLPSAFFKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDAAPKRDSPIDITLNDHSLGNDEES

SwissProt top hitse value%identityAlignment
O04504 Pentatricopeptide repeat-containing protein At1g098201.3e-0622.54Show/hide
Query:  VIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQ-AWLILIEELVKNKYL
        V+ +MV   +SP   +F+ L+     + +  G+M+  ++ L   ++P   ++ +L+    N G  +  + +   M           +  LI    KN  L
Subjt:  VIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQ-AWLILIEELVKNKYL

Query:  EDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQA
        ++A  +F      G   T ++Y++LI+  CK G   +   +  EME  G +     +NCL++     G  E A   F+ +        PD  T++ +++ 
Subjt:  EDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQA

Query:  YTR-AESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTK
        Y R  ES      + E+  M       L+P   TY ++++ + K
Subjt:  YTR-AESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTK

Q0WPZ6 Pentatricopeptide repeat-containing protein At2g171401.2e-0922.74Show/hide
Query:  VSDVIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLN-YDIRQAWLILIEELVKN
        VS +  DMV  G++P   +F+ L+ +   ++  + A +   +    G +P   TF  LVR +   GL  +GLE+L AME       +  +  ++    + 
Subjt:  VSDVIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLN-YDIRQAWLILIEELVKN

Query:  KYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMA----TTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTET
           +D+ K+  K  + GL      ++  I   CK G   +A  I  +ME    +      +  +N +L      G+ E A + FE++   +D      ++
Subjt:  KYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMA----TTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTET

Query:  YNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLS--LYLRALCREGRVV
        YN  +Q   R   +   + V + +       K + P++ +Y +L++   K  ++ +A       KT  G    +   G   D ++    L   C  G+V 
Subjt:  YNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLS--LYLRALCREGRVV

Query:  ELLEALEAMARDN---QQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTG
             L+ M R+N          ++ S      +S   E L++  E GY +D +   I   GL G
Subjt:  ELLEALEAMARDN---QQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTG

Q9LMH5 Putative pentatricopeptide repeat-containing protein At1g138001.7e-0622.44Show/hide
Query:  LQQKDDDSTVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKG
        L  K D    + K +R    E  ++ A +        V+ DM   G+ P    +  ++  H  N +   A+    K L    R       ++++ +   G
Subjt:  LQQKDDDSTVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKG

Query:  LATRGLEILAAMEKLNYDI-RQAWLILIEELVKNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSV
          +   ++     + N  + R  + +  + L K   +E+A ++F +    G+      Y  LI   C  G  S+A ++  EM+  G+      +N L   
Subjt:  LATRGLEILAAMEKLNYDI-RQAWLILIEELVKNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSV

Query:  QATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELL
         AT G+ + AF T + ME     +KP   T+N VI+    A   D+ +   E L
Subjt:  QATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELL

Q9SAK0 Pentatricopeptide repeat-containing protein At1g79490, mitochondrial9.4e-1023.02Show/hide
Query:  RNHDPLGVSDVIYDMVAAGLSPGPRSF--HGLVVSHVLNAD-AEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDI-RQAW
        +  D +G+  +  +MV    S G  SF  +  V+ ++  A+  E A    +K   +G +   +T+  L+ LF NKGL  +  EI  +MEK +  +    +
Subjt:  RNHDPLGVSDVIYDMVAAGLSPGPRSF--HGLVVSHVLNAD-AEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDI-RQAW

Query:  LILIEELVKNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYM
         ++I  L K+  L+ A K+F +  +  LR +  ++  L++   KAG    ++++  EM+  G   +   F  L+   A  G  + A   ++ M+  +   
Subjt:  LILIEELVKNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYM

Query:  KPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVE
        +P+   Y  +I+++ ++   +    V + +     +     P   TY+ L+E
Subjt:  KPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVE

Q9SIC9 Pentatricopeptide repeat-containing protein At2g31400, chloroplastic1.8e-0824.71Show/hide
Query:  GLRPLHETFVALVRLFGNKGL--ATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEIS
        G++P   TF +L+ +    GL  A R L       ++  D+  ++  L++ + K   ++ A ++  +     +      Y  +I+   KAG    AL + 
Subjt:  GLRPLHETFVALVRLFGNKGL--ATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEIS

Query:  YEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT
         EM   G       +N LLS+    G  E A      M      +K D  TYN ++  Y +   YD V+ V      M  +H  + PN+ TY+ L++ ++
Subjt:  YEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT

Query:  KYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPL--SLYLRALCREGRVVELLEALEAMARD
        K  + +EA+  FR  K+   G +A        D +  S  + ALC+ G V   +  ++ M ++
Subjt:  KYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPL--SLYLRALCREGRVVELLEALEAMARD

Arabidopsis top hitse value%identityAlignment
AT1G09820.1 Pentatricopeptide repeat (PPR-like) superfamily protein9.1e-0822.54Show/hide
Query:  VIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQ-AWLILIEELVKNKYL
        V+ +MV   +SP   +F+ L+     + +  G+M+  ++ L   ++P   ++ +L+    N G  +  + +   M           +  LI    KN  L
Subjt:  VIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQ-AWLILIEELVKNKYL

Query:  EDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQA
        ++A  +F      G   T ++Y++LI+  CK G   +   +  EME  G +     +NCL++     G  E A   F+ +        PD  T++ +++ 
Subjt:  EDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQA

Query:  YTR-AESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTK
        Y R  ES      + E+  M       L+P   TY ++++ + K
Subjt:  YTR-AESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTK

AT1G79490.1 Pentatricopeptide repeat (PPR) superfamily protein6.7e-1123.02Show/hide
Query:  RNHDPLGVSDVIYDMVAAGLSPGPRSF--HGLVVSHVLNAD-AEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDI-RQAW
        +  D +G+  +  +MV    S G  SF  +  V+ ++  A+  E A    +K   +G +   +T+  L+ LF NKGL  +  EI  +MEK +  +    +
Subjt:  RNHDPLGVSDVIYDMVAAGLSPGPRSF--HGLVVSHVLNAD-AEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDI-RQAW

Query:  LILIEELVKNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYM
         ++I  L K+  L+ A K+F +  +  LR +  ++  L++   KAG    ++++  EM+  G   +   F  L+   A  G  + A   ++ M+  +   
Subjt:  LILIEELVKNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYM

Query:  KPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVE
        +P+   Y  +I+++ ++   +    V + +     +     P   TY+ L+E
Subjt:  KPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVE

AT2G17140.1 Pentatricopeptide repeat (PPR) superfamily protein8.7e-1122.74Show/hide
Query:  VSDVIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLN-YDIRQAWLILIEELVKN
        VS +  DMV  G++P   +F+ L+ +   ++  + A +   +    G +P   TF  LVR +   GL  +GLE+L AME       +  +  ++    + 
Subjt:  VSDVIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLN-YDIRQAWLILIEELVKN

Query:  KYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMA----TTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTET
           +D+ K+  K  + GL      ++  I   CK G   +A  I  +ME    +      +  +N +L      G+ E A + FE++   +D      ++
Subjt:  KYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMA----TTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTET

Query:  YNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLS--LYLRALCREGRVV
        YN  +Q   R   +   + V + +       K + P++ +Y +L++   K  ++ +A       KT  G    +   G   D ++    L   C  G+V 
Subjt:  YNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLS--LYLRALCREGRVV

Query:  ELLEALEAMARDN---QQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTG
             L+ M R+N          ++ S      +S   E L++  E GY +D +   I   GL G
Subjt:  ELLEALEAMARDN---QQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTG

AT2G31400.1 genomes uncoupled 11.3e-0924.71Show/hide
Query:  GLRPLHETFVALVRLFGNKGL--ATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEIS
        G++P   TF +L+ +    GL  A R L       ++  D+  ++  L++ + K   ++ A ++  +     +      Y  +I+   KAG    AL + 
Subjt:  GLRPLHETFVALVRLFGNKGL--ATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEIS

Query:  YEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT
         EM   G       +N LLS+    G  E A      M      +K D  TYN ++  Y +   YD V+ V      M  +H  + PN+ TY+ L++ ++
Subjt:  YEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT

Query:  KYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPL--SLYLRALCREGRVVELLEALEAMARD
        K  + +EA+  FR  K+   G +A        D +  S  + ALC+ G V   +  ++ M ++
Subjt:  KYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPL--SLYLRALCREGRVVELLEALEAMARD

AT3G04260.1 plastid transcriptionally active 30.0e+0074.28Show/hide
Query:  SVLSTEKRGRKKRQSRQQQLQQKDDD--------STVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQS
        S+ + EK+ R++R+ ++    + DD          +  E+SLR TFM+ELM+RARN D  GVS+VIYDM+AAGLSPGPRSFHGLVV+H LN D +GAM S
Subjt:  SVLSTEKRGRKKRQSRQQQLQQKDDD--------STVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHVLNADAEGAMQS

Query:  LRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSN
        LRKEL  G RPL ET +ALVRL G+KG ATRGLEILAAMEKL YDIRQAWLIL+EEL++  +LEDANKVFLKGA+GG+RATD++YDL+IEEDCKAGDHSN
Subjt:  LRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSN

Query:  ALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALL
        AL+ISYEMEAAGRMATTFHFNCLLSVQATCGIPE+A++TFENMEYGE +MKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKR+QPN++TYALL
Subjt:  ALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALL

Query:  VECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGY
        VECFTKYCV++EAIRHFR LK F GGT  LHN GNF DPLSLYLRALCREGR+VEL++AL+AM +DNQ IP RAMI+SRKYR+LVSSWIEPLQEEAE GY
Subjt:  VECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEAEHGY

Query:  EIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNI
        EIDY+ARYIEEGGLTGERKRWVPRRGKTPLDPDA GFIYSNP+ETSFKQRCLEDWK++HRK+L+TLQ+EGL  LGDASE+DY+RV ERL+ IIKGP  N+
Subjt:  EIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNI

Query:  LKPKAASKMLVSELKEELEAQGLPIDGTRNILYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPS
        LKPKAASKM+VSELKEELEAQGLPIDGTRN+LYQRVQKARRIN+SRGRPLWVPP+EEEEEEVDEE+D+LI RIKLHEG+TEFWKRRFLGEGL   +V+  
Subjt:  LKPKAASKMLVSELKEELEAQGLPIDGTRNILYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPS

Query:  E--------------DDQSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQ-DGERVIK-KEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSR
        E              +D S+  D+ +D D  E    E D+E  EEE  V  TEN+ +GE ++K K  +AKK  QMIGVQLLK+ D+ + T KK  +R SR
Subjt:  E--------------DDQSEPLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQ-DGERVIK-KEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSR

Query:  ASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAAIKAPLPSA
         ++EDD DEDWFPE+ FEAF E+R+RKVFD +DMYTIADVWGWTWE++ KN+ PR+WSQEWEVELAI +M KVIELGGIPTIGDCA+ILRAA++AP+PSA
Subjt:  ASVEDDRDEDWFPEDLFEAFGELRKRKVFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAAIKAPLPSA

Query:  FFKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDA
        F KILQTTHSLGY FGSPLYDEIITLCLDLGELDAAIAIVAD+ETTGI+VPD+TLD++ISARQ+N++
Subjt:  FFKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGISVPDETLDRIISARQTNDA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCAAATTCCTGCTCTCTCACTCCTACCTTCTCACCCTTCCCCACAAGCATCATTCCTTTTCCCTCCACAATGGCGTCTTCCCCCCCATCCGCTCAGTTCTC
TCTACTGAGAAGCGGGGTAGAAAGAAGCGGCAGTCGCGGCAGCAACAATTGCAACAAAAGGACGACGATTCCACTGTGTTTGAGAAGTCCCTTCGCTTCACTTTC
ATGGAGGAACTCATGGACCGCGCTAGAAACCACGATCCACTTGGCGTTTCTGATGTCATTTATGATATGGTTGCCGCTGGATTGAGCCCTGGTCCTCGCTCCTTC
CATGGCTTGGTTGTTTCACATGTTCTTAATGCTGATGCTGAGGGAGCGATGCAATCTCTGAGAAAGGAACTAAGTACTGGACTTCGTCCCCTTCACGAAACGTTT
GTTGCATTAGTTCGGTTATTTGGTAACAAGGGTCTTGCTACTAGAGGCTTAGAAATCCTTGCAGCCATGGAGAAATTGAACTATGACATTCGCCAAGCTTGGCTC
ATTCTTATTGAGGAACTCGTAAAGAACAAATATTTAGAAGACGCCAATAAAGTGTTCTTAAAGGGTGCCAAAGGGGGCCTAAGAGCCACGGACAAGATTTACGAT
CTTCTAATTGAGGAAGACTGTAAAGCTGGGGACCATTCAAATGCCTTGGAAATTTCATATGAAATGGAGGCTGCTGGGCGGATGGCAACAACCTTCCATTTCAAT
TGCCTTCTCAGTGTCCAGGCTACTTGTGGAATACCTGAAATTGCTTTCTCAACATTTGAGAACATGGAATATGGAGAAGATTACATGAAGCCTGACACTGAGACA
TATAATTGGGTGATCCAAGCATATACAAGAGCTGAATCTTATGATAGGGTGCAAGATGTTGCTGAGTTACTTGGCATGATGGTTGAAGACCACAAGCGTCTACAG
CCTAACATGAGAACCTATGCGCTCTTGGTAGAGTGTTTTACCAAGTATTGTGTTATACGAGAAGCCATCAGGCATTTTCGTGGGCTAAAAACTTTTCCAGGTGGA
ACAAAAGCTTTGCATAATGAAGGAAATTTTGGTGATCCACTTTCCTTGTATCTTCGAGCTTTATGTAGAGAAGGTAGGGTTGTAGAGCTCTTAGAAGCATTAGAA
GCTATGGCTAGAGACAACCAACAGATTCCTTCGAGAGCCATGATCTTGAGCAGAAAGTATCGATCACTTGTGAGCTCATGGATTGAACCTTTACAGGAAGAAGCT
GAACATGGATACGAGATAGACTACATTGCAAGATACATTGAAGAGGGTGGACTCACTGGAGAACGCAAGAGATGGGTGCCTCGAAGAGGAAAAACTCCTCTAGAT
CCTGATGCAGATGGATTCATCTATTCAAATCCTATGGAAACATCCTTTAAGCAACGATGTCTAGAAGATTGGAAGATGTACCACCGAAAGATTTTGAAAACCTTG
CAGAATGAAGGACTTGCAGCTCTTGGGGATGCATCTGAAGCTGATTATATTAGAGTCGAGGAGAGATTGAAGAAAATTATAAAGGGTCCTGATCCAAATATTTTA
AAGCCAAAGGCTGCAAGTAAGATGCTTGTATCAGAATTAAAAGAAGAATTGGAAGCACAAGGTTTACCGATTGATGGAACTAGAAATATTCTTTACCAGCGGGTT
CAAAAAGCAAGGAGAATAAATCGGTCTCGTGGTCGGCCCCTTTGGGTTCCTCCAGTGGAGGAGGAGGAAGAGGAGGTTGATGAAGAGCTGGATGAACTAATTTCA
CGAATAAAGCTACACGAAGGAAACACAGAGTTCTGGAAACGCCGTTTTCTTGGAGAAGGCTTGGACAGTAATAATGTTAAACCTTCTGAAGATGATCAGTCAGAA
CCTCTTGATTCTTTGGATGATGTTGACATTGTAGAAGACGTTGCAAAGGAGATTGATGAAGAAGAAGCCGAGGAGGAAGAGGAGGTTGAACCAACCGAGAATCAA
GATGGTGAAAGAGTTATTAAGAAGGAAGTTGAAGCTAAGAAGCCTCCTCAAATGATAGGTGTCCAATTGTTGAAAGACGTTGACCAAACCTCAACAACATCCAAA
AAGTCAAGGAGAAGACGTTCTCGAGCATCAGTTGAGGACGATCGTGATGAAGACTGGTTTCCTGAAGATTTATTCGAGGCATTTGGAGAGTTGCGAAAGAGGAAA
GTCTTTGATGAATCTGACATGTACACAATAGCTGATGTTTGGGGTTGGACTTGGGAGAGAGAACTTAAGAACAGACCTCCCAGGAGGTGGTCACAGGAATGGGAA
GTGGAGTTGGCCATTAAAATTATGCACAAGGTGATTGAATTGGGTGGGATACCAACAATTGGCGACTGTGCCATGATCTTGCGAGCTGCCATCAAGGCTCCTCTT
CCGTCTGCCTTTTTTAAGATCTTGCAGACAACTCATAGTCTTGGCTATGTATTTGGGAGCCCATTATATGATGAGATTATTACCCTGTGTCTTGATCTTGGGGAA
CTAGATGCAGCCATTGCCATCGTAGCAGATCTGGAAACCACAGGAATCTCGGTTCCCGACGAAACACTCGATCGGATAATCTCCGCTAGACAGACAAACGATGCT
GCGCCCAAGCGTGATTCACCCATTGATATTACACTCAATGATCATAGTTTAGGCAATGATGAAGAATCATAA
mRNA sequenceShow/hide mRNA sequence
AGTTCATTATCGTCTCTCCCTCAGCACCCGCCTTCTTTCTTCTCTTCGCGGAGGCTCTGTTTCTTATCCAAATTCCCGCAGCCATTAATGTCCAAATTCCTGCTC
TCTCACTCCTACCTTCTCACCCTTCCCCACAAGCATCATTCCTTTTCCCTCCACAATGGCGTCTTCCCCCCCATCCGCTCAGTTCTCTCTACTGAGAAGCGGGGT
AGAAAGAAGCGGCAGTCGCGGCAGCAACAATTGCAACAAAAGGACGACGATTCCACTGTGTTTGAGAAGTCCCTTCGCTTCACTTTCATGGAGGAACTCATGGAC
CGCGCTAGAAACCACGATCCACTTGGCGTTTCTGATGTCATTTATGATATGGTTGCCGCTGGATTGAGCCCTGGTCCTCGCTCCTTCCATGGCTTGGTTGTTTCA
CATGTTCTTAATGCTGATGCTGAGGGAGCGATGCAATCTCTGAGAAAGGAACTAAGTACTGGACTTCGTCCCCTTCACGAAACGTTTGTTGCATTAGTTCGGTTA
TTTGGTAACAAGGGTCTTGCTACTAGAGGCTTAGAAATCCTTGCAGCCATGGAGAAATTGAACTATGACATTCGCCAAGCTTGGCTCATTCTTATTGAGGAACTC
GTAAAGAACAAATATTTAGAAGACGCCAATAAAGTGTTCTTAAAGGGTGCCAAAGGGGGCCTAAGAGCCACGGACAAGATTTACGATCTTCTAATTGAGGAAGAC
TGTAAAGCTGGGGACCATTCAAATGCCTTGGAAATTTCATATGAAATGGAGGCTGCTGGGCGGATGGCAACAACCTTCCATTTCAATTGCCTTCTCAGTGTCCAG
GCTACTTGTGGAATACCTGAAATTGCTTTCTCAACATTTGAGAACATGGAATATGGAGAAGATTACATGAAGCCTGACACTGAGACATATAATTGGGTGATCCAA
GCATATACAAGAGCTGAATCTTATGATAGGGTGCAAGATGTTGCTGAGTTACTTGGCATGATGGTTGAAGACCACAAGCGTCTACAGCCTAACATGAGAACCTAT
GCGCTCTTGGTAGAGTGTTTTACCAAGTATTGTGTTATACGAGAAGCCATCAGGCATTTTCGTGGGCTAAAAACTTTTCCAGGTGGAACAAAAGCTTTGCATAAT
GAAGGAAATTTTGGTGATCCACTTTCCTTGTATCTTCGAGCTTTATGTAGAGAAGGTAGGGTTGTAGAGCTCTTAGAAGCATTAGAAGCTATGGCTAGAGACAAC
CAACAGATTCCTTCGAGAGCCATGATCTTGAGCAGAAAGTATCGATCACTTGTGAGCTCATGGATTGAACCTTTACAGGAAGAAGCTGAACATGGATACGAGATA
GACTACATTGCAAGATACATTGAAGAGGGTGGACTCACTGGAGAACGCAAGAGATGGGTGCCTCGAAGAGGAAAAACTCCTCTAGATCCTGATGCAGATGGATTC
ATCTATTCAAATCCTATGGAAACATCCTTTAAGCAACGATGTCTAGAAGATTGGAAGATGTACCACCGAAAGATTTTGAAAACCTTGCAGAATGAAGGACTTGCA
GCTCTTGGGGATGCATCTGAAGCTGATTATATTAGAGTCGAGGAGAGATTGAAGAAAATTATAAAGGGTCCTGATCCAAATATTTTAAAGCCAAAGGCTGCAAGT
AAGATGCTTGTATCAGAATTAAAAGAAGAATTGGAAGCACAAGGTTTACCGATTGATGGAACTAGAAATATTCTTTACCAGCGGGTTCAAAAAGCAAGGAGAATA
AATCGGTCTCGTGGTCGGCCCCTTTGGGTTCCTCCAGTGGAGGAGGAGGAAGAGGAGGTTGATGAAGAGCTGGATGAACTAATTTCACGAATAAAGCTACACGAA
GGAAACACAGAGTTCTGGAAACGCCGTTTTCTTGGAGAAGGCTTGGACAGTAATAATGTTAAACCTTCTGAAGATGATCAGTCAGAACCTCTTGATTCTTTGGAT
GATGTTGACATTGTAGAAGACGTTGCAAAGGAGATTGATGAAGAAGAAGCCGAGGAGGAAGAGGAGGTTGAACCAACCGAGAATCAAGATGGTGAAAGAGTTATT
AAGAAGGAAGTTGAAGCTAAGAAGCCTCCTCAAATGATAGGTGTCCAATTGTTGAAAGACGTTGACCAAACCTCAACAACATCCAAAAAGTCAAGGAGAAGACGT
TCTCGAGCATCAGTTGAGGACGATCGTGATGAAGACTGGTTTCCTGAAGATTTATTCGAGGCATTTGGAGAGTTGCGAAAGAGGAAAGTCTTTGATGAATCTGAC
ATGTACACAATAGCTGATGTTTGGGGTTGGACTTGGGAGAGAGAACTTAAGAACAGACCTCCCAGGAGGTGGTCACAGGAATGGGAAGTGGAGTTGGCCATTAAA
ATTATGCACAAGGTGATTGAATTGGGTGGGATACCAACAATTGGCGACTGTGCCATGATCTTGCGAGCTGCCATCAAGGCTCCTCTTCCGTCTGCCTTTTTTAAG
ATCTTGCAGACAACTCATAGTCTTGGCTATGTATTTGGGAGCCCATTATATGATGAGATTATTACCCTGTGTCTTGATCTTGGGGAACTAGATGCAGCCATTGCC
ATCGTAGCAGATCTGGAAACCACAGGAATCTCGGTTCCCGACGAAACACTCGATCGGATAATCTCCGCTAGACAGACAAACGATGCTGCGCCCAAGCGTGATTCA
CCCATTGATATTACACTCAATGATCATAGTTTAGGCAATGATGAAGAATCATAATCATCAAACATGTTCTTGTTTTCCTTTTGTACAGTTCAGTTCTAGAACATT
GGAGTTGAAAAATTTTGATCTGTTAATCCGTGTAATCAATCACTTGCTTTACTTTATTTAATCGTTCCACAAATTGTTCTTGGCACTGATATTGACTCTCTTCAT
AATGTTAAGTTTTATATCTGAGCTTAATTTTGTTTCTTATTTTATTTTGAAA
Protein sequenceShow/hide protein sequence
MSKFLLSHSYLLTLPHKHHSFSLHNGVFPPIRSVLSTEKRGRKKRQSRQQQLQQKDDDSTVFEKSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSF
HGLVVSHVLNADAEGAMQSLRKELSTGLRPLHETFVALVRLFGNKGLATRGLEILAAMEKLNYDIRQAWLILIEELVKNKYLEDANKVFLKGAKGGLRATDKIYD
LLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQ
PNMRTYALLVECFTKYCVIREAIRHFRGLKTFPGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPSRAMILSRKYRSLVSSWIEPLQEEA
EHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYIRVEERLKKIIKGPDPNIL
KPKAASKMLVSELKEELEAQGLPIDGTRNILYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDQSE
PLDSLDDVDIVEDVAKEIDEEEAEEEEEVEPTENQDGERVIKKEVEAKKPPQMIGVQLLKDVDQTSTTSKKSRRRRSRASVEDDRDEDWFPEDLFEAFGELRKRK
VFDESDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAAIKAPLPSAFFKILQTTHSLGYVFGSPLYDEIITLCLDLGE
LDAAIAIVADLETTGISVPDETLDRIISARQTNDAAPKRDSPIDITLNDHSLGNDEES