; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc03G12830 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc03G12830
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionSAP domain-containing protein
Genome locationClcChr03:20967283..21028836
RNA-Seq ExpressionClc03G12830
SyntenyClc03G12830
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0006979 - response to oxidative stress (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0004601 - peroxidase activity (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR003034 - SAP domain
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR036361 - SAP domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008443746.1 PREDICTED: uncharacterized protein LOC103487261 isoform X1 [Cucumis melo]0.0e+0093.99Show/hide
Query:  MSKFLLSHAYLLTLPHKNHSFSLNHGVLPIRSVLSAPEKRGKKKRQSR-QQQFHPKDDDSTALEKALRFTFMEELMDRARNRDPLGVSDVIYDMLAAGLN
        MSK LLSHA+LLTLP+ + SFSLNHG+LPIRSVLSAP+KRG+KKRQSR QQQ   KDDDST+LE +LRFTFMEELMDRARN DPLGVSDVIYDM+AAGL+
Subjt:  MSKFLLSHAYLLTLPHKNHSFSLNHGVLPIRSVLSAPEKRGKKKRQSR-QQQFHPKDDDSTALEKALRFTFMEELMDRARNRDPLGVSDVIYDMLAAGLN

Query:  PGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKGAK
        PGPRSFHGLVVSH LNGDTEGAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA RGLEILAAME+LNYDIRQ WLILTEELVRNKYLEDANKVFLKGAK
Subjt:  PGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
         GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFN LLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLDALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCV+REAIRHFRALKTF+GGTKALHNEGNFGDPLSLYLRALCREGRV++LL+ALEAMARDNQQIPPRAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLDALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIKEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALG
        ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYI+EGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL AL 
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIKEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALG

Query:  DASEADYHRVVERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
        DASEADYHRVVE+LKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYHRVVERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDISEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEPTENQDGERVIKKEIEAKKPLQMIGVQLLKDVDQTTTTS
        HEGNTEFWKRRFLGEGLDSNNVKPSEDD S+ LDSLDDVDT+EDVAKEIEEEEAEEEEEVE TENQDGERVIKKE+EAKKPLQMIGVQLLKDVDQ T TS
Subjt:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDISEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEPTENQDGERVIKKEIEAKKPLQMIGVQLLKDVDQTTTTS

Query:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
        KKSRRRSSRASLEDDRDEDWFPEDIFEAF+EL+KRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA

Query:  AIKAPVPSAFLKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDDES
        AIKAP+PSAFLKILQTTH LGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGILVPDETLDRVIS RQTNDAMPKPDSAIDTT+NDHSLAND+ S
Subjt:  AIKAPVPSAFLKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDDES

XP_011660243.1 uncharacterized protein LOC101209618 isoform X1 [Cucumis sativus]0.0e+0094.33Show/hide
Query:  MSKFLLSHAYLLTLPHKNHSFSLNHGVLPIRSVLSAPEKRGKKKRQSR-QQQFHPKDDDSTALEKALRFTFMEELMDRARNRDPLGVSDVIYDMLAAGLN
        MSKFLLSHA+LLTLP  + SFSLNHG+LPIRSVLSAP+KRG+KKRQSR QQQ  PKD+DST+LE +LRFTFMEELMDRARN DPLGVSDVIYDM+AAGL+
Subjt:  MSKFLLSHAYLLTLPHKNHSFSLNHGVLPIRSVLSAPEKRGKKKRQSR-QQQFHPKDDDSTALEKALRFTFMEELMDRARNRDPLGVSDVIYDMLAAGLN

Query:  PGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKGAK
        PGPRSFHGLVVSH LNGDTEGAMQSLRRELSAGL PLHETFVALVRLFGSKGLA RGLEILAAMEKLNYDIRQ WLILTEELVR+KYLEDANKVFLKGAK
Subjt:  PGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
         GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFN LLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLDALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCV+REAIRHFRAL+TFEGGT ALHNEGNFGDPLSLYLRALCREGRVVELL+ALEAMARDNQQIPPRAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLDALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIKEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALG
        ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYI+EGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL AL 
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIKEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALG

Query:  DASEADYHRVVERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
        DASEADYHRVVERL+KIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYHRVVERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDISEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEPTENQDGERVIKKEIEAKKPLQMIGVQLLKDVDQTTTTS
        HEGNTEFWKRRFLGEGL SNNVKPSEDD S+PLDSLDDVDT+EDVAKEIEEEEAEEEEEVE TENQDGERVIKKE+EAKKPLQMIGVQLLKDVDQ TTTS
Subjt:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDISEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEPTENQDGERVIKKEIEAKKPLQMIGVQLLKDVDQTTTTS

Query:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
        KKSRRRSSRASLEDDRDEDWFPEDIFEAF+EL+KRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGG PTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA

Query:  AIKAPVPSAFLKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDDES
        AIKAP+PSAFLKILQTTH LGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGILV DETLDRVISARQTNDAMPKPDSAIDTTLNDHSLAND+ S
Subjt:  AIKAPVPSAFLKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDDES

XP_022930357.1 uncharacterized protein LOC111436825 isoform X1 [Cucurbita moschata]0.0e+0092.99Show/hide
Query:  MSKFLLSHAYLLTLPHKNHSFSLNHGVL-PIRSVLSAPEKRGKKKRQSRQQQFHPKDDDSTALEKALRFTFMEELMDRARNRDPLGVSDVIYDMLAAGLN
        MSKFLLSH+YLLTLPHK+HSFSL++GV  PIRSVLS  EKRG+KKRQSRQQQ   KDDDST  EK+LRFTFMEELMDRARN DPLGVSDVIYDM+AAGL+
Subjt:  MSKFLLSHAYLLTLPHKNHSFSLNHGVL-PIRSVLSAPEKRGKKKRQSRQQQFHPKDDDSTALEKALRFTFMEELMDRARNRDPLGVSDVIYDMLAAGLN

Query:  PGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKGAK
        PGPRSFHGLVVSH LN D EGAMQSLR+ELS GLRPLHETFVALVRLFG+KGLATRGLEILAAMEKLNYDIRQ WLIL EELV+NKYLEDANKVFLKGAK
Subjt:  PGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
        GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFN LLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLDALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCV+REAIRHFR LKTF GGTKALHNEGNFGDPLSLYLRALCREGRVVELL+ALEAMARDNQQIP RAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLDALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIKEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALG
        ILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYI+EGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALG
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIKEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALG

Query:  DASEADYHRVVERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
        DASEADY RV ERLKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRN+LYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYHRVVERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDISEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEPTENQDGERVIKKEIEAKKPLQMIGVQLLKDVDQTTTTS
        HEGNTEFWKRRFLGEGLDSNNVKPSEDD SEPLDSLDDVD VEDVAKEI+EEEAEEEEEVEPTENQDGERVIKKE+EAKKP QMIGVQLLKDVDQT+TTS
Subjt:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDISEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEPTENQDGERVIKKEIEAKKPLQMIGVQLLKDVDQTTTTS

Query:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
        KKSRRR SRAS+EDDRDEDWFPED+FEAF ELRKRKVFD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGG PTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA

Query:  AIKAPVPSAFLKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDDES
        AIKAP+PSAF KILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGI VPDETLDR+ISARQTNDA PK DS ID TLNDHSL ND+ES
Subjt:  AIKAPVPSAFLKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDDES

XP_023531019.1 uncharacterized protein LOC111793400 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0092.89Show/hide
Query:  MSKFLLSHAYLLTLPHKNHSFSLNHGVL--PIRSVLSAPEKRGKKKRQSRQQQFHPKDDDSTALEKALRFTFMEELMDRARNRDPLGVSDVIYDMLAAGL
        MSKFLLSH+ LLTLPHK+HSFSL++G+L  PIRSVLS  EKRG+KKRQSRQQQ   KDDDST LEK+LRFTFMEELMDRARN DPLGVSDVIYDM+AAGL
Subjt:  MSKFLLSHAYLLTLPHKNHSFSLNHGVL--PIRSVLSAPEKRGKKKRQSRQQQFHPKDDDSTALEKALRFTFMEELMDRARNRDPLGVSDVIYDMLAAGL

Query:  NPGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKGA
        +PGPRSFHGLVVSH LN D EGAMQSLR+ELS GLRP+HETFVALVRLFG+KGLATRGLEILAAMEKLNYDIRQ WLIL EELV+NKYLEDANKVFLKGA
Subjt:  NPGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKGA

Query:  KGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ
        KGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFN LLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ
Subjt:  KGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ

Query:  DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLDALEAMARDNQQIPPRA
        DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCV+REAIRHFR L+TF GGTKALHNEGNFGDPLSLYLRALCREGRVVELL+ALEAMARDNQQIP RA
Subjt:  DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLDALEAMARDNQQIPPRA

Query:  MILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIKEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAAL
        MILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYI+EGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAAL
Subjt:  MILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIKEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAAL

Query:  GDASEADYHRVVERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK
        GDASEADY RV ERLKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK
Subjt:  GDASEADYHRVVERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK

Query:  LHEGNTEFWKRRFLGEGLDSNNVKPSEDDISEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEPTENQDGERVIKKEIEAKKPLQMIGVQLLKDVDQTTTT
        LHEGNTEFWKRRFLGEGLDSNNVKPSEDD SEPLDSLDDVD VEDVAKEI+EEEAEEEEEVEPTENQDGERVIKKE+EAKKP QMIGVQLLKD+DQT+TT
Subjt:  LHEGNTEFWKRRFLGEGLDSNNVKPSEDDISEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEPTENQDGERVIKKEIEAKKPLQMIGVQLLKDVDQTTTT

Query:  SKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILR
        SKKSRRR SRAS+EDDRDEDWFPED+FEAF ELRKRKVFD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGG PTIGDCAMILR
Subjt:  SKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILR

Query:  AAIKAPVPSAFLKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDDES
        AAIKAP+PSAF KILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGI VPDETLDR+ISARQTNDA PK DS ID TLNDHSLAND+ES
Subjt:  AAIKAPVPSAFLKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDDES

XP_038879291.1 uncharacterized protein LOC120071230 [Benincasa hispida]0.0e+0095.44Show/hide
Query:  MSKFLLSHAYLLTLPHKNHSFSLNHGVLPIRSVLSAPEKRGKKKRQSR-QQQFHPKDDDSTALEKALRFTFMEELMDRARNRDPLGVSDVIYDMLAAGLN
        MSKFLLSHA+LLTLP+K+HSFSLNHGV+PIRSVLSAP+KRG+KKRQ+R QQQ H KD DSTALEK+LRFTFMEELMDRARN DPLGVSDVIYDM+AAGL+
Subjt:  MSKFLLSHAYLLTLPHKNHSFSLNHGVLPIRSVLSAPEKRGKKKRQSR-QQQFHPKDDDSTALEKALRFTFMEELMDRARNRDPLGVSDVIYDMLAAGLN

Query:  PGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKGAK
        PGPRSFHGLVVSH LNGDTEGAMQSLRRELSAGL PLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQ WLILTEELVRNKYLEDANKVFLKGAK
Subjt:  PGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
        GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFN LLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLDALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCV+REAIRHFRALKTF+GGTKALHNEGNFGDPLSLYLRALCREGRVVELL+ALEAMARDNQQIPPRAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLDALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIKEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALG
        ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYI+EGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKM+HRKILKTLQNEGLAALG
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIKEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALG

Query:  DASEADYHRVVERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
         ASEADYHRVVERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYHRVVERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDISEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEPTENQDGERVIKKEIEAKKPLQMIGVQLLKDVDQTTTTS
        HEGNTEFWKRRFLGEGLDSNNVKPSEDD SEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVE TENQDGERVIKKE+EAKKPLQMIGVQLLKDVDQ  TTS
Subjt:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDISEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEPTENQDGERVIKKEIEAKKPLQMIGVQLLKDVDQTTTTS

Query:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
        KKSRRRSSRASLEDDRDEDWFPEDIFEAF+ELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA

Query:  AIKAPVPSAFLKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDDES
        AIKAP+PS+FLKILQTTH LGY FGSPLYDE+ITLCLDLGELDAAIAIVADLETTGILVPDETLDRVIS RQTND+MPKPDSAIDTTLNDHSLA+D+ S
Subjt:  AIKAPVPSAFLKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDDES

TrEMBL top hitse value%identityAlignment
A0A1S3B8T6 uncharacterized protein LOC103487261 isoform X10.0e+0093.99Show/hide
Query:  MSKFLLSHAYLLTLPHKNHSFSLNHGVLPIRSVLSAPEKRGKKKRQSR-QQQFHPKDDDSTALEKALRFTFMEELMDRARNRDPLGVSDVIYDMLAAGLN
        MSK LLSHA+LLTLP+ + SFSLNHG+LPIRSVLSAP+KRG+KKRQSR QQQ   KDDDST+LE +LRFTFMEELMDRARN DPLGVSDVIYDM+AAGL+
Subjt:  MSKFLLSHAYLLTLPHKNHSFSLNHGVLPIRSVLSAPEKRGKKKRQSR-QQQFHPKDDDSTALEKALRFTFMEELMDRARNRDPLGVSDVIYDMLAAGLN

Query:  PGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKGAK
        PGPRSFHGLVVSH LNGDTEGAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA RGLEILAAME+LNYDIRQ WLILTEELVRNKYLEDANKVFLKGAK
Subjt:  PGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
         GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFN LLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLDALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCV+REAIRHFRALKTF+GGTKALHNEGNFGDPLSLYLRALCREGRV++LL+ALEAMARDNQQIPPRAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLDALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIKEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALG
        ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYI+EGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL AL 
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIKEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALG

Query:  DASEADYHRVVERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
        DASEADYHRVVE+LKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYHRVVERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDISEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEPTENQDGERVIKKEIEAKKPLQMIGVQLLKDVDQTTTTS
        HEGNTEFWKRRFLGEGLDSNNVKPSEDD S+ LDSLDDVDT+EDVAKEIEEEEAEEEEEVE TENQDGERVIKKE+EAKKPLQMIGVQLLKDVDQ T TS
Subjt:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDISEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEPTENQDGERVIKKEIEAKKPLQMIGVQLLKDVDQTTTTS

Query:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
        KKSRRRSSRASLEDDRDEDWFPEDIFEAF+EL+KRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA

Query:  AIKAPVPSAFLKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDDES
        AIKAP+PSAFLKILQTTH LGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGILVPDETLDRVIS RQTNDAMPKPDSAIDTT+NDHSLAND+ S
Subjt:  AIKAPVPSAFLKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDDES

A0A1S3B9H7 uncharacterized protein LOC103487261 isoform X20.0e+0093.84Show/hide
Query:  MSKFLLSHAYLLTLPHKNHSFSLNHGVLPIRSVLSAPEKRGKKKRQSR-QQQFHPKDDDSTALEKALRFTFMEELMDRARNRDPLGVSDVIYDMLAAGLN
        MSK LLSHA+LLTLP+ + SFSLNHG+LPIRSVLSAP+KRG+KKRQSR QQQ   KDDDST+LE +LRFTFMEELMDRARN DPLGVSDVIYDM+AAGL+
Subjt:  MSKFLLSHAYLLTLPHKNHSFSLNHGVLPIRSVLSAPEKRGKKKRQSR-QQQFHPKDDDSTALEKALRFTFMEELMDRARNRDPLGVSDVIYDMLAAGLN

Query:  PGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKGAK
        PGPRSFHGLVVSH LNGDTEGAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA RGLEILAAME+LNYDIRQ WLILTEELVRNKYLEDANKVFLKGAK
Subjt:  PGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
         GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFN LLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLDALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCV+REAIRHFRALKTF+GGTKALHNEGNFGDPLSLYLRALCREGRV++LL+ALEAMARDNQQIPPRAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLDALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIKEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALG
        ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYI+EGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL AL 
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIKEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALG

Query:  DASEADYHRVVERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
        DASEADYHRVVE+LKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYHRVVERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDISEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEPTENQDGERVIKKEIEAKKPLQMIGVQLLKDVDQTTTTS
        HEGNTEFWKRRFLGEGLDSNNVKPSEDD S+ LDSLDDVDT+EDVAKEIEEEEAEEEEEVE TENQDGERVIKKE+EAKKPLQMIGVQLLKDVDQ T TS
Subjt:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDISEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEPTENQDGERVIKKEIEAKKPLQMIGVQLLKDVDQTTTTS

Query:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
        KKSRRRSSRASLEDDRDEDWFPEDIFEAF+EL+KRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA

Query:  AIKAPVPSAFLKILQTTHSLGYVFGSPL
        AIKAP+PSAFLKILQTTH LGYVFG  L
Subjt:  AIKAPVPSAFLKILQTTHSLGYVFGSPL

A0A6J1DBV3 uncharacterized protein LOC1110195950.0e+0093.12Show/hide
Query:  MSKFLL-SHAYLLTLPHKNHSFSL-NH-GVLPIRSVLSAPEKRGKKKRQSRQQQFHPKDDDSTALEKALRFTFMEELMDRARNRDPLGVSDVIYDMLAAG
        MSK LL SH +LLTLPHK  S  L NH GVLPIRSVLSAPEKRG+KKRQ R    HPKDD STALEK LRFTFMEELMDRAR+ D +GVSDVIYDM+AAG
Subjt:  MSKFLL-SHAYLLTLPHKNHSFSL-NH-GVLPIRSVLSAPEKRGKKKRQSRQQQFHPKDDDSTALEKALRFTFMEELMDRARNRDPLGVSDVIYDMLAAG

Query:  LNPGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKG
        L+PGPRSFHGLVVSH+LNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLA+RGLEIL+AMEKLNYDIRQ WLIL +ELVRNKYLEDANK FLKG
Subjt:  LNPGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKG

Query:  AKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRV
        AKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFN LLSVQATCGIPEIAFSTFENMEYGED MKPDTETYNWVIQAYTRAESYDRV
Subjt:  AKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRV

Query:  QDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLDALEAMARDNQQIPPR
        QDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCV+REAIRHFRALKTF+GGTKALHNEGNFGDPLSLYLRALCREGRVVELL+ALEAMARDNQQIP R
Subjt:  QDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLDALEAMARDNQQIPPR

Query:  AMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIKEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAA
        AMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYI+EGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL A
Subjt:  AMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIKEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAA

Query:  LGDASEADYHRVVERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRI
        LGDASEADYHRV ERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRI
Subjt:  LGDASEADYHRVVERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRI

Query:  KLHEGNTEFWKRRFLGEGLDSNNVKPSEDDISEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEPTENQDGERVIKKEIEAKKPLQMIGVQLLKDVDQTTT
        KLHEGNTE+WKRRFLGEGLD+N+VKPSEDD SEPLDSLDDVD VED AKEIEEEE  EEEEVE TENQDGERVIKKE+EAKKPLQMIGVQLLKDVDQTTT
Subjt:  KLHEGNTEFWKRRFLGEGLDSNNVKPSEDDISEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEPTENQDGERVIKKEIEAKKPLQMIGVQLLKDVDQTTT

Query:  TSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMIL
        TSKKSRRRSSRASLEDDRDEDWFPEDIFEAF+ELRKR+VFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELA KIMHKVIELGGTPTIGDCAMIL
Subjt:  TSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMIL

Query:  RAAIKAPVPSAFLKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDDE
        RAAI++P+PSAFLKILQTTHSLGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGI VPDETLDRVISARQTNDAMPKPD+AIDTTLNDHSLAND+ 
Subjt:  RAAIKAPVPSAFLKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDDE

Query:  S
        S
Subjt:  S

A0A6J1EQ88 uncharacterized protein LOC111436825 isoform X10.0e+0092.99Show/hide
Query:  MSKFLLSHAYLLTLPHKNHSFSLNHGVL-PIRSVLSAPEKRGKKKRQSRQQQFHPKDDDSTALEKALRFTFMEELMDRARNRDPLGVSDVIYDMLAAGLN
        MSKFLLSH+YLLTLPHK+HSFSL++GV  PIRSVLS  EKRG+KKRQSRQQQ   KDDDST  EK+LRFTFMEELMDRARN DPLGVSDVIYDM+AAGL+
Subjt:  MSKFLLSHAYLLTLPHKNHSFSLNHGVL-PIRSVLSAPEKRGKKKRQSRQQQFHPKDDDSTALEKALRFTFMEELMDRARNRDPLGVSDVIYDMLAAGLN

Query:  PGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKGAK
        PGPRSFHGLVVSH LN D EGAMQSLR+ELS GLRPLHETFVALVRLFG+KGLATRGLEILAAMEKLNYDIRQ WLIL EELV+NKYLEDANKVFLKGAK
Subjt:  PGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
        GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFN LLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLDALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCV+REAIRHFR LKTF GGTKALHNEGNFGDPLSLYLRALCREGRVVELL+ALEAMARDNQQIP RAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLDALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIKEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALG
        ILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYI+EGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALG
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIKEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALG

Query:  DASEADYHRVVERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
        DASEADY RV ERLKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRN+LYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYHRVVERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDISEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEPTENQDGERVIKKEIEAKKPLQMIGVQLLKDVDQTTTTS
        HEGNTEFWKRRFLGEGLDSNNVKPSEDD SEPLDSLDDVD VEDVAKEI+EEEAEEEEEVEPTENQDGERVIKKE+EAKKP QMIGVQLLKDVDQT+TTS
Subjt:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDISEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEPTENQDGERVIKKEIEAKKPLQMIGVQLLKDVDQTTTTS

Query:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
        KKSRRR SRAS+EDDRDEDWFPED+FEAF ELRKRKVFD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGG PTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA

Query:  AIKAPVPSAFLKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDDES
        AIKAP+PSAF KILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGI VPDETLDR+ISARQTNDA PK DS ID TLNDHSL ND+ES
Subjt:  AIKAPVPSAFLKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDDES

A0A6J1L2D9 uncharacterized protein LOC111499221 isoform X10.0e+0092.55Show/hide
Query:  MSKFLLSHAYLLTLPHKNHSFSLNHGVL-PIRSVLSAPEKRGKKKRQSRQQQFHPKDDDSTALEKALRFTFMEELMDRARNRDPLGVSDVIYDMLAAGLN
        MSKFLLSH+ LLTLPHK+HSFSL++ VL PIRSVLS  EKRG+KKRQSRQQQ   KD DST LEK+LRFTFMEELMDRARN DPLGVSDVIYDM+AAGL+
Subjt:  MSKFLLSHAYLLTLPHKNHSFSLNHGVL-PIRSVLSAPEKRGKKKRQSRQQQFHPKDDDSTALEKALRFTFMEELMDRARNRDPLGVSDVIYDMLAAGLN

Query:  PGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKGAK
        PGPRSFHGLVVSH LN D EGAMQSLR+ELS GLRPLHETFVALVRLFG+KGLATRGLEILAAMEKLNYDIRQ WLIL EELV+NKYLEDANKVFLKGAK
Subjt:  PGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
        GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFN LLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLDALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCV+REAIRHFR LKTF GGTKALH+EG+FGDPLSLYLRALCREGRVVELL+ALEAMARDNQQIP RAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLDALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIKEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALG
        ILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYI+EGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALG
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIKEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALG

Query:  DASEADYHRVVERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
        DASEADY RV ERLKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYHRVVERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDISEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEPTENQDGERVIKKEIEAKKPLQMIGVQLLKDVDQTTTTS
        HEGNTEFWKRRFLGEGLDSNNVKPSEDD SEPLDSLDDVD VEDVAKEI+EEEAEEEEEVEPTENQDGERVIKKE+EAKKP QMIGVQLLKDVDQT+TTS
Subjt:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDISEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEPTENQDGERVIKKEIEAKKPLQMIGVQLLKDVDQTTTTS

Query:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
        KKSRRR SRAS+EDDRDEDWFPED+FEAF ELRKRK+FD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGG PTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA

Query:  AIKAPVPSAFLKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDDES
        AIKAP+PSAF KILQTTHSLGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGI VPDETLDR+ISARQTNDA PK DS ID TLNDHSLA+D+ES
Subjt:  AIKAPVPSAFLKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDDES

SwissProt top hitse value%identityAlignment
O64624 Pentatricopeptide repeat-containing protein At2g18940, chloroplastic1.4e-0528.07Show/hide
Query:  RDPLGVSDVIYDMLAAGLNPGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQ-TWLILTE
        R  LGV D   +M + GL     +   ++ + A  G    A +      S G  P   T+ AL+++FG  G+ T  L +L  ME+ +      T+  L  
Subjt:  RDPLGVSDVIYDMLAAGLNPGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQ-TWLILTE

Query:  ELVRNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSV
          VR  + ++A  V     K G+      Y  +I+   KAG    AL++ Y M+ AG +  T  +N++LS+
Subjt:  ELVRNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSV

Q0WPZ6 Pentatricopeptide repeat-containing protein At2g171408.0e-0923.43Show/hide
Query:  VSDVIYDMLAAGLNPGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLN-YDIRQTWLILTEELVRN
        VS +  DM+  G+ P   +F+ L+ +   +   + A +        G +P   TF  LVR +   GL  +GLE+L AME       +  +  +     R 
Subjt:  VSDVIYDMLAAGLNPGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLN-YDIRQTWLILTEELVRN

Query:  KYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMA----TTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTET
           +D+ K+  K  + GL      ++  I   CK G   +A  I  +ME    +      +  +N +L      G+ E A + FE++   +D      ++
Subjt:  KYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMA----TTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTET

Query:  YNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLS--LYLRALCREGRVV
        YN  +Q   R   +   + V + +       K + P++ +Y +L++   K  +L +A       KT  G    +   G   D ++    L   C  G+V 
Subjt:  YNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLS--LYLRALCREGRVV

Query:  ELLDALEAMARDNQQIPPRAMILSRKYRSL-----VSSWIEPLQEEAEHGFEIDYIARYIKEGGLTG
             L+ M R+N    P A   +    SL     +S   E L++  E G+ +D +   I   GL G
Subjt:  ELLDALEAMARDNQQIPPRAMILSRKYRSL-----VSSWIEPLQEEAEHGFEIDYIARYIKEGGLTG

Q0WVK7 Pentatricopeptide repeat-containing protein At1g05670, mitochondrial6.3e-0622.18Show/hide
Query:  DPLGVSDVIYDMLAAGLNPGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIR-QTWLILTEE
        D +    + ++M   GL P   +F  L+  +   G  + A +     + AG  P   T+  L+     +G      E+L  M K+       T+  +   
Subjt:  DPLGVSDVIYDMLAAGLNPGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIR-QTWLILTEE

Query:  LVRNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMK-----
        L ++  +E+A K+  +    GL A    Y  L++  CK+G+   A EI  EM   G   T   FN L++           F     +E GE  +      
Subjt:  LVRNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMK-----

Query:  ---PDTETYNWVIQAYTRAESYDRVQDVAELLGMMVED--HKRLQPNMRTYALLVECFTKYCVLREAIRHFRALK
           P+  T+N +++ Y        +++  +    + +D   + + P+ +TY  LV+   K   ++EA   F+ +K
Subjt:  ---PDTETYNWVIQAYTRAESYDRVQDVAELLGMMVED--HKRLQPNMRTYALLVECFTKYCVLREAIRHFRALK

Q9LN22 Pentatricopeptide repeat-containing protein At1g20300, mitochondrial6.3e-0625.93Show/hide
Query:  LEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQ
        + +A KVF +    G+      Y ++I+  C+ G  S A ++  +M  +G       FN+L+ V    G  E     +  M+  +   +PDT TYN++I+
Subjt:  LEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQ

Query:  AYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTY
        A+ R E+   +++  ++L  M++  K+ + N  T+
Subjt:  AYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTY

Q9SIC9 Pentatricopeptide repeat-containing protein At2g31400, chloroplastic6.8e-0824.33Show/hide
Query:  GLRPLHETFVALVRLFGSKGL--ATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEIS
        G++P   TF +L+ +    GL  A R L       ++  D+  ++  L + + +   ++ A ++  +     +      Y  +I+   KAG    AL + 
Subjt:  GLRPLHETFVALVRLFGSKGL--ATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEIS

Query:  YEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT
         EM   G       +N+LLS+    G  E A      M      +K D  TYN ++  Y +   YD V+ V      M  +H  + PN+ TY+ L++ ++
Subjt:  YEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT

Query:  KYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPL--SLYLRALCREGRVVELLDALEAMARD
        K  + +EA+  FR  K+   G +A        D +  S  + ALC+ G V   +  ++ M ++
Subjt:  KYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPL--SLYLRALCREGRVVELLDALEAMARD

Arabidopsis top hitse value%identityAlignment
AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein4.5e-0722.18Show/hide
Query:  DPLGVSDVIYDMLAAGLNPGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIR-QTWLILTEE
        D +    + ++M   GL P   +F  L+  +   G  + A +     + AG  P   T+  L+     +G      E+L  M K+       T+  +   
Subjt:  DPLGVSDVIYDMLAAGLNPGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIR-QTWLILTEE

Query:  LVRNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMK-----
        L ++  +E+A K+  +    GL A    Y  L++  CK+G+   A EI  EM   G   T   FN L++           F     +E GE  +      
Subjt:  LVRNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMK-----

Query:  ---PDTETYNWVIQAYTRAESYDRVQDVAELLGMMVED--HKRLQPNMRTYALLVECFTKYCVLREAIRHFRALK
           P+  T+N +++ Y        +++  +    + +D   + + P+ +TY  LV+   K   ++EA   F+ +K
Subjt:  ---PDTETYNWVIQAYTRAESYDRVQDVAELLGMMVED--HKRLQPNMRTYALLVECFTKYCVLREAIRHFRALK

AT1G05670.2 Pentatricopeptide repeat (PPR-like) superfamily protein4.5e-0722.18Show/hide
Query:  DPLGVSDVIYDMLAAGLNPGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIR-QTWLILTEE
        D +    + ++M   GL P   +F  L+  +   G  + A +     + AG  P   T+  L+     +G      E+L  M K+       T+  +   
Subjt:  DPLGVSDVIYDMLAAGLNPGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIR-QTWLILTEE

Query:  LVRNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMK-----
        L ++  +E+A K+  +    GL A    Y  L++  CK+G+   A EI  EM   G   T   FN L++           F     +E GE  +      
Subjt:  LVRNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMK-----

Query:  ---PDTETYNWVIQAYTRAESYDRVQDVAELLGMMVED--HKRLQPNMRTYALLVECFTKYCVLREAIRHFRALK
           P+  T+N +++ Y        +++  +    + +D   + + P+ +TY  LV+   K   ++EA   F+ +K
Subjt:  ---PDTETYNWVIQAYTRAESYDRVQDVAELLGMMVED--HKRLQPNMRTYALLVECFTKYCVLREAIRHFRALK

AT2G17140.1 Pentatricopeptide repeat (PPR) superfamily protein5.7e-1023.43Show/hide
Query:  VSDVIYDMLAAGLNPGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLN-YDIRQTWLILTEELVRN
        VS +  DM+  G+ P   +F+ L+ +   +   + A +        G +P   TF  LVR +   GL  +GLE+L AME       +  +  +     R 
Subjt:  VSDVIYDMLAAGLNPGPRSFHGLVVSHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLN-YDIRQTWLILTEELVRN

Query:  KYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMA----TTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTET
           +D+ K+  K  + GL      ++  I   CK G   +A  I  +ME    +      +  +N +L      G+ E A + FE++   +D      ++
Subjt:  KYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMA----TTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTET

Query:  YNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLS--LYLRALCREGRVV
        YN  +Q   R   +   + V + +       K + P++ +Y +L++   K  +L +A       KT  G    +   G   D ++    L   C  G+V 
Subjt:  YNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLS--LYLRALCREGRVV

Query:  ELLDALEAMARDNQQIPPRAMILSRKYRSL-----VSSWIEPLQEEAEHGFEIDYIARYIKEGGLTG
             L+ M R+N    P A   +    SL     +S   E L++  E G+ +D +   I   GL G
Subjt:  ELLDALEAMARDNQQIPPRAMILSRKYRSL-----VSSWIEPLQEEAEHGFEIDYIARYIKEGGLTG

AT2G31400.1 genomes uncoupled 14.8e-0924.33Show/hide
Query:  GLRPLHETFVALVRLFGSKGL--ATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEIS
        G++P   TF +L+ +    GL  A R L       ++  D+  ++  L + + +   ++ A ++  +     +      Y  +I+   KAG    AL + 
Subjt:  GLRPLHETFVALVRLFGSKGL--ATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEIS

Query:  YEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT
         EM   G       +N+LLS+    G  E A      M      +K D  TYN ++  Y +   YD V+ V      M  +H  + PN+ TY+ L++ ++
Subjt:  YEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT

Query:  KYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPL--SLYLRALCREGRVVELLDALEAMARD
        K  + +EA+  FR  K+   G +A        D +  S  + ALC+ G V   +  ++ M ++
Subjt:  KYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPL--SLYLRALCREGRVVELLDALEAMARD

AT3G04260.1 plastid transcriptionally active 30.0e+0075.4Show/hide
Query:  GVLPIRSVLSAPEKRGKKKRQSRQQQFHPKDDD--------STALEKALRFTFMEELMDRARNRDPLGVSDVIYDMLAAGLNPGPRSFHGLVVSHALNGD
        G+  IR  +SAPEK+ +++R+ ++      DD          +ALE++LR TFM+ELM+RARNRD  GVS+VIYDM+AAGL+PGPRSFHGLVV+HALNGD
Subjt:  GVLPIRSVLSAPEKRGKKKRQSRQQQFHPKDDD--------STALEKALRFTFMEELMDRARNRDPLGVSDVIYDMLAAGLNPGPRSFHGLVVSHALNGD

Query:  TEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDC
         +GAM SLR+EL AG RPL ET +ALVRL GSKG ATRGLEILAAMEKL YDIRQ WLIL EEL+R  +LEDANKVFLKGA+GG+RATD++YDL+IEEDC
Subjt:  TEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDC

Query:  KAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPN
        KAGDHSNAL+ISYEMEAAGRMATTFHFN LLSVQATCGIPE+A++TFENMEYGE +MKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKR+QPN
Subjt:  KAGDHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPN

Query:  MRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLDALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQ
        ++TYALLVECFTKYCV++EAIRHFRALK FEGGT  LHN GNF DPLSLYLRALCREGR+VEL+DAL+AM +DNQ IPPRAMI+SRKYR+LVSSWIEPLQ
Subjt:  MRTYALLVECFTKYCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLDALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQ

Query:  EEAEHGFEIDYIARYIKEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYHRVVERLKKII
        EEAE G+EIDY+ARYI+EGGLTGERKRWVPRRGKTPLDPDA GFIYSNP+ETSFKQRCLEDWK++HRK+L+TLQ+EGL  LGDASE+DY RVVERL+ II
Subjt:  EEAEHGFEIDYIARYIKEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYHRVVERLKKII

Query:  KGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLD
        KGP  N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRIN+SRGRPLWVPP+EEEEEEVDEE+D+LI RIKLHEG+TEFWKRRFLGEGL 
Subjt:  KGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLD

Query:  SNNVKPSE--------------DDISEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEPTENQ-DGERVIK-KEIEAKKPLQMIGVQLLKDVDQTTTTSKK
          +V+  E              +DIS+  D+ +D D  E    E ++E  EEE  V  TEN+ +GE ++K K  +AKK LQMIGVQLLK+ D+   T KK
Subjt:  SNNVKPSE--------------DDISEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVEPTENQ-DGERVIK-KEIEAKKPLQMIGVQLLKDVDQTTTTSKK

Query:  SRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRAAI
          +R+SR +LEDD DEDWFPE+ FEAF+E+R+RKVFDV+DMYTIADVWGWTWE++ KN+ PR+WSQEWEVELAI +M KVIELGG PTIGDCA+ILRAA+
Subjt:  SRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRAAI

Query:  KAPVPSAFLKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISARQTNDA
        +AP+PSAFLKILQTTHSLGY FGSPLYDEIITLCLDLGELDAAIAIVAD+ETTGI VPD+TLD+VISARQ+N++
Subjt:  KAPVPSAFLKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISARQTNDA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCAAATTCCTGCTCTCTCACGCTTACCTTCTCACCCTTCCCCACAAAAACCATTCCTTTTCTCTCAACCATGGCGTCCTTCCCATCCGCTCAGTCCTCTCTGCTCC
AGAGAAGCGAGGTAAGAAGAAGCGGCAGTCGCGGCAGCAACAATTCCACCCAAAGGACGACGATTCCACTGCACTTGAGAAGGCCCTCCGCTTCACTTTCATGGAGGAAC
TCATGGACCGCGCTAGAAACCGCGACCCTCTCGGCGTTTCTGATGTCATTTACGATATGCTTGCCGCTGGATTGAACCCTGGACCTCGCTCCTTCCATGGATTAGTTGTT
TCTCATGCTCTCAATGGTGATACTGAGGGAGCGATGCAATCTCTGAGAAGGGAATTAAGTGCTGGACTTCGTCCTCTTCATGAAACGTTTGTTGCATTAGTTCGGTTATT
TGGTTCCAAGGGTCTTGCTACTAGAGGCTTAGAAATCCTTGCAGCCATGGAGAAATTGAATTATGACATCCGTCAAACATGGCTCATTCTTACTGAGGAACTTGTAAGAA
ACAAATATTTAGAAGACGCCAACAAAGTGTTCTTAAAGGGTGCCAAAGGGGGTCTCAGAGCCACCGACAAGATTTATGATCTTCTGATTGAGGAAGATTGTAAAGCCGGG
GATCATTCAAATGCCTTAGAGATCTCATATGAAATGGAGGCTGCTGGGCGGATGGCAACAACCTTTCATTTCAATTCCCTTCTTAGTGTCCAGGCTACTTGTGGAATACC
TGAAATTGCTTTCTCAACATTTGAGAACATGGAGTATGGAGAAGATTACATGAAACCTGATACTGAGACATATAATTGGGTGATCCAAGCATATACAAGGGCTGAATCTT
ATGATAGGGTGCAAGATGTTGCTGAGTTACTTGGCATGATGGTTGAAGACCACAAGCGTCTACAGCCGAACATGAGAACCTATGCGCTCTTGGTAGAGTGTTTTACTAAG
TATTGTGTTTTACGAGAAGCTATTAGGCATTTTCGTGCACTAAAAACCTTTGAAGGTGGAACAAAAGCTTTGCATAATGAAGGGAATTTTGGTGACCCACTTTCCTTATA
TCTTCGAGCTTTATGTAGAGAAGGTAGGGTTGTTGAACTGTTAGATGCCTTGGAGGCTATGGCTAGAGATAACCAACAGATTCCTCCAAGAGCCATGATCTTGAGCAGAA
AGTATCGATCACTGGTGAGCTCATGGATTGAACCTTTACAAGAAGAAGCTGAACATGGATTCGAGATAGACTACATTGCAAGATACATTAAAGAGGGTGGACTCACTGGA
GAACGCAAGAGATGGGTCCCTCGAAGAGGAAAAACTCCTCTAGATCCTGATGCAGATGGATTCATCTATTCAAATCCTATGGAGACATCCTTTAAGCAACGATGTCTAGA
AGACTGGAAGATGTACCACCGAAAGATTTTGAAAACCTTGCAGAATGAAGGACTTGCGGCTCTTGGGGATGCATCTGAAGCTGATTATCATAGAGTCGTGGAGAGATTGA
AGAAAATAATAAAGGGTCCTGACCAAAATGTTTTAAAGCCAAAGGCTGCAAGTAAGATGATTGTATCAGAATTAAAAGAAGAATTAGAAGCACAAGGTTTACCAATTGAC
GGAACTAGAAATGTTCTTTACCAGCGTGTTCAGAAAGCAAGGAGAATTAACCGGTCTCGTGGTCGGCCCCTTTGGGTTCCTCCAGTGGAGGAGGAGGAAGAAGAGGTTGA
TGAAGAGCTGGATGAACTAATTTCACGAATAAAGCTACACGAAGGAAATACGGAGTTCTGGAAACGCCGCTTTCTCGGAGAAGGCTTGGACAGTAATAATGTTAAACCTT
CTGAAGATGATATATCAGAACCTCTTGATTCTTTGGATGATGTTGACACTGTAGAAGACGTTGCAAAGGAGATTGAAGAAGAAGAAGCTGAGGAGGAAGAGGAGGTAGAA
CCAACTGAGAATCAAGATGGGGAAAGAGTTATTAAGAAGGAAATTGAAGCTAAGAAGCCTCTTCAAATGATAGGTGTTCAATTGTTAAAAGATGTTGACCAAACCACAAC
GACATCCAAAAAGTCGAGGAGGAGAAGTTCTCGAGCATCACTTGAGGATGATCGTGATGAAGATTGGTTTCCTGAAGATATATTTGAGGCATTTGAGGAGTTGCGAAAGA
GGAAAGTCTTTGATGTATCTGACATGTACACAATAGCTGATGTCTGGGGTTGGACTTGGGAGAGAGAACTTAAGAACAGACCTCCCAGGAGGTGGTCACAGGAATGGGAA
GTGGAGCTGGCTATTAAAATTATGCACAAGGTGATTGAATTGGGTGGAACACCAACAATTGGCGACTGTGCCATGATCTTGCGAGCTGCCATCAAGGCTCCTGTACCTTC
TGCCTTCTTGAAAATCTTGCAGACAACTCATAGTCTTGGCTATGTATTTGGGAGCCCTTTATATGACGAGATTATCACCCTGTGCCTTGATCTTGGGGAACTAGATGCAG
CCATTGCGATTGTAGCAGACCTGGAAACAACAGGAATCTTGGTTCCCGATGAAACACTCGATCGGGTCATCTCCGCTAGACAGACGAACGATGCTATGCCCAAGCCTGAT
TCAGCCATTGATACTACACTAAATGATCACAGTTTAGCCAATGATGATGAATCATAA
mRNA sequenceShow/hide mRNA sequence
CGAACTCTCAAGTCTCAACAGGTCCGAGTGCCAAACACCCTCCTCTTTGTTGTTTATTATCTCATCAGCAATCCCACCTTTTTAAACAGCAACACACTTCACTATACTTC
CCATTTTTCCCTCCTCCTTCTCGCTGTTTCTTATCCAAATTCCCGCAGCCATTAATGTCCAAATTCCTGCTCTCTCACGCTTACCTTCTCACCCTTCCCCACAAAAACCA
TTCCTTTTCTCTCAACCATGGCGTCCTTCCCATCCGCTCAGTCCTCTCTGCTCCAGAGAAGCGAGGTAAGAAGAAGCGGCAGTCGCGGCAGCAACAATTCCACCCAAAGG
ACGACGATTCCACTGCACTTGAGAAGGCCCTCCGCTTCACTTTCATGGAGGAACTCATGGACCGCGCTAGAAACCGCGACCCTCTCGGCGTTTCTGATGTCATTTACGAT
ATGCTTGCCGCTGGATTGAACCCTGGACCTCGCTCCTTCCATGGATTAGTTGTTTCTCATGCTCTCAATGGTGATACTGAGGGAGCGATGCAATCTCTGAGAAGGGAATT
AAGTGCTGGACTTCGTCCTCTTCATGAAACGTTTGTTGCATTAGTTCGGTTATTTGGTTCCAAGGGTCTTGCTACTAGAGGCTTAGAAATCCTTGCAGCCATGGAGAAAT
TGAATTATGACATCCGTCAAACATGGCTCATTCTTACTGAGGAACTTGTAAGAAACAAATATTTAGAAGACGCCAACAAAGTGTTCTTAAAGGGTGCCAAAGGGGGTCTC
AGAGCCACCGACAAGATTTATGATCTTCTGATTGAGGAAGATTGTAAAGCCGGGGATCATTCAAATGCCTTAGAGATCTCATATGAAATGGAGGCTGCTGGGCGGATGGC
AACAACCTTTCATTTCAATTCCCTTCTTAGTGTCCAGGCTACTTGTGGAATACCTGAAATTGCTTTCTCAACATTTGAGAACATGGAGTATGGAGAAGATTACATGAAAC
CTGATACTGAGACATATAATTGGGTGATCCAAGCATATACAAGGGCTGAATCTTATGATAGGGTGCAAGATGTTGCTGAGTTACTTGGCATGATGGTTGAAGACCACAAG
CGTCTACAGCCGAACATGAGAACCTATGCGCTCTTGGTAGAGTGTTTTACTAAGTATTGTGTTTTACGAGAAGCTATTAGGCATTTTCGTGCACTAAAAACCTTTGAAGG
TGGAACAAAAGCTTTGCATAATGAAGGGAATTTTGGTGACCCACTTTCCTTATATCTTCGAGCTTTATGTAGAGAAGGTAGGGTTGTTGAACTGTTAGATGCCTTGGAGG
CTATGGCTAGAGATAACCAACAGATTCCTCCAAGAGCCATGATCTTGAGCAGAAAGTATCGATCACTGGTGAGCTCATGGATTGAACCTTTACAAGAAGAAGCTGAACAT
GGATTCGAGATAGACTACATTGCAAGATACATTAAAGAGGGTGGACTCACTGGAGAACGCAAGAGATGGGTCCCTCGAAGAGGAAAAACTCCTCTAGATCCTGATGCAGA
TGGATTCATCTATTCAAATCCTATGGAGACATCCTTTAAGCAACGATGTCTAGAAGACTGGAAGATGTACCACCGAAAGATTTTGAAAACCTTGCAGAATGAAGGACTTG
CGGCTCTTGGGGATGCATCTGAAGCTGATTATCATAGAGTCGTGGAGAGATTGAAGAAAATAATAAAGGGTCCTGACCAAAATGTTTTAAAGCCAAAGGCTGCAAGTAAG
ATGATTGTATCAGAATTAAAAGAAGAATTAGAAGCACAAGGTTTACCAATTGACGGAACTAGAAATGTTCTTTACCAGCGTGTTCAGAAAGCAAGGAGAATTAACCGGTC
TCGTGGTCGGCCCCTTTGGGTTCCTCCAGTGGAGGAGGAGGAAGAAGAGGTTGATGAAGAGCTGGATGAACTAATTTCACGAATAAAGCTACACGAAGGAAATACGGAGT
TCTGGAAACGCCGCTTTCTCGGAGAAGGCTTGGACAGTAATAATGTTAAACCTTCTGAAGATGATATATCAGAACCTCTTGATTCTTTGGATGATGTTGACACTGTAGAA
GACGTTGCAAAGGAGATTGAAGAAGAAGAAGCTGAGGAGGAAGAGGAGGTAGAACCAACTGAGAATCAAGATGGGGAAAGAGTTATTAAGAAGGAAATTGAAGCTAAGAA
GCCTCTTCAAATGATAGGTGTTCAATTGTTAAAAGATGTTGACCAAACCACAACGACATCCAAAAAGTCGAGGAGGAGAAGTTCTCGAGCATCACTTGAGGATGATCGTG
ATGAAGATTGGTTTCCTGAAGATATATTTGAGGCATTTGAGGAGTTGCGAAAGAGGAAAGTCTTTGATGTATCTGACATGTACACAATAGCTGATGTCTGGGGTTGGACT
TGGGAGAGAGAACTTAAGAACAGACCTCCCAGGAGGTGGTCACAGGAATGGGAAGTGGAGCTGGCTATTAAAATTATGCACAAGGTGATTGAATTGGGTGGAACACCAAC
AATTGGCGACTGTGCCATGATCTTGCGAGCTGCCATCAAGGCTCCTGTACCTTCTGCCTTCTTGAAAATCTTGCAGACAACTCATAGTCTTGGCTATGTATTTGGGAGCC
CTTTATATGACGAGATTATCACCCTGTGCCTTGATCTTGGGGAACTAGATGCAGCCATTGCGATTGTAGCAGACCTGGAAACAACAGGAATCTTGGTTCCCGATGAAACA
CTCGATCGGGTCATCTCCGCTAGACAGACGAACGATGCTATGCCCAAGCCTGATTCAGCCATTGATACTACACTAAATGATCACAGTTTAGCCAATGATGATGAATCATA
ATGGATCATTTTGTGTTCTTATTTTTCTTTTGTACAGTTCAGTCGTAGGGTTGCCAGCAAAATGGTTCTTGTAAATTTTATTTGTAGTTGCTTTAGTGGTTTTGTATTTT
GAACTTTGGAGTTGAAATATTTGATTTTAGAAACAAGATATTGAGATTTTTTTTTTTCAATCAAGTTTTAATCATTATGGTAAGTAGCTCCTTTCTTTTGCACCTTCGTT
TGAGCTAACATGCGAAGGAACTCGTACTGGTTTAGTGATGTGTCATATCCACTCCAATGTCTAAGTTAGTAAACAAAGTGTTTAGTTCAACTAAGGACTCAAATATTGCA
TATCTTGTAAGTGGATAGAACAAAGTTATTTATAGGACTCATTATCTATCATTCACTTGGGGCGCGAGCAATGTTGGCGGTGTCAAAATGGATGAAATTGTCAAGGTTGT
AGAGTTCGACCCGTGAGCTCATCTTGCTCATTTCCTCTTATGTGAAATGGGAGGACACTAGGCCAAATTGACCTTGGGACTGAGGACCGC
Protein sequenceShow/hide protein sequence
MSKFLLSHAYLLTLPHKNHSFSLNHGVLPIRSVLSAPEKRGKKKRQSRQQQFHPKDDDSTALEKALRFTFMEELMDRARNRDPLGVSDVIYDMLAAGLNPGPRSFHGLVV
SHALNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQTWLILTEELVRNKYLEDANKVFLKGAKGGLRATDKIYDLLIEEDCKAG
DHSNALEISYEMEAAGRMATTFHFNSLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTK
YCVLREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLDALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIKEGGLTG
ERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLAALGDASEADYHRVVERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPID
GTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDISEPLDSLDDVDTVEDVAKEIEEEEAEEEEEVE
PTENQDGERVIKKEIEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFEELRKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWE
VELAIKIMHKVIELGGTPTIGDCAMILRAAIKAPVPSAFLKILQTTHSLGYVFGSPLYDEIITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISARQTNDAMPKPD
SAIDTTLNDHSLANDDES