; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020083 (gene) of Snake gourd v1 genome

Gene IDTan0020083
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSAP domain-containing protein
Genome locationLG09:40754575..40807241
RNA-Seq ExpressionTan0020083
SyntenyTan0020083
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0006979 - response to oxidative stress (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0004601 - peroxidase activity (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR003034 - SAP domain
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR036361 - SAP domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588298.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0094.33Show/hide
Query:  MSKFLLSHSYLLTLPHKHHSFSLNNGVL-PIRSLLSAPEKRGRKKRQSRQQQLHQKDDGSTALENALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSKFLLSHS LLTLPHKHHSFSL+NGVL P+RS+LS  EKRGRKKRQSRQQQL QKDD ST LE +LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHSYLLTLPHKHHSFSLNNGVL-PIRSLLSAPEKRGRKKRQSRQQQLHQKDDGSTALENALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKGAK
        PGPRSFHGLVVSH LNADAEGAMQSLR+ELS GLRPLHETFVALVRLFG+KGLATRGLEILAAMEKLNYDIRQAWLILIEELV+NKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQD
        GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPD ETYNWVIQAYTRAESYDRVQD
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTF GGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIP RAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKIYHRKILKTLQNEGLAALG
        ILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWK+YHRKILKTLQNEGLAALG
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKIYHRKILKTLQNEGLAALG

Query:  DASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
        DASEADY+RVEERLKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLDSNNVKASEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQPENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTS
        HEGNTEFWKRRFLGEGLDSNNVK SEDD+SEPLDSLDDVDIVEDVAKEI+EEEAEEEEEVE  ENQDGERVIKKEVEAKKP QMIGVQLLKDVDQT+TTS
Subjt:  HEGNTEFWKRRFLGEGLDSNNVKASEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQPENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTS

Query:  KKSRRRSSRASLEDDRDEDWFPEDLFEAFGELRKRKIFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
        KKSRRR SRAS+EDDRDEDWFPEDLFEAFGELRKRK+FD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGG PTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDLFEAFGELRKRKIFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA

Query:  AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGISVSDETLDRVISARQTSDAAPKPDPAIDITLDDHSLANDEAS
        AIKAPLPSAF KILQTTH LGYVFGS LYDE+ITLCLDLGELDAAIAIVADLETTGISV DETLDR+ISARQT+DAAPK D  IDITL+DHSL NDE S
Subjt:  AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGISVSDETLDRVISARQTSDAAPKPDPAIDITLDDHSLANDEAS

XP_022930357.1 uncharacterized protein LOC111436825 isoform X1 [Cucurbita moschata]0.0e+0094.33Show/hide
Query:  MSKFLLSHSYLLTLPHKHHSFSLNNGVL-PIRSLLSAPEKRGRKKRQSRQQQLHQKDDGSTALENALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSKFLLSHSYLLTLPHKHHSFSL+NGV  PIRS+LS  EKRGRKKRQSRQQQL QKDD ST  E +LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHSYLLTLPHKHHSFSLNNGVL-PIRSLLSAPEKRGRKKRQSRQQQLHQKDDGSTALENALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKGAK
        PGPRSFHGLVVSH LNADAEGAMQSLR+ELS GLRPLHETFVALVRLFG+KGLATRGLEILAAMEKLNYDIRQAWLILIEELV+NKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQD
        GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPD ETYNWVIQAYTRAESYDRVQD
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTF GGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIP RAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKIYHRKILKTLQNEGLAALG
        ILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWK+YHRKILKTLQNEGLAALG
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKIYHRKILKTLQNEGLAALG

Query:  DASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
        DASEADY+RVEERLKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRN+LYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLDSNNVKASEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQPENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTS
        HEGNTEFWKRRFLGEGLDSNNVK SEDD+SEPLDSLDDVDIVEDVAKEI+EEEAEEEEEVE  ENQDGERVIKKEVEAKKP QMIGVQLLKDVDQT+TTS
Subjt:  HEGNTEFWKRRFLGEGLDSNNVKASEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQPENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTS

Query:  KKSRRRSSRASLEDDRDEDWFPEDLFEAFGELRKRKIFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
        KKSRRR SRAS+EDDRDEDWFPEDLFEAFGELRKRK+FD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGG PTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDLFEAFGELRKRKIFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA

Query:  AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGISVSDETLDRVISARQTSDAAPKPDPAIDITLDDHSLANDEAS
        AIKAPLPSAF KILQTTH LGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGISV DETLDR+ISARQT+DAAPK D  IDITL+DHSL NDE S
Subjt:  AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGISVSDETLDRVISARQTSDAAPKPDPAIDITLDDHSLANDEAS

XP_023006519.1 uncharacterized protein LOC111499221 isoform X1 [Cucurbita maxima]0.0e+0094.33Show/hide
Query:  MSKFLLSHSYLLTLPHKHHSFSLNNGVL-PIRSLLSAPEKRGRKKRQSRQQQLHQKDDGSTALENALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSKFLLSHS LLTLPHKHHSFSL+N VL PIRS+LS  EKRGRKKRQSRQQQL QKD  ST LE +LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHSYLLTLPHKHHSFSLNNGVL-PIRSLLSAPEKRGRKKRQSRQQQLHQKDDGSTALENALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKGAK
        PGPRSFHGLVVSH LNADAEGAMQSLR+ELS GLRPLHETFVALVRLFG+KGLATRGLEILAAMEKLNYDIRQAWLILIEELV+NKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQD
        GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPD ETYNWVIQAYTRAESYDRVQD
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTF GGTKALH+EG+FGDPLSLYLRALCREGRVVELLEALEAMARDNQQIP RAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKIYHRKILKTLQNEGLAALG
        ILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWK+YHRKILKTLQNEGLAALG
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKIYHRKILKTLQNEGLAALG

Query:  DASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
        DASEADYLRVEERLKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLDSNNVKASEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQPENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTS
        HEGNTEFWKRRFLGEGLDSNNVK SEDD+SEPLDSLDDVD+VEDVAKEI+EEEAEEEEEVE  ENQDGERVIKKEVEAKKP QMIGVQLLKDVDQT+TTS
Subjt:  HEGNTEFWKRRFLGEGLDSNNVKASEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQPENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTS

Query:  KKSRRRSSRASLEDDRDEDWFPEDLFEAFGELRKRKIFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
        KKSRRR SRAS+EDDRDEDWFPEDLFEAFGELRKRKIFD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGG PTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDLFEAFGELRKRKIFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA

Query:  AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGISVSDETLDRVISARQTSDAAPKPDPAIDITLDDHSLANDEAS
        AIKAPLPSAF KILQTTH LGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGISV DETLDR+ISARQT+DAAPK D  IDITL+DHSLA+DE S
Subjt:  AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGISVSDETLDRVISARQTSDAAPKPDPAIDITLDDHSLANDEAS

XP_023531019.1 uncharacterized protein LOC111793400 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0094.11Show/hide
Query:  MSKFLLSHSYLLTLPHKHHSFSLNNGVL--PIRSLLSAPEKRGRKKRQSRQQQLHQKDDGSTALENALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL
        MSKFLLSHS LLTLPHKHHSFSL+NG+L  PIRS+LS  EKRGRKKRQSRQQQL QKDD ST LE +LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL
Subjt:  MSKFLLSHSYLLTLPHKHHSFSLNNGVL--PIRSLLSAPEKRGRKKRQSRQQQLHQKDDGSTALENALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL

Query:  SPGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKGA
        SPGPRSFHGLVVSH LNADAEGAMQSLR+ELS GLRP+HETFVALVRLFG+KGLATRGLEILAAMEKLNYDIRQAWLILIEELV+NKYLEDAN+VFLKGA
Subjt:  SPGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKGA

Query:  KGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQ
        KGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPD ETYNWVIQAYTRAESYDRVQ
Subjt:  KGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQ

Query:  DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRA
        DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGL+TF GGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIP RA
Subjt:  DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRA

Query:  MILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKIYHRKILKTLQNEGLAAL
        MILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWK+YHRKILKTLQNEGLAAL
Subjt:  MILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKIYHRKILKTLQNEGLAAL

Query:  GDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK
        GDASEADYLRVEERLKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK
Subjt:  GDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK

Query:  LHEGNTEFWKRRFLGEGLDSNNVKASEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQPENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTT
        LHEGNTEFWKRRFLGEGLDSNNVK SEDD+SEPLDSLDDVDIVEDVAKEI+EEEAEEEEEVE  ENQDGERVIKKEVEAKKP QMIGVQLLKD+DQT+TT
Subjt:  LHEGNTEFWKRRFLGEGLDSNNVKASEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQPENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTT

Query:  SKKSRRRSSRASLEDDRDEDWFPEDLFEAFGELRKRKIFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILR
        SKKSRRR SRAS+EDDRDEDWFPEDLFEAFGELRKRK+FD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGG PTIGDCAMILR
Subjt:  SKKSRRRSSRASLEDDRDEDWFPEDLFEAFGELRKRKIFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILR

Query:  AAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGISVSDETLDRVISARQTSDAAPKPDPAIDITLDDHSLANDEAS
        AAIKAPLPSAF KILQTTH LGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGISV DETLDR+ISARQT+DAAPK D  IDITL+DHSLANDE S
Subjt:  AAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGISVSDETLDRVISARQTSDAAPKPDPAIDITLDDHSLANDEAS

XP_038879291.1 uncharacterized protein LOC120071230 [Benincasa hispida]0.0e+0094.66Show/hide
Query:  MSKFLLSHSYLLTLPHKHHSFSLNNGVLPIRSLLSAPEKRGRKKRQSR-QQQLHQKDDGSTALENALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSKFLLSH++LLTLP+KHHSFSLN+GV+PIRS+LSAP+KRGRKKRQ+R QQQLH KD  STALE +LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHSYLLTLPHKHHSFSLNNGVLPIRSLLSAPEKRGRKKRQSR-QQQLHQKDDGSTALENALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKGAK
        PGPRSFHGLVVSH LN D EGAMQSLRRELSAGL PLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLIL EELVRNKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQD
        GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPD ETYNWVIQAYTRAESYDRVQD
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFR LKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKIYHRKILKTLQNEGLAALG
        ILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWK++HRKILKTLQNEGLAALG
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKIYHRKILKTLQNEGLAALG

Query:  DASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
         ASEADY RV ERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLDSNNVKASEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQPENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTS
        HEGNTEFWKRRFLGEGLDSNNVK SEDDKSEPLDSLDDVD VEDVAKEIEEEEAEEEEEVEQ ENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQ  TTS
Subjt:  HEGNTEFWKRRFLGEGLDSNNVKASEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQPENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTS

Query:  KKSRRRSSRASLEDDRDEDWFPEDLFEAFGELRKRKIFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
        KKSRRRSSRASLEDDRDEDWFPED+FEAF ELRKRK+FDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDLFEAFGELRKRKIFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA

Query:  AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGISVSDETLDRVISARQTSDAAPKPDPAIDITLDDHSLANDEAS
        AIKAPLPS+FLKILQTTHGLGY FGSPLYDEVITLCLDLGELDAAIAIVADLETTGI V DETLDRVIS RQT+D+ PKPD AID TL+DHSLA+DEAS
Subjt:  AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGISVSDETLDRVISARQTSDAAPKPDPAIDITLDDHSLANDEAS

TrEMBL top hitse value%identityAlignment
A0A1S3B8T6 uncharacterized protein LOC103487261 isoform X10.0e+0093.44Show/hide
Query:  MSKFLLSHSYLLTLPHKHHSFSLNNGVLPIRSLLSAPEKRGRKKRQSR-QQQLHQKDDGSTALENALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSK LLSH++LLTLP+ H SFSLN+G+LPIRS+LSAP+KRGRKKRQSR QQQL  KDD ST+LEN+LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHSYLLTLPHKHHSFSLNNGVLPIRSLLSAPEKRGRKKRQSR-QQQLHQKDDGSTALENALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKGAK
        PGPRSFHGLVVSH LN D EGAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA RGLEILAAME+LNYDIRQAWLIL EELVRNKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQD
         GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPD ETYNWVIQAYTRAESYDRVQD
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFR LKTFQGGTKALHNEGNFGDPLSLYLRALCREGRV++LLEALEAMARDNQQIPPRAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKIYHRKILKTLQNEGLAALG
        ILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWK+YHRKILKTLQNEGL AL 
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKIYHRKILKTLQNEGLAALG

Query:  DASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
        DASEADY RV E+LKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLDSNNVKASEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQPENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTS
        HEGNTEFWKRRFLGEGLDSNNVK SEDDKS+ LDSLDDVD +EDVAKEIEEEEAEEEEEVEQ ENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQ T TS
Subjt:  HEGNTEFWKRRFLGEGLDSNNVKASEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQPENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTS

Query:  KKSRRRSSRASLEDDRDEDWFPEDLFEAFGELRKRKIFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
        KKSRRRSSRASLEDDRDEDWFPED+FEAF EL+KRK+FDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDLFEAFGELRKRKIFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA

Query:  AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGISVSDETLDRVISARQTSDAAPKPDPAIDITLDDHSLANDEAS
        AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGI V DETLDRVIS RQT+DA PKPD AID T++DHSLANDEAS
Subjt:  AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGISVSDETLDRVISARQTSDAAPKPDPAIDITLDDHSLANDEAS

A0A1S3B9H7 uncharacterized protein LOC103487261 isoform X20.0e+0093.72Show/hide
Query:  MSKFLLSHSYLLTLPHKHHSFSLNNGVLPIRSLLSAPEKRGRKKRQSR-QQQLHQKDDGSTALENALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSK LLSH++LLTLP+ H SFSLN+G+LPIRS+LSAP+KRGRKKRQSR QQQL  KDD ST+LEN+LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHSYLLTLPHKHHSFSLNNGVLPIRSLLSAPEKRGRKKRQSR-QQQLHQKDDGSTALENALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKGAK
        PGPRSFHGLVVSH LN D EGAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA RGLEILAAME+LNYDIRQAWLIL EELVRNKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQD
         GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPD ETYNWVIQAYTRAESYDRVQD
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFR LKTFQGGTKALHNEGNFGDPLSLYLRALCREGRV++LLEALEAMARDNQQIPPRAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKIYHRKILKTLQNEGLAALG
        ILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWK+YHRKILKTLQNEGL AL 
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKIYHRKILKTLQNEGLAALG

Query:  DASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
        DASEADY RV E+LKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLDSNNVKASEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQPENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTS
        HEGNTEFWKRRFLGEGLDSNNVK SEDDKS+ LDSLDDVD +EDVAKEIEEEEAEEEEEVEQ ENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQ T TS
Subjt:  HEGNTEFWKRRFLGEGLDSNNVKASEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQPENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTS

Query:  KKSRRRSSRASLEDDRDEDWFPEDLFEAFGELRKRKIFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
        KKSRRRSSRASLEDDRDEDWFPED+FEAF EL+KRK+FDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDLFEAFGELRKRKIFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA

Query:  AIKAPLPSAFLKILQTTHGLGYVFGSPL
        AIKAPLPSAFLKILQTTHGLGYVFG  L
Subjt:  AIKAPLPSAFLKILQTTHGLGYVFGSPL

A0A6J1DBV3 uncharacterized protein LOC1110195950.0e+0093.01Show/hide
Query:  MSKFLL-SHSYLLTLPHKHHSFSL--NNGVLPIRSLLSAPEKRGRKKRQSRQQQLHQKDDGSTALENALRFTFMEELMDRARNHDPLGVSDVIYDMVAAG
        MSK LL SH++LLTLPHK  S  L  +NGVLPIRS+LSAPEKRGRKKRQ R    H KDD STALE  LRFTFMEELMDRAR+ D +GVSDVIYDMVAAG
Subjt:  MSKFLL-SHSYLLTLPHKHHSFSL--NNGVLPIRSLLSAPEKRGRKKRQSRQQQLHQKDDGSTALENALRFTFMEELMDRARNHDPLGVSDVIYDMVAAG

Query:  LSPGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKG
        LSPGPRSFHGLVVSH+LN D EGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLA+RGLEIL+AMEKLNYDIRQAWLILI+ELVRNKYLEDAN+ FLKG
Subjt:  LSPGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKG

Query:  AKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRV
        AKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGED MKPD ETYNWVIQAYTRAESYDRV
Subjt:  AKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRV

Query:  QDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPR
        QDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFR LKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIP R
Subjt:  QDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPR

Query:  AMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKIYHRKILKTLQNEGLAA
        AMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWK+YHRKILKTLQNEGL A
Subjt:  AMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKIYHRKILKTLQNEGLAA

Query:  LGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRI
        LGDASEADY RVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRI
Subjt:  LGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRI

Query:  KLHEGNTEFWKRRFLGEGLDSNNVKASEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQPENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTT
        KLHEGNTE+WKRRFLGEGLD+N+VK SEDDKSEPLDSLDDVDIVED AKEIEEEE  EEEEVEQ ENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTT
Subjt:  KLHEGNTEFWKRRFLGEGLDSNNVKASEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQPENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTT

Query:  TSKKSRRRSSRASLEDDRDEDWFPEDLFEAFGELRKRKIFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMIL
        TSKKSRRRSSRASLEDDRDEDWFPED+FEAF ELRKR++FDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELA KIMHKVIELGGTPTIGDCAMIL
Subjt:  TSKKSRRRSSRASLEDDRDEDWFPEDLFEAFGELRKRKIFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMIL

Query:  RAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGISVSDETLDRVISARQTSDAAPKPDPAIDITLDDHSLANDEA
        RAAI++PLPSAFLKILQTTH LGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGI V DETLDRVISARQT+DA PKPD AID TL+DHSLANDEA
Subjt:  RAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGISVSDETLDRVISARQTSDAAPKPDPAIDITLDDHSLANDEA

Query:  S
        S
Subjt:  S

A0A6J1EQ88 uncharacterized protein LOC111436825 isoform X10.0e+0094.33Show/hide
Query:  MSKFLLSHSYLLTLPHKHHSFSLNNGVL-PIRSLLSAPEKRGRKKRQSRQQQLHQKDDGSTALENALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSKFLLSHSYLLTLPHKHHSFSL+NGV  PIRS+LS  EKRGRKKRQSRQQQL QKDD ST  E +LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHSYLLTLPHKHHSFSLNNGVL-PIRSLLSAPEKRGRKKRQSRQQQLHQKDDGSTALENALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKGAK
        PGPRSFHGLVVSH LNADAEGAMQSLR+ELS GLRPLHETFVALVRLFG+KGLATRGLEILAAMEKLNYDIRQAWLILIEELV+NKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQD
        GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPD ETYNWVIQAYTRAESYDRVQD
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTF GGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIP RAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKIYHRKILKTLQNEGLAALG
        ILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWK+YHRKILKTLQNEGLAALG
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKIYHRKILKTLQNEGLAALG

Query:  DASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
        DASEADY+RVEERLKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRN+LYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLDSNNVKASEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQPENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTS
        HEGNTEFWKRRFLGEGLDSNNVK SEDD+SEPLDSLDDVDIVEDVAKEI+EEEAEEEEEVE  ENQDGERVIKKEVEAKKP QMIGVQLLKDVDQT+TTS
Subjt:  HEGNTEFWKRRFLGEGLDSNNVKASEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQPENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTS

Query:  KKSRRRSSRASLEDDRDEDWFPEDLFEAFGELRKRKIFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
        KKSRRR SRAS+EDDRDEDWFPEDLFEAFGELRKRK+FD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGG PTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDLFEAFGELRKRKIFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA

Query:  AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGISVSDETLDRVISARQTSDAAPKPDPAIDITLDDHSLANDEAS
        AIKAPLPSAF KILQTTH LGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGISV DETLDR+ISARQT+DAAPK D  IDITL+DHSL NDE S
Subjt:  AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGISVSDETLDRVISARQTSDAAPKPDPAIDITLDDHSLANDEAS

A0A6J1L2D9 uncharacterized protein LOC111499221 isoform X10.0e+0094.33Show/hide
Query:  MSKFLLSHSYLLTLPHKHHSFSLNNGVL-PIRSLLSAPEKRGRKKRQSRQQQLHQKDDGSTALENALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSKFLLSHS LLTLPHKHHSFSL+N VL PIRS+LS  EKRGRKKRQSRQQQL QKD  ST LE +LRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHSYLLTLPHKHHSFSLNNGVL-PIRSLLSAPEKRGRKKRQSRQQQLHQKDDGSTALENALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKGAK
        PGPRSFHGLVVSH LNADAEGAMQSLR+ELS GLRPLHETFVALVRLFG+KGLATRGLEILAAMEKLNYDIRQAWLILIEELV+NKYLEDAN+VFLKGAK
Subjt:  PGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKGAK

Query:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQD
        GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPD ETYNWVIQAYTRAESYDRVQD
Subjt:  GGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTF GGTKALH+EG+FGDPLSLYLRALCREGRVVELLEALEAMARDNQQIP RAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKIYHRKILKTLQNEGLAALG
        ILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWK+YHRKILKTLQNEGLAALG
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKIYHRKILKTLQNEGLAALG

Query:  DASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
        DASEADYLRVEERLKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLDSNNVKASEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQPENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTS
        HEGNTEFWKRRFLGEGLDSNNVK SEDD+SEPLDSLDDVD+VEDVAKEI+EEEAEEEEEVE  ENQDGERVIKKEVEAKKP QMIGVQLLKDVDQT+TTS
Subjt:  HEGNTEFWKRRFLGEGLDSNNVKASEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQPENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTS

Query:  KKSRRRSSRASLEDDRDEDWFPEDLFEAFGELRKRKIFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
        KKSRRR SRAS+EDDRDEDWFPEDLFEAFGELRKRKIFD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGG PTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDLFEAFGELRKRKIFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA

Query:  AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGISVSDETLDRVISARQTSDAAPKPDPAIDITLDDHSLANDEAS
        AIKAPLPSAF KILQTTH LGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGISV DETLDR+ISARQT+DAAPK D  IDITL+DHSLA+DE S
Subjt:  AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGISVSDETLDRVISARQTSDAAPKPDPAIDITLDDHSLANDEAS

SwissProt top hitse value%identityAlignment
O82178 Pentatricopeptide repeat-containing protein At2g351303.4e-0723.15Show/hide
Query:  PLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLIL----IEELVRNK-YLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEIS
        P  +T+  L++ +   GL  R   +L  M+  +   +   + +    IE L++ K   E+A +VF +  +   + T + Y+L+I    KA     + ++ 
Subjt:  PLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLIL----IEELVRNK-YLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEIS

Query:  YEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT
         EM +         +  L++  A  G+ E A   FE ++  ED ++PD+  YN ++++Y+RA         AE+  +M   H   +P+  +Y ++V+ + 
Subjt:  YEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT

Query:  KYCVIREAIRHFRGLK
        +  +  +A   F  +K
Subjt:  KYCVIREAIRHFRGLK

Q0WPZ6 Pentatricopeptide repeat-containing protein At2g171404.2e-1023.43Show/hide
Query:  VSDVIYDMVAAGLSPGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLN-YDIRQAWLILIEELVRN
        VS +  DMV  G++P   +F+ L+ +   ++  + A +        G +P   TF  LVR +   GL  +GLE+L AME       +  +  ++    R 
Subjt:  VSDVIYDMVAAGLSPGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLN-YDIRQAWLILIEELVRN

Query:  KYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMA----TTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIET
           +D+ ++  K  + GL      ++  I   CK G   +A  I  +ME    +      +  +N +L      G+ E A + FE++   +D     +++
Subjt:  KYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMA----TTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIET

Query:  YNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLS--LYLRALCREGRVV
        YN  +Q   R   +   + V + +       K + P++ +Y +L++   K  ++ +A       KT  G    +   G   D ++    L   C  G+V 
Subjt:  YNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLS--LYLRALCREGRVV

Query:  ELLEALEAMARDNQQIPPRAMILSRKYRSL-----VSSWIEPLQEEAEHGYEIDYIARYIEEGGLTG
             L+ M R+N    P A   +    SL     +S   E L++  E GY +D +   I   GL G
Subjt:  ELLEALEAMARDNQQIPPRAMILSRKYRSL-----VSSWIEPLQEEAEHGYEIDYIARYIEEGGLTG

Q9LMH5 Putative pentatricopeptide repeat-containing protein At1g138009.8e-0722.58Show/hide
Query:  VIYDMVAAGLSPGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDI-RQAWLILIEELVRNKYL
        V+ DM   G+ P    +  ++  H  N +   A+    + L    R       ++++ +   G  +   ++     + N  + R  + +  + L +   +
Subjt:  VIYDMVAAGLSPGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDI-RQAWLILIEELVRNKYL

Query:  EDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQA
        E+A E+F +    G+      Y  LI   C  G  S+A ++  EM+  G+      +N L    AT G+ + AF T + ME     +KP   T+N VI+ 
Subjt:  EDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQA

Query:  YTRAESYDRVQDVAELL
           A   D+ +   E L
Subjt:  YTRAESYDRVQDVAELL

Q9LN22 Pentatricopeptide repeat-containing protein At1g20300, mitochondrial2.4e-0523.9Show/hide
Query:  ETFVALVRLFGSKGLATRGLEILAAMEKLN-YDIRQAWLILIEELVRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAG
        ETF  L+R +   GLA+  +     ME       + A+ I+I  L R +   +A + F    K        +Y  L+   C+AG+ S A ++  EM+ AG
Subjt:  ETFVALVRLFGSKGLATRGLEILAAMEKLN-YDIRQAWLILIEELVRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAG

Query:  RMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRL--QPNMRTYALLVECFTKYCVI
             + ++ ++     CG    A   F +M   +    P+  T+N +++ + +A    R + V ++   M    K+L  +P+  TY  L+E   +   +
Subjt:  RMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRL--QPNMRTYALLVECFTKYCVI

Query:  REAIR
          A++
Subjt:  REAIR

Q9SIC9 Pentatricopeptide repeat-containing protein At2g31400, chloroplastic6.1e-0924.71Show/hide
Query:  GLRPLHETFVALVRLFGSKGL--ATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEIS
        G++P   TF +L+ +    GL  A R L       ++  D+  ++  L++ + +   ++ A E+  +     +      Y  +I+   KAG    AL + 
Subjt:  GLRPLHETFVALVRLFGSKGL--ATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEIS

Query:  YEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT
         EM   G       +N LLS+    G  E A      M      +K D+ TYN ++  Y +   YD V+ V      M  +H  + PN+ TY+ L++ ++
Subjt:  YEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT

Query:  KYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPL--SLYLRALCREGRVVELLEALEAMARD
        K  + +EA+  FR  K+   G +A        D +  S  + ALC+ G V   +  ++ M ++
Subjt:  KYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPL--SLYLRALCREGRVVELLEALEAMARD

Arabidopsis top hitse value%identityAlignment
AT1G13800.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.9e-0822.58Show/hide
Query:  VIYDMVAAGLSPGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDI-RQAWLILIEELVRNKYL
        V+ DM   G+ P    +  ++  H  N +   A+    + L    R       ++++ +   G  +   ++     + N  + R  + +  + L +   +
Subjt:  VIYDMVAAGLSPGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDI-RQAWLILIEELVRNKYL

Query:  EDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQA
        E+A E+F +    G+      Y  LI   C  G  S+A ++  EM+  G+      +N L    AT G+ + AF T + ME     +KP   T+N VI+ 
Subjt:  EDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQA

Query:  YTRAESYDRVQDVAELL
           A   D+ +   E L
Subjt:  YTRAESYDRVQDVAELL

AT2G17140.1 Pentatricopeptide repeat (PPR) superfamily protein3.0e-1123.43Show/hide
Query:  VSDVIYDMVAAGLSPGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLN-YDIRQAWLILIEELVRN
        VS +  DMV  G++P   +F+ L+ +   ++  + A +        G +P   TF  LVR +   GL  +GLE+L AME       +  +  ++    R 
Subjt:  VSDVIYDMVAAGLSPGPRSFHGLVVSHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLN-YDIRQAWLILIEELVRN

Query:  KYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMA----TTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIET
           +D+ ++  K  + GL      ++  I   CK G   +A  I  +ME    +      +  +N +L      G+ E A + FE++   +D     +++
Subjt:  KYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMA----TTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIET

Query:  YNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLS--LYLRALCREGRVV
        YN  +Q   R   +   + V + +       K + P++ +Y +L++   K  ++ +A       KT  G    +   G   D ++    L   C  G+V 
Subjt:  YNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLS--LYLRALCREGRVV

Query:  ELLEALEAMARDNQQIPPRAMILSRKYRSL-----VSSWIEPLQEEAEHGYEIDYIARYIEEGGLTG
             L+ M R+N    P A   +    SL     +S   E L++  E GY +D +   I   GL G
Subjt:  ELLEALEAMARDNQQIPPRAMILSRKYRSL-----VSSWIEPLQEEAEHGYEIDYIARYIEEGGLTG

AT2G31400.1 genomes uncoupled 14.3e-1024.71Show/hide
Query:  GLRPLHETFVALVRLFGSKGL--ATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEIS
        G++P   TF +L+ +    GL  A R L       ++  D+  ++  L++ + +   ++ A E+  +     +      Y  +I+   KAG    AL + 
Subjt:  GLRPLHETFVALVRLFGSKGL--ATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEIS

Query:  YEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT
         EM   G       +N LLS+    G  E A      M      +K D+ TYN ++  Y +   YD V+ V      M  +H  + PN+ TY+ L++ ++
Subjt:  YEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT

Query:  KYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPL--SLYLRALCREGRVVELLEALEAMARD
        K  + +EA+  FR  K+   G +A        D +  S  + ALC+ G V   +  ++ M ++
Subjt:  KYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPL--SLYLRALCREGRVVELLEALEAMARD

AT2G35130.2 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-0823.15Show/hide
Query:  PLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLIL----IEELVRNK-YLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEIS
        P  +T+  L++ +   GL  R   +L  M+  +   +   + +    IE L++ K   E+A +VF +  +   + T + Y+L+I    KA     + ++ 
Subjt:  PLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLIL----IEELVRNK-YLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEIS

Query:  YEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT
         EM +         +  L++  A  G+ E A   FE ++  ED ++PD+  YN ++++Y+RA         AE+  +M   H   +P+  +Y ++V+ + 
Subjt:  YEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT

Query:  KYCVIREAIRHFRGLK
        +  +  +A   F  +K
Subjt:  KYCVIREAIRHFRGLK

AT3G04260.1 plastid transcriptionally active 30.0e+0074.15Show/hide
Query:  GVLPIRSLLSAPEKRGRKKRQSRQQQLHQKDD----GS----TALENALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHALNAD
        G+  IR  +SAPEK+ R++R+ ++    + DD    GS    +ALE +LR TFM+ELM+RARN D  GVS+VIYDM+AAGLSPGPRSFHGLVV+HALN D
Subjt:  GVLPIRSLLSAPEKRGRKKRQSRQQQLHQKDD----GS----TALENALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHALNAD

Query:  AEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDC
         +GAM SLR+EL AG RPL ET +ALVRL GSKG ATRGLEILAAMEKL YDIRQAWLIL+EEL+R  +LEDAN+VFLKGA+GG+RATD++YDL+IEEDC
Subjt:  AEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDC

Query:  KAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPN
        KAGDHSNAL+ISYEMEAAGRMATTFHFNCLLSVQATCGIPE+A++TFENMEYGE +MKPD ETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKR+QPN
Subjt:  KAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPN

Query:  MRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQ
        ++TYALLVECFTKYCV++EAIRHFR LK F+GGT  LHN GNF DPLSLYLRALCREGR+VEL++AL+AM +DNQ IPPRAMI+SRKYR+LVSSWIEPLQ
Subjt:  MRTYALLVECFTKYCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQ

Query:  EEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKIYHRKILKTLQNEGLAALGDASEADYLRVEERLKKII
        EEAE GYEIDY+ARYIEEGGLTGERKRWVPRRGKTPLDPDA GFIYSNP+ETSFKQRCLEDWK++HRK+L+TLQ+EGL  LGDASE+DY+RV ERL+ II
Subjt:  EEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKIYHRKILKTLQNEGLAALGDASEADYLRVEERLKKII

Query:  KGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLD
        KGP  N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRIN+SRGRPLWVPP+EEEEEEVDEE+D+LI RIKLHEG+TEFWKRRFLGEGL 
Subjt:  KGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLD

Query:  SNNVKASEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQPENQD-----------------GERVIK-KEVEAKKPLQMIGVQLLKDVDQTTTTS
          +V++ E   +E + + +    +ED++KE + EE ++EEE E  E+ D                 GE ++K K  +AKK LQMIGVQLLK+ D+   T 
Subjt:  SNNVKASEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVEQPENQD-----------------GERVIK-KEVEAKKPLQMIGVQLLKDVDQTTTTS

Query:  KKSRRRSSRASLEDDRDEDWFPEDLFEAFGELRKRKIFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
        KK  +R+SR +LEDD DEDWFPE+ FEAF E+R+RK+FDV+DMYTIADVWGWTWE++ KN+ PR+WSQEWEVELAI +M KVIELGG PTIGDCA+ILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDLFEAFGELRKRKIFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA

Query:  AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGISVSDETLDRVISARQTSDAAPKPDP
        A++AP+PSAFLKILQTTH LGY FGSPLYDE+ITLCLDLGELDAAIAIVAD+ETTGI+V D+TLD+VISARQ S+ +P+ +P
Subjt:  AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGISVSDETLDRVISARQTSDAAPKPDP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCAAATTCCTGCTCTCTCACTCCTATCTTCTCACCCTTCCCCACAAACACCATTCCTTTTCTCTCAACAATGGCGTCCTCCCCATCCGCTCACTCCTCTCTGCTCC
TGAGAAGCGAGGTAGAAAGAAGCGGCAGTCGCGGCAGCAACAACTGCACCAAAAGGACGACGGTTCCACTGCGCTCGAGAACGCTCTTCGCTTCACTTTCATGGAGGAAC
TCATGGACCGCGCTAGAAACCACGATCCGCTTGGCGTTTCTGATGTCATTTACGATATGGTTGCCGCTGGATTGAGCCCTGGACCTCGCTCCTTCCATGGCTTGGTTGTG
TCTCATGCTCTCAATGCTGATGCTGAGGGAGCGATGCAATCTCTGAGAAGGGAATTAAGTGCTGGACTTCGTCCTCTTCATGAAACGTTTGTTGCATTAGTTCGGTTATT
TGGCTCCAAGGGTCTTGCTACTAGAGGCTTAGAAATCCTTGCAGCCATGGAGAAATTGAACTATGACATTCGCCAAGCTTGGCTCATTCTTATTGAGGAACTCGTAAGGA
ACAAATATTTAGAAGACGCCAATGAAGTGTTCTTAAAGGGTGCCAAAGGGGGTCTCAGAGCCACCGACAAGATTTACGATCTTCTAATTGAAGAAGACTGTAAAGCTGGG
GACCATTCAAATGCCTTAGAGATTTCATATGAAATGGAGGCTGCTGGGCGGATGGCAACAACCTTTCATTTCAATTGCCTTCTCAGTGTTCAGGCTACTTGTGGAATACC
TGAAATTGCTTTCTCAACATTTGAGAACATGGAATATGGAGAAGATTACATGAAGCCCGATATTGAGACATATAATTGGGTGATCCAAGCATATACAAGAGCTGAATCTT
ATGATAGGGTGCAAGATGTTGCTGAGTTACTTGGCATGATGGTTGAAGACCACAAGCGTCTACAGCCTAACATGAGAACCTATGCGCTCTTGGTAGAGTGTTTTACGAAG
TATTGTGTTATACGAGAAGCCATTAGGCATTTTCGTGGACTAAAAACCTTTCAAGGTGGAACAAAAGCTTTGCATAATGAAGGAAATTTTGGTGACCCACTTTCCTTATA
TCTTCGAGCTTTATGTAGAGAAGGTAGGGTTGTAGAGCTCTTAGAAGCGTTAGAAGCTATGGCTAGAGATAACCAACAGATTCCTCCAAGAGCCATGATCTTGAGCAGAA
AGTATCGATCACTGGTGAGCTCATGGATTGAACCTTTACAGGAAGAAGCTGAACATGGATACGAGATAGACTACATTGCAAGATACATTGAAGAGGGGGGACTCACTGGA
GAACGCAAGAGATGGGTCCCTCGAAGAGGAAAAACTCCTCTAGATCCTGATGCTGATGGATTCATCTATTCAAATCCTATGGAAACATCCTTTAAGCAACGATGTCTAGA
AGATTGGAAGATATACCACCGAAAGATTTTGAAAACCTTGCAGAATGAAGGACTTGCAGCTCTTGGGGATGCATCTGAAGCTGATTATCTTAGAGTCGAGGAGAGATTGA
AGAAAATTATAAAGGGCCCTGATCAAAATGTTTTAAAGCCAAAGGCTGCAAGTAAGATGATTGTATCAGAATTAAAAGAAGAATTGGAAGCGCAAGGTTTACCAATTGAT
GGAACTAGAAATGTTCTTTACCAGCGTGTTCAAAAAGCAAGGAGAATAAATCGGTCTCGTGGTCGGCCCCTTTGGGTTCCTCCAGTGGAGGAGGAGGAAGAAGAGGTTGA
TGAAGAGTTGGATGAACTAATTTCACGAATAAAGCTACATGAAGGAAACACAGAGTTCTGGAAACGCCGCTTTCTTGGAGAAGGCTTGGACAGTAATAATGTTAAAGCTT
CTGAAGATGATAAGTCAGAACCTCTTGATTCTTTGGATGATGTTGACATTGTAGAAGACGTTGCAAAGGAGATTGAAGAAGAAGAAGCCGAGGAGGAAGAGGAGGTTGAA
CAACCTGAGAATCAAGATGGTGAAAGAGTTATCAAGAAGGAAGTTGAAGCTAAGAAGCCTCTTCAAATGATAGGTGTCCAATTGTTAAAAGATGTTGACCAAACCACAAC
AACATCCAAAAAGTCAAGGAGGAGAAGTTCTCGAGCATCACTCGAGGACGATCGTGATGAAGACTGGTTTCCTGAAGATTTATTTGAGGCATTTGGAGAGTTGCGAAAGA
GGAAAATCTTTGATGTATCTGACATGTACACAATAGCTGATGTTTGGGGTTGGACTTGGGAGAGAGAACTTAAGAACAGACCTCCCAGGAGGTGGTCACAGGAATGGGAA
GTGGAGTTGGCTATTAAAATTATGCACAAGGTGATAGAATTGGGTGGAACACCAACAATTGGCGACTGTGCCATGATCTTGCGAGCTGCCATCAAGGCTCCGCTTCCGTC
TGCTTTTTTGAAAATCTTGCAGACAACTCATGGTCTTGGCTATGTATTTGGGAGCCCTTTATATGATGAGGTTATTACCCTGTGTCTTGATCTTGGGGAACTAGATGCAG
CCATTGCAATTGTAGCAGACCTGGAAACCACAGGAATCTCGGTTTCTGATGAAACGCTCGATCGGGTAATCTCCGCTAGACAGACGAGCGATGCTGCACCTAAGCCTGAT
CCAGCCATTGATATTACACTCGATGATCATAGTTTAGCCAATGATGAAGCATCATAA
mRNA sequenceShow/hide mRNA sequence
GTGGCATTATCATCTCTCCTCAGCAGTCCCGCTATTTTCTCTTCGCCAAGGCTCTGTTTCTTTCTTATCCAAATTTCCGCAGCCATTAATGTCCAAATTCCTGCTCTCTC
ACTCCTATCTTCTCACCCTTCCCCACAAACACCATTCCTTTTCTCTCAACAATGGCGTCCTCCCCATCCGCTCACTCCTCTCTGCTCCTGAGAAGCGAGGTAGAAAGAAG
CGGCAGTCGCGGCAGCAACAACTGCACCAAAAGGACGACGGTTCCACTGCGCTCGAGAACGCTCTTCGCTTCACTTTCATGGAGGAACTCATGGACCGCGCTAGAAACCA
CGATCCGCTTGGCGTTTCTGATGTCATTTACGATATGGTTGCCGCTGGATTGAGCCCTGGACCTCGCTCCTTCCATGGCTTGGTTGTGTCTCATGCTCTCAATGCTGATG
CTGAGGGAGCGATGCAATCTCTGAGAAGGGAATTAAGTGCTGGACTTCGTCCTCTTCATGAAACGTTTGTTGCATTAGTTCGGTTATTTGGCTCCAAGGGTCTTGCTACT
AGAGGCTTAGAAATCCTTGCAGCCATGGAGAAATTGAACTATGACATTCGCCAAGCTTGGCTCATTCTTATTGAGGAACTCGTAAGGAACAAATATTTAGAAGACGCCAA
TGAAGTGTTCTTAAAGGGTGCCAAAGGGGGTCTCAGAGCCACCGACAAGATTTACGATCTTCTAATTGAAGAAGACTGTAAAGCTGGGGACCATTCAAATGCCTTAGAGA
TTTCATATGAAATGGAGGCTGCTGGGCGGATGGCAACAACCTTTCATTTCAATTGCCTTCTCAGTGTTCAGGCTACTTGTGGAATACCTGAAATTGCTTTCTCAACATTT
GAGAACATGGAATATGGAGAAGATTACATGAAGCCCGATATTGAGACATATAATTGGGTGATCCAAGCATATACAAGAGCTGAATCTTATGATAGGGTGCAAGATGTTGC
TGAGTTACTTGGCATGATGGTTGAAGACCACAAGCGTCTACAGCCTAACATGAGAACCTATGCGCTCTTGGTAGAGTGTTTTACGAAGTATTGTGTTATACGAGAAGCCA
TTAGGCATTTTCGTGGACTAAAAACCTTTCAAGGTGGAACAAAAGCTTTGCATAATGAAGGAAATTTTGGTGACCCACTTTCCTTATATCTTCGAGCTTTATGTAGAGAA
GGTAGGGTTGTAGAGCTCTTAGAAGCGTTAGAAGCTATGGCTAGAGATAACCAACAGATTCCTCCAAGAGCCATGATCTTGAGCAGAAAGTATCGATCACTGGTGAGCTC
ATGGATTGAACCTTTACAGGAAGAAGCTGAACATGGATACGAGATAGACTACATTGCAAGATACATTGAAGAGGGGGGACTCACTGGAGAACGCAAGAGATGGGTCCCTC
GAAGAGGAAAAACTCCTCTAGATCCTGATGCTGATGGATTCATCTATTCAAATCCTATGGAAACATCCTTTAAGCAACGATGTCTAGAAGATTGGAAGATATACCACCGA
AAGATTTTGAAAACCTTGCAGAATGAAGGACTTGCAGCTCTTGGGGATGCATCTGAAGCTGATTATCTTAGAGTCGAGGAGAGATTGAAGAAAATTATAAAGGGCCCTGA
TCAAAATGTTTTAAAGCCAAAGGCTGCAAGTAAGATGATTGTATCAGAATTAAAAGAAGAATTGGAAGCGCAAGGTTTACCAATTGATGGAACTAGAAATGTTCTTTACC
AGCGTGTTCAAAAAGCAAGGAGAATAAATCGGTCTCGTGGTCGGCCCCTTTGGGTTCCTCCAGTGGAGGAGGAGGAAGAAGAGGTTGATGAAGAGTTGGATGAACTAATT
TCACGAATAAAGCTACATGAAGGAAACACAGAGTTCTGGAAACGCCGCTTTCTTGGAGAAGGCTTGGACAGTAATAATGTTAAAGCTTCTGAAGATGATAAGTCAGAACC
TCTTGATTCTTTGGATGATGTTGACATTGTAGAAGACGTTGCAAAGGAGATTGAAGAAGAAGAAGCCGAGGAGGAAGAGGAGGTTGAACAACCTGAGAATCAAGATGGTG
AAAGAGTTATCAAGAAGGAAGTTGAAGCTAAGAAGCCTCTTCAAATGATAGGTGTCCAATTGTTAAAAGATGTTGACCAAACCACAACAACATCCAAAAAGTCAAGGAGG
AGAAGTTCTCGAGCATCACTCGAGGACGATCGTGATGAAGACTGGTTTCCTGAAGATTTATTTGAGGCATTTGGAGAGTTGCGAAAGAGGAAAATCTTTGATGTATCTGA
CATGTACACAATAGCTGATGTTTGGGGTTGGACTTGGGAGAGAGAACTTAAGAACAGACCTCCCAGGAGGTGGTCACAGGAATGGGAAGTGGAGTTGGCTATTAAAATTA
TGCACAAGGTGATAGAATTGGGTGGAACACCAACAATTGGCGACTGTGCCATGATCTTGCGAGCTGCCATCAAGGCTCCGCTTCCGTCTGCTTTTTTGAAAATCTTGCAG
ACAACTCATGGTCTTGGCTATGTATTTGGGAGCCCTTTATATGATGAGGTTATTACCCTGTGTCTTGATCTTGGGGAACTAGATGCAGCCATTGCAATTGTAGCAGACCT
GGAAACCACAGGAATCTCGGTTTCTGATGAAACGCTCGATCGGGTAATCTCCGCTAGACAGACGAGCGATGCTGCACCTAAGCCTGATCCAGCCATTGATATTACACTCG
ATGATCATAGTTTAGCCAATGATGAAGCATCATAATCAATCAGTTTGTGTTCTTACTTTCTTTTGTACAGTTCAGTTCTACTTCTAGGGTTGCCAGCAAAATGGTTCTTG
TAAAATTGATCTGTTGTTCTGGCTTTAGTGGTTATATATTTAGACCTTTGGAGTTGAAATATTTTGATTTCAGTTACAAGATTTGGAAAAATTGTCCATCAAGCCCTAAT
CACCACTTTTACTTTAATTTCTTTAACGGTTTAATGTTTCACCGGATGTAGTGAGTAGCAATGGGTTTTCCAAGTTTCTTTTTGTAGTTCTTATTTATTTATGGGAATAA
TTTTGGTGCAGCATGAGCTACACACATCTACCATGCGG
Protein sequenceShow/hide protein sequence
MSKFLLSHSYLLTLPHKHHSFSLNNGVLPIRSLLSAPEKRGRKKRQSRQQQLHQKDDGSTALENALRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVV
SHALNADAEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLATRGLEILAAMEKLNYDIRQAWLILIEELVRNKYLEDANEVFLKGAKGGLRATDKIYDLLIEEDCKAG
DHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDIETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTK
YCVIREAIRHFRGLKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTG
ERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKIYHRKILKTLQNEGLAALGDASEADYLRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPID
GTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKASEDDKSEPLDSLDDVDIVEDVAKEIEEEEAEEEEEVE
QPENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDLFEAFGELRKRKIFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWE
VELAIKIMHKVIELGGTPTIGDCAMILRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGISVSDETLDRVISARQTSDAAPKPD
PAIDITLDDHSLANDEAS