; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0019052 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0019052
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionSAP domain-containing protein
Genome locationchr02:10306968..10354473
RNA-Seq ExpressionIVF0019052
SyntenyIVF0019052
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0006979 - response to oxidative stress (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0004601 - peroxidase activity (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR003034 - SAP domain
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR036361 - SAP domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008443746.1 PREDICTED: uncharacterized protein LOC103487261 isoform X1 [Cucumis melo]0.099.67Show/hide
Query:  MSKLLLSHAHLLTLPYNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSKLLLSHAHLLTLPYNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKLLLSHAHLLTLPYNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLKGAK
        PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLKGAK
Subjt:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLKGAK

Query:  AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
        AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPPRAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERRRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR
        ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGER+RWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERRRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR

Query:  DASEADYHRVVEKLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
        DASEADYHRVVEKLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYHRVVEKLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTATS
        HEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTATS
Subjt:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTATS

Query:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
        KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA

Query:  AIKAPLPSAFLKILRTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISTRQTNDAMPKPDSAIDTTLNDHSLANDEAS
        AIKAPLPSAFLKIL+TTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISTRQTNDAMPKPDSAIDTT+NDHSLANDEAS
Subjt:  AIKAPLPSAFLKILRTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISTRQTNDAMPKPDSAIDTTLNDHSLANDEAS

XP_008443747.1 PREDICTED: uncharacterized protein LOC103487261 isoform X2 [Cucumis melo]0.099.52Show/hide
Query:  MSKLLLSHAHLLTLPYNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSKLLLSHAHLLTLPYNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKLLLSHAHLLTLPYNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLKGAK
        PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLKGAK
Subjt:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLKGAK

Query:  AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
        AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPPRAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERRRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR
        ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGER+RWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERRRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR

Query:  DASEADYHRVVEKLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
        DASEADYHRVVEKLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYHRVVEKLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTATS
        HEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTATS
Subjt:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTATS

Query:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
        KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA

Query:  AIKAPLPSAFLKILRTTHGLGYVFGSPL
        AIKAPLPSAFLKIL+TTHGLGYVFG  L
Subjt:  AIKAPLPSAFLKILRTTHGLGYVFGSPL

XP_011660243.1 uncharacterized protein LOC101209618 isoform X1 [Cucumis sativus]0.097.44Show/hide
Query:  MSKLLLSHAHLLTLPYNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSK LLSHAHLLTLP NHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQ KD+DSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKLLLSHAHLLTLPYNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLKGAK
        PGPRSFHGLVVSHTLNGDTEGAMQSLRRELS+GL PLHETFVALVRLFGSKGLANRGLEILAAME+LNYDIRQAWLILTEELVR+KYLEDANKVFLKGAK
Subjt:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLKGAK

Query:  AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
        AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRAL+TF+GGT ALHNEGNFGDPLSLYLRALCREGRV++LLEALEAMARDNQQIPPRAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERRRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR
        ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGER+RWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERRRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR

Query:  DASEADYHRVVEKLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
        DASEADYHRVVE+L+KIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYHRVVEKLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTATS
        HEGNTEFWKRRFLGEGL SNNVKPSEDDKSD LDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT TS
Subjt:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTATS

Query:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
        KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGG PTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA

Query:  AIKAPLPSAFLKILRTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISTRQTNDAMPKPDSAIDTTLNDHSLANDEAS
        AIKAPLPSAFLKIL+TTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILV DETLDRVIS RQTNDAMPKPDSAIDTTLNDHSLANDEAS
Subjt:  AIKAPLPSAFLKILRTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISTRQTNDAMPKPDSAIDTTLNDHSLANDEAS

XP_022151680.1 uncharacterized protein LOC111019595 [Momordica charantia]0.092.35Show/hide
Query:  MSKLLLS-HAHLLTLPYNHRSFSL-NH-GLLPIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAA
        MSKLLLS H HLLTLP+   S  L NH G+LPIRSVLSAP+KRGRKKRQ RH      KDD ST+LE  LRFTFMEELMDRAR+ D +GVSDVIYDMVAA
Subjt:  MSKLLLS-HAHLLTLPYNHRSFSL-NH-GLLPIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAA

Query:  GLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLK
        GLSPGPRSFHGLVVSH+LNGDTEGAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA+RGLEIL+AME+LNYDIRQAWLIL +ELVRNKYLEDANK FLK
Subjt:  GLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLK

Query:  GAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDR
        GAK GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGED MKPDTETYNWVIQAYTRAESYDR
Subjt:  GAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDR

Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPP
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRV++LLEALEAMARDNQQIP 
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPP

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERRRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
        RAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGER+RWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERRRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV

Query:  ALRDASEADYHRVVEKLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        AL DASEADYHRV E+LKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALRDASEADYHRVVEKLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT
        IKLHEGNTE+WKRRFLGEGLD+N+VKPSEDDKS+ LDSLDDVD +ED AKEIEEEE EEEE VEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQ T
Subjt:  IKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT

Query:  ATSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMI
         TSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKEL+KR+VFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELA KIMHKVIELGGTPTIGDCAMI
Subjt:  ATSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMI

Query:  LRAAIKAPLPSAFLKILRTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISTRQTNDAMPKPDSAIDTTLNDHSLANDE
        LRAAI++PLPSAFLKIL+TTH LGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGI VPDETLDRVIS RQTNDAMPKPD+AIDTTLNDHSLANDE
Subjt:  LRAAIKAPLPSAFLKILRTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISTRQTNDAMPKPDSAIDTTLNDHSLANDE

Query:  AS
        AS
Subjt:  AS

XP_038879291.1 uncharacterized protein LOC120071230 [Benincasa hispida]0.095.66Show/hide
Query:  MSKLLLSHAHLLTLPYNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSK LLSHAHLLTLP  H SFSLNHG++PIRSVLSAPDKRGRKKRQ+R QQQL  KD DST+LE SLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKLLLSHAHLLTLPYNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLKGAK
        PGPRSFHGLVVSH LNGDTEGAMQSLRRELS+GL PLHETFVALVRLFGSKGLA RGLEILAAME+LNYDIRQAWLILTEELVRNKYLEDANKVFLKGAK
Subjt:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLKGAK

Query:  AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
         GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRV++LLEALEAMARDNQQIPPRAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERRRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR
        ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGER+RWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKM+HRKILKTLQNEGL AL 
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERRRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR

Query:  DASEADYHRVVEKLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
         ASEADYHRVVE+LKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYHRVVEKLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTATS
        HEGNTEFWKRRFLGEGLDSNNVKPSEDDKS+ LDSLDDVDT+EDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT TS
Subjt:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTATS

Query:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
        KKSRRRSSRASLEDDRDEDWFPEDIFEAFKEL+KRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA

Query:  AIKAPLPSAFLKILRTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISTRQTNDAMPKPDSAIDTTLNDHSLANDEAS
        AIKAPLPS+FLKIL+TTHGLGY FGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISTRQTND+MPKPDSAIDTTLNDHSLA+DEAS
Subjt:  AIKAPLPSAFLKILRTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISTRQTNDAMPKPDSAIDTTLNDHSLANDEAS

TrEMBL top hitse value%identityAlignment
A0A1S3B8T6 uncharacterized protein LOC103487261 isoform X10.0e+0099.67Show/hide
Query:  MSKLLLSHAHLLTLPYNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSKLLLSHAHLLTLPYNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKLLLSHAHLLTLPYNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLKGAK
        PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLKGAK
Subjt:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLKGAK

Query:  AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
        AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPPRAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERRRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR
        ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGER+RWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERRRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR

Query:  DASEADYHRVVEKLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
        DASEADYHRVVEKLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYHRVVEKLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTATS
        HEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTATS
Subjt:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTATS

Query:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
        KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA

Query:  AIKAPLPSAFLKILRTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISTRQTNDAMPKPDSAIDTTLNDHSLANDEAS
        AIKAPLPSAFLKIL+TTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISTRQTNDAMPKPDSAIDTT+NDHSLANDEAS
Subjt:  AIKAPLPSAFLKILRTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISTRQTNDAMPKPDSAIDTTLNDHSLANDEAS

A0A1S3B9H7 uncharacterized protein LOC103487261 isoform X20.0e+0099.52Show/hide
Query:  MSKLLLSHAHLLTLPYNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSKLLLSHAHLLTLPYNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKLLLSHAHLLTLPYNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLKGAK
        PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLKGAK
Subjt:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLKGAK

Query:  AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
        AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPPRAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERRRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR
        ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGER+RWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERRRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR

Query:  DASEADYHRVVEKLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
        DASEADYHRVVEKLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYHRVVEKLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTATS
        HEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTATS
Subjt:  HEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTATS

Query:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
        KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILRA

Query:  AIKAPLPSAFLKILRTTHGLGYVFGSPL
        AIKAPLPSAFLKIL+TTHGLGYVFG  L
Subjt:  AIKAPLPSAFLKILRTTHGLGYVFGSPL

A0A6J1DBV3 uncharacterized protein LOC1110195950.0e+0092.35Show/hide
Query:  MSKLLL-SHAHLLTLPYNHRSFSL-NH-GLLPIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAA
        MSKLLL SH HLLTLP+   S  L NH G+LPIRSVLSAP+KRGRKKRQ RH      KDD ST+LE  LRFTFMEELMDRAR+ D +GVSDVIYDMVAA
Subjt:  MSKLLL-SHAHLLTLPYNHRSFSL-NH-GLLPIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAA

Query:  GLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLK
        GLSPGPRSFHGLVVSH+LNGDTEGAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA+RGLEIL+AME+LNYDIRQAWLIL +ELVRNKYLEDANK FLK
Subjt:  GLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLK

Query:  GAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDR
        GAK GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGED MKPDTETYNWVIQAYTRAESYDR
Subjt:  GAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDR

Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPP
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRV++LLEALEAMARDNQQIP 
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPP

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERRRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
        RAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGER+RWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERRRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV

Query:  ALRDASEADYHRVVEKLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        AL DASEADYHRV E+LKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALRDASEADYHRVVEKLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT
        IKLHEGNTE+WKRRFLGEGLD+N+VKPSEDDKS+ LDSLDDVD +ED AKEIEEEE  EEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQ T
Subjt:  IKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT

Query:  ATSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMI
         TSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKEL+KR+VFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELA KIMHKVIELGGTPTIGDCAMI
Subjt:  ATSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMI

Query:  LRAAIKAPLPSAFLKILRTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISTRQTNDAMPKPDSAIDTTLNDHSLANDE
        LRAAI++PLPSAFLKIL+TTH LGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGI VPDETLDRVIS RQTNDAMPKPD+AIDTTLNDHSLANDE
Subjt:  LRAAIKAPLPSAFLKILRTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISTRQTNDAMPKPDSAIDTTLNDHSLANDE

Query:  AS
        AS
Subjt:  AS

A0A6J1EQ88 uncharacterized protein LOC111436825 isoform X10.0e+0091.22Show/hide
Query:  MSKLLLSHAHLLTLPYNHRSFSLNHGLL-PIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL
        MSK LLSH++LLTLP+ H SFSL++G+  PIRSVLS  +KRGRKKRQSR QQQLQ KDDDST  E SLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL
Subjt:  MSKLLLSHAHLLTLPYNHRSFSLNHGLL-PIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL

Query:  SPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLKGA
        SPGPRSFHGLVVSH LN D EGAMQSLR+ELS+GLRPLHETFVALVRLFG+KGLA RGLEILAAME+LNYDIRQAWLIL EELV+NKYLEDANKVFLKGA
Subjt:  SPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLKGA

Query:  KAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ
        K GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ
Subjt:  KAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ

Query:  DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPPRA
        DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFR LKTF GGTKALHNEGNFGDPLSLYLRALCREGRV++LLEALEAMARDNQQIP RA
Subjt:  DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPPRA

Query:  MILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERRRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVAL
        MILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGER+RWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL AL
Subjt:  MILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERRRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVAL

Query:  RDASEADYHRVVEKLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK
         DASEADY RV E+LKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRN+LYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK
Subjt:  RDASEADYHRVVEKLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK

Query:  LHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTAT
        LHEGNTEFWKRRFLGEGLDSNNVKPSEDD+S+ LDSLDDVD +EDVAKEI+EEEAEEEEEVE TENQDGERVIKKEVEAKKP QMIGVQLLKDVDQ + T
Subjt:  LHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTAT

Query:  SKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILR
        SKKSRRR SRAS+EDDRDEDWFPED+FEAF EL+KRKVFD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGG PTIGDCAMILR
Subjt:  SKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILR

Query:  AAIKAPLPSAFLKILRTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISTRQTNDAMPKPDSAIDTTLNDHSLANDEAS
        AAIKAPLPSAF KIL+TTH LGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGI VPDETLDR+IS RQTNDA PK DS ID TLNDHSL NDE S
Subjt:  AAIKAPLPSAFLKILRTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISTRQTNDAMPKPDSAIDTTLNDHSLANDEAS

A0A6J1L2D9 uncharacterized protein LOC111499221 isoform X10.0e+0091.11Show/hide
Query:  MSKLLLSHAHLLTLPYNHRSFSLNHGLL-PIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL
        MSK LLSH+ LLTLP+ H SFSL++ +L PIRSVLS  +KRGRKKRQSR QQQLQ KD DST LE SLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL
Subjt:  MSKLLLSHAHLLTLPYNHRSFSLNHGLL-PIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL

Query:  SPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLKGA
        SPGPRSFHGLVVSH LN D EGAMQSLR+ELS+GLRPLHETFVALVRLFG+KGLA RGLEILAAME+LNYDIRQAWLIL EELV+NKYLEDANKVFLKGA
Subjt:  SPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLKGA

Query:  KAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ
        K GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ
Subjt:  KAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ

Query:  DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPPRA
        DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFR LKTF GGTKALH+EG+FGDPLSLYLRALCREGRV++LLEALEAMARDNQQIP RA
Subjt:  DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPPRA

Query:  MILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERRRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVAL
        MILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGER+RWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL AL
Subjt:  MILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERRRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVAL

Query:  RDASEADYHRVVEKLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK
         DASEADY RV E+LKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK
Subjt:  RDASEADYHRVVEKLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK

Query:  LHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTAT
        LHEGNTEFWKRRFLGEGLDSNNVKPSEDD+S+ LDSLDDVD +EDVAKEI+EEEAEEEEEVE TENQDGERVIKKEVEAKKP QMIGVQLLKDVDQ + T
Subjt:  LHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTAT

Query:  SKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILR
        SKKSRRR SRAS+EDDRDEDWFPED+FEAF EL+KRK+FD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGG PTIGDCAMILR
Subjt:  SKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMILR

Query:  AAIKAPLPSAFLKILRTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISTRQTNDAMPKPDSAIDTTLNDHSLANDEAS
        AAIKAPLPSAF KIL+TTH LGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGI VPDETLDR+IS RQTNDA PK DS ID TLNDHSLA+DE S
Subjt:  AAIKAPLPSAFLKILRTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISTRQTNDAMPKPDSAIDTTLNDHSLANDEAS

SwissProt top hitse value%identityAlignment
Q0WPZ6 Pentatricopeptide repeat-containing protein At2g171401.6e-0923.43Show/hide
Query:  VSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLN-YDIRQAWLILTEELVRN
        VS +  DMV  G++P   +F+ L+ +   +   + A +        G +P   TF  LVR +   GL ++GLE+L AME       +  +  +     R 
Subjt:  VSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLN-YDIRQAWLILTEELVRN

Query:  KYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMA----TTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTET
           +D+ K+  K  + GL      ++  I   CK G   +A  I  +ME    +      +  +N +L      G+ E A + FE++   +D      ++
Subjt:  KYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMA----TTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTET

Query:  YNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLS--LYLRALCREGRVL
        YN  +Q   R   +   + V + +       K + P++ +Y +L++   K  ++ +A       KT  G    +   G   D ++    L   C  G+V 
Subjt:  YNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLS--LYLRALCREGRVL

Query:  DLLEALEAMARDNQQIPPRAMILSRKYRSL-----VSSWIEPLQEEAEHGFEIDYIARYIEEGGLTG
             L+ M R+N    P A   +    SL     +S   E L++  E G+ +D +   I   GL G
Subjt:  DLLEALEAMARDNQQIPPRAMILSRKYRSL-----VSSWIEPLQEEAEHGFEIDYIARYIEEGGLTG

Q8GYP6 Pentatricopeptide repeat-containing protein At1g189005.2e-0820.96Show/hide
Query:  FTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMER--
        +T M   + RA+      ++ ++ +MV  G  P   +++ L+ S+        AM    +   +G +P   T+  L+ +    G  +  +++   M+   
Subjt:  FTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMER--

Query:  LNYDIRQAWLILTEELVRNKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFE
        L+ D    + ++   L +  +L  A+K+F +    G       Y++M++   KA ++ NAL++  +M+ AG       ++ ++ V   CG  E A + F 
Subjt:  LNYDIRQAWLILTEELVRNKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFE

Query:  NMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREA
         M+  +    PD   Y  ++  + +A + ++     + +      H  L+PN+ T   L+  F +   I EA
Subjt:  NMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREA

Q9LN22 Pentatricopeptide repeat-containing protein At1g20300, mitochondrial4.8e-0626.67Show/hide
Query:  LEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQ
        + +A KVF +   AG+      Y ++I+  C+ G  S A ++  +M  +G       FN L+ V    G  E     +  M+  +   +PDT TYN++I+
Subjt:  LEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQ

Query:  AYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTY
        A+ R E+   +++  ++L  M++  K+ + N  T+
Subjt:  AYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTY

Q9LYZ9 Pentatricopeptide repeat-containing protein At5g028606.1e-0923.71Show/hide
Query:  KRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVA
        KR S HQ+  Q+ ++   +  +  + T+   L    ++H P     V+ +MV  G SP   +++ L+ ++  +G  + AM+   +    G +P   T+  
Subjt:  KRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVA

Query:  LVRLFGSKGLANRGLEILAAME----RLNYDIRQAWLILTEELVRNKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRM
        L+  F   G     + I   M     + N     A++ +     R K+ E   K+F +    GL      ++ ++    + G  S    +  EM+ AG +
Subjt:  LVRLFGSKGLANRGLEILAAME----RLNYDIRQAWLILTEELVRNKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRM

Query:  ATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDV-AELLGMMVEDHKRLQPNMRTYALLVECF
             FN L+S  + CG  E A + +  M   +  + PD  TYN V+ A  R   +++ + V AE+      +  R +PN  TY  L+  +
Subjt:  ATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDV-AELLGMMVEDHKRLQPNMRTYALLVECF

Q9SIC9 Pentatricopeptide repeat-containing protein At2g31400, chloroplastic6.8e-0824.62Show/hide
Query:  SGLRPLHETFVALVRLFGSKGL--ANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEI
        +G++P   TF +L+ +    GL  A R L       R+  D+  ++  L + + +   ++ A ++  +     +      Y  +I+   KAG    AL +
Subjt:  SGLRPLHETFVALVRLFGSKGL--ANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEI

Query:  SYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECF
          EM   G       +N LLS+    G  E A      M      +K D  TYN ++  Y +   YD V+ V      M  +H  + PN+ TY+ L++ +
Subjt:  SYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECF

Query:  TKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPL--SLYLRALCREGRVLDLLEALEAMARD
        +K  + +EA+  FR  K+   G +A        D +  S  + ALC+ G V   +  ++ M ++
Subjt:  TKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPL--SLYLRALCREGRVLDLLEALEAMARD

Arabidopsis top hitse value%identityAlignment
AT1G18900.1 Pentatricopeptide repeat (PPR) superfamily protein3.7e-0920.96Show/hide
Query:  FTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMER--
        +T M   + RA+      ++ ++ +MV  G  P   +++ L+ S+        AM    +   +G +P   T+  L+ +    G  +  +++   M+   
Subjt:  FTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMER--

Query:  LNYDIRQAWLILTEELVRNKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFE
        L+ D    + ++   L +  +L  A+K+F +    G       Y++M++   KA ++ NAL++  +M+ AG       ++ ++ V   CG  E A + F 
Subjt:  LNYDIRQAWLILTEELVRNKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFE

Query:  NMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREA
         M+  +    PD   Y  ++  + +A + ++     + +      H  L+PN+ T   L+  F +   I EA
Subjt:  NMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREA

AT1G18900.2 Pentatricopeptide repeat (PPR) superfamily protein3.7e-0920.96Show/hide
Query:  FTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMER--
        +T M   + RA+      ++ ++ +MV  G  P   +++ L+ S+        AM    +   +G +P   T+  L+ +    G  +  +++   M+   
Subjt:  FTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMER--

Query:  LNYDIRQAWLILTEELVRNKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFE
        L+ D    + ++   L +  +L  A+K+F +    G       Y++M++   KA ++ NAL++  +M+ AG       ++ ++ V   CG  E A + F 
Subjt:  LNYDIRQAWLILTEELVRNKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFE

Query:  NMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREA
         M+  +    PD   Y  ++  + +A + ++     + +      H  L+PN+ T   L+  F +   I EA
Subjt:  NMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREA

AT2G17140.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-1023.43Show/hide
Query:  VSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLN-YDIRQAWLILTEELVRN
        VS +  DMV  G++P   +F+ L+ +   +   + A +        G +P   TF  LVR +   GL ++GLE+L AME       +  +  +     R 
Subjt:  VSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLN-YDIRQAWLILTEELVRN

Query:  KYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMA----TTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTET
           +D+ K+  K  + GL      ++  I   CK G   +A  I  +ME    +      +  +N +L      G+ E A + FE++   +D      ++
Subjt:  KYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMA----TTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTET

Query:  YNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLS--LYLRALCREGRVL
        YN  +Q   R   +   + V + +       K + P++ +Y +L++   K  ++ +A       KT  G    +   G   D ++    L   C  G+V 
Subjt:  YNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLS--LYLRALCREGRVL

Query:  DLLEALEAMARDNQQIPPRAMILSRKYRSL-----VSSWIEPLQEEAEHGFEIDYIARYIEEGGLTG
             L+ M R+N    P A   +    SL     +S   E L++  E G+ +D +   I   GL G
Subjt:  DLLEALEAMARDNQQIPPRAMILSRKYRSL-----VSSWIEPLQEEAEHGFEIDYIARYIEEGGLTG

AT3G04260.1 plastid transcriptionally active 30.0e+0073.69Show/hide
Query:  GLLPIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTS---------LENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLN
        G+  IR  +SAP+K+ R++R+ +     +  +DDS S         LE SLR TFM+ELM+RARN D  GVS+VIYDM+AAGLSPGPRSFHGLVV+H LN
Subjt:  GLLPIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTS---------LENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLN

Query:  GDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLKGAKAGLRATDKIYDLMIEE
        GD +GAM SLR+EL +G RPL ET +ALVRL GSKG A RGLEILAAME+L YDIRQAWLIL EEL+R  +LEDANKVFLKGA+ G+RATD++YDLMIEE
Subjt:  GDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLKGAKAGLRATDKIYDLMIEE

Query:  DCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQ
        DCKAGDHSNAL+ISYEMEAAGRMATTFHFNCLLSVQATCGIPE+A++TFENMEYGE +MKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKR+Q
Subjt:  DCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQ

Query:  PNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEP
        PN++TYALLVECFTKYCV++EAIRHFRALK F+GGT  LHN GNF DPLSLYLRALCREGR+++L++AL+AM +DNQ IPPRAMI+SRKYR+LVSSWIEP
Subjt:  PNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEP

Query:  LQEEAEHGFEIDYIARYIEEGGLTGERRRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALRDASEADYHRVVEKLKK
        LQEEAE G+EIDY+ARYIEEGGLTGER+RWVPR+GKTPLDPDA GFIYSNP+ETSFKQRCLEDWK++HRK+L+TLQ+EGL  L DASE+DY RVVE+L+ 
Subjt:  LQEEAEHGFEIDYIARYIEEGGLTGERRRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALRDASEADYHRVVEKLKK

Query:  IIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEG
        IIKGP  N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRIN+SRGRPLWVPP+EEEEEEVDEE+D+LI RIKLHEG+TEFWKRRFLGEG
Subjt:  IIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEG

Query:  LDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEE----------------VEQTENQ-DGERVIK-KEVEAKKPLQMIGVQLLKDVDQPTA
        L   +V+  E   ++S+ + +    IED++KE + EE ++EEE                V +TEN+ +GE ++K K  +AKK LQMIGVQLLK+ D+   
Subjt:  LDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEE----------------VEQTENQ-DGERVIK-KEVEAKKPLQMIGVQLLKDVDQPTA

Query:  TSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMIL
        T KK  +R+SR +LEDD DEDWFPE+ FEAFKE+++RKVFDV+DMYTIADVWGWTWE++ KN+ PR+WSQEWEVELAI +M KVIELGG PTIGDCA+IL
Subjt:  TSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGTPTIGDCAMIL

Query:  RAAIKAPLPSAFLKILRTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISTRQTNDA
        RAA++AP+PSAFLKIL+TTH LGY FGSPLYDE+ITLCLDLGELDAAIAIVAD+ETTGI VPD+TLD+VIS RQ+N++
Subjt:  RAAIKAPLPSAFLKILRTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISTRQTNDA

AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein4.3e-1023.71Show/hide
Query:  KRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVA
        KR S HQ+  Q+ ++   +  +  + T+   L    ++H P     V+ +MV  G SP   +++ L+ ++  +G  + AM+   +    G +P   T+  
Subjt:  KRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSSGLRPLHETFVA

Query:  LVRLFGSKGLANRGLEILAAME----RLNYDIRQAWLILTEELVRNKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRM
        L+  F   G     + I   M     + N     A++ +     R K+ E   K+F +    GL      ++ ++    + G  S    +  EM+ AG +
Subjt:  LVRLFGSKGLANRGLEILAAME----RLNYDIRQAWLILTEELVRNKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRM

Query:  ATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDV-AELLGMMVEDHKRLQPNMRTYALLVECF
             FN L+S  + CG  E A + +  M   +  + PD  TYN V+ A  R   +++ + V AE+      +  R +PN  TY  L+  +
Subjt:  ATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDV-AELLGMMVEDHKRLQPNMRTYALLVECF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCAAACTCCTGCTCTCTCACGCTCACCTTCTCACACTTCCCTACAATCACCGTTCCTTTTCTCTCAACCATGGCCTCCTCCCCATTCGCTCAGTCCTCTCTGCTCC
AGACAAGCGAGGTAGAAAGAAGCGGCAGTCTCGCCACCAACAACAATTGCAACTAAAGGACGATGATTCCACTTCACTTGAGAACTCCCTCCGTTTCACTTTCATGGAGG
AACTCATGGACCGCGCTAGAAACCACGATCCACTTGGTGTTTCTGATGTCATTTACGATATGGTTGCTGCTGGATTGAGCCCTGGACCTCGCTCCTTCCATGGATTAGTT
GTTTCTCATACTCTAAATGGCGATACTGAGGGAGCGATGCAGTCTCTGAGAAGGGAATTAAGTTCTGGACTTCGTCCTCTCCATGAAACTTTTGTTGCATTAGTTCGGTT
ATTTGGTTCCAAGGGTCTTGCTAATAGAGGCTTAGAAATCCTTGCTGCCATGGAGAGATTGAATTATGACATCCGTCAAGCATGGCTCATTCTTACGGAGGAACTCGTAA
GGAACAAATATTTGGAAGATGCCAATAAAGTGTTCTTAAAGGGTGCCAAAGCGGGTCTCAGAGCCACAGACAAAATTTATGATCTTATGATTGAGGAAGATTGTAAAGCT
GGAGATCATTCAAATGCCTTAGAGATCTCATATGAAATGGAGGCTGCTGGGCGGATGGCAACGACCTTCCATTTCAATTGCCTTCTTAGTGTCCAGGCTACTTGTGGAAT
ACCTGAAATTGCTTTCTCAACATTTGAGAACATGGAATATGGAGAAGATTACATGAAGCCCGATACTGAGACATATAATTGGGTGATCCAAGCATATACAAGGGCTGAAT
CTTATGATAGGGTGCAAGATGTTGCAGAGTTACTTGGCATGATGGTTGAAGACCACAAGCGTCTACAACCCAACATGAGAACCTATGCGCTCTTGGTAGAGTGTTTTACT
AAGTACTGTGTTATACGAGAAGCTATTAGGCATTTTCGTGCACTAAAAACCTTTCAAGGTGGAACAAAAGCTTTGCATAATGAAGGAAATTTTGGTGATCCACTTTCCTT
ATATCTTCGAGCTTTATGTAGAGAAGGTAGGGTGTTAGATCTCTTAGAAGCATTAGAGGCTATGGCTAGAGATAACCAACAGATTCCTCCAAGAGCCATGATTTTGAGCA
GAAAGTATCGATCACTGGTGAGCTCATGGATTGAACCTTTACAAGAAGAAGCCGAACATGGATTTGAGATAGACTACATTGCAAGATACATTGAAGAGGGCGGACTCACT
GGAGAACGCAGGAGGTGGGTTCCTCGAAAAGGAAAAACTCCTCTAGATCCTGATGCAGATGGATTCATTTATTCAAATCCTATGGAGACATCCTTTAAGCAACGATGTCT
AGAAGATTGGAAGATGTACCACCGAAAGATTTTGAAAACCTTGCAAAATGAAGGACTTGTGGCTCTGAGGGATGCATCTGAAGCTGATTATCATAGAGTCGTGGAGAAAT
TGAAGAAAATAATAAAGGGTCCTGACCAAAATGTTTTAAAGCCAAAGGCTGCAAGTAAGATGATTGTATCAGAATTAAAAGAAGAGTTAGAAGCACAAGGTTTACCAATT
GACGGAACTAGAAATGTTCTTTACCAGCGTGTTCAAAAAGCAAGGAGAATAAACCGGTCTCGTGGTCGGCCCCTTTGGGTTCCCCCAGTGGAAGAGGAGGAAGAAGAGGT
TGATGAAGAGCTGGATGAACTAATTTCACGAATAAAGCTACATGAAGGAAATACGGAGTTTTGGAAACGCCGTTTTCTCGGAGAAGGCTTGGACAGTAATAATGTTAAAC
CTTCTGAAGATGATAAATCAGATTCTCTTGATTCTTTGGATGATGTTGACACTATAGAAGACGTTGCAAAGGAGATTGAAGAAGAAGAAGCTGAGGAGGAAGAGGAGGTA
GAACAAACTGAGAATCAAGATGGTGAAAGGGTTATTAAGAAGGAAGTTGAAGCTAAGAAGCCTCTTCAAATGATTGGTGTCCAATTGTTAAAAGATGTTGACCAACCTAC
AGCAACCTCCAAAAAGTCAAGGAGGAGAAGTTCCCGAGCATCACTCGAGGACGATCGTGATGAAGATTGGTTTCCTGAAGATATATTTGAAGCATTTAAAGAGTTGCAAA
AGAGGAAAGTCTTTGATGTATCTGACATGTACACAATAGCTGATGTTTGGGGTTGGACTTGGGAGAGAGAACTTAAGAACAGACCTCCCAGGAGGTGGTCACAGGAATGG
GAAGTGGAGCTGGCTATTAAAATTATGCACAAGGTGATAGAATTGGGTGGAACGCCAACAATTGGCGACTGTGCCATGATCTTGCGAGCTGCCATCAAGGCTCCATTACC
TTCTGCCTTCTTGAAAATCTTGCGGACAACTCACGGTCTTGGCTATGTATTTGGGAGCCCTTTATATGATGAGGTTATCACCCTTTGCCTTGATCTTGGGGAACTAGATG
CAGCCATTGCAATTGTAGCAGATCTTGAAACAACAGGAATCTTGGTTCCTGATGAAACACTGGATCGGGTAATTTCCACTAGACAGACAAACGATGCTATGCCCAAGCCT
GACTCAGCCATTGATACCACACTGAATGATCACAGTTTAGCCAATGATGAAGCATCATAA
mRNA sequenceShow/hide mRNA sequence
ATCATCGACAAATCCCGCCTTTTTAGCACTCAACACTTCACTTCCCATTTTTCCCTCCCTCTTCATTCTCGCTGTTTCTTATCCAAATTTCTGGGCAGCCATTAATGTCC
AAACTCCTGCTCTCTCACGCTCACCTTCTCACACTTCCCTACAATCACCGTTCCTTTTCTCTCAACCATGGCCTCCTCCCCATTCGCTCAGTCCTCTCTGCTCCAGACAA
GCGAGGTAGAAAGAAGCGGCAGTCTCGCCACCAACAACAATTGCAACTAAAGGACGATGATTCCACTTCACTTGAGAACTCCCTCCGTTTCACTTTCATGGAGGAACTCA
TGGACCGCGCTAGAAACCACGATCCACTTGGTGTTTCTGATGTCATTTACGATATGGTTGCTGCTGGATTGAGCCCTGGACCTCGCTCCTTCCATGGATTAGTTGTTTCT
CATACTCTAAATGGCGATACTGAGGGAGCGATGCAGTCTCTGAGAAGGGAATTAAGTTCTGGACTTCGTCCTCTCCATGAAACTTTTGTTGCATTAGTTCGGTTATTTGG
TTCCAAGGGTCTTGCTAATAGAGGCTTAGAAATCCTTGCTGCCATGGAGAGATTGAATTATGACATCCGTCAAGCATGGCTCATTCTTACGGAGGAACTCGTAAGGAACA
AATATTTGGAAGATGCCAATAAAGTGTTCTTAAAGGGTGCCAAAGCGGGTCTCAGAGCCACAGACAAAATTTATGATCTTATGATTGAGGAAGATTGTAAAGCTGGAGAT
CATTCAAATGCCTTAGAGATCTCATATGAAATGGAGGCTGCTGGGCGGATGGCAACGACCTTCCATTTCAATTGCCTTCTTAGTGTCCAGGCTACTTGTGGAATACCTGA
AATTGCTTTCTCAACATTTGAGAACATGGAATATGGAGAAGATTACATGAAGCCCGATACTGAGACATATAATTGGGTGATCCAAGCATATACAAGGGCTGAATCTTATG
ATAGGGTGCAAGATGTTGCAGAGTTACTTGGCATGATGGTTGAAGACCACAAGCGTCTACAACCCAACATGAGAACCTATGCGCTCTTGGTAGAGTGTTTTACTAAGTAC
TGTGTTATACGAGAAGCTATTAGGCATTTTCGTGCACTAAAAACCTTTCAAGGTGGAACAAAAGCTTTGCATAATGAAGGAAATTTTGGTGATCCACTTTCCTTATATCT
TCGAGCTTTATGTAGAGAAGGTAGGGTGTTAGATCTCTTAGAAGCATTAGAGGCTATGGCTAGAGATAACCAACAGATTCCTCCAAGAGCCATGATTTTGAGCAGAAAGT
ATCGATCACTGGTGAGCTCATGGATTGAACCTTTACAAGAAGAAGCCGAACATGGATTTGAGATAGACTACATTGCAAGATACATTGAAGAGGGCGGACTCACTGGAGAA
CGCAGGAGGTGGGTTCCTCGAAAAGGAAAAACTCCTCTAGATCCTGATGCAGATGGATTCATTTATTCAAATCCTATGGAGACATCCTTTAAGCAACGATGTCTAGAAGA
TTGGAAGATGTACCACCGAAAGATTTTGAAAACCTTGCAAAATGAAGGACTTGTGGCTCTGAGGGATGCATCTGAAGCTGATTATCATAGAGTCGTGGAGAAATTGAAGA
AAATAATAAAGGGTCCTGACCAAAATGTTTTAAAGCCAAAGGCTGCAAGTAAGATGATTGTATCAGAATTAAAAGAAGAGTTAGAAGCACAAGGTTTACCAATTGACGGA
ACTAGAAATGTTCTTTACCAGCGTGTTCAAAAAGCAAGGAGAATAAACCGGTCTCGTGGTCGGCCCCTTTGGGTTCCCCCAGTGGAAGAGGAGGAAGAAGAGGTTGATGA
AGAGCTGGATGAACTAATTTCACGAATAAAGCTACATGAAGGAAATACGGAGTTTTGGAAACGCCGTTTTCTCGGAGAAGGCTTGGACAGTAATAATGTTAAACCTTCTG
AAGATGATAAATCAGATTCTCTTGATTCTTTGGATGATGTTGACACTATAGAAGACGTTGCAAAGGAGATTGAAGAAGAAGAAGCTGAGGAGGAAGAGGAGGTAGAACAA
ACTGAGAATCAAGATGGTGAAAGGGTTATTAAGAAGGAAGTTGAAGCTAAGAAGCCTCTTCAAATGATTGGTGTCCAATTGTTAAAAGATGTTGACCAACCTACAGCAAC
CTCCAAAAAGTCAAGGAGGAGAAGTTCCCGAGCATCACTCGAGGACGATCGTGATGAAGATTGGTTTCCTGAAGATATATTTGAAGCATTTAAAGAGTTGCAAAAGAGGA
AAGTCTTTGATGTATCTGACATGTACACAATAGCTGATGTTTGGGGTTGGACTTGGGAGAGAGAACTTAAGAACAGACCTCCCAGGAGGTGGTCACAGGAATGGGAAGTG
GAGCTGGCTATTAAAATTATGCACAAGGTGATAGAATTGGGTGGAACGCCAACAATTGGCGACTGTGCCATGATCTTGCGAGCTGCCATCAAGGCTCCATTACCTTCTGC
CTTCTTGAAAATCTTGCGGACAACTCACGGTCTTGGCTATGTATTTGGGAGCCCTTTATATGATGAGGTTATCACCCTTTGCCTTGATCTTGGGGAACTAGATGCAGCCA
TTGCAATTGTAGCAGATCTTGAAACAACAGGAATCTTGGTTCCTGATGAAACACTGGATCGGGTAATTTCCACTAGACAGACAAACGATGCTATGCCCAAGCCTGACTCA
GCCATTGATACCACACTGAATGATCACAGTTTAGCCAATGATGAAGCATCATAATGGATCGGTGTTCTTATTTTTGTTTTGTACAGTTCAGTCTTAGATTTGCTAGCAAA
ATGGTTGTAAATTTTATCTGTAGTTGTTTTAGTGGTTTAGTATTTTGAATTTTGTGTTGAAGCATTTAATTTTAGAAACAAGATTTTGAGCAAAATTGTTTATCAAGTCA
TAATAATTACGGTTGTCGCACAAATCTTGTCAAATTGCGCTAAGACTTAACGATCGATTATCTTGTGAGGTGGT
Protein sequenceShow/hide protein sequence
MSKLLLSHAHLLTLPYNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQLKDDDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLV
VSHTLNGDTEGAMQSLRRELSSGLRPLHETFVALVRLFGSKGLANRGLEILAAMERLNYDIRQAWLILTEELVRNKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKA
GDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT
KYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVLDLLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLT
GERRRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALRDASEADYHRVVEKLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPI
DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLDSNNVKPSEDDKSDSLDSLDDVDTIEDVAKEIEEEEAEEEEEV
EQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTATSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEW
EVELAIKIMHKVIELGGTPTIGDCAMILRAAIKAPLPSAFLKILRTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVPDETLDRVISTRQTNDAMPKP
DSAIDTTLNDHSLANDEAS