; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G21659 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G21659
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionSAP domain-containing protein
Genome locationctg949:1028895..1048412
RNA-Seq ExpressionCucsat.G21659
SyntenyCucsat.G21659
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0006979 - response to oxidative stress (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0004601 - peroxidase activity (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR003034 - SAP domain
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR036361 - SAP domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008443746.1 PREDICTED: uncharacterized protein LOC103487261 isoform X1 [Cucumis melo]0.097.55Show/hide
Query:  MSKFLLSHAHLLTLPSNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSK LLSHAHLLTLP NHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQ KD+DSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHAHLLTLPSNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLILTEELVRSKYLEDANKVFLKGAK
        PGPRSFHGLVVSHTLNGDTEGAMQSLRRELS+GL PLHETFVALVRLFGSKGLANRGLEILAAME+LNYDIRQAWLILTEELVR+KYLEDANKVFLKGAK
Subjt:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLILTEELVRSKYLEDANKVFLKGAK

Query:  AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
        AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRAL+TF+GGT ALHNEGNFGDPLSLYLRALCREGRV++LLEALEAMARDNQQIPPRAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR
        ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR

Query:  DASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
        DASEADYHRVVE+L+KIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTTTS
        HEGNTEFWKRRFLGEGL SNNVKPSEDDKSD LDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT TS
Subjt:  HEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTTTS

Query:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRA
        KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGG PTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRA

Query:  AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDEAS
        AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILV DETLDRVIS RQTNDAMPKPDSAIDTT+NDHSLANDEAS
Subjt:  AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDEAS

XP_011660243.1 uncharacterized protein LOC101209618 isoform X1 [Cucumis sativus]0.0100Show/hide
Query:  MSKFLLSHAHLLTLPSNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSKFLLSHAHLLTLPSNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHAHLLTLPSNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLILTEELVRSKYLEDANKVFLKGAK
        PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLILTEELVRSKYLEDANKVFLKGAK
Subjt:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLILTEELVRSKYLEDANKVFLKGAK

Query:  AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
        AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR
        ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR

Query:  DASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
        DASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTTTS
        HEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTTTS
Subjt:  HEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTTTS

Query:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRA
        KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRA

Query:  AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDEAS
        AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDEAS
Subjt:  AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDEAS

XP_022151680.1 uncharacterized protein LOC111019595 [Momordica charantia]0.092.35Show/hide
Query:  MSKFLLS-HAHLLTLPSNHRSFSL-NH-GLLPIRSVLSAPDKRGRKKRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAA
        MSK LLS H HLLTLP    S  L NH G+LPIRSVLSAP+KRGRKKRQ RH     PKD+ ST+LE  LRFTFMEELMDRAR+ D +GVSDVIYDMVAA
Subjt:  MSKFLLS-HAHLLTLPSNHRSFSL-NH-GLLPIRSVLSAPDKRGRKKRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAA

Query:  GLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLILTEELVRSKYLEDANKVFLK
        GLSPGPRSFHGLVVSH+LNGDTEGAMQSLRRELSAGL PLHETFVALVRLFGSKGLA+RGLEIL+AMEKLNYDIRQAWLIL +ELVR+KYLEDANK FLK
Subjt:  GLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLILTEELVRSKYLEDANKVFLK

Query:  GAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDR
        GAK GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGED MKPDTETYNWVIQAYTRAESYDR
Subjt:  GAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDR

Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRAL+TF+GGT ALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIP 
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
        RAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV

Query:  ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        AL DASEADYHRV ERL+KIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT
        IKLHEGNTE+WKRRFLGEGL +N+VKPSEDDKS+PLDSLDDVD +ED AKEIEEEE EEEE VEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQ T
Subjt:  IKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT

Query:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI
        TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKEL+KR+VFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELA KIMHKVIELGG PTIGDCAMI
Subjt:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI

Query:  LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDE
        LRAAI++PLPSAFLKILQTTH LGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGI V DETLDRVISARQTNDAMPKPD+AIDTTLNDHSLANDE
Subjt:  LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDE

Query:  AS
        AS
Subjt:  AS

XP_023531019.1 uncharacterized protein LOC111793400 isoform X1 [Cucurbita pepo subsp. pepo]0.092.01Show/hide
Query:  MSKFLLSHAHLLTLPSNHRSFSLNHGLL--PIRSVLSAPDKRGRKKRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAG
        MSKFLLSH+ LLTLP  H SFSL++GLL  PIRSVLS  +KRGRKKRQSR QQQLQ KD+DST LE SLRFTFMEELMDRARNHDPLGVSDVIYDMVAAG
Subjt:  MSKFLLSHAHLLTLPSNHRSFSLNHGLL--PIRSVLSAPDKRGRKKRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAG

Query:  LSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLILTEELVRSKYLEDANKVFLKG
        LSPGPRSFHGLVVSH LN D EGAMQSLR+ELS GL P+HETFVALVRLFG+KGLA RGLEILAAMEKLNYDIRQAWLIL EELV++KYLEDANKVFLKG
Subjt:  LSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLILTEELVRSKYLEDANKVFLKG

Query:  AKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRV
        AK GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRV
Subjt:  AKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRV

Query:  QDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPR
        QDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFR LRTF GGT ALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIP R
Subjt:  QDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPR

Query:  AMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVA
        AMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL A
Subjt:  AMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVA

Query:  LRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRI
        L DASEADY RV ERL+KIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRI
Subjt:  LRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRI

Query:  KLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTT
        KLHEGNTEFWKRRFLGEGL SNNVKPSEDD+S+PLDSLDDVD +EDVAKEI+EEEAEEEEEVE TENQDGERVIKKEVEAKKP QMIGVQLLKD+DQ +T
Subjt:  KLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTT

Query:  TSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMIL
        TSKKSRRR SRAS+EDDRDEDWFPED+FEAF EL+KRKVFD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMIL
Subjt:  TSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMIL

Query:  RAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDEA
        RAAIKAPLPSAF KILQTTH LGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGI V DETLDR+ISARQTNDA PK DS ID TLNDHSLANDE 
Subjt:  RAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDEA

Query:  S
        S
Subjt:  S

XP_038879291.1 uncharacterized protein LOC120071230 [Benincasa hispida]0.095.66Show/hide
Query:  MSKFLLSHAHLLTLPSNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSKFLLSHAHLLTLP+ H SFSLNHG++PIRSVLSAPDKRGRKKRQ+R QQQL  KD+DST+LE SLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHAHLLTLPSNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLILTEELVRSKYLEDANKVFLKGAK
        PGPRSFHGLVVSH LNGDTEGAMQSLRRELSAGL PLHETFVALVRLFGSKGLA RGLEILAAMEKLNYDIRQAWLILTEELVR+KYLEDANKVFLKGAK
Subjt:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLILTEELVRSKYLEDANKVFLKGAK

Query:  AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
         GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRAL+TF+GGT ALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR
        ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKM+HRKILKTLQNEGL AL 
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR

Query:  DASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
         ASEADYHRVVERL+KIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTTTS
        HEGNTEFWKRRFLGEGL SNNVKPSEDDKS+PLDSLDDVDT+EDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTT S
Subjt:  HEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTTTS

Query:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRA
        KKSRRRSSRASLEDDRDEDWFPEDIFEAFKEL+KRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGG PTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRA

Query:  AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDEAS
        AIKAPLPS+FLKILQTTHGLGY FGSPLYDEVITLCLDLGELDAAIAIVADLETTGILV DETLDRVIS RQTND+MPKPDSAIDTTLNDHSLA+DEAS
Subjt:  AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDEAS

TrEMBL top hitse value%identityAlignment
A0A1S3B8T6 uncharacterized protein LOC103487261 isoform X10.097.55Show/hide
Query:  MSKFLLSHAHLLTLPSNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSK LLSHAHLLTLP NHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQ KD+DSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHAHLLTLPSNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLILTEELVRSKYLEDANKVFLKGAK
        PGPRSFHGLVVSHTLNGDTEGAMQSLRRELS+GL PLHETFVALVRLFGSKGLANRGLEILAAME+LNYDIRQAWLILTEELVR+KYLEDANKVFLKGAK
Subjt:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLILTEELVRSKYLEDANKVFLKGAK

Query:  AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
        AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRAL+TF+GGT ALHNEGNFGDPLSLYLRALCREGRV++LLEALEAMARDNQQIPPRAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR
        ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR

Query:  DASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
        DASEADYHRVVE+L+KIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTTTS
        HEGNTEFWKRRFLGEGL SNNVKPSEDDKSD LDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT TS
Subjt:  HEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTTTS

Query:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRA
        KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGG PTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRA

Query:  AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDEAS
        AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILV DETLDRVIS RQTNDAMPKPDSAIDTT+NDHSLANDEAS
Subjt:  AIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDEAS

A0A1S3B9H7 uncharacterized protein LOC103487261 isoform X20.097.46Show/hide
Query:  MSKFLLSHAHLLTLPSNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
        MSK LLSHAHLLTLP NHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQ KD+DSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS
Subjt:  MSKFLLSHAHLLTLPSNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLS

Query:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLILTEELVRSKYLEDANKVFLKGAK
        PGPRSFHGLVVSHTLNGDTEGAMQSLRRELS+GL PLHETFVALVRLFGSKGLANRGLEILAAME+LNYDIRQAWLILTEELVR+KYLEDANKVFLKGAK
Subjt:  PGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLILTEELVRSKYLEDANKVFLKGAK

Query:  AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
        AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
Subjt:  AGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM
        VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRAL+TF+GGT ALHNEGNFGDPLSLYLRALCREGRV++LLEALEAMARDNQQIPPRAM
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAM

Query:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR
        ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR
Subjt:  ILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALR

Query:  DASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
        DASEADYHRVVE+L+KIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL
Subjt:  DASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKL

Query:  HEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTTTS
        HEGNTEFWKRRFLGEGL SNNVKPSEDDKSD LDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT TS
Subjt:  HEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTTTS

Query:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRA
        KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGG PTIGDCAMILRA
Subjt:  KKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRA

Query:  AIKAPLPSAFLKILQTTHGLGYVFGSPL
        AIKAPLPSAFLKILQTTHGLGYVFG  L
Subjt:  AIKAPLPSAFLKILQTTHGLGYVFGSPL

A0A6J1DBV3 uncharacterized protein LOC1110195950.092.35Show/hide
Query:  MSKFLLS-HAHLLTLPSNHRSFSL-NH-GLLPIRSVLSAPDKRGRKKRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAA
        MSK LLS H HLLTLP    S  L NH G+LPIRSVLSAP+KRGRKKRQ RH     PKD+ ST+LE  LRFTFMEELMDRAR+ D +GVSDVIYDMVAA
Subjt:  MSKFLLS-HAHLLTLPSNHRSFSL-NH-GLLPIRSVLSAPDKRGRKKRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAA

Query:  GLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLILTEELVRSKYLEDANKVFLK
        GLSPGPRSFHGLVVSH+LNGDTEGAMQSLRRELSAGL PLHETFVALVRLFGSKGLA+RGLEIL+AMEKLNYDIRQAWLIL +ELVR+KYLEDANK FLK
Subjt:  GLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLILTEELVRSKYLEDANKVFLK

Query:  GAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDR
        GAK GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGED MKPDTETYNWVIQAYTRAESYDR
Subjt:  GAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDR

Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRAL+TF+GGT ALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIP 
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
        RAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV

Query:  ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        AL DASEADYHRV ERL+KIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT
        IKLHEGNTE+WKRRFLGEGL +N+VKPSEDDKS+PLDSLDDVD +ED AKEIEEEE EEEE VEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQ T
Subjt:  IKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT

Query:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI
        TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKEL+KR+VFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELA KIMHKVIELGG PTIGDCAMI
Subjt:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI

Query:  LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDE
        LRAAI++PLPSAFLKILQTTH LGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGI V DETLDRVISARQTNDAMPKPD+AIDTTLNDHSLANDE
Subjt:  LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDE

Query:  AS
        AS
Subjt:  AS

A0A6J1EQ88 uncharacterized protein LOC111436825 isoform X10.091.56Show/hide
Query:  MSKFLLSHAHLLTLPSNHRSFSLNHGLLP-IRSVLSAPDKRGRKKRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL
        MSKFLLSH++LLTLP  H SFSL++G+ P IRSVLS  +KRGRKKRQSR QQQLQ KD+DST  E SLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL
Subjt:  MSKFLLSHAHLLTLPSNHRSFSLNHGLLP-IRSVLSAPDKRGRKKRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL

Query:  SPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLILTEELVRSKYLEDANKVFLKGA
        SPGPRSFHGLVVSH LN D EGAMQSLR+ELS GL PLHETFVALVRLFG+KGLA RGLEILAAMEKLNYDIRQAWLIL EELV++KYLEDANKVFLKGA
Subjt:  SPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLILTEELVRSKYLEDANKVFLKGA

Query:  KAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ
        K GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ
Subjt:  KAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ

Query:  DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRA
        DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFR L+TF GGT ALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIP RA
Subjt:  DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRA

Query:  MILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVAL
        MILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL AL
Subjt:  MILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVAL

Query:  RDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK
         DASEADY RV ERL+KIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRN+LYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK
Subjt:  RDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK

Query:  LHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTTT
        LHEGNTEFWKRRFLGEGL SNNVKPSEDD+S+PLDSLDDVD +EDVAKEI+EEEAEEEEEVE TENQDGERVIKKEVEAKKP QMIGVQLLKDVDQ +TT
Subjt:  LHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTTT

Query:  SKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILR
        SKKSRRR SRAS+EDDRDEDWFPED+FEAF EL+KRKVFD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILR
Subjt:  SKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILR

Query:  AAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDEAS
        AAIKAPLPSAF KILQTTH LGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGI V DETLDR+ISARQTNDA PK DS ID TLNDHSL NDE S
Subjt:  AAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDEAS

A0A6J1L2D9 uncharacterized protein LOC111499221 isoform X10.091.56Show/hide
Query:  MSKFLLSHAHLLTLPSNHRSFSLNHGLLP-IRSVLSAPDKRGRKKRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL
        MSKFLLSH+ LLTLP  H SFSL++ +LP IRSVLS  +KRGRKKRQSR QQQLQ KD DST LE SLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL
Subjt:  MSKFLLSHAHLLTLPSNHRSFSLNHGLLP-IRSVLSAPDKRGRKKRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGL

Query:  SPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLILTEELVRSKYLEDANKVFLKGA
        SPGPRSFHGLVVSH LN D EGAMQSLR+ELS GL PLHETFVALVRLFG+KGLA RGLEILAAMEKLNYDIRQAWLIL EELV++KYLEDANKVFLKGA
Subjt:  SPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLILTEELVRSKYLEDANKVFLKGA

Query:  KAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ
        K GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ
Subjt:  KAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQ

Query:  DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRA
        DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFR L+TF GGT ALH+EG+FGDPLSLYLRALCREGRVVELLEALEAMARDNQQIP RA
Subjt:  DVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRA

Query:  MILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVAL
        MILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL AL
Subjt:  MILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVAL

Query:  RDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK
         DASEADY RV ERL+KIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK
Subjt:  RDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIK

Query:  LHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTTT
        LHEGNTEFWKRRFLGEGL SNNVKPSEDD+S+PLDSLDDVD +EDVAKEI+EEEAEEEEEVE TENQDGERVIKKEVEAKKP QMIGVQLLKDVDQ +TT
Subjt:  LHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTTT

Query:  SKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILR
        SKKSRRR SRAS+EDDRDEDWFPED+FEAF EL+KRK+FD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILR
Subjt:  SKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILR

Query:  AAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDEAS
        AAIKAPLPSAF KILQTTH LGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGI V DETLDR+ISARQTNDA PK DS ID TLNDHSLA+DE S
Subjt:  AAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDEAS

SwissProt top hitse value%identityAlignment
Q0WPZ6 Pentatricopeptide repeat-containing protein At2g171401.5e-0722.54Show/hide
Query:  LSHAHLLTLPSNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRS
        +   H L L S+ +   L+  LL + S+ +  +   +   Q +  +   P++  S  L N L    +E  +   R      VS +  DMV  G++P   +
Subjt:  LSHAHLLTLPSNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRS

Query:  FHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLN-YDIRQAWLILTEELVRSKYLEDANKVFLKGAKAGLR
        F+ L+ +   +   + A +        G  P   TF  LVR +   GL ++GLE+L AME       +  +  +     R    +D+ K+  K  + GL 
Subjt:  FHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLN-YDIRQAWLILTEELVRSKYLEDANKVFLKGAKAGLR

Query:  ATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMA----TTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD
             ++  I   CK G   +A  I  +ME    +      +  +N +L      G+ E A + FE++   +D      ++YN  +Q   R   +   + 
Subjt:  ATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMA----TTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQD

Query:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLS--LYLRALCREGRVVELLEALEAMARDNQQIPPR
        V + +       K + P++ +Y +L++   K  ++ +A       +T  G    +   G   D ++    L   C  G+V      L+ M R+N    P 
Subjt:  VAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLS--LYLRALCREGRVVELLEALEAMARDNQQIPPR

Query:  AMILSRKYRSL-----VSSWIEPLQEEAEHGFEIDYIARYIEEGGLTG
        A   +    SL     +S   E L++  E G+ +D +   I   GL G
Subjt:  AMILSRKYRSL-----VSSWIEPLQEEAEHGFEIDYIARYIEEGGLTG

Q8GYP6 Pentatricopeptide repeat-containing protein At1g189002.3e-0821.32Show/hide
Query:  FTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEK--
        +T M   + RA+      ++ ++ +MV  G  P   +++ L+ S+        AM    +   AG  P   T+  L+ +    G  +  +++   M+   
Subjt:  FTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEK--

Query:  LNYDIRQAWLILTEELVRSKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFE
        L+ D    + ++   L ++ +L  A+K+F +    G       Y++M++   KA ++ NAL++  +M+ AG       ++ ++ V   CG  E A + F 
Subjt:  LNYDIRQAWLILTEELVRSKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFE

Query:  NMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREA
         M+  +    PD   Y  ++  + +A + ++     + +      H  L+PN+ T   L+  F +   I EA
Subjt:  NMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREA

Q9LFC5 Pentatricopeptide repeat-containing protein At5g011101.4e-0524.05Show/hide
Query:  MVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLI--------LTEELVRS
        M   G SPG  +++ ++     +G  E A +     L +GL P   T+ +L+     KG        +   EK+  D+R   ++        +     RS
Subjt:  MVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLI--------LTEELVRS

Query:  KYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWV
          L+ A   F    +AGL   + IY ++I+  C+ G  S A+ +  EM   G       +N +L       +   A   F  M   E  + PD+ T   +
Subjt:  KYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWV

Query:  IQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEAL
        I  + +      +Q+  EL   M E  KR++ ++ TY  L++ F K   I  A   +  + + E   T +          S+ + ALC +G + E     
Subjt:  IQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEAL

Query:  EAMARDNQQIPPRAMI
        + M   N  I P  MI
Subjt:  EAMARDNQQIPPRAMI

Q9LN22 Pentatricopeptide repeat-containing protein At1g20300, mitochondrial3.7e-0626.62Show/hide
Query:  RSKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYN
        R+  + +A KVF +   AG+      Y ++I+  C+ G  S A ++  +M  +G       FN L+ V    G  E     +  M+  +   +PDT TYN
Subjt:  RSKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYN

Query:  WVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTY
        ++I+A+ R E+   +++  ++L  M++  K+ + N  T+
Subjt:  WVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTY

Q9LYZ9 Pentatricopeptide repeat-containing protein At5g028604.0e-0824.05Show/hide
Query:  KRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVA
        KR S HQ+  Q  +    +  +  + T+   L    ++H P     V+ +MV  G SP   +++ L+ ++  +G  + AM+   +    G  P   T+  
Subjt:  KRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVA

Query:  LVRLFGSKGLANRGLEILAAME----KLNYDIRQAWLILTEELVRSKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRM
        L+  F   G     + I   M     K N     A++ +     R K+ E   K+F +    GL      ++ ++    + G  S    +  EM+ AG +
Subjt:  LVRLFGSKGLANRGLEILAAME----KLNYDIRQAWLILTEELVRSKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRM

Query:  ATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDV-AELLGMMVEDHKRLQPNMRTYALLVECF
             FN L+S  + CG  E A + +  M   +  + PD  TYN V+ A  R   +++ + V AE+      +  R +PN  TY  L+  +
Subjt:  ATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDV-AELLGMMVEDHKRLQPNMRTYALLVECF

Arabidopsis top hitse value%identityAlignment
AT1G18900.1 Pentatricopeptide repeat (PPR) superfamily protein1.7e-0921.32Show/hide
Query:  FTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEK--
        +T M   + RA+      ++ ++ +MV  G  P   +++ L+ S+        AM    +   AG  P   T+  L+ +    G  +  +++   M+   
Subjt:  FTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEK--

Query:  LNYDIRQAWLILTEELVRSKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFE
        L+ D    + ++   L ++ +L  A+K+F +    G       Y++M++   KA ++ NAL++  +M+ AG       ++ ++ V   CG  E A + F 
Subjt:  LNYDIRQAWLILTEELVRSKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFE

Query:  NMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREA
         M+  +    PD   Y  ++  + +A + ++     + +      H  L+PN+ T   L+  F +   I EA
Subjt:  NMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREA

AT1G18900.2 Pentatricopeptide repeat (PPR) superfamily protein1.7e-0921.32Show/hide
Query:  FTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEK--
        +T M   + RA+      ++ ++ +MV  G  P   +++ L+ S+        AM    +   AG  P   T+  L+ +    G  +  +++   M+   
Subjt:  FTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEK--

Query:  LNYDIRQAWLILTEELVRSKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFE
        L+ D    + ++   L ++ +L  A+K+F +    G       Y++M++   KA ++ NAL++  +M+ AG       ++ ++ V   CG  E A + F 
Subjt:  LNYDIRQAWLILTEELVRSKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFE

Query:  NMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREA
         M+  +    PD   Y  ++  + +A + ++     + +      H  L+PN+ T   L+  F +   I EA
Subjt:  NMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREA

AT1G18900.3 Pentatricopeptide repeat (PPR) superfamily protein1.7e-0921.32Show/hide
Query:  FTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEK--
        +T M   + RA+      ++ ++ +MV  G  P   +++ L+ S+        AM    +   AG  P   T+  L+ +    G  +  +++   M+   
Subjt:  FTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEK--

Query:  LNYDIRQAWLILTEELVRSKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFE
        L+ D    + ++   L ++ +L  A+K+F +    G       Y++M++   KA ++ NAL++  +M+ AG       ++ ++ V   CG  E A + F 
Subjt:  LNYDIRQAWLILTEELVRSKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFE

Query:  NMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREA
         M+  +    PD   Y  ++  + +A + ++     + +      H  L+PN+ T   L+  F +   I EA
Subjt:  NMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREA

AT3G04260.1 plastid transcriptionally active 30.0e+0074.71Show/hide
Query:  GLLPIRSVLSAPDKRGRKKRQSRHQQQLQPKDNDS-------TSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGD
        G+  IR  +SAP+K+ R++R+ +     +  D+ S       ++LE SLR TFM+ELM+RARN D  GVS+VIYDM+AAGLSPGPRSFHGLVV+H LNGD
Subjt:  GLLPIRSVLSAPDKRGRKKRQSRHQQQLQPKDNDS-------TSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGD

Query:  TEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLILTEELVRSKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDC
         +GAM SLR+EL AG  PL ET +ALVRL GSKG A RGLEILAAMEKL YDIRQAWLIL EEL+R  +LEDANKVFLKGA+ G+RATD++YDLMIEEDC
Subjt:  TEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLILTEELVRSKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDC

Query:  KAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPN
        KAGDHSNAL+ISYEMEAAGRMATTFHFNCLLSVQATCGIPE+A++TFENMEYGE +MKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKR+QPN
Subjt:  KAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPN

Query:  MRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQ
        ++TYALLVECFTKYCV++EAIRHFRAL+ FEGGT  LHN GNF DPLSLYLRALCREGR+VEL++AL+AM +DNQ IPPRAMI+SRKYR+LVSSWIEPLQ
Subjt:  MRTYALLVECFTKYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQ

Query:  EEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALRDASEADYHRVVERLRKII
        EEAE G+EIDY+ARYIEEGGLTGERKRWVPR+GKTPLDPDA GFIYSNP+ETSFKQRCLEDWK++HRK+L+TLQ+EGL  L DASE+DY RVVERLR II
Subjt:  EEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALRDASEADYHRVVERLRKII

Query:  KGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLY
        KGP  N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRIN+SRGRPLWVPP+EEEEEEVDEE+D+LI RIKLHEG+TEFWKRRFLGEGL 
Subjt:  KGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLY

Query:  SNNVKPSE--------------DDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQ-DGERVIK-KEVEAKKPLQMIGVQLLKDVDQPTTTSKK
          +V+  E              +D S   D+ +D D  E    E ++E  EEE  V +TEN+ +GE ++K K  +AKK LQMIGVQLLK+ D+   T KK
Subjt:  SNNVKPSE--------------DDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQ-DGERVIK-KEVEAKKPLQMIGVQLLKDVDQPTTTSKK

Query:  SRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAAI
          +R+SR +LEDD DEDWFPE+ FEAFKE+++RKVFDV+DMYTIADVWGWTWE++ KN+ PR+WSQEWEVELAI +M KVIELGGIPTIGDCA+ILRAA+
Subjt:  SRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAAI

Query:  KAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDA
        +AP+PSAFLKILQTTH LGY FGSPLYDE+ITLCLDLGELDAAIAIVAD+ETTGI V D+TLD+VISARQ+N++
Subjt:  KAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDA

AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein2.8e-0924.05Show/hide
Query:  KRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVA
        KR S HQ+  Q  +    +  +  + T+   L    ++H P     V+ +MV  G SP   +++ L+ ++  +G  + AM+   +    G  P   T+  
Subjt:  KRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLVVSHTLNGDTEGAMQSLRRELSAGLLPLHETFVA

Query:  LVRLFGSKGLANRGLEILAAME----KLNYDIRQAWLILTEELVRSKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRM
        L+  F   G     + I   M     K N     A++ +     R K+ E   K+F +    GL      ++ ++    + G  S    +  EM+ AG +
Subjt:  LVRLFGSKGLANRGLEILAAME----KLNYDIRQAWLILTEELVRSKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKAGDHSNALEISYEMEAAGRM

Query:  ATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDV-AELLGMMVEDHKRLQPNMRTYALLVECF
             FN L+S  + CG  E A + +  M   +  + PD  TYN V+ A  R   +++ + V AE+      +  R +PN  TY  L+  +
Subjt:  ATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDV-AELLGMMVEDHKRLQPNMRTYALLVECF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCAAATTCCTGCTCTCTCACGCTCATCTTCTCACACTTCCCTCCAATCACCGTTCCTTTTCTCTCAACCATGGCCTCCTCCCCATACGCTCAGTCCTCTCTGCTCC
GGACAAGCGAGGTAGAAAGAAGCGCCAGTCTCGCCACCAACAACAATTGCAACCAAAGGACAATGATTCCACTTCACTTGAGAACTCCCTCCGCTTCACTTTCATGGAGG
AACTCATGGACCGCGCTAGAAACCACGATCCACTTGGTGTTTCTGATGTCATTTACGATATGGTTGCTGCTGGATTGAGCCCTGGACCTCGCTCCTTCCATGGATTAGTT
GTTTCTCATACTCTCAATGGCGATACTGAGGGAGCGATGCAGTCTCTGAGAAGGGAATTAAGTGCTGGACTTCTTCCTCTCCATGAAACGTTTGTTGCATTAGTTCGGTT
ATTTGGTTCCAAGGGTCTTGCTAATAGAGGCTTAGAAATCCTTGCCGCCATGGAGAAATTGAATTATGACATCCGTCAAGCATGGCTCATTCTTACTGAGGAACTCGTAA
GGAGCAAATATTTAGAAGACGCCAATAAAGTGTTCTTAAAAGGTGCCAAAGCGGGTCTCAGAGCCACTGACAAAATTTATGATCTTATGATTGAGGAAGATTGTAAAGCT
GGAGATCATTCAAATGCCTTAGAGATCTCATATGAAATGGAGGCTGCTGGGCGGATGGCAACGACCTTCCATTTTAATTGCCTTCTTAGTGTCCAGGCCACTTGTGGAAT
ACCTGAAATTGCTTTCTCAACATTTGAGAACATGGAATATGGAGAAGATTACATGAAGCCCGATACTGAGACATATAATTGGGTGATCCAAGCATATACAAGGGCTGAAT
CTTATGATAGGGTGCAAGATGTTGCCGAGCTACTTGGCATGATGGTTGAAGACCACAAACGTCTACAACCTAACATGAGAACCTATGCGCTCTTGGTAGAGTGTTTTACT
AAGTATTGTGTTATACGAGAAGCTATTAGGCATTTTCGTGCACTAAGAACATTTGAAGGTGGAACAACAGCTTTGCATAATGAAGGAAATTTTGGTGATCCACTTTCCTT
ATATCTTCGAGCTTTATGTAGAGAAGGTAGGGTGGTAGAACTCTTAGAAGCATTAGAGGCTATGGCTAGAGATAACCAACAGATTCCTCCAAGAGCAATGATCTTGAGCA
GAAAGTATCGATCACTCGTGAGCTCATGGATTGAACCTTTACAAGAAGAAGCTGAACATGGATTCGAGATAGACTACATTGCAAGATACATTGAAGAGGGCGGACTAACT
GGAGAGCGCAAGAGGTGGGTCCCTCGGAAAGGAAAAACTCCTCTAGATCCTGATGCAGATGGATTCATCTATTCAAATCCTATGGAGACATCCTTTAAGCAGCGATGCCT
CGAAGATTGGAAGATGTACCACCGAAAGATTTTGAAAACCTTGCAGAATGAAGGACTTGTGGCTCTGAGGGATGCATCTGAAGCTGATTATCATAGAGTAGTGGAGAGAT
TGAGGAAAATAATAAAGGGTCCTGACCAAAATGTTTTAAAGCCGAAGGCTGCAAGTAAGATGATTGTATCGGAATTAAAAGAAGAGTTAGAAGCACAAGGTTTACCAATT
GACGGAACTAGAAATGTTCTTTACCAGCGTGTTCAAAAAGCAAGAAGAATAAACCGGTCTCGTGGTCGGCCCCTTTGGGTTCCTCCAGTGGAGGAGGAGGAAGAAGAGGT
TGATGAAGAGCTGGATGAACTAATTTCACGAATAAAGCTACATGAAGGAAATACGGAGTTCTGGAAACGCCGTTTTCTCGGAGAAGGCTTGTACAGTAATAATGTTAAAC
CTTCTGAGGATGATAAATCAGATCCTCTTGATTCTTTGGATGATGTTGACACTATAGAAGACGTTGCAAAGGAGATTGAAGAAGAAGAAGCTGAGGAGGAAGAGGAGGTA
GAACAAACTGAGAATCAAGATGGTGAAAGGGTTATTAAAAAGGAAGTTGAAGCTAAGAAGCCTCTTCAAATGATTGGTGTCCAATTGTTAAAAGATGTTGACCAACCTAC
AACAACATCCAAAAAGTCAAGGAGGAGAAGTTCTCGAGCATCACTCGAGGATGATCGTGATGAAGATTGGTTTCCTGAAGATATATTTGAAGCATTTAAAGAGTTGCAAA
AGAGGAAAGTCTTTGATGTATCTGACATGTACACAATAGCTGATGTTTGGGGTTGGACTTGGGAGAGAGAACTTAAGAACAGACCTCCCAGGAGGTGGTCACAGGAATGG
GAAGTGGAGCTGGCTATTAAAATTATGCACAAGGTGATAGAATTGGGTGGAATACCAACAATTGGCGACTGTGCCATGATCTTGCGAGCTGCCATCAAGGCTCCATTACC
TTCTGCCTTCTTGAAAATCTTGCAGACAACTCACGGTCTTGGCTATGTATTTGGGAGCCCTTTATATGATGAGGTTATCACCCTTTGCCTTGATCTTGGGGAACTAGATG
CAGCCATTGCAATTGTAGCAGATCTTGAAACAACAGGAATCTTGGTCCATGATGAAACGCTCGATCGGGTAATCTCTGCTAGACAGACGAATGATGCTATGCCCAAGCCT
GACTCAGCCATTGATACCACACTGAATGATCACAGTTTAGCCAATGATGAAGCATCATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCCAAATTCCTGCTCTCTCACGCTCATCTTCTCACACTTCCCTCCAATCACCGTTCCTTTTCTCTCAACCATGGCCTCCTCCCCATACGCTCAGTCCTCTCTGCTCC
GGACAAGCGAGGTAGAAAGAAGCGCCAGTCTCGCCACCAACAACAATTGCAACCAAAGGACAATGATTCCACTTCACTTGAGAACTCCCTCCGCTTCACTTTCATGGAGG
AACTCATGGACCGCGCTAGAAACCACGATCCACTTGGTGTTTCTGATGTCATTTACGATATGGTTGCTGCTGGATTGAGCCCTGGACCTCGCTCCTTCCATGGATTAGTT
GTTTCTCATACTCTCAATGGCGATACTGAGGGAGCGATGCAGTCTCTGAGAAGGGAATTAAGTGCTGGACTTCTTCCTCTCCATGAAACGTTTGTTGCATTAGTTCGGTT
ATTTGGTTCCAAGGGTCTTGCTAATAGAGGCTTAGAAATCCTTGCCGCCATGGAGAAATTGAATTATGACATCCGTCAAGCATGGCTCATTCTTACTGAGGAACTCGTAA
GGAGCAAATATTTAGAAGACGCCAATAAAGTGTTCTTAAAAGGTGCCAAAGCGGGTCTCAGAGCCACTGACAAAATTTATGATCTTATGATTGAGGAAGATTGTAAAGCT
GGAGATCATTCAAATGCCTTAGAGATCTCATATGAAATGGAGGCTGCTGGGCGGATGGCAACGACCTTCCATTTTAATTGCCTTCTTAGTGTCCAGGCCACTTGTGGAAT
ACCTGAAATTGCTTTCTCAACATTTGAGAACATGGAATATGGAGAAGATTACATGAAGCCCGATACTGAGACATATAATTGGGTGATCCAAGCATATACAAGGGCTGAAT
CTTATGATAGGGTGCAAGATGTTGCCGAGCTACTTGGCATGATGGTTGAAGACCACAAACGTCTACAACCTAACATGAGAACCTATGCGCTCTTGGTAGAGTGTTTTACT
AAGTATTGTGTTATACGAGAAGCTATTAGGCATTTTCGTGCACTAAGAACATTTGAAGGTGGAACAACAGCTTTGCATAATGAAGGAAATTTTGGTGATCCACTTTCCTT
ATATCTTCGAGCTTTATGTAGAGAAGGTAGGGTGGTAGAACTCTTAGAAGCATTAGAGGCTATGGCTAGAGATAACCAACAGATTCCTCCAAGAGCAATGATCTTGAGCA
GAAAGTATCGATCACTCGTGAGCTCATGGATTGAACCTTTACAAGAAGAAGCTGAACATGGATTCGAGATAGACTACATTGCAAGATACATTGAAGAGGGCGGACTAACT
GGAGAGCGCAAGAGGTGGGTCCCTCGGAAAGGAAAAACTCCTCTAGATCCTGATGCAGATGGATTCATCTATTCAAATCCTATGGAGACATCCTTTAAGCAGCGATGCCT
CGAAGATTGGAAGATGTACCACCGAAAGATTTTGAAAACCTTGCAGAATGAAGGACTTGTGGCTCTGAGGGATGCATCTGAAGCTGATTATCATAGAGTAGTGGAGAGAT
TGAGGAAAATAATAAAGGGTCCTGACCAAAATGTTTTAAAGCCGAAGGCTGCAAGTAAGATGATTGTATCGGAATTAAAAGAAGAGTTAGAAGCACAAGGTTTACCAATT
GACGGAACTAGAAATGTTCTTTACCAGCGTGTTCAAAAAGCAAGAAGAATAAACCGGTCTCGTGGTCGGCCCCTTTGGGTTCCTCCAGTGGAGGAGGAGGAAGAAGAGGT
TGATGAAGAGCTGGATGAACTAATTTCACGAATAAAGCTACATGAAGGAAATACGGAGTTCTGGAAACGCCGTTTTCTCGGAGAAGGCTTGTACAGTAATAATGTTAAAC
CTTCTGAGGATGATAAATCAGATCCTCTTGATTCTTTGGATGATGTTGACACTATAGAAGACGTTGCAAAGGAGATTGAAGAAGAAGAAGCTGAGGAGGAAGAGGAGGTA
GAACAAACTGAGAATCAAGATGGTGAAAGGGTTATTAAAAAGGAAGTTGAAGCTAAGAAGCCTCTTCAAATGATTGGTGTCCAATTGTTAAAAGATGTTGACCAACCTAC
AACAACATCCAAAAAGTCAAGGAGGAGAAGTTCTCGAGCATCACTCGAGGATGATCGTGATGAAGATTGGTTTCCTGAAGATATATTTGAAGCATTTAAAGAGTTGCAAA
AGAGGAAAGTCTTTGATGTATCTGACATGTACACAATAGCTGATGTTTGGGGTTGGACTTGGGAGAGAGAACTTAAGAACAGACCTCCCAGGAGGTGGTCACAGGAATGG
GAAGTGGAGCTGGCTATTAAAATTATGCACAAGGTGATAGAATTGGGTGGAATACCAACAATTGGCGACTGTGCCATGATCTTGCGAGCTGCCATCAAGGCTCCATTACC
TTCTGCCTTCTTGAAAATCTTGCAGACAACTCACGGTCTTGGCTATGTATTTGGGAGCCCTTTATATGATGAGGTTATCACCCTTTGCCTTGATCTTGGGGAACTAGATG
CAGCCATTGCAATTGTAGCAGATCTTGAAACAACAGGAATCTTGGTCCATGATGAAACGCTCGATCGGGTAATCTCTGCTAGACAGACGAATGATGCTATGCCCAAGCCT
GACTCAGCCATTGATACCACACTGAATGATCACAGTTTAGCCAATGATGAAGCATCATAA
Protein sequenceShow/hide protein sequence
MSKFLLSHAHLLTLPSNHRSFSLNHGLLPIRSVLSAPDKRGRKKRQSRHQQQLQPKDNDSTSLENSLRFTFMEELMDRARNHDPLGVSDVIYDMVAAGLSPGPRSFHGLV
VSHTLNGDTEGAMQSLRRELSAGLLPLHETFVALVRLFGSKGLANRGLEILAAMEKLNYDIRQAWLILTEELVRSKYLEDANKVFLKGAKAGLRATDKIYDLMIEEDCKA
GDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT
KYCVIREAIRHFRALRTFEGGTTALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLT
GERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPI
DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEV
EQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEW
EVELAIKIMHKVIELGGIPTIGDCAMILRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKP
DSAIDTTLNDHSLANDEAS