; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g1187 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g1187
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionSAP domain-containing protein
Genome locationMC06:15057164..15095583
RNA-Seq ExpressionMC06g1187
SyntenyMC06g1187
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0006979 - response to oxidative stress (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0004601 - peroxidase activity (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR002016 - Haem peroxidase
IPR003034 - SAP domain
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR036361 - SAP domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008443746.1 PREDICTED: uncharacterized protein LOC103487261 isoform X1 [Cucumis melo]0.092.46Show/hide
Query:  MSKLLLSSHTHLLTLPHKRPSLCLHNHNGVLPIRSVLSAPEKRGRKKRQPRHP-----KDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAA
        MSKLLLS H HLLTLP+   S  L NH G+LPIRSVLSAP+KRGRKKRQ RH      KDD ST+LE  LRFTFMEELMDRAR+ D +GVSDVIYDMVAA
Subjt:  MSKLLLSSHTHLLTLPHKRPSLCLHNHNGVLPIRSVLSAPEKRGRKKRQPRHP-----KDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAA

Query:  GLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLK
        GLSPGPRSFHGLVVSH+LNGDTEGAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA+RGLEIL+AME+LNYDIRQAWLIL +ELVRNKYLEDANK FLK
Subjt:  GLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLK

Query:  GAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDR
        GAK GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGED MKPDTETYNWVIQAYTRAESYDR
Subjt:  GAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDR

Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPQ
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRV++LLEALEAMARDNQQIP 
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPQ

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
        RAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV

Query:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        AL DASEADYHRV E+LKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNTEYWKRRFLGEGLDNNSVKPSEDDKSEPLDSLDDVDIVEDGAKEIEEEEVEEEE-VEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTT
        IKLHEGNTE+WKRRFLGEGLD+N+VKPSEDDKS+ LDSLDDVD +ED AKEIEEEE EEEE VEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQ T
Subjt:  IKLHEGNTEYWKRRFLGEGLDNNSVKPSEDDKSEPLDSLDDVDIVEDGAKEIEEEEVEEEE-VEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTT

Query:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELRKRRVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELATKIMHKVIELGGTPTIGDCAMI
         TSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKEL+KR+VFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELA KIMHKVIELGGTPTIGDCAMI
Subjt:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELRKRRVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELATKIMHKVIELGGTPTIGDCAMI

Query:  LRAAIRSPLPSAFLKILQTTHSLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGIPVPDETLDRVISARQTNDAMPKPDTAIDTTLNDHSLANDE
        LRAAI++PLPSAFLKILQTTH LGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGI VPDETLDRVIS RQTNDAMPKPD+AIDTT+NDHSLANDE
Subjt:  LRAAIRSPLPSAFLKILQTTHSLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGIPVPDETLDRVISARQTNDAMPKPDTAIDTTLNDHSLANDE

Query:  AS
        AS
Subjt:  AS

XP_011660243.1 uncharacterized protein LOC101209618 isoform X1 [Cucumis sativus]0.092.35Show/hide
Query:  MSKLLLSSHTHLLTLPHKRPSLCLHNHNGVLPIRSVLSAPEKRGRKKRQPRH-----PKDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAA
        MSK LLS H HLLTLP    S  L NH G+LPIRSVLSAP+KRGRKKRQ RH     PKD+ ST+LE  LRFTFMEELMDRAR+ D +GVSDVIYDMVAA
Subjt:  MSKLLLSSHTHLLTLPHKRPSLCLHNHNGVLPIRSVLSAPEKRGRKKRQPRH-----PKDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAA

Query:  GLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLK
        GLSPGPRSFHGLVVSH+LNGDTEGAMQSLRRELSAGL PLHETFVALVRLFGSKGLA+RGLEIL+AMEKLNYDIRQAWLIL +ELVR+KYLEDANK FLK
Subjt:  GLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLK

Query:  GAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDR
        GAK GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGED MKPDTETYNWVIQAYTRAESYDR
Subjt:  GAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDR

Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPQ
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRAL+TF+GGT ALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIP 
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPQ

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
        RAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV

Query:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        AL DASEADYHRV ERL+KIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNTEYWKRRFLGEGLDNNSVKPSEDDKSEPLDSLDDVDIVEDGAKEIEEEEVEEEE-VEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTT
        IKLHEGNTE+WKRRFLGEGL +N+VKPSEDDKS+PLDSLDDVD +ED AKEIEEEE EEEE VEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQ T
Subjt:  IKLHEGNTEYWKRRFLGEGLDNNSVKPSEDDKSEPLDSLDDVDIVEDGAKEIEEEEVEEEE-VEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTT

Query:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELRKRRVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELATKIMHKVIELGGTPTIGDCAMI
        TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKEL+KR+VFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELA KIMHKVIELGG PTIGDCAMI
Subjt:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELRKRRVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELATKIMHKVIELGGTPTIGDCAMI

Query:  LRAAIRSPLPSAFLKILQTTHSLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGIPVPDETLDRVISARQTNDAMPKPDTAIDTTLNDHSLANDE
        LRAAI++PLPSAFLKILQTTH LGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGI V DETLDRVISARQTNDAMPKPD+AIDTTLNDHSLANDE
Subjt:  LRAAIRSPLPSAFLKILQTTHSLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGIPVPDETLDRVISARQTNDAMPKPDTAIDTTLNDHSLANDE

Query:  AS
        AS
Subjt:  AS

XP_022151680.1 uncharacterized protein LOC111019595 [Momordica charantia]0.0100Show/hide
Query:  MSKLLLSSHTHLLTLPHKRPSLCLHNHNGVLPIRSVLSAPEKRGRKKRQPRHPKDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAAGLSPG
        MSKLLLSSHTHLLTLPHKRPSLCLHNHNGVLPIRSVLSAPEKRGRKKRQPRHPKDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAAGLSPG
Subjt:  MSKLLLSSHTHLLTLPHKRPSLCLHNHNGVLPIRSVLSAPEKRGRKKRQPRHPKDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAAGLSPG

Query:  PRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLKGAKGG
        PRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLKGAKGG
Subjt:  PRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLKGAKGG

Query:  LRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDRVQDVA
        LRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDRVQDVA
Subjt:  LRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDRVQDVA

Query:  ELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPQRAMIL
        ELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPQRAMIL
Subjt:  ELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPQRAMIL

Query:  SRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALGDA
        SRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALGDA
Subjt:  SRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALGDA

Query:  SEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHE
        SEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHE
Subjt:  SEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHE

Query:  GNTEYWKRRFLGEGLDNNSVKPSEDDKSEPLDSLDDVDIVEDGAKEIEEEEVEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKS
        GNTEYWKRRFLGEGLDNNSVKPSEDDKSEPLDSLDDVDIVEDGAKEIEEEEVEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKS
Subjt:  GNTEYWKRRFLGEGLDNNSVKPSEDDKSEPLDSLDDVDIVEDGAKEIEEEEVEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKS

Query:  RRRSSRASLEDDRDEDWFPEDIFEAFKELRKRRVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELATKIMHKVIELGGTPTIGDCAMILRAAIR
        RRRSSRASLEDDRDEDWFPEDIFEAFKELRKRRVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELATKIMHKVIELGGTPTIGDCAMILRAAIR
Subjt:  RRRSSRASLEDDRDEDWFPEDIFEAFKELRKRRVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELATKIMHKVIELGGTPTIGDCAMILRAAIR

Query:  SPLPSAFLKILQTTHSLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGIPVPDETLDRVISARQTNDAMPKPDTAIDTTLNDHSLANDEAS
        SPLPSAFLKILQTTHSLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGIPVPDETLDRVISARQTNDAMPKPDTAIDTTLNDHSLANDEAS
Subjt:  SPLPSAFLKILQTTHSLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGIPVPDETLDRVISARQTNDAMPKPDTAIDTTLNDHSLANDEAS

XP_022930357.1 uncharacterized protein LOC111436825 isoform X1 [Cucurbita moschata]0.090.91Show/hide
Query:  MSKLLLSSHTHLLTLPHKRPSLCLHNHNGVLP-IRSVLSAPEKRGRKKRQPRHP----KDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAA
        MSK LLS H++LLTLPHK  S  LHN  GV P IRSVLS  EKRGRKKRQ R      KDD ST  EK LRFTFMEELMDRAR+ D +GVSDVIYDMVAA
Subjt:  MSKLLLSSHTHLLTLPHKRPSLCLHNHNGVLP-IRSVLSAPEKRGRKKRQPRHP----KDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAA

Query:  GLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLK
        GLSPGPRSFHGLVVSH LN D EGAMQSLR+ELS GLRPLHETFVALVRLFG+KGLA+RGLEIL+AMEKLNYDIRQAWLILI+ELV+NKYLEDANK FLK
Subjt:  GLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLK

Query:  GAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDR
        GAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGED MKPDTETYNWVIQAYTRAESYDR
Subjt:  GAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDR

Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPQ
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFR LKTF GGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIP 
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPQ

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
        RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL 
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV

Query:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        ALGDASEADY RVEERLKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRN+LYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNTEYWKRRFLGEGLDNNSVKPSEDDKSEPLDSLDDVDIVEDGAKEIEEEEVEEEE-VEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTT
        IKLHEGNTE+WKRRFLGEGLD+N+VKPSEDD+SEPLDSLDDVDIVED AKEI+EEE EEEE VE TENQDGERVIKKEVEAKKP QMIGVQLLKDVDQT+
Subjt:  IKLHEGNTEYWKRRFLGEGLDNNSVKPSEDDKSEPLDSLDDVDIVEDGAKEIEEEEVEEEE-VEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTT

Query:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELRKRRVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELATKIMHKVIELGGTPTIGDCAMI
        TTSKKSRRR SRAS+EDDRDEDWFPED+FEAF ELRKR+VFD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELA KIMHKVIELGG PTIGDCAMI
Subjt:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELRKRRVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELATKIMHKVIELGGTPTIGDCAMI

Query:  LRAAIRSPLPSAFLKILQTTHSLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGIPVPDETLDRVISARQTNDAMPKPDTAIDTTLNDHSLANDE
        LRAAI++PLPSAF KILQTTHSLGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGI VPDETLDR+ISARQTNDA PK D+ ID TLNDHSL NDE
Subjt:  LRAAIRSPLPSAFLKILQTTHSLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGIPVPDETLDRVISARQTNDAMPKPDTAIDTTLNDHSLANDE

Query:  AS
         S
Subjt:  AS

XP_038879291.1 uncharacterized protein LOC120071230 [Benincasa hispida]0.093.24Show/hide
Query:  MSKLLLSSHTHLLTLPHKRPSLCLHNHNGVLPIRSVLSAPEKRGRKKRQPR-----HPKDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAA
        MSK LLS H HLLTLP+K  S  L NH GV+PIRSVLSAP+KRGRKKRQ R     H KD  STALEK LRFTFMEELMDRAR+ D +GVSDVIYDMVAA
Subjt:  MSKLLLSSHTHLLTLPHKRPSLCLHNHNGVLPIRSVLSAPEKRGRKKRQPR-----HPKDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAA

Query:  GLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLK
        GLSPGPRSFHGLVVSH LNGDTEGAMQSLRRELSAGL PLHETFVALVRLFGSKGLA+RGLEIL+AMEKLNYDIRQAWLIL +ELVRNKYLEDANK FLK
Subjt:  GLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLK

Query:  GAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDR
        GAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGED MKPDTETYNWVIQAYTRAESYDR
Subjt:  GAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDR

Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPQ
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIP 
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPQ

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
        RAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKM+HRKILKTLQNEGL 
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV

Query:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        ALG ASEADYHRV ERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNTEYWKRRFLGEGLDNNSVKPSEDDKSEPLDSLDDVDIVEDGAKEIEEEEVEEEE-VEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTT
        IKLHEGNTE+WKRRFLGEGLD+N+VKPSEDDKSEPLDSLDDVD VED AKEIEEEE EEEE VEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQ T
Subjt:  IKLHEGNTEYWKRRFLGEGLDNNSVKPSEDDKSEPLDSLDDVDIVEDGAKEIEEEEVEEEE-VEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTT

Query:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELRKRRVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELATKIMHKVIELGGTPTIGDCAMI
        T SKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELRKR+VFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELA KIMHKVIELGGTPTIGDCAMI
Subjt:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELRKRRVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELATKIMHKVIELGGTPTIGDCAMI

Query:  LRAAIRSPLPSAFLKILQTTHSLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGIPVPDETLDRVISARQTNDAMPKPDTAIDTTLNDHSLANDE
        LRAAI++PLPS+FLKILQTTH LGY FGSPLYDEVITLCLDLGELDAAIAIVADLETTGI VPDETLDRVIS RQTND+MPKPD+AIDTTLNDHSLA+DE
Subjt:  LRAAIRSPLPSAFLKILQTTHSLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGIPVPDETLDRVISARQTNDAMPKPDTAIDTTLNDHSLANDE

Query:  AS
        AS
Subjt:  AS

TrEMBL top hitse value%identityAlignment
A0A1S3B8T6 uncharacterized protein LOC103487261 isoform X10.092.46Show/hide
Query:  MSKLLLSSHTHLLTLPHKRPSLCLHNHNGVLPIRSVLSAPEKRGRKKRQPRHP-----KDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAA
        MSKLLLS H HLLTLP+   S  L NH G+LPIRSVLSAP+KRGRKKRQ RH      KDD ST+LE  LRFTFMEELMDRAR+ D +GVSDVIYDMVAA
Subjt:  MSKLLLSSHTHLLTLPHKRPSLCLHNHNGVLPIRSVLSAPEKRGRKKRQPRHP-----KDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAA

Query:  GLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLK
        GLSPGPRSFHGLVVSH+LNGDTEGAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA+RGLEIL+AME+LNYDIRQAWLIL +ELVRNKYLEDANK FLK
Subjt:  GLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLK

Query:  GAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDR
        GAK GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGED MKPDTETYNWVIQAYTRAESYDR
Subjt:  GAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDR

Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPQ
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRV++LLEALEAMARDNQQIP 
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPQ

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
        RAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV

Query:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        AL DASEADYHRV E+LKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNTEYWKRRFLGEGLDNNSVKPSEDDKSEPLDSLDDVDIVEDGAKEIEEEEVEEEE-VEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTT
        IKLHEGNTE+WKRRFLGEGLD+N+VKPSEDDKS+ LDSLDDVD +ED AKEIEEEE EEEE VEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQ T
Subjt:  IKLHEGNTEYWKRRFLGEGLDNNSVKPSEDDKSEPLDSLDDVDIVEDGAKEIEEEEVEEEE-VEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTT

Query:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELRKRRVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELATKIMHKVIELGGTPTIGDCAMI
         TSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKEL+KR+VFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELA KIMHKVIELGGTPTIGDCAMI
Subjt:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELRKRRVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELATKIMHKVIELGGTPTIGDCAMI

Query:  LRAAIRSPLPSAFLKILQTTHSLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGIPVPDETLDRVISARQTNDAMPKPDTAIDTTLNDHSLANDE
        LRAAI++PLPSAFLKILQTTH LGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGI VPDETLDRVIS RQTNDAMPKPD+AIDTT+NDHSLANDE
Subjt:  LRAAIRSPLPSAFLKILQTTHSLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGIPVPDETLDRVISARQTNDAMPKPDTAIDTTLNDHSLANDE

Query:  AS
        AS
Subjt:  AS

A0A1S3B9H7 uncharacterized protein LOC103487261 isoform X20.092.06Show/hide
Query:  MSKLLLSSHTHLLTLPHKRPSLCLHNHNGVLPIRSVLSAPEKRGRKKRQPRHP-----KDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAA
        MSKLLLS H HLLTLP+   S  L NH G+LPIRSVLSAP+KRGRKKRQ RH      KDD ST+LE  LRFTFMEELMDRAR+ D +GVSDVIYDMVAA
Subjt:  MSKLLLSSHTHLLTLPHKRPSLCLHNHNGVLPIRSVLSAPEKRGRKKRQPRHP-----KDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAA

Query:  GLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLK
        GLSPGPRSFHGLVVSH+LNGDTEGAMQSLRRELS+GLRPLHETFVALVRLFGSKGLA+RGLEIL+AME+LNYDIRQAWLIL +ELVRNKYLEDANK FLK
Subjt:  GLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLK

Query:  GAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDR
        GAK GLRATDKIYDL+IEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGED MKPDTETYNWVIQAYTRAESYDR
Subjt:  GAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDR

Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPQ
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRV++LLEALEAMARDNQQIP 
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPQ

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
        RAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV

Query:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        AL DASEADYHRV E+LKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNTEYWKRRFLGEGLDNNSVKPSEDDKSEPLDSLDDVDIVEDGAKEIEEEEVEEEE-VEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTT
        IKLHEGNTE+WKRRFLGEGLD+N+VKPSEDDKS+ LDSLDDVD +ED AKEIEEEE EEEE VEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQ T
Subjt:  IKLHEGNTEYWKRRFLGEGLDNNSVKPSEDDKSEPLDSLDDVDIVEDGAKEIEEEEVEEEE-VEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTT

Query:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELRKRRVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELATKIMHKVIELGGTPTIGDCAMI
         TSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKEL+KR+VFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELA KIMHKVIELGGTPTIGDCAMI
Subjt:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELRKRRVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELATKIMHKVIELGGTPTIGDCAMI

Query:  LRAAIRSPLPSAFLKILQTTHSLGYVFGSPL
        LRAAI++PLPSAFLKILQTTH LGYVFG  L
Subjt:  LRAAIRSPLPSAFLKILQTTHSLGYVFGSPL

A0A6J1DBV3 uncharacterized protein LOC1110195950.0100Show/hide
Query:  MSKLLLSSHTHLLTLPHKRPSLCLHNHNGVLPIRSVLSAPEKRGRKKRQPRHPKDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAAGLSPG
        MSKLLLSSHTHLLTLPHKRPSLCLHNHNGVLPIRSVLSAPEKRGRKKRQPRHPKDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAAGLSPG
Subjt:  MSKLLLSSHTHLLTLPHKRPSLCLHNHNGVLPIRSVLSAPEKRGRKKRQPRHPKDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAAGLSPG

Query:  PRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLKGAKGG
        PRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLKGAKGG
Subjt:  PRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLKGAKGG

Query:  LRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDRVQDVA
        LRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDRVQDVA
Subjt:  LRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDRVQDVA

Query:  ELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPQRAMIL
        ELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPQRAMIL
Subjt:  ELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPQRAMIL

Query:  SRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALGDA
        SRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALGDA
Subjt:  SRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALGDA

Query:  SEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHE
        SEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHE
Subjt:  SEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHE

Query:  GNTEYWKRRFLGEGLDNNSVKPSEDDKSEPLDSLDDVDIVEDGAKEIEEEEVEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKS
        GNTEYWKRRFLGEGLDNNSVKPSEDDKSEPLDSLDDVDIVEDGAKEIEEEEVEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKS
Subjt:  GNTEYWKRRFLGEGLDNNSVKPSEDDKSEPLDSLDDVDIVEDGAKEIEEEEVEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKS

Query:  RRRSSRASLEDDRDEDWFPEDIFEAFKELRKRRVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELATKIMHKVIELGGTPTIGDCAMILRAAIR
        RRRSSRASLEDDRDEDWFPEDIFEAFKELRKRRVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELATKIMHKVIELGGTPTIGDCAMILRAAIR
Subjt:  RRRSSRASLEDDRDEDWFPEDIFEAFKELRKRRVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELATKIMHKVIELGGTPTIGDCAMILRAAIR

Query:  SPLPSAFLKILQTTHSLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGIPVPDETLDRVISARQTNDAMPKPDTAIDTTLNDHSLANDEAS
        SPLPSAFLKILQTTHSLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGIPVPDETLDRVISARQTNDAMPKPDTAIDTTLNDHSLANDEAS
Subjt:  SPLPSAFLKILQTTHSLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGIPVPDETLDRVISARQTNDAMPKPDTAIDTTLNDHSLANDEAS

A0A6J1EQ88 uncharacterized protein LOC111436825 isoform X10.090.91Show/hide
Query:  MSKLLLSSHTHLLTLPHKRPSLCLHNHNGVLP-IRSVLSAPEKRGRKKRQPRHP----KDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAA
        MSK LLS H++LLTLPHK  S  LHN  GV P IRSVLS  EKRGRKKRQ R      KDD ST  EK LRFTFMEELMDRAR+ D +GVSDVIYDMVAA
Subjt:  MSKLLLSSHTHLLTLPHKRPSLCLHNHNGVLP-IRSVLSAPEKRGRKKRQPRHP----KDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAA

Query:  GLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLK
        GLSPGPRSFHGLVVSH LN D EGAMQSLR+ELS GLRPLHETFVALVRLFG+KGLA+RGLEIL+AMEKLNYDIRQAWLILI+ELV+NKYLEDANK FLK
Subjt:  GLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLK

Query:  GAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDR
        GAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGED MKPDTETYNWVIQAYTRAESYDR
Subjt:  GAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDR

Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPQ
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFR LKTF GGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIP 
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPQ

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
        RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL 
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV

Query:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        ALGDASEADY RVEERLKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRN+LYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNTEYWKRRFLGEGLDNNSVKPSEDDKSEPLDSLDDVDIVEDGAKEIEEEEVEEEE-VEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTT
        IKLHEGNTE+WKRRFLGEGLD+N+VKPSEDD+SEPLDSLDDVDIVED AKEI+EEE EEEE VE TENQDGERVIKKEVEAKKP QMIGVQLLKDVDQT+
Subjt:  IKLHEGNTEYWKRRFLGEGLDNNSVKPSEDDKSEPLDSLDDVDIVEDGAKEIEEEEVEEEE-VEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTT

Query:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELRKRRVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELATKIMHKVIELGGTPTIGDCAMI
        TTSKKSRRR SRAS+EDDRDEDWFPED+FEAF ELRKR+VFD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELA KIMHKVIELGG PTIGDCAMI
Subjt:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELRKRRVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELATKIMHKVIELGGTPTIGDCAMI

Query:  LRAAIRSPLPSAFLKILQTTHSLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGIPVPDETLDRVISARQTNDAMPKPDTAIDTTLNDHSLANDE
        LRAAI++PLPSAF KILQTTHSLGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGI VPDETLDR+ISARQTNDA PK D+ ID TLNDHSL NDE
Subjt:  LRAAIRSPLPSAFLKILQTTHSLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGIPVPDETLDRVISARQTNDAMPKPDTAIDTTLNDHSLANDE

Query:  AS
         S
Subjt:  AS

A0A6J1L2D9 uncharacterized protein LOC111499221 isoform X10.090.69Show/hide
Query:  MSKLLLSSHTHLLTLPHKRPSLCLHNHNGVLP-IRSVLSAPEKRGRKKRQPRHP----KDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAA
        MSK LLS H+ LLTLPHK  S  LHN   VLP IRSVLS  EKRGRKKRQ R      KD  ST LEK LRFTFMEELMDRAR+ D +GVSDVIYDMVAA
Subjt:  MSKLLLSSHTHLLTLPHKRPSLCLHNHNGVLP-IRSVLSAPEKRGRKKRQPRHP----KDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAA

Query:  GLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLK
        GLSPGPRSFHGLVVSH LN D EGAMQSLR+ELS GLRPLHETFVALVRLFG+KGLA+RGLEIL+AMEKLNYDIRQAWLILI+ELV+NKYLEDANK FLK
Subjt:  GLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLK

Query:  GAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDR
        GAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGED MKPDTETYNWVIQAYTRAESYDR
Subjt:  GAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDR

Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPQ
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFR LKTF GGTKALH+EG+FGDPLSLYLRALCREGRVVELLEALEAMARDNQQIP 
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPQ

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
        RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL 
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV

Query:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        ALGDASEADY RVEERLKKIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNTEYWKRRFLGEGLDNNSVKPSEDDKSEPLDSLDDVDIVEDGAKEIEEEEVEEEE-VEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTT
        IKLHEGNTE+WKRRFLGEGLD+N+VKPSEDD+SEPLDSLDDVD+VED AKEI+EEE EEEE VE TENQDGERVIKKEVEAKKP QMIGVQLLKDVDQT+
Subjt:  IKLHEGNTEYWKRRFLGEGLDNNSVKPSEDDKSEPLDSLDDVDIVEDGAKEIEEEEVEEEE-VEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTT

Query:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELRKRRVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELATKIMHKVIELGGTPTIGDCAMI
        TTSKKSRRR SRAS+EDDRDEDWFPED+FEAF ELRKR++FD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELA KIMHKVIELGG PTIGDCAMI
Subjt:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELRKRRVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELATKIMHKVIELGGTPTIGDCAMI

Query:  LRAAIRSPLPSAFLKILQTTHSLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGIPVPDETLDRVISARQTNDAMPKPDTAIDTTLNDHSLANDE
        LRAAI++PLPSAF KILQTTHSLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGI VPDETLDR+ISARQTNDA PK D+ ID TLNDHSLA+DE
Subjt:  LRAAIRSPLPSAFLKILQTTHSLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGIPVPDETLDRVISARQTNDAMPKPDTAIDTTLNDHSLANDE

Query:  AS
         S
Subjt:  AS

SwissProt top hitse value%identityAlignment
Q0WPZ6 Pentatricopeptide repeat-containing protein At2g171401.2e-0923.01Show/hide
Query:  VSDVIYDMVAAGLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLN-YDIRQAWLILIDELVRN
        VS +  DMV  G++P   +F+ L+ +   +   + A +        G +P   TF  LVR +   GL  +GLE+L+AME       +  +  ++    R 
Subjt:  VSDVIYDMVAAGLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLN-YDIRQAWLILIDELVRN

Query:  KYLEDANKAFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMA----TTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTET
           +D+ K   K  + GL      ++  I   CK G   +A  I  +ME    +      +  +N +L      G+ E A + FE++   +D      ++
Subjt:  KYLEDANKAFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMA----TTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTET

Query:  YNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLS--LYLRALCREGRVV
        YN  +Q   R   +   + V + +       K + P++ +Y +L++   K  ++ +A       KT  G    +   G   D ++    L   C  G+V 
Subjt:  YNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLS--LYLRALCREGRVV

Query:  ELLEALEAMARDN---QQIPQRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTG
             L+ M R+N          ++ S      +S   E L++  E GY +D +   I   GL G
Subjt:  ELLEALEAMARDN---QQIPQRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTG

Q9LMH5 Putative pentatricopeptide repeat-containing protein At1g138002.6e-0723.11Show/hide
Query:  KDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAAGLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLAS
        K D   A  K +R    E  ++ A S        V+ DM   G+ P    +  ++  H  N +   A+    + L    R       ++++ +   G  S
Subjt:  KDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAAGLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLAS

Query:  RGLEILSAMEKLNYDI-RQAWLILIDELVRNKYLEDANKAFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQAT
           ++     + N  + R  + +  D L +   +E+A + F +    G+      Y  LI   C  G  S+A ++  EM+  G+      +N L    AT
Subjt:  RGLEILSAMEKLNYDI-RQAWLILIDELVRNKYLEDANKAFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQAT

Query:  CGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDRVQDVAELL
         G+ + AF T + ME     +KP   T+N VI+    A   D+ +   E L
Subjt:  CGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDRVQDVAELL

Q9LN22 Pentatricopeptide repeat-containing protein At1g20300, mitochondrial1.7e-0624.39Show/hide
Query:  ETFVALVRLFGSKGLASRGLEILSAMEKLN-YDIRQAWLILIDELVRNKYLEDANKAFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAG
        ETF  L+R +   GLAS  +   + ME       + A+ I+I  L R +   +A ++F    K        +Y  L+   C+AG+ S A ++  EM+ AG
Subjt:  ETFVALVRLFGSKGLASRGLEILSAMEKLN-YDIRQAWLILIDELVRNKYLEDANKAFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAG

Query:  RMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRL--QPNMRTYALLVECFTKYCVI
             + ++ ++     CG    A   F +M   +    P+  T+N +++ + +A    R + V ++   M    K+L  +P+  TY  L+E   +   +
Subjt:  RMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRL--QPNMRTYALLVECFTKYCVI

Query:  REAIR
          A++
Subjt:  REAIR

Q9SH60 Pentatricopeptide repeat-containing protein At1g641001.7e-0622.18Show/hide
Query:  LDSVGVSDVIYDMVAAGLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEK--LNYDIRQAWLILI
        L++V + D    MV  GL+P   +F+ L+    L G    A   + + +  GL     T+  +V      G     L +LS ME+  +  D+   +  +I
Subjt:  LDSVGVSDVIYDMVAAGLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEK--LNYDIRQAWLILI

Query:  DELVRNKYLEDANKAFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDT
        D L ++ +  DA   F +  + G+      Y+ +I+  C  G  S+A  +  +M           FN L+S     G    A    + M +   C+ PDT
Subjt:  DELVRNKYLEDANKAFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDT

Query:  ETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRAL
         TYN +I  + +   +D  + + +L+           P++ T+  +++ + +   + E ++  R +
Subjt:  ETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRAL

Q9SIC9 Pentatricopeptide repeat-containing protein At2g31400, chloroplastic2.3e-0824.71Show/hide
Query:  GLRPLHETFVALVRLFGSKGL--ASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEIS
        G++P   TF +L+ +    GL  A+R L       ++  D+  ++  L+D + +   ++ A +   +     +      Y  +I+   KAG    AL + 
Subjt:  GLRPLHETFVALVRLFGSKGL--ASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEIS

Query:  YEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT
         EM   G       +N LLS+    G  E A      M      +K D  TYN ++  Y +   YD V+ V      M  +H  + PN+ TY+ L++ ++
Subjt:  YEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT

Query:  KYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPL--SLYLRALCREGRVVELLEALEAMARD
        K  + +EA+  FR  K+   G +A        D +  S  + ALC+ G V   +  ++ M ++
Subjt:  KYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPL--SLYLRALCREGRVVELLEALEAMARD

Arabidopsis top hitse value%identityAlignment
AT1G13800.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.8e-0823.11Show/hide
Query:  KDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAAGLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLAS
        K D   A  K +R    E  ++ A S        V+ DM   G+ P    +  ++  H  N +   A+    + L    R       ++++ +   G  S
Subjt:  KDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAAGLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLAS

Query:  RGLEILSAMEKLNYDI-RQAWLILIDELVRNKYLEDANKAFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQAT
           ++     + N  + R  + +  D L +   +E+A + F +    G+      Y  LI   C  G  S+A ++  EM+  G+      +N L    AT
Subjt:  RGLEILSAMEKLNYDI-RQAWLILIDELVRNKYLEDANKAFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQAT

Query:  CGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDRVQDVAELL
         G+ + AF T + ME     +KP   T+N VI+    A   D+ +   E L
Subjt:  CGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDRVQDVAELL

AT1G64100.1 pentatricopeptide (PPR) repeat-containing protein1.2e-0722.18Show/hide
Query:  LDSVGVSDVIYDMVAAGLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEK--LNYDIRQAWLILI
        L++V + D    MV  GL+P   +F+ L+    L G    A   + + +  GL     T+  +V      G     L +LS ME+  +  D+   +  +I
Subjt:  LDSVGVSDVIYDMVAAGLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEK--LNYDIRQAWLILI

Query:  DELVRNKYLEDANKAFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDT
        D L ++ +  DA   F +  + G+      Y+ +I+  C  G  S+A  +  +M           FN L+S     G    A    + M +   C+ PDT
Subjt:  DELVRNKYLEDANKAFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDT

Query:  ETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRAL
         TYN +I  + +   +D  + + +L+           P++ T+  +++ + +   + E ++  R +
Subjt:  ETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRAL

AT2G17140.1 Pentatricopeptide repeat (PPR) superfamily protein8.7e-1123.01Show/hide
Query:  VSDVIYDMVAAGLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLN-YDIRQAWLILIDELVRN
        VS +  DMV  G++P   +F+ L+ +   +   + A +        G +P   TF  LVR +   GL  +GLE+L+AME       +  +  ++    R 
Subjt:  VSDVIYDMVAAGLSPGPRSFHGLVVSHSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLN-YDIRQAWLILIDELVRN

Query:  KYLEDANKAFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMA----TTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTET
           +D+ K   K  + GL      ++  I   CK G   +A  I  +ME    +      +  +N +L      G+ E A + FE++   +D      ++
Subjt:  KYLEDANKAFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEISYEMEAAGRMA----TTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTET

Query:  YNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLS--LYLRALCREGRVV
        YN  +Q   R   +   + V + +       K + P++ +Y +L++   K  ++ +A       KT  G    +   G   D ++    L   C  G+V 
Subjt:  YNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLS--LYLRALCREGRVV

Query:  ELLEALEAMARDN---QQIPQRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTG
             L+ M R+N          ++ S      +S   E L++  E GY +D +   I   GL G
Subjt:  ELLEALEAMARDN---QQIPQRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTG

AT2G31400.1 genomes uncoupled 11.6e-0924.71Show/hide
Query:  GLRPLHETFVALVRLFGSKGL--ASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEIS
        G++P   TF +L+ +    GL  A+R L       ++  D+  ++  L+D + +   ++ A +   +     +      Y  +I+   KAG    AL + 
Subjt:  GLRPLHETFVALVRLFGSKGL--ASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLKGAKGGLRATDKIYDLLIEEDCKAGDHSNALEIS

Query:  YEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT
         EM   G       +N LLS+    G  E A      M      +K D  TYN ++  Y +   YD V+ V      M  +H  + PN+ TY+ L++ ++
Subjt:  YEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFT

Query:  KYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPL--SLYLRALCREGRVVELLEALEAMARD
        K  + +EA+  FR  K+   G +A        D +  S  + ALC+ G V   +  ++ M ++
Subjt:  KYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPL--SLYLRALCREGRVVELLEALEAMARD

AT3G04260.1 plastid transcriptionally active 30.0e+0074.6Show/hide
Query:  GVLPIRSVLSAPEKRGRKKRQPRH----PKDD--------ASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAAGLSPGPRSFHGLVVSHSLNGD
        G+  IR  +SAPEK+ R++R+ +       DD        A +ALE+ LR TFM+ELM+RAR+ D+ GVS+VIYDM+AAGLSPGPRSFHGLVV+H+LNGD
Subjt:  GVLPIRSVLSAPEKRGRKKRQPRH----PKDD--------ASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAAGLSPGPRSFHGLVVSHSLNGD

Query:  TEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLKGAKGGLRATDKIYDLLIEEDC
         +GAM SLR+EL AG RPL ET +ALVRL GSKG A+RGLEIL+AMEKL YDIRQAWLIL++EL+R  +LEDANK FLKGA+GG+RATD++YDL+IEEDC
Subjt:  TEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLKGAKGGLRATDKIYDLLIEEDC

Query:  KAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPN
        KAGDHSNAL+ISYEMEAAGRMATTFHFNCLLSVQATCGIPE+A++TFENMEYGE  MKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKR+QPN
Subjt:  KAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPN

Query:  MRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPQRAMILSRKYRSLVSSWIEPLQ
        ++TYALLVECFTKYCV++EAIRHFRALK F+GGT  LHN GNF DPLSLYLRALCREGR+VEL++AL+AM +DNQ IP RAMI+SRKYR+LVSSWIEPLQ
Subjt:  MRTYALLVECFTKYCVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPQRAMILSRKYRSLVSSWIEPLQ

Query:  EEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALGDASEADYHRVEERLKKII
        EEAE GYEIDY+ARYIEEGGLTGERKRWVPRRGKTPLDPDA GFIYSNP+ETSFKQRCLEDWK++HRK+L+TLQ+EGL  LGDASE+DY RV ERL+ II
Subjt:  EEAEHGYEIDYIARYIEEGGLTGERKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALGDASEADYHRVEERLKKII

Query:  KGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEYWKRRFLGEGLD
        KGP  N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRIN+SRGRPLWVPP+EEEEEEVDEE+D+LI RIKLHEG+TE+WKRRFLGEGL 
Subjt:  KGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEYWKRRFLGEGLD

Query:  NNSVKPSE--------------DDKSEPLDSLDDVDIVE-DGAKEIEEEEVEEEEVEQTENQ-DGERVIK-KEVEAKKPLQMIGVQLLKDVDQTTTTSKK
          SV+  E              +D S+  D+ +D D  E +G ++ +E E EE  V +TEN+ +GE ++K K  +AKK LQMIGVQLLK+ D+   T KK
Subjt:  NNSVKPSE--------------DDKSEPLDSLDDVDIVE-DGAKEIEEEEVEEEEVEQTENQ-DGERVIK-KEVEAKKPLQMIGVQLLKDVDQTTTTSKK

Query:  SRRRSSRASLEDDRDEDWFPEDIFEAFKELRKRRVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELATKIMHKVIELGGTPTIGDCAMILRAAI
          +R+SR +LEDD DEDWFPE+ FEAFKE+R+R+VFDV+DMYTIADVWGWTWE++ KN+ PR+WSQEWEVELA  +M KVIELGG PTIGDCA+ILRAA+
Subjt:  SRRRSSRASLEDDRDEDWFPEDIFEAFKELRKRRVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELATKIMHKVIELGGTPTIGDCAMILRAAI

Query:  RSPLPSAFLKILQTTHSLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGIPVPDETLDRVISARQTNDA
        R+P+PSAFLKILQTTHSLGY FGSPLYDE+ITLCLDLGELDAAIAIVAD+ETTGI VPD+TLD+VISARQ+N++
Subjt:  RSPLPSAFLKILQTTHSLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGIPVPDETLDRVISARQTNDA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCAAACTCCTGCTCTCCTCCCACACCCACCTTCTCACCCTTCCCCACAAACGCCCTTCATTGTGTCTTCACAATCACAATGGCGTCCTGCCCATCCGCTCAGTCCT
CTCCGCTCCAGAGAAGCGTGGCAGAAAGAAGAGGCAGCCGCGGCACCCTAAGGATGACGCTTCCACTGCGCTTGAGAAGGGCCTTCGCTTCACTTTCATGGAGGAACTCA
TGGACCGCGCCAGAAGCCTCGACTCCGTCGGCGTTTCTGATGTCATTTACGATATGGTTGCCGCTGGATTGAGCCCTGGACCTCGCTCCTTCCACGGCTTGGTTGTGTCT
CATTCTCTCAACGGCGACACCGAGGGAGCGATGCAATCTCTGAGAAGGGAACTAAGTGCTGGACTTCGTCCTCTCCATGAGACGTTTGTTGCATTAGTTCGGTTATTTGG
TTCCAAGGGTCTTGCTTCCAGAGGCTTAGAAATCCTTTCAGCCATGGAGAAATTGAACTATGACATTCGCCAAGCTTGGCTCATTCTTATTGACGAACTCGTAAGGAATA
AATACTTAGAAGACGCCAATAAAGCGTTTTTAAAGGGTGCCAAAGGGGGTCTCAGAGCCACTGACAAGATTTATGATCTCCTAATTGAGGAAGACTGTAAAGCTGGGGAT
CATTCAAATGCCTTAGAGATCTCATATGAGATGGAGGCTGCTGGGCGAATGGCAACAACCTTTCATTTCAATTGTCTCCTTAGTGTCCAGGCTACTTGTGGGATACCTGA
AATTGCTTTCTCAACATTTGAAAACATGGAATATGGAGAAGATTGCATGAAACCTGATACTGAGACATATAATTGGGTGATCCAAGCATATACAAGAGCTGAATCTTACG
ATAGGGTGCAAGATGTTGCTGAGTTACTTGGCATGATGGTTGAAGACCATAAGCGTCTACAGCCTAACATGAGAACCTATGCGCTCTTGGTAGAGTGTTTTACCAAGTAT
TGTGTCATACGAGAAGCCATTAGGCATTTCCGTGCACTAAAAACCTTTCAAGGTGGAACAAAAGCTTTGCATAATGAAGGAAATTTTGGTGATCCACTTTCTTTATATCT
TCGAGCTTTATGTAGAGAAGGTAGAGTTGTAGAGCTCTTAGAAGCTTTAGAAGCTATGGCTAGAGATAACCAACAGATTCCTCAAAGAGCCATGATTTTGAGCAGAAAGT
ATCGATCACTGGTGAGCTCATGGATTGAACCTTTACAGGAAGAAGCTGAACATGGATACGAGATAGACTACATTGCAAGATACATTGAAGAGGGTGGACTCACTGGAGAA
CGCAAGAGGTGGGTCCCCCGAAGAGGAAAAACTCCTCTAGATCCTGATGCAGATGGATTCATTTATTCAAATCCAATGGAAACATCCTTTAAGCAACGATGTCTAGAAGA
TTGGAAGATGTACCACCGAAAGATTTTGAAAACCTTGCAGAATGAAGGACTTGTAGCTCTTGGGGATGCATCTGAAGCTGATTATCATAGAGTTGAGGAGAGATTGAAGA
AAATTATAAAGGGTCCCGACCAAAATGTTTTAAAGCCAAAGGCTGCAAGTAAGATGATTGTATCAGAATTGAAAGAAGAATTAGAAGCACAAGGTTTACCAATTGATGGA
ACTAGAAATGTTCTTTATCAGCGTGTTCAAAAAGCAAGGAGAATAAATAGGTCTCGTGGTCGGCCCCTTTGGGTTCCTCCCGTTGAGGAGGAGGAAGAAGAGGTTGATGA
AGAGCTGGATGAACTAATTTCACGAATAAAGCTACATGAAGGAAATACAGAGTACTGGAAACGCCGCTTTCTTGGAGAAGGCTTGGACAATAATAGTGTTAAACCATCTG
AAGACGATAAATCAGAACCTCTTGATTCTCTGGATGATGTTGACATTGTAGAAGATGGTGCAAAGGAGATTGAAGAAGAAGAAGTGGAAGAGGAGGAGGTAGAACAAACT
GAGAATCAAGATGGTGAAAGAGTTATTAAGAAGGAAGTTGAAGCTAAGAAGCCTCTTCAAATGATTGGTGTCCAATTGTTAAAAGACGTCGACCAAACCACAACAACATC
CAAAAAGTCAAGGAGGAGGAGTTCTCGAGCATCACTCGAGGATGATCGTGATGAAGACTGGTTTCCTGAAGATATATTTGAGGCATTTAAAGAGTTGCGAAAGAGAAGAG
TCTTTGATGTATCGGACATGTACACAATAGCTGATGTTTGGGGTTGGACTTGGGAGAGAGAACTTAAGAACAGACCTCCCAGGAGGTGGTCACAGGAATGGGAAGTGGAG
TTGGCTACTAAAATTATGCACAAGGTGATTGAATTGGGTGGAACACCAACAATTGGCGACTGTGCGATGATCTTGCGAGCTGCCATAAGGTCTCCTCTACCCTCTGCTTT
TTTGAAGATCTTGCAGACAACACACAGTCTTGGCTATGTTTTTGGCAGCCCTCTATATGATGAGGTTATCACCCTGTGTCTTGATCTTGGGGAGCTAGATGCAGCCATAG
CAATTGTAGCAGACCTGGAAACCACGGGAATTCCAGTTCCCGATGAAACGCTCGATCGGGTAATCTCTGCTCGGCAGACGAACGATGCTATGCCCAAGCCTGATACAGCC
ATTGATACCACACTCAATGATCATAGTTTAGCCAATGATGAAGCATCATAG
mRNA sequenceShow/hide mRNA sequence
GATTTGTAGAAGTTAAAAATTTTAGATTTTGTAGCCGCTTGACTTGAAGAATAGCATAGTTTGAAGTTTCAGATTCGAAACTGGAGGCCAAAATAGGAGATGATAAAATC
TCTGACTATATTTGTAAATATGCTAAAATAAATAGCTCACCATTGCAATTACCCTTAATATCAATATTGATATATTTTCACACTTGGATATTTGACGCTATAGCTGTAGT
TGCAGACTACGGTCGTTCATTTATCGTCAGGAATCCCGCCTTTTTATTCGGCAACGCCCCTTCCGCCGAGGCTCTGTTTCTTATCCAAAATTCGGCAGCCATTAATGTCC
AAACTCCTGCTCTCCTCCCACACCCACCTTCTCACCCTTCCCCACAAACGCCCTTCATTGTGTCTTCACAATCACAATGGCGTCCTGCCCATCCGCTCAGTCCTCTCCGC
TCCAGAGAAGCGTGGCAGAAAGAAGAGGCAGCCGCGGCACCCTAAGGATGACGCTTCCACTGCGCTTGAGAAGGGCCTTCGCTTCACTTTCATGGAGGAACTCATGGACC
GCGCCAGAAGCCTCGACTCCGTCGGCGTTTCTGATGTCATTTACGATATGGTTGCCGCTGGATTGAGCCCTGGACCTCGCTCCTTCCACGGCTTGGTTGTGTCTCATTCT
CTCAACGGCGACACCGAGGGAGCGATGCAATCTCTGAGAAGGGAACTAAGTGCTGGACTTCGTCCTCTCCATGAGACGTTTGTTGCATTAGTTCGGTTATTTGGTTCCAA
GGGTCTTGCTTCCAGAGGCTTAGAAATCCTTTCAGCCATGGAGAAATTGAACTATGACATTCGCCAAGCTTGGCTCATTCTTATTGACGAACTCGTAAGGAATAAATACT
TAGAAGACGCCAATAAAGCGTTTTTAAAGGGTGCCAAAGGGGGTCTCAGAGCCACTGACAAGATTTATGATCTCCTAATTGAGGAAGACTGTAAAGCTGGGGATCATTCA
AATGCCTTAGAGATCTCATATGAGATGGAGGCTGCTGGGCGAATGGCAACAACCTTTCATTTCAATTGTCTCCTTAGTGTCCAGGCTACTTGTGGGATACCTGAAATTGC
TTTCTCAACATTTGAAAACATGGAATATGGAGAAGATTGCATGAAACCTGATACTGAGACATATAATTGGGTGATCCAAGCATATACAAGAGCTGAATCTTACGATAGGG
TGCAAGATGTTGCTGAGTTACTTGGCATGATGGTTGAAGACCATAAGCGTCTACAGCCTAACATGAGAACCTATGCGCTCTTGGTAGAGTGTTTTACCAAGTATTGTGTC
ATACGAGAAGCCATTAGGCATTTCCGTGCACTAAAAACCTTTCAAGGTGGAACAAAAGCTTTGCATAATGAAGGAAATTTTGGTGATCCACTTTCTTTATATCTTCGAGC
TTTATGTAGAGAAGGTAGAGTTGTAGAGCTCTTAGAAGCTTTAGAAGCTATGGCTAGAGATAACCAACAGATTCCTCAAAGAGCCATGATTTTGAGCAGAAAGTATCGAT
CACTGGTGAGCTCATGGATTGAACCTTTACAGGAAGAAGCTGAACATGGATACGAGATAGACTACATTGCAAGATACATTGAAGAGGGTGGACTCACTGGAGAACGCAAG
AGGTGGGTCCCCCGAAGAGGAAAAACTCCTCTAGATCCTGATGCAGATGGATTCATTTATTCAAATCCAATGGAAACATCCTTTAAGCAACGATGTCTAGAAGATTGGAA
GATGTACCACCGAAAGATTTTGAAAACCTTGCAGAATGAAGGACTTGTAGCTCTTGGGGATGCATCTGAAGCTGATTATCATAGAGTTGAGGAGAGATTGAAGAAAATTA
TAAAGGGTCCCGACCAAAATGTTTTAAAGCCAAAGGCTGCAAGTAAGATGATTGTATCAGAATTGAAAGAAGAATTAGAAGCACAAGGTTTACCAATTGATGGAACTAGA
AATGTTCTTTATCAGCGTGTTCAAAAAGCAAGGAGAATAAATAGGTCTCGTGGTCGGCCCCTTTGGGTTCCTCCCGTTGAGGAGGAGGAAGAAGAGGTTGATGAAGAGCT
GGATGAACTAATTTCACGAATAAAGCTACATGAAGGAAATACAGAGTACTGGAAACGCCGCTTTCTTGGAGAAGGCTTGGACAATAATAGTGTTAAACCATCTGAAGACG
ATAAATCAGAACCTCTTGATTCTCTGGATGATGTTGACATTGTAGAAGATGGTGCAAAGGAGATTGAAGAAGAAGAAGTGGAAGAGGAGGAGGTAGAACAAACTGAGAAT
CAAGATGGTGAAAGAGTTATTAAGAAGGAAGTTGAAGCTAAGAAGCCTCTTCAAATGATTGGTGTCCAATTGTTAAAAGACGTCGACCAAACCACAACAACATCCAAAAA
GTCAAGGAGGAGGAGTTCTCGAGCATCACTCGAGGATGATCGTGATGAAGACTGGTTTCCTGAAGATATATTTGAGGCATTTAAAGAGTTGCGAAAGAGAAGAGTCTTTG
ATGTATCGGACATGTACACAATAGCTGATGTTTGGGGTTGGACTTGGGAGAGAGAACTTAAGAACAGACCTCCCAGGAGGTGGTCACAGGAATGGGAAGTGGAGTTGGCT
ACTAAAATTATGCACAAGGTGATTGAATTGGGTGGAACACCAACAATTGGCGACTGTGCGATGATCTTGCGAGCTGCCATAAGGTCTCCTCTACCCTCTGCTTTTTTGAA
GATCTTGCAGACAACACACAGTCTTGGCTATGTTTTTGGCAGCCCTCTATATGATGAGGTTATCACCCTGTGTCTTGATCTTGGGGAGCTAGATGCAGCCATAGCAATTG
TAGCAGACCTGGAAACCACGGGAATTCCAGTTCCCGATGAAACGCTCGATCGGGTAATCTCTGCTCGGCAGACGAACGATGCTATGCCCAAGCCTGATACAGCCATTGAT
ACCACACTCAATGATCATAGTTTAGCCAATGATGAAGCATCATAGTCAATCATTGTGTTCTTATTTTCCACTTTTGTACAGTCCAATTCTGGGGTTGCCAGCAAATTGTT
CTTGTAAAATTGATTAGTAGTTCTTGTAAACTAGATTTTGAAATTTTGTTTATCAAGTTGTAATCATCACTTGTTTTAATTTAATCTCTTTTCCAATTTAATGATTCATA
TGATTTATTGTCTAAAGCAATCAATTTTGCTAATTGCTTTTCGAAGCTTCTCTTTAGTTTTTTATTTTGATGTTGCCTCTCAC
Protein sequenceShow/hide protein sequence
MSKLLLSSHTHLLTLPHKRPSLCLHNHNGVLPIRSVLSAPEKRGRKKRQPRHPKDDASTALEKGLRFTFMEELMDRARSLDSVGVSDVIYDMVAAGLSPGPRSFHGLVVS
HSLNGDTEGAMQSLRRELSAGLRPLHETFVALVRLFGSKGLASRGLEILSAMEKLNYDIRQAWLILIDELVRNKYLEDANKAFLKGAKGGLRATDKIYDLLIEEDCKAGD
HSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGEDCMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKY
CVIREAIRHFRALKTFQGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPQRAMILSRKYRSLVSSWIEPLQEEAEHGYEIDYIARYIEEGGLTGE
RKRWVPRRGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALGDASEADYHRVEERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDG
TRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEYWKRRFLGEGLDNNSVKPSEDDKSEPLDSLDDVDIVEDGAKEIEEEEVEEEEVEQT
ENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQTTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELRKRRVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVE
LATKIMHKVIELGGTPTIGDCAMILRAAIRSPLPSAFLKILQTTHSLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGIPVPDETLDRVISARQTNDAMPKPDTA
IDTTLNDHSLANDEAS