; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G32820 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G32820
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionSAP domain-containing protein
Genome locationChr1:27647902..27666509
RNA-Seq ExpressionCSPI01G32820
SyntenyCSPI01G32820
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0006979 - response to oxidative stress (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0004601 - peroxidase activity (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR003034 - SAP domain
IPR036361 - SAP domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008443746.1 PREDICTED: uncharacterized protein LOC103487261 isoform X1 [Cucumis melo]0.0e+0097.84Show/hide
Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRAL+TF+GGTKALHNEGNFGDPLSLYLRALCREGRV++LLEALEAMARDNQQIPP
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
        RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV

Query:  ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        ALRDASEADYHRVVE+L+KIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT
        IKLHEGNTEFWKRRFLGEGL SNNVKPSEDDKSD LDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT
Subjt:  IKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT

Query:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI
         TSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGG PTIGDCAMI
Subjt:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI

Query:  LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDE
        LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILV DETLDRVIS RQTNDAMPKPDSAIDTT+NDHSLANDE
Subjt:  LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDE

Query:  AS
        AS
Subjt:  AS

XP_011660243.1 uncharacterized protein LOC101209618 isoform X1 [Cucumis sativus]0.0e+0099.83Show/hide
Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGT ALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
        RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV

Query:  ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT
        IKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT
Subjt:  IKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT

Query:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI
        TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI
Subjt:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI

Query:  LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDE
        LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDE
Subjt:  LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDE

Query:  AS
        AS
Subjt:  AS

XP_022151680.1 uncharacterized protein LOC111019595 [Momordica charantia]0.0e+0095.18Show/hide
Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRAL+TF+GGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIP 
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
        RAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV

Query:  ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        AL DASEADYHRV ERL+KIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT
        IKLHEGNTE+WKRRFLGEGL +N+VKPSEDDKS+PLDSLDDVD +ED AKEIEEEE  EEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQ T
Subjt:  IKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT

Query:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI
        TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKEL+KR+VFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELA KIMHKVIELGG PTIGDCAMI
Subjt:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI

Query:  LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDE
        LRAAI++PLPSAFLKILQTTH LGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGI V DETLDRVISARQTNDAMPKPD+AIDTTLNDHSLANDE
Subjt:  LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDE

Query:  AS
        AS
Subjt:  AS

XP_031740953.1 uncharacterized protein LOC101209618 isoform X2 [Cucumis sativus]0.0e+0099.83Show/hide
Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGT ALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
        RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV

Query:  ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT
        IKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT
Subjt:  IKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT

Query:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI
        TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI
Subjt:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI

Query:  LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDE
        LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDE
Subjt:  LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDE

Query:  AS
        AS
Subjt:  AS

XP_038879291.1 uncharacterized protein LOC120071230 [Benincasa hispida]0.0e+0096.68Show/hide
Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRAL+TF+GGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
        RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKM+HRKILKTLQNEGL 
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV

Query:  ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        AL  ASEADYHRVVERL+KIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT
        IKLHEGNTEFWKRRFLGEGL SNNVKPSEDDKS+PLDSLDDVDT+EDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQP 
Subjt:  IKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT

Query:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI
        TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKEL+KRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGG PTIGDCAMI
Subjt:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI

Query:  LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDE
        LRAAIKAPLPS+FLKILQTTHGLGY FGSPLYDEVITLCLDLGELDAAIAIVADLETTGILV DETLDRVIS RQTND+MPKPDSAIDTTLNDHSLA+DE
Subjt:  LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDE

Query:  AS
        AS
Subjt:  AS

TrEMBL top hitse value%identityAlignment
A0A0A0M091 SAP domain-containing protein0.0e+0099.83Show/hide
Query:  MMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAMILSRKY
        MMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGT ALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAMILSRKY
Subjt:  MMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAMILSRKY

Query:  RSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALRDASEAD
        RSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALRDASEAD
Subjt:  RSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALRDASEAD

Query:  YHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTE
        YHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTE
Subjt:  YHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTE

Query:  FWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTTTSKKSRRR
        FWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTTTSKKSRRR
Subjt:  FWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTTTSKKSRRR

Query:  SSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAAIKAPL
        SSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAAIKAPL
Subjt:  SSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAAIKAPL

Query:  PSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDEAS
        PSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDEAS
Subjt:  PSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDEAS

A0A1S3B8T6 uncharacterized protein LOC103487261 isoform X10.0e+0097.84Show/hide
Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRAL+TF+GGTKALHNEGNFGDPLSLYLRALCREGRV++LLEALEAMARDNQQIPP
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
        RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV

Query:  ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        ALRDASEADYHRVVE+L+KIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT
        IKLHEGNTEFWKRRFLGEGL SNNVKPSEDDKSD LDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT
Subjt:  IKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT

Query:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI
         TSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGG PTIGDCAMI
Subjt:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI

Query:  LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDE
        LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILV DETLDRVIS RQTNDAMPKPDSAIDTT+NDHSLANDE
Subjt:  LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDE

Query:  AS
        AS
Subjt:  AS

A0A6J1DBV3 uncharacterized protein LOC1110195950.0e+0095.18Show/hide
Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRAL+TF+GGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIP 
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
        RAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV

Query:  ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        AL DASEADYHRV ERL+KIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT
        IKLHEGNTE+WKRRFLGEGL +N+VKPSEDDKS+PLDSLDDVD +ED AKEIEEEE  EEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQ T
Subjt:  IKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT

Query:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI
        TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKEL+KR+VFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELA KIMHKVIELGG PTIGDCAMI
Subjt:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI

Query:  LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDE
        LRAAI++PLPSAFLKILQTTH LGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGI V DETLDRVISARQTNDAMPKPD+AIDTTLNDHSLANDE
Subjt:  LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDE

Query:  AS
        AS
Subjt:  AS

A0A6J1EQ88 uncharacterized protein LOC111436825 isoform X10.0e+0092.86Show/hide
Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFR L+TF GGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIP 
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
        RAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL 
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV

Query:  ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        AL DASEADY RV ERL+KIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRN+LYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT
        IKLHEGNTEFWKRRFLGEGL SNNVKPSEDD+S+PLDSLDDVD +EDVAKEI+EEEAEEEEEVE TENQDGERVIKKEVEAKKP QMIGVQLLKDVDQ +
Subjt:  IKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT

Query:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI
        TTSKKSRRR SRAS+EDDRDEDWFPED+FEAF EL+KRKVFD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI
Subjt:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI

Query:  LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDE
        LRAAIKAPLPSAF KILQTTH LGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGI V DETLDR+ISARQTNDA PK DS ID TLNDHSL NDE
Subjt:  LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDE

Query:  AS
         S
Subjt:  AS

A0A6J1EWQ7 uncharacterized protein LOC111436825 isoform X20.0e+0092.86Show/hide
Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP
        VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFR L+TF GGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIP 
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
        RAMILSRKYRSLVSSWIEPLQEEAEHG+EIDYIARYIEEGGLTGERKRWVPR+GKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGL 
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV

Query:  ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
        AL DASEADY RV ERL+KIIKGPD N+LKPKAASKM+VSELKEELEAQGLPIDGTRN+LYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
Subjt:  ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT
        IKLHEGNTEFWKRRFLGEGL SNNVKPSEDD+S+PLDSLDDVD +EDVAKEI+EEEAEEEEEVE TENQDGERVIKKEVEAKKP QMIGVQLLKDVDQ +
Subjt:  IKLHEGNTEFWKRRFLGEGLYSNNVKPSEDDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPT

Query:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI
        TTSKKSRRR SRAS+EDDRDEDWFPED+FEAF EL+KRKVFD SDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI
Subjt:  TTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMI

Query:  LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDE
        LRAAIKAPLPSAF KILQTTH LGYVFGSPLYDE+ITLCLDLGELDAAIAIVADLETTGI V DETLDR+ISARQTNDA PK DS ID TLNDHSL NDE
Subjt:  LRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDE

Query:  AS
         S
Subjt:  AS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G04260.1 plastid transcriptionally active 32.5e-25675.29Show/hide
Query:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP
        VQDVAELLGMMVEDHKR+QPN++TYALLVECFTKYCV++EAIRHFRAL+ FEGGT  LHN GNF DPLSLYLRALCREGR+VEL++AL+AM +DNQ IPP
Subjt:  VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPP

Query:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV
        RAMI+SRKYR+LVSSWIEPLQEEAE G+EIDY+ARYIEEGGLTGERKRWVPR+GKTPLDPDA GFIYSNP+ETSFKQRCLEDWK++HRK+L+TLQ+EGL 
Subjt:  RAMILSRKYRSLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLV

Query:  ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR
         L DASE+DY RVVERLR IIKGP  N+LKPKAASKM+VSELKEELEAQGLPIDGTRNVLYQRVQKARRIN+SRGRPLWVPP+EEEEEEVDEE+D+LI R
Subjt:  ALRDASEADYHRVVERLRKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISR

Query:  IKLHEGNTEFWKRRFLGEGLYSNNVKPSE--------------DDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQ-DGERVIK-KEVEAKKP
        IKLHEG+TEFWKRRFLGEGL   +V+  E              +D S   D+ +D D  E    E ++E  EEE  V +TEN+ +GE ++K K  +AKK 
Subjt:  IKLHEGNTEFWKRRFLGEGLYSNNVKPSE--------------DDKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQ-DGERVIK-KEVEAKKP

Query:  LQMIGVQLLKDVDQPTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHK
        LQMIGVQLLK+ D+   T KK  +R+SR +LEDD DEDWFPE+ FEAFKE+++RKVFDV+DMYTIADVWGWTWE++ KN+ PR+WSQEWEVELAI +M K
Subjt:  LQMIGVQLLKDVDQPTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKVFDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHK

Query:  VIELGGIPTIGDCAMILRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDA
        VIELGGIPTIGDCA+ILRAA++AP+PSAFLKILQTTH LGY FGSPLYDE+ITLCLDLGELDAAIAIVAD+ETTGI V D+TLD+VISARQ+N++
Subjt:  VIELGGIPTIGDCAMILRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIAIVADLETTGILVHDETLDRVISARQTNDA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GTGCAAGATGTTGCCGAGCTACTTGGCATGATGGTTGAAGACCACAAACGTCTACAACCTAACATGAGAACCTATGCGCTCTTGGTAGAGTGTTTTACTAAGTATTGTGT
TATACGAGAAGCTATTAGGCATTTTCGTGCACTAAGAACATTTGAAGGTGGAACAAAAGCTTTGCATAATGAAGGAAATTTTGGTGATCCACTTTCCTTATATCTTCGAG
CTTTATGTAGAGAAGGTAGGGTGGTAGAACTCTTAGAAGCATTAGAGGCTATGGCTAGAGATAACCAACAGATTCCTCCAAGAGCAATGATCTTGAGCAGAAAGTATCGA
TCACTCGTGAGCTCATGGATTGAACCTTTACAAGAAGAAGCTGAACATGGATTCGAGATAGACTACATTGCAAGATACATTGAAGAGGGCGGACTAACTGGAGAGCGCAA
GAGGTGGGTCCCTCGGAAAGGAAAAACTCCTCTAGATCCTGATGCAGATGGATTCATCTATTCAAATCCTATGGAGACATCCTTTAAGCAGCGATGCCTCGAAGATTGGA
AGATGTACCACCGAAAGATTTTGAAAACCTTGCAGAATGAAGGACTTGTGGCTCTGAGGGATGCATCTGAAGCTGATTATCATAGAGTAGTGGAGAGATTGAGGAAAATA
ATAAAGGGTCCTGACCAAAATGTTTTAAAGCCGAAGGCTGCAAGTAAGATGATTGTATCGGAATTAAAAGAAGAGTTAGAAGCACAAGGTTTACCAATTGACGGAACTAG
AAATGTTCTTTACCAGCGTGTTCAAAAAGCAAGAAGAATAAACCGGTCTCGTGGTCGGCCCCTTTGGGTTCCTCCAGTGGAGGAGGAGGAAGAAGAGGTTGATGAAGAGC
TGGATGAACTAATTTCACGAATAAAGCTACATGAAGGAAATACGGAGTTCTGGAAACGCCGTTTTCTCGGAGAAGGCTTGTACAGTAATAATGTTAAACCTTCTGAGGAT
GATAAATCAGATCCTCTTGATTCTTTGGATGATGTTGACACTATAGAAGACGTTGCAAAGGAGATTGAAGAAGAAGAAGCTGAGGAGGAAGAGGAGGTAGAACAAACTGA
GAATCAAGATGGTGAAAGGGTTATTAAAAAGGAAGTTGAAGCTAAGAAGCCTCTTCAAATGATTGGTGTCCAATTGTTAAAAGATGTTGACCAACCTACAACAACATCCA
AAAAGTCAAGGAGGAGAAGTTCTCGAGCATCACTCGAGGATGATCGTGATGAAGATTGGTTTCCTGAAGATATATTTGAAGCATTTAAAGAGTTGCAAAAGAGGAAAGTC
TTTGATGTATCTGACATGTACACAATAGCTGATGTTTGGGGTTGGACTTGGGAGAGAGAACTTAAGAACAGACCTCCCAGGAGGTGGTCACAGGAATGGGAAGTGGAGCT
GGCTATTAAAATTATGCACAAGGTGATAGAATTGGGTGGAATACCAACAATTGGCGACTGTGCCATGATCTTGCGAGCTGCCATCAAGGCTCCATTACCTTCTGCCTTCT
TGAAAATCTTGCAGACAACTCACGGTCTTGGCTATGTATTTGGGAGCCCTTTATATGATGAGGTTATCACCCTTTGCCTTGATCTTGGGGAACTAGATGCAGCCATTGCA
ATTGTAGCAGATCTTGAAACAACAGGAATCTTGGTCCATGATGAAACGCTCGATCGGGTAATCTCTGCTAGACAGACGAATGATGCTATGCCCAAGCCTGACTCAGCCAT
TGATACCACACTGAATGATCACAGTTTAGCCAATGATGAAGCATCATAA
mRNA sequenceShow/hide mRNA sequence
GGTGCAAGATGTTGCCGAGCTACTTGGCATGATGGTTGAAGACCACAAACGTCTACAACCTAACATGAGAACCTATGCGCTCTTGGTAGAGTGTTTTACTAAGTATTGTG
TTATACGAGAAGCTATTAGGCATTTTCGTGCACTAAGAACATTTGAAGGTGGAACAAAAGCTTTGCATAATGAAGGAAATTTTGGTGATCCACTTTCCTTATATCTTCGA
GCTTTATGTAGAGAAGGTAGGGTGGTAGAACTCTTAGAAGCATTAGAGGCTATGGCTAGAGATAACCAACAGATTCCTCCAAGAGCAATGATCTTGAGCAGAAAGTATCG
ATCACTCGTGAGCTCATGGATTGAACCTTTACAAGAAGAAGCTGAACATGGATTCGAGATAGACTACATTGCAAGATACATTGAAGAGGGCGGACTAACTGGAGAGCGCA
AGAGGTGGGTCCCTCGGAAAGGAAAAACTCCTCTAGATCCTGATGCAGATGGATTCATCTATTCAAATCCTATGGAGACATCCTTTAAGCAGCGATGCCTCGAAGATTGG
AAGATGTACCACCGAAAGATTTTGAAAACCTTGCAGAATGAAGGACTTGTGGCTCTGAGGGATGCATCTGAAGCTGATTATCATAGAGTAGTGGAGAGATTGAGGAAAAT
AATAAAGGGTCCTGACCAAAATGTTTTAAAGCCGAAGGCTGCAAGTAAGATGATTGTATCGGAATTAAAAGAAGAGTTAGAAGCACAAGGTTTACCAATTGACGGAACTA
GAAATGTTCTTTACCAGCGTGTTCAAAAAGCAAGAAGAATAAACCGGTCTCGTGGTCGGCCCCTTTGGGTTCCTCCAGTGGAGGAGGAGGAAGAAGAGGTTGATGAAGAG
CTGGATGAACTAATTTCACGAATAAAGCTACATGAAGGAAATACGGAGTTCTGGAAACGCCGTTTTCTCGGAGAAGGCTTGTACAGTAATAATGTTAAACCTTCTGAGGA
TGATAAATCAGATCCTCTTGATTCTTTGGATGATGTTGACACTATAGAAGACGTTGCAAAGGAGATTGAAGAAGAAGAAGCTGAGGAGGAAGAGGAGGTAGAACAAACTG
AGAATCAAGATGGTGAAAGGGTTATTAAAAAGGAAGTTGAAGCTAAGAAGCCTCTTCAAATGATTGGTGTCCAATTGTTAAAAGATGTTGACCAACCTACAACAACATCC
AAAAAGTCAAGGAGGAGAAGTTCTCGAGCATCACTCGAGGATGATCGTGATGAAGATTGGTTTCCTGAAGATATATTTGAAGCATTTAAAGAGTTGCAAAAGAGGAAAGT
CTTTGATGTATCTGACATGTACACAATAGCTGATGTTTGGGGTTGGACTTGGGAGAGAGAACTTAAGAACAGACCTCCCAGGAGGTGGTCACAGGAATGGGAAGTGGAGC
TGGCTATTAAAATTATGCACAAGGTGATAGAATTGGGTGGAATACCAACAATTGGCGACTGTGCCATGATCTTGCGAGCTGCCATCAAGGCTCCATTACCTTCTGCCTTC
TTGAAAATCTTGCAGACAACTCACGGTCTTGGCTATGTATTTGGGAGCCCTTTATATGATGAGGTTATCACCCTTTGCCTTGATCTTGGGGAACTAGATGCAGCCATTGC
AATTGTAGCAGATCTTGAAACAACAGGAATCTTGGTCCATGATGAAACGCTCGATCGGGTAATCTCTGCTAGACAGACGAATGATGCTATGCCCAAGCCTGACTCAGCCA
TTGATACCACACTGAATGATCACAGTTTAGCCAATGATGAAGCATCATAATGGATCAGTGTTCTTATTTTTATTTTGTACAGTGCAGTCGTAGAGTTGCCAACAAAATGG
TTCTTGTAAATTTTATTTGTAGTTGCTTTAGTGGTTTAGTATTTTGAATTTTTGAGTTGAAACATTTAATTTTAGAAACAAGATTTTGAGCATAATTATTTATCGTCATA
ATCATTATGGTTGCC
Protein sequenceShow/hide protein sequence
VQDVAELLGMMVEDHKRLQPNMRTYALLVECFTKYCVIREAIRHFRALRTFEGGTKALHNEGNFGDPLSLYLRALCREGRVVELLEALEAMARDNQQIPPRAMILSRKYR
SLVSSWIEPLQEEAEHGFEIDYIARYIEEGGLTGERKRWVPRKGKTPLDPDADGFIYSNPMETSFKQRCLEDWKMYHRKILKTLQNEGLVALRDASEADYHRVVERLRKI
IKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLYSNNVKPSED
DKSDPLDSLDDVDTIEDVAKEIEEEEAEEEEEVEQTENQDGERVIKKEVEAKKPLQMIGVQLLKDVDQPTTTSKKSRRRSSRASLEDDRDEDWFPEDIFEAFKELQKRKV
FDVSDMYTIADVWGWTWERELKNRPPRRWSQEWEVELAIKIMHKVIELGGIPTIGDCAMILRAAIKAPLPSAFLKILQTTHGLGYVFGSPLYDEVITLCLDLGELDAAIA
IVADLETTGILVHDETLDRVISARQTNDAMPKPDSAIDTTLNDHSLANDEAS