; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g20010 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g20010
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionEnzymatic polyprotein
Genome locationchr6:15662179..15667846
RNA-Seq ExpressionMoc06g20010
SyntenyMoc06g20010
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052109.1 Enzymatic polyprotein [Cucumis melo var. makuwa]4.6e-16946.72Show/hide
Query:  MTLGDFTSKIQKRELVKNEALVKRLQADGQVAVIRNGTVWVTARGKEIASTFPPEEEATFSHLVIPAIKMVSSPYKTIDEDKVQKVG-------------
        +T+ DF   + K E  KNEALVK+LQADGQ+A+I+  TVWVT  GKE+AS +PPEEEA F H  IPAIKM+SSPYKTI+EDKVQKVG             
Subjt:  MTLGDFTSKIQKRELVKNEALVKRLQADGQVAVIRNGTVWVTARGKEIASTFPPEEEATFSHLVIPAIKMVSSPYKTIDEDKVQKVG-------------

Query:  -----------------------------------------------------------------------------------------------ASSST
                                                                                                       AS   
Subjt:  -----------------------------------------------------------------------------------------------ASSST

Query:  ILPVTMHTEVKNHYPRPSPPDMGWDDLRHDQRSYDGSSIITWNIDGYSEAQMMNTFQKMMMAATAFSTKKSVLQTAHILISGLSGNLRSWWHNQLTDEDR
        ILPV    ++KNHYP+PSPPD+GWDDL H++R+YDG S+ITWNIDGYSEAQMMNTFQ+M++AATA+STKKS  +TA ILI G +GNLRSWWHN LT++DR
Subjt:  ILPVTMHTEVKNHYPRPSPPDMGWDDLRHDQRSYDGSSIITWNIDGYSEAQMMNTFQKMMMAATAFSTKKSVLQTAHILISGLSGNLRSWWHNQLTDEDR

Query:  TKILTATKSIVKQEGSNT-MQIDEPDM-------------------------------------------------------------------------
         +ILTAT+++VK E ++T +Q++EPDM                                                                         
Subjt:  TKILTATKSIVKQEGSNT-MQIDEPDM-------------------------------------------------------------------------

Query:  -----TAVTNSATNRIDWAELTFEDINATNQQICVNLCLKNRHTAKVIKDPDYRKELGTFCKQYGLDNRPEEERKKKKKS-SNKRLFNKSKSKDSELPRR
             T   NS   +IDWA LT+ DI++T Q I VNLC +N+HT KVIKD DYRKELGTFCKQYGL   P+EE+KKKKK  S+K+ F K K KD E P+R
Subjt:  -----TAVTNSATNRIDWAELTFEDINATNQQICVNLCLKNRHTAKVIKDPDYRKELGTFCKQYGLDNRPEEERKKKKKS-SNKRLFNKSKSKDSELPRR

Query:  KRKYYNRNKGKKDYSKNRPHKSSVVCYKCNRKGHYSSKCPLKDKINSLTIDEETRQSLLYAIRSEEESSLSSESSIDNDEINLINEEG-SDEETFYSQSD
        +R +Y + KGKK YS     K++ +C+KCN+KGHY+++CPLKDKIN+LTIDEET+QSLLYAIR ++++S  +ESS + D IN++ EEG S EE FYSQSD
Subjt:  KRKYYNRNKGKKDYSKNRPHKSSVVCYKCNRKGHYSSKCPLKDKINSLTIDEETRQSLLYAIRSEEESSLSSESSIDNDEINLINEEG-SDEETFYSQSD

Query:  SSEEDEIIPCTGHCAGRCHGHINVISRDQEALFDLIDRLPDEESKRMCLVKLRQNLEAEALQRKPDYNLIEYSFQDILKRVKGEAKKPIQIEDLHTEVKN
        SS+++  IPCTG CAG+C GHINVI++DQE LF LI+++PDEE+KR CL+KL+Q+LE +A Q K   N I YS+QDIL RVKGEAK PIQ+EDLH EVK 
Subjt:  SSEEDEIIPCTGHCAGRCHGHINVISRDQEALFDLIDRLPDEESKRMCLVKLRQNLEAEALQRKPDYNLIEYSFQDILKRVKGEAKKPIQIEDLHTEVKN

Query:  LKKKVASNKQRLSTLEFAFGKFQESEATEGETSSSRPE-------QTLQIGSPSGINYISKM
        LK++VA NKQRL  LE AF  FQES+  +  + +SR +       + L I     IN ISK+
Subjt:  LKKKVASNKQRLSTLEFAFGKFQESEATEGETSSSRPE-------QTLQIGSPSGINYISKM

KAA0056776.1 Enzymatic polyprotein [Cucumis melo var. makuwa]2.8e-17447.76Show/hide
Query:  MTLGDFTSKIQKRELVKNEALVKRLQADGQVAVIRNGTVWVTARGKEIASTFPPEEEATFSHLVIPAIKMVSSPYKTIDEDKVQKVG-------------
        +T+ DF   + K E  KNEAL K+LQADGQVA+I+  TVWVTA GKE+AS +PPEEEA FSH  IPAIKMVSSPYKTI+EDKVQKVG             
Subjt:  MTLGDFTSKIQKRELVKNEALVKRLQADGQVAVIRNGTVWVTARGKEIASTFPPEEEATFSHLVIPAIKMVSSPYKTIDEDKVQKVG-------------

Query:  -----------------------------------------------------------------------------------------------ASSST
                                                                                                       AS S 
Subjt:  -----------------------------------------------------------------------------------------------ASSST

Query:  ILPVTMHTEVKNHYPRPSPPDMGWDDLRHDQRSYDGSSIITWNIDGYSEAQMMNTFQKMMMAATAFSTKKSVLQTAHILISGLSGNLRSWWHNQLTDEDR
        ILPV    ++KNHYP+PSPPD+GWDDL H++R+YDG S+ITWNIDGYSEAQMMNTFQ+M++AATA+STKKS  +TA ILI G +GNLRSWWHN LT++DR
Subjt:  ILPVTMHTEVKNHYPRPSPPDMGWDDLRHDQRSYDGSSIITWNIDGYSEAQMMNTFQKMMMAATAFSTKKSVLQTAHILISGLSGNLRSWWHNQLTDEDR

Query:  TKILTATKSIVKQEGSNT-MQIDEPDM-------------------------------------------------------------------------
         +ILTAT+++VK E ++T +Q++EPDM                                                                         
Subjt:  TKILTATKSIVKQEGSNT-MQIDEPDM-------------------------------------------------------------------------

Query:  -----TAVTNSATNRIDWAELTFEDINATNQQICVNLCLKNRHTAKVIKDPDYRKELGTFCKQYGLDNRPEEERKKKKKS-SNKRLFNKSKSKDSELPRR
             T   NS   +IDWA LT+ DI++T Q ICVNLC +N+HT KVIKD DYRKELGTFCKQYGL   P+EE+KKKKK  S+K+ F KSK+KD E PRR
Subjt:  -----TAVTNSATNRIDWAELTFEDINATNQQICVNLCLKNRHTAKVIKDPDYRKELGTFCKQYGLDNRPEEERKKKKKS-SNKRLFNKSKSKDSELPRR

Query:  KRKYYNRNKGKKDYSKNRPHKSSVVCYKCNRKGHYSSKCPLKDKINSLTIDEETRQSLLYAIRSEEESSLSSESSIDNDEINLINEEG-SDEETFYSQSD
        +R++YN+ K KK YS     K+  +C+KCN+KGHY+++CPLKDKIN++TIDEET+QSLLYAIRS+++++  +ESS + D IN++ EEG S EE FYSQSD
Subjt:  KRKYYNRNKGKKDYSKNRPHKSSVVCYKCNRKGHYSSKCPLKDKINSLTIDEETRQSLLYAIRSEEESSLSSESSIDNDEINLINEEG-SDEETFYSQSD

Query:  SSEEDEIIPCTGHCAGRCHGHINVISRDQEALFDLIDRLPDEESKRMCLVKLRQNLEAEALQRKPDYNLIEYSFQDILKRVKGEAKKPIQIEDLHTEVKN
        SS+++  IPCTG CAG+C GHINVI++DQE LFDLI+++PDEE+KR CL+KL+Q+LE +A Q K   N I YS+QDIL RVKGEAK PIQ+EDLH EVK 
Subjt:  SSEEDEIIPCTGHCAGRCHGHINVISRDQEALFDLIDRLPDEESKRMCLVKLRQNLEAEALQRKPDYNLIEYSFQDILKRVKGEAKKPIQIEDLHTEVKN

Query:  LKKKVASNKQRLSTLEFAFGKFQESEATEGETSSSRPEQT----LQIGSPSGINYISKME
        LK++VA NKQRL  LE AF  FQ S+A++ E++S    ++    L I     IN ISK++
Subjt:  LKKKVASNKQRLSTLEFAFGKFQESEATEGETSSSRPEQT----LQIGSPSGINYISKME

TYJ97599.1 Enzymatic polyprotein [Cucumis melo var. makuwa]4.0e-17347.37Show/hide
Query:  MTLGDFTSKIQKRELVKNEALVKRLQADGQVAVIRNGTVWVTARGKEIASTFPPEEEATFSHLVIPAIKMVSSPYKTIDEDKVQKVG-------------
        +T+ DF   + K E  KNEAL K+LQADGQVA+I+  TVWVTA GKE+AS +PPEEEA FSH  IPAIKMVSSPYKTI+EDKVQKVG             
Subjt:  MTLGDFTSKIQKRELVKNEALVKRLQADGQVAVIRNGTVWVTARGKEIASTFPPEEEATFSHLVIPAIKMVSSPYKTIDEDKVQKVG-------------

Query:  -----------------------------------------------------------------------------------------------ASSST
                                                                                                       AS S 
Subjt:  -----------------------------------------------------------------------------------------------ASSST

Query:  ILPVTMHTEVKNHYPRPSPPDMGWDDLRHDQRSYDGSSIITWNIDGYSEAQMMNTFQKMMMAATAFSTKKSVLQTAHILISGLSGNLRSWWHNQLTDEDR
        ILPV    ++KNHYP+PSPPD+GWDDL H++R+YDG S+ITWNIDGYSEAQMMNTFQ+M++AATA+STKKS  +TA ILI G +GNLRSWWHN LT++DR
Subjt:  ILPVTMHTEVKNHYPRPSPPDMGWDDLRHDQRSYDGSSIITWNIDGYSEAQMMNTFQKMMMAATAFSTKKSVLQTAHILISGLSGNLRSWWHNQLTDEDR

Query:  TKILTATKSIVKQEGSNT-MQIDEPDM-------------------------------------------------------------------------
         +ILTAT+++VK E ++T +Q++EPDM                                                                         
Subjt:  TKILTATKSIVKQEGSNT-MQIDEPDM-------------------------------------------------------------------------

Query:  -----TAVTNSATNRIDWAELTFEDINATNQQICVNLCLKNRHTAKVIKDPDYRKELGTFCKQYGLDNRPEEERKKKKKS-SNKRLFNKSKSKDSELPRR
             T   NS   +IDWA LT+ DI++T Q ICVNLC +N+HT KVIKD DYRKELGTFCKQYGL   P+EE+KKKKK  S+K+ F KSK+KD E P+R
Subjt:  -----TAVTNSATNRIDWAELTFEDINATNQQICVNLCLKNRHTAKVIKDPDYRKELGTFCKQYGLDNRPEEERKKKKKS-SNKRLFNKSKSKDSELPRR

Query:  KRKYYNRNKGKKDYSKNRPHKSSVVCYKCNRKGHYSSKCPLKDKINSLTIDEETRQSLLYAIRSEEESSLSSESSIDNDEINLINEEG-SDEETFYSQSD
        ++++YN+ K KK YS     K+  +C+KCN+KGHY+++CPLKDKIN++TIDEET+QSLLYAIRS+++++  +ESS + D IN++ EEG S EE FYSQSD
Subjt:  KRKYYNRNKGKKDYSKNRPHKSSVVCYKCNRKGHYSSKCPLKDKINSLTIDEETRQSLLYAIRSEEESSLSSESSIDNDEINLINEEG-SDEETFYSQSD

Query:  SSEEDEIIPCTGHCAGRCHGHINVISRDQEALFDLIDRLPDEESKRMCLVKLRQNLEAEALQRKPDYNLIEYSFQDILKRVKGEAKKPIQIEDLHTEVKN
        SS+++  IPCTG CAG+C GHINVI++DQE LFDLI+++PDEE+KR CL+KL+Q+LE +A Q K   N I YS+QDIL RVKGEAK PIQ+EDLH EVK 
Subjt:  SSEEDEIIPCTGHCAGRCHGHINVISRDQEALFDLIDRLPDEESKRMCLVKLRQNLEAEALQRKPDYNLIEYSFQDILKRVKGEAKKPIQIEDLHTEVKN

Query:  LKKKVASNKQRLSTLEFAFGKFQESEATEGETSSSRPEQT----LQIGSPSGINYISKME
        LK++VA NKQRL  LE AF  FQ S+A++ E++S    ++    L I     IN IS+++
Subjt:  LKKKVASNKQRLSTLEFAFGKFQESEATEGETSSSRPEQT----LQIGSPSGINYISKME

TYJ98087.1 Enzymatic polyprotein [Cucumis melo var. makuwa]4.6e-16946.72Show/hide
Query:  MTLGDFTSKIQKRELVKNEALVKRLQADGQVAVIRNGTVWVTARGKEIASTFPPEEEATFSHLVIPAIKMVSSPYKTIDEDKVQKVG-------------
        +T+ DF   + K E  KNEALVK+LQADGQ+A+I+  TVWVT  GKE+AS +PPEEEA F H  IPAIKM+SSPYKTI+EDKVQKVG             
Subjt:  MTLGDFTSKIQKRELVKNEALVKRLQADGQVAVIRNGTVWVTARGKEIASTFPPEEEATFSHLVIPAIKMVSSPYKTIDEDKVQKVG-------------

Query:  -----------------------------------------------------------------------------------------------ASSST
                                                                                                       AS   
Subjt:  -----------------------------------------------------------------------------------------------ASSST

Query:  ILPVTMHTEVKNHYPRPSPPDMGWDDLRHDQRSYDGSSIITWNIDGYSEAQMMNTFQKMMMAATAFSTKKSVLQTAHILISGLSGNLRSWWHNQLTDEDR
        ILPV    ++KNHYP+PSPPD+GWDDL H++R+YDG S+ITWNIDGYSEAQMMNTFQ+M++AATA+STKKS  +TA ILI G +GNLRSWWHN LT++DR
Subjt:  ILPVTMHTEVKNHYPRPSPPDMGWDDLRHDQRSYDGSSIITWNIDGYSEAQMMNTFQKMMMAATAFSTKKSVLQTAHILISGLSGNLRSWWHNQLTDEDR

Query:  TKILTATKSIVKQEGSNT-MQIDEPDM-------------------------------------------------------------------------
         +ILTAT+++VK E ++T +Q++EPDM                                                                         
Subjt:  TKILTATKSIVKQEGSNT-MQIDEPDM-------------------------------------------------------------------------

Query:  -----TAVTNSATNRIDWAELTFEDINATNQQICVNLCLKNRHTAKVIKDPDYRKELGTFCKQYGLDNRPEEERKKKKKS-SNKRLFNKSKSKDSELPRR
             T   NS   +IDWA LT+ DI++T Q I VNLC +N+HT KVIKD DYRKELGTFCKQYGL   P+EE+KKKKK  S+K+ F K K KD E P+R
Subjt:  -----TAVTNSATNRIDWAELTFEDINATNQQICVNLCLKNRHTAKVIKDPDYRKELGTFCKQYGLDNRPEEERKKKKKS-SNKRLFNKSKSKDSELPRR

Query:  KRKYYNRNKGKKDYSKNRPHKSSVVCYKCNRKGHYSSKCPLKDKINSLTIDEETRQSLLYAIRSEEESSLSSESSIDNDEINLINEEG-SDEETFYSQSD
        +R +Y + KGKK YS     K++ +C+KCN+KGHY+++CPLKDKIN+LTIDEET+QSLLYAIR ++++S  +ESS + D IN++ EEG S EE FYSQSD
Subjt:  KRKYYNRNKGKKDYSKNRPHKSSVVCYKCNRKGHYSSKCPLKDKINSLTIDEETRQSLLYAIRSEEESSLSSESSIDNDEINLINEEG-SDEETFYSQSD

Query:  SSEEDEIIPCTGHCAGRCHGHINVISRDQEALFDLIDRLPDEESKRMCLVKLRQNLEAEALQRKPDYNLIEYSFQDILKRVKGEAKKPIQIEDLHTEVKN
        SS+++  IPCTG CAG+C GHINVI++DQE LF LI+++PDEE+KR CL+KL+Q+LE +A Q K   N I YS+QDIL RVKGEAK PIQ+EDLH EVK 
Subjt:  SSEEDEIIPCTGHCAGRCHGHINVISRDQEALFDLIDRLPDEESKRMCLVKLRQNLEAEALQRKPDYNLIEYSFQDILKRVKGEAKKPIQIEDLHTEVKN

Query:  LKKKVASNKQRLSTLEFAFGKFQESEATEGETSSSRPE-------QTLQIGSPSGINYISKM
        LK++VA NKQRL  LE AF  FQES+  +  + +SR +       + L I     IN ISK+
Subjt:  LKKKVASNKQRLSTLEFAFGKFQESEATEGETSSSRPE-------QTLQIGSPSGINYISKM

XP_022151716.1 uncharacterized protein LOC111019629 [Momordica charantia]3.2e-18674.05Show/hide
Query:  ASSSTILPVTMHTEVKNHYPRPSPPDMGWDDLRHDQRSYDGSSIITWNIDGYSEAQMMNTFQKMMMAATAFSTKKSVLQTAHILISGLSGNLRSWWHNQL
        ASSSTIL VTMHTEVKNHYPRPSPPDMGWDDLRHDQR+YD SSIITWNIDGYSEAQMMNTFQ+MMMAATAFSTKK VLQTA ILIS LSGNLRSWWHNQL
Subjt:  ASSSTILPVTMHTEVKNHYPRPSPPDMGWDDLRHDQRSYDGSSIITWNIDGYSEAQMMNTFQKMMMAATAFSTKKSVLQTAHILISGLSGNLRSWWHNQL

Query:  TDEDRTKILTATKSIVKQEGSNTMQIDEPDM---------------------------------------------------------------------
        TDEDRTKIL ATK++VKQEGSN MQIDEPDM                                                                     
Subjt:  TDEDRTKILTATKSIVKQEGSNTMQIDEPDM---------------------------------------------------------------------

Query:  ---------TAVTNSATNRIDWAELTFEDINATNQQICVNLCLKNRHTAKVIKDPDYRKELGTFCKQYGLDNRPEEERKKKKKSSNKRLFNKSKSKDSEL
                 T VTNS TNRIDWAELT  DINAT QQICVNLCL+N+HTAKVIK+PDYRKELGTFCKQYGLD+R EEERKKKKKSSNKRLF+KSKSKDSEL
Subjt:  ---------TAVTNSATNRIDWAELTFEDINATNQQICVNLCLKNRHTAKVIKDPDYRKELGTFCKQYGLDNRPEEERKKKKKSSNKRLFNKSKSKDSEL

Query:  PRRKRKYYNRNKGKKDYSKNRPHKSSVVCYKCNRKGHYSSKCPLKDKINSLTIDEETRQSLLYAIRSEEESSLSSESSIDNDEINLINEEGSDEETFYSQ
        PRRKRKYYNRNKGKKDYSKNRPHKSSV CYKCNRKGHYSSKCPLKDKINSLTIDE+TR+SLLYAIRSEEE+S SSESS DNDEINLINEE SDEETF+SQ
Subjt:  PRRKRKYYNRNKGKKDYSKNRPHKSSVVCYKCNRKGHYSSKCPLKDKINSLTIDEETRQSLLYAIRSEEESSLSSESSIDNDEINLINEEGSDEETFYSQ

Query:  SDSSEEDEIIPCTGHCAGRCHGHINVISRDQEALFDLIDRLPDEESKRMCLVKLRQNLEAEALQRKPDYNLIEY
        SDSSEED IIPCTGHCAG+CHGHINVI++DQEALFDLID+LPDE+SKRMCLVKLR++LEAEALQ+KP+ ++ +Y
Subjt:  SDSSEEDEIIPCTGHCAGRCHGHINVISRDQEALFDLIDRLPDEESKRMCLVKLRQNLEAEALQRKPDYNLIEY

TrEMBL top hitse value%identityAlignment
A0A5A7UF59 Enzymatic polyprotein2.2e-16946.72Show/hide
Query:  MTLGDFTSKIQKRELVKNEALVKRLQADGQVAVIRNGTVWVTARGKEIASTFPPEEEATFSHLVIPAIKMVSSPYKTIDEDKVQKVG-------------
        +T+ DF   + K E  KNEALVK+LQADGQ+A+I+  TVWVT  GKE+AS +PPEEEA F H  IPAIKM+SSPYKTI+EDKVQKVG             
Subjt:  MTLGDFTSKIQKRELVKNEALVKRLQADGQVAVIRNGTVWVTARGKEIASTFPPEEEATFSHLVIPAIKMVSSPYKTIDEDKVQKVG-------------

Query:  -----------------------------------------------------------------------------------------------ASSST
                                                                                                       AS   
Subjt:  -----------------------------------------------------------------------------------------------ASSST

Query:  ILPVTMHTEVKNHYPRPSPPDMGWDDLRHDQRSYDGSSIITWNIDGYSEAQMMNTFQKMMMAATAFSTKKSVLQTAHILISGLSGNLRSWWHNQLTDEDR
        ILPV    ++KNHYP+PSPPD+GWDDL H++R+YDG S+ITWNIDGYSEAQMMNTFQ+M++AATA+STKKS  +TA ILI G +GNLRSWWHN LT++DR
Subjt:  ILPVTMHTEVKNHYPRPSPPDMGWDDLRHDQRSYDGSSIITWNIDGYSEAQMMNTFQKMMMAATAFSTKKSVLQTAHILISGLSGNLRSWWHNQLTDEDR

Query:  TKILTATKSIVKQEGSNT-MQIDEPDM-------------------------------------------------------------------------
         +ILTAT+++VK E ++T +Q++EPDM                                                                         
Subjt:  TKILTATKSIVKQEGSNT-MQIDEPDM-------------------------------------------------------------------------

Query:  -----TAVTNSATNRIDWAELTFEDINATNQQICVNLCLKNRHTAKVIKDPDYRKELGTFCKQYGLDNRPEEERKKKKKS-SNKRLFNKSKSKDSELPRR
             T   NS   +IDWA LT+ DI++T Q I VNLC +N+HT KVIKD DYRKELGTFCKQYGL   P+EE+KKKKK  S+K+ F K K KD E P+R
Subjt:  -----TAVTNSATNRIDWAELTFEDINATNQQICVNLCLKNRHTAKVIKDPDYRKELGTFCKQYGLDNRPEEERKKKKKS-SNKRLFNKSKSKDSELPRR

Query:  KRKYYNRNKGKKDYSKNRPHKSSVVCYKCNRKGHYSSKCPLKDKINSLTIDEETRQSLLYAIRSEEESSLSSESSIDNDEINLINEEG-SDEETFYSQSD
        +R +Y + KGKK YS     K++ +C+KCN+KGHY+++CPLKDKIN+LTIDEET+QSLLYAIR ++++S  +ESS + D IN++ EEG S EE FYSQSD
Subjt:  KRKYYNRNKGKKDYSKNRPHKSSVVCYKCNRKGHYSSKCPLKDKINSLTIDEETRQSLLYAIRSEEESSLSSESSIDNDEINLINEEG-SDEETFYSQSD

Query:  SSEEDEIIPCTGHCAGRCHGHINVISRDQEALFDLIDRLPDEESKRMCLVKLRQNLEAEALQRKPDYNLIEYSFQDILKRVKGEAKKPIQIEDLHTEVKN
        SS+++  IPCTG CAG+C GHINVI++DQE LF LI+++PDEE+KR CL+KL+Q+LE +A Q K   N I YS+QDIL RVKGEAK PIQ+EDLH EVK 
Subjt:  SSEEDEIIPCTGHCAGRCHGHINVISRDQEALFDLIDRLPDEESKRMCLVKLRQNLEAEALQRKPDYNLIEYSFQDILKRVKGEAKKPIQIEDLHTEVKN

Query:  LKKKVASNKQRLSTLEFAFGKFQESEATEGETSSSRPE-------QTLQIGSPSGINYISKM
        LK++VA NKQRL  LE AF  FQES+  +  + +SR +       + L I     IN ISK+
Subjt:  LKKKVASNKQRLSTLEFAFGKFQESEATEGETSSSRPE-------QTLQIGSPSGINYISKM

A0A5A7UR29 Enzymatic polyprotein1.4e-17447.76Show/hide
Query:  MTLGDFTSKIQKRELVKNEALVKRLQADGQVAVIRNGTVWVTARGKEIASTFPPEEEATFSHLVIPAIKMVSSPYKTIDEDKVQKVG-------------
        +T+ DF   + K E  KNEAL K+LQADGQVA+I+  TVWVTA GKE+AS +PPEEEA FSH  IPAIKMVSSPYKTI+EDKVQKVG             
Subjt:  MTLGDFTSKIQKRELVKNEALVKRLQADGQVAVIRNGTVWVTARGKEIASTFPPEEEATFSHLVIPAIKMVSSPYKTIDEDKVQKVG-------------

Query:  -----------------------------------------------------------------------------------------------ASSST
                                                                                                       AS S 
Subjt:  -----------------------------------------------------------------------------------------------ASSST

Query:  ILPVTMHTEVKNHYPRPSPPDMGWDDLRHDQRSYDGSSIITWNIDGYSEAQMMNTFQKMMMAATAFSTKKSVLQTAHILISGLSGNLRSWWHNQLTDEDR
        ILPV    ++KNHYP+PSPPD+GWDDL H++R+YDG S+ITWNIDGYSEAQMMNTFQ+M++AATA+STKKS  +TA ILI G +GNLRSWWHN LT++DR
Subjt:  ILPVTMHTEVKNHYPRPSPPDMGWDDLRHDQRSYDGSSIITWNIDGYSEAQMMNTFQKMMMAATAFSTKKSVLQTAHILISGLSGNLRSWWHNQLTDEDR

Query:  TKILTATKSIVKQEGSNT-MQIDEPDM-------------------------------------------------------------------------
         +ILTAT+++VK E ++T +Q++EPDM                                                                         
Subjt:  TKILTATKSIVKQEGSNT-MQIDEPDM-------------------------------------------------------------------------

Query:  -----TAVTNSATNRIDWAELTFEDINATNQQICVNLCLKNRHTAKVIKDPDYRKELGTFCKQYGLDNRPEEERKKKKKS-SNKRLFNKSKSKDSELPRR
             T   NS   +IDWA LT+ DI++T Q ICVNLC +N+HT KVIKD DYRKELGTFCKQYGL   P+EE+KKKKK  S+K+ F KSK+KD E PRR
Subjt:  -----TAVTNSATNRIDWAELTFEDINATNQQICVNLCLKNRHTAKVIKDPDYRKELGTFCKQYGLDNRPEEERKKKKKS-SNKRLFNKSKSKDSELPRR

Query:  KRKYYNRNKGKKDYSKNRPHKSSVVCYKCNRKGHYSSKCPLKDKINSLTIDEETRQSLLYAIRSEEESSLSSESSIDNDEINLINEEG-SDEETFYSQSD
        +R++YN+ K KK YS     K+  +C+KCN+KGHY+++CPLKDKIN++TIDEET+QSLLYAIRS+++++  +ESS + D IN++ EEG S EE FYSQSD
Subjt:  KRKYYNRNKGKKDYSKNRPHKSSVVCYKCNRKGHYSSKCPLKDKINSLTIDEETRQSLLYAIRSEEESSLSSESSIDNDEINLINEEG-SDEETFYSQSD

Query:  SSEEDEIIPCTGHCAGRCHGHINVISRDQEALFDLIDRLPDEESKRMCLVKLRQNLEAEALQRKPDYNLIEYSFQDILKRVKGEAKKPIQIEDLHTEVKN
        SS+++  IPCTG CAG+C GHINVI++DQE LFDLI+++PDEE+KR CL+KL+Q+LE +A Q K   N I YS+QDIL RVKGEAK PIQ+EDLH EVK 
Subjt:  SSEEDEIIPCTGHCAGRCHGHINVISRDQEALFDLIDRLPDEESKRMCLVKLRQNLEAEALQRKPDYNLIEYSFQDILKRVKGEAKKPIQIEDLHTEVKN

Query:  LKKKVASNKQRLSTLEFAFGKFQESEATEGETSSSRPEQT----LQIGSPSGINYISKME
        LK++VA NKQRL  LE AF  FQ S+A++ E++S    ++    L I     IN ISK++
Subjt:  LKKKVASNKQRLSTLEFAFGKFQESEATEGETSSSRPEQT----LQIGSPSGINYISKME

A0A5D3BEY3 Enzymatic polyprotein2.0e-17347.37Show/hide
Query:  MTLGDFTSKIQKRELVKNEALVKRLQADGQVAVIRNGTVWVTARGKEIASTFPPEEEATFSHLVIPAIKMVSSPYKTIDEDKVQKVG-------------
        +T+ DF   + K E  KNEAL K+LQADGQVA+I+  TVWVTA GKE+AS +PPEEEA FSH  IPAIKMVSSPYKTI+EDKVQKVG             
Subjt:  MTLGDFTSKIQKRELVKNEALVKRLQADGQVAVIRNGTVWVTARGKEIASTFPPEEEATFSHLVIPAIKMVSSPYKTIDEDKVQKVG-------------

Query:  -----------------------------------------------------------------------------------------------ASSST
                                                                                                       AS S 
Subjt:  -----------------------------------------------------------------------------------------------ASSST

Query:  ILPVTMHTEVKNHYPRPSPPDMGWDDLRHDQRSYDGSSIITWNIDGYSEAQMMNTFQKMMMAATAFSTKKSVLQTAHILISGLSGNLRSWWHNQLTDEDR
        ILPV    ++KNHYP+PSPPD+GWDDL H++R+YDG S+ITWNIDGYSEAQMMNTFQ+M++AATA+STKKS  +TA ILI G +GNLRSWWHN LT++DR
Subjt:  ILPVTMHTEVKNHYPRPSPPDMGWDDLRHDQRSYDGSSIITWNIDGYSEAQMMNTFQKMMMAATAFSTKKSVLQTAHILISGLSGNLRSWWHNQLTDEDR

Query:  TKILTATKSIVKQEGSNT-MQIDEPDM-------------------------------------------------------------------------
         +ILTAT+++VK E ++T +Q++EPDM                                                                         
Subjt:  TKILTATKSIVKQEGSNT-MQIDEPDM-------------------------------------------------------------------------

Query:  -----TAVTNSATNRIDWAELTFEDINATNQQICVNLCLKNRHTAKVIKDPDYRKELGTFCKQYGLDNRPEEERKKKKKS-SNKRLFNKSKSKDSELPRR
             T   NS   +IDWA LT+ DI++T Q ICVNLC +N+HT KVIKD DYRKELGTFCKQYGL   P+EE+KKKKK  S+K+ F KSK+KD E P+R
Subjt:  -----TAVTNSATNRIDWAELTFEDINATNQQICVNLCLKNRHTAKVIKDPDYRKELGTFCKQYGLDNRPEEERKKKKKS-SNKRLFNKSKSKDSELPRR

Query:  KRKYYNRNKGKKDYSKNRPHKSSVVCYKCNRKGHYSSKCPLKDKINSLTIDEETRQSLLYAIRSEEESSLSSESSIDNDEINLINEEG-SDEETFYSQSD
        ++++YN+ K KK YS     K+  +C+KCN+KGHY+++CPLKDKIN++TIDEET+QSLLYAIRS+++++  +ESS + D IN++ EEG S EE FYSQSD
Subjt:  KRKYYNRNKGKKDYSKNRPHKSSVVCYKCNRKGHYSSKCPLKDKINSLTIDEETRQSLLYAIRSEEESSLSSESSIDNDEINLINEEG-SDEETFYSQSD

Query:  SSEEDEIIPCTGHCAGRCHGHINVISRDQEALFDLIDRLPDEESKRMCLVKLRQNLEAEALQRKPDYNLIEYSFQDILKRVKGEAKKPIQIEDLHTEVKN
        SS+++  IPCTG CAG+C GHINVI++DQE LFDLI+++PDEE+KR CL+KL+Q+LE +A Q K   N I YS+QDIL RVKGEAK PIQ+EDLH EVK 
Subjt:  SSEEDEIIPCTGHCAGRCHGHINVISRDQEALFDLIDRLPDEESKRMCLVKLRQNLEAEALQRKPDYNLIEYSFQDILKRVKGEAKKPIQIEDLHTEVKN

Query:  LKKKVASNKQRLSTLEFAFGKFQESEATEGETSSSRPEQT----LQIGSPSGINYISKME
        LK++VA NKQRL  LE AF  FQ S+A++ E++S    ++    L I     IN IS+++
Subjt:  LKKKVASNKQRLSTLEFAFGKFQESEATEGETSSSRPEQT----LQIGSPSGINYISKME

A0A5D3BG41 Enzymatic polyprotein2.2e-16946.72Show/hide
Query:  MTLGDFTSKIQKRELVKNEALVKRLQADGQVAVIRNGTVWVTARGKEIASTFPPEEEATFSHLVIPAIKMVSSPYKTIDEDKVQKVG-------------
        +T+ DF   + K E  KNEALVK+LQADGQ+A+I+  TVWVT  GKE+AS +PPEEEA F H  IPAIKM+SSPYKTI+EDKVQKVG             
Subjt:  MTLGDFTSKIQKRELVKNEALVKRLQADGQVAVIRNGTVWVTARGKEIASTFPPEEEATFSHLVIPAIKMVSSPYKTIDEDKVQKVG-------------

Query:  -----------------------------------------------------------------------------------------------ASSST
                                                                                                       AS   
Subjt:  -----------------------------------------------------------------------------------------------ASSST

Query:  ILPVTMHTEVKNHYPRPSPPDMGWDDLRHDQRSYDGSSIITWNIDGYSEAQMMNTFQKMMMAATAFSTKKSVLQTAHILISGLSGNLRSWWHNQLTDEDR
        ILPV    ++KNHYP+PSPPD+GWDDL H++R+YDG S+ITWNIDGYSEAQMMNTFQ+M++AATA+STKKS  +TA ILI G +GNLRSWWHN LT++DR
Subjt:  ILPVTMHTEVKNHYPRPSPPDMGWDDLRHDQRSYDGSSIITWNIDGYSEAQMMNTFQKMMMAATAFSTKKSVLQTAHILISGLSGNLRSWWHNQLTDEDR

Query:  TKILTATKSIVKQEGSNT-MQIDEPDM-------------------------------------------------------------------------
         +ILTAT+++VK E ++T +Q++EPDM                                                                         
Subjt:  TKILTATKSIVKQEGSNT-MQIDEPDM-------------------------------------------------------------------------

Query:  -----TAVTNSATNRIDWAELTFEDINATNQQICVNLCLKNRHTAKVIKDPDYRKELGTFCKQYGLDNRPEEERKKKKKS-SNKRLFNKSKSKDSELPRR
             T   NS   +IDWA LT+ DI++T Q I VNLC +N+HT KVIKD DYRKELGTFCKQYGL   P+EE+KKKKK  S+K+ F K K KD E P+R
Subjt:  -----TAVTNSATNRIDWAELTFEDINATNQQICVNLCLKNRHTAKVIKDPDYRKELGTFCKQYGLDNRPEEERKKKKKS-SNKRLFNKSKSKDSELPRR

Query:  KRKYYNRNKGKKDYSKNRPHKSSVVCYKCNRKGHYSSKCPLKDKINSLTIDEETRQSLLYAIRSEEESSLSSESSIDNDEINLINEEG-SDEETFYSQSD
        +R +Y + KGKK YS     K++ +C+KCN+KGHY+++CPLKDKIN+LTIDEET+QSLLYAIR ++++S  +ESS + D IN++ EEG S EE FYSQSD
Subjt:  KRKYYNRNKGKKDYSKNRPHKSSVVCYKCNRKGHYSSKCPLKDKINSLTIDEETRQSLLYAIRSEEESSLSSESSIDNDEINLINEEG-SDEETFYSQSD

Query:  SSEEDEIIPCTGHCAGRCHGHINVISRDQEALFDLIDRLPDEESKRMCLVKLRQNLEAEALQRKPDYNLIEYSFQDILKRVKGEAKKPIQIEDLHTEVKN
        SS+++  IPCTG CAG+C GHINVI++DQE LF LI+++PDEE+KR CL+KL+Q+LE +A Q K   N I YS+QDIL RVKGEAK PIQ+EDLH EVK 
Subjt:  SSEEDEIIPCTGHCAGRCHGHINVISRDQEALFDLIDRLPDEESKRMCLVKLRQNLEAEALQRKPDYNLIEYSFQDILKRVKGEAKKPIQIEDLHTEVKN

Query:  LKKKVASNKQRLSTLEFAFGKFQESEATEGETSSSRPE-------QTLQIGSPSGINYISKM
        LK++VA NKQRL  LE AF  FQES+  +  + +SR +       + L I     IN ISK+
Subjt:  LKKKVASNKQRLSTLEFAFGKFQESEATEGETSSSRPE-------QTLQIGSPSGINYISKM

A0A6J1DFI7 uncharacterized protein LOC1110196291.5e-18674.05Show/hide
Query:  ASSSTILPVTMHTEVKNHYPRPSPPDMGWDDLRHDQRSYDGSSIITWNIDGYSEAQMMNTFQKMMMAATAFSTKKSVLQTAHILISGLSGNLRSWWHNQL
        ASSSTIL VTMHTEVKNHYPRPSPPDMGWDDLRHDQR+YD SSIITWNIDGYSEAQMMNTFQ+MMMAATAFSTKK VLQTA ILIS LSGNLRSWWHNQL
Subjt:  ASSSTILPVTMHTEVKNHYPRPSPPDMGWDDLRHDQRSYDGSSIITWNIDGYSEAQMMNTFQKMMMAATAFSTKKSVLQTAHILISGLSGNLRSWWHNQL

Query:  TDEDRTKILTATKSIVKQEGSNTMQIDEPDM---------------------------------------------------------------------
        TDEDRTKIL ATK++VKQEGSN MQIDEPDM                                                                     
Subjt:  TDEDRTKILTATKSIVKQEGSNTMQIDEPDM---------------------------------------------------------------------

Query:  ---------TAVTNSATNRIDWAELTFEDINATNQQICVNLCLKNRHTAKVIKDPDYRKELGTFCKQYGLDNRPEEERKKKKKSSNKRLFNKSKSKDSEL
                 T VTNS TNRIDWAELT  DINAT QQICVNLCL+N+HTAKVIK+PDYRKELGTFCKQYGLD+R EEERKKKKKSSNKRLF+KSKSKDSEL
Subjt:  ---------TAVTNSATNRIDWAELTFEDINATNQQICVNLCLKNRHTAKVIKDPDYRKELGTFCKQYGLDNRPEEERKKKKKSSNKRLFNKSKSKDSEL

Query:  PRRKRKYYNRNKGKKDYSKNRPHKSSVVCYKCNRKGHYSSKCPLKDKINSLTIDEETRQSLLYAIRSEEESSLSSESSIDNDEINLINEEGSDEETFYSQ
        PRRKRKYYNRNKGKKDYSKNRPHKSSV CYKCNRKGHYSSKCPLKDKINSLTIDE+TR+SLLYAIRSEEE+S SSESS DNDEINLINEE SDEETF+SQ
Subjt:  PRRKRKYYNRNKGKKDYSKNRPHKSSVVCYKCNRKGHYSSKCPLKDKINSLTIDEETRQSLLYAIRSEEESSLSSESSIDNDEINLINEEGSDEETFYSQ

Query:  SDSSEEDEIIPCTGHCAGRCHGHINVISRDQEALFDLIDRLPDEESKRMCLVKLRQNLEAEALQRKPDYNLIEY
        SDSSEED IIPCTGHCAG+CHGHINVI++DQEALFDLID+LPDE+SKRMCLVKLR++LEAEALQ+KP+ ++ +Y
Subjt:  SDSSEEDEIIPCTGHCAGRCHGHINVISRDQEALFDLIDRLPDEESKRMCLVKLRQNLEAEALQRKPDYNLIEY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCCTTGGCGATTTCACCTCAAAGATACAAAAACGAGAGCTAGTAAAGAACGAAGCTCTAGTCAAAAGACTCCAAGCTGATGGACAGGTAGCGGTCATCAGAAATGG
CACTGTTTGGGTAACTGCCAGAGGCAAAGAGATAGCTTCCACCTTCCCCCCAGAAGAAGAAGCGACCTTCTCTCACCTGGTGATACCTGCCATAAAGATGGTGTCTTCAC
CCTATAAAACAATAGATGAAGACAAAGTCCAGAAAGTTGGGGCTTCATCCTCAACAATACTTCCGGTTACCATGCACACGGAAGTAAAGAATCATTATCCAAGACCATCT
CCTCCAGATATGGGATGGGACGATCTTCGCCATGACCAACGAAGTTATGACGGATCTTCTATAATTACTTGGAATATCGATGGGTATTCTGAAGCTCAAATGATGAATAC
TTTTCAAAAAATGATGATGGCAGCCACTGCCTTCAGCACCAAGAAGTCGGTTTTACAGACAGCCCACATCCTTATCTCGGGCCTTTCTGGAAACCTAAGAAGCTGGTGGC
ATAACCAGCTAACCGACGAAGATAGAACGAAAATCCTGACGGCGACTAAATCGATTGTCAAGCAGGAAGGTTCTAATACTATGCAGATTGATGAGCCAGATATGACTGCG
GTAACAAACTCTGCAACTAATCGTATCGATTGGGCGGAGTTAACATTCGAAGACATTAACGCCACGAATCAACAGATATGCGTTAATCTCTGTCTCAAGAATAGGCATAC
AGCCAAAGTCATCAAAGATCCTGACTACCGAAAGGAATTGGGAACTTTTTGCAAACAATATGGTCTTGACAATAGACCTGAAGAAGAACGGAAGAAGAAGAAGAAATCTT
CCAACAAGCGACTCTTCAACAAGAGCAAATCAAAAGATTCCGAATTACCAAGGCGTAAACGGAAATATTACAACAGGAACAAAGGAAAGAAGGATTATTCTAAGAATCGT
CCTCATAAGTCCTCTGTTGTCTGCTACAAATGCAACCGCAAAGGACACTACTCCAGTAAGTGCCCTTTGAAGGACAAAATCAACTCTCTGACTATAGATGAAGAAACAAG
ACAATCTCTTCTCTATGCCATCAGAAGCGAAGAAGAAAGCTCTTTGAGTTCCGAATCTTCTATCGACAATGATGAGATCAACCTCATAAACGAAGAAGGTTCTGATGAAG
AGACGTTCTATTCTCAAAGTGATTCCTCTGAAGAAGATGAAATTATTCCTTGCACTGGCCATTGCGCTGGAAGATGCCATGGCCATATCAATGTCATCAGTAGAGATCAA
GAGGCCCTCTTTGATCTAATTGATAGACTACCCGATGAAGAATCCAAGAGAATGTGCCTTGTGAAACTTCGGCAAAACCTTGAAGCAGAAGCTCTTCAAAGGAAACCAGA
TTATAACCTAATAGAATACTCTTTCCAAGATATTCTAAAAAGGGTCAAAGGAGAAGCCAAGAAGCCGATCCAAATTGAAGATCTCCACACTGAAGTGAAGAATCTCAAAA
AAAAAGTTGCTAGTAACAAGCAACGACTTTCTACTCTTGAATTCGCCTTTGGAAAATTCCAAGAGTCAGAAGCAACAGAAGGAGAAACCTCCTCTTCAAGACCTGAACAG
ACCTTACAGATTGGTTCACCAAGCGGGATCAATTACATCAGTAAAATGGAACCTCCTCCCGGCCGGAGACGGTCAAACTCTCAGCGGCCACCGCCGCAAACCAATCAGCG
ACCTCCGTCGCCAAGAAATGAGTCTTCCATTTCTCCCCAAACAGTAGCATCCTCTTCAAGGGCTGCTATCTCAAAGGGCAAAAGGCCCATTAATCAATCATCTGTACCAT
CACCGATGAGTGTAGAGAATTATGCCATGGATATCCAGTTTGAAACGGTATCCAGGCGTCAGCCAGGTTCTTCTCAGAGAACCTTGACTATCCAATCAGGCCCTCCGAGC
CTTCCAACTCTTTCAAGCACGTTGTTACGCCCCCGCGGCAATACAACGAGAAACAGGCGCCCTGCTACGGCAGCCGCCGCTCCCAGACCAACGATTTCGAGGAACCCTTC
CTCGTTTTCTCAAATAGTCAGGCCAAAGGTTTTCCAACCAAGGCCTCCCATCACTGGTTATTTCACCAAAACTACCCTGGTGGACTCAACCATTGAACCAGAGTTCGACG
GACCTTCGGTCCAAGAAGTCTGCAAACAGATATTTCCTCATGGCTTCAATTTCCTGCCAGAGGATCTTCAAAAGACCCAAACTTACTATGAGTTTATTCTGGTAGATTCA
AAGTCTGCAGAAATAACTCATGTTCCAGACAGAAATGATCCTTCTAGGACCATTTACTCAAAGCTCAGGATCTTCCGCATTCTTACCCCTTCCTCTTGGAAACAGGGCTT
ACAAATCTCACTTTCCAAATTGGTTTCAAACTTGGTGGAACTACTTCGGACTCTCCGACGAAATTTTTCCGGTAGAAGTTCAGAGCTAGGACCCAGTGGAAACTTCAAAG
CATTAAGCAAAGCTTTACGCATCAAATGGTGGGAAAAATTCGATTATTCCTACTTAGAATCTGACAAGATGAAGGATTGGTTGAAAACCAACGTTCATCTTCAAGACATG
ACAAGGCAAGAAGATGAGAGCTTCCTTCTGGCCAAAAATGCTGTCATGAGTTCGCTAGCTGGAGCCGGATCCCAAGCCGACTTCAACTCAGTACTCAATACCGTTGCAGT
TCAGATTTCTGATCCCGACGAAGCCCAGACGGATGTAGATTCTTCCGCCTCTGTCAACAATGATGCCGTAGACGACGAAGAAGACTTCGATCCCTTCGATGGATACAACA
TCAACGACCCATATCTAGATTCACAGCCTAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCCTTGGCGATTTCACCTCAAAGATACAAAAACGAGAGCTAGTAAAGAACGAAGCTCTAGTCAAAAGACTCCAAGCTGATGGACAGGTAGCGGTCATCAGAAATGG
CACTGTTTGGGTAACTGCCAGAGGCAAAGAGATAGCTTCCACCTTCCCCCCAGAAGAAGAAGCGACCTTCTCTCACCTGGTGATACCTGCCATAAAGATGGTGTCTTCAC
CCTATAAAACAATAGATGAAGACAAAGTCCAGAAAGTTGGGGCTTCATCCTCAACAATACTTCCGGTTACCATGCACACGGAAGTAAAGAATCATTATCCAAGACCATCT
CCTCCAGATATGGGATGGGACGATCTTCGCCATGACCAACGAAGTTATGACGGATCTTCTATAATTACTTGGAATATCGATGGGTATTCTGAAGCTCAAATGATGAATAC
TTTTCAAAAAATGATGATGGCAGCCACTGCCTTCAGCACCAAGAAGTCGGTTTTACAGACAGCCCACATCCTTATCTCGGGCCTTTCTGGAAACCTAAGAAGCTGGTGGC
ATAACCAGCTAACCGACGAAGATAGAACGAAAATCCTGACGGCGACTAAATCGATTGTCAAGCAGGAAGGTTCTAATACTATGCAGATTGATGAGCCAGATATGACTGCG
GTAACAAACTCTGCAACTAATCGTATCGATTGGGCGGAGTTAACATTCGAAGACATTAACGCCACGAATCAACAGATATGCGTTAATCTCTGTCTCAAGAATAGGCATAC
AGCCAAAGTCATCAAAGATCCTGACTACCGAAAGGAATTGGGAACTTTTTGCAAACAATATGGTCTTGACAATAGACCTGAAGAAGAACGGAAGAAGAAGAAGAAATCTT
CCAACAAGCGACTCTTCAACAAGAGCAAATCAAAAGATTCCGAATTACCAAGGCGTAAACGGAAATATTACAACAGGAACAAAGGAAAGAAGGATTATTCTAAGAATCGT
CCTCATAAGTCCTCTGTTGTCTGCTACAAATGCAACCGCAAAGGACACTACTCCAGTAAGTGCCCTTTGAAGGACAAAATCAACTCTCTGACTATAGATGAAGAAACAAG
ACAATCTCTTCTCTATGCCATCAGAAGCGAAGAAGAAAGCTCTTTGAGTTCCGAATCTTCTATCGACAATGATGAGATCAACCTCATAAACGAAGAAGGTTCTGATGAAG
AGACGTTCTATTCTCAAAGTGATTCCTCTGAAGAAGATGAAATTATTCCTTGCACTGGCCATTGCGCTGGAAGATGCCATGGCCATATCAATGTCATCAGTAGAGATCAA
GAGGCCCTCTTTGATCTAATTGATAGACTACCCGATGAAGAATCCAAGAGAATGTGCCTTGTGAAACTTCGGCAAAACCTTGAAGCAGAAGCTCTTCAAAGGAAACCAGA
TTATAACCTAATAGAATACTCTTTCCAAGATATTCTAAAAAGGGTCAAAGGAGAAGCCAAGAAGCCGATCCAAATTGAAGATCTCCACACTGAAGTGAAGAATCTCAAAA
AAAAAGTTGCTAGTAACAAGCAACGACTTTCTACTCTTGAATTCGCCTTTGGAAAATTCCAAGAGTCAGAAGCAACAGAAGGAGAAACCTCCTCTTCAAGACCTGAACAG
ACCTTACAGATTGGTTCACCAAGCGGGATCAATTACATCAGTAAAATGGAACCTCCTCCCGGCCGGAGACGGTCAAACTCTCAGCGGCCACCGCCGCAAACCAATCAGCG
ACCTCCGTCGCCAAGAAATGAGTCTTCCATTTCTCCCCAAACAGTAGCATCCTCTTCAAGGGCTGCTATCTCAAAGGGCAAAAGGCCCATTAATCAATCATCTGTACCAT
CACCGATGAGTGTAGAGAATTATGCCATGGATATCCAGTTTGAAACGGTATCCAGGCGTCAGCCAGGTTCTTCTCAGAGAACCTTGACTATCCAATCAGGCCCTCCGAGC
CTTCCAACTCTTTCAAGCACGTTGTTACGCCCCCGCGGCAATACAACGAGAAACAGGCGCCCTGCTACGGCAGCCGCCGCTCCCAGACCAACGATTTCGAGGAACCCTTC
CTCGTTTTCTCAAATAGTCAGGCCAAAGGTTTTCCAACCAAGGCCTCCCATCACTGGTTATTTCACCAAAACTACCCTGGTGGACTCAACCATTGAACCAGAGTTCGACG
GACCTTCGGTCCAAGAAGTCTGCAAACAGATATTTCCTCATGGCTTCAATTTCCTGCCAGAGGATCTTCAAAAGACCCAAACTTACTATGAGTTTATTCTGGTAGATTCA
AAGTCTGCAGAAATAACTCATGTTCCAGACAGAAATGATCCTTCTAGGACCATTTACTCAAAGCTCAGGATCTTCCGCATTCTTACCCCTTCCTCTTGGAAACAGGGCTT
ACAAATCTCACTTTCCAAATTGGTTTCAAACTTGGTGGAACTACTTCGGACTCTCCGACGAAATTTTTCCGGTAGAAGTTCAGAGCTAGGACCCAGTGGAAACTTCAAAG
CATTAAGCAAAGCTTTACGCATCAAATGGTGGGAAAAATTCGATTATTCCTACTTAGAATCTGACAAGATGAAGGATTGGTTGAAAACCAACGTTCATCTTCAAGACATG
ACAAGGCAAGAAGATGAGAGCTTCCTTCTGGCCAAAAATGCTGTCATGAGTTCGCTAGCTGGAGCCGGATCCCAAGCCGACTTCAACTCAGTACTCAATACCGTTGCAGT
TCAGATTTCTGATCCCGACGAAGCCCAGACGGATGTAGATTCTTCCGCCTCTGTCAACAATGATGCCGTAGACGACGAAGAAGACTTCGATCCCTTCGATGGATACAACA
TCAACGACCCATATCTAGATTCACAGCCTAGCTGA
Protein sequenceShow/hide protein sequence
MTLGDFTSKIQKRELVKNEALVKRLQADGQVAVIRNGTVWVTARGKEIASTFPPEEEATFSHLVIPAIKMVSSPYKTIDEDKVQKVGASSSTILPVTMHTEVKNHYPRPS
PPDMGWDDLRHDQRSYDGSSIITWNIDGYSEAQMMNTFQKMMMAATAFSTKKSVLQTAHILISGLSGNLRSWWHNQLTDEDRTKILTATKSIVKQEGSNTMQIDEPDMTA
VTNSATNRIDWAELTFEDINATNQQICVNLCLKNRHTAKVIKDPDYRKELGTFCKQYGLDNRPEEERKKKKKSSNKRLFNKSKSKDSELPRRKRKYYNRNKGKKDYSKNR
PHKSSVVCYKCNRKGHYSSKCPLKDKINSLTIDEETRQSLLYAIRSEEESSLSSESSIDNDEINLINEEGSDEETFYSQSDSSEEDEIIPCTGHCAGRCHGHINVISRDQ
EALFDLIDRLPDEESKRMCLVKLRQNLEAEALQRKPDYNLIEYSFQDILKRVKGEAKKPIQIEDLHTEVKNLKKKVASNKQRLSTLEFAFGKFQESEATEGETSSSRPEQ
TLQIGSPSGINYISKMEPPPGRRRSNSQRPPPQTNQRPPSPRNESSISPQTVASSSRAAISKGKRPINQSSVPSPMSVENYAMDIQFETVSRRQPGSSQRTLTIQSGPPS
LPTLSSTLLRPRGNTTRNRRPATAAAAPRPTISRNPSSFSQIVRPKVFQPRPPITGYFTKTTLVDSTIEPEFDGPSVQEVCKQIFPHGFNFLPEDLQKTQTYYEFILVDS
KSAEITHVPDRNDPSRTIYSKLRIFRILTPSSWKQGLQISLSKLVSNLVELLRTLRRNFSGRSSELGPSGNFKALSKALRIKWWEKFDYSYLESDKMKDWLKTNVHLQDM
TRQEDESFLLAKNAVMSSLAGAGSQADFNSVLNTVAVQISDPDEAQTDVDSSASVNNDAVDDEEDFDPFDGYNINDPYLDSQPS