; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0002209 (gene) of Chayote v1 genome

Gene IDSed0002209
OrganismSechium edule (Chayote v1)
DescriptionRING/FYVE/PHD zinc finger superfamily protein, putative isoform 1
Genome locationLG08:40301826..40320934
RNA-Seq ExpressionSed0002209
SyntenySed0002209
Gene Ontology termsGO:0006325 - chromatin organization (biological process)
GO:0003682 - chromatin binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001025 - Bromo adjacent homology (BAH) domain
IPR001965 - Zinc finger, PHD-type
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR019786 - Zinc finger, PHD-type, conserved site
IPR019787 - Zinc finger, PHD-finger
IPR043151 - Bromo adjacent homology (BAH) domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577472.1 PHD finger protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0072.1Show/hide
Query:  DKRPIEIHGDADADDADLHLIKKPR----VAPNLRRVAEIVLAISTMTALRAGNHPTHAELQLMAEARAKLLPICQQFPPNLFVGTEAITSLIHDLGLHP
        +KR IEIHGD +   A+  L KKPR    + PNLRRVAEIVL +STMTALRAG  PT AE++LMAEARAKL+ IC+   P   VG E I+SLI DLGLH 
Subjt:  DKRPIEIHGDADADDADLHLIKKPR----VAPNLRRVAEIVLAISTMTALRAGNHPTHAELQLMAEARAKLLPICQQFPPNLFVGTEAITSLIHDLGLHP

Query:  KPTDHNLGFRGPRLTIADKFAHAKKKMEDSKKYMPPSGYGSQPIQTSFTSSIETRGTLPTVRMFSSDKSSNVPTSVGGTVAALPSGHVSVSGSTSIQVQA
           D  LGFRGPRLTIA+K A  KKKMEDSKKY+PPSGYGS P Q    SSIE RG LP+VRMF SDKSS VP SVGGT   LP+GHVSV+GSTS+QVQ 
Subjt:  KPTDHNLGFRGPRLTIADKFAHAKKKMEDSKKYMPPSGYGSQPIQTSFTSSIETRGTLPTVRMFSSDKSSNVPTSVGGTVAALPSGHVSVSGSTSIQVQA

Query:  QLPSNEARAHIISSGFPISHQGRDSSSLLHGIERPLNGTYGSQVQVNSSANHPLANVPTWSAQTQSALSAKGVPEHKLPSHSGVNAQGITDSRTLRSSSQ
        QL  NEARAH ISSGFPI+ QGRD SSLLHGIERPLNG YGSQ+QVNSS NH LA+ PTWSAQTQSALSAKG PEHK P+HS  NAQG TDSR LRSSSQ
Subjt:  QLPSNEARAHIISSGFPISHQGRDSSSLLHGIERPLNGTYGSQVQVNSSANHPLANVPTWSAQTQSALSAKGVPEHKLPSHSGVNAQGITDSRTLRSSSQ

Query:  GARDQNFRPSISQTVTGNIAGLQPHLQSMSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFH
         ARDQ+FRP I QT TGN+AGLQPHLQS+++VQGPS+SN+HN+IVKIIQKLLQPQ PDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFH
Subjt:  GARDQNFRPSISQTVTGNIAGLQPHLQSMSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFH

Query:  LKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTAWEKRS--GAIEQNTSAGQLKLVPNGGSDLQS-AQPADHGSNANE-
        LKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSG+   ++S   A+EQ  SAGQLKLV NGG DL S  QPA+ GSNANE 
Subjt:  LKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTAWEKRS--GAIEQNTSAGQLKLVPNGGSDLQS-AQPADHGSNANE-

Query:  SGIRI--VEEIHGNHFLPIRKDMDEKPTSSTSLNTPAKSVELVCDPSSTELSSERCTQQTKSSQASIGEDGSSLKSEPPQESQTMADNYS--------QI
        SGI+I   EE HGN+ LPIRKD+DEKPTSSTSLNTPAKS+ +VC+PSS E+SSE   Q  KSSQA IGEDGSS K+EPP+ESQT  DN S        QI
Subjt:  SGIRI--VEEIHGNHFLPIRKDMDEKPTSSTSLNTPAKSVELVCDPSSTELSSERCTQQTKSSQASIGEDGSSLKSEPPQESQTMADNYS--------QI

Query:  VDQKMVSNGSEKPYSTENTNHISSVKKDGPEFLQENIVENFEAGIISREQSGTKSDGLHDVEWVGSPHQIEDGKAYYRSCCIDGVTYKVEEFALFHTSNG
        VDQ+M SNG E+PYST +++  S+VKK+G E LQEN VENFEA II+REQSGT S+ LHDVEW+G PHQ+ D +AYY+SC +DGVTYKVEEF+LFH++NG
Subjt:  VDQKMVSNGSEKPYSTENTNHISSVKKDGPEFLQENIVENFEAGIISREQSGTKSDGLHDVEWVGSPHQIEDGKAYYRSCCIDGVTYKVEEFALFHTSNG

Query:  KLMPYMLQSMCSEFESGSNLAILKRCYFYEDLPKEVAHLCPCSPEQNEVYTSDGYIDLQVGLIRGPCEILPVSKYKEEYERRRQLGVGADNEIKRIFLCK
        KLMPY LQS+  E+ESG N AILK+CYF+EDLPKEVAHLCPCSPEQNEVYTSDG I L VGLIR PCE+LPV+KYKEE+ERR+QLG GAD+ IK  FLCK
Subjt:  KLMPYMLQSMCSEFESGSNLAILKRCYFYEDLPKEVAHLCPCSPEQNEVYTSDGYIDLQVGLIRGPCEILPVSKYKEEYERRRQLGVGADNEIKRIFLCK

Query:  YVHFLKFFYHLVLLETVSSSFPLLNNVL
        + +       +   + V  SF ++   L
Subjt:  YVHFLKFFYHLVLLETVSSSFPLLNNVL

XP_022932252.1 uncharacterized protein LOC111438615 isoform X1 [Cucurbita moschata]0.0e+0072.34Show/hide
Query:  DKRPIEIHGDADADDADLHLIKKPR----VAPNLRRVAEIVLAISTMTALRAGNHPTHAELQLMAEARAKLLPICQQFPPNLFVGTEAITSLIHDLGLHP
        +KR IEIHGD +   A+  L KKPR    + PNLRRVAEIVL +STMTALRAG  PT AE++LMAEARAKL+ IC+   P   VG E I+SLI DLGLH 
Subjt:  DKRPIEIHGDADADDADLHLIKKPR----VAPNLRRVAEIVLAISTMTALRAGNHPTHAELQLMAEARAKLLPICQQFPPNLFVGTEAITSLIHDLGLHP

Query:  KPTDHNLGFRGPRLTIADKFAHAKKKMEDSKKYMPPSGYGSQPIQTSFTSSIETRGTLPTVRMFSSDKSSNVPTSVGGTVAALPSGHVSVSGSTSIQVQA
           D  LGFRGPRLTIA+K A  KKKMEDSKKY+PPSGYGS P Q    SSIE RG LP+VRMF SDKSS VP SVGGT   LPSGHVSV+GSTS+QVQ 
Subjt:  KPTDHNLGFRGPRLTIADKFAHAKKKMEDSKKYMPPSGYGSQPIQTSFTSSIETRGTLPTVRMFSSDKSSNVPTSVGGTVAALPSGHVSVSGSTSIQVQA

Query:  QLPSNEARAHIISSGFPISHQGRDSSSLLHGIERPLNGTYGSQVQVNSSANHPLANVPTWSAQTQSALSAKGVPEHKLPSHSGVNAQGITDSRTLRSSSQ
        QL  NE RAH ISSGFPI+ QGRD SSLLHGIERPLNGTYGSQ+QVNSS NH LA+ PTWSAQTQSALSAKG PEHK P+HS  NAQG TDSR LRSSSQ
Subjt:  QLPSNEARAHIISSGFPISHQGRDSSSLLHGIERPLNGTYGSQVQVNSSANHPLANVPTWSAQTQSALSAKGVPEHKLPSHSGVNAQGITDSRTLRSSSQ

Query:  GARDQNFRPSISQTVTGNIAGLQPHLQSMSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFH
         ARDQ+FRP I QT TGN+AGLQPHLQS+++VQGPS+SN+HN+IVKIIQKLLQPQ PDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFH
Subjt:  GARDQNFRPSISQTVTGNIAGLQPHLQSMSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFH

Query:  LKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTAWEKRS--GAIEQNTSAGQLKLVPNGGSDLQS-AQPADHGSNANE-
        LKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSG+   ++S   A+EQ  SAGQLKLV NGG DL S  QPA+ GSNANE 
Subjt:  LKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTAWEKRS--GAIEQNTSAGQLKLVPNGGSDLQS-AQPADHGSNANE-

Query:  SGIRI--VEEIHGNHFLPIRKDMDEKPTSSTSLNTPAKSVELVCDPSSTELSSERCTQQTKSSQASIGEDGSSLKSEPPQESQTMADNYS--------QI
        SGI+I   EE HGN+ LPIRKD+DEKPTSSTSLNTPAKS+ +VC+PSS E+SSE   Q  KSSQA IGEDGSS KSEPP+ESQT  DN S        QI
Subjt:  SGIRI--VEEIHGNHFLPIRKDMDEKPTSSTSLNTPAKSVELVCDPSSTELSSERCTQQTKSSQASIGEDGSSLKSEPPQESQTMADNYS--------QI

Query:  VDQKMVSNGSEKPYSTENTNHISSVKKDGPEFLQENIVENFEAGIISREQSGTKSDGLHDVEWVGSPHQIEDGKAYYRSCCIDGVTYKVEEFALFHTSNG
        VDQ+M SNG E+PYST +++  S+VKKDG E LQEN VENFEA II+REQSGT S+ LHDVEW+G PHQ+ D +AYY+SC +DGVTY VEEF+LFH++NG
Subjt:  VDQKMVSNGSEKPYSTENTNHISSVKKDGPEFLQENIVENFEAGIISREQSGTKSDGLHDVEWVGSPHQIEDGKAYYRSCCIDGVTYKVEEFALFHTSNG

Query:  KLMPYMLQSMCSEFESGSNLAILKRCYFYEDLPKEVAHLCPCSPEQNEVYTSDGYIDLQVGLIRGPCEILPVSKYKEEYERRRQLGVGADNEIKRIFLCK
        KLMPY LQS+  E+ESG N AILK+CYF+EDLPKEVAHLCPCSPEQNEVYTSDG I L VGLIR PCE+LPV+KYKEE+ERR+QLG GAD+ IK  FLCK
Subjt:  KLMPYMLQSMCSEFESGSNLAILKRCYFYEDLPKEVAHLCPCSPEQNEVYTSDGYIDLQVGLIRGPCEILPVSKYKEEYERRRQLGVGADNEIKRIFLCK

Query:  YVHFLKFFYHLVLLETVSSSFPLLNNVL
        + +       +   + V  SF ++   L
Subjt:  YVHFLKFFYHLVLLETVSSSFPLLNNVL

XP_022932253.1 uncharacterized protein LOC111438615 isoform X2 [Cucurbita moschata]0.0e+0074.1Show/hide
Query:  DKRPIEIHGDADADDADLHLIKKPR----VAPNLRRVAEIVLAISTMTALRAGNHPTHAELQLMAEARAKLLPICQQFPPNLFVGTEAITSLIHDLGLHP
        +KR IEIHGD +   A+  L KKPR    + PNLRRVAEIVL +STMTALRAG  PT AE++LMAEARAKL+ IC+   P   VG E I+SLI DLGLH 
Subjt:  DKRPIEIHGDADADDADLHLIKKPR----VAPNLRRVAEIVLAISTMTALRAGNHPTHAELQLMAEARAKLLPICQQFPPNLFVGTEAITSLIHDLGLHP

Query:  KPTDHNLGFRGPRLTIADKFAHAKKKMEDSKKYMPPSGYGSQPIQTSFTSSIETRGTLPTVRMFSSDKSSNVPTSVGGTVAALPSGHVSVSGSTSIQVQA
           D  LGFRGPRLTIA+K A  KKKMEDSKKY+PPSGYGS P Q    SSIE RG LP+VRMF SDKSS VP SVGGT   LPSGHVSV+GSTS+QVQ 
Subjt:  KPTDHNLGFRGPRLTIADKFAHAKKKMEDSKKYMPPSGYGSQPIQTSFTSSIETRGTLPTVRMFSSDKSSNVPTSVGGTVAALPSGHVSVSGSTSIQVQA

Query:  QLPSNEARAHIISSGFPISHQGRDSSSLLHGIERPLNGTYGSQVQVNSSANHPLANVPTWSAQTQSALSAKGVPEHKLPSHSGVNAQGITDSRTLRSSSQ
        QL  NE RAH ISSGFPI+ QGRD SSLLHGIERPLNGTYGSQ+QVNSS NH LA+ PTWSAQTQSALSAKG PEHK P+HS  NAQG TDSR LRSSSQ
Subjt:  QLPSNEARAHIISSGFPISHQGRDSSSLLHGIERPLNGTYGSQVQVNSSANHPLANVPTWSAQTQSALSAKGVPEHKLPSHSGVNAQGITDSRTLRSSSQ

Query:  GARDQNFRPSISQTVTGNIAGLQPHLQSMSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFH
         ARDQ+FRP I QT TGN+AGLQPHLQS+++VQGPS+SN+HN+IVKIIQKLLQPQ PDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFH
Subjt:  GARDQNFRPSISQTVTGNIAGLQPHLQSMSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFH

Query:  LKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTAWEKRS--GAIEQNTSAGQLKLVPNGGSDLQS-AQPADHGSNANE-
        LKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSG+   ++S   A+EQ  SAGQLKLV NGG DL S  QPA+ GSNANE 
Subjt:  LKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTAWEKRS--GAIEQNTSAGQLKLVPNGGSDLQS-AQPADHGSNANE-

Query:  SGIRI--VEEIHGNHFLPIRKDMDEKPTSSTSLNTPAKSVELVCDPSSTELSSERCTQQTKSSQASIGEDGSSLKSEPPQESQTMADNYS--------QI
        SGI+I   EE HGN+ LPIRKD+DEKPTSSTSLNTPAKS+ +VC+PSS E+SSE   Q  KSSQA IGEDGSS KSEPP+ESQT  DN S        QI
Subjt:  SGIRI--VEEIHGNHFLPIRKDMDEKPTSSTSLNTPAKSVELVCDPSSTELSSERCTQQTKSSQASIGEDGSSLKSEPPQESQTMADNYS--------QI

Query:  VDQKMVSNGSEKPYSTENTNHISSVKKDGPEFLQENIVENFEAGIISREQSGTKSDGLHDVEWVGSPHQIEDGKAYYRSCCIDGVTYKVEEFALFHTSNG
        VDQ+M SNG E+PYST +++  S+VKKDG E LQEN VENFEA II+REQSGT S+ LHDVEW+G PHQ+ D +AYY+SC +DGVTY VEEF+LFH++NG
Subjt:  VDQKMVSNGSEKPYSTENTNHISSVKKDGPEFLQENIVENFEAGIISREQSGTKSDGLHDVEWVGSPHQIEDGKAYYRSCCIDGVTYKVEEFALFHTSNG

Query:  KLMPYMLQSMCSEFESGSNLAILKRCYFYEDLPKEVAHLCPCSPEQNEVYTSDGYIDLQVGLIRGPCEILPVSKYKEEYERRRQLGVGADNEIKRIFLCK
        KLMPY LQS+  E+ESG N AILK+CYF+EDLPKEVAHLCPCSPEQNEVYTSDG I L VGLIR PCE+LPV+KYKEE+ERR+QLG GAD+ IK  FLCK
Subjt:  KLMPYMLQSMCSEFESGSNLAILKRCYFYEDLPKEVAHLCPCSPEQNEVYTSDGYIDLQVGLIRGPCEILPVSKYKEEYERRRQLGVGADNEIKRIFLCK

Query:  YVH
        + +
Subjt:  YVH

XP_023007695.1 uncharacterized protein LOC111500233 isoform X2 [Cucurbita maxima]0.0e+0073.82Show/hide
Query:  AADKRPIEIHGDADADDADLHLIKKPR----VAPNLRRVAEIVLAISTMTALRAGNHPTHAELQLMAEARAKLLPICQQFPPNLFVGTEAITSLIHDLGL
        A +KR IEIHGD +   A+  L KKPR    + PNLRRVAEIVL +STMTALRAG  PT AE++LMAEARAKL+ IC+   P   VG E I+SLI DLGL
Subjt:  AADKRPIEIHGDADADDADLHLIKKPR----VAPNLRRVAEIVLAISTMTALRAGNHPTHAELQLMAEARAKLLPICQQFPPNLFVGTEAITSLIHDLGL

Query:  HPKPTDHNLGFRGPRLTIADKFAHAKKKMEDSKKYMPPSGYGSQPIQTSFTSSIETRGTLPTVRMFSSDKSSNVPTSVGGTVAALPSGHVSVSGSTSIQV
        H    D  LGFRGPRLTIA+K A  KKKMEDSKKY+PPSGYGS P Q    SSIE RG LP+VRMF SDKSS VP SVGGT   LPSGHVSV+GS+S+QV
Subjt:  HPKPTDHNLGFRGPRLTIADKFAHAKKKMEDSKKYMPPSGYGSQPIQTSFTSSIETRGTLPTVRMFSSDKSSNVPTSVGGTVAALPSGHVSVSGSTSIQV

Query:  QAQLPSNEARAHIISSGFPISHQGRDSSSLLHGIERPLNGTYGSQVQVNSSANHPLANVPTWSAQTQSALSAKGVPEHKLPSHSGVNAQGITDSRTLRSS
        Q QL  NE RAH ISSGFPI+ QGRDSSSLLHGIERPLNGTYGSQ+QVNSS NH LA+ PTWSAQTQSALSAKG PEHK P+HS  NAQG TDSR LRSS
Subjt:  QAQLPSNEARAHIISSGFPISHQGRDSSSLLHGIERPLNGTYGSQVQVNSSANHPLANVPTWSAQTQSALSAKGVPEHKLPSHSGVNAQGITDSRTLRSS

Query:  SQGARDQNFRPSISQTVTGNIAGLQ-PHLQSMSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEK
        SQ ARDQ+FRP I QT TGN+AGLQ PHLQS+++VQGPS+SN+HN+IVKIIQKLLQPQ PDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEK
Subjt:  SQGARDQNFRPSISQTVTGNIAGLQ-PHLQSMSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEK

Query:  GFHLKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTAWEKRS--GAIEQNTSAGQLKLVPNGGSDLQS-AQPADHGSNA
        GFHLKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSG+   ++S   A+EQ  SAGQLKLV NGG DL S  QPA+ GSNA
Subjt:  GFHLKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTAWEKRS--GAIEQNTSAGQLKLVPNGGSDLQS-AQPADHGSNA

Query:  NE-SGIRI--VEEIHGNHFLPIRKDMDEKPTSSTSLNTPAKSVELVCDPSSTELSSERCTQQTKSSQASIGEDGSSLKSEPPQESQTMADNYS-------
        NE SGI+I   EE HGN+FLPIRKD+DEKPTSSTSLNTPAKS+ +VC+PSS E+SSE   Q  KSSQA IGEDGSS K+EPP+ESQT  DN S       
Subjt:  NE-SGIRI--VEEIHGNHFLPIRKDMDEKPTSSTSLNTPAKSVELVCDPSSTELSSERCTQQTKSSQASIGEDGSSLKSEPPQESQTMADNYS-------

Query:  -QIVDQKMVSNGSEKPYSTENTNHISSVKKDGPEFLQENIVENFEAGIISREQSGTKSDGLHDVEWVGSPHQIEDGKAYYRSCCIDGVTYKVEEFALFHT
         QI+DQKM SNG E+PY TE+++  S+VKKDG E LQEN VENFEA I++REQSGT S+ LHDVEW+G PHQ+ D +AYY+SC +DGVTYKVEEF+LFH+
Subjt:  -QIVDQKMVSNGSEKPYSTENTNHISSVKKDGPEFLQENIVENFEAGIISREQSGTKSDGLHDVEWVGSPHQIEDGKAYYRSCCIDGVTYKVEEFALFHT

Query:  SNGKLMPYMLQSMCSEFESGSNLAILKRCYFYEDLPKEVAHLCPCSPEQNEVYTSDGYIDLQVGLIRGPCEILPVSKYKEEYERRRQLGVGADNEIKRIF
        +NGKLMPY LQS+  E+ESG N AILK+CYF+EDLPKEVAHLCPCSPEQNEVYTSDG I L VGLIR PCE+LPV+KYK E+ERR+QLG GAD+ IK  F
Subjt:  SNGKLMPYMLQSMCSEFESGSNLAILKRCYFYEDLPKEVAHLCPCSPEQNEVYTSDGYIDLQVGLIRGPCEILPVSKYKEEYERRRQLGVGADNEIKRIF

Query:  LCKYVH
        LCK+ +
Subjt:  LCKYVH

XP_038904765.1 uncharacterized protein LOC120091039 [Benincasa hispida]0.0e+0074.72Show/hide
Query:  ADKRPIEIHGDADADDADLHLIKKPR----VAPNLRRVAEIVLAISTMTALRAGNHPTHAELQLMAEARAKLLPICQQFPPNLFVGTEAITSLIHDLGLH
        ADKRPIEI+GD +   A+  L KKPR    + PNLRRVAEIVL +STMTALRAG  P+ AE++LMAEARAKL+ IC+   P   VG E I+SLI DLGLH
Subjt:  ADKRPIEIHGDADADDADLHLIKKPR----VAPNLRRVAEIVLAISTMTALRAGNHPTHAELQLMAEARAKLLPICQQFPPNLFVGTEAITSLIHDLGLH

Query:  PKPTDHNLGFRGPRLTIADKFAHAKKKMEDSKKYMPPSGYGSQPIQTSFTSSIETRGTLPTVRMFSSDKSSNVPTSVGGTVAALPSGHVSVSGSTSIQVQ
            D  LGFRGPRLTIA+K A AKKKMEDSKKY+PP GYGS P Q +F+SS+E+RG LPTVRMF+SDKSS VP SVGGTV  LPSGHV V+GSTSIQVQ
Subjt:  PKPTDHNLGFRGPRLTIADKFAHAKKKMEDSKKYMPPSGYGSQPIQTSFTSSIETRGTLPTVRMFSSDKSSNVPTSVGGTVAALPSGHVSVSGSTSIQVQ

Query:  AQLPSNEARAHIISSGFPISHQGRDSSSLLHGIERPLNGTYGSQVQVNSSANHPLANVPTWSAQTQSALSAKGVPEHKLPSHSGVNA-QGITDSRTLRSS
        AQLPSNE R HI+SSG+PI HQGRDSSSLLHGIERPLNGTYGSQ+QVNS  N PLA+  TWSAQTQS LSAKG PEHK P+HS VNA QG TDSR LRSS
Subjt:  AQLPSNEARAHIISSGFPISHQGRDSSSLLHGIERPLNGTYGSQVQVNSSANHPLANVPTWSAQTQSALSAKGVPEHKLPSHSGVNA-QGITDSRTLRSS

Query:  SQGARDQNFRPSISQTVTGNIAGLQPHLQSMSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKG
        SQ ARDQ+FRP ISQT TGNI GLQP LQSM++VQGPSL NNHN+IVKI+QKLLQPQ PDHPTWNPPSRDYMNKAVTCQTCQVTINEID+VLICDACEKG
Subjt:  SQGARDQNFRPSISQTVTGNIAGLQPHLQSMSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKG

Query:  FHLKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGT-AWEKRSGAIEQNTSAGQLKLVPNGGSDLQSAQPADHGSNANE-
        +HLKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGT   EKRSGA+EQ  SAGQLKLV NGGSDL + QPAD+G N NE 
Subjt:  FHLKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGT-AWEKRSGAIEQNTSAGQLKLVPNGGSDLQSAQPADHGSNANE-

Query:  SGIRI--VEEIHGNHFLPIRKDMDEKPTSSTSLNTPAKSVELVCDPSSTELSSERCTQQTKSSQASIGEDGSSLKSEPPQESQTMADNYS--------QI
        SGI+I  V+EI GN+ LPIRKD+DEKPTSSTSLNTPAKS+ LVC+PSS ELSSE   QQ K+S ASIGED SS K+EP +ESQT ADN S        QI
Subjt:  SGIRI--VEEIHGNHFLPIRKDMDEKPTSSTSLNTPAKSVELVCDPSSTELSSERCTQQTKSSQASIGEDGSSLKSEPPQESQTMADNYS--------QI

Query:  VDQKMVSNGSEKPYSTENTNHISSVKKDGPEFLQENIVENFEAGIISREQSGTKSDGLHDVEWVGSPHQIEDGKAYYRSCCIDGVTYKVEEFALFHTSNG
        VDQ+MVS G E P ST + +  S+VKKDG E LQEN VENFEA II+REQ GT S+ LH++EW+G  +QI D +AYY+SC +DGVTYKVEEFALF +SNG
Subjt:  VDQKMVSNGSEKPYSTENTNHISSVKKDGPEFLQENIVENFEAGIISREQSGTKSDGLHDVEWVGSPHQIEDGKAYYRSCCIDGVTYKVEEFALFHTSNG

Query:  KLMPYMLQSMCSEFESGSNLAILKRCYFYEDLPKEVAHLCPCSPEQNEVYTSDGYIDLQVGLIRGPCEILPVSKYKEEYERRRQLGVGADNEIKRIFLCK
        KLMPY LQS   E+ESG N AILK+CYFYEDLPKEVAHL   SPEQ+EVYTSDGYI L VGLIRGPCE+LPV+KYKEE ERR+QLG+GADN IK IFLCK
Subjt:  KLMPYMLQSMCSEFESGSNLAILKRCYFYEDLPKEVAHLCPCSPEQNEVYTSDGYIDLQVGLIRGPCEILPVSKYKEEYERRRQLGVGADNEIKRIFLCK

Query:  YVH
        + +
Subjt:  YVH

TrEMBL top hitse value%identityAlignment
A0A5A7UL23 Uncharacterized protein0.0e+0071.79Show/hide
Query:  MEETA----------ADKRPIEIHGDADADDADLHLIKKPR----VAPNLRRVAEIVLAISTMTALRAGNHPTHAELQLMAEARAKLLPICQQFPPNLFV
        MEETA          +DKRPIEIH   D   A+    KKPR    + PNLRRVAEIVL +STMTALR G  P+ AE+ LMAEARAKL+ IC+   P   V
Subjt:  MEETA----------ADKRPIEIHGDADADDADLHLIKKPR----VAPNLRRVAEIVLAISTMTALRAGNHPTHAELQLMAEARAKLLPICQQFPPNLFV

Query:  GTEAITSLIHDLGLHPKPTDHNLGFRGPRLTIADKFAHAKKKMEDSKKYMPPSGYGSQPIQTSFTSSIETRGTLPTVRMFSSDKSSNVPTSVGGTVAALP
        G E I+SLI DLGLH +  D  LGFRGPRLTIA+K A +KKKMEDSKKY+ P  YGS   Q S +SS+E+RG LPTVRMF S+KS  VP SVGGT   LP
Subjt:  GTEAITSLIHDLGLHPKPTDHNLGFRGPRLTIADKFAHAKKKMEDSKKYMPPSGYGSQPIQTSFTSSIETRGTLPTVRMFSSDKSSNVPTSVGGTVAALP

Query:  SGHVSVSGSTSIQVQAQLPSNEARAHIISSGFPISHQGRDSSSLLHGIERPLNGTYGSQVQVNSSANHPLANVPTWSAQTQSALSAKGVPEHKLPSHSGV
        SGH SV+G TSIQVQAQ+PSNE R+HIISSG+ I HQG  SSSLLHG E+PLNG YGSQ+QVNS ANHPLA+ PTWSAQTQSAL+AKG PEHK P+HS V
Subjt:  SGHVSVSGSTSIQVQAQLPSNEARAHIISSGFPISHQGRDSSSLLHGIERPLNGTYGSQVQVNSSANHPLANVPTWSAQTQSALSAKGVPEHKLPSHSGV

Query:  NAQGITDSRTLRSSSQGARDQNFRPSISQTVTGNIAGLQPHLQSMSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTIN
        NAQG TDSR LRSSSQ ARDQ+FRP ISQT TGN  GL   LQ+M++VQGPSLSNNHN+IVKIIQKLLQPQ PDHPTWNPPSRDYMNKAVTCQTCQVTIN
Subjt:  NAQGITDSRTLRSSSQGARDQNFRPSISQTVTGNIAGLQPHLQSMSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTIN

Query:  EIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGT-AWEKRSGAIEQNTSAGQLKLVPNGGSDLQ
        EID+VLICDACEKG+HLKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGT   EK+SGAIEQ  SAGQLKLV NGGSDL 
Subjt:  EIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGT-AWEKRSGAIEQNTSAGQLKLVPNGGSDLQ

Query:  SAQPADHGSNANES-GIRI--VEEIHGNHFLPIRKDMDEK--PTSSTSLNTPAKSVELVCDPSSTELSSERCTQQTKSSQASIGEDGSSLKSEPPQESQT
        + QPAD+GSNANES G+++  VEEIHGN+FLPIRKD+DEK  PTS TSLNTPAKS+ LVC+PSS ELSSE   Q  KSSQASIG+D SS K+EPP+ESQT
Subjt:  SAQPADHGSNANES-GIRI--VEEIHGNHFLPIRKDMDEK--PTSSTSLNTPAKSVELVCDPSSTELSSERCTQQTKSSQASIGEDGSSLKSEPPQESQT

Query:  MADNYS--------QIVDQKMVSNGSEKPYSTENTNHISSVKKDGPEFLQENIVENFEAGIISREQSGTKSDGLHDVEWVGSPHQIEDGKAYYRSCCIDG
        MADN S        +IVDQKMVS G E P ST + +  S+VKKDG E LQEN VENFEA II+REQ G  S+ LH+VEW+G  +QI D +AYY+SC +DG
Subjt:  MADNYS--------QIVDQKMVSNGSEKPYSTENTNHISSVKKDGPEFLQENIVENFEAGIISREQSGTKSDGLHDVEWVGSPHQIEDGKAYYRSCCIDG

Query:  VTYKVEEFALFHTSNGKLMPYMLQSMCSEFESGSNLAILKRCYFYEDLPKEVAHLCPCSPEQNEVYTSDGYIDLQVGLIRGPCEILPVSKYKEEYERRRQ
         TYKVEEFALF +SNGKLMPY L S   E+ESG   AILK+CYFYEDLPKEVAHL PCSPE++EVYTSDGYI L +GLIRGPCE+L V+KYKEE ERR+Q
Subjt:  VTYKVEEFALFHTSNGKLMPYMLQSMCSEFESGSNLAILKRCYFYEDLPKEVAHLCPCSPEQNEVYTSDGYIDLQVGLIRGPCEILPVSKYKEEYERRRQ

Query:  LGVGADNEIKRIFLCKYVH
        LG G DN IK IFLCK+ +
Subjt:  LGVGADNEIKRIFLCKYVH

A0A6J1EW47 uncharacterized protein LOC111438615 isoform X10.0e+0072.34Show/hide
Query:  DKRPIEIHGDADADDADLHLIKKPR----VAPNLRRVAEIVLAISTMTALRAGNHPTHAELQLMAEARAKLLPICQQFPPNLFVGTEAITSLIHDLGLHP
        +KR IEIHGD +   A+  L KKPR    + PNLRRVAEIVL +STMTALRAG  PT AE++LMAEARAKL+ IC+   P   VG E I+SLI DLGLH 
Subjt:  DKRPIEIHGDADADDADLHLIKKPR----VAPNLRRVAEIVLAISTMTALRAGNHPTHAELQLMAEARAKLLPICQQFPPNLFVGTEAITSLIHDLGLHP

Query:  KPTDHNLGFRGPRLTIADKFAHAKKKMEDSKKYMPPSGYGSQPIQTSFTSSIETRGTLPTVRMFSSDKSSNVPTSVGGTVAALPSGHVSVSGSTSIQVQA
           D  LGFRGPRLTIA+K A  KKKMEDSKKY+PPSGYGS P Q    SSIE RG LP+VRMF SDKSS VP SVGGT   LPSGHVSV+GSTS+QVQ 
Subjt:  KPTDHNLGFRGPRLTIADKFAHAKKKMEDSKKYMPPSGYGSQPIQTSFTSSIETRGTLPTVRMFSSDKSSNVPTSVGGTVAALPSGHVSVSGSTSIQVQA

Query:  QLPSNEARAHIISSGFPISHQGRDSSSLLHGIERPLNGTYGSQVQVNSSANHPLANVPTWSAQTQSALSAKGVPEHKLPSHSGVNAQGITDSRTLRSSSQ
        QL  NE RAH ISSGFPI+ QGRD SSLLHGIERPLNGTYGSQ+QVNSS NH LA+ PTWSAQTQSALSAKG PEHK P+HS  NAQG TDSR LRSSSQ
Subjt:  QLPSNEARAHIISSGFPISHQGRDSSSLLHGIERPLNGTYGSQVQVNSSANHPLANVPTWSAQTQSALSAKGVPEHKLPSHSGVNAQGITDSRTLRSSSQ

Query:  GARDQNFRPSISQTVTGNIAGLQPHLQSMSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFH
         ARDQ+FRP I QT TGN+AGLQPHLQS+++VQGPS+SN+HN+IVKIIQKLLQPQ PDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFH
Subjt:  GARDQNFRPSISQTVTGNIAGLQPHLQSMSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFH

Query:  LKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTAWEKRS--GAIEQNTSAGQLKLVPNGGSDLQS-AQPADHGSNANE-
        LKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSG+   ++S   A+EQ  SAGQLKLV NGG DL S  QPA+ GSNANE 
Subjt:  LKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTAWEKRS--GAIEQNTSAGQLKLVPNGGSDLQS-AQPADHGSNANE-

Query:  SGIRI--VEEIHGNHFLPIRKDMDEKPTSSTSLNTPAKSVELVCDPSSTELSSERCTQQTKSSQASIGEDGSSLKSEPPQESQTMADNYS--------QI
        SGI+I   EE HGN+ LPIRKD+DEKPTSSTSLNTPAKS+ +VC+PSS E+SSE   Q  KSSQA IGEDGSS KSEPP+ESQT  DN S        QI
Subjt:  SGIRI--VEEIHGNHFLPIRKDMDEKPTSSTSLNTPAKSVELVCDPSSTELSSERCTQQTKSSQASIGEDGSSLKSEPPQESQTMADNYS--------QI

Query:  VDQKMVSNGSEKPYSTENTNHISSVKKDGPEFLQENIVENFEAGIISREQSGTKSDGLHDVEWVGSPHQIEDGKAYYRSCCIDGVTYKVEEFALFHTSNG
        VDQ+M SNG E+PYST +++  S+VKKDG E LQEN VENFEA II+REQSGT S+ LHDVEW+G PHQ+ D +AYY+SC +DGVTY VEEF+LFH++NG
Subjt:  VDQKMVSNGSEKPYSTENTNHISSVKKDGPEFLQENIVENFEAGIISREQSGTKSDGLHDVEWVGSPHQIEDGKAYYRSCCIDGVTYKVEEFALFHTSNG

Query:  KLMPYMLQSMCSEFESGSNLAILKRCYFYEDLPKEVAHLCPCSPEQNEVYTSDGYIDLQVGLIRGPCEILPVSKYKEEYERRRQLGVGADNEIKRIFLCK
        KLMPY LQS+  E+ESG N AILK+CYF+EDLPKEVAHLCPCSPEQNEVYTSDG I L VGLIR PCE+LPV+KYKEE+ERR+QLG GAD+ IK  FLCK
Subjt:  KLMPYMLQSMCSEFESGSNLAILKRCYFYEDLPKEVAHLCPCSPEQNEVYTSDGYIDLQVGLIRGPCEILPVSKYKEEYERRRQLGVGADNEIKRIFLCK

Query:  YVHFLKFFYHLVLLETVSSSFPLLNNVL
        + +       +   + V  SF ++   L
Subjt:  YVHFLKFFYHLVLLETVSSSFPLLNNVL

A0A6J1F158 uncharacterized protein LOC111438615 isoform X20.0e+0074.1Show/hide
Query:  DKRPIEIHGDADADDADLHLIKKPR----VAPNLRRVAEIVLAISTMTALRAGNHPTHAELQLMAEARAKLLPICQQFPPNLFVGTEAITSLIHDLGLHP
        +KR IEIHGD +   A+  L KKPR    + PNLRRVAEIVL +STMTALRAG  PT AE++LMAEARAKL+ IC+   P   VG E I+SLI DLGLH 
Subjt:  DKRPIEIHGDADADDADLHLIKKPR----VAPNLRRVAEIVLAISTMTALRAGNHPTHAELQLMAEARAKLLPICQQFPPNLFVGTEAITSLIHDLGLHP

Query:  KPTDHNLGFRGPRLTIADKFAHAKKKMEDSKKYMPPSGYGSQPIQTSFTSSIETRGTLPTVRMFSSDKSSNVPTSVGGTVAALPSGHVSVSGSTSIQVQA
           D  LGFRGPRLTIA+K A  KKKMEDSKKY+PPSGYGS P Q    SSIE RG LP+VRMF SDKSS VP SVGGT   LPSGHVSV+GSTS+QVQ 
Subjt:  KPTDHNLGFRGPRLTIADKFAHAKKKMEDSKKYMPPSGYGSQPIQTSFTSSIETRGTLPTVRMFSSDKSSNVPTSVGGTVAALPSGHVSVSGSTSIQVQA

Query:  QLPSNEARAHIISSGFPISHQGRDSSSLLHGIERPLNGTYGSQVQVNSSANHPLANVPTWSAQTQSALSAKGVPEHKLPSHSGVNAQGITDSRTLRSSSQ
        QL  NE RAH ISSGFPI+ QGRD SSLLHGIERPLNGTYGSQ+QVNSS NH LA+ PTWSAQTQSALSAKG PEHK P+HS  NAQG TDSR LRSSSQ
Subjt:  QLPSNEARAHIISSGFPISHQGRDSSSLLHGIERPLNGTYGSQVQVNSSANHPLANVPTWSAQTQSALSAKGVPEHKLPSHSGVNAQGITDSRTLRSSSQ

Query:  GARDQNFRPSISQTVTGNIAGLQPHLQSMSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFH
         ARDQ+FRP I QT TGN+AGLQPHLQS+++VQGPS+SN+HN+IVKIIQKLLQPQ PDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFH
Subjt:  GARDQNFRPSISQTVTGNIAGLQPHLQSMSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFH

Query:  LKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTAWEKRS--GAIEQNTSAGQLKLVPNGGSDLQS-AQPADHGSNANE-
        LKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSG+   ++S   A+EQ  SAGQLKLV NGG DL S  QPA+ GSNANE 
Subjt:  LKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTAWEKRS--GAIEQNTSAGQLKLVPNGGSDLQS-AQPADHGSNANE-

Query:  SGIRI--VEEIHGNHFLPIRKDMDEKPTSSTSLNTPAKSVELVCDPSSTELSSERCTQQTKSSQASIGEDGSSLKSEPPQESQTMADNYS--------QI
        SGI+I   EE HGN+ LPIRKD+DEKPTSSTSLNTPAKS+ +VC+PSS E+SSE   Q  KSSQA IGEDGSS KSEPP+ESQT  DN S        QI
Subjt:  SGIRI--VEEIHGNHFLPIRKDMDEKPTSSTSLNTPAKSVELVCDPSSTELSSERCTQQTKSSQASIGEDGSSLKSEPPQESQTMADNYS--------QI

Query:  VDQKMVSNGSEKPYSTENTNHISSVKKDGPEFLQENIVENFEAGIISREQSGTKSDGLHDVEWVGSPHQIEDGKAYYRSCCIDGVTYKVEEFALFHTSNG
        VDQ+M SNG E+PYST +++  S+VKKDG E LQEN VENFEA II+REQSGT S+ LHDVEW+G PHQ+ D +AYY+SC +DGVTY VEEF+LFH++NG
Subjt:  VDQKMVSNGSEKPYSTENTNHISSVKKDGPEFLQENIVENFEAGIISREQSGTKSDGLHDVEWVGSPHQIEDGKAYYRSCCIDGVTYKVEEFALFHTSNG

Query:  KLMPYMLQSMCSEFESGSNLAILKRCYFYEDLPKEVAHLCPCSPEQNEVYTSDGYIDLQVGLIRGPCEILPVSKYKEEYERRRQLGVGADNEIKRIFLCK
        KLMPY LQS+  E+ESG N AILK+CYF+EDLPKEVAHLCPCSPEQNEVYTSDG I L VGLIR PCE+LPV+KYKEE+ERR+QLG GAD+ IK  FLCK
Subjt:  KLMPYMLQSMCSEFESGSNLAILKRCYFYEDLPKEVAHLCPCSPEQNEVYTSDGYIDLQVGLIRGPCEILPVSKYKEEYERRRQLGVGADNEIKRIFLCK

Query:  YVH
        + +
Subjt:  YVH

A0A6J1L3P7 uncharacterized protein LOC111500233 isoform X20.0e+0073.82Show/hide
Query:  AADKRPIEIHGDADADDADLHLIKKPR----VAPNLRRVAEIVLAISTMTALRAGNHPTHAELQLMAEARAKLLPICQQFPPNLFVGTEAITSLIHDLGL
        A +KR IEIHGD +   A+  L KKPR    + PNLRRVAEIVL +STMTALRAG  PT AE++LMAEARAKL+ IC+   P   VG E I+SLI DLGL
Subjt:  AADKRPIEIHGDADADDADLHLIKKPR----VAPNLRRVAEIVLAISTMTALRAGNHPTHAELQLMAEARAKLLPICQQFPPNLFVGTEAITSLIHDLGL

Query:  HPKPTDHNLGFRGPRLTIADKFAHAKKKMEDSKKYMPPSGYGSQPIQTSFTSSIETRGTLPTVRMFSSDKSSNVPTSVGGTVAALPSGHVSVSGSTSIQV
        H    D  LGFRGPRLTIA+K A  KKKMEDSKKY+PPSGYGS P Q    SSIE RG LP+VRMF SDKSS VP SVGGT   LPSGHVSV+GS+S+QV
Subjt:  HPKPTDHNLGFRGPRLTIADKFAHAKKKMEDSKKYMPPSGYGSQPIQTSFTSSIETRGTLPTVRMFSSDKSSNVPTSVGGTVAALPSGHVSVSGSTSIQV

Query:  QAQLPSNEARAHIISSGFPISHQGRDSSSLLHGIERPLNGTYGSQVQVNSSANHPLANVPTWSAQTQSALSAKGVPEHKLPSHSGVNAQGITDSRTLRSS
        Q QL  NE RAH ISSGFPI+ QGRDSSSLLHGIERPLNGTYGSQ+QVNSS NH LA+ PTWSAQTQSALSAKG PEHK P+HS  NAQG TDSR LRSS
Subjt:  QAQLPSNEARAHIISSGFPISHQGRDSSSLLHGIERPLNGTYGSQVQVNSSANHPLANVPTWSAQTQSALSAKGVPEHKLPSHSGVNAQGITDSRTLRSS

Query:  SQGARDQNFRPSISQTVTGNIAGLQ-PHLQSMSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEK
        SQ ARDQ+FRP I QT TGN+AGLQ PHLQS+++VQGPS+SN+HN+IVKIIQKLLQPQ PDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEK
Subjt:  SQGARDQNFRPSISQTVTGNIAGLQ-PHLQSMSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEK

Query:  GFHLKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTAWEKRS--GAIEQNTSAGQLKLVPNGGSDLQS-AQPADHGSNA
        GFHLKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSG+   ++S   A+EQ  SAGQLKLV NGG DL S  QPA+ GSNA
Subjt:  GFHLKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTAWEKRS--GAIEQNTSAGQLKLVPNGGSDLQS-AQPADHGSNA

Query:  NE-SGIRI--VEEIHGNHFLPIRKDMDEKPTSSTSLNTPAKSVELVCDPSSTELSSERCTQQTKSSQASIGEDGSSLKSEPPQESQTMADNYS-------
        NE SGI+I   EE HGN+FLPIRKD+DEKPTSSTSLNTPAKS+ +VC+PSS E+SSE   Q  KSSQA IGEDGSS K+EPP+ESQT  DN S       
Subjt:  NE-SGIRI--VEEIHGNHFLPIRKDMDEKPTSSTSLNTPAKSVELVCDPSSTELSSERCTQQTKSSQASIGEDGSSLKSEPPQESQTMADNYS-------

Query:  -QIVDQKMVSNGSEKPYSTENTNHISSVKKDGPEFLQENIVENFEAGIISREQSGTKSDGLHDVEWVGSPHQIEDGKAYYRSCCIDGVTYKVEEFALFHT
         QI+DQKM SNG E+PY TE+++  S+VKKDG E LQEN VENFEA I++REQSGT S+ LHDVEW+G PHQ+ D +AYY+SC +DGVTYKVEEF+LFH+
Subjt:  -QIVDQKMVSNGSEKPYSTENTNHISSVKKDGPEFLQENIVENFEAGIISREQSGTKSDGLHDVEWVGSPHQIEDGKAYYRSCCIDGVTYKVEEFALFHT

Query:  SNGKLMPYMLQSMCSEFESGSNLAILKRCYFYEDLPKEVAHLCPCSPEQNEVYTSDGYIDLQVGLIRGPCEILPVSKYKEEYERRRQLGVGADNEIKRIF
        +NGKLMPY LQS+  E+ESG N AILK+CYF+EDLPKEVAHLCPCSPEQNEVYTSDG I L VGLIR PCE+LPV+KYK E+ERR+QLG GAD+ IK  F
Subjt:  SNGKLMPYMLQSMCSEFESGSNLAILKRCYFYEDLPKEVAHLCPCSPEQNEVYTSDGYIDLQVGLIRGPCEILPVSKYKEEYERRRQLGVGADNEIKRIF

Query:  LCKYVH
        LCK+ +
Subjt:  LCKYVH

A0A6J1L5P9 uncharacterized protein LOC111500233 isoform X10.0e+0072.08Show/hide
Query:  AADKRPIEIHGDADADDADLHLIKKPR----VAPNLRRVAEIVLAISTMTALRAGNHPTHAELQLMAEARAKLLPICQQFPPNLFVGTEAITSLIHDLGL
        A +KR IEIHGD +   A+  L KKPR    + PNLRRVAEIVL +STMTALRAG  PT AE++LMAEARAKL+ IC+   P   VG E I+SLI DLGL
Subjt:  AADKRPIEIHGDADADDADLHLIKKPR----VAPNLRRVAEIVLAISTMTALRAGNHPTHAELQLMAEARAKLLPICQQFPPNLFVGTEAITSLIHDLGL

Query:  HPKPTDHNLGFRGPRLTIADKFAHAKKKMEDSKKYMPPSGYGSQPIQTSFTSSIETRGTLPTVRMFSSDKSSNVPTSVGGTVAALPSGHVSVSGSTSIQV
        H    D  LGFRGPRLTIA+K A  KKKMEDSKKY+PPSGYGS P Q    SSIE RG LP+VRMF SDKSS VP SVGGT   LPSGHVSV+GS+S+QV
Subjt:  HPKPTDHNLGFRGPRLTIADKFAHAKKKMEDSKKYMPPSGYGSQPIQTSFTSSIETRGTLPTVRMFSSDKSSNVPTSVGGTVAALPSGHVSVSGSTSIQV

Query:  QAQLPSNEARAHIISSGFPISHQGRDSSSLLHGIERPLNGTYGSQVQVNSSANHPLANVPTWSAQTQSALSAKGVPEHKLPSHSGVNAQGITDSRTLRSS
        Q QL  NE RAH ISSGFPI+ QGRDSSSLLHGIERPLNGTYGSQ+QVNSS NH LA+ PTWSAQTQSALSAKG PEHK P+HS  NAQG TDSR LRSS
Subjt:  QAQLPSNEARAHIISSGFPISHQGRDSSSLLHGIERPLNGTYGSQVQVNSSANHPLANVPTWSAQTQSALSAKGVPEHKLPSHSGVNAQGITDSRTLRSS

Query:  SQGARDQNFRPSISQTVTGNIAGLQ-PHLQSMSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEK
        SQ ARDQ+FRP I QT TGN+AGLQ PHLQS+++VQGPS+SN+HN+IVKIIQKLLQPQ PDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEK
Subjt:  SQGARDQNFRPSISQTVTGNIAGLQ-PHLQSMSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEK

Query:  GFHLKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTAWEKRS--GAIEQNTSAGQLKLVPNGGSDLQS-AQPADHGSNA
        GFHLKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSG+   ++S   A+EQ  SAGQLKLV NGG DL S  QPA+ GSNA
Subjt:  GFHLKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTAWEKRS--GAIEQNTSAGQLKLVPNGGSDLQS-AQPADHGSNA

Query:  NE-SGIRI--VEEIHGNHFLPIRKDMDEKPTSSTSLNTPAKSVELVCDPSSTELSSERCTQQTKSSQASIGEDGSSLKSEPPQESQTMADNYS-------
        NE SGI+I   EE HGN+FLPIRKD+DEKPTSSTSLNTPAKS+ +VC+PSS E+SSE   Q  KSSQA IGEDGSS K+EPP+ESQT  DN S       
Subjt:  NE-SGIRI--VEEIHGNHFLPIRKDMDEKPTSSTSLNTPAKSVELVCDPSSTELSSERCTQQTKSSQASIGEDGSSLKSEPPQESQTMADNYS-------

Query:  -QIVDQKMVSNGSEKPYSTENTNHISSVKKDGPEFLQENIVENFEAGIISREQSGTKSDGLHDVEWVGSPHQIEDGKAYYRSCCIDGVTYKVEEFALFHT
         QI+DQKM SNG E+PY TE+++  S+VKKDG E LQEN VENFEA I++REQSGT S+ LHDVEW+G PHQ+ D +AYY+SC +DGVTYKVEEF+LFH+
Subjt:  -QIVDQKMVSNGSEKPYSTENTNHISSVKKDGPEFLQENIVENFEAGIISREQSGTKSDGLHDVEWVGSPHQIEDGKAYYRSCCIDGVTYKVEEFALFHT

Query:  SNGKLMPYMLQSMCSEFESGSNLAILKRCYFYEDLPKEVAHLCPCSPEQNEVYTSDGYIDLQVGLIRGPCEILPVSKYKEEYERRRQLGVGADNEIKRIF
        +NGKLMPY LQS+  E+ESG N AILK+CYF+EDLPKEVAHLCPCSPEQNEVYTSDG I L VGLIR PCE+LPV+KYK E+ERR+QLG GAD+ IK  F
Subjt:  SNGKLMPYMLQSMCSEFESGSNLAILKRCYFYEDLPKEVAHLCPCSPEQNEVYTSDGYIDLQVGLIRGPCEILPVSKYKEEYERRRQLGVGADNEIKRIF

Query:  LCKYVHFLKFFYHLVLLETVSSSFPLLNNVL
        LCK+ +       +   + V  SF ++   L
Subjt:  LCKYVHFLKFFYHLVLLETVSSSFPLLNNVL

SwissProt top hitse value%identityAlignment
Q5PNS0 PHD finger protein At3g202801.8e-3645.57Show/hide
Query:  MSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCPRCLTISNG
        M Y Q  S   NH +I KII K LQP+   +P WNPPSR+YM++A+ CQ C+VTINE+D++LICDACEK +HLKC+Q  N + +P+ EWHC RC+   NG
Subjt:  MSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCPRCLTISNG

Query:  KPLPPKYGRVMR--SNPPPKLSVNTSGTAWEKRSGAIEQNTSAGQLKLVPNGGSDLQS
        KP PP YGR  R  +    K+    +G            +  A Q K + +  S LQ+
Subjt:  KPLPPKYGRVMR--SNPPPKLSVNTSGTAWEKRSGAIEQNTSAGQLKLVPNGGSDLQS

Q5SMU7 Origin of replication complex subunit 19.9e-0632.74Show/hide
Query:  QTCQVTINEIDSVLI-CDACEKGFHLKCVQSPNQRAIPRGEWHCPRCLTISNGK----PLPPKYGRVMRSNPPPKLSVN------------TSGTAWEK-
        + C+V      +V++ CD C  GFHL+CV+ P +R +P G+W CP C     GK    P PP+  R++R+     LS +              G  W K 
Subjt:  QTCQVTINEIDSVLI-CDACEKGFHLKCVQSPNQRAIPRGEWHCPRCLTISNGK----PLPPKYGRVMRSNPPPKLSVN------------TSGTAWEK-

Query:  RSGAIEQNTSAGQ
        R   I + T+AG+
Subjt:  RSGAIEQNTSAGQ

Q8BRB7 Histone acetyltransferase KAT6B2.0e-0638.6Show/hide
Query:  TCQTCQVTINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCPRCLTISNGKPL
        TC  C+V     D++L CD+C++GFH++C   P  R +P+G W C  C     G+ L
Subjt:  TCQTCQVTINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCPRCLTISNGKPL

Q8WML3 Histone acetyltransferase KAT6B2.0e-0638.6Show/hide
Query:  TCQTCQVTINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCPRCLTISNGKPL
        TC  C+V     D++L CD+C++GFH++C   P  R +P+G W C  C     G+ L
Subjt:  TCQTCQVTINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCPRCLTISNGKPL

Q8WYB5 Histone acetyltransferase KAT6B2.0e-0638.6Show/hide
Query:  TCQTCQVTINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCPRCLTISNGKPL
        TC  C+V     D++L CD+C++GFH++C   P  R +P+G W C  C     G+ L
Subjt:  TCQTCQVTINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCPRCLTISNGKPL

Arabidopsis top hitse value%identityAlignment
AT1G50620.1 RING/FYVE/PHD zinc finger superfamily protein2.2e-6931.1Show/hide
Query:  KKPRVAPNLRRVAEIVLAISTMTALRAGNHPTHAELQLMAEARAKLLPICQQFPPNLFVGTEAITSLIHDLGLHPKPTDHNLGFRGPRLTIADKFAHAKK
        KKPR    + RVAEIVL +S +  +R G  PT  E+ LM EA++KL+ +CQ+F P   +G +AI ++I DLG + K  D  LGFR P+LTI++K +  K+
Subjt:  KKPRVAPNLRRVAEIVLAISTMTALRAGNHPTHAELQLMAEARAKLLPICQQFPPNLFVGTEAITSLIHDLGLHPKPTDHNLGFRGPRLTIADKFAHAKK

Query:  KMEDSKKYMPPSGYGSQPIQTSFTSSIETRGTLPTVRMFSSDKSSNVPTSVGGTVAALPSGHVSVSGSTSIQVQAQLPSNEARAHIISSGFPISHQGRDS
        KME+ KK          P+       + T  T P V                                 ++ +  Q P++E +A   S     SH  R++
Subjt:  KMEDSKKYMPPSGYGSQPIQTSFTSSIETRGTLPTVRMFSSDKSSNVPTSVGGTVAALPSGHVSVSGSTSIQVQAQLPSNEARAHIISSGFPISHQGRDS

Query:  SSLLHG-IERPLNGTYGSQVQVNSS------ANHPLANVPTWSAQTQSALSAKGVPEHKLPSHSGVNAQGITDSRTLRSSSQGARDQNFRPSISQTVTGN
        S +    +ERP    + S +   +S      A +   N  TWSAQ  S             S S ++    +DS+    SS    D +FRP +SQT  G 
Subjt:  SSLLHG-IERPLNGTYGSQVQVNSS------ANHPLANVPTWSAQTQSALSAKGVPEHKLPSHSGVNAQGITDSRTLRSSSQGARDQNFRPSISQTVTGN

Query:  IAGLQPHLQSMSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWH
          G++    +      P  +NNH +I K+I K+LQP+   +  WNPPSR+YM+KA+TCQ CQ TINEI++VLICDACEKG+HLKC+ + N + +P+ EWH
Subjt:  IAGLQPHLQSMSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWH

Query:  CPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTAW--EKRSGAIEQNTSAGQLKLVPNGGSDLQSAQPADHGSNANESGIRIVEEIHGNHFLPIRKD
        C RC+ + NGK  PPKYGRVMRS    K+S +T+      EK  G ++Q       K+   G   L++A+P     +A E  +   E++  N    + K 
Subjt:  CPRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTAW--EKRSGAIEQNTSAGQLKLVPNGGSDLQSAQPADHGSNANESGIRIVEEIHGNHFLPIRKD

Query:  MDEKPTSSTSLNTPAKSVELVCDPSSTELSSERCTQQTKSSQASIGE--DGSSLKSEPPQESQTMADNYSQIVDQKMVSNGSE-------KPYSTENTNH
        M E   +  +   P    E   D +   +  +  +Q  ++  A+I +  D +     P       +D+ S+ V      N  E       +  +TE +  
Subjt:  MDEKPTSSTSLNTPAKSVELVCDPSSTELSSERCTQQTKSSQASIGE--DGSSLKSEPPQESQTMADNYSQIVDQKMVSNGSE-------KPYSTENTNH

Query:  ISSVKKDGPEFLQENIVENFEAGII--SREQSGTKSDGLHDVEWVGSPHQIEDGKA
             K+     + ++ E   A     S  Q  ++SD  H  +   +P+  E+  A
Subjt:  ISSVKKDGPEFLQENIVENFEAGII--SREQSGTKSDGLHDVEWVGSPHQIEDGKA

AT3G01460.1 methyl-CPG-binding domain 93.8e-0532.93Show/hide
Query:  DIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCPRCLT
        ++V+ I     P P        P RD      +C  C      I+ V++CDACE+GFH+ CV +    A P  +W C  C T
Subjt:  DIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCPRCLT

AT3G20280.1 RING/FYVE/PHD zinc finger superfamily protein6.7e-6635.78Show/hide
Query:  KKPRVAPNLRRVAEIVLAISTMTALRAGNHPTHAELQLMAEARAKLLPICQQFPPNLFVGTEAITSLIHDLGLHPKPTDHNLGFRGPRLTIADKFAHAKK
        KKPR    + RVAEIVL +S +  +R G  PT  EL+LM EAR+KL  +C +F P   +  + + S+I DLG + K  D  LGFR P +TI++K +  K+
Subjt:  KKPRVAPNLRRVAEIVLAISTMTALRAGNHPTHAELQLMAEARAKLLPICQQFPPNLFVGTEAITSLIHDLGLHPKPTDHNLGFRGPRLTIADKFAHAKK

Query:  KMEDSKKYMPPSGYGSQPIQTSFTSSIETRGTLPTVRMFSSDKSSNVPTSVGGTVAALPSGHVSVSG--STSIQVQAQLPSNE-ARAHIISSGFPISHQG
        KME+++KY               TS++ T  TL                       + P+G ++  G  +    V  Q PS+E A A+   S F +    
Subjt:  KMEDSKKYMPPSGYGSQPIQTSFTSSIETRGTLPTVRMFSSDKSSNVPTSVGGTVAALPSGHVSVSG--STSIQVQAQLPSNE-ARAHIISSGFPISHQG

Query:  RDSSSLLHGIERPLNGTYGSQVQVNSSANHPLANVPTWSAQTQSALSAKGVPEHKLPSHSGVNAQGITDSRTLRSSSQGARDQNFRPSISQTVTGNIAGL
                  +  LNG   SQ    SSAN+       WSAQ  S +S    P+ K+P  S V                   D +FRP    T TG     
Subjt:  RDSSSLLHGIERPLNGTYGSQVQVNSSANHPLANVPTWSAQTQSALSAKGVPEHKLPSHSGVNAQGITDSRTLRSSSQGARDQNFRPSISQTVTGNIAGL

Query:  QPHLQSMSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCPRC
            Q M Y Q  S   NH +I KII K LQP+   +P WNPPSR+YM++A+ CQ C+VTINE+D++LICDACEK +HLKC+Q  N + +P+ EWHC RC
Subjt:  QPHLQSMSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCPRC

Query:  LTISNGKPLPPKYGRVMR--SNPPPKLSVNTSGTAWEKRSGAIEQNTSAGQLKLVPNGGSDLQS
        +   NGKP PP YGR  R  +    K+    +G            +  A Q K + +  S LQ+
Subjt:  LTISNGKPLPPKYGRVMR--SNPPPKLSVNTSGTAWEKRSGAIEQNTSAGQLKLVPNGGSDLQS

AT3G20280.2 RING/FYVE/PHD zinc finger superfamily protein1.3e-3745.57Show/hide
Query:  MSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCPRCLTISNG
        M Y Q  S   NH +I KII K LQP+   +P WNPPSR+YM++A+ CQ C+VTINE+D++LICDACEK +HLKC+Q  N + +P+ EWHC RC+   NG
Subjt:  MSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCPRCLTISNG

Query:  KPLPPKYGRVMR--SNPPPKLSVNTSGTAWEKRSGAIEQNTSAGQLKLVPNGGSDLQS
        KP PP YGR  R  +    K+    +G            +  A Q K + +  S LQ+
Subjt:  KPLPPKYGRVMR--SNPPPKLSVNTSGTAWEKRSGAIEQNTSAGQLKLVPNGGSDLQS

AT5G24330.1 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 62.3e-0526.32Show/hide
Query:  QPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCPRCLTISNGKPLP------PKYGRVMRSNPPPKLS
        +PP H + +    D+      C+ C  +  +   +L+CD C+KGFHL C++ P   ++P+G W CP C      K  P        + R+ RS  P    
Subjt:  QPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCPRCLTISNGKPLP------PKYGRVMRSNPPPKLS

Query:  VNTSGTAWEKRSGAIEQNTSAGQLKLVPNGGSD
        +++S  +  K+        S  + +L+P   S+
Subjt:  VNTSGTAWEKRSGAIEQNTSAGQLKLVPNGGSD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAAACCGCCGCCGACAAGAGGCCAATTGAGATTCACGGCGACGCTGACGCTGACGACGCGGACCTCCACTTGATCAAGAAGCCCCGGGTTGCTCCTAATCTCAG
AAGGGTCGCCGAGATTGTGCTCGCCATCTCCACCATGACCGCCCTGCGTGCTGGCAACCACCCCACCCATGCCGAGCTTCAATTAATGGCTGAGGCTAGGGCTAAGCTCC
TTCCCATCTGCCAACAATTCCCGCCCAACCTGTTTGTCGGAACCGAAGCTATTACCTCTCTCATTCACGATTTGGGCCTTCATCCCAAGCCTACGGATCACAACTTAGGG
TTTCGGGGTCCCAGGCTGACCATAGCCGACAAGTTCGCCCACGCTAAGAAGAAGATGGAAGATTCCAAGAAATATATGCCACCGTCAGGTTATGGATCTCAACCAATCCA
AACAAGCTTCACTTCATCAATTGAGACCCGTGGGACATTGCCTACTGTTCGGATGTTTTCCTCAGATAAATCAAGTAATGTTCCGACTTCTGTGGGTGGCACAGTAGCTG
CTCTGCCTTCAGGTCATGTTTCTGTCTCTGGTTCTACATCTATACAGGTTCAAGCGCAACTACCAAGCAATGAAGCCAGAGCACATATTATTTCAAGTGGATTTCCTATT
AGTCATCAAGGAAGGGATTCTTCCTCACTCTTGCATGGCATTGAAAGACCACTAAATGGGACATATGGATCTCAAGTACAAGTTAATTCTTCAGCAAATCACCCTCTGGC
AAATGTTCCAACTTGGTCTGCTCAAACTCAATCCGCCTTGTCAGCTAAAGGTGTGCCAGAGCACAAACTGCCCAGTCATTCAGGTGTTAATGCCCAGGGAATCACAGATT
CCAGAACCTTGAGATCGTCTTCTCAAGGAGCCAGGGACCAGAATTTTAGACCTTCAATTTCCCAAACTGTGACAGGAAATATAGCTGGTTTGCAGCCACATTTACAGAGT
ATGAGCTATGTGCAAGGACCTTCACTTTCTAATAATCACAATGATATTGTCAAAATTATTCAGAAGCTGTTACAGCCACAGCCTCCAGATCATCCTACATGGAATCCTCC
TTCTAGAGATTATATGAACAAGGCCGTGACTTGCCAAACTTGTCAAGTTACCATTAATGAGATTGATAGTGTACTTATATGTGATGCTTGTGAGAAAGGATTTCACCTGA
AATGTGTACAGTCACCTAATCAGAGAGCAATTCCTAGAGGTGAATGGCACTGCCCAAGATGTTTAACTATAAGCAATGGGAAGCCTTTACCTCCTAAATATGGGCGTGTC
ATGAGGAGTAATCCCCCACCAAAGTTATCTGTCAATACCAGCGGAACAGCATGGGAGAAAAGATCAGGAGCCATAGAACAAAACACCAGTGCTGGTCAGCTGAAGTTAGT
TCCTAATGGAGGTTCTGATTTGCAAAGTGCTCAGCCTGCTGACCATGGAAGCAATGCCAATGAATCTGGTATTAGAATTGTGGAAGAAATTCATGGAAATCATTTTCTGC
CAATTAGAAAAGACATGGACGAGAAACCAACCTCTTCAACTTCCCTGAATACCCCAGCCAAATCCGTGGAACTGGTTTGTGACCCCTCTTCTACTGAGTTATCAAGTGAA
AGATGTACTCAGCAGACTAAAAGTTCTCAAGCATCAATAGGTGAAGATGGATCTTCATTGAAATCAGAGCCTCCACAAGAATCCCAAACAATGGCTGACAATTATTCTCA
AATTGTTGATCAGAAAATGGTTTCTAATGGTTCAGAAAAACCTTACTCAACAGAAAATACTAATCACATTTCAAGTGTCAAGAAAGACGGCCCTGAATTTCTGCAGGAAA
ATATTGTTGAAAATTTTGAAGCTGGCATCATAAGTAGAGAGCAGTCCGGGACTAAATCCGATGGCTTGCATGATGTTGAATGGGTGGGAAGCCCACATCAGATTGAAGAT
GGGAAGGCATACTACAGATCCTGTTGCATTGATGGTGTTACATATAAAGTTGAAGAATTTGCTCTTTTTCATACCAGTAACGGGAAATTGATGCCATATATGCTTCAGTC
CATGTGCAGTGAGTTTGAAAGTGGGTCGAACTTGGCTATTCTTAAGAGGTGTTACTTTTATGAAGACTTGCCAAAGGAAGTTGCCCACCTCTGTCCATGCTCCCCTGAAC
AAAATGAGGTATATACATCCGATGGATATATTGATTTACAAGTAGGTTTAATTCGAGGACCATGTGAAATTCTTCCTGTTTCCAAGTATAAAGAAGAATATGAAAGACGA
AGGCAATTGGGTGTTGGGGCAGATAATGAAATAAAGCGAATTTTCCTGTGCAAGTATGTCCATTTTCTTAAATTCTTTTACCACCTGGTGCTTTTAGAGACGGTATCGAG
CAGTTTTCCTTTATTGAACAATGTGTTGATCAATTATTTTTTAGACTGA
mRNA sequenceShow/hide mRNA sequence
GTTTTTGTTGTTGTGTGAGAAAAACCAACCGGAGTGGGTTTACCTCCACGAAGGAAGGTGGGAGTCTTTTTGTGTGAGATCGGCCGCGTCCACAGCACAGCAGGGCCCTT
CTTTCTTCTGTGTTCCCCCAAAACTCCGAATCCTTTTTTATTACTTCTCATTGTGTGCTCTTCTCACGCTTACCCGCCAAAATACTCTCTTTTAACGAACGTCTCTCAAA
CTTCCTACATTTTTTAAACTCCCCCATTCCCATGGAGGAAACCGCCGCCGACAAGAGGCCAATTGAGATTCACGGCGACGCTGACGCTGACGACGCGGACCTCCACTTGA
TCAAGAAGCCCCGGGTTGCTCCTAATCTCAGAAGGGTCGCCGAGATTGTGCTCGCCATCTCCACCATGACCGCCCTGCGTGCTGGCAACCACCCCACCCATGCCGAGCTT
CAATTAATGGCTGAGGCTAGGGCTAAGCTCCTTCCCATCTGCCAACAATTCCCGCCCAACCTGTTTGTCGGAACCGAAGCTATTACCTCTCTCATTCACGATTTGGGCCT
TCATCCCAAGCCTACGGATCACAACTTAGGGTTTCGGGGTCCCAGGCTGACCATAGCCGACAAGTTCGCCCACGCTAAGAAGAAGATGGAAGATTCCAAGAAATATATGC
CACCGTCAGGTTATGGATCTCAACCAATCCAAACAAGCTTCACTTCATCAATTGAGACCCGTGGGACATTGCCTACTGTTCGGATGTTTTCCTCAGATAAATCAAGTAAT
GTTCCGACTTCTGTGGGTGGCACAGTAGCTGCTCTGCCTTCAGGTCATGTTTCTGTCTCTGGTTCTACATCTATACAGGTTCAAGCGCAACTACCAAGCAATGAAGCCAG
AGCACATATTATTTCAAGTGGATTTCCTATTAGTCATCAAGGAAGGGATTCTTCCTCACTCTTGCATGGCATTGAAAGACCACTAAATGGGACATATGGATCTCAAGTAC
AAGTTAATTCTTCAGCAAATCACCCTCTGGCAAATGTTCCAACTTGGTCTGCTCAAACTCAATCCGCCTTGTCAGCTAAAGGTGTGCCAGAGCACAAACTGCCCAGTCAT
TCAGGTGTTAATGCCCAGGGAATCACAGATTCCAGAACCTTGAGATCGTCTTCTCAAGGAGCCAGGGACCAGAATTTTAGACCTTCAATTTCCCAAACTGTGACAGGAAA
TATAGCTGGTTTGCAGCCACATTTACAGAGTATGAGCTATGTGCAAGGACCTTCACTTTCTAATAATCACAATGATATTGTCAAAATTATTCAGAAGCTGTTACAGCCAC
AGCCTCCAGATCATCCTACATGGAATCCTCCTTCTAGAGATTATATGAACAAGGCCGTGACTTGCCAAACTTGTCAAGTTACCATTAATGAGATTGATAGTGTACTTATA
TGTGATGCTTGTGAGAAAGGATTTCACCTGAAATGTGTACAGTCACCTAATCAGAGAGCAATTCCTAGAGGTGAATGGCACTGCCCAAGATGTTTAACTATAAGCAATGG
GAAGCCTTTACCTCCTAAATATGGGCGTGTCATGAGGAGTAATCCCCCACCAAAGTTATCTGTCAATACCAGCGGAACAGCATGGGAGAAAAGATCAGGAGCCATAGAAC
AAAACACCAGTGCTGGTCAGCTGAAGTTAGTTCCTAATGGAGGTTCTGATTTGCAAAGTGCTCAGCCTGCTGACCATGGAAGCAATGCCAATGAATCTGGTATTAGAATT
GTGGAAGAAATTCATGGAAATCATTTTCTGCCAATTAGAAAAGACATGGACGAGAAACCAACCTCTTCAACTTCCCTGAATACCCCAGCCAAATCCGTGGAACTGGTTTG
TGACCCCTCTTCTACTGAGTTATCAAGTGAAAGATGTACTCAGCAGACTAAAAGTTCTCAAGCATCAATAGGTGAAGATGGATCTTCATTGAAATCAGAGCCTCCACAAG
AATCCCAAACAATGGCTGACAATTATTCTCAAATTGTTGATCAGAAAATGGTTTCTAATGGTTCAGAAAAACCTTACTCAACAGAAAATACTAATCACATTTCAAGTGTC
AAGAAAGACGGCCCTGAATTTCTGCAGGAAAATATTGTTGAAAATTTTGAAGCTGGCATCATAAGTAGAGAGCAGTCCGGGACTAAATCCGATGGCTTGCATGATGTTGA
ATGGGTGGGAAGCCCACATCAGATTGAAGATGGGAAGGCATACTACAGATCCTGTTGCATTGATGGTGTTACATATAAAGTTGAAGAATTTGCTCTTTTTCATACCAGTA
ACGGGAAATTGATGCCATATATGCTTCAGTCCATGTGCAGTGAGTTTGAAAGTGGGTCGAACTTGGCTATTCTTAAGAGGTGTTACTTTTATGAAGACTTGCCAAAGGAA
GTTGCCCACCTCTGTCCATGCTCCCCTGAACAAAATGAGGTATATACATCCGATGGATATATTGATTTACAAGTAGGTTTAATTCGAGGACCATGTGAAATTCTTCCTGT
TTCCAAGTATAAAGAAGAATATGAAAGACGAAGGCAATTGGGTGTTGGGGCAGATAATGAAATAAAGCGAATTTTCCTGTGCAAGTATGTCCATTTTCTTAAATTCTTTT
ACCACCTGGTGCTTTTAGAGACGGTATCGAGCAGTTTTCCTTTATTGAACAATGTGTTGATCAATTATTTTTTAGACTGATTTCTTCTTAGAAGATGATTAGCTTGCCGA
TTTTCCTTCATGATTGCAGATGGTTTTATAGTGAAACTGCCAAGGAATTTGTACCTTTTACCGGTTCCCTGTGAAAAATTCACGGTATTTCAGATACAGCAACCTGCAAG
GACAAGTTGATGGATAAATGCCCGCAAGCAAATGTTTAGAACTAGGTAGGGATATTTTTCAGTTTGTTTCTTCTTCCATGTGACTTGTAATTTAAAGTTGTATCATTTTA
TACTCCATCTGAATCCCCAAGTCCTCTCTTAGAATATCACCCTGTCTGTATATTATTTTATCAACAGCACTTGCCCTGCTTTCACAATGAAAGAGGTGCCTTTGTAACGT
AACCTGTCATTATTTTTCATTACACAAATTTATGACTTTTTTTACTTGTCGTGAGTATGATAATCGAG
Protein sequenceShow/hide protein sequence
MEETAADKRPIEIHGDADADDADLHLIKKPRVAPNLRRVAEIVLAISTMTALRAGNHPTHAELQLMAEARAKLLPICQQFPPNLFVGTEAITSLIHDLGLHPKPTDHNLG
FRGPRLTIADKFAHAKKKMEDSKKYMPPSGYGSQPIQTSFTSSIETRGTLPTVRMFSSDKSSNVPTSVGGTVAALPSGHVSVSGSTSIQVQAQLPSNEARAHIISSGFPI
SHQGRDSSSLLHGIERPLNGTYGSQVQVNSSANHPLANVPTWSAQTQSALSAKGVPEHKLPSHSGVNAQGITDSRTLRSSSQGARDQNFRPSISQTVTGNIAGLQPHLQS
MSYVQGPSLSNNHNDIVKIIQKLLQPQPPDHPTWNPPSRDYMNKAVTCQTCQVTINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCPRCLTISNGKPLPPKYGRV
MRSNPPPKLSVNTSGTAWEKRSGAIEQNTSAGQLKLVPNGGSDLQSAQPADHGSNANESGIRIVEEIHGNHFLPIRKDMDEKPTSSTSLNTPAKSVELVCDPSSTELSSE
RCTQQTKSSQASIGEDGSSLKSEPPQESQTMADNYSQIVDQKMVSNGSEKPYSTENTNHISSVKKDGPEFLQENIVENFEAGIISREQSGTKSDGLHDVEWVGSPHQIED
GKAYYRSCCIDGVTYKVEEFALFHTSNGKLMPYMLQSMCSEFESGSNLAILKRCYFYEDLPKEVAHLCPCSPEQNEVYTSDGYIDLQVGLIRGPCEILPVSKYKEEYERR
RQLGVGADNEIKRIFLCKYVHFLKFFYHLVLLETVSSSFPLLNNVLINYFLD