; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019255 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019255
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRING/FYVE/PHD zinc finger superfamily protein, putative isoform 1
Genome locationtig00153302:770948..792908
RNA-Seq ExpressionSgr019255
SyntenySgr019255
Gene Ontology termsGO:0006325 - chromatin organization (biological process)
GO:0003682 - chromatin binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001025 - Bromo adjacent homology (BAH) domain
IPR001965 - Zinc finger, PHD-type
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR019786 - Zinc finger, PHD-type, conserved site
IPR019787 - Zinc finger, PHD-finger
IPR043151 - Bromo adjacent homology (BAH) domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015540.1 PHD finger protein [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0074.24Show/hide
Query:  MTEPMEE-TVVDGEPTEPTGTHSGDKRPIENQGDDQLVEPHLSKKPRNGRELGRNLRRVAEIVLVMSTMTAVRGGKNPSDAEVELMAEARAKLVQICEGL
        M EPMEE TVVDGEPT PT    G+KR IE  GDD+L EP L KKPRNG ELG NLRRVAEIVLVMSTMTA+R GK P+DAEVELMAEARAKLVQICEGL
Subjt:  MTEPMEE-TVVDGEPTEPTGTHSGDKRPIENQGDDQLVEPHLSKKPRNGRELGRNLRRVAEIVLVMSTMTAVRGGKNPSDAEVELMAEARAKLVQICEGL

Query:  APKDIVGREGISSVIEDLGLHGKAKDQKLGFRGPRLTIAEKLAQAKKKMEDSKKYIPPSAYGSHPTQTNFTSSVESRGALPTVRMFPSDKSSHVPTSVGG
        APKDIVGREGISS+IEDLGLHG  +DQKLGFRGPRLTIAEKLAQ KKKMEDSKKYIPPS YGSHPTQ +  SS+E+RG LP+VRMFPSDKSS VP SVGG
Subjt:  APKDIVGREGISSVIEDLGLHGKAKDQKLGFRGPRLTIAEKLAQAKKKMEDSKKYIPPSAYGSHPTQTNFTSSVESRGALPTVRMFPSDKSSHVPTSVGG

Query:  TAAALPSGHVSVTGSASIQVQAQLPSNEVRAHIISSGFPISHQGRDSSSFLHGVERPLNGTYGSPMQVNSTVNHPLASAPTWSAQSQSALSAKGGPEHKL
        TA  LPSGHVSV GS S+QVQ QL  NEVRAH ISSGFPI+ QGRD SS LHG+ERPLNGTYGS MQVNS+VNH LASAPTWSAQ+QSALSAKGGPEHK 
Subjt:  TAAALPSGHVSVTGSASIQVQAQLPSNEVRAHIISSGFPISHQGRDSSSFLHGVERPLNGTYGSPMQVNSTVNHPLASAPTWSAQSQSALSAKGGPEHKL

Query:  PNHSAVSVQGITDSSTSRSSSQAARDQSFRPSISQTVTGNMAGLQPHLQSMNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQT
        PNHSA + QG TDS   RSSSQAARDQSFRP I QT TGNMAGLQPHLQS+NFVQGPS+SN+HNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQT
Subjt:  PNHSAVSVQGITDSSTSRSSSQAARDQSFRPSISQTVTGNMAGLQPHLQSMNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQT

Query:  CQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTQPSEKRSG-SIEQKASAGQLKLVS
        CQ+TINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHC RCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSG+QP EK  G ++EQKASAGQLKLVS
Subjt:  CQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTQPSEKRSG-SIEQKASAGQLKLVS

Query:  NGGSDLQS--QPADHGSNANESSGTKIPDVEEIHGNHFLPIRKDIDEKPTSSTSLNTQAKSLGLVCDPSSAELSSERSALHIKSSQSPKGEDGSLNQKQS
        NGG DL S  QPA+ GSNANESSG KIP+ EE HGN+ LPIRKDIDEKPTSSTSLNT AKSLG+VC+PSSAE+SSE SA HIKSSQ+P GEDGS  + + 
Subjt:  NGGSDLQS--QPADHGSNANESSGTKIPDVEEIHGNHFLPIRKDIDEKPTSSTSLNTQAKSLGLVCDPSSAELSSERSALHIKSSQSPKGEDGSLNQKQS

Query:  LQK---------NPKQ-----------------------------------WLTILASNFENFEASIINREQSGTSSDDLRDVEWIGSPQLLTDGKAYYK
         ++         NPK                                       +  +N ENFEASIINREQSGTSS+DL DVEWIG P  LTD +AYYK
Subjt:  LQK---------NPKQ-----------------------------------WLTILASNFENFEASIINREQSGTSSDDLRDVEWIGSPQLLTDGKAYYK

Query:  SCRIDGVTYK--------------------SMCHDFNNGSNWAVLKKCYFYEDLPKTVAHLRPCSPEHNEVYASDGYICLMVGLIRSPCEILPVAKYKEE
        SCR+DGVTYK                    S+ H++ +G NWA+LK+CYF+EDLPK VAHL PCSPE NEVY SDG ICL VGLIRSPCE+LPVAKYKEE
Subjt:  SCRIDGVTYK--------------------SMCHDFNNGSNWAVLKKCYFYEDLPKTVAHLRPCSPEHNEVYASDGYICLMVGLIRSPCEILPVAKYKEE

Query:  YERRRQMGHEADNGIKPIFLCKWFYSE
        +ERR+Q+G  AD+GIKP FLCKWFY+E
Subjt:  YERRRQMGHEADNGIKPIFLCKWFYSE

XP_022151302.1 uncharacterized protein LOC111019267 isoform X1 [Momordica charantia]0.0e+0079.04Show/hide
Query:  MTEPMEETVVDGEPTEPTGTHSGDKRPIENQGDDQLV--EPHLSKKPRNGRELGRNLRRVAEIVLVMSTMTAVRGGKNPSDAEVELMAEARAKLVQICEG
        M EPM+E VVD EPTEPT THSGDKRPI  +G D +V  EP LSKKPRNG+ELGRNLRRVAEIVLVMSTMTAVRGGK PSDAEVELMAEARAKL QICEG
Subjt:  MTEPMEETVVDGEPTEPTGTHSGDKRPIENQGDDQLV--EPHLSKKPRNGRELGRNLRRVAEIVLVMSTMTAVRGGKNPSDAEVELMAEARAKLVQICEG

Query:  LAPKDIVGREGISSVIEDLGLHGKAKDQKLGFRGPRLTIAEKLAQAKKKMEDSKKYIPPSAYGSHPTQTNFTSSVESRGALPTVRMFPSDKSSHVPTSVG
        LAPKDI+GREGISSVIEDLGL+ KAKD KLGFRGPRLTIAEKLA AKKKMEDSKKYIPPSAYGSHPTQTNFT SVESRGALPTVRMFPSDKSSHVPTSVG
Subjt:  LAPKDIVGREGISSVIEDLGLHGKAKDQKLGFRGPRLTIAEKLAQAKKKMEDSKKYIPPSAYGSHPTQTNFTSSVESRGALPTVRMFPSDKSSHVPTSVG

Query:  GTAAALPSGHVSVTGSASIQVQAQLPSNEVRAHIISSGFPISHQGRDSSSFLHGVERPLNGTYGSPMQVNSTVNHPLASAPTWSAQSQSALSAKGGPEHK
        GTAAALPSGHVSVTGS+SIQVQAQLPSNEVRAHIISSGFPISHQGRDSS FLHGVERPLNGTYGS MQVNS+VNHPLASAPTWSAQ+QSALSAKGGPEHK
Subjt:  GTAAALPSGHVSVTGSASIQVQAQLPSNEVRAHIISSGFPISHQGRDSSSFLHGVERPLNGTYGSPMQVNSTVNHPLASAPTWSAQSQSALSAKGGPEHK

Query:  LPNHSAVSVQGITDSSTSRSSSQAARDQSFRPSISQTVTGNMAGLQPHLQSMNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQ
          NHSAVSVQ  TDSST RSSSQAAR+QS RPSISQTVTG+MAGLQPHLQSMNFVQG SLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQ
Subjt:  LPNHSAVSVQGITDSSTSRSSSQAARDQSFRPSISQTVTGNMAGLQPHLQSMNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQ

Query:  TCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTQPSEKRSGSIEQKASAGQLKLVS
        TCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHC RCLTIS+GKPLPPKYGRVMRSNPPPKLSVNT GTQPSEKRSG+IEQKASA QL LVS
Subjt:  TCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTQPSEKRSGSIEQKASAGQLKLVS

Query:  NGGSDLQS-QPADHGSNA--NESSGTKIPDVEEIHGNHFLPIRKDIDEKPTSSTSLNTQAKSLGLVCDPSSAELSSERSALHIKSSQSPKGEDGSLNQKQ
        NGGSDLQS Q ADHGSN   NE+SGTK PDVEEIHGNHFLPIRKD+DEKP SSTSLNT AKSLG VCDPSSAELSSERS   IKSSQSPKGEDGS   K 
Subjt:  NGGSDLQS-QPADHGSNA--NESSGTKIPDVEEIHGNHFLPIRKDIDEKPTSSTSLNTQAKSLGLVCDPSSAELSSERSALHIKSSQSPKGEDGSLNQKQ

Query:  SLQKNPK--------------------------------------------QWLTILASNFENFEASIINRE--QSGTSSDDLRDVEWIGSPQLLTDGKA
           ++P+                                                + A+N E FEASIINRE  QSGTSSDDLRD+EWIG P++L DGKA
Subjt:  SLQKNPK--------------------------------------------QWLTILASNFENFEASIINRE--QSGTSSDDLRDVEWIGSPQLLTDGKA

Query:  YYKSCRIDGVTYK--------------------SMCHDFNNGSNWAVLKKCYFYEDLPKTVAHLRPCSPEHNEVYASDGYICLMVGLIRSPCEILPVAKY
        +Y SC IDGVTYK                    +M HD+N GSNWAVLKKCYF+EDLPK V HL P SPEH EVYASD    LM GLIRSPCE+L VAKY
Subjt:  YYKSCRIDGVTYK--------------------SMCHDFNNGSNWAVLKKCYFYEDLPKTVAHLRPCSPEHNEVYASDGYICLMVGLIRSPCEILPVAKY

Query:  KEEYERRRQMGHEADNGIKPIFLCKWFYSE
        KEEYERRRQ+   ADNG+K IFLCKWFY E
Subjt:  KEEYERRRQMGHEADNGIKPIFLCKWFYSE

XP_022151303.1 uncharacterized protein LOC111019267 isoform X2 [Momordica charantia]0.0e+0079.04Show/hide
Query:  MTEPMEETVVDGEPTEPTGTHSGDKRPIENQGDDQLV--EPHLSKKPRNGRELGRNLRRVAEIVLVMSTMTAVRGGKNPSDAEVELMAEARAKLVQICEG
        M EPM+E VVD EPTEPT THSGDKRPI  +G D +V  EP LSKKPRNG+ELGRNLRRVAEIVLVMSTMTAVRGGK PSDAEVELMAEARAKL QICEG
Subjt:  MTEPMEETVVDGEPTEPTGTHSGDKRPIENQGDDQLV--EPHLSKKPRNGRELGRNLRRVAEIVLVMSTMTAVRGGKNPSDAEVELMAEARAKLVQICEG

Query:  LAPKDIVGREGISSVIEDLGLHGKAKDQKLGFRGPRLTIAEKLAQAKKKMEDSKKYIPPSAYGSHPTQTNFTSSVESRGALPTVRMFPSDKSSHVPTSVG
        LAPKDI+GREGISSVIEDLGL+ KAKD KLGFRGPRLTIAEKLA AKKKMEDSKKYIPPSAYGSHPTQTNFT SVESRGALPTVRMFPSDKSSHVPTSVG
Subjt:  LAPKDIVGREGISSVIEDLGLHGKAKDQKLGFRGPRLTIAEKLAQAKKKMEDSKKYIPPSAYGSHPTQTNFTSSVESRGALPTVRMFPSDKSSHVPTSVG

Query:  GTAAALPSGHVSVTGSASIQVQAQLPSNEVRAHIISSGFPISHQGRDSSSFLHGVERPLNGTYGSPMQVNSTVNHPLASAPTWSAQSQSALSAKGGPEHK
        GTAAALPSGHVSVTGS+SIQVQAQLPSNEVRAHIISSGFPISHQGRDSS FLHGVERPLNGTYGS MQVNS+VNHPLASAPTWSAQ+QSALSAKGGPEHK
Subjt:  GTAAALPSGHVSVTGSASIQVQAQLPSNEVRAHIISSGFPISHQGRDSSSFLHGVERPLNGTYGSPMQVNSTVNHPLASAPTWSAQSQSALSAKGGPEHK

Query:  LPNHSAVSVQGITDSSTSRSSSQAARDQSFRPSISQTVTGNMAGLQPHLQSMNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQ
          NHSAVSVQ  TDSST RSSSQAAR+QS RPSISQTVTG+MAGLQPHLQSMNFVQG SLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQ
Subjt:  LPNHSAVSVQGITDSSTSRSSSQAARDQSFRPSISQTVTGNMAGLQPHLQSMNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQ

Query:  TCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTQPSEKRSGSIEQKASAGQLKLVS
        TCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHC RCLTIS+GKPLPPKYGRVMRSNPPPKLSVNT GTQPSEKRSG+IEQKASA QL LVS
Subjt:  TCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTQPSEKRSGSIEQKASAGQLKLVS

Query:  NGGSDLQS-QPADHGSNA--NESSGTKIPDVEEIHGNHFLPIRKDIDEKPTSSTSLNTQAKSLGLVCDPSSAELSSERSALHIKSSQSPKGEDGSLNQKQ
        NGGSDLQS Q ADHGSN   NE+SGTK PDVEEIHGNHFLPIRKD+DEKP SSTSLNT AKSLG VCDPSSAELSSERS   IKSSQSPKGEDGS   K 
Subjt:  NGGSDLQS-QPADHGSNA--NESSGTKIPDVEEIHGNHFLPIRKDIDEKPTSSTSLNTQAKSLGLVCDPSSAELSSERSALHIKSSQSPKGEDGSLNQKQ

Query:  SLQKNPK--------------------------------------------QWLTILASNFENFEASIINRE--QSGTSSDDLRDVEWIGSPQLLTDGKA
           ++P+                                                + A+N E FEASIINRE  QSGTSSDDLRD+EWIG P++L DGKA
Subjt:  SLQKNPK--------------------------------------------QWLTILASNFENFEASIINRE--QSGTSSDDLRDVEWIGSPQLLTDGKA

Query:  YYKSCRIDGVTYK--------------------SMCHDFNNGSNWAVLKKCYFYEDLPKTVAHLRPCSPEHNEVYASDGYICLMVGLIRSPCEILPVAKY
        +Y SC IDGVTYK                    +M HD+N GSNWAVLKKCYF+EDLPK V HL P SPEH EVYASD    LM GLIRSPCE+L VAKY
Subjt:  YYKSCRIDGVTYK--------------------SMCHDFNNGSNWAVLKKCYFYEDLPKTVAHLRPCSPEHNEVYASDGYICLMVGLIRSPCEILPVAKY

Query:  KEEYERRRQMGHEADNGIKPIFLCKWFYSE
        KEEYERRRQ+   ADNG+K IFLCKWFY E
Subjt:  KEEYERRRQMGHEADNGIKPIFLCKWFYSE

XP_022932252.1 uncharacterized protein LOC111438615 isoform X1 [Cucurbita moschata]0.0e+0074.12Show/hide
Query:  MTEPMEE-TVVDGEPTEPTGTHSGDKRPIENQGDDQLVEPHLSKKPRNGRELGRNLRRVAEIVLVMSTMTAVRGGKNPSDAEVELMAEARAKLVQICEGL
        M EPMEE TVVDGEPT PT    G+KR IE  GDD+L EP L KKPRNG ELG NLRRVAEIVLVMSTMTA+R GK P+DAEVELMAEARAKLVQICEGL
Subjt:  MTEPMEE-TVVDGEPTEPTGTHSGDKRPIENQGDDQLVEPHLSKKPRNGRELGRNLRRVAEIVLVMSTMTAVRGGKNPSDAEVELMAEARAKLVQICEGL

Query:  APKDIVGREGISSVIEDLGLHGKAKDQKLGFRGPRLTIAEKLAQAKKKMEDSKKYIPPSAYGSHPTQTNFTSSVESRGALPTVRMFPSDKSSHVPTSVGG
        APKDIVGREGISS+IEDLGLHG  +DQKLGFRGPRLTIAEKLAQ KKKMEDSKKYIPPS YGSHPTQ +  SS+E+RG LP+VRMFPSDKSS VP SVGG
Subjt:  APKDIVGREGISSVIEDLGLHGKAKDQKLGFRGPRLTIAEKLAQAKKKMEDSKKYIPPSAYGSHPTQTNFTSSVESRGALPTVRMFPSDKSSHVPTSVGG

Query:  TAAALPSGHVSVTGSASIQVQAQLPSNEVRAHIISSGFPISHQGRDSSSFLHGVERPLNGTYGSPMQVNSTVNHPLASAPTWSAQSQSALSAKGGPEHKL
        TA  LPSGHVSV GS S+QVQ QL  NEVRAH ISSGFPI+ QGRD SS LHG+ERPLNGTYGS MQVNS+VNH LASAPTWSAQ+QSALSAKGGPEHK 
Subjt:  TAAALPSGHVSVTGSASIQVQAQLPSNEVRAHIISSGFPISHQGRDSSSFLHGVERPLNGTYGSPMQVNSTVNHPLASAPTWSAQSQSALSAKGGPEHKL

Query:  PNHSAVSVQGITDSSTSRSSSQAARDQSFRPSISQTVTGNMAGLQPHLQSMNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQT
        PNHSA + QG TDS   RSSSQAARDQSFRP I QT TGNMAGLQPHLQS+NFVQGPS+SN+HNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQT
Subjt:  PNHSAVSVQGITDSSTSRSSSQAARDQSFRPSISQTVTGNMAGLQPHLQSMNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQT

Query:  CQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTQPSEKRSG-SIEQKASAGQLKLVS
        CQ+TINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHC RCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSG+QP EK  G ++EQKASAGQLKLVS
Subjt:  CQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTQPSEKRSG-SIEQKASAGQLKLVS

Query:  NGGSDLQS--QPADHGSNANESSGTKIPDVEEIHGNHFLPIRKDIDEKPTSSTSLNTQAKSLGLVCDPSSAELSSERSALHIKSSQSPKGEDGSLNQKQS
        NGG DL S  QPA+ GSNANESSG KIP+ EE HGN+ LPIRKDIDEKPTSSTSLNT AKSLG+VC+PSSAE+SSE SA HIKSSQ+P GEDGS  + + 
Subjt:  NGGSDLQS--QPADHGSNANESSGTKIPDVEEIHGNHFLPIRKDIDEKPTSSTSLNTQAKSLGLVCDPSSAELSSERSALHIKSSQSPKGEDGSLNQKQS

Query:  LQK---------NPKQ-----------------------------------WLTILASNFENFEASIINREQSGTSSDDLRDVEWIGSPQLLTDGKAYYK
         ++         NPK                                       +  +N ENFEASIINREQSGTSS+DL DVEWIG P  LTD +AYYK
Subjt:  LQK---------NPKQ-----------------------------------WLTILASNFENFEASIINREQSGTSSDDLRDVEWIGSPQLLTDGKAYYK

Query:  SCRIDGVTY--------------------KSMCHDFNNGSNWAVLKKCYFYEDLPKTVAHLRPCSPEHNEVYASDGYICLMVGLIRSPCEILPVAKYKEE
        SCR+DGVTY                    +S+ H++ +G NWA+LK+CYF+EDLPK VAHL PCSPE NEVY SDG ICL VGLIRSPCE+LPVAKYKEE
Subjt:  SCRIDGVTY--------------------KSMCHDFNNGSNWAVLKKCYFYEDLPKTVAHLRPCSPEHNEVYASDGYICLMVGLIRSPCEILPVAKYKEE

Query:  YERRRQMGHEADNGIKPIFLCKWFYSE
        +ERR+Q+G  AD+GIKP FLCKWFY+E
Subjt:  YERRRQMGHEADNGIKPIFLCKWFYSE

XP_022932253.1 uncharacterized protein LOC111438615 isoform X2 [Cucurbita moschata]0.0e+0074.12Show/hide
Query:  MTEPMEE-TVVDGEPTEPTGTHSGDKRPIENQGDDQLVEPHLSKKPRNGRELGRNLRRVAEIVLVMSTMTAVRGGKNPSDAEVELMAEARAKLVQICEGL
        M EPMEE TVVDGEPT PT    G+KR IE  GDD+L EP L KKPRNG ELG NLRRVAEIVLVMSTMTA+R GK P+DAEVELMAEARAKLVQICEGL
Subjt:  MTEPMEE-TVVDGEPTEPTGTHSGDKRPIENQGDDQLVEPHLSKKPRNGRELGRNLRRVAEIVLVMSTMTAVRGGKNPSDAEVELMAEARAKLVQICEGL

Query:  APKDIVGREGISSVIEDLGLHGKAKDQKLGFRGPRLTIAEKLAQAKKKMEDSKKYIPPSAYGSHPTQTNFTSSVESRGALPTVRMFPSDKSSHVPTSVGG
        APKDIVGREGISS+IEDLGLHG  +DQKLGFRGPRLTIAEKLAQ KKKMEDSKKYIPPS YGSHPTQ +  SS+E+RG LP+VRMFPSDKSS VP SVGG
Subjt:  APKDIVGREGISSVIEDLGLHGKAKDQKLGFRGPRLTIAEKLAQAKKKMEDSKKYIPPSAYGSHPTQTNFTSSVESRGALPTVRMFPSDKSSHVPTSVGG

Query:  TAAALPSGHVSVTGSASIQVQAQLPSNEVRAHIISSGFPISHQGRDSSSFLHGVERPLNGTYGSPMQVNSTVNHPLASAPTWSAQSQSALSAKGGPEHKL
        TA  LPSGHVSV GS S+QVQ QL  NEVRAH ISSGFPI+ QGRD SS LHG+ERPLNGTYGS MQVNS+VNH LASAPTWSAQ+QSALSAKGGPEHK 
Subjt:  TAAALPSGHVSVTGSASIQVQAQLPSNEVRAHIISSGFPISHQGRDSSSFLHGVERPLNGTYGSPMQVNSTVNHPLASAPTWSAQSQSALSAKGGPEHKL

Query:  PNHSAVSVQGITDSSTSRSSSQAARDQSFRPSISQTVTGNMAGLQPHLQSMNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQT
        PNHSA + QG TDS   RSSSQAARDQSFRP I QT TGNMAGLQPHLQS+NFVQGPS+SN+HNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQT
Subjt:  PNHSAVSVQGITDSSTSRSSSQAARDQSFRPSISQTVTGNMAGLQPHLQSMNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQT

Query:  CQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTQPSEKRSG-SIEQKASAGQLKLVS
        CQ+TINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHC RCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSG+QP EK  G ++EQKASAGQLKLVS
Subjt:  CQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTQPSEKRSG-SIEQKASAGQLKLVS

Query:  NGGSDLQS--QPADHGSNANESSGTKIPDVEEIHGNHFLPIRKDIDEKPTSSTSLNTQAKSLGLVCDPSSAELSSERSALHIKSSQSPKGEDGSLNQKQS
        NGG DL S  QPA+ GSNANESSG KIP+ EE HGN+ LPIRKDIDEKPTSSTSLNT AKSLG+VC+PSSAE+SSE SA HIKSSQ+P GEDGS  + + 
Subjt:  NGGSDLQS--QPADHGSNANESSGTKIPDVEEIHGNHFLPIRKDIDEKPTSSTSLNTQAKSLGLVCDPSSAELSSERSALHIKSSQSPKGEDGSLNQKQS

Query:  LQK---------NPKQ-----------------------------------WLTILASNFENFEASIINREQSGTSSDDLRDVEWIGSPQLLTDGKAYYK
         ++         NPK                                       +  +N ENFEASIINREQSGTSS+DL DVEWIG P  LTD +AYYK
Subjt:  LQK---------NPKQ-----------------------------------WLTILASNFENFEASIINREQSGTSSDDLRDVEWIGSPQLLTDGKAYYK

Query:  SCRIDGVTY--------------------KSMCHDFNNGSNWAVLKKCYFYEDLPKTVAHLRPCSPEHNEVYASDGYICLMVGLIRSPCEILPVAKYKEE
        SCR+DGVTY                    +S+ H++ +G NWA+LK+CYF+EDLPK VAHL PCSPE NEVY SDG ICL VGLIRSPCE+LPVAKYKEE
Subjt:  SCRIDGVTY--------------------KSMCHDFNNGSNWAVLKKCYFYEDLPKTVAHLRPCSPEHNEVYASDGYICLMVGLIRSPCEILPVAKYKEE

Query:  YERRRQMGHEADNGIKPIFLCKWFYSE
        +ERR+Q+G  AD+GIKP FLCKWFY+E
Subjt:  YERRRQMGHEADNGIKPIFLCKWFYSE

TrEMBL top hitse value%identityAlignment
A0A6J1DBU3 uncharacterized protein LOC111019267 isoform X20.0e+0079.04Show/hide
Query:  MTEPMEETVVDGEPTEPTGTHSGDKRPIENQGDDQLV--EPHLSKKPRNGRELGRNLRRVAEIVLVMSTMTAVRGGKNPSDAEVELMAEARAKLVQICEG
        M EPM+E VVD EPTEPT THSGDKRPI  +G D +V  EP LSKKPRNG+ELGRNLRRVAEIVLVMSTMTAVRGGK PSDAEVELMAEARAKL QICEG
Subjt:  MTEPMEETVVDGEPTEPTGTHSGDKRPIENQGDDQLV--EPHLSKKPRNGRELGRNLRRVAEIVLVMSTMTAVRGGKNPSDAEVELMAEARAKLVQICEG

Query:  LAPKDIVGREGISSVIEDLGLHGKAKDQKLGFRGPRLTIAEKLAQAKKKMEDSKKYIPPSAYGSHPTQTNFTSSVESRGALPTVRMFPSDKSSHVPTSVG
        LAPKDI+GREGISSVIEDLGL+ KAKD KLGFRGPRLTIAEKLA AKKKMEDSKKYIPPSAYGSHPTQTNFT SVESRGALPTVRMFPSDKSSHVPTSVG
Subjt:  LAPKDIVGREGISSVIEDLGLHGKAKDQKLGFRGPRLTIAEKLAQAKKKMEDSKKYIPPSAYGSHPTQTNFTSSVESRGALPTVRMFPSDKSSHVPTSVG

Query:  GTAAALPSGHVSVTGSASIQVQAQLPSNEVRAHIISSGFPISHQGRDSSSFLHGVERPLNGTYGSPMQVNSTVNHPLASAPTWSAQSQSALSAKGGPEHK
        GTAAALPSGHVSVTGS+SIQVQAQLPSNEVRAHIISSGFPISHQGRDSS FLHGVERPLNGTYGS MQVNS+VNHPLASAPTWSAQ+QSALSAKGGPEHK
Subjt:  GTAAALPSGHVSVTGSASIQVQAQLPSNEVRAHIISSGFPISHQGRDSSSFLHGVERPLNGTYGSPMQVNSTVNHPLASAPTWSAQSQSALSAKGGPEHK

Query:  LPNHSAVSVQGITDSSTSRSSSQAARDQSFRPSISQTVTGNMAGLQPHLQSMNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQ
          NHSAVSVQ  TDSST RSSSQAAR+QS RPSISQTVTG+MAGLQPHLQSMNFVQG SLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQ
Subjt:  LPNHSAVSVQGITDSSTSRSSSQAARDQSFRPSISQTVTGNMAGLQPHLQSMNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQ

Query:  TCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTQPSEKRSGSIEQKASAGQLKLVS
        TCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHC RCLTIS+GKPLPPKYGRVMRSNPPPKLSVNT GTQPSEKRSG+IEQKASA QL LVS
Subjt:  TCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTQPSEKRSGSIEQKASAGQLKLVS

Query:  NGGSDLQS-QPADHGSNA--NESSGTKIPDVEEIHGNHFLPIRKDIDEKPTSSTSLNTQAKSLGLVCDPSSAELSSERSALHIKSSQSPKGEDGSLNQKQ
        NGGSDLQS Q ADHGSN   NE+SGTK PDVEEIHGNHFLPIRKD+DEKP SSTSLNT AKSLG VCDPSSAELSSERS   IKSSQSPKGEDGS   K 
Subjt:  NGGSDLQS-QPADHGSNA--NESSGTKIPDVEEIHGNHFLPIRKDIDEKPTSSTSLNTQAKSLGLVCDPSSAELSSERSALHIKSSQSPKGEDGSLNQKQ

Query:  SLQKNPK--------------------------------------------QWLTILASNFENFEASIINRE--QSGTSSDDLRDVEWIGSPQLLTDGKA
           ++P+                                                + A+N E FEASIINRE  QSGTSSDDLRD+EWIG P++L DGKA
Subjt:  SLQKNPK--------------------------------------------QWLTILASNFENFEASIINRE--QSGTSSDDLRDVEWIGSPQLLTDGKA

Query:  YYKSCRIDGVTYK--------------------SMCHDFNNGSNWAVLKKCYFYEDLPKTVAHLRPCSPEHNEVYASDGYICLMVGLIRSPCEILPVAKY
        +Y SC IDGVTYK                    +M HD+N GSNWAVLKKCYF+EDLPK V HL P SPEH EVYASD    LM GLIRSPCE+L VAKY
Subjt:  YYKSCRIDGVTYK--------------------SMCHDFNNGSNWAVLKKCYFYEDLPKTVAHLRPCSPEHNEVYASDGYICLMVGLIRSPCEILPVAKY

Query:  KEEYERRRQMGHEADNGIKPIFLCKWFYSE
        KEEYERRRQ+   ADNG+K IFLCKWFY E
Subjt:  KEEYERRRQMGHEADNGIKPIFLCKWFYSE

A0A6J1DD39 uncharacterized protein LOC111019267 isoform X10.0e+0079.04Show/hide
Query:  MTEPMEETVVDGEPTEPTGTHSGDKRPIENQGDDQLV--EPHLSKKPRNGRELGRNLRRVAEIVLVMSTMTAVRGGKNPSDAEVELMAEARAKLVQICEG
        M EPM+E VVD EPTEPT THSGDKRPI  +G D +V  EP LSKKPRNG+ELGRNLRRVAEIVLVMSTMTAVRGGK PSDAEVELMAEARAKL QICEG
Subjt:  MTEPMEETVVDGEPTEPTGTHSGDKRPIENQGDDQLV--EPHLSKKPRNGRELGRNLRRVAEIVLVMSTMTAVRGGKNPSDAEVELMAEARAKLVQICEG

Query:  LAPKDIVGREGISSVIEDLGLHGKAKDQKLGFRGPRLTIAEKLAQAKKKMEDSKKYIPPSAYGSHPTQTNFTSSVESRGALPTVRMFPSDKSSHVPTSVG
        LAPKDI+GREGISSVIEDLGL+ KAKD KLGFRGPRLTIAEKLA AKKKMEDSKKYIPPSAYGSHPTQTNFT SVESRGALPTVRMFPSDKSSHVPTSVG
Subjt:  LAPKDIVGREGISSVIEDLGLHGKAKDQKLGFRGPRLTIAEKLAQAKKKMEDSKKYIPPSAYGSHPTQTNFTSSVESRGALPTVRMFPSDKSSHVPTSVG

Query:  GTAAALPSGHVSVTGSASIQVQAQLPSNEVRAHIISSGFPISHQGRDSSSFLHGVERPLNGTYGSPMQVNSTVNHPLASAPTWSAQSQSALSAKGGPEHK
        GTAAALPSGHVSVTGS+SIQVQAQLPSNEVRAHIISSGFPISHQGRDSS FLHGVERPLNGTYGS MQVNS+VNHPLASAPTWSAQ+QSALSAKGGPEHK
Subjt:  GTAAALPSGHVSVTGSASIQVQAQLPSNEVRAHIISSGFPISHQGRDSSSFLHGVERPLNGTYGSPMQVNSTVNHPLASAPTWSAQSQSALSAKGGPEHK

Query:  LPNHSAVSVQGITDSSTSRSSSQAARDQSFRPSISQTVTGNMAGLQPHLQSMNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQ
          NHSAVSVQ  TDSST RSSSQAAR+QS RPSISQTVTG+MAGLQPHLQSMNFVQG SLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQ
Subjt:  LPNHSAVSVQGITDSSTSRSSSQAARDQSFRPSISQTVTGNMAGLQPHLQSMNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQ

Query:  TCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTQPSEKRSGSIEQKASAGQLKLVS
        TCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHC RCLTIS+GKPLPPKYGRVMRSNPPPKLSVNT GTQPSEKRSG+IEQKASA QL LVS
Subjt:  TCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTQPSEKRSGSIEQKASAGQLKLVS

Query:  NGGSDLQS-QPADHGSNA--NESSGTKIPDVEEIHGNHFLPIRKDIDEKPTSSTSLNTQAKSLGLVCDPSSAELSSERSALHIKSSQSPKGEDGSLNQKQ
        NGGSDLQS Q ADHGSN   NE+SGTK PDVEEIHGNHFLPIRKD+DEKP SSTSLNT AKSLG VCDPSSAELSSERS   IKSSQSPKGEDGS   K 
Subjt:  NGGSDLQS-QPADHGSNA--NESSGTKIPDVEEIHGNHFLPIRKDIDEKPTSSTSLNTQAKSLGLVCDPSSAELSSERSALHIKSSQSPKGEDGSLNQKQ

Query:  SLQKNPK--------------------------------------------QWLTILASNFENFEASIINRE--QSGTSSDDLRDVEWIGSPQLLTDGKA
           ++P+                                                + A+N E FEASIINRE  QSGTSSDDLRD+EWIG P++L DGKA
Subjt:  SLQKNPK--------------------------------------------QWLTILASNFENFEASIINRE--QSGTSSDDLRDVEWIGSPQLLTDGKA

Query:  YYKSCRIDGVTYK--------------------SMCHDFNNGSNWAVLKKCYFYEDLPKTVAHLRPCSPEHNEVYASDGYICLMVGLIRSPCEILPVAKY
        +Y SC IDGVTYK                    +M HD+N GSNWAVLKKCYF+EDLPK V HL P SPEH EVYASD    LM GLIRSPCE+L VAKY
Subjt:  YYKSCRIDGVTYK--------------------SMCHDFNNGSNWAVLKKCYFYEDLPKTVAHLRPCSPEHNEVYASDGYICLMVGLIRSPCEILPVAKY

Query:  KEEYERRRQMGHEADNGIKPIFLCKWFYSE
        KEEYERRRQ+   ADNG+K IFLCKWFY E
Subjt:  KEEYERRRQMGHEADNGIKPIFLCKWFYSE

A0A6J1EW47 uncharacterized protein LOC111438615 isoform X10.0e+0074.12Show/hide
Query:  MTEPMEE-TVVDGEPTEPTGTHSGDKRPIENQGDDQLVEPHLSKKPRNGRELGRNLRRVAEIVLVMSTMTAVRGGKNPSDAEVELMAEARAKLVQICEGL
        M EPMEE TVVDGEPT PT    G+KR IE  GDD+L EP L KKPRNG ELG NLRRVAEIVLVMSTMTA+R GK P+DAEVELMAEARAKLVQICEGL
Subjt:  MTEPMEE-TVVDGEPTEPTGTHSGDKRPIENQGDDQLVEPHLSKKPRNGRELGRNLRRVAEIVLVMSTMTAVRGGKNPSDAEVELMAEARAKLVQICEGL

Query:  APKDIVGREGISSVIEDLGLHGKAKDQKLGFRGPRLTIAEKLAQAKKKMEDSKKYIPPSAYGSHPTQTNFTSSVESRGALPTVRMFPSDKSSHVPTSVGG
        APKDIVGREGISS+IEDLGLHG  +DQKLGFRGPRLTIAEKLAQ KKKMEDSKKYIPPS YGSHPTQ +  SS+E+RG LP+VRMFPSDKSS VP SVGG
Subjt:  APKDIVGREGISSVIEDLGLHGKAKDQKLGFRGPRLTIAEKLAQAKKKMEDSKKYIPPSAYGSHPTQTNFTSSVESRGALPTVRMFPSDKSSHVPTSVGG

Query:  TAAALPSGHVSVTGSASIQVQAQLPSNEVRAHIISSGFPISHQGRDSSSFLHGVERPLNGTYGSPMQVNSTVNHPLASAPTWSAQSQSALSAKGGPEHKL
        TA  LPSGHVSV GS S+QVQ QL  NEVRAH ISSGFPI+ QGRD SS LHG+ERPLNGTYGS MQVNS+VNH LASAPTWSAQ+QSALSAKGGPEHK 
Subjt:  TAAALPSGHVSVTGSASIQVQAQLPSNEVRAHIISSGFPISHQGRDSSSFLHGVERPLNGTYGSPMQVNSTVNHPLASAPTWSAQSQSALSAKGGPEHKL

Query:  PNHSAVSVQGITDSSTSRSSSQAARDQSFRPSISQTVTGNMAGLQPHLQSMNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQT
        PNHSA + QG TDS   RSSSQAARDQSFRP I QT TGNMAGLQPHLQS+NFVQGPS+SN+HNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQT
Subjt:  PNHSAVSVQGITDSSTSRSSSQAARDQSFRPSISQTVTGNMAGLQPHLQSMNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQT

Query:  CQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTQPSEKRSG-SIEQKASAGQLKLVS
        CQ+TINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHC RCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSG+QP EK  G ++EQKASAGQLKLVS
Subjt:  CQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTQPSEKRSG-SIEQKASAGQLKLVS

Query:  NGGSDLQS--QPADHGSNANESSGTKIPDVEEIHGNHFLPIRKDIDEKPTSSTSLNTQAKSLGLVCDPSSAELSSERSALHIKSSQSPKGEDGSLNQKQS
        NGG DL S  QPA+ GSNANESSG KIP+ EE HGN+ LPIRKDIDEKPTSSTSLNT AKSLG+VC+PSSAE+SSE SA HIKSSQ+P GEDGS  + + 
Subjt:  NGGSDLQS--QPADHGSNANESSGTKIPDVEEIHGNHFLPIRKDIDEKPTSSTSLNTQAKSLGLVCDPSSAELSSERSALHIKSSQSPKGEDGSLNQKQS

Query:  LQK---------NPKQ-----------------------------------WLTILASNFENFEASIINREQSGTSSDDLRDVEWIGSPQLLTDGKAYYK
         ++         NPK                                       +  +N ENFEASIINREQSGTSS+DL DVEWIG P  LTD +AYYK
Subjt:  LQK---------NPKQ-----------------------------------WLTILASNFENFEASIINREQSGTSSDDLRDVEWIGSPQLLTDGKAYYK

Query:  SCRIDGVTY--------------------KSMCHDFNNGSNWAVLKKCYFYEDLPKTVAHLRPCSPEHNEVYASDGYICLMVGLIRSPCEILPVAKYKEE
        SCR+DGVTY                    +S+ H++ +G NWA+LK+CYF+EDLPK VAHL PCSPE NEVY SDG ICL VGLIRSPCE+LPVAKYKEE
Subjt:  SCRIDGVTY--------------------KSMCHDFNNGSNWAVLKKCYFYEDLPKTVAHLRPCSPEHNEVYASDGYICLMVGLIRSPCEILPVAKYKEE

Query:  YERRRQMGHEADNGIKPIFLCKWFYSE
        +ERR+Q+G  AD+GIKP FLCKWFY+E
Subjt:  YERRRQMGHEADNGIKPIFLCKWFYSE

A0A6J1F158 uncharacterized protein LOC111438615 isoform X20.0e+0074.12Show/hide
Query:  MTEPMEE-TVVDGEPTEPTGTHSGDKRPIENQGDDQLVEPHLSKKPRNGRELGRNLRRVAEIVLVMSTMTAVRGGKNPSDAEVELMAEARAKLVQICEGL
        M EPMEE TVVDGEPT PT    G+KR IE  GDD+L EP L KKPRNG ELG NLRRVAEIVLVMSTMTA+R GK P+DAEVELMAEARAKLVQICEGL
Subjt:  MTEPMEE-TVVDGEPTEPTGTHSGDKRPIENQGDDQLVEPHLSKKPRNGRELGRNLRRVAEIVLVMSTMTAVRGGKNPSDAEVELMAEARAKLVQICEGL

Query:  APKDIVGREGISSVIEDLGLHGKAKDQKLGFRGPRLTIAEKLAQAKKKMEDSKKYIPPSAYGSHPTQTNFTSSVESRGALPTVRMFPSDKSSHVPTSVGG
        APKDIVGREGISS+IEDLGLHG  +DQKLGFRGPRLTIAEKLAQ KKKMEDSKKYIPPS YGSHPTQ +  SS+E+RG LP+VRMFPSDKSS VP SVGG
Subjt:  APKDIVGREGISSVIEDLGLHGKAKDQKLGFRGPRLTIAEKLAQAKKKMEDSKKYIPPSAYGSHPTQTNFTSSVESRGALPTVRMFPSDKSSHVPTSVGG

Query:  TAAALPSGHVSVTGSASIQVQAQLPSNEVRAHIISSGFPISHQGRDSSSFLHGVERPLNGTYGSPMQVNSTVNHPLASAPTWSAQSQSALSAKGGPEHKL
        TA  LPSGHVSV GS S+QVQ QL  NEVRAH ISSGFPI+ QGRD SS LHG+ERPLNGTYGS MQVNS+VNH LASAPTWSAQ+QSALSAKGGPEHK 
Subjt:  TAAALPSGHVSVTGSASIQVQAQLPSNEVRAHIISSGFPISHQGRDSSSFLHGVERPLNGTYGSPMQVNSTVNHPLASAPTWSAQSQSALSAKGGPEHKL

Query:  PNHSAVSVQGITDSSTSRSSSQAARDQSFRPSISQTVTGNMAGLQPHLQSMNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQT
        PNHSA + QG TDS   RSSSQAARDQSFRP I QT TGNMAGLQPHLQS+NFVQGPS+SN+HNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQT
Subjt:  PNHSAVSVQGITDSSTSRSSSQAARDQSFRPSISQTVTGNMAGLQPHLQSMNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQT

Query:  CQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTQPSEKRSG-SIEQKASAGQLKLVS
        CQ+TINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHC RCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSG+QP EK  G ++EQKASAGQLKLVS
Subjt:  CQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTQPSEKRSG-SIEQKASAGQLKLVS

Query:  NGGSDLQS--QPADHGSNANESSGTKIPDVEEIHGNHFLPIRKDIDEKPTSSTSLNTQAKSLGLVCDPSSAELSSERSALHIKSSQSPKGEDGSLNQKQS
        NGG DL S  QPA+ GSNANESSG KIP+ EE HGN+ LPIRKDIDEKPTSSTSLNT AKSLG+VC+PSSAE+SSE SA HIKSSQ+P GEDGS  + + 
Subjt:  NGGSDLQS--QPADHGSNANESSGTKIPDVEEIHGNHFLPIRKDIDEKPTSSTSLNTQAKSLGLVCDPSSAELSSERSALHIKSSQSPKGEDGSLNQKQS

Query:  LQK---------NPKQ-----------------------------------WLTILASNFENFEASIINREQSGTSSDDLRDVEWIGSPQLLTDGKAYYK
         ++         NPK                                       +  +N ENFEASIINREQSGTSS+DL DVEWIG P  LTD +AYYK
Subjt:  LQK---------NPKQ-----------------------------------WLTILASNFENFEASIINREQSGTSSDDLRDVEWIGSPQLLTDGKAYYK

Query:  SCRIDGVTY--------------------KSMCHDFNNGSNWAVLKKCYFYEDLPKTVAHLRPCSPEHNEVYASDGYICLMVGLIRSPCEILPVAKYKEE
        SCR+DGVTY                    +S+ H++ +G NWA+LK+CYF+EDLPK VAHL PCSPE NEVY SDG ICL VGLIRSPCE+LPVAKYKEE
Subjt:  SCRIDGVTY--------------------KSMCHDFNNGSNWAVLKKCYFYEDLPKTVAHLRPCSPEHNEVYASDGYICLMVGLIRSPCEILPVAKYKEE

Query:  YERRRQMGHEADNGIKPIFLCKWFYSE
        +ERR+Q+G  AD+GIKP FLCKWFY+E
Subjt:  YERRRQMGHEADNGIKPIFLCKWFYSE

A0A6J1L5P9 uncharacterized protein LOC111500233 isoform X10.0e+0073.91Show/hide
Query:  MTEPMEE-TVVDGEPTEPTGTHSGDKRPIENQGDDQLVEPHLSKKPRNGRELGRNLRRVAEIVLVMSTMTAVRGGKNPSDAEVELMAEARAKLVQICEGL
        M EPMEE TVVDGEPT      +G+KR IE  GDD+L EP L KKPRNG ELG NLRRVAEIVLVMSTMTA+R GK P+DAEVELMAEARAKLVQICEGL
Subjt:  MTEPMEE-TVVDGEPTEPTGTHSGDKRPIENQGDDQLVEPHLSKKPRNGRELGRNLRRVAEIVLVMSTMTAVRGGKNPSDAEVELMAEARAKLVQICEGL

Query:  APKDIVGREGISSVIEDLGLHGKAKDQKLGFRGPRLTIAEKLAQAKKKMEDSKKYIPPSAYGSHPTQTNFTSSVESRGALPTVRMFPSDKSSHVPTSVGG
        APKDIVGREGISS+IEDLGLHG  +DQKLGFRGPRLTIAEKLAQ KKKMEDSKKYIPPS YGSHPTQ +  SS+E+RG LP+VRMFPSDKSS VP SVGG
Subjt:  APKDIVGREGISSVIEDLGLHGKAKDQKLGFRGPRLTIAEKLAQAKKKMEDSKKYIPPSAYGSHPTQTNFTSSVESRGALPTVRMFPSDKSSHVPTSVGG

Query:  TAAALPSGHVSVTGSASIQVQAQLPSNEVRAHIISSGFPISHQGRDSSSFLHGVERPLNGTYGSPMQVNSTVNHPLASAPTWSAQSQSALSAKGGPEHKL
        TA  LPSGHVSV GS+S+QVQ QL  NEVRAH ISSGFPI+ QGRDSSS LHG+ERPLNGTYGS MQVNS+VNH LASAPTWSAQ+QSALSAKGGPEHK 
Subjt:  TAAALPSGHVSVTGSASIQVQAQLPSNEVRAHIISSGFPISHQGRDSSSFLHGVERPLNGTYGSPMQVNSTVNHPLASAPTWSAQSQSALSAKGGPEHKL

Query:  PNHSAVSVQGITDSSTSRSSSQAARDQSFRPSISQTVTGNMAGLQ-PHLQSMNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQ
        PNHSA + QG TDS   RSSSQAARDQSFRP I QT TGNMAGLQ PHLQS+NFVQGPS+SN+HNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQ
Subjt:  PNHSAVSVQGITDSSTSRSSSQAARDQSFRPSISQTVTGNMAGLQ-PHLQSMNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQ

Query:  TCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTQPSEKRSG-SIEQKASAGQLKLV
        TCQ+TINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHC RCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSG+QP EK  G ++EQKASAGQLKLV
Subjt:  TCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTQPSEKRSG-SIEQKASAGQLKLV

Query:  SNGGSDLQS--QPADHGSNANESSGTKIPDVEEIHGNHFLPIRKDIDEKPTSSTSLNTQAKSLGLVCDPSSAELSSERSALHIKSSQSPKGEDGS-----
        SNGG DL S  QPA+ GSNANESSG KIP+ EE HGN+FLPIRKDIDEKPTSSTSLNT AKSLG+VC+PSSAE+SSE SA H+KSSQ+P GEDGS     
Subjt:  SNGGSDLQS--QPADHGSNANESSGTKIPDVEEIHGNHFLPIRKDIDEKPTSSTSLNTQAKSLGLVCDPSSAELSSERSALHIKSSQSPKGEDGS-----

Query:  -----------------------LNQKQSLQKNPKQWLT----------------ILASNFENFEASIINREQSGTSSDDLRDVEWIGSPQLLTDGKAYY
                               L+QK       + +LT                +  +N ENFEASI+NREQSGTSS+DL DVEWIG P  LTD +AYY
Subjt:  -----------------------LNQKQSLQKNPKQWLT----------------ILASNFENFEASIINREQSGTSSDDLRDVEWIGSPQLLTDGKAYY

Query:  KSCRIDGVTYK--------------------SMCHDFNNGSNWAVLKKCYFYEDLPKTVAHLRPCSPEHNEVYASDGYICLMVGLIRSPCEILPVAKYKE
        KSC +DGVTYK                    S+ H++ +G NWA+LK+CYF+EDLPK VAHL PCSPE NEVY SDG ICL VGLIRSPCE+LPVAKYK 
Subjt:  KSCRIDGVTYK--------------------SMCHDFNNGSNWAVLKKCYFYEDLPKTVAHLRPCSPEHNEVYASDGYICLMVGLIRSPCEILPVAKYKE

Query:  EYERRRQMGHEADNGIKPIFLCKWFYSE
        E+ERR+Q+G  AD+GIKP FLCKWFY+E
Subjt:  EYERRRQMGHEADNGIKPIFLCKWFYSE

SwissProt top hitse value%identityAlignment
Q5PNS0 PHD finger protein At3g202801.9e-3747.5Show/hide
Query:  MNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQTCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNG
        M++ Q  S   NH EI KII K LQP++  +P WNPPSR+YM++A+ CQ C++TINE+D++LICDACEK +HLKC+Q  N + +P+ EWHCSRC+   NG
Subjt:  MNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQTCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNG

Query:  KPLPPKYGRVMR--SNPPPKLSVNTSGT-QPSEKRSGSIEQKASAGQLKLVSNGGSDLQS
        KP PP YGR  R  +    K+    +G    S K+ G ++ KA+  Q K + +  S LQ+
Subjt:  KPLPPKYGRVMR--SNPPPKLSVNTSGT-QPSEKRSGSIEQKASAGQLKLVSNGGSDLQS

Q8BRB7 Histone acetyltransferase KAT6B2.3e-0636.84Show/hide
Query:  TCQTCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPL
        TC  C++     D++L CD+C++GFH++C   P  R +P+G W C  C     G+ L
Subjt:  TCQTCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPL

Q8WML3 Histone acetyltransferase KAT6B2.3e-0636.84Show/hide
Query:  TCQTCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPL
        TC  C++     D++L CD+C++GFH++C   P  R +P+G W C  C     G+ L
Subjt:  TCQTCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPL

Q8WYB5 Histone acetyltransferase KAT6B2.3e-0636.84Show/hide
Query:  TCQTCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPL
        TC  C++     D++L CD+C++GFH++C   P  R +P+G W C  C     G+ L
Subjt:  TCQTCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPL

Q92794 Histone acetyltransferase KAT6A1.1e-0536.67Show/hide
Query:  TCQTCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPK
        TC +C+      D++L CD+C++GFH++C   P  R +P+G W C  C     G+ L  K
Subjt:  TCQTCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPK

Arabidopsis top hitse value%identityAlignment
AT1G50620.1 RING/FYVE/PHD zinc finger superfamily protein9.3e-8039.32Show/hide
Query:  IENQGDDQLVE--PHLSKKPRNGRELGRNLRRVAEIVLVMSTMTAVRGGKNPSDAEVELMAEARAKLVQICEGLAPKDIVGREGISSVIEDLGLHGKAKD
        +E + D++ VE  P   KKPR   E    + RVAEIVLV+S +  +RGGK P++ E++LM EA++KLV +C+   PKDI+G + I +VIEDLG +GK KD
Subjt:  IENQGDDQLVE--PHLSKKPRNGRELGRNLRRVAEIVLVMSTMTAVRGGKNPSDAEVELMAEARAKLVQICEGLAPKDIVGREGISSVIEDLGLHGKAKD

Query:  QKLGFRGPRLTIAEKLAQAKKKMEDSKKYIPPSAYGSHPTQTNFTSSVESRGALPTVRMFPSDKSSHVPTSVGGTAAALPSGHVSVTGSASIQVQAQLPS
        Q+LGFR P+LTI+EKL+  K+KME+ KK    S   + P   N                                                + +  Q P+
Subjt:  QKLGFRGPRLTIAEKLAQAKKKMEDSKKYIPPSAYGSHPTQTNFTSSVESRGALPTVRMFPSDKSSHVPTSVGGTAAALPSGHVSVTGSASIQVQAQLPS

Query:  NEVRAHIISSGFPISHQGRDSSSFLHG-VERPL------NGTYGSPMQVNSTVNHPLASAPTWSAQSQSALSAKGGPEHKLPNHSAVSVQGITDSSTSRS
        +E++A   S     SH  R++S      +ERP        GT   P        +   +  TWSAQ  S+              S +S    +DS     
Subjt:  NEVRAHIISSGFPISHQGRDSSSFLHG-VERPL------NGTYGSPMQVNSTVNHPLASAPTWSAQSQSALSAKGGPEHKLPNHSAVSVQGITDSSTSRS

Query:  SSQAARDQSFRPSISQTVTGNMAGLQPHLQSMNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQTCQITINEIDSVLICDACEK
        SS    D SFRP +SQT  G   G++    +      P  +NNH EI K+I K+LQP+   +  WNPPSR+YM+KA+TCQ CQ TINEI++VLICDACEK
Subjt:  SSQAARDQSFRPSISQTVTGNMAGLQPHLQSMNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQTCQITINEIDSVLICDACEK

Query:  GFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTQ-PSEKRSGSIEQKAS
        G+HLKC+ + N + +P+ EWHCSRC+ + NGK  PPKYGRVMRS    K+S +T+  Q P+EK  G ++QK S
Subjt:  GFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTQ-PSEKRSGSIEQKAS

AT3G01460.1 methyl-CPG-binding domain 95.8e-0529.2Show/hide
Query:  EIVKIIQKLLQPQLPDHPTWNP-PSRDYMNKAVTCQTCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMRS
        E+V+ I     P  P  P   P P RD      +C  C      I+ V++CDACE+GFH+ CV +    A P  +W CS C T      L P        
Subjt:  EIVKIIQKLLQPQLPDHPTWNP-PSRDYMNKAVTCQTCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMRS

Query:  NPPPKLSVNTSGTQPSEKRSGSIEQKASAGQLKLVSN
            KL ++ + + PS+      E+ + + +  L S+
Subjt:  NPPPKLSVNTSGTQPSEKRSGSIEQKASAGQLKLVSN

AT3G20280.1 RING/FYVE/PHD zinc finger superfamily protein3.2e-7238.51Show/hide
Query:  IENQGDDQLVEPHLSKKPRNGRELGRNLRRVAEIVLVMSTMTAVRGGKNPSDAEVELMAEARAKLVQICEGLAPKDIVGREGISSVIEDLGLHGKAKDQK
        ++  G + +  P  +KKPR   E    + RVAEIVLV+S +  +RGG+ P+  E+ELM EAR+KL  +C    PKDI+ ++ + SVIEDLG +GK KDQ+
Subjt:  IENQGDDQLVEPHLSKKPRNGRELGRNLRRVAEIVLVMSTMTAVRGGKNPSDAEVELMAEARAKLVQICEGLAPKDIVGREGISSVIEDLGLHGKAKDQK

Query:  LGFRGPRLTIAEKLAQAKKKMEDSKKYIPPSAYGSHPTQTNFTSSVESRGALPTVRMFPSDKSSHVPTSVGGTAAALPSGHVSVTGSASIQVQAQLPSNE
        LGFR P +TI+EKL+  K+KME+++KY   S   +  T +    S+ S G L       ++K+S                           V  Q PS+E
Subjt:  LGFRGPRLTIAEKLAQAKKKMEDSKKYIPPSAYGSHPTQTNFTSSVESRGALPTVRMFPSDKSSHVPTSVGGTAAALPSGHVSVTGSASIQVQAQLPSNE

Query:  VRAHIISSGFPISHQGRDSSSFLHGVERPLNG-TYGSPMQVNSTVNHPLASAPTWSAQSQSALSAKGGPEHKLPNHSAVSVQGITDSSTSRSSSQAARDQ
        V A   +SG   SH   D    +      LNG + G+P+   S+ N+    A  WSAQ  S +S    P+ K+P  S+V                   D 
Subjt:  VRAHIISSGFPISHQGRDSSSFLHGVERPLNG-TYGSPMQVNSTVNHPLASAPTWSAQSQSALSAKGGPEHKLPNHSAVSVQGITDSSTSRSSSQAARDQ

Query:  SFRPSISQTVTGNMAGLQPHLQSMNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQTCQITINEIDSVLICDACEKGFHLKCVQ
        SFRP    T TG         Q M++ Q  S   NH EI KII K LQP++  +P WNPPSR+YM++A+ CQ C++TINE+D++LICDACEK +HLKC+Q
Subjt:  SFRPSISQTVTGNMAGLQPHLQSMNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQTCQITINEIDSVLICDACEKGFHLKCVQ

Query:  SPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMR--SNPPPKLSVNTSGT-QPSEKRSGSIEQKASAGQLKLVSNGGSDLQS
          N + +P+ EWHCSRC+   NGKP PP YGR  R  +    K+    +G    S K+ G ++ KA+  Q K + +  S LQ+
Subjt:  SPNQRAIPRGEWHCSRCLTISNGKPLPPKYGRVMR--SNPPPKLSVNTSGT-QPSEKRSGSIEQKASAGQLKLVSNGGSDLQS

AT3G20280.2 RING/FYVE/PHD zinc finger superfamily protein1.4e-3847.5Show/hide
Query:  MNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQTCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNG
        M++ Q  S   NH EI KII K LQP++  +P WNPPSR+YM++A+ CQ C++TINE+D++LICDACEK +HLKC+Q  N + +P+ EWHCSRC+   NG
Subjt:  MNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQTCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRCLTISNG

Query:  KPLPPKYGRVMR--SNPPPKLSVNTSGT-QPSEKRSGSIEQKASAGQLKLVSNGGSDLQS
        KP PP YGR  R  +    K+    +G    S K+ G ++ KA+  Q K + +  S LQ+
Subjt:  KPLPPKYGRVMR--SNPPPKLSVNTSGT-QPSEKRSGSIEQKASAGQLKLVSNGGSDLQS

AT5G09790.1 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 51.3e-0440.82Show/hide
Query:  VTCQTCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRC
        VTC+ C     + D +L+CD C++GFH+KC++ P    +P G W C  C
Subjt:  VTCQTCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHCSRC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGAACCAATGGAGGAGACGGTTGTTGATGGGGAACCAACTGAGCCCACCGGGACTCACTCCGGCGACAAGAGGCCAATTGAGAATCAGGGAGACGATCAACTTGT
AGAGCCCCACTTGAGCAAGAAGCCGCGGAATGGTCGCGAACTGGGTCGTAATCTCAGAAGGGTCGCAGAGATTGTATTGGTCATGTCGACTATGACGGCAGTGCGTGGTG
GGAAAAATCCCAGTGATGCCGAGGTTGAATTGATGGCTGAGGCGAGGGCTAAGCTGGTTCAGATTTGTGAAGGTTTGGCTCCCAAGGATATTGTGGGAAGGGAAGGTATT
AGCTCAGTAATTGAAGATTTGGGGCTACATGGGAAGGCTAAGGATCAGAAGTTAGGGTTTCGGGGTCCCAGGTTGACAATAGCGGAGAAATTGGCGCAGGCAAAGAAGAA
GATGGAAGATTCCAAGAAGTATATCCCTCCGTCAGCTTATGGTTCTCATCCAACCCAAACAAACTTCACTTCATCCGTTGAGAGCCGTGGGGCATTGCCTACTGTAAGGA
TGTTTCCCTCAGATAAATCAAGTCATGTACCGACTTCTGTGGGAGGCACTGCAGCTGCTCTGCCTTCAGGTCATGTTTCTGTCACTGGTTCTGCATCTATACAGGTTCAA
GCTCAACTGCCAAGCAATGAAGTCAGAGCACATATTATTTCCAGTGGATTTCCTATTAGCCATCAAGGAAGGGATTCTTCCTCATTCTTGCATGGCGTTGAAAGACCACT
AAATGGGACATATGGATCTCCAATGCAAGTTAATTCTACAGTAAATCACCCTCTGGCGAGTGCTCCAACTTGGTCTGCTCAAAGTCAATCTGCCTTGTCAGCTAAAGGTG
GGCCAGAGCACAAGTTGCCGAATCATTCTGCTGTTAGTGTTCAGGGAATCACAGACTCAAGCACGTCACGATCGTCTTCTCAAGCAGCAAGGGACCAGAGCTTTAGACCT
TCAATTTCTCAAACTGTGACAGGAAATATGGCTGGTTTGCAGCCGCATTTACAGAGCATGAACTTTGTGCAAGGACCTTCACTTTCTAATAACCACAATGAAATTGTCAA
AATTATTCAGAAGCTCCTACAACCACAGCTTCCAGATCATCCTACTTGGAATCCTCCTTCTAGAGATTACATGAACAAGGCTGTGACTTGCCAAACTTGTCAAATTACCA
TCAATGAGATTGATAGTGTACTTATATGTGATGCTTGCGAGAAAGGATTTCACTTGAAATGTGTGCAATCACCTAATCAGAGAGCAATTCCTAGAGGCGAATGGCACTGC
TCAAGATGTTTAACTATAAGCAATGGGAAGCCTTTACCTCCTAAATATGGGCGTGTCATGAGGAGCAACCCCCCACCAAAATTATCTGTCAACACCAGTGGAACTCAGCC
ATCAGAGAAGAGATCAGGGTCCATAGAACAAAAGGCCAGTGCTGGTCAGCTGAAGTTAGTTTCTAATGGAGGTTCAGATTTGCAAAGTCAGCCTGCTGACCATGGAAGCA
ATGCCAATGAATCATCTGGTACCAAGATTCCAGATGTGGAAGAAATTCATGGAAATCATTTTCTACCAATTAGGAAAGACATAGATGAGAAACCAACCTCTTCGACATCC
CTGAATACACAAGCCAAATCCTTGGGGCTGGTTTGTGACCCCTCTTCTGCTGAGCTATCAAGTGAAAGATCTGCTCTGCACATTAAAAGTTCTCAATCACCAAAAGGTGA
AGATGGATCTCTGAATCAAAAGCAGAGCCTTCAGAAGAATCCCAAACAGTGGCTGACAATTCTAGCAAGTAATTTTGAAAATTTTGAAGCTAGCATCATAAATAGAGAGC
AGTCAGGGACTTCTTCAGATGACTTGCGTGATGTTGAATGGATTGGAAGTCCACAACTCCTTACTGACGGAAAGGCATATTACAAATCCTGTCGCATTGATGGTGTTACA
TATAAATCCATGTGCCATGATTTCAATAATGGGTCGAACTGGGCTGTTCTTAAGAAGTGCTACTTTTATGAGGACTTGCCAAAGACAGTTGCCCACCTCCGCCCATGCTC
CCCAGAACACAATGAGGTATATGCATCCGATGGCTATATTTGTTTAATGGTGGGCTTAATTCGAAGCCCATGTGAAATTCTTCCTGTTGCCAAGTATAAAGAAGAATATG
AAAGACGAAGGCAAATGGGTCATGAGGCAGATAATGGAATAAAGCCAATTTTCTTGTGCAAATGGTTTTATAGTGAAGTAGAAATGAGTTTGTACCTTTTACCGGTGTCA
TCTGTGAAAACTTCTCAGCGATTCATCCCATCAGACATTCCCATTGACAGTTCACACTGTTCTTCTGCCATTTCAAGTGCCCTGCAGACCTCAGTTCTACATTTCAACAG
CGTCTGCCAGAAAACTCAAGATGTGGGCAGCACAACTTCACATGCTTTGCAAGTTCGACGCATCCAGAAGGATCAACACCAGATTTTGCTGCCAAATTTTCGCACGGTTC
TCTCCGAAGCATCGGTGGGAAGCCTCGTAGACTTGGCGGATGAGAGACACAGCCTTCGTCTTCGAGACTGCCACGACTCGAACCTGGTCGAATTGCCGGCCAGACCGCTC
AGCAGCCTGCCGAACTCGGAACATAACCGATCGGAGAGCTGCGACGGCTGCGCCTTCCACCACCGGAGCAGCCATAGGATAGCCTTTTTGCTGGTCTACTCGGAGGATTT
TTCCCAGCTCGTGAGTCGTTCTTCTTCAGCGGAGGTGGAACAGAGAGTCCGTAGAGGAAGATTTGAGATTACAGTGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGACTGAACCAATGGAGGAGACGGTTGTTGATGGGGAACCAACTGAGCCCACCGGGACTCACTCCGGCGACAAGAGGCCAATTGAGAATCAGGGAGACGATCAACTTGT
AGAGCCCCACTTGAGCAAGAAGCCGCGGAATGGTCGCGAACTGGGTCGTAATCTCAGAAGGGTCGCAGAGATTGTATTGGTCATGTCGACTATGACGGCAGTGCGTGGTG
GGAAAAATCCCAGTGATGCCGAGGTTGAATTGATGGCTGAGGCGAGGGCTAAGCTGGTTCAGATTTGTGAAGGTTTGGCTCCCAAGGATATTGTGGGAAGGGAAGGTATT
AGCTCAGTAATTGAAGATTTGGGGCTACATGGGAAGGCTAAGGATCAGAAGTTAGGGTTTCGGGGTCCCAGGTTGACAATAGCGGAGAAATTGGCGCAGGCAAAGAAGAA
GATGGAAGATTCCAAGAAGTATATCCCTCCGTCAGCTTATGGTTCTCATCCAACCCAAACAAACTTCACTTCATCCGTTGAGAGCCGTGGGGCATTGCCTACTGTAAGGA
TGTTTCCCTCAGATAAATCAAGTCATGTACCGACTTCTGTGGGAGGCACTGCAGCTGCTCTGCCTTCAGGTCATGTTTCTGTCACTGGTTCTGCATCTATACAGGTTCAA
GCTCAACTGCCAAGCAATGAAGTCAGAGCACATATTATTTCCAGTGGATTTCCTATTAGCCATCAAGGAAGGGATTCTTCCTCATTCTTGCATGGCGTTGAAAGACCACT
AAATGGGACATATGGATCTCCAATGCAAGTTAATTCTACAGTAAATCACCCTCTGGCGAGTGCTCCAACTTGGTCTGCTCAAAGTCAATCTGCCTTGTCAGCTAAAGGTG
GGCCAGAGCACAAGTTGCCGAATCATTCTGCTGTTAGTGTTCAGGGAATCACAGACTCAAGCACGTCACGATCGTCTTCTCAAGCAGCAAGGGACCAGAGCTTTAGACCT
TCAATTTCTCAAACTGTGACAGGAAATATGGCTGGTTTGCAGCCGCATTTACAGAGCATGAACTTTGTGCAAGGACCTTCACTTTCTAATAACCACAATGAAATTGTCAA
AATTATTCAGAAGCTCCTACAACCACAGCTTCCAGATCATCCTACTTGGAATCCTCCTTCTAGAGATTACATGAACAAGGCTGTGACTTGCCAAACTTGTCAAATTACCA
TCAATGAGATTGATAGTGTACTTATATGTGATGCTTGCGAGAAAGGATTTCACTTGAAATGTGTGCAATCACCTAATCAGAGAGCAATTCCTAGAGGCGAATGGCACTGC
TCAAGATGTTTAACTATAAGCAATGGGAAGCCTTTACCTCCTAAATATGGGCGTGTCATGAGGAGCAACCCCCCACCAAAATTATCTGTCAACACCAGTGGAACTCAGCC
ATCAGAGAAGAGATCAGGGTCCATAGAACAAAAGGCCAGTGCTGGTCAGCTGAAGTTAGTTTCTAATGGAGGTTCAGATTTGCAAAGTCAGCCTGCTGACCATGGAAGCA
ATGCCAATGAATCATCTGGTACCAAGATTCCAGATGTGGAAGAAATTCATGGAAATCATTTTCTACCAATTAGGAAAGACATAGATGAGAAACCAACCTCTTCGACATCC
CTGAATACACAAGCCAAATCCTTGGGGCTGGTTTGTGACCCCTCTTCTGCTGAGCTATCAAGTGAAAGATCTGCTCTGCACATTAAAAGTTCTCAATCACCAAAAGGTGA
AGATGGATCTCTGAATCAAAAGCAGAGCCTTCAGAAGAATCCCAAACAGTGGCTGACAATTCTAGCAAGTAATTTTGAAAATTTTGAAGCTAGCATCATAAATAGAGAGC
AGTCAGGGACTTCTTCAGATGACTTGCGTGATGTTGAATGGATTGGAAGTCCACAACTCCTTACTGACGGAAAGGCATATTACAAATCCTGTCGCATTGATGGTGTTACA
TATAAATCCATGTGCCATGATTTCAATAATGGGTCGAACTGGGCTGTTCTTAAGAAGTGCTACTTTTATGAGGACTTGCCAAAGACAGTTGCCCACCTCCGCCCATGCTC
CCCAGAACACAATGAGGTATATGCATCCGATGGCTATATTTGTTTAATGGTGGGCTTAATTCGAAGCCCATGTGAAATTCTTCCTGTTGCCAAGTATAAAGAAGAATATG
AAAGACGAAGGCAAATGGGTCATGAGGCAGATAATGGAATAAAGCCAATTTTCTTGTGCAAATGGTTTTATAGTGAAGTAGAAATGAGTTTGTACCTTTTACCGGTGTCA
TCTGTGAAAACTTCTCAGCGATTCATCCCATCAGACATTCCCATTGACAGTTCACACTGTTCTTCTGCCATTTCAAGTGCCCTGCAGACCTCAGTTCTACATTTCAACAG
CGTCTGCCAGAAAACTCAAGATGTGGGCAGCACAACTTCACATGCTTTGCAAGTTCGACGCATCCAGAAGGATCAACACCAGATTTTGCTGCCAAATTTTCGCACGGTTC
TCTCCGAAGCATCGGTGGGAAGCCTCGTAGACTTGGCGGATGAGAGACACAGCCTTCGTCTTCGAGACTGCCACGACTCGAACCTGGTCGAATTGCCGGCCAGACCGCTC
AGCAGCCTGCCGAACTCGGAACATAACCGATCGGAGAGCTGCGACGGCTGCGCCTTCCACCACCGGAGCAGCCATAGGATAGCCTTTTTGCTGGTCTACTCGGAGGATTT
TTCCCAGCTCGTGAGTCGTTCTTCTTCAGCGGAGGTGGAACAGAGAGTCCGTAGAGGAAGATTTGAGATTACAGTGAAATGA
Protein sequenceShow/hide protein sequence
MTEPMEETVVDGEPTEPTGTHSGDKRPIENQGDDQLVEPHLSKKPRNGRELGRNLRRVAEIVLVMSTMTAVRGGKNPSDAEVELMAEARAKLVQICEGLAPKDIVGREGI
SSVIEDLGLHGKAKDQKLGFRGPRLTIAEKLAQAKKKMEDSKKYIPPSAYGSHPTQTNFTSSVESRGALPTVRMFPSDKSSHVPTSVGGTAAALPSGHVSVTGSASIQVQ
AQLPSNEVRAHIISSGFPISHQGRDSSSFLHGVERPLNGTYGSPMQVNSTVNHPLASAPTWSAQSQSALSAKGGPEHKLPNHSAVSVQGITDSSTSRSSSQAARDQSFRP
SISQTVTGNMAGLQPHLQSMNFVQGPSLSNNHNEIVKIIQKLLQPQLPDHPTWNPPSRDYMNKAVTCQTCQITINEIDSVLICDACEKGFHLKCVQSPNQRAIPRGEWHC
SRCLTISNGKPLPPKYGRVMRSNPPPKLSVNTSGTQPSEKRSGSIEQKASAGQLKLVSNGGSDLQSQPADHGSNANESSGTKIPDVEEIHGNHFLPIRKDIDEKPTSSTS
LNTQAKSLGLVCDPSSAELSSERSALHIKSSQSPKGEDGSLNQKQSLQKNPKQWLTILASNFENFEASIINREQSGTSSDDLRDVEWIGSPQLLTDGKAYYKSCRIDGVT
YKSMCHDFNNGSNWAVLKKCYFYEDLPKTVAHLRPCSPEHNEVYASDGYICLMVGLIRSPCEILPVAKYKEEYERRRQMGHEADNGIKPIFLCKWFYSEVEMSLYLLPVS
SVKTSQRFIPSDIPIDSSHCSSAISSALQTSVLHFNSVCQKTQDVGSTTSHALQVRRIQKDQHQILLPNFRTVLSEASVGSLVDLADERHSLRLRDCHDSNLVELPARPL
SSLPNSEHNRSESCDGCAFHHRSSHRIAFLLVYSEDFSQLVSRSSSAEVEQRVRRGRFEITVK