; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030191 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030191
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr8:45244814..45264766
RNA-Seq ExpressionLag0030191
SyntenyLag0030191
Gene Ontology termsGO:0034470 - ncRNA processing (biological process)
GO:0005654 - nucleoplasm (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0071013 - catalytic step 2 spliceosome (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005344 - TMEM33/Pom33 family
IPR006568 - PSP, proline-rich


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016075.1 Zinc finger CCHC domain-containing protein 8 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0077.52Show/hide
Query:  MGEEREDSQRLKRAAAAAYDYENDPRWADYWSNILIPPNMASRPDVVDHYKRKFYHRYIDAELVVEAMSSSSSTQSSRPSAASSTAPPPTNDRSRSRSSG
        M EERED QRLKRAAAAAYDYENDP+WADYWSNILIPP+MASRPDVVDHYKRKFY RYID +LVVEAMSSSSSTQSSRPSA SS APPPTNDRSR RSSG
Subjt:  MGEEREDSQRLKRAAAAAYDYENDPRWADYWSNILIPPNMASRPDVVDHYKRKFYHRYIDAELVVEAMSSSSSTQSSRPSAASSTAPPPTNDRSRSRSSG

Query:  STTRAAGSSASADPNSTPLRWDRQTIQFSVNAWVFIVAVLAIFPLIPKNLSQRAYRLSFMGTTCSSLYSLYSLYGKPRAWNLQALQVYFQSIIATKDFIY
        STTR +G+SASAD N +PLRWDRQTIQFSVNAWV IVAVLAIFPLIPKNLSQRAYRLSFMG TCSSLYSLYSLYGKPRAWNLQALQ YFQSIIATKDFIY
Subjt:  STTRAAGSSASADPNSTPLRWDRQTIQFSVNAWVFIVAVLAIFPLIPKNLSQRAYRLSFMGTTCSSLYSLYSLYGKPRAWNLQALQVYFQSIIATKDFIY

Query:  FTYCITFVTSNICLKFALIPILCRALEHVAKFLRRNFARSSLYRKYLEEPCVWVESNSTTLSILSSQAEIGLGFILIISLLSWQRNFLHTFMYWQLLKLM
        F YCITF+TSNICLKFALIPILCRALEHVAKFLRRNFARSSLYRKYLEEPCVWVESNSTTLSILSSQAEIGLGF+LIISLLS             LLKLM
Subjt:  FTYCITFVTSNICLKFALIPILCRALEHVAKFLRRNFARSSLYRKYLEEPCVWVESNSTTLSILSSQAEIGLGFILIISLLSWQRNFLHTFMYWQLLKLM

Query:  YHAPVTSGYHRSAWSNIGRIVSPLIYRYAPFLNTPLSMAQRWWF--RFLMVLVYMPSVYGDGGKNCP---------------------------------
        YHAPVTSGYHRSAWSNIGR VSPLIYRYAPFLNTPLSMAQRWWF  R  + L    S +       P                                 
Subjt:  YHAPVTSGYHRSAWSNIGRIVSPLIYRYAPFLNTPLSMAQRWWF--RFLMVLVYMPSVYGDGGKNCP---------------------------------

Query:  --LYPHFMGTEDFIALPASGDSGNENESNDSLSCNEPGEAYSQSSVLKCKDNDASIEKVELADDVQFEDMHCIPQSDLNDETQRSDSDMEIEDLNNLPDF
           + HFMGTEDFIALPASGD G+ENESN+SLS NE  EA SQSSVL+CKDN ASIEK ELADDVQ EDM CIPQSDLNDETQ S+SDMEIEDLNNLPD 
Subjt:  --LYPHFMGTEDFIALPASGDSGNENESNDSLSCNEPGEAYSQSSVLKCKDNDASIEKVELADDVQFEDMHCIPQSDLNDETQRSDSDMEIEDLNNLPDF

Query:  SKTRSRSENNKIPSEAEYLPVNSTDENIQPSREPLQQNELHTRYENVCHVESKNFGKDLVDNSSFSKTGSQLTVTNSVSIEFNGFNSGVPTPTENGLATS
        SKTRS SEN KI SEAEYLPVNS DENI PS EPLQQNELH R E+V H ESK   KDLVDNSSFSKT   LT+   VSI             ENG A S
Subjt:  SKTRSRSENNKIPSEAEYLPVNSTDENIQPSREPLQQNELHTRYENVCHVESKNFGKDLVDNSSFSKTGSQLTVTNSVSIEFNGFNSGVPTPTENGLATS

Query:  -HHGGPSKIHKSDAISGVKRPRITMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALCVGTKKNSAVTFWID
         HHGGP KIHKSDAI GVK+PR+ MDE+QPSVHIVY SLTR SKQKLDELLKQWSEW+AQ+GS +Q                             TFWID
Subjt:  -HHGGPSKIHKSDAISGVKRPRITMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALCVGTKKNSAVTFWID

Query:  NQRSEQQQNFIPLDDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNSGSRNSTRYYQNSRG
        NQ SEQ QNF+P+DDNSVPLYDRGFTLGLTSANDSSN EGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVN+ARN Y++H++SGSRNSTRYYQ SRG
Subjt:  NQRSEQQQNFIPLDDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNSGSRNSTRYYQNSRG

Query:  GKYDDLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIPENADER
        GKYDDLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDP+DEDQPSGITI+ADEK  EQEDGEITEPEYRKP KKM+VEFPGINAPIPE+ADER
Subjt:  GKYDDLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIPENADER

Query:  LWAAEPSSSGLPRSRSHQRLNHHAEYDGRGNDHYQQRWSRDYRDDRPPGVDSVKSPPMPFTPRYGGHDFSYDSQSSRGNFSASRSPHLGRVHSDRGRRSP
        LW+AEP SS LPR RS QRLNHH E+DGRGNDH+QQRWSRDYRD RPPGVDSVKSP + FTPRYGGH+ SY S+S R NFS SRS +LGR+HSDRGR+SP
Subjt:  LWAAEPSSSGLPRSRSHQRLNHHAEYDGRGNDHYQQRWSRDYRDDRPPGVDSVKSPPMPFTPRYGGHDFSYDSQSSRGNFSASRSPHLGRVHSDRGRRSP

Query:  LLDDDYSRYGSHSSSPFSPPRRR
         LDDDYSRYGS+SSSPFSPPRRR
Subjt:  LLDDDYSRYGSHSSSPFSPPRRR

XP_008459422.1 PREDICTED: uncharacterized protein LOC103498564 isoform X1 [Cucumis melo]7.5e-30085.81Show/hide
Query:  MGTEDFIALPASGDSGNENESNDSLSCNEPGEAYSQSSVLKCKDNDASIEKVELADDVQFEDMHCIPQSDLNDETQRSDSDMEIEDLNNLPDFSKTRSRS
        MGTEDFIALPASGDSGNE ESN+SL+ NE  EAYSQSSVLKCKD+DASIEK EL DDVQ EDMHC+PQSDL DETQRSDSDMEIEDLNNLPDFSKTRSRS
Subjt:  MGTEDFIALPASGDSGNENESNDSLSCNEPGEAYSQSSVLKCKDNDASIEKVELADDVQFEDMHCIPQSDLNDETQRSDSDMEIEDLNNLPDFSKTRSRS

Query:  ENNKIPSEAEYLPVNSTDENIQPSREPLQQNELHTRYENVCHVESKNFGKDLVDNSSFSKTGSQLTVTNSVSIEFNGFNSGVPTPTENGLATS-HHGGPS
        EN++I S+AE LPVNS D NI PS EPLQQNELHTRYE+VCHVES+NF KDLVDNSSFSKTG QLTV N VSI+FN  NSG   P ENG ATS HHGGPS
Subjt:  ENNKIPSEAEYLPVNSTDENIQPSREPLQQNELHTRYENVCHVESKNFGKDLVDNSSFSKTGSQLTVTNSVSIEFNGFNSGVPTPTENGLATS-HHGGPS

Query:  KIHKSDAISGVKRPRI---TMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALCVGTKKNSAVTFWIDNQRS
        KI KSD ISGVKRPR+    MDE+QPSVHIVY SLTR SKQKLDELLKQWSEWHAQQGSL++DDK++ENLESGEETFFPALCVGTKK SAVTFW+DNQ+S
Subjt:  KIHKSDAISGVKRPRI---TMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALCVGTKKNSAVTFWIDNQRS

Query:  EQQQNFIPLDDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNSGSRNSTRYYQNSRGGKYD
        EQQQ F+P+DDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVN+ARNKY++ HNS SRNSTRYYQNSRGGKYD
Subjt:  EQQQNFIPLDDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNSGSRNSTRYYQNSRGGKYD

Query:  DLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIPENADERLWAA
        DLRPG LDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDP+DEDQPSGITI+ADEKTDEQEDGEITE EYRKPQKKM+VEFPGINAPIPENADERLWA 
Subjt:  DLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIPENADERLWAA

Query:  EPSSSGLPRSRSHQRLNHHAEYDGRGNDHYQQRWSRDYRDDRPPGVDSVKSPPMPFTPRYGGHDFSYDSQSSRGNFSASRSPHLGRVHSDRGRRSPLLDD
        EPSSSGLPR+RS+QRLNH+ EYD RGNDH+QQRWSRDYRDDRPPGVDS+KSPP  FTPRYGGHDFSYDSQ+ RG+FS SRSP+LGR HSDRGRRSP  DD
Subjt:  EPSSSGLPRSRSHQRLNHHAEYDGRGNDHYQQRWSRDYRDDRPPGVDSVKSPPMPFTPRYGGHDFSYDSQSSRGNFSASRSPHLGRVHSDRGRRSPLLDD

Query:  DYSRY-GSHSSSPFSPPRRR
        DYSRY  S+SSS FSPPRRR
Subjt:  DYSRY-GSHSSSPFSPPRRR

XP_008459423.1 PREDICTED: uncharacterized protein LOC103498564 isoform X2 [Cucumis melo]1.6e-29484.84Show/hide
Query:  MGTEDFIALPASGDSGNENESNDSLSCNEPGEAYSQSSVLKCKDNDASIEKVELADDVQFEDMHCIPQSDLNDETQRSDSDMEIEDLNNLPDFSKTRSRS
        MGTEDFIALPASGDSGNE ESN+SL+ NE  EAYSQSSVLKCKD+DASIEK EL DDVQ EDMHC+PQSDL DETQRSDSDMEIEDLNNLPDFSKTRSRS
Subjt:  MGTEDFIALPASGDSGNENESNDSLSCNEPGEAYSQSSVLKCKDNDASIEKVELADDVQFEDMHCIPQSDLNDETQRSDSDMEIEDLNNLPDFSKTRSRS

Query:  ENNKIPSEAEYLPVNSTDENIQPSREPLQQNELHTRYENVCHVESKNFGKDLVDNSSFSKTGSQLTVTNSVSIEFNGFNSGVPTPTENGLATS-HHGGPS
        EN++I S+AE LPVNS D NI PS EPLQQNELHTRYE+VCHVES+NF KDLVDNSSFSKTG QLTV N VSI+FN  NSG   P ENG ATS HHGGP 
Subjt:  ENNKIPSEAEYLPVNSTDENIQPSREPLQQNELHTRYENVCHVESKNFGKDLVDNSSFSKTGSQLTVTNSVSIEFNGFNSGVPTPTENGLATS-HHGGPS

Query:  KIHKSDAISGVKRPRI---TMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALCVGTKKNSAVTFWIDNQRS
               ISGVKRPR+    MDE+QPSVHIVY SLTR SKQKLDELLKQWSEWHAQQGSL++DDK++ENLESGEETFFPALCVGTKK SAVTFW+DNQ+S
Subjt:  KIHKSDAISGVKRPRI---TMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALCVGTKKNSAVTFWIDNQRS

Query:  EQQQNFIPLDDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNSGSRNSTRYYQNSRGGKYD
        EQQQ F+P+DDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVN+ARNKY++ HNS SRNSTRYYQNSRGGKYD
Subjt:  EQQQNFIPLDDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNSGSRNSTRYYQNSRGGKYD

Query:  DLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIPENADERLWAA
        DLRPG LDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDP+DEDQPSGITI+ADEKTDEQEDGEITE EYRKPQKKM+VEFPGINAPIPENADERLWA 
Subjt:  DLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIPENADERLWAA

Query:  EPSSSGLPRSRSHQRLNHHAEYDGRGNDHYQQRWSRDYRDDRPPGVDSVKSPPMPFTPRYGGHDFSYDSQSSRGNFSASRSPHLGRVHSDRGRRSPLLDD
        EPSSSGLPR+RS+QRLNH+ EYD RGNDH+QQRWSRDYRDDRPPGVDS+KSPP  FTPRYGGHDFSYDSQ+ RG+FS SRSP+LGR HSDRGRRSP  DD
Subjt:  EPSSSGLPRSRSHQRLNHHAEYDGRGNDHYQQRWSRDYRDDRPPGVDSVKSPPMPFTPRYGGHDFSYDSQSSRGNFSASRSPHLGRVHSDRGRRSPLLDD

Query:  DYSRY-GSHSSSPFSPPRRR
        DYSRY  S+SSS FSPPRRR
Subjt:  DYSRY-GSHSSSPFSPPRRR

XP_022133813.1 uncharacterized protein LOC111006283 [Momordica charantia]3.1e-29884.93Show/hide
Query:  MGTEDFIALPASGDSGNENESNDSLSCNEPGEAYSQSSVLKCKDNDASIEKVELADDVQFEDMHCIPQSDLNDETQRSDSDMEIEDLNNLPDFSKTRSRS
        M TEDFIALPASGDSGNENE+N+ LSC+E  E  SQSSVLKCKD+DASIEK ELADDVQF+DM CIPQSDLNDE Q SDSDMEIEDLNNLPDF+K+RSRS
Subjt:  MGTEDFIALPASGDSGNENESNDSLSCNEPGEAYSQSSVLKCKDNDASIEKVELADDVQFEDMHCIPQSDLNDETQRSDSDMEIEDLNNLPDFSKTRSRS

Query:  ENNKIPSEAEYLPVNSTDENIQPSREPLQQNELHTRYENVCHVESKNFGKDLVDNSSFSKTGSQLTVTNSVSIEFNGFNSGVPTPTENGLATSHHGGPSK
        ENN+I +EA+YLPVNS  ENIQPSREPLQQNELH RYENVCHV SKNF  DLVDNSSF KTG QLTVTN VSIE+NGFNSGV  P ENG ATS+HG   K
Subjt:  ENNKIPSEAEYLPVNSTDENIQPSREPLQQNELHTRYENVCHVESKNFGKDLVDNSSFSKTGSQLTVTNSVSIEFNGFNSGVPTPTENGLATSHHGGPSK

Query:  IHKSDAISGVKRPRITMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALCVGTKKNSAVTFWIDNQRSEQQQ
         HKSDAISGVKRPR+ MDE+QPSVH++Y+SLTRASKQKLDELLKQWSEWHAQQG L+QDDKESENLESGEETFFPALC+GTKK+SAVTFWIDNQR EQQQ
Subjt:  IHKSDAISGVKRPRITMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALCVGTKKNSAVTFWIDNQRSEQQQ

Query:  NFIPLDDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNSARNKY--RRHHNSGSRNSTRYYQNSRGGKYDDL
        NFIPLDDNSVP YDRGFTLGLTSAND+SNVEGGQKIIDDASRCFNCGSYNH+L+DC KPRDN AVN+ARN+Y  +RH NSGSRNSTRYYQNSRGGKYDDL
Subjt:  NFIPLDDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNSARNKY--RRHHNSGSRNSTRYYQNSRGGKYDDL

Query:  RPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIPENADERLWAAEP
        RPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIF DE+ +EQEDGEITEPEYRKP++K +VEFPGINAPIPENADE LWAAEP
Subjt:  RPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIPENADERLWAAEP

Query:  SSSGLPRSRSHQRLNHHAEYDGRGNDHYQQRWSRDYRDDRPPGVDSVKSPPMPFTPRYGGHDFSYDSQSSRGNFSASRSPHLGRVHSDRGRRSPLLDDDY
        SSSGLPRSRSHQRLNHHAEYDGRGND Y QRW RDYRDD PPGVDSVKSPPM +TPRYG +DF++DSQSSR N S SRSP+LGR+HSDRGRRSP  +DDY
Subjt:  SSSGLPRSRSHQRLNHHAEYDGRGNDHYQQRWSRDYRDDRPPGVDSVKSPPMPFTPRYGGHDFSYDSQSSRGNFSASRSPHLGRVHSDRGRRSPLLDDDY

Query:  SRYGSHSSSPFSPPRRR
        SRYGS+S+S FSPPRRR
Subjt:  SRYGSHSSSPFSPPRRR

XP_038890370.1 uncharacterized protein LOC120079961 [Benincasa hispida]3.2e-30387.52Show/hide
Query:  MGTEDFIALPASGDSGNENESNDSLSCNEPGEAYSQSSVLKCKDNDASIEKVELADDVQFEDMHCIPQSDLNDETQRSDSDMEIEDLNNLPDFSKTRSRS
        MGTEDFIALPASGDSGNE ESN+SLS +E  +  SQSSVLKCKD+DAS EKVELADDV  EDMH IPQSDL DETQRSDSDMEIEDLNNLPDF+KTRSRS
Subjt:  MGTEDFIALPASGDSGNENESNDSLSCNEPGEAYSQSSVLKCKDNDASIEKVELADDVQFEDMHCIPQSDLNDETQRSDSDMEIEDLNNLPDFSKTRSRS

Query:  ENNKIPSEAEYLPVNSTDENIQPSREPLQQNELHTRYENVCHVESKNFGKDLVDNSSFSKTGSQLTVTNSVSIEFNGFNSGVPTPTENGLATSH-HGGPS
        ENNKI SEAEYLPVNS DENI PS EPLQQNELHTRYE+VCHVES+NF KDLVDNSSF KTG QLTVTN VSIEFN  NSGV  P ENGLA+SH HGGPS
Subjt:  ENNKIPSEAEYLPVNSTDENIQPSREPLQQNELHTRYENVCHVESKNFGKDLVDNSSFSKTGSQLTVTNSVSIEFNGFNSGVPTPTENGLATSH-HGGPS

Query:  KIHKSDAISGVKRPRITMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALCVGTKKNSAVTFWIDNQRSEQQ
        KIHKSDAISGVKRPR+ MDE+QPSVHIVY SLTR SKQKLDELLKQWSEWHAQ+GSL+ DDK+SENLESGEETFFPALCVGTKK SAVTFWIDNQRSEQQ
Subjt:  KIHKSDAISGVKRPRITMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALCVGTKKNSAVTFWIDNQRSEQQ

Query:  QNFIPLDDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNSGSRNSTRYYQNSRGGKYDDLR
        QNF+P+DDNSVPLYDRGFTLGLTSA+DSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVN+ARNKY++ H+SGSRNSTRYYQNSRGGKYDDLR
Subjt:  QNFIPLDDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNSGSRNSTRYYQNSRGGKYDDLR

Query:  PGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIPENADERLWAAEPS
        PGALD ETRQLLGLKELDPPPWLNRMRELGYPPGYLDP+DEDQPSGITI+ADEKTDEQEDGEITE EYRKP+KKM+V FPGINAPIPENADERLWA EPS
Subjt:  PGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIPENADERLWAAEPS

Query:  SSGLPRSRSHQRLNHHAEYDGRGNDHYQQRWSRDYRDDRPPGVDSVKSPPMPFTPRYGGHDFSYDSQSSRGNFSASRSPHLGRVHSDRGRRSPLLDDDYS
        S GLPR+RS+QRLNH+ EYD RGNDH+QQRWSRDYRDDRPPGVDSVKSPP  FTPRYG HDFSYDSQ+ RGNFS SRSP+LGR HSDRGRRSPLLDDDYS
Subjt:  SSGLPRSRSHQRLNHHAEYDGRGNDHYQQRWSRDYRDDRPPGVDSVKSPPMPFTPRYGGHDFSYDSQSSRGNFSASRSPHLGRVHSDRGRRSPLLDDDYS

Query:  RYG-SHSSSPFSPPRRR
        RYG S+SSS FSPPRRR
Subjt:  RYG-SHSSSPFSPPRRR

TrEMBL top hitse value%identityAlignment
A0A1S3CAN3 uncharacterized protein LOC103498564 isoform X13.6e-30085.81Show/hide
Query:  MGTEDFIALPASGDSGNENESNDSLSCNEPGEAYSQSSVLKCKDNDASIEKVELADDVQFEDMHCIPQSDLNDETQRSDSDMEIEDLNNLPDFSKTRSRS
        MGTEDFIALPASGDSGNE ESN+SL+ NE  EAYSQSSVLKCKD+DASIEK EL DDVQ EDMHC+PQSDL DETQRSDSDMEIEDLNNLPDFSKTRSRS
Subjt:  MGTEDFIALPASGDSGNENESNDSLSCNEPGEAYSQSSVLKCKDNDASIEKVELADDVQFEDMHCIPQSDLNDETQRSDSDMEIEDLNNLPDFSKTRSRS

Query:  ENNKIPSEAEYLPVNSTDENIQPSREPLQQNELHTRYENVCHVESKNFGKDLVDNSSFSKTGSQLTVTNSVSIEFNGFNSGVPTPTENGLATS-HHGGPS
        EN++I S+AE LPVNS D NI PS EPLQQNELHTRYE+VCHVES+NF KDLVDNSSFSKTG QLTV N VSI+FN  NSG   P ENG ATS HHGGPS
Subjt:  ENNKIPSEAEYLPVNSTDENIQPSREPLQQNELHTRYENVCHVESKNFGKDLVDNSSFSKTGSQLTVTNSVSIEFNGFNSGVPTPTENGLATS-HHGGPS

Query:  KIHKSDAISGVKRPRI---TMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALCVGTKKNSAVTFWIDNQRS
        KI KSD ISGVKRPR+    MDE+QPSVHIVY SLTR SKQKLDELLKQWSEWHAQQGSL++DDK++ENLESGEETFFPALCVGTKK SAVTFW+DNQ+S
Subjt:  KIHKSDAISGVKRPRI---TMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALCVGTKKNSAVTFWIDNQRS

Query:  EQQQNFIPLDDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNSGSRNSTRYYQNSRGGKYD
        EQQQ F+P+DDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVN+ARNKY++ HNS SRNSTRYYQNSRGGKYD
Subjt:  EQQQNFIPLDDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNSGSRNSTRYYQNSRGGKYD

Query:  DLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIPENADERLWAA
        DLRPG LDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDP+DEDQPSGITI+ADEKTDEQEDGEITE EYRKPQKKM+VEFPGINAPIPENADERLWA 
Subjt:  DLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIPENADERLWAA

Query:  EPSSSGLPRSRSHQRLNHHAEYDGRGNDHYQQRWSRDYRDDRPPGVDSVKSPPMPFTPRYGGHDFSYDSQSSRGNFSASRSPHLGRVHSDRGRRSPLLDD
        EPSSSGLPR+RS+QRLNH+ EYD RGNDH+QQRWSRDYRDDRPPGVDS+KSPP  FTPRYGGHDFSYDSQ+ RG+FS SRSP+LGR HSDRGRRSP  DD
Subjt:  EPSSSGLPRSRSHQRLNHHAEYDGRGNDHYQQRWSRDYRDDRPPGVDSVKSPPMPFTPRYGGHDFSYDSQSSRGNFSASRSPHLGRVHSDRGRRSPLLDD

Query:  DYSRY-GSHSSSPFSPPRRR
        DYSRY  S+SSS FSPPRRR
Subjt:  DYSRY-GSHSSSPFSPPRRR

A0A1S3CBD3 uncharacterized protein LOC103498564 isoform X27.8e-29584.84Show/hide
Query:  MGTEDFIALPASGDSGNENESNDSLSCNEPGEAYSQSSVLKCKDNDASIEKVELADDVQFEDMHCIPQSDLNDETQRSDSDMEIEDLNNLPDFSKTRSRS
        MGTEDFIALPASGDSGNE ESN+SL+ NE  EAYSQSSVLKCKD+DASIEK EL DDVQ EDMHC+PQSDL DETQRSDSDMEIEDLNNLPDFSKTRSRS
Subjt:  MGTEDFIALPASGDSGNENESNDSLSCNEPGEAYSQSSVLKCKDNDASIEKVELADDVQFEDMHCIPQSDLNDETQRSDSDMEIEDLNNLPDFSKTRSRS

Query:  ENNKIPSEAEYLPVNSTDENIQPSREPLQQNELHTRYENVCHVESKNFGKDLVDNSSFSKTGSQLTVTNSVSIEFNGFNSGVPTPTENGLATS-HHGGPS
        EN++I S+AE LPVNS D NI PS EPLQQNELHTRYE+VCHVES+NF KDLVDNSSFSKTG QLTV N VSI+FN  NSG   P ENG ATS HHGGP 
Subjt:  ENNKIPSEAEYLPVNSTDENIQPSREPLQQNELHTRYENVCHVESKNFGKDLVDNSSFSKTGSQLTVTNSVSIEFNGFNSGVPTPTENGLATS-HHGGPS

Query:  KIHKSDAISGVKRPRI---TMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALCVGTKKNSAVTFWIDNQRS
               ISGVKRPR+    MDE+QPSVHIVY SLTR SKQKLDELLKQWSEWHAQQGSL++DDK++ENLESGEETFFPALCVGTKK SAVTFW+DNQ+S
Subjt:  KIHKSDAISGVKRPRI---TMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALCVGTKKNSAVTFWIDNQRS

Query:  EQQQNFIPLDDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNSGSRNSTRYYQNSRGGKYD
        EQQQ F+P+DDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVN+ARNKY++ HNS SRNSTRYYQNSRGGKYD
Subjt:  EQQQNFIPLDDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNSGSRNSTRYYQNSRGGKYD

Query:  DLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIPENADERLWAA
        DLRPG LDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDP+DEDQPSGITI+ADEKTDEQEDGEITE EYRKPQKKM+VEFPGINAPIPENADERLWA 
Subjt:  DLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIPENADERLWAA

Query:  EPSSSGLPRSRSHQRLNHHAEYDGRGNDHYQQRWSRDYRDDRPPGVDSVKSPPMPFTPRYGGHDFSYDSQSSRGNFSASRSPHLGRVHSDRGRRSPLLDD
        EPSSSGLPR+RS+QRLNH+ EYD RGNDH+QQRWSRDYRDDRPPGVDS+KSPP  FTPRYGGHDFSYDSQ+ RG+FS SRSP+LGR HSDRGRRSP  DD
Subjt:  EPSSSGLPRSRSHQRLNHHAEYDGRGNDHYQQRWSRDYRDDRPPGVDSVKSPPMPFTPRYGGHDFSYDSQSSRGNFSASRSPHLGRVHSDRGRRSPLLDD

Query:  DYSRY-GSHSSSPFSPPRRR
        DYSRY  S+SSS FSPPRRR
Subjt:  DYSRY-GSHSSSPFSPPRRR

A0A5D3BMZ1 Zinc finger CCHC domain-containing protein 8 isoform X27.8e-29584.84Show/hide
Query:  MGTEDFIALPASGDSGNENESNDSLSCNEPGEAYSQSSVLKCKDNDASIEKVELADDVQFEDMHCIPQSDLNDETQRSDSDMEIEDLNNLPDFSKTRSRS
        MGTEDFIALPASGDSGNE ESN+SL+ NE  EAYSQSSVLKCKD+DASIEK EL DDVQ EDMHC+PQSDL DETQRSDSDMEIEDLNNLPDFSKTRSRS
Subjt:  MGTEDFIALPASGDSGNENESNDSLSCNEPGEAYSQSSVLKCKDNDASIEKVELADDVQFEDMHCIPQSDLNDETQRSDSDMEIEDLNNLPDFSKTRSRS

Query:  ENNKIPSEAEYLPVNSTDENIQPSREPLQQNELHTRYENVCHVESKNFGKDLVDNSSFSKTGSQLTVTNSVSIEFNGFNSGVPTPTENGLATS-HHGGPS
        EN++I S+AE LPVNS D NI PS EPLQQNELHTRYE+VCHVES+NF KDLVDNSSFSKTG QLTV N VSI+FN  NSG   P ENG ATS HHGGP 
Subjt:  ENNKIPSEAEYLPVNSTDENIQPSREPLQQNELHTRYENVCHVESKNFGKDLVDNSSFSKTGSQLTVTNSVSIEFNGFNSGVPTPTENGLATS-HHGGPS

Query:  KIHKSDAISGVKRPRI---TMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALCVGTKKNSAVTFWIDNQRS
               ISGVKRPR+    MDE+QPSVHIVY SLTR SKQKLDELLKQWSEWHAQQGSL++DDK++ENLESGEETFFPALCVGTKK SAVTFW+DNQ+S
Subjt:  KIHKSDAISGVKRPRI---TMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALCVGTKKNSAVTFWIDNQRS

Query:  EQQQNFIPLDDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNSGSRNSTRYYQNSRGGKYD
        EQQQ F+P+DDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVN+ARNKY++ HNS SRNSTRYYQNSRGGKYD
Subjt:  EQQQNFIPLDDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNSGSRNSTRYYQNSRGGKYD

Query:  DLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIPENADERLWAA
        DLRPG LDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDP+DEDQPSGITI+ADEKTDEQEDGEITE EYRKPQKKM+VEFPGINAPIPENADERLWA 
Subjt:  DLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIPENADERLWAA

Query:  EPSSSGLPRSRSHQRLNHHAEYDGRGNDHYQQRWSRDYRDDRPPGVDSVKSPPMPFTPRYGGHDFSYDSQSSRGNFSASRSPHLGRVHSDRGRRSPLLDD
        EPSSSGLPR+RS+QRLNH+ EYD RGNDH+QQRWSRDYRDDRPPGVDS+KSPP  FTPRYGGHDFSYDSQ+ RG+FS SRSP+LGR HSDRGRRSP  DD
Subjt:  EPSSSGLPRSRSHQRLNHHAEYDGRGNDHYQQRWSRDYRDDRPPGVDSVKSPPMPFTPRYGGHDFSYDSQSSRGNFSASRSPHLGRVHSDRGRRSPLLDD

Query:  DYSRY-GSHSSSPFSPPRRR
        DYSRY  S+SSS FSPPRRR
Subjt:  DYSRY-GSHSSSPFSPPRRR

A0A6J1BX16 uncharacterized protein LOC1110062831.5e-29884.93Show/hide
Query:  MGTEDFIALPASGDSGNENESNDSLSCNEPGEAYSQSSVLKCKDNDASIEKVELADDVQFEDMHCIPQSDLNDETQRSDSDMEIEDLNNLPDFSKTRSRS
        M TEDFIALPASGDSGNENE+N+ LSC+E  E  SQSSVLKCKD+DASIEK ELADDVQF+DM CIPQSDLNDE Q SDSDMEIEDLNNLPDF+K+RSRS
Subjt:  MGTEDFIALPASGDSGNENESNDSLSCNEPGEAYSQSSVLKCKDNDASIEKVELADDVQFEDMHCIPQSDLNDETQRSDSDMEIEDLNNLPDFSKTRSRS

Query:  ENNKIPSEAEYLPVNSTDENIQPSREPLQQNELHTRYENVCHVESKNFGKDLVDNSSFSKTGSQLTVTNSVSIEFNGFNSGVPTPTENGLATSHHGGPSK
        ENN+I +EA+YLPVNS  ENIQPSREPLQQNELH RYENVCHV SKNF  DLVDNSSF KTG QLTVTN VSIE+NGFNSGV  P ENG ATS+HG   K
Subjt:  ENNKIPSEAEYLPVNSTDENIQPSREPLQQNELHTRYENVCHVESKNFGKDLVDNSSFSKTGSQLTVTNSVSIEFNGFNSGVPTPTENGLATSHHGGPSK

Query:  IHKSDAISGVKRPRITMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALCVGTKKNSAVTFWIDNQRSEQQQ
         HKSDAISGVKRPR+ MDE+QPSVH++Y+SLTRASKQKLDELLKQWSEWHAQQG L+QDDKESENLESGEETFFPALC+GTKK+SAVTFWIDNQR EQQQ
Subjt:  IHKSDAISGVKRPRITMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALCVGTKKNSAVTFWIDNQRSEQQQ

Query:  NFIPLDDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNSARNKY--RRHHNSGSRNSTRYYQNSRGGKYDDL
        NFIPLDDNSVP YDRGFTLGLTSAND+SNVEGGQKIIDDASRCFNCGSYNH+L+DC KPRDN AVN+ARN+Y  +RH NSGSRNSTRYYQNSRGGKYDDL
Subjt:  NFIPLDDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNSARNKY--RRHHNSGSRNSTRYYQNSRGGKYDDL

Query:  RPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIPENADERLWAAEP
        RPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIF DE+ +EQEDGEITEPEYRKP++K +VEFPGINAPIPENADE LWAAEP
Subjt:  RPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIPENADERLWAAEP

Query:  SSSGLPRSRSHQRLNHHAEYDGRGNDHYQQRWSRDYRDDRPPGVDSVKSPPMPFTPRYGGHDFSYDSQSSRGNFSASRSPHLGRVHSDRGRRSPLLDDDY
        SSSGLPRSRSHQRLNHHAEYDGRGND Y QRW RDYRDD PPGVDSVKSPPM +TPRYG +DF++DSQSSR N S SRSP+LGR+HSDRGRRSP  +DDY
Subjt:  SSSGLPRSRSHQRLNHHAEYDGRGNDHYQQRWSRDYRDDRPPGVDSVKSPPMPFTPRYGGHDFSYDSQSSRGNFSASRSPHLGRVHSDRGRRSPLLDDDY

Query:  SRYGSHSSSPFSPPRRR
        SRYGS+S+S FSPPRRR
Subjt:  SRYGSHSSSPFSPPRRR

E5GCT2 Nucleic acid binding protein7.8e-29584.84Show/hide
Query:  MGTEDFIALPASGDSGNENESNDSLSCNEPGEAYSQSSVLKCKDNDASIEKVELADDVQFEDMHCIPQSDLNDETQRSDSDMEIEDLNNLPDFSKTRSRS
        MGTEDFIALPASGDSGNE ESN+SL+ NE  EAYSQSSVLKCKD+DASIEK EL DDVQ EDMHC+PQSDL DETQRSDSDMEIEDLNNLPDFSKTRSRS
Subjt:  MGTEDFIALPASGDSGNENESNDSLSCNEPGEAYSQSSVLKCKDNDASIEKVELADDVQFEDMHCIPQSDLNDETQRSDSDMEIEDLNNLPDFSKTRSRS

Query:  ENNKIPSEAEYLPVNSTDENIQPSREPLQQNELHTRYENVCHVESKNFGKDLVDNSSFSKTGSQLTVTNSVSIEFNGFNSGVPTPTENGLATS-HHGGPS
        EN++I S+AE LPVNS D NI PS EPLQQNELHTRYE+VCHVES+NF KDLVDNSSFSKTG QLTV N VSI+FN  NSG   P ENG ATS HHGGP 
Subjt:  ENNKIPSEAEYLPVNSTDENIQPSREPLQQNELHTRYENVCHVESKNFGKDLVDNSSFSKTGSQLTVTNSVSIEFNGFNSGVPTPTENGLATS-HHGGPS

Query:  KIHKSDAISGVKRPRI---TMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALCVGTKKNSAVTFWIDNQRS
               ISGVKRPR+    MDE+QPSVHIVY SLTR SKQKLDELLKQWSEWHAQQGSL++DDK++ENLESGEETFFPALCVGTKK SAVTFW+DNQ+S
Subjt:  KIHKSDAISGVKRPRI---TMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALCVGTKKNSAVTFWIDNQRS

Query:  EQQQNFIPLDDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNSGSRNSTRYYQNSRGGKYD
        EQQQ F+P+DDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVN+ARNKY++ HNS SRNSTRYYQNSRGGKYD
Subjt:  EQQQNFIPLDDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNSGSRNSTRYYQNSRGGKYD

Query:  DLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIPENADERLWAA
        DLRPG LDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDP+DEDQPSGITI+ADEKTDEQEDGEITE EYRKPQKKM+VEFPGINAPIPENADERLWA 
Subjt:  DLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIPENADERLWAA

Query:  EPSSSGLPRSRSHQRLNHHAEYDGRGNDHYQQRWSRDYRDDRPPGVDSVKSPPMPFTPRYGGHDFSYDSQSSRGNFSASRSPHLGRVHSDRGRRSPLLDD
        EPSSSGLPR+RS+QRLNH+ EYD RGNDH+QQRWSRDYRDDRPPGVDS+KSPP  FTPRYGGHDFSYDSQ+ RG+FS SRSP+LGR HSDRGRRSP  DD
Subjt:  EPSSSGLPRSRSHQRLNHHAEYDGRGNDHYQQRWSRDYRDDRPPGVDSVKSPPMPFTPRYGGHDFSYDSQSSRGNFSASRSPHLGRVHSDRGRRSPLLDD

Query:  DYSRY-GSHSSSPFSPPRRR
        DYSRY  S+SSS FSPPRRR
Subjt:  DYSRY-GSHSSSPFSPPRRR

SwissProt top hitse value%identityAlignment
Q5F3D1 Zinc finger CCHC domain-containing protein 81.4e-1429.67Show/hide
Query:  SRCFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNSGSRNS--TRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPD
        S CFNCGS  H +KDC KPR+ A ++  R ++   +   S  +   RY+      ++   +PG +  E +  LG+     PP++ RMR+LGYPPG+L  +
Subjt:  SRCFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNSGSRNS--TRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPD

Query:  DEDQPSGITIFADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIPENADERLWAAEPSSSGLPRSR----SHQRLNHHA
         E + SG+ ++  +  +E ED    +P++        + +PG N   P    +  W    S    P  +    +H   N HA
Subjt:  DEDQPSGITIFADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIPENADERLWAAEPSSSGLPRSR----SHQRLNHHA

Q5R789 Zinc finger CCHC domain-containing protein 81.7e-1232.72Show/hide
Query:  GQKIIDDASR----CFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNSGSRNS--TRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMR
        GQ+I   A R    CFNCGS  H +KDC  PR+ A ++  R +Y       +  +   RY+      ++   +PG +  E +  LG+ +   PP++ RMR
Subjt:  GQKIIDDASR----CFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNSGSRNS--TRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMR

Query:  ELGYPPGYLDPDDEDQPSGITIF--ADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIP
        +LGYPPG+L  + E + SG+ ++   D    E E GEI + +         V +PG N   P
Subjt:  ELGYPPGYLDPDDEDQPSGITIF--ADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIP

Q6DD45 Zinc finger CCHC domain-containing protein 83.3e-1631.75Show/hide
Query:  GQKIIDDASR----CFNCGSYNHSLKDCRKPRDNAAVNSARNKY-RRHHNSGSRNSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMRE
        GQ+I   A R    CFNCGS  H ++DC KPRD A +N  R ++      +G++N  RY+      ++   +PG +  E ++ LG+ + + PP++ RMRE
Subjt:  GQKIIDDASR----CFNCGSYNHSLKDCRKPRDNAAVNSARNKY-RRHHNSGSRNSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMRE

Query:  LGYPPGYLDPDDEDQPSGITIFADEKTDEQEDGEITEPEYRKPQK-----KMTVEFPGINAPIPENADERLWAAEPSSSGLPRSRSHQR
        LGYPPG+L  + E + SG++++  ++  +  DGEI + +    +         V +PG N   P +  +  W    S   +P  ++HQ+
Subjt:  LGYPPGYLDPDDEDQPSGITIFADEKTDEQEDGEITEPEYRKPQK-----KMTVEFPGINAPIPENADERLWAAEPSSSGLPRSRSHQR

Q6NZY4 Zinc finger CCHC domain-containing protein 81.7e-1232.72Show/hide
Query:  GQKIIDDASR----CFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNSGSRNS--TRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMR
        GQ+I   A R    CFNCGS  H +KDC  PR+ A ++  R +Y       +  +   RY+      ++   +PG +  E +  LG+ +   PP++ RMR
Subjt:  GQKIIDDASR----CFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNSGSRNS--TRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMR

Query:  ELGYPPGYLDPDDEDQPSGITIF--ADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIP
        +LGYPPG+L  + E + SG+ ++   D    E E GEI + +         V +PG N   P
Subjt:  ELGYPPGYLDPDDEDQPSGITIF--ADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIP

Q9CYA6 Zinc finger CCHC domain-containing protein 81.0e-1233.78Show/hide
Query:  CFNCGSYNHSLKDCRKPRDNAAVNSARNKYRR--HHNSGSRNSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDE
        CFNCGS  H +K+C  PR+ A ++  R +Y       SG     RY+      ++   +PG +  E +  LG+ +   PP++ RMR+LGYPPG+L  + E
Subjt:  CFNCGSYNHSLKDCRKPRDNAAVNSARNKYRR--HHNSGSRNSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDE

Query:  DQPSGITIF--ADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIP
         + SG+ ++   D+   E E GEI          K+ V +PG N   P
Subjt:  DQPSGITIF--ADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIP

Arabidopsis top hitse value%identityAlignment
AT1G67210.1 Proline-rich spliceosome-associated (PSP) family protein / zinc knuckle (CCHC-type) family protein3.3e-8855.1Show/hide
Query:  SGVKRPRITMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALCVGTKKNSAVTFWIDNQRS-EQQQNFIPLD
        SGVKR R    E+QPSVH+ Y  LTR SKQKL+ LL+QWSEW A+Q SL++D  + + LE+G+ET+FPAL VG +K S+V+FW D Q      +  +P++
Subjt:  SGVKRPRITMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALCVGTKKNSAVTFWIDNQRS-EQQQNFIPLD

Query:  DNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNS--GSRNSTRYYQNSRGGKYDDLRPGALD
         ++ PLY+RGFT+GL S   S+NVEGG +IIDD  RCFNCG+Y+HS+++C +P D +AV++AR +++R  N   GSR  +RYYQ+ + GKYD L+PG+LD
Subjt:  DNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNS--GSRNSTRYYQNSRGGKYDDLRPGALD

Query:  AETRQLLGLKELDPPPWLNRMRELGYPPGYLD-PDDEDQPSGITIFADEKTDEQ-----EDGEITE-PEYRKPQKKMTVEFPGINAPIPENADERLWAAE
        AETR+LLGLKELDPPPWLNRMRE+GYPPGY    +D+D  S ITIF +E+T E+     E+GEI E    ++P+K MTV FPGINAPIPENAD  LW   
Subjt:  AETRQLLGLKELDPPPWLNRMRELGYPPGYLD-PDDEDQPSGITIFADEKTDEQ-----EDGEITE-PEYRKPQKKMTVEFPGINAPIPENADERLWAAE

Query:  PSSSGLPRSRSHQR
         S++G     +H R
Subjt:  PSSSGLPRSRSHQR

AT1G67210.2 Proline-rich spliceosome-associated (PSP) family protein / zinc knuckle (CCHC-type) family protein1.8e-8955.59Show/hide
Query:  SGVKRPRITMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALCVGTKKNSAVTFWIDNQRS-EQQQNFIPLD
        SGVKR R    E+QPSVH+ Y  LTR SKQKL+ LL+QWSEW A+Q SL++D  + + LE+G+ET+FPAL VG +K S+V+FW D Q      +  +P++
Subjt:  SGVKRPRITMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALCVGTKKNSAVTFWIDNQRS-EQQQNFIPLD

Query:  DNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNS--GSRNSTRYYQNSRGGKYDDLRPGALD
         ++ PLY+RGFT+GL S   S+NVEGG +IIDD  RCFNCG+Y+HS+++C +P D +AV++AR +++R  N   GSR  +RYYQ+ + GKYD L+PG+LD
Subjt:  DNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNS--GSRNSTRYYQNSRGGKYDDLRPGALD

Query:  AETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFADEKTDEQ-----EDGEITE-PEYRKPQKKMTVEFPGINAPIPENADERLWAAEP
        AETR+LLGLKELDPPPWLNRMRE+GYPPGY + DD+D  S ITIF +E+T E+     E+GEI E    ++P+K MTV FPGINAPIPENAD  LW    
Subjt:  AETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFADEKTDEQ-----EDGEITE-PEYRKPQKKMTVEFPGINAPIPENADERLWAAEP

Query:  SSSGLPRSRSHQR
        S++G     +H R
Subjt:  SSSGLPRSRSHQR

AT3G02420.1 unknown protein3.1e-15076.15Show/hide
Query:  MGEEREDSQRLKRAAAAAYDYENDPRWADYWSNILIPPNMASRPDVVDHYKRKFYHRYIDAELVVEAMS-SSSSTQSSRPSA--ASSTAPPPTNDRSRSR
        M E  EDSQRLK+ AAAA+DYEND RWADYWSNILIPP+MASRP+VVDH+KRKFY RYID +LVVE MS SSSS+QS+RP+A  ASSTA    N++ RSR
Subjt:  MGEEREDSQRLKRAAAAAYDYENDPRWADYWSNILIPPNMASRPDVVDHYKRKFYHRYIDAELVVEAMS-SSSSTQSSRPSA--ASSTAPPPTNDRSRSR

Query:  SSGSTTRAAGSSASADPNSTPLRWDRQTIQFSVNAWVFIVAVLAIFPLIPKNLSQRAYRLSFMGTTCSSLYSLYSLYGKPRAWNLQALQVYFQSIIATKD
        +SGS  R +G SA+     + +RWD QTIQFSVNAWVF++AVLA+ PLIPKNLS RAYRLSFMGT CSSLYSLYSLYG+PRAWN+Q LQVYFQSI+A KD
Subjt:  SSGSTTRAAGSSASADPNSTPLRWDRQTIQFSVNAWVFIVAVLAIFPLIPKNLSQRAYRLSFMGTTCSSLYSLYSLYGKPRAWNLQALQVYFQSIIATKD

Query:  FIYFTYCITFVTSNICLKFALIPILCRALEHVAKFLRRNFARSSLYRKYLEEPCVWVESNSTTLSILSSQAEIGLGFILIISLLSWQRNFLHTFMYWQLL
        FIYF YC+TFVTS++CLKFALIPILCRALE VAKFLRRNF RS++YRKYLE+PCVWVESN+TTL+ILSSQAEI +GF+LIISLLSWQRN + TFMYWQLL
Subjt:  FIYFTYCITFVTSNICLKFALIPILCRALEHVAKFLRRNFARSSLYRKYLEEPCVWVESNSTTLSILSSQAEIGLGFILIISLLSWQRNFLHTFMYWQLL

Query:  KLMYHAPVTSGYHRSAWSNIGRIVSPLIYRYAPFLNTPLSMAQRWWFR
        KLMY APVT+GYH+S WS IGR V+P+I RYAPFLNTP+S  QRWWFR
Subjt:  KLMYHAPVTSGYHRSAWSNIGRIVSPLIYRYAPFLNTPLSMAQRWWFR

AT5G38600.1 Proline-rich spliceosome-associated (PSP) family protein / zinc knuckle (CCHC-type) family protein6.1e-9845.81Show/hide
Query:  MEIEDLNNLPDFSKTRSRSENNKIPSEAEYLPVNS---TDENIQPSREPLQQNELHTRYENVCHVESKNFGKDLVDNSSFSKTGSQLTVTNSVSIEFNGF
        ME ED+ ++P  S   S  + N + S       NS    DEN++ +       +L    EN+  V  +  G+ L               T  VS  FN  
Subjt:  MEIEDLNNLPDFSKTRSRSENNKIPSEAEYLPVNS---TDENIQPSREPLQQNELHTRYENVCHVESKNFGKDLVDNSSFSKTGSQLTVTNSVSIEFNGF

Query:  NSGVPTPTENGLATSHHGGPSKIHKSDAISGVKRPRITMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALC
           V    + G+        + +  S   +GVKRPR + DE+QP+VH+ Y  LTRASKQKL+ LL++WSEW A+  SLAQD  + +  ESGEET FPA+ 
Subjt:  NSGVPTPTENGLATSHHGGPSKIHKSDAISGVKRPRITMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALC

Query:  VGTKKNSAVTFWIDNQRSEQQ-QNFIPLDDNSVPLYDRGFTLGLTSANDSSNVEGGQKII-DDASRCFNCGSYNHSLKDCRKPRDNAAVNSAR--NKYRR
        VG +K S+V+FWIDNQ   +  ++F+ ++ ++ PLYDR F +GL SA+ S NVEGG +II DD  RCFNCG Y+HSL++C +P D +AVNSAR   K +R
Subjt:  VGTKKNSAVTFWIDNQRSEQQ-QNFIPLDDNSVPLYDRGFTLGLTSANDSSNVEGGQKII-DDASRCFNCGSYNHSLKDCRKPRDNAAVNSAR--NKYRR

Query:  HHN-SGSRNSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFADE----KTDEQEDGEITEPEYR-
        + N SG R  +RYYQ ++ GKYD L+PG LDAETRQLL L ELDPPPWLNRMRE+GYPPGYL P+D D  SGITIF +E    +  E EDGEI E     
Subjt:  HHN-SGSRNSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFADE----KTDEQEDGEITEPEYR-

Query:  -KPQKKMTVEFPGINAPIPENADERLWAAEPSSSGLPRSRSHQRLNHHAEYDGRGNDHYQQRWSR--DYRDDRPPGVDSVKSPPMPFTPRYGG-HDFSYD
         +PQ K TVEFPGINAP PENADE LW A PS     RS   Q                QQ+ SR  DYRDD P GV+     P  + PRYG  +D+ Y 
Subjt:  -KPQKKMTVEFPGINAPIPENADERLWAAEPSSSGLPRSRSHQRLNHHAEYDGRGNDHYQQRWSR--DYRDDRPPGVDSVKSPPMPFTPRYGG-HDFSYD

Query:  SQSSRGNFSASRSPHLGRVHSDRGRRSPLLDDDYSRY
        S       S SRSP + R  S+R +R      DYS Y
Subjt:  SQSSRGNFSASRSPHLGRVHSDRGRRSPLLDDDYSRY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGAAGAACGAGAGGATTCTCAGAGGTTGAAGAGAGCAGCAGCAGCAGCTTATGACTACGAGAACGATCCCAGATGGGCCGATTACTGGTCCAACATTCTGATCCC
TCCTAACATGGCTTCTCGCCCCGATGTTGTTGACCACTACAAGCGCAAGTTCTACCACCGTTACATCGATGCCGAACTTGTGGTAGAGGCCATGTCTTCTAGTAGTTCAA
CTCAGTCATCTAGACCTTCAGCAGCATCTTCCACAGCACCCCCTCCTACTAATGATCGGAGTCGATCACGAAGCTCAGGGTCAACGACTAGAGCTGCAGGCTCATCTGCA
AGCGCAGATCCTAATTCAACTCCATTAAGATGGGATCGACAAACAATTCAGTTTTCTGTCAATGCATGGGTGTTTATTGTGGCTGTGCTGGCAATTTTCCCCCTAATACC
AAAAAATCTTTCACAGAGGGCATATAGGCTTTCTTTTATGGGCACAACTTGTTCTTCTTTATATTCTTTGTACTCGTTGTATGGGAAGCCCAGGGCGTGGAATTTGCAAG
CATTGCAAGTTTATTTCCAGTCCATAATTGCAACAAAAGATTTCATTTACTTCACTTACTGTATCACCTTTGTGACTTCAAATATTTGTCTAAAATTTGCTTTAATTCCT
ATACTGTGTCGGGCTCTTGAACATGTTGCAAAGTTTCTTAGGCGTAATTTTGCACGTTCGTCTTTATACAGGAAATATTTGGAAGAGCCTTGCGTATGGGTCGAGTCAAA
TTCAACTACTCTCAGCATCCTATCTTCGCAGGCTGAGATTGGACTTGGCTTCATTCTAATCATCTCTTTGCTCTCGTGGCAACGCAACTTCTTACATACATTCATGTATT
GGCAGCTGCTAAAGCTCATGTATCATGCTCCTGTCACTTCTGGGTATCATCGAAGCGCCTGGTCCAATATTGGGAGGATCGTTTCTCCGCTGATCTACCGTTATGCCCCA
TTCCTTAATACTCCTCTTTCAATGGCACAAAGATGGTGGTTCAGGTTTCTCATGGTGCTGGTTTATATGCCCTCTGTTTATGGTGATGGAGGCAAGAATTGTCCTTTGTA
TCCCCATTTTATGGGGACCGAGGATTTCATTGCACTGCCAGCTTCTGGTGATTCTGGAAATGAAAATGAGAGTAATGATTCTCTTAGTTGTAATGAACCAGGGGAAGCTT
ATTCTCAATCAAGTGTTTTGAAGTGCAAGGACAATGATGCAAGCATAGAGAAGGTTGAGCTTGCAGATGATGTACAGTTTGAAGATATGCATTGCATACCTCAGTCTGAC
CTTAATGATGAAACACAACGTTCTGATTCAGATATGGAAATTGAGGATTTGAATAACCTCCCAGATTTTAGTAAGACTAGAAGTAGAAGTGAGAATAATAAAATACCGAG
TGAAGCTGAATACCTGCCAGTCAACTCTACAGATGAGAACATACAACCAAGCAGAGAGCCCTTGCAGCAGAATGAACTTCATACGAGATATGAAAATGTTTGTCATGTTG
AAAGTAAAAATTTTGGAAAGGATTTGGTTGATAATTCATCCTTCTCGAAAACTGGTAGTCAATTGACTGTAACCAACAGTGTTTCAATTGAGTTCAACGGGTTTAACTCT
GGAGTGCCCACTCCCACTGAGAATGGCTTGGCCACATCCCATCATGGTGGCCCCAGTAAAATTCACAAGAGTGATGCAATATCAGGTGTCAAAAGACCAAGGATTACTAT
GGATGAGGAACAACCTTCAGTGCACATCGTGTATAATTCTTTAACCAGAGCTAGTAAACAAAAGCTCGATGAACTATTAAAGCAGTGGTCTGAGTGGCATGCTCAACAAG
GTTCTCTAGCTCAAGATGATAAAGAATCTGAAAATCTAGAATCTGGAGAAGAAACCTTCTTTCCTGCTCTATGTGTCGGCACAAAGAAGAATTCAGCAGTGACTTTCTGG
ATTGACAACCAAAGAAGTGAGCAGCAGCAAAATTTTATTCCTTTAGATGATAATTCTGTCCCACTATATGATCGGGGATTCACTTTGGGACTAACTTCAGCCAATGATTC
GAGTAATGTAGAAGGAGGCCAGAAGATAATTGATGATGCTAGCCGTTGTTTCAATTGTGGTTCTTACAATCATTCCTTAAAGGATTGCCGAAAGCCTCGAGATAATGCTG
CTGTTAATAGTGCTCGCAATAAGTATAGAAGACATCATAATTCTGGCTCCCGCAATTCAACTCGATATTATCAGAATTCACGTGGTGGGAAGTATGATGATTTGAGGCCA
GGAGCTCTTGATGCTGAAACACGGCAACTGTTAGGTCTCAAGGAGCTTGATCCGCCCCCTTGGCTAAACAGAATGAGAGAGCTGGGATACCCACCGGGATATTTAGATCC
GGATGATGAGGATCAACCATCAGGGATTACAATATTTGCTGATGAGAAAACTGACGAACAGGAAGATGGGGAAATTACTGAGCCAGAGTACCGTAAACCACAAAAGAAAA
TGACTGTTGAGTTTCCTGGCATAAATGCTCCAATCCCAGAAAACGCAGATGAAAGACTTTGGGCTGCTGAACCTTCAAGTTCAGGTCTTCCTAGAAGTCGTTCGCACCAG
CGCTTGAACCATCACGCAGAATATGATGGGAGGGGGAATGATCACTATCAACAACGATGGTCCCGGGATTACAGAGATGACAGACCTCCAGGTGTTGACTCAGTAAAAAG
TCCTCCCATGCCTTTCACTCCAAGGTATGGTGGTCATGATTTTAGTTATGACTCCCAAAGTTCAAGAGGTAATTTTTCAGCGTCAAGGAGCCCTCATTTGGGGAGGGTCC
ACTCAGATAGAGGTAGAAGAAGCCCATTGCTTGACGACGATTACTCAAGATATGGCTCTCACAGTTCTTCGCCGTTTTCACCACCAAGAAGACGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGAAGAACGAGAGGATTCTCAGAGGTTGAAGAGAGCAGCAGCAGCAGCTTATGACTACGAGAACGATCCCAGATGGGCCGATTACTGGTCCAACATTCTGATCCC
TCCTAACATGGCTTCTCGCCCCGATGTTGTTGACCACTACAAGCGCAAGTTCTACCACCGTTACATCGATGCCGAACTTGTGGTAGAGGCCATGTCTTCTAGTAGTTCAA
CTCAGTCATCTAGACCTTCAGCAGCATCTTCCACAGCACCCCCTCCTACTAATGATCGGAGTCGATCACGAAGCTCAGGGTCAACGACTAGAGCTGCAGGCTCATCTGCA
AGCGCAGATCCTAATTCAACTCCATTAAGATGGGATCGACAAACAATTCAGTTTTCTGTCAATGCATGGGTGTTTATTGTGGCTGTGCTGGCAATTTTCCCCCTAATACC
AAAAAATCTTTCACAGAGGGCATATAGGCTTTCTTTTATGGGCACAACTTGTTCTTCTTTATATTCTTTGTACTCGTTGTATGGGAAGCCCAGGGCGTGGAATTTGCAAG
CATTGCAAGTTTATTTCCAGTCCATAATTGCAACAAAAGATTTCATTTACTTCACTTACTGTATCACCTTTGTGACTTCAAATATTTGTCTAAAATTTGCTTTAATTCCT
ATACTGTGTCGGGCTCTTGAACATGTTGCAAAGTTTCTTAGGCGTAATTTTGCACGTTCGTCTTTATACAGGAAATATTTGGAAGAGCCTTGCGTATGGGTCGAGTCAAA
TTCAACTACTCTCAGCATCCTATCTTCGCAGGCTGAGATTGGACTTGGCTTCATTCTAATCATCTCTTTGCTCTCGTGGCAACGCAACTTCTTACATACATTCATGTATT
GGCAGCTGCTAAAGCTCATGTATCATGCTCCTGTCACTTCTGGGTATCATCGAAGCGCCTGGTCCAATATTGGGAGGATCGTTTCTCCGCTGATCTACCGTTATGCCCCA
TTCCTTAATACTCCTCTTTCAATGGCACAAAGATGGTGGTTCAGGTTTCTCATGGTGCTGGTTTATATGCCCTCTGTTTATGGTGATGGAGGCAAGAATTGTCCTTTGTA
TCCCCATTTTATGGGGACCGAGGATTTCATTGCACTGCCAGCTTCTGGTGATTCTGGAAATGAAAATGAGAGTAATGATTCTCTTAGTTGTAATGAACCAGGGGAAGCTT
ATTCTCAATCAAGTGTTTTGAAGTGCAAGGACAATGATGCAAGCATAGAGAAGGTTGAGCTTGCAGATGATGTACAGTTTGAAGATATGCATTGCATACCTCAGTCTGAC
CTTAATGATGAAACACAACGTTCTGATTCAGATATGGAAATTGAGGATTTGAATAACCTCCCAGATTTTAGTAAGACTAGAAGTAGAAGTGAGAATAATAAAATACCGAG
TGAAGCTGAATACCTGCCAGTCAACTCTACAGATGAGAACATACAACCAAGCAGAGAGCCCTTGCAGCAGAATGAACTTCATACGAGATATGAAAATGTTTGTCATGTTG
AAAGTAAAAATTTTGGAAAGGATTTGGTTGATAATTCATCCTTCTCGAAAACTGGTAGTCAATTGACTGTAACCAACAGTGTTTCAATTGAGTTCAACGGGTTTAACTCT
GGAGTGCCCACTCCCACTGAGAATGGCTTGGCCACATCCCATCATGGTGGCCCCAGTAAAATTCACAAGAGTGATGCAATATCAGGTGTCAAAAGACCAAGGATTACTAT
GGATGAGGAACAACCTTCAGTGCACATCGTGTATAATTCTTTAACCAGAGCTAGTAAACAAAAGCTCGATGAACTATTAAAGCAGTGGTCTGAGTGGCATGCTCAACAAG
GTTCTCTAGCTCAAGATGATAAAGAATCTGAAAATCTAGAATCTGGAGAAGAAACCTTCTTTCCTGCTCTATGTGTCGGCACAAAGAAGAATTCAGCAGTGACTTTCTGG
ATTGACAACCAAAGAAGTGAGCAGCAGCAAAATTTTATTCCTTTAGATGATAATTCTGTCCCACTATATGATCGGGGATTCACTTTGGGACTAACTTCAGCCAATGATTC
GAGTAATGTAGAAGGAGGCCAGAAGATAATTGATGATGCTAGCCGTTGTTTCAATTGTGGTTCTTACAATCATTCCTTAAAGGATTGCCGAAAGCCTCGAGATAATGCTG
CTGTTAATAGTGCTCGCAATAAGTATAGAAGACATCATAATTCTGGCTCCCGCAATTCAACTCGATATTATCAGAATTCACGTGGTGGGAAGTATGATGATTTGAGGCCA
GGAGCTCTTGATGCTGAAACACGGCAACTGTTAGGTCTCAAGGAGCTTGATCCGCCCCCTTGGCTAAACAGAATGAGAGAGCTGGGATACCCACCGGGATATTTAGATCC
GGATGATGAGGATCAACCATCAGGGATTACAATATTTGCTGATGAGAAAACTGACGAACAGGAAGATGGGGAAATTACTGAGCCAGAGTACCGTAAACCACAAAAGAAAA
TGACTGTTGAGTTTCCTGGCATAAATGCTCCAATCCCAGAAAACGCAGATGAAAGACTTTGGGCTGCTGAACCTTCAAGTTCAGGTCTTCCTAGAAGTCGTTCGCACCAG
CGCTTGAACCATCACGCAGAATATGATGGGAGGGGGAATGATCACTATCAACAACGATGGTCCCGGGATTACAGAGATGACAGACCTCCAGGTGTTGACTCAGTAAAAAG
TCCTCCCATGCCTTTCACTCCAAGGTATGGTGGTCATGATTTTAGTTATGACTCCCAAAGTTCAAGAGGTAATTTTTCAGCGTCAAGGAGCCCTCATTTGGGGAGGGTCC
ACTCAGATAGAGGTAGAAGAAGCCCATTGCTTGACGACGATTACTCAAGATATGGCTCTCACAGTTCTTCGCCGTTTTCACCACCAAGAAGACGCTGA
Protein sequenceShow/hide protein sequence
MGEEREDSQRLKRAAAAAYDYENDPRWADYWSNILIPPNMASRPDVVDHYKRKFYHRYIDAELVVEAMSSSSSTQSSRPSAASSTAPPPTNDRSRSRSSGSTTRAAGSSA
SADPNSTPLRWDRQTIQFSVNAWVFIVAVLAIFPLIPKNLSQRAYRLSFMGTTCSSLYSLYSLYGKPRAWNLQALQVYFQSIIATKDFIYFTYCITFVTSNICLKFALIP
ILCRALEHVAKFLRRNFARSSLYRKYLEEPCVWVESNSTTLSILSSQAEIGLGFILIISLLSWQRNFLHTFMYWQLLKLMYHAPVTSGYHRSAWSNIGRIVSPLIYRYAP
FLNTPLSMAQRWWFRFLMVLVYMPSVYGDGGKNCPLYPHFMGTEDFIALPASGDSGNENESNDSLSCNEPGEAYSQSSVLKCKDNDASIEKVELADDVQFEDMHCIPQSD
LNDETQRSDSDMEIEDLNNLPDFSKTRSRSENNKIPSEAEYLPVNSTDENIQPSREPLQQNELHTRYENVCHVESKNFGKDLVDNSSFSKTGSQLTVTNSVSIEFNGFNS
GVPTPTENGLATSHHGGPSKIHKSDAISGVKRPRITMDEEQPSVHIVYNSLTRASKQKLDELLKQWSEWHAQQGSLAQDDKESENLESGEETFFPALCVGTKKNSAVTFW
IDNQRSEQQQNFIPLDDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNSARNKYRRHHNSGSRNSTRYYQNSRGGKYDDLRP
GALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFADEKTDEQEDGEITEPEYRKPQKKMTVEFPGINAPIPENADERLWAAEPSSSGLPRSRSHQ
RLNHHAEYDGRGNDHYQQRWSRDYRDDRPPGVDSVKSPPMPFTPRYGGHDFSYDSQSSRGNFSASRSPHLGRVHSDRGRRSPLLDDDYSRYGSHSSSPFSPPRRR