; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023961 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023961
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCCHC-type domain-containing protein
Genome locationtig00001047:1964308..1984833
RNA-Seq ExpressionSgr023961
SyntenySgr023961
Gene Ontology termsGO:0034470 - ncRNA processing (biological process)
GO:0005654 - nucleoplasm (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0071013 - catalytic step 2 spliceosome (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005344 - TMEM33/Pom33 family
IPR006568 - PSP, proline-rich
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016075.1 Zinc finger CCHC domain-containing protein 8 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0070.61Show/hide
Query:  MGEEREDLQRVKKAAAAAYDYENDPRWADYWSNILIPPHMASRPDVVDHYKRKFYQRYIVRTLSFNYRFGYRRFLGFRPFSRICALWWLMRIQVSSCIGF
        M EERED QR+K+AAAAAYDYENDP+WADYWSNILIPPHMASRPDVVDHYKRKFYQRYI                                         
Subjt:  MGEEREDLQRVKKAAAAAYDYENDPRWADYWSNILIPPHMASRPDVVDHYKRKFYQRYIVRTLSFNYRFGYRRFLGFRPFSRICALWWLMRIQVSSCIGF

Query:  KFLPYVALLGLLKQDPELVVEAMSSSSSTQSSRPSATSS-APPPTNDRSRSRSSGSTTRTSGTSASADPNPTPLRWDRQTIQFSVNAWVFIVAVLAIFPL
                      DP+LVVEAMSSSSSTQSSRPSATSS APPPTNDRSR RSSGSTTRTSGTSASAD NP+PLRWDRQTIQFSVNAWV IVAVLAIFPL
Subjt:  KFLPYVALLGLLKQDPELVVEAMSSSSSTQSSRPSATSS-APPPTNDRSRSRSSGSTTRTSGTSASADPNPTPLRWDRQTIQFSVNAWVFIVAVLAIFPL

Query:  IPKNLSQRAYRLSFMGTTCSSLYSLYSLYGKPRAWNLQALQVYFQSIIATKDFIYFTYCITFVTSNICLKFALIPILCRALEHVAKFLRRNFTRSSLYRK
        IPKNLSQRAYRLSFMG TCSSLYSLYSLYGKPRAWNLQALQ YFQSIIATKDFIYF YCITF+TSNICLKFALIPILCRALEHVAKFLRRNF RSSLYRK
Subjt:  IPKNLSQRAYRLSFMGTTCSSLYSLYSLYGKPRAWNLQALQVYFQSIIATKDFIYFTYCITFVTSNICLKFALIPILCRALEHVAKFLRRNFTRSSLYRK

Query:  YLEEPCVWVESNSTTLSILSSQAEIGLGFILVISLLSWQRNFLHTFMYWQLLKLMYHAPVTSGYHQSAWSNIGRVVSPLIYRYAPFLNTPLSMAQRWWFS
        YLEEPCVWVESNSTTLSILSSQAEIGLGF+L+ISLLS             LLKLMYHAPVTSGYH+SAWSNIGR VSPLIYRYAPFLNTPLSMAQRWWFS
Subjt:  YLEEPCVWVESNSTTLSILSSQAEIGLGFILVISLLSWQRNFLHTFMYWQLLKLMYHAPVTSGYHQSAWSNIGRVVSPLIYRYAPFLNTPLSMAQRWWFS

Query:  YLVDFLLHHCP----------------------------------------------LYVHFMGTEDFIALPASGDSGNDNENNELLSFHETREAYSQSS
          V   L+                                                  + HFMGTEDFIALPASGD G++NE+NE LSF+ETREA SQSS
Subjt:  YLVDFLLHHCP----------------------------------------------LYVHFMGTEDFIALPASGDSGNDNENNELLSFHETREAYSQSS

Query:  VLKCKDNDASIEKVELADHVQFEDMHCITQSDLNDETQTCDSDMEIEDLNNLPDFSKPRSRSENNQIPSEAEYLPVNSADENIQPSREPLEQNELYMRYE
        VL+CKDN ASIEK ELAD VQ EDM CI QSDLNDETQ  +SDMEIEDLNNLPD SK RS SEN +I SEAEYLPVNS DENI PS EPL+QNEL++R E
Subjt:  VLKCKDNDASIEKVELADHVQFEDMHCITQSDLNDETQTCDSDMEIEDLNNLPDFSKPRSRSENNQIPSEAEYLPVNSADENIQPSREPLEQNELYMRYE

Query:  DVGHVTSTNFEKDLVDNSSFLKTGNRLTVTKAVPIEFNRFNSGVSIENGSA-TFHHGSPIKNHKNDAISGVKRPRTTMDEQQPSVHVVYSSLTRASKQKL
        DV H  S   +KDLVDNSSF KT   LT+            +GVSIENGSA + HHG P K HK+DAI GVK+PR  MDEQQPSVH+VY+SLTR SKQKL
Subjt:  DVGHVTSTNFEKDLVDNSSFLKTGNRLTVTKAVPIEFNRFNSGVSIENGSA-TFHHGSPIKNHKNDAISGVKRPRTTMDEQQPSVHVVYSSLTRASKQKL

Query:  DELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPALCVGTEKTSAVKRTAAKFIPLDDNFVPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCG
        DELLKQWSEW+AQRGS SQ             TF+          +        F+P+DDN VP YDRGFTLGLTSANDSSN EGGQKIIDDASRCFNCG
Subjt:  DELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPALCVGTEKTSAVKRTAAKFIPLDDNFVPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCG

Query:  SYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSG
        SYNHSLKDC KPRDNAAVNNAR  Y  K+H +SGSR+STRYYQ SRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMR LGYPPGYLDP+DEDQPSG
Subjt:  SYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSG

Query:  ITIYADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIPENADERLWAAEPSSSGLSRSRSHQRLNHYAEHDGRGNDHY-QRWSRDYKDDRPPGVDPV
        ITIYADE+  EQEDGEITE EY KP +KMSVEFPGINAPIPE+ADERLW+AEP SS L R RS QRLNH+ EHDGRGNDH+ QRWSRDY+D RPPGVD V
Subjt:  ITIYADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIPENADERLWAAEPSSSGLSRSRSHQRLNHYAEHDGRGNDHY-QRWSRDYKDDRPPGVDPV

Query:  KSPPVPYASRYGGYDFNYDFQSPR
        KSP + +  RYGG++ +Y  +SPR
Subjt:  KSPPVPYASRYGGYDFNYDFQSPR

XP_008459422.1 PREDICTED: uncharacterized protein LOC103498564 isoform X1 [Cucumis melo]3.0e-24878.4Show/hide
Query:  MGTEDFIALPASGDSGNDNENNELLSFHETREAYSQSSVLKCKDNDASIEKVELADHVQFEDMHCITQSDLNDETQTCDSDMEIEDLNNLPDFSKPRSRS
        MGTEDFIALPASGDSGN+ E+NE L+F+ETREAYSQSSVLKCKD+DASIEK EL D VQ EDMHC+ QSDL DETQ  DSDMEIEDLNNLPDFSK RSRS
Subjt:  MGTEDFIALPASGDSGNDNENNELLSFHETREAYSQSSVLKCKDNDASIEKVELADHVQFEDMHCITQSDLNDETQTCDSDMEIEDLNNLPDFSKPRSRS

Query:  ENNQIPSEAEYLPVNSADENIQPSREPLEQNELYMRYEDVGHVTSTNFEKDLVDNSSFLKTGNRLTVTKAVPIEFNRFNSGVSIENGSAT-FHHGSPIKN
        EN++I S+AE LPVNSAD NI PS EPL+QNEL+ RYEDV HV S NF+KDLVDNSSF KTG +LTV   V I+FN  NSG  +ENGSAT  HHG P K 
Subjt:  ENNQIPSEAEYLPVNSADENIQPSREPLEQNELYMRYEDVGHVTSTNFEKDLVDNSSFLKTGNRLTVTKAVPIEFNRFNSGVSIENGSAT-FHHGSPIKN

Query:  HKNDAISGVKRPR---TTMDEQQPSVHVVYSSLTRASKQKLDELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPALCVGTEKTSAV--------KRT
         K+D ISGVKRPR     MDEQQPSVH+VY+SLTR SKQKLDELLKQWSEWHAQ+GSLS+D K++ENLESGEETFFPALCVGT+KTSAV           
Subjt:  HKNDAISGVKRPR---TTMDEQQPSVHVVYSSLTRASKQKLDELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPALCVGTEKTSAV--------KRT

Query:  AAKFIPLDDNFVPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYD
           F+P+DDN VP YDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDC KPRDNAAVNNAR +YK K+H NS SR+STRYYQNSRGGKYD
Subjt:  AAKFIPLDDNFVPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYD

Query:  DLRPGALDAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIPENADERLWAA
        DLRPG LDAETRQLLGLKELDPPPWLNRMR LGYPPGYLDP+DEDQPSGITIYADE+ +EQEDGEITE EY KP++KMSVEFPGINAPIPENADERLWA 
Subjt:  DLRPGALDAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIPENADERLWAA

Query:  EPSSSGLSRSRSHQRLNHYAEHDGRGNDHY-QRWSRDYKDDRPPGVDPVKSPPVPYASRYGGYDFNYDFQSPRG
        EPSSSGL R+RS+QRLNHY E+D RGNDH+ QRWSRDY+DDRPPGVD +KSPP  +  RYGG+DF+YD Q+PRG
Subjt:  EPSSSGLSRSRSHQRLNHYAEHDGRGNDHY-QRWSRDYKDDRPPGVDPVKSPPVPYASRYGGYDFNYDFQSPRG

XP_008459423.1 PREDICTED: uncharacterized protein LOC103498564 isoform X2 [Cucumis melo]3.1e-24577.84Show/hide
Query:  MGTEDFIALPASGDSGNDNENNELLSFHETREAYSQSSVLKCKDNDASIEKVELADHVQFEDMHCITQSDLNDETQTCDSDMEIEDLNNLPDFSKPRSRS
        MGTEDFIALPASGDSGN+ E+NE L+F+ETREAYSQSSVLKCKD+DASIEK EL D VQ EDMHC+ QSDL DETQ  DSDMEIEDLNNLPDFSK RSRS
Subjt:  MGTEDFIALPASGDSGNDNENNELLSFHETREAYSQSSVLKCKDNDASIEKVELADHVQFEDMHCITQSDLNDETQTCDSDMEIEDLNNLPDFSKPRSRS

Query:  ENNQIPSEAEYLPVNSADENIQPSREPLEQNELYMRYEDVGHVTSTNFEKDLVDNSSFLKTGNRLTVTKAVPIEFNRFNSGVSIENGSATFHHGSPIKNH
        EN++I S+AE LPVNSAD NI PS EPL+QNEL+ RYEDV HV S NF+KDLVDNSSF KTG +LTV   V I+FN  NSG  +ENGSAT HH      H
Subjt:  ENNQIPSEAEYLPVNSADENIQPSREPLEQNELYMRYEDVGHVTSTNFEKDLVDNSSFLKTGNRLTVTKAVPIEFNRFNSGVSIENGSATFHHGSPIKNH

Query:  KNDAISGVKRPR---TTMDEQQPSVHVVYSSLTRASKQKLDELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPALCVGTEKTSAV--------KRTA
            ISGVKRPR     MDEQQPSVH+VY+SLTR SKQKLDELLKQWSEWHAQ+GSLS+D K++ENLESGEETFFPALCVGT+KTSAV            
Subjt:  KNDAISGVKRPR---TTMDEQQPSVHVVYSSLTRASKQKLDELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPALCVGTEKTSAV--------KRTA

Query:  AKFIPLDDNFVPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDD
          F+P+DDN VP YDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDC KPRDNAAVNNAR +YK K+H NS SR+STRYYQNSRGGKYDD
Subjt:  AKFIPLDDNFVPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDD

Query:  LRPGALDAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIPENADERLWAAE
        LRPG LDAETRQLLGLKELDPPPWLNRMR LGYPPGYLDP+DEDQPSGITIYADE+ +EQEDGEITE EY KP++KMSVEFPGINAPIPENADERLWA E
Subjt:  LRPGALDAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIPENADERLWAAE

Query:  PSSSGLSRSRSHQRLNHYAEHDGRGNDHY-QRWSRDYKDDRPPGVDPVKSPPVPYASRYGGYDFNYDFQSPRG
        PSSSGL R+RS+QRLNHY E+D RGNDH+ QRWSRDY+DDRPPGVD +KSPP  +  RYGG+DF+YD Q+PRG
Subjt:  PSSSGLSRSRSHQRLNHYAEHDGRGNDHY-QRWSRDYKDDRPPGVDPVKSPPVPYASRYGGYDFNYDFQSPRG

XP_022133813.1 uncharacterized protein LOC111006283 [Momordica charantia]4.2e-26682.57Show/hide
Query:  MGTEDFIALPASGDSGNDNENNELLSFHETREAYSQSSVLKCKDNDASIEKVELADHVQFEDMHCITQSDLNDETQTCDSDMEIEDLNNLPDFSKPRSRS
        M TEDFIALPASGDSGN+NENNE LS HETRE  SQSSVLKCKD+DASIEK ELAD VQF+DM CI QSDLNDE Q  DSDMEIEDLNNLPDF+K RSRS
Subjt:  MGTEDFIALPASGDSGNDNENNELLSFHETREAYSQSSVLKCKDNDASIEKVELADHVQFEDMHCITQSDLNDETQTCDSDMEIEDLNNLPDFSKPRSRS

Query:  ENNQIPSEAEYLPVNSADENIQPSREPLEQNELYMRYEDVGHVTSTNFEKDLVDNSSFLKTGNRLTVTKAVPIEFNRFNSGVSIENGSATFHHGSPIKNH
        ENN+I +EA+YLPVNSA ENIQPSREPL+QNEL+MRYE+V HV S NFE DLVDNSSFLKTG++LTVT  V IE+N FNSGV IENGSAT +HG+ IK+H
Subjt:  ENNQIPSEAEYLPVNSADENIQPSREPLEQNELYMRYEDVGHVTSTNFEKDLVDNSSFLKTGNRLTVTKAVPIEFNRFNSGVSIENGSATFHHGSPIKNH

Query:  KNDAISGVKRPRTTMDEQQPSVHVVYSSLTRASKQKLDELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPALCVGTEKTSAV--------KRTAAKF
        K+DAISGVKRPR  MDEQQPSVHV+YSSLTRASKQKLDELLKQWSEWHAQ+G LSQD KESENLESGEETFFPALC+GT+K+SAV        +     F
Subjt:  KNDAISGVKRPRTTMDEQQPSVHVVYSSLTRASKQKLDELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPALCVGTEKTSAV--------KRTAAKF

Query:  IPLDDNFVPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDDLRP
        IPLDDN VPRYDRGFTLGLTSAND+SNVEGGQKIIDDASRCFNCGSYNH+L+DC KPRDN AVNNAR +YKSKRHQNSGSR+STRYYQNSRGGKYDDLRP
Subjt:  IPLDDNFVPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDDLRP

Query:  GALDAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIPENADERLWAAEPSS
        GALDAETRQLLGLKELDPPPWLNRMR LGYPPGYLDPDDEDQPSGITI+ DEE NEQEDGEITE EY KPRRK SVEFPGINAPIPENADE LWAAEPSS
Subjt:  GALDAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIPENADERLWAAEPSS

Query:  SGLSRSRSHQRLNHYAEHDGRGNDHYQRWSRDYKDDRPPGVDPVKSPPVPYASRYGGYDFNYDFQSPR
        SGL RSRSHQRLNH+AE+DGRGND YQRW RDY+DD PPGVD VKSPP+ Y  RYG YDFN+D QS R
Subjt:  SGLSRSRSHQRLNHYAEHDGRGNDHYQRWSRDYKDDRPPGVDPVKSPPVPYASRYGGYDFNYDFQSPR

XP_038890370.1 uncharacterized protein LOC120079961 [Benincasa hispida]4.9e-25180.21Show/hide
Query:  MGTEDFIALPASGDSGNDNENNELLSFHETREAYSQSSVLKCKDNDASIEKVELADHVQFEDMHCITQSDLNDETQTCDSDMEIEDLNNLPDFSKPRSRS
        MGTEDFIALPASGDSGN+ E+NE LSFHETR+  SQSSVLKCKD+DAS EKVELAD V  EDMH I QSDL DETQ  DSDMEIEDLNNLPDF+K RSRS
Subjt:  MGTEDFIALPASGDSGNDNENNELLSFHETREAYSQSSVLKCKDNDASIEKVELADHVQFEDMHCITQSDLNDETQTCDSDMEIEDLNNLPDFSKPRSRS

Query:  ENNQIPSEAEYLPVNSADENIQPSREPLEQNELYMRYEDVGHVTSTNFEKDLVDNSSFLKTGNRLTVTKAVPIEFNRFNSGVSIENGSATFH-HGSPIKN
        ENN+I SEAEYLPVNSADENI PS EPL+QNEL+ RYEDV HV S NF+KDLVDNSSFLKTG +LTVT  V IEFNR NSGV IENG A+ H HG P K 
Subjt:  ENNQIPSEAEYLPVNSADENIQPSREPLEQNELYMRYEDVGHVTSTNFEKDLVDNSSFLKTGNRLTVTKAVPIEFNRFNSGVSIENGSATFH-HGSPIKN

Query:  HKNDAISGVKRPRTTMDEQQPSVHVVYSSLTRASKQKLDELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPALCVGTEKTSAV--------KRTAAK
        HK+DAISGVKRPR  MDEQQPSVH+VY+SLTR SKQKLDELLKQWSEWHAQRGSLS D K+SENLESGEETFFPALCVGT+KTSAV              
Subjt:  HKNDAISGVKRPRTTMDEQQPSVHVVYSSLTRASKQKLDELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPALCVGTEKTSAV--------KRTAAK

Query:  FIPLDDNFVPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDDLR
        F+P+DDN VP YDRGFTLGLTSA+DSSNVEGGQKIIDDASRCFNCGSYNHSLKDC KPRDNAAVNNAR +YK K+H +SGSR+STRYYQNSRGGKYDDLR
Subjt:  FIPLDDNFVPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDDLR

Query:  PGALDAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIPENADERLWAAEPS
        PGALD ETRQLLGLKELDPPPWLNRMR LGYPPGYLDP+DEDQPSGITIYADE+ +EQEDGEITE EY KPR+KMSV FPGINAPIPENADERLWA EPS
Subjt:  PGALDAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIPENADERLWAAEPS

Query:  SSGLSRSRSHQRLNHYAEHDGRGNDHY-QRWSRDYKDDRPPGVDPVKSPPVPYASRYGGYDFNYDFQSPRG
        S GL R+RS+QRLNHY E+D RGNDH+ QRWSRDY+DDRPPGVD VKSPP  +  RYG +DF+YD Q+PRG
Subjt:  SSGLSRSRSHQRLNHYAEHDGRGNDHY-QRWSRDYKDDRPPGVDPVKSPPVPYASRYGGYDFNYDFQSPRG

TrEMBL top hitse value%identityAlignment
A0A1S3CAN3 uncharacterized protein LOC103498564 isoform X11.5e-24878.4Show/hide
Query:  MGTEDFIALPASGDSGNDNENNELLSFHETREAYSQSSVLKCKDNDASIEKVELADHVQFEDMHCITQSDLNDETQTCDSDMEIEDLNNLPDFSKPRSRS
        MGTEDFIALPASGDSGN+ E+NE L+F+ETREAYSQSSVLKCKD+DASIEK EL D VQ EDMHC+ QSDL DETQ  DSDMEIEDLNNLPDFSK RSRS
Subjt:  MGTEDFIALPASGDSGNDNENNELLSFHETREAYSQSSVLKCKDNDASIEKVELADHVQFEDMHCITQSDLNDETQTCDSDMEIEDLNNLPDFSKPRSRS

Query:  ENNQIPSEAEYLPVNSADENIQPSREPLEQNELYMRYEDVGHVTSTNFEKDLVDNSSFLKTGNRLTVTKAVPIEFNRFNSGVSIENGSAT-FHHGSPIKN
        EN++I S+AE LPVNSAD NI PS EPL+QNEL+ RYEDV HV S NF+KDLVDNSSF KTG +LTV   V I+FN  NSG  +ENGSAT  HHG P K 
Subjt:  ENNQIPSEAEYLPVNSADENIQPSREPLEQNELYMRYEDVGHVTSTNFEKDLVDNSSFLKTGNRLTVTKAVPIEFNRFNSGVSIENGSAT-FHHGSPIKN

Query:  HKNDAISGVKRPR---TTMDEQQPSVHVVYSSLTRASKQKLDELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPALCVGTEKTSAV--------KRT
         K+D ISGVKRPR     MDEQQPSVH+VY+SLTR SKQKLDELLKQWSEWHAQ+GSLS+D K++ENLESGEETFFPALCVGT+KTSAV           
Subjt:  HKNDAISGVKRPR---TTMDEQQPSVHVVYSSLTRASKQKLDELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPALCVGTEKTSAV--------KRT

Query:  AAKFIPLDDNFVPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYD
           F+P+DDN VP YDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDC KPRDNAAVNNAR +YK K+H NS SR+STRYYQNSRGGKYD
Subjt:  AAKFIPLDDNFVPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYD

Query:  DLRPGALDAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIPENADERLWAA
        DLRPG LDAETRQLLGLKELDPPPWLNRMR LGYPPGYLDP+DEDQPSGITIYADE+ +EQEDGEITE EY KP++KMSVEFPGINAPIPENADERLWA 
Subjt:  DLRPGALDAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIPENADERLWAA

Query:  EPSSSGLSRSRSHQRLNHYAEHDGRGNDHY-QRWSRDYKDDRPPGVDPVKSPPVPYASRYGGYDFNYDFQSPRG
        EPSSSGL R+RS+QRLNHY E+D RGNDH+ QRWSRDY+DDRPPGVD +KSPP  +  RYGG+DF+YD Q+PRG
Subjt:  EPSSSGLSRSRSHQRLNHYAEHDGRGNDHY-QRWSRDYKDDRPPGVDPVKSPPVPYASRYGGYDFNYDFQSPRG

A0A1S3CBD3 uncharacterized protein LOC103498564 isoform X21.5e-24577.84Show/hide
Query:  MGTEDFIALPASGDSGNDNENNELLSFHETREAYSQSSVLKCKDNDASIEKVELADHVQFEDMHCITQSDLNDETQTCDSDMEIEDLNNLPDFSKPRSRS
        MGTEDFIALPASGDSGN+ E+NE L+F+ETREAYSQSSVLKCKD+DASIEK EL D VQ EDMHC+ QSDL DETQ  DSDMEIEDLNNLPDFSK RSRS
Subjt:  MGTEDFIALPASGDSGNDNENNELLSFHETREAYSQSSVLKCKDNDASIEKVELADHVQFEDMHCITQSDLNDETQTCDSDMEIEDLNNLPDFSKPRSRS

Query:  ENNQIPSEAEYLPVNSADENIQPSREPLEQNELYMRYEDVGHVTSTNFEKDLVDNSSFLKTGNRLTVTKAVPIEFNRFNSGVSIENGSATFHHGSPIKNH
        EN++I S+AE LPVNSAD NI PS EPL+QNEL+ RYEDV HV S NF+KDLVDNSSF KTG +LTV   V I+FN  NSG  +ENGSAT HH      H
Subjt:  ENNQIPSEAEYLPVNSADENIQPSREPLEQNELYMRYEDVGHVTSTNFEKDLVDNSSFLKTGNRLTVTKAVPIEFNRFNSGVSIENGSATFHHGSPIKNH

Query:  KNDAISGVKRPR---TTMDEQQPSVHVVYSSLTRASKQKLDELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPALCVGTEKTSAV--------KRTA
            ISGVKRPR     MDEQQPSVH+VY+SLTR SKQKLDELLKQWSEWHAQ+GSLS+D K++ENLESGEETFFPALCVGT+KTSAV            
Subjt:  KNDAISGVKRPR---TTMDEQQPSVHVVYSSLTRASKQKLDELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPALCVGTEKTSAV--------KRTA

Query:  AKFIPLDDNFVPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDD
          F+P+DDN VP YDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDC KPRDNAAVNNAR +YK K+H NS SR+STRYYQNSRGGKYDD
Subjt:  AKFIPLDDNFVPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDD

Query:  LRPGALDAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIPENADERLWAAE
        LRPG LDAETRQLLGLKELDPPPWLNRMR LGYPPGYLDP+DEDQPSGITIYADE+ +EQEDGEITE EY KP++KMSVEFPGINAPIPENADERLWA E
Subjt:  LRPGALDAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIPENADERLWAAE

Query:  PSSSGLSRSRSHQRLNHYAEHDGRGNDHY-QRWSRDYKDDRPPGVDPVKSPPVPYASRYGGYDFNYDFQSPRG
        PSSSGL R+RS+QRLNHY E+D RGNDH+ QRWSRDY+DDRPPGVD +KSPP  +  RYGG+DF+YD Q+PRG
Subjt:  PSSSGLSRSRSHQRLNHYAEHDGRGNDHY-QRWSRDYKDDRPPGVDPVKSPPVPYASRYGGYDFNYDFQSPRG

A0A5D3BMZ1 Zinc finger CCHC domain-containing protein 8 isoform X21.5e-24577.84Show/hide
Query:  MGTEDFIALPASGDSGNDNENNELLSFHETREAYSQSSVLKCKDNDASIEKVELADHVQFEDMHCITQSDLNDETQTCDSDMEIEDLNNLPDFSKPRSRS
        MGTEDFIALPASGDSGN+ E+NE L+F+ETREAYSQSSVLKCKD+DASIEK EL D VQ EDMHC+ QSDL DETQ  DSDMEIEDLNNLPDFSK RSRS
Subjt:  MGTEDFIALPASGDSGNDNENNELLSFHETREAYSQSSVLKCKDNDASIEKVELADHVQFEDMHCITQSDLNDETQTCDSDMEIEDLNNLPDFSKPRSRS

Query:  ENNQIPSEAEYLPVNSADENIQPSREPLEQNELYMRYEDVGHVTSTNFEKDLVDNSSFLKTGNRLTVTKAVPIEFNRFNSGVSIENGSATFHHGSPIKNH
        EN++I S+AE LPVNSAD NI PS EPL+QNEL+ RYEDV HV S NF+KDLVDNSSF KTG +LTV   V I+FN  NSG  +ENGSAT HH      H
Subjt:  ENNQIPSEAEYLPVNSADENIQPSREPLEQNELYMRYEDVGHVTSTNFEKDLVDNSSFLKTGNRLTVTKAVPIEFNRFNSGVSIENGSATFHHGSPIKNH

Query:  KNDAISGVKRPR---TTMDEQQPSVHVVYSSLTRASKQKLDELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPALCVGTEKTSAV--------KRTA
            ISGVKRPR     MDEQQPSVH+VY+SLTR SKQKLDELLKQWSEWHAQ+GSLS+D K++ENLESGEETFFPALCVGT+KTSAV            
Subjt:  KNDAISGVKRPR---TTMDEQQPSVHVVYSSLTRASKQKLDELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPALCVGTEKTSAV--------KRTA

Query:  AKFIPLDDNFVPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDD
          F+P+DDN VP YDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDC KPRDNAAVNNAR +YK K+H NS SR+STRYYQNSRGGKYDD
Subjt:  AKFIPLDDNFVPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDD

Query:  LRPGALDAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIPENADERLWAAE
        LRPG LDAETRQLLGLKELDPPPWLNRMR LGYPPGYLDP+DEDQPSGITIYADE+ +EQEDGEITE EY KP++KMSVEFPGINAPIPENADERLWA E
Subjt:  LRPGALDAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIPENADERLWAAE

Query:  PSSSGLSRSRSHQRLNHYAEHDGRGNDHY-QRWSRDYKDDRPPGVDPVKSPPVPYASRYGGYDFNYDFQSPRG
        PSSSGL R+RS+QRLNHY E+D RGNDH+ QRWSRDY+DDRPPGVD +KSPP  +  RYGG+DF+YD Q+PRG
Subjt:  PSSSGLSRSRSHQRLNHYAEHDGRGNDHY-QRWSRDYKDDRPPGVDPVKSPPVPYASRYGGYDFNYDFQSPRG

A0A6J1BX16 uncharacterized protein LOC1110062832.0e-26682.57Show/hide
Query:  MGTEDFIALPASGDSGNDNENNELLSFHETREAYSQSSVLKCKDNDASIEKVELADHVQFEDMHCITQSDLNDETQTCDSDMEIEDLNNLPDFSKPRSRS
        M TEDFIALPASGDSGN+NENNE LS HETRE  SQSSVLKCKD+DASIEK ELAD VQF+DM CI QSDLNDE Q  DSDMEIEDLNNLPDF+K RSRS
Subjt:  MGTEDFIALPASGDSGNDNENNELLSFHETREAYSQSSVLKCKDNDASIEKVELADHVQFEDMHCITQSDLNDETQTCDSDMEIEDLNNLPDFSKPRSRS

Query:  ENNQIPSEAEYLPVNSADENIQPSREPLEQNELYMRYEDVGHVTSTNFEKDLVDNSSFLKTGNRLTVTKAVPIEFNRFNSGVSIENGSATFHHGSPIKNH
        ENN+I +EA+YLPVNSA ENIQPSREPL+QNEL+MRYE+V HV S NFE DLVDNSSFLKTG++LTVT  V IE+N FNSGV IENGSAT +HG+ IK+H
Subjt:  ENNQIPSEAEYLPVNSADENIQPSREPLEQNELYMRYEDVGHVTSTNFEKDLVDNSSFLKTGNRLTVTKAVPIEFNRFNSGVSIENGSATFHHGSPIKNH

Query:  KNDAISGVKRPRTTMDEQQPSVHVVYSSLTRASKQKLDELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPALCVGTEKTSAV--------KRTAAKF
        K+DAISGVKRPR  MDEQQPSVHV+YSSLTRASKQKLDELLKQWSEWHAQ+G LSQD KESENLESGEETFFPALC+GT+K+SAV        +     F
Subjt:  KNDAISGVKRPRTTMDEQQPSVHVVYSSLTRASKQKLDELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPALCVGTEKTSAV--------KRTAAKF

Query:  IPLDDNFVPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDDLRP
        IPLDDN VPRYDRGFTLGLTSAND+SNVEGGQKIIDDASRCFNCGSYNH+L+DC KPRDN AVNNAR +YKSKRHQNSGSR+STRYYQNSRGGKYDDLRP
Subjt:  IPLDDNFVPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDDLRP

Query:  GALDAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIPENADERLWAAEPSS
        GALDAETRQLLGLKELDPPPWLNRMR LGYPPGYLDPDDEDQPSGITI+ DEE NEQEDGEITE EY KPRRK SVEFPGINAPIPENADE LWAAEPSS
Subjt:  GALDAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIPENADERLWAAEPSS

Query:  SGLSRSRSHQRLNHYAEHDGRGNDHYQRWSRDYKDDRPPGVDPVKSPPVPYASRYGGYDFNYDFQSPR
        SGL RSRSHQRLNH+AE+DGRGND YQRW RDY+DD PPGVD VKSPP+ Y  RYG YDFN+D QS R
Subjt:  SGLSRSRSHQRLNHYAEHDGRGNDHYQRWSRDYKDDRPPGVDPVKSPPVPYASRYGGYDFNYDFQSPR

E5GCT2 Nucleic acid binding protein1.5e-24577.84Show/hide
Query:  MGTEDFIALPASGDSGNDNENNELLSFHETREAYSQSSVLKCKDNDASIEKVELADHVQFEDMHCITQSDLNDETQTCDSDMEIEDLNNLPDFSKPRSRS
        MGTEDFIALPASGDSGN+ E+NE L+F+ETREAYSQSSVLKCKD+DASIEK EL D VQ EDMHC+ QSDL DETQ  DSDMEIEDLNNLPDFSK RSRS
Subjt:  MGTEDFIALPASGDSGNDNENNELLSFHETREAYSQSSVLKCKDNDASIEKVELADHVQFEDMHCITQSDLNDETQTCDSDMEIEDLNNLPDFSKPRSRS

Query:  ENNQIPSEAEYLPVNSADENIQPSREPLEQNELYMRYEDVGHVTSTNFEKDLVDNSSFLKTGNRLTVTKAVPIEFNRFNSGVSIENGSATFHHGSPIKNH
        EN++I S+AE LPVNSAD NI PS EPL+QNEL+ RYEDV HV S NF+KDLVDNSSF KTG +LTV   V I+FN  NSG  +ENGSAT HH      H
Subjt:  ENNQIPSEAEYLPVNSADENIQPSREPLEQNELYMRYEDVGHVTSTNFEKDLVDNSSFLKTGNRLTVTKAVPIEFNRFNSGVSIENGSATFHHGSPIKNH

Query:  KNDAISGVKRPR---TTMDEQQPSVHVVYSSLTRASKQKLDELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPALCVGTEKTSAV--------KRTA
            ISGVKRPR     MDEQQPSVH+VY+SLTR SKQKLDELLKQWSEWHAQ+GSLS+D K++ENLESGEETFFPALCVGT+KTSAV            
Subjt:  KNDAISGVKRPR---TTMDEQQPSVHVVYSSLTRASKQKLDELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPALCVGTEKTSAV--------KRTA

Query:  AKFIPLDDNFVPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDD
          F+P+DDN VP YDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDC KPRDNAAVNNAR +YK K+H NS SR+STRYYQNSRGGKYDD
Subjt:  AKFIPLDDNFVPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDD

Query:  LRPGALDAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIPENADERLWAAE
        LRPG LDAETRQLLGLKELDPPPWLNRMR LGYPPGYLDP+DEDQPSGITIYADE+ +EQEDGEITE EY KP++KMSVEFPGINAPIPENADERLWA E
Subjt:  LRPGALDAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIPENADERLWAAE

Query:  PSSSGLSRSRSHQRLNHYAEHDGRGNDHY-QRWSRDYKDDRPPGVDPVKSPPVPYASRYGGYDFNYDFQSPRG
        PSSSGL R+RS+QRLNHY E+D RGNDH+ QRWSRDY+DDRPPGVD +KSPP  +  RYGG+DF+YD Q+PRG
Subjt:  PSSSGLSRSRSHQRLNHYAEHDGRGNDHY-QRWSRDYKDDRPPGVDPVKSPPVPYASRYGGYDFNYDFQSPRG

SwissProt top hitse value%identityAlignment
Q5F3D1 Zinc finger CCHC domain-containing protein 86.1e-1831.28Show/hide
Query:  VPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDDLRPGALDAET
        +P+Y + F+  L+       V+  +      S CFNCGS  H +KDCPKPR+ A ++  RKE+     + S      RY+      ++   +PG +  E 
Subjt:  VPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDDLRPGALDAET

Query:  RQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIP
        +  LG+     PP++ RMR LGYPPG+L  + E + SG+ +Y  ++ NE ED    + ++        + +PG N   P
Subjt:  RQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIP

Q5R789 Zinc finger CCHC domain-containing protein 87.4e-1635.19Show/hide
Query:  GQKIIDDASR----CFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMR
        GQ+I   A R    CFNCGS  H +KDCP PR+ A ++  RKEY     + +      RY+      ++   +PG +  E +  LG+ +   PP++ RMR
Subjt:  GQKIIDDASR----CFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMR

Query:  ALGYPPGYLDPDDEDQPSGITIY--ADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIP
         LGYPPG+L  + E + SG+ +Y   D    E E GEI + +         V +PG N   P
Subjt:  ALGYPPGYLDPDDEDQPSGITIY--ADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIP

Q6DD45 Zinc finger CCHC domain-containing protein 81.6e-1832.98Show/hide
Query:  VPRYDRGFTLGLTSANDSSNVEGGQKIIDDASR----CFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDDLRPGAL
        +P+Y + FT  ++          GQ+I   A R    CFNCGS  H ++DCPKPRD A +N  RKE+     + +G+++  RY+      ++   +PG +
Subjt:  VPRYDRGFTLGLTSANDSSNVEGGQKIIDDASR----CFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDDLRPGAL

Query:  DAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADEEPNEQEDGEITEVEYPKPRR-----KMSVEFPGINAPIP
          E ++ LG+ + + PP++ RMR LGYPPG+L  + E + SG+++Y  +E  +  DGEI + +    +         V +PG N   P
Subjt:  DAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADEEPNEQEDGEITEVEYPKPRR-----KMSVEFPGINAPIP

Q6NZY4 Zinc finger CCHC domain-containing protein 87.4e-1635.19Show/hide
Query:  GQKIIDDASR----CFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMR
        GQ+I   A R    CFNCGS  H +KDCP PR+ A ++  RKEY     + +      RY+      ++   +PG +  E +  LG+ +   PP++ RMR
Subjt:  GQKIIDDASR----CFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMR

Query:  ALGYPPGYLDPDDEDQPSGITIY--ADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIP
         LGYPPG+L  + E + SG+ +Y   D    E E GEI + +         V +PG N   P
Subjt:  ALGYPPGYLDPDDEDQPSGITIY--ADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIP

Q9CYA6 Zinc finger CCHC domain-containing protein 81.5e-1636.49Show/hide
Query:  CFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDE
        CFNCGS  H +K+CP PR+ A ++  RKEY     + SG     RY+      ++   +PG +  E +  LG+ +   PP++ RMR LGYPPG+L  + E
Subjt:  CFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDE

Query:  DQPSGITIY--ADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIP
         + SG+ +Y   D+   E E GEI          K+ V +PG N   P
Subjt:  DQPSGITIY--ADEEPNEQEDGEITEVEYPKPRRKMSVEFPGINAPIP

Arabidopsis top hitse value%identityAlignment
AT1G67210.1 Proline-rich spliceosome-associated (PSP) family protein / zinc knuckle (CCHC-type) family protein6.7e-8955.73Show/hide
Query:  SGVKRPRTTMDEQQPSVHVVYSSLTRASKQKLDELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPALCVGTEKTSAVK---------RTAAKFIPLD
        SGVKR RT   EQQPSVHV Y  LTR SKQKL+ LL+QWSEW A++ SLS+D  + + LE+G+ET+FPAL VG +KTS+V           ++ K +P++
Subjt:  SGVKRPRTTMDEQQPSVHVVYSSLTRASKQKLDELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPALCVGTEKTSAVK---------RTAAKFIPLD

Query:  DNFVPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDDLRPGALD
         +  P Y+RGFT+GL S   S+NVEGG +IIDD  RCFNCG+Y+HS+++CP+P D +AV+NAR+++K KR+Q  GSR  +RYYQ+ + GKYD L+PG+LD
Subjt:  DNFVPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDDLRPGALD

Query:  AETRQLLGLKELDPPPWLNRMRALGYPPGYLD-PDDEDQPSGITIYADEEPNEQ-----EDGEITEVEYP-KPRRKMSVEFPGINAPIPENADERLWAAE
        AETR+LLGLKELDPPPWLNRMR +GYPPGY    +D+D  S ITI+ +EE  E+     E+GEI E   P +PR+ M+V FPGINAPIPENAD  LW   
Subjt:  AETRQLLGLKELDPPPWLNRMRALGYPPGYLD-PDDEDQPSGITIYADEEPNEQ-----EDGEITEVEYP-KPRRKMSVEFPGINAPIPENADERLWAAE

Query:  PSSSGLSRSRSHQR
         S++G +   +H R
Subjt:  PSSSGLSRSRSHQR

AT1G67210.2 Proline-rich spliceosome-associated (PSP) family protein / zinc knuckle (CCHC-type) family protein4.7e-9056.23Show/hide
Query:  SGVKRPRTTMDEQQPSVHVVYSSLTRASKQKLDELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPALCVGTEKTSAVK---------RTAAKFIPLD
        SGVKR RT   EQQPSVHV Y  LTR SKQKL+ LL+QWSEW A++ SLS+D  + + LE+G+ET+FPAL VG +KTS+V           ++ K +P++
Subjt:  SGVKRPRTTMDEQQPSVHVVYSSLTRASKQKLDELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPALCVGTEKTSAVK---------RTAAKFIPLD

Query:  DNFVPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDDLRPGALD
         +  P Y+RGFT+GL S   S+NVEGG +IIDD  RCFNCG+Y+HS+++CP+P D +AV+NAR+++K KR+Q  GSR  +RYYQ+ + GKYD L+PG+LD
Subjt:  DNFVPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDDLRPGALD

Query:  AETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADEEPNEQ-----EDGEITEVEYP-KPRRKMSVEFPGINAPIPENADERLWAAEP
        AETR+LLGLKELDPPPWLNRMR +GYPPGY + DD+D  S ITI+ +EE  E+     E+GEI E   P +PR+ M+V FPGINAPIPENAD  LW    
Subjt:  AETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADEEPNEQ-----EDGEITEVEYP-KPRRKMSVEFPGINAPIPENADERLWAAEP

Query:  SSSGLSRSRSHQR
        S++G +   +H R
Subjt:  SSSGLSRSRSHQR

AT3G02420.1 unknown protein1.1e-14466.42Show/hide
Query:  MGEEREDLQRVKKAAAAAYDYENDPRWADYWSNILIPPHMASRPDVVDHYKRKFYQRYIVRTLSFNYRFGYRRFLGFRPFSRICALWWLMRIQVSSCIGF
        M E  ED QR+KK AAAA+DYEND RWADYWSNILIPPHMASRP+VVDH+KRKFYQRYI                                         
Subjt:  MGEEREDLQRVKKAAAAAYDYENDPRWADYWSNILIPPHMASRPDVVDHYKRKFYQRYIVRTLSFNYRFGYRRFLGFRPFSRICALWWLMRIQVSSCIGF

Query:  KFLPYVALLGLLKQDPELVVEAMS-SSSSTQSSRPSAT---SSAPPPTNDRSRSRSSGSTTRTSGTSASADPNPTPLRWDRQTIQFSVNAWVFIVAVLAI
                      DP+LVVE MS SSSS+QS+RP+AT   S+A    N++ RSR+SGS  RTSG SA+    P+ +RWD QTIQFSVNAWVF++AVLA+
Subjt:  KFLPYVALLGLLKQDPELVVEAMS-SSSSTQSSRPSAT---SSAPPPTNDRSRSRSSGSTTRTSGTSASADPNPTPLRWDRQTIQFSVNAWVFIVAVLAI

Query:  FPLIPKNLSQRAYRLSFMGTTCSSLYSLYSLYGKPRAWNLQALQVYFQSIIATKDFIYFTYCITFVTSNICLKFALIPILCRALEHVAKFLRRNFTRSSL
         PLIPKNLS RAYRLSFMGT CSSLYSLYSLYG+PRAWN+Q LQVYFQSI+A KDFIYF YC+TFVTS++CLKFALIPILCRALE VAKFLRRNF RS++
Subjt:  FPLIPKNLSQRAYRLSFMGTTCSSLYSLYSLYGKPRAWNLQALQVYFQSIIATKDFIYFTYCITFVTSNICLKFALIPILCRALEHVAKFLRRNFTRSSL

Query:  YRKYLEEPCVWVESNSTTLSILSSQAEIGLGFILVISLLSWQRNFLHTFMYWQLLKLMYHAPVTSGYHQSAWSNIGRVVSPLIYRYAPFLNTPLSMAQRW
        YRKYLE+PCVWVESN+TTL+ILSSQAEI +GF+L+ISLLSWQRN + TFMYWQLLKLMY APVT+GYHQS WS IGR V+P+I RYAPFLNTP+S  QRW
Subjt:  YRKYLEEPCVWVESNSTTLSILSSQAEIGLGFILVISLLSWQRNFLHTFMYWQLLKLMYHAPVTSGYHQSAWSNIGRVVSPLIYRYAPFLNTPLSMAQRW

Query:  WF
        WF
Subjt:  WF

AT5G38600.1 Proline-rich spliceosome-associated (PSP) family protein / zinc knuckle (CCHC-type) family protein4.5e-9345.88Show/hide
Query:  MEIEDLNNLPDFSKPRSRSENNQIPSEAEYLPVNSADENIQPSREPLEQNELYMRYEDVGHVTSTNFEKDLVDNSSFLKTGNRL--TVTKAVPIEFNRFN
        ME ED+ ++P  S   S  + N + S            N  P     E N L    E+V      N + DL + +  +  G      +T+ V   FN   
Subjt:  MEIEDLNNLPDFSKPRSRSENNQIPSEAEYLPVNSADENIQPSREPLEQNELYMRYEDVGHVTSTNFEKDLVDNSSFLKTGNRL--TVTKAVPIEFNRFN

Query:  SGVSIEN----GSATFHHGSPIKNHKNDAISGVKRPRTTMDEQQPSVHVVYSSLTRASKQKLDELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPAL
          V+++        T  H + +        +GVKRPRT+ DEQQP+VHV Y  LTRASKQKL+ LL++WSEW A+  SL+QD  + +  ESGEET FPA+
Subjt:  SGVSIEN----GSATFHHGSPIKNHKNDAISGVKRPRTTMDEQQPSVHVVYSSLTRASKQKLDELLKQWSEWHAQRGSLSQDAKESENLESGEETFFPAL

Query:  CVGTEKTSAVK---------RTAAKFIPLDDNFVPRYDRGFTLGLTSANDSSNVEGGQKII-DDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSK
         VG +KTS+V          +    F+ ++ +  P YDR F +GL SA+ S NVEGG +II DD  RCFNCG Y+HSL++CP+P D +AVN+ARK  KSK
Subjt:  CVGTEKTSAVK---------RTAAKFIPLDDNFVPRYDRGFTLGLTSANDSSNVEGGQKII-DDASRCFNCGSYNHSLKDCPKPRDNAAVNNARKEYKSK

Query:  RHQN-SGSRSSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADE----EPNEQEDGEITEV--E
        R+QN SG R  +RYYQ ++ GKYD L+PG LDAETRQLL L ELDPPPWLNRMR +GYPPGYL P+D D  SGITI+ +E    E  E EDGEI E    
Subjt:  RHQN-SGSRSSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADE----EPNEQEDGEITEV--E

Query:  YPKPRRKMSVEFPGINAPIPENADERLWAAEPSSSGLSRSRSHQRLNHYAEHDGRGNDHYQRWSRDYKDDRPPGVDPVKSPPVPYASRYGG-YDFNY
         P+P+ K +VEFPGINAP PENADE LW A PS    SRS             GR          DY+DD P GV+P   PP     RYG  YD+ Y
Subjt:  YPKPRRKMSVEFPGINAPIPENADERLWAAEPSSSGLSRSRSHQRLNHYAEHDGRGNDHYQRWSRDYKDDRPPGVDPVKSPPVPYASRYGG-YDFNY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGAAGAAAGAGAGGATCTACAGAGGGTGAAGAAAGCAGCGGCGGCAGCATACGACTACGAGAACGATCCAAGATGGGCCGATTACTGGTCCAACATTCTGATCCC
TCCTCACATGGCTTCTCGCCCCGATGTTGTTGACCATTACAAGCGCAAGTTCTACCAGCGTTACATCGTGCGTACTCTCTCTTTCAATTATCGATTTGGGTATCGGCGTT
TTCTTGGATTTAGACCCTTTAGTCGAATTTGTGCTCTTTGGTGGCTAATGAGAATTCAAGTGTCTTCTTGCATTGGTTTTAAATTCTTACCTTATGTTGCCTTACTCGGT
TTGTTAAAACAGGACCCCGAACTTGTGGTAGAGGCCATGTCTTCTAGTAGTTCAACTCAGTCATCTAGACCTTCAGCAACATCTTCCGCACCCCCTCCTACAAATGATCG
GAGTCGATCACGTAGCTCAGGGTCAACGACTAGGACTTCAGGTACATCTGCCAGTGCAGATCCTAATCCAACTCCCTTACGCTGGGATCGACAAACAATTCAGTTTTCTG
TCAATGCATGGGTGTTTATTGTGGCCGTGCTGGCAATTTTCCCCCTAATACCCAAAAATCTTTCACAGAGGGCATATAGGCTTTCTTTTATGGGCACAACTTGTTCCTCT
TTATATTCTTTGTACTCGTTGTATGGGAAGCCCAGGGCGTGGAATTTGCAAGCATTGCAAGTTTATTTCCAGTCCATAATTGCAACAAAAGATTTCATTTACTTCACTTA
CTGTATCACCTTTGTGACTTCAAATATTTGTCTTAAATTTGCTTTAATTCCTATCCTGTGTCGGGCTCTTGAACATGTTGCAAAGTTCCTTAGGCGTAATTTCACACGTT
CATCCTTATACAGGAAATATTTGGAAGAGCCTTGTGTATGGGTGGAGTCAAATTCAACTACTCTCAGCATCCTATCTTCGCAGGCTGAGATTGGACTTGGCTTCATTCTA
GTCATCTCTTTGCTCTCGTGGCAACGCAACTTCTTACATACATTCATGTACTGGCAGCTGCTAAAGCTCATGTATCATGCTCCTGTCACTTCTGGGTATCATCAAAGCGC
CTGGTCCAATATTGGGAGGGTCGTTTCCCCGCTTATCTACCGTTATGCCCCGTTCCTTAATACTCCTCTTTCAATGGCACAAAGATGGTGGTTCAGTTACTTAGTGGACT
TTTTGCTACACCATTGTCCATTGTATGTCCATTTTATGGGAACCGAGGATTTCATTGCACTGCCAGCTTCTGGTGATTCTGGAAATGACAATGAGAATAATGAACTCCTT
AGTTTTCATGAAACAAGAGAAGCTTATTCTCAATCAAGTGTTTTGAAGTGTAAGGACAATGATGCAAGCATAGAGAAAGTCGAGCTTGCAGATCATGTGCAGTTTGAAGA
CATGCATTGCATAACTCAATCTGACCTTAATGATGAAACGCAGACTTGTGATTCAGATATGGAAATTGAGGATTTGAATAACCTCCCAGATTTTAGTAAGCCCAGAAGTA
GAAGTGAGAATAATCAAATACCAAGTGAAGCTGAATACCTGCCAGTCAACTCTGCAGATGAAAACATACAACCAAGCAGAGAGCCCTTGGAGCAGAATGAACTTTATATG
CGATATGAAGATGTTGGTCATGTTACAAGTACAAATTTTGAAAAGGATTTGGTTGACAATTCATCCTTCTTGAAAACTGGTAATCGATTGACTGTGACCAAGGCAGTTCC
AATTGAGTTCAATAGATTCAACTCTGGAGTGTCCATTGAGAATGGTTCGGCCACATTCCATCATGGAAGCCCCATTAAGAATCACAAGAATGATGCAATATCAGGTGTCA
AAAGACCGAGGACGACCATGGATGAGCAACAACCTTCAGTGCACGTTGTGTATAGTTCTTTAACGAGAGCTAGTAAACAAAAGCTTGACGAACTATTAAAGCAGTGGTCT
GAGTGGCATGCTCAACGAGGTTCTTTGTCTCAAGATGCTAAAGAATCTGAAAATCTAGAATCTGGAGAAGAGACCTTCTTTCCTGCTCTATGTGTTGGCACAGAGAAGAC
CTCAGCAGTGAAGAGAACAGCAGCAAAATTTATTCCTTTAGATGACAATTTTGTGCCACGATATGATCGGGGATTCACTTTGGGACTGACTTCAGCCAATGATTCGAGTA
ATGTGGAAGGAGGCCAGAAGATAATTGATGATGCTAGCCGCTGTTTCAATTGTGGTTCTTACAATCATTCCTTAAAAGATTGCCCAAAGCCTCGAGATAATGCTGCTGTT
AATAATGCTCGCAAAGAGTATAAGTCTAAAAGACATCAAAATTCTGGCTCCCGCAGTTCAACCCGGTACTATCAGAATTCACGTGGTGGGAAGTATGATGATTTGAGGCC
AGGAGCTCTTGATGCTGAAACACGGCAACTGTTAGGTCTCAAGGAGCTTGATCCACCCCCTTGGCTTAACAGAATGCGAGCGCTGGGATACCCACCGGGATATCTAGATC
CGGATGATGAGGATCAGCCATCGGGGATAACAATATATGCTGATGAGGAACCCAATGAGCAGGAAGATGGAGAAATTACTGAAGTGGAGTACCCTAAACCACGGAGGAAA
ATGAGTGTTGAATTTCCTGGTATAAATGCTCCAATCCCAGAAAATGCAGATGAAAGACTCTGGGCTGCTGAACCTTCAAGTTCAGGTCTCTCTAGAAGTCGATCACACCA
GCGTTTGAACCACTATGCCGAACATGATGGGAGGGGGAATGATCATTACCAACGATGGTCCCGCGATTACAAAGACGACAGACCTCCAGGCGTTGACCCAGTAAAAAGTC
CACCTGTGCCTTACGCTTCAAGGTATGGTGGTTATGATTTTAATTACGACTTCCAAAGCCCAAGAGGTATTTCAGGGTATTTGATCGGCTGCCTGGACGGTGGTAGTTCA
GAATTCAGATTAAAAGAGAATCTCGATTTTCAATTGCCTTTGAAGCAGGGGTTTTTTAATGACAGACACCTCATCAGAAAGGCAGAAGTGGGAGCAGAAGGACTATTGAG
AAGGATAAAGATCGCCGGAGATTGTATCGTTTTTCAGGAGCTTGCACAGCTGGGACCGCCCTCTGCCGCCTTTGCGTGGTTGCCCTTCCACGACACTACCAGAGCCAAGA
CTAGATTCATCCCGACTCTCATTATTCAGCTGACTAGAATCCTCGGCATTTCCATTTGGATACTCAACTTGATGAGTTTGCTGAAGTTCCCCAATTATAGCCTTAGCATC
TTCAACAACTGCCTTCACAGACCGGGTTCTACTGATTTTAGGCTTTCCTCTCCTGGGACGTTGTTTTGGCTGATTTCCCCTTACATCAGAAGGTTGAGAATCGACCGCAA
CTTCTGGGAATCACTTGCAATTGCAAGAGATATCTCTGCCTCATCTTCACCTGAAGACATCCTCCTCTTAGATGACTCTGCCCTCCAAGAAATAGTTCCACCAGAAATGG
GAGACTTTGGACCAGCTACGCCTGGAGTCAGTTCACCATTCTGCATATCAGAAGCTAGATTCCCGCCAGGAGACACCTGGAGTCCTGGAATTTCCAAATATCTATCAGGC
AGTCCAGGCAGATTCAGGATATCGGCATTCTCAATACCATCTAAAGAGCCTTCTGCTCTTTCTCAACCTGAGCTCTTTTCTCATCCAGCTCTTCCCATTCTCTCTCGAAA
GTCTCTCTCTGCTGCTTCAAATCCTCTGCTTCCTTCAGCAGTAGTTCTTTCTGAAGCCTATACTTCTCTATCTCTTGTTTTAATTCTGACTGCAAACGAAGATAATCAGA
CCTCTCTGACTCAGTCACTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGAAGAAAGAGAGGATCTACAGAGGGTGAAGAAAGCAGCGGCGGCAGCATACGACTACGAGAACGATCCAAGATGGGCCGATTACTGGTCCAACATTCTGATCCC
TCCTCACATGGCTTCTCGCCCCGATGTTGTTGACCATTACAAGCGCAAGTTCTACCAGCGTTACATCGTGCGTACTCTCTCTTTCAATTATCGATTTGGGTATCGGCGTT
TTCTTGGATTTAGACCCTTTAGTCGAATTTGTGCTCTTTGGTGGCTAATGAGAATTCAAGTGTCTTCTTGCATTGGTTTTAAATTCTTACCTTATGTTGCCTTACTCGGT
TTGTTAAAACAGGACCCCGAACTTGTGGTAGAGGCCATGTCTTCTAGTAGTTCAACTCAGTCATCTAGACCTTCAGCAACATCTTCCGCACCCCCTCCTACAAATGATCG
GAGTCGATCACGTAGCTCAGGGTCAACGACTAGGACTTCAGGTACATCTGCCAGTGCAGATCCTAATCCAACTCCCTTACGCTGGGATCGACAAACAATTCAGTTTTCTG
TCAATGCATGGGTGTTTATTGTGGCCGTGCTGGCAATTTTCCCCCTAATACCCAAAAATCTTTCACAGAGGGCATATAGGCTTTCTTTTATGGGCACAACTTGTTCCTCT
TTATATTCTTTGTACTCGTTGTATGGGAAGCCCAGGGCGTGGAATTTGCAAGCATTGCAAGTTTATTTCCAGTCCATAATTGCAACAAAAGATTTCATTTACTTCACTTA
CTGTATCACCTTTGTGACTTCAAATATTTGTCTTAAATTTGCTTTAATTCCTATCCTGTGTCGGGCTCTTGAACATGTTGCAAAGTTCCTTAGGCGTAATTTCACACGTT
CATCCTTATACAGGAAATATTTGGAAGAGCCTTGTGTATGGGTGGAGTCAAATTCAACTACTCTCAGCATCCTATCTTCGCAGGCTGAGATTGGACTTGGCTTCATTCTA
GTCATCTCTTTGCTCTCGTGGCAACGCAACTTCTTACATACATTCATGTACTGGCAGCTGCTAAAGCTCATGTATCATGCTCCTGTCACTTCTGGGTATCATCAAAGCGC
CTGGTCCAATATTGGGAGGGTCGTTTCCCCGCTTATCTACCGTTATGCCCCGTTCCTTAATACTCCTCTTTCAATGGCACAAAGATGGTGGTTCAGTTACTTAGTGGACT
TTTTGCTACACCATTGTCCATTGTATGTCCATTTTATGGGAACCGAGGATTTCATTGCACTGCCAGCTTCTGGTGATTCTGGAAATGACAATGAGAATAATGAACTCCTT
AGTTTTCATGAAACAAGAGAAGCTTATTCTCAATCAAGTGTTTTGAAGTGTAAGGACAATGATGCAAGCATAGAGAAAGTCGAGCTTGCAGATCATGTGCAGTTTGAAGA
CATGCATTGCATAACTCAATCTGACCTTAATGATGAAACGCAGACTTGTGATTCAGATATGGAAATTGAGGATTTGAATAACCTCCCAGATTTTAGTAAGCCCAGAAGTA
GAAGTGAGAATAATCAAATACCAAGTGAAGCTGAATACCTGCCAGTCAACTCTGCAGATGAAAACATACAACCAAGCAGAGAGCCCTTGGAGCAGAATGAACTTTATATG
CGATATGAAGATGTTGGTCATGTTACAAGTACAAATTTTGAAAAGGATTTGGTTGACAATTCATCCTTCTTGAAAACTGGTAATCGATTGACTGTGACCAAGGCAGTTCC
AATTGAGTTCAATAGATTCAACTCTGGAGTGTCCATTGAGAATGGTTCGGCCACATTCCATCATGGAAGCCCCATTAAGAATCACAAGAATGATGCAATATCAGGTGTCA
AAAGACCGAGGACGACCATGGATGAGCAACAACCTTCAGTGCACGTTGTGTATAGTTCTTTAACGAGAGCTAGTAAACAAAAGCTTGACGAACTATTAAAGCAGTGGTCT
GAGTGGCATGCTCAACGAGGTTCTTTGTCTCAAGATGCTAAAGAATCTGAAAATCTAGAATCTGGAGAAGAGACCTTCTTTCCTGCTCTATGTGTTGGCACAGAGAAGAC
CTCAGCAGTGAAGAGAACAGCAGCAAAATTTATTCCTTTAGATGACAATTTTGTGCCACGATATGATCGGGGATTCACTTTGGGACTGACTTCAGCCAATGATTCGAGTA
ATGTGGAAGGAGGCCAGAAGATAATTGATGATGCTAGCCGCTGTTTCAATTGTGGTTCTTACAATCATTCCTTAAAAGATTGCCCAAAGCCTCGAGATAATGCTGCTGTT
AATAATGCTCGCAAAGAGTATAAGTCTAAAAGACATCAAAATTCTGGCTCCCGCAGTTCAACCCGGTACTATCAGAATTCACGTGGTGGGAAGTATGATGATTTGAGGCC
AGGAGCTCTTGATGCTGAAACACGGCAACTGTTAGGTCTCAAGGAGCTTGATCCACCCCCTTGGCTTAACAGAATGCGAGCGCTGGGATACCCACCGGGATATCTAGATC
CGGATGATGAGGATCAGCCATCGGGGATAACAATATATGCTGATGAGGAACCCAATGAGCAGGAAGATGGAGAAATTACTGAAGTGGAGTACCCTAAACCACGGAGGAAA
ATGAGTGTTGAATTTCCTGGTATAAATGCTCCAATCCCAGAAAATGCAGATGAAAGACTCTGGGCTGCTGAACCTTCAAGTTCAGGTCTCTCTAGAAGTCGATCACACCA
GCGTTTGAACCACTATGCCGAACATGATGGGAGGGGGAATGATCATTACCAACGATGGTCCCGCGATTACAAAGACGACAGACCTCCAGGCGTTGACCCAGTAAAAAGTC
CACCTGTGCCTTACGCTTCAAGGTATGGTGGTTATGATTTTAATTACGACTTCCAAAGCCCAAGAGGTATTTCAGGGTATTTGATCGGCTGCCTGGACGGTGGTAGTTCA
GAATTCAGATTAAAAGAGAATCTCGATTTTCAATTGCCTTTGAAGCAGGGGTTTTTTAATGACAGACACCTCATCAGAAAGGCAGAAGTGGGAGCAGAAGGACTATTGAG
AAGGATAAAGATCGCCGGAGATTGTATCGTTTTTCAGGAGCTTGCACAGCTGGGACCGCCCTCTGCCGCCTTTGCGTGGTTGCCCTTCCACGACACTACCAGAGCCAAGA
CTAGATTCATCCCGACTCTCATTATTCAGCTGACTAGAATCCTCGGCATTTCCATTTGGATACTCAACTTGATGAGTTTGCTGAAGTTCCCCAATTATAGCCTTAGCATC
TTCAACAACTGCCTTCACAGACCGGGTTCTACTGATTTTAGGCTTTCCTCTCCTGGGACGTTGTTTTGGCTGATTTCCCCTTACATCAGAAGGTTGAGAATCGACCGCAA
CTTCTGGGAATCACTTGCAATTGCAAGAGATATCTCTGCCTCATCTTCACCTGAAGACATCCTCCTCTTAGATGACTCTGCCCTCCAAGAAATAGTTCCACCAGAAATGG
GAGACTTTGGACCAGCTACGCCTGGAGTCAGTTCACCATTCTGCATATCAGAAGCTAGATTCCCGCCAGGAGACACCTGGAGTCCTGGAATTTCCAAATATCTATCAGGC
AGTCCAGGCAGATTCAGGATATCGGCATTCTCAATACCATCTAAAGAGCCTTCTGCTCTTTCTCAACCTGAGCTCTTTTCTCATCCAGCTCTTCCCATTCTCTCTCGAAA
GTCTCTCTCTGCTGCTTCAAATCCTCTGCTTCCTTCAGCAGTAGTTCTTTCTGAAGCCTATACTTCTCTATCTCTTGTTTTAATTCTGACTGCAAACGAAGATAATCAGA
CCTCTCTGACTCAGTCACTTTAA
Protein sequenceShow/hide protein sequence
MGEEREDLQRVKKAAAAAYDYENDPRWADYWSNILIPPHMASRPDVVDHYKRKFYQRYIVRTLSFNYRFGYRRFLGFRPFSRICALWWLMRIQVSSCIGFKFLPYVALLG
LLKQDPELVVEAMSSSSSTQSSRPSATSSAPPPTNDRSRSRSSGSTTRTSGTSASADPNPTPLRWDRQTIQFSVNAWVFIVAVLAIFPLIPKNLSQRAYRLSFMGTTCSS
LYSLYSLYGKPRAWNLQALQVYFQSIIATKDFIYFTYCITFVTSNICLKFALIPILCRALEHVAKFLRRNFTRSSLYRKYLEEPCVWVESNSTTLSILSSQAEIGLGFIL
VISLLSWQRNFLHTFMYWQLLKLMYHAPVTSGYHQSAWSNIGRVVSPLIYRYAPFLNTPLSMAQRWWFSYLVDFLLHHCPLYVHFMGTEDFIALPASGDSGNDNENNELL
SFHETREAYSQSSVLKCKDNDASIEKVELADHVQFEDMHCITQSDLNDETQTCDSDMEIEDLNNLPDFSKPRSRSENNQIPSEAEYLPVNSADENIQPSREPLEQNELYM
RYEDVGHVTSTNFEKDLVDNSSFLKTGNRLTVTKAVPIEFNRFNSGVSIENGSATFHHGSPIKNHKNDAISGVKRPRTTMDEQQPSVHVVYSSLTRASKQKLDELLKQWS
EWHAQRGSLSQDAKESENLESGEETFFPALCVGTEKTSAVKRTAAKFIPLDDNFVPRYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCPKPRDNAAV
NNARKEYKSKRHQNSGSRSSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMRALGYPPGYLDPDDEDQPSGITIYADEEPNEQEDGEITEVEYPKPRRK
MSVEFPGINAPIPENADERLWAAEPSSSGLSRSRSHQRLNHYAEHDGRGNDHYQRWSRDYKDDRPPGVDPVKSPPVPYASRYGGYDFNYDFQSPRGISGYLIGCLDGGSS
EFRLKENLDFQLPLKQGFFNDRHLIRKAEVGAEGLLRRIKIAGDCIVFQELAQLGPPSAAFAWLPFHDTTRAKTRFIPTLIIQLTRILGISIWILNLMSLLKFPNYSLSI
FNNCLHRPGSTDFRLSSPGTLFWLISPYIRRLRIDRNFWESLAIARDISASSSPEDILLLDDSALQEIVPPEMGDFGPATPGVSSPFCISEARFPPGDTWSPGISKYLSG
SPGRFRISAFSIPSKEPSALSQPELFSHPALPILSRKSLSAASNPLLPSAVVLSEAYTSLSLVLILTANEDNQTSLTQSL