; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018268 (gene) of Snake gourd v1 genome

Gene IDTan0018268
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionzinc finger CCHC domain-containing protein 7-like isoform X2
Genome locationLG09:69452146..69456868
RNA-Seq ExpressionTan0018268
SyntenyTan0018268
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587697.1 DNA-binding protein HEXBP, partial [Cucurbita argyrosperma subsp. sororia]1.8e-28174.06Show/hide
Query:  MGRRDKQKKFKFEDDEDELQGVKEGASKSANSVSSDDDEANEDLSLKIVEKALRLRSGKLVIAANNNKDGNRRQSGNVDVAVVGAVEVFPSSLEDADAGA
        MGRR KQKKFKFE+DEDEL   KE ASK+   VSSDD+EANEDLSLKIVEKALRLRSGKLV   ++N+ GNR QSGN  V  VG VEV  S L DA+ GA
Subjt:  MGRRDKQKKFKFEDDEDELQGVKEGASKSANSVSSDDDEANEDLSLKIVEKALRLRSGKLVIAANNNKDGNRRQSGNVDVAVVGAVEVFPSSLEDADAGA

Query:  GTSGASADGIAAASEVRELGSKKTVKKRKRKVKKLGTEEHDVVVTEGEK-------------------IETTGMIDQVDSVEPNSTETPNNNVFRKLLRG
         TSG   D IAAASE RELGSKK+ KKRKRKVKKL TEE  VVVTEGEK                   IETT +IDQVDS EPN T+T NNNVFRKLLRG
Subjt:  GTSGASADGIAAASEVRELGSKKTVKKRKRKVKKLGTEEHDVVVTEGEK-------------------IETTGMIDQVDSVEPNSTETPNNNVFRKLLRG

Query:  PRYFDPPDSWGMCYNCGEEGHNAVNCTSVKRKKPCFVCGSLEHNARSCLKARDCHICKKVGHRAKDCPEKHRDVSSSSKICLKCGDPGHDMFSCQNLYPD
        PRYFDPPDSWG CYNCGEEGHNAV+CTS KRKKPCFVCGSLEHNARSCLKARDC+IC KVGHRAKDCPEKH  VSSSSKICLKCGDPGHDMFSCQ  YPD
Subjt:  PRYFDPPDSWGMCYNCGEEGHNAVNCTSVKRKKPCFVCGSLEHNARSCLKARDCHICKKVGHRAKDCPEKHRDVSSSSKICLKCGDPGHDMFSCQNLYPD

Query:  DDLKNIQCYICKTFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRC
        DDLKNIQCYICK  GHLCCVNSTSDTSIDISCYKCG+ GH+GLACSRLRGEAS AVS+S CYRCGEEGHFARECTS  KGGK+NREEASGA S  PCYRC
Subjt:  DDLKNIQCYICKTFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRC

Query:  GEEGHFARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNLEEASGAT
        GEEGHFAREC  S K  K+                    G   RECT S KGGK+  E+ASGAAS + CYRCGE GHFSREC SSTK  KRN+E ASG  
Subjt:  GEEGHFARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNLEEASGAT

Query:  STSLCYRCGEQGHFSRECTSSTKGDKRNRELSNPKLRSQIEEMNHMGSKSVPRDLAKAHNKKKKINHEEKYTSMPRNSGQKGRWMMEDLGRGNFSNGTFM
        ST+ CYRCG +GHF+RECTSSTKG KRNRELSN K RS+ EE  HMGSKS P +LAKAH+KKKKIN++EKYT++PR SGQKGRWMMED G  NF NGT M
Subjt:  STSLCYRCGEQGHFSRECTSSTKGDKRNRELSNPKLRSQIEEMNHMGSKSVPRDLAKAHNKKKKINHEEKYTSMPRNSGQKGRWMMEDLGRGNFSNGTFM

Query:  RNNWNSPVTPSRWDHNYYMEYTGHYESPQFSGHYAS-HQSSGHYGSPQSFSKGRTLHPGTPMSLGSSIAPQNRFSASRFGGFSNEGRRKSYGWW
         NNWNSPVTPSRWDH YY EY+GHY SPQFSG YAS  +SSG+Y SPQS ++ RTLHPGTP+S  SSIAPQN FSASRFGGFSNEGRRKSYGWW
Subjt:  RNNWNSPVTPSRWDHNYYMEYTGHYESPQFSGHYAS-HQSSGHYGSPQSFSKGRTLHPGTPMSLGSSIAPQNRFSASRFGGFSNEGRRKSYGWW

XP_022933989.1 uncharacterized protein LOC111441232 isoform X2 [Cucurbita moschata]9.1e-28174.03Show/hide
Query:  MGRRDKQKKFKFEDDEDELQGVKEGASKSANSVSSDDDEANEDLSLKIVEKALRLRSGKLVIAANNNKDGNRRQSGNVDVAVVGAVEVFPSSLEDADAGA
        MGRR KQKKFKFE+DEDEL   KE ASK+   VSSDD+EANEDLSLKIVEKALRLRSGKLV   ++N+ GNR QSGN  V  VG VEV  S L DAD GA
Subjt:  MGRRDKQKKFKFEDDEDELQGVKEGASKSANSVSSDDDEANEDLSLKIVEKALRLRSGKLVIAANNNKDGNRRQSGNVDVAVVGAVEVFPSSLEDADAGA

Query:  GTSGASADGIAAASEVRELGSKKTVKKRKRKVKKLGTEEHDVVVTEGEK-------------------IETTGMIDQVDSVEPNSTETPNNNVFRKLLRG
        GTSG S D IAAASE RELGSKK+ KKRKRKVKKL TEE  VVVTEGEK                   IETT +IDQVDS  PN T+T NNNVFRKLLRG
Subjt:  GTSGASADGIAAASEVRELGSKKTVKKRKRKVKKLGTEEHDVVVTEGEK-------------------IETTGMIDQVDSVEPNSTETPNNNVFRKLLRG

Query:  PRYFDPPDSWGMCYNCGEEGHNAVNCTSVKRKKPCFVCGSLEHNARSCLKARDCHICKKVGHRAKDCPEKHRDVSSSSKICLKCGDPGHDMFSCQNLYPD
        PRYFDPPDSWG CYNCGEEGHNAV+CTS KRKKPCFVCGSLEHNARSC KARDC+IC KVGHRAKDCPEKH  VSSSSKICLKCGDPGHDMFSCQ+ YPD
Subjt:  PRYFDPPDSWGMCYNCGEEGHNAVNCTSVKRKKPCFVCGSLEHNARSCLKARDCHICKKVGHRAKDCPEKHRDVSSSSKICLKCGDPGHDMFSCQNLYPD

Query:  DDLKNIQCYICKTFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRC
        DDLKNIQCYICK  GHLCCVNSTSDTSIDISCYKCG+ GH+GLACSRLRGEAS AVS+S CYRCGEEGHFARECTS  KGGK+NREEASGA S  PCYRC
Subjt:  DDLKNIQCYICKTFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRC

Query:  GEEGHFARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTK---GGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNLEEAS
        GEEGHFAREC  S K  K+                    G   RECT S K   GGK+  E+ASGAAS + CYRCGE GHFSREC S TK  KRN+E AS
Subjt:  GEEGHFARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTK---GGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNLEEAS

Query:  GATSTSLCYRCGEQGHFSRECTSSTKGDKRNRELSNPKLRSQIEEMNHMGSKSVPRDLAKAHNKKKKINHEEKYTSMPRNSGQKGRWMMEDLGRGNFSNG
        GA ST+ CYRCG +GHF+RECTSSTKG KRNRELSN K RS+ EE  HMGSKS P +LAKAH+KKKKIN++EKYT++PR SGQKGRWMMED G  NF NG
Subjt:  GATSTSLCYRCGEQGHFSRECTSSTKGDKRNRELSNPKLRSQIEEMNHMGSKSVPRDLAKAHNKKKKINHEEKYTSMPRNSGQKGRWMMEDLGRGNFSNG

Query:  TFMRNNWNSPVTPSRWDHNYYMEYTGHYESPQFSGHYAS-HQSSGHYGSPQSFSKGRTLHPGTPMSLGSSIAPQNRFSASRFGGFSNEGRRKSYGWW
        T M NNWNSPVTPSRWDH YY EY GHY SPQFSG YAS  +SSG+Y SPQS ++ RTLHPGTPMS  SSIAPQN FSASRFGGFSNEGRRKSYGWW
Subjt:  TFMRNNWNSPVTPSRWDHNYYMEYTGHYESPQFSGHYAS-HQSSGHYGSPQSFSKGRTLHPGTPMSLGSSIAPQNRFSASRFGGFSNEGRRKSYGWW

XP_022933990.1 uncharacterized protein LOC111441232 isoform X3 [Cucurbita moschata]9.1e-28174.03Show/hide
Query:  MGRRDKQKKFKFEDDEDELQGVKEGASKSANSVSSDDDEANEDLSLKIVEKALRLRSGKLVIAANNNKDGNRRQSGNVDVAVVGAVEVFPSSLEDADAGA
        MGRR KQKKFKFE+DEDEL   KE ASK+   VSSDD+EANEDLSLKIVEKALRLRSGKLV   ++N+ GNR QSGN  V  VG VEV  S L DAD GA
Subjt:  MGRRDKQKKFKFEDDEDELQGVKEGASKSANSVSSDDDEANEDLSLKIVEKALRLRSGKLVIAANNNKDGNRRQSGNVDVAVVGAVEVFPSSLEDADAGA

Query:  GTSGASADGIAAASEVRELGSKKTVKKRKRKVKKLGTEEHDVVVTEGEK-------------------IETTGMIDQVDSVEPNSTETPNNNVFRKLLRG
        GTSG S D IAAASE RELGSKK+ KKRKRKVKKL TEE  VVVTEGEK                   IETT +IDQVDS  PN T+T NNNVFRKLLRG
Subjt:  GTSGASADGIAAASEVRELGSKKTVKKRKRKVKKLGTEEHDVVVTEGEK-------------------IETTGMIDQVDSVEPNSTETPNNNVFRKLLRG

Query:  PRYFDPPDSWGMCYNCGEEGHNAVNCTSVKRKKPCFVCGSLEHNARSCLKARDCHICKKVGHRAKDCPEKHRDVSSSSKICLKCGDPGHDMFSCQNLYPD
        PRYFDPPDSWG CYNCGEEGHNAV+CTS KRKKPCFVCGSLEHNARSC KARDC+IC KVGHRAKDCPEKH  VSSSSKICLKCGDPGHDMFSCQ+ YPD
Subjt:  PRYFDPPDSWGMCYNCGEEGHNAVNCTSVKRKKPCFVCGSLEHNARSCLKARDCHICKKVGHRAKDCPEKHRDVSSSSKICLKCGDPGHDMFSCQNLYPD

Query:  DDLKNIQCYICKTFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTK---GGKRNREEASGAVSSIPC
        DDLKNIQCYICK  GHLCCVNSTSDTSIDISCYKCG+ GH+GLACSRLRGEAS AVS+S CYRCGEEGHFARECTS  K   GGK+NREEASGA S  PC
Subjt:  DDLKNIQCYICKTFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTK---GGKRNREEASGAVSSIPC

Query:  YRCGEEGHFARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNLEEAS
        YRCGEEGHFAREC  S K  K+                    G   RECT S KGGK+  E+ASGAAS + CYRCGE GHFSREC S TK  KRN+E AS
Subjt:  YRCGEEGHFARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNLEEAS

Query:  GATSTSLCYRCGEQGHFSRECTSSTKGDKRNRELSNPKLRSQIEEMNHMGSKSVPRDLAKAHNKKKKINHEEKYTSMPRNSGQKGRWMMEDLGRGNFSNG
        GA ST+ CYRCG +GHF+RECTSSTKG KRNRELSN K RS+ EE  HMGSKS P +LAKAH+KKKKIN++EKYT++PR SGQKGRWMMED G  NF NG
Subjt:  GATSTSLCYRCGEQGHFSRECTSSTKGDKRNRELSNPKLRSQIEEMNHMGSKSVPRDLAKAHNKKKKINHEEKYTSMPRNSGQKGRWMMEDLGRGNFSNG

Query:  TFMRNNWNSPVTPSRWDHNYYMEYTGHYESPQFSGHYAS-HQSSGHYGSPQSFSKGRTLHPGTPMSLGSSIAPQNRFSASRFGGFSNEGRRKSYGWW
        T M NNWNSPVTPSRWDH YY EY GHY SPQFSG YAS  +SSG+Y SPQS ++ RTLHPGTPMS  SSIAPQN FSASRFGGFSNEGRRKSYGWW
Subjt:  TFMRNNWNSPVTPSRWDHNYYMEYTGHYESPQFSGHYAS-HQSSGHYGSPQSFSKGRTLHPGTPMSLGSSIAPQNRFSASRFGGFSNEGRRKSYGWW

XP_023531109.1 zinc finger CCHC domain-containing protein 7-like isoform X2 [Cucurbita pepo subsp. pepo]2.4e-28174.03Show/hide
Query:  MGRRDKQKKFKFEDDEDELQGVKEGASKSANSVSSDDDEANEDLSLKIVEKALRLRSGKLVIAANNNKDGNRRQSGNVDVAVVGAVEVFPSSLEDADAGA
        MGRR K KKFKFE+DEDEL   KE ASK+   VSSDD+EANED SLKIVEKALRLRSGKLV   ++N+ GNR QSGN DV  VG VEV  S L DAD GA
Subjt:  MGRRDKQKKFKFEDDEDELQGVKEGASKSANSVSSDDDEANEDLSLKIVEKALRLRSGKLVIAANNNKDGNRRQSGNVDVAVVGAVEVFPSSLEDADAGA

Query:  GTSGASADGIAAASEVRELGSKKTVKKRKRKVKKLGTEEHDVVVTEGEK-------------------IETTGMIDQVDSVEPNSTETPNNNVFRKLLRG
        GTSG S D IAA SE RELGSKK+ KKRKRKVKKL TEE  VVVTEGEK                   IETT +IDQVDS EPN T+T NNNVFRKLLRG
Subjt:  GTSGASADGIAAASEVRELGSKKTVKKRKRKVKKLGTEEHDVVVTEGEK-------------------IETTGMIDQVDSVEPNSTETPNNNVFRKLLRG

Query:  PRYFDPPDSWGMCYNCGEEGHNAVNCTSVKRKKPCFVCGSLEHNARSCLKARDCHICKKVGHRAKDCPEKHRDVSSSSKICLKCGDPGHDMFSCQNLYPD
        PRYFDPPDSWG CYNCGEEGHNAV+CTS KRKKPCFVCGSLEHNARSCLKARDC+IC KVGHRAKDCPEKH  VSSSSKICLKCGDPGHDMFSCQ+ YPD
Subjt:  PRYFDPPDSWGMCYNCGEEGHNAVNCTSVKRKKPCFVCGSLEHNARSCLKARDCHICKKVGHRAKDCPEKHRDVSSSSKICLKCGDPGHDMFSCQNLYPD

Query:  DDLKNIQCYICKTFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRC
        DDL+NIQCYICK  GHLCCVNSTSDTSIDISCYKCGK GH+GLACSRLRGEAS AVS+S CYRCGEEGHFARECTS  KGGK+NREEASGA S  PCYRC
Subjt:  DDLKNIQCYICKTFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRC

Query:  GEEGHFARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTK---GGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNLEEAS
        GEEGHFAREC  S K  K+                    G   RECT S K   GGK+  E+ASGAAS + CYRCGE GHFSREC SSTK  KRN+E AS
Subjt:  GEEGHFARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTK---GGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNLEEAS

Query:  GATSTSLCYRCGEQGHFSRECTSSTKGDKRNRELSNPKLRSQIEEMNHMGSKSVPRDLAKAHNKKKKINHEEKYTSMPRNSGQKGRWMMEDLGRGNFSNG
        GA ST+ CYRCG +GHF+RECTSSTKG KRNRELS  K RS+ EE  HMGSKS P +LAKAH+KKKKIN++EKYT++PR SGQKGRWMMED G  NF NG
Subjt:  GATSTSLCYRCGEQGHFSRECTSSTKGDKRNRELSNPKLRSQIEEMNHMGSKSVPRDLAKAHNKKKKINHEEKYTSMPRNSGQKGRWMMEDLGRGNFSNG

Query:  TFMRNNWNSPVTPSRWDHNYYMEYTGHYESPQFSGHYAS-HQSSGHYGSPQSFSKGRTLHPGTPMSLGSSIAPQNRFSASRFGGFSNEGRRKSYGWW
        T M NNWNSPVTPSRWDH+YY EY GHY SPQFSG YAS  +SSG+Y SPQS ++ RTLHPGTPMS  SSIAPQN FSASRFGGFSNEGRRKSYGWW
Subjt:  TFMRNNWNSPVTPSRWDHNYYMEYTGHYESPQFSGHYAS-HQSSGHYGSPQSFSKGRTLHPGTPMSLGSSIAPQNRFSASRFGGFSNEGRRKSYGWW

XP_023531111.1 zinc finger CCHC domain-containing protein 7-like isoform X3 [Cucurbita pepo subsp. pepo]2.4e-28174.03Show/hide
Query:  MGRRDKQKKFKFEDDEDELQGVKEGASKSANSVSSDDDEANEDLSLKIVEKALRLRSGKLVIAANNNKDGNRRQSGNVDVAVVGAVEVFPSSLEDADAGA
        MGRR K KKFKFE+DEDEL   KE ASK+   VSSDD+EANED SLKIVEKALRLRSGKLV   ++N+ GNR QSGN DV  VG VEV  S L DAD GA
Subjt:  MGRRDKQKKFKFEDDEDELQGVKEGASKSANSVSSDDDEANEDLSLKIVEKALRLRSGKLVIAANNNKDGNRRQSGNVDVAVVGAVEVFPSSLEDADAGA

Query:  GTSGASADGIAAASEVRELGSKKTVKKRKRKVKKLGTEEHDVVVTEGEK-------------------IETTGMIDQVDSVEPNSTETPNNNVFRKLLRG
        GTSG S D IAA SE RELGSKK+ KKRKRKVKKL TEE  VVVTEGEK                   IETT +IDQVDS EPN T+T NNNVFRKLLRG
Subjt:  GTSGASADGIAAASEVRELGSKKTVKKRKRKVKKLGTEEHDVVVTEGEK-------------------IETTGMIDQVDSVEPNSTETPNNNVFRKLLRG

Query:  PRYFDPPDSWGMCYNCGEEGHNAVNCTSVKRKKPCFVCGSLEHNARSCLKARDCHICKKVGHRAKDCPEKHRDVSSSSKICLKCGDPGHDMFSCQNLYPD
        PRYFDPPDSWG CYNCGEEGHNAV+CTS KRKKPCFVCGSLEHNARSCLKARDC+IC KVGHRAKDCPEKH  VSSSSKICLKCGDPGHDMFSCQ+ YPD
Subjt:  PRYFDPPDSWGMCYNCGEEGHNAVNCTSVKRKKPCFVCGSLEHNARSCLKARDCHICKKVGHRAKDCPEKHRDVSSSSKICLKCGDPGHDMFSCQNLYPD

Query:  DDLKNIQCYICKTFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTK---GGKRNREEASGAVSSIPC
        DDL+NIQCYICK  GHLCCVNSTSDTSIDISCYKCGK GH+GLACSRLRGEAS AVS+S CYRCGEEGHFARECTS  K   GGK+NREEASGA S  PC
Subjt:  DDLKNIQCYICKTFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTK---GGKRNREEASGAVSSIPC

Query:  YRCGEEGHFARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNLEEAS
        YRCGEEGHFAREC  S K  K+                    G   RECT S KGGK+  E+ASGAAS + CYRCGE GHFSREC SSTK  KRN+E AS
Subjt:  YRCGEEGHFARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNLEEAS

Query:  GATSTSLCYRCGEQGHFSRECTSSTKGDKRNRELSNPKLRSQIEEMNHMGSKSVPRDLAKAHNKKKKINHEEKYTSMPRNSGQKGRWMMEDLGRGNFSNG
        GA ST+ CYRCG +GHF+RECTSSTKG KRNRELS  K RS+ EE  HMGSKS P +LAKAH+KKKKIN++EKYT++PR SGQKGRWMMED G  NF NG
Subjt:  GATSTSLCYRCGEQGHFSRECTSSTKGDKRNRELSNPKLRSQIEEMNHMGSKSVPRDLAKAHNKKKKINHEEKYTSMPRNSGQKGRWMMEDLGRGNFSNG

Query:  TFMRNNWNSPVTPSRWDHNYYMEYTGHYESPQFSGHYAS-HQSSGHYGSPQSFSKGRTLHPGTPMSLGSSIAPQNRFSASRFGGFSNEGRRKSYGWW
        T M NNWNSPVTPSRWDH+YY EY GHY SPQFSG YAS  +SSG+Y SPQS ++ RTLHPGTPMS  SSIAPQN FSASRFGGFSNEGRRKSYGWW
Subjt:  TFMRNNWNSPVTPSRWDHNYYMEYTGHYESPQFSGHYAS-HQSSGHYGSPQSFSKGRTLHPGTPMSLGSSIAPQNRFSASRFGGFSNEGRRKSYGWW

TrEMBL top hitse value%identityAlignment
A0A6J1F1A8 uncharacterized protein LOC111441232 isoform X24.4e-28174.03Show/hide
Query:  MGRRDKQKKFKFEDDEDELQGVKEGASKSANSVSSDDDEANEDLSLKIVEKALRLRSGKLVIAANNNKDGNRRQSGNVDVAVVGAVEVFPSSLEDADAGA
        MGRR KQKKFKFE+DEDEL   KE ASK+   VSSDD+EANEDLSLKIVEKALRLRSGKLV   ++N+ GNR QSGN  V  VG VEV  S L DAD GA
Subjt:  MGRRDKQKKFKFEDDEDELQGVKEGASKSANSVSSDDDEANEDLSLKIVEKALRLRSGKLVIAANNNKDGNRRQSGNVDVAVVGAVEVFPSSLEDADAGA

Query:  GTSGASADGIAAASEVRELGSKKTVKKRKRKVKKLGTEEHDVVVTEGEK-------------------IETTGMIDQVDSVEPNSTETPNNNVFRKLLRG
        GTSG S D IAAASE RELGSKK+ KKRKRKVKKL TEE  VVVTEGEK                   IETT +IDQVDS  PN T+T NNNVFRKLLRG
Subjt:  GTSGASADGIAAASEVRELGSKKTVKKRKRKVKKLGTEEHDVVVTEGEK-------------------IETTGMIDQVDSVEPNSTETPNNNVFRKLLRG

Query:  PRYFDPPDSWGMCYNCGEEGHNAVNCTSVKRKKPCFVCGSLEHNARSCLKARDCHICKKVGHRAKDCPEKHRDVSSSSKICLKCGDPGHDMFSCQNLYPD
        PRYFDPPDSWG CYNCGEEGHNAV+CTS KRKKPCFVCGSLEHNARSC KARDC+IC KVGHRAKDCPEKH  VSSSSKICLKCGDPGHDMFSCQ+ YPD
Subjt:  PRYFDPPDSWGMCYNCGEEGHNAVNCTSVKRKKPCFVCGSLEHNARSCLKARDCHICKKVGHRAKDCPEKHRDVSSSSKICLKCGDPGHDMFSCQNLYPD

Query:  DDLKNIQCYICKTFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRC
        DDLKNIQCYICK  GHLCCVNSTSDTSIDISCYKCG+ GH+GLACSRLRGEAS AVS+S CYRCGEEGHFARECTS  KGGK+NREEASGA S  PCYRC
Subjt:  DDLKNIQCYICKTFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRC

Query:  GEEGHFARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTK---GGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNLEEAS
        GEEGHFAREC  S K  K+                    G   RECT S K   GGK+  E+ASGAAS + CYRCGE GHFSREC S TK  KRN+E AS
Subjt:  GEEGHFARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTK---GGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNLEEAS

Query:  GATSTSLCYRCGEQGHFSRECTSSTKGDKRNRELSNPKLRSQIEEMNHMGSKSVPRDLAKAHNKKKKINHEEKYTSMPRNSGQKGRWMMEDLGRGNFSNG
        GA ST+ CYRCG +GHF+RECTSSTKG KRNRELSN K RS+ EE  HMGSKS P +LAKAH+KKKKIN++EKYT++PR SGQKGRWMMED G  NF NG
Subjt:  GATSTSLCYRCGEQGHFSRECTSSTKGDKRNRELSNPKLRSQIEEMNHMGSKSVPRDLAKAHNKKKKINHEEKYTSMPRNSGQKGRWMMEDLGRGNFSNG

Query:  TFMRNNWNSPVTPSRWDHNYYMEYTGHYESPQFSGHYAS-HQSSGHYGSPQSFSKGRTLHPGTPMSLGSSIAPQNRFSASRFGGFSNEGRRKSYGWW
        T M NNWNSPVTPSRWDH YY EY GHY SPQFSG YAS  +SSG+Y SPQS ++ RTLHPGTPMS  SSIAPQN FSASRFGGFSNEGRRKSYGWW
Subjt:  TFMRNNWNSPVTPSRWDHNYYMEYTGHYESPQFSGHYAS-HQSSGHYGSPQSFSKGRTLHPGTPMSLGSSIAPQNRFSASRFGGFSNEGRRKSYGWW

A0A6J1F6D8 uncharacterized protein LOC111441232 isoform X34.4e-28174.03Show/hide
Query:  MGRRDKQKKFKFEDDEDELQGVKEGASKSANSVSSDDDEANEDLSLKIVEKALRLRSGKLVIAANNNKDGNRRQSGNVDVAVVGAVEVFPSSLEDADAGA
        MGRR KQKKFKFE+DEDEL   KE ASK+   VSSDD+EANEDLSLKIVEKALRLRSGKLV   ++N+ GNR QSGN  V  VG VEV  S L DAD GA
Subjt:  MGRRDKQKKFKFEDDEDELQGVKEGASKSANSVSSDDDEANEDLSLKIVEKALRLRSGKLVIAANNNKDGNRRQSGNVDVAVVGAVEVFPSSLEDADAGA

Query:  GTSGASADGIAAASEVRELGSKKTVKKRKRKVKKLGTEEHDVVVTEGEK-------------------IETTGMIDQVDSVEPNSTETPNNNVFRKLLRG
        GTSG S D IAAASE RELGSKK+ KKRKRKVKKL TEE  VVVTEGEK                   IETT +IDQVDS  PN T+T NNNVFRKLLRG
Subjt:  GTSGASADGIAAASEVRELGSKKTVKKRKRKVKKLGTEEHDVVVTEGEK-------------------IETTGMIDQVDSVEPNSTETPNNNVFRKLLRG

Query:  PRYFDPPDSWGMCYNCGEEGHNAVNCTSVKRKKPCFVCGSLEHNARSCLKARDCHICKKVGHRAKDCPEKHRDVSSSSKICLKCGDPGHDMFSCQNLYPD
        PRYFDPPDSWG CYNCGEEGHNAV+CTS KRKKPCFVCGSLEHNARSC KARDC+IC KVGHRAKDCPEKH  VSSSSKICLKCGDPGHDMFSCQ+ YPD
Subjt:  PRYFDPPDSWGMCYNCGEEGHNAVNCTSVKRKKPCFVCGSLEHNARSCLKARDCHICKKVGHRAKDCPEKHRDVSSSSKICLKCGDPGHDMFSCQNLYPD

Query:  DDLKNIQCYICKTFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTK---GGKRNREEASGAVSSIPC
        DDLKNIQCYICK  GHLCCVNSTSDTSIDISCYKCG+ GH+GLACSRLRGEAS AVS+S CYRCGEEGHFARECTS  K   GGK+NREEASGA S  PC
Subjt:  DDLKNIQCYICKTFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTK---GGKRNREEASGAVSSIPC

Query:  YRCGEEGHFARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNLEEAS
        YRCGEEGHFAREC  S K  K+                    G   RECT S KGGK+  E+ASGAAS + CYRCGE GHFSREC S TK  KRN+E AS
Subjt:  YRCGEEGHFARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNLEEAS

Query:  GATSTSLCYRCGEQGHFSRECTSSTKGDKRNRELSNPKLRSQIEEMNHMGSKSVPRDLAKAHNKKKKINHEEKYTSMPRNSGQKGRWMMEDLGRGNFSNG
        GA ST+ CYRCG +GHF+RECTSSTKG KRNRELSN K RS+ EE  HMGSKS P +LAKAH+KKKKIN++EKYT++PR SGQKGRWMMED G  NF NG
Subjt:  GATSTSLCYRCGEQGHFSRECTSSTKGDKRNRELSNPKLRSQIEEMNHMGSKSVPRDLAKAHNKKKKINHEEKYTSMPRNSGQKGRWMMEDLGRGNFSNG

Query:  TFMRNNWNSPVTPSRWDHNYYMEYTGHYESPQFSGHYAS-HQSSGHYGSPQSFSKGRTLHPGTPMSLGSSIAPQNRFSASRFGGFSNEGRRKSYGWW
        T M NNWNSPVTPSRWDH YY EY GHY SPQFSG YAS  +SSG+Y SPQS ++ RTLHPGTPMS  SSIAPQN FSASRFGGFSNEGRRKSYGWW
Subjt:  TFMRNNWNSPVTPSRWDHNYYMEYTGHYESPQFSGHYAS-HQSSGHYGSPQSFSKGRTLHPGTPMSLGSSIAPQNRFSASRFGGFSNEGRRKSYGWW

A0A6J1F6E2 uncharacterized protein LOC111441232 isoform X11.8e-27973.71Show/hide
Query:  MGRRDKQKKFKFEDDEDELQGVKEGASKSANSVSSDDDEANEDLSLKIVEKALRLRSGKLVIAANNNKDGNRRQSGNVDVAVVGAVEVFPSSLEDADAGA
        MGRR KQKKFKFE+DEDEL   KE ASK+   VSSDD+EANEDLSLKIVEKALRLRSGKLV   ++N+ GNR QSGN  V  VG VEV  S L DAD GA
Subjt:  MGRRDKQKKFKFEDDEDELQGVKEGASKSANSVSSDDDEANEDLSLKIVEKALRLRSGKLVIAANNNKDGNRRQSGNVDVAVVGAVEVFPSSLEDADAGA

Query:  GTSGASADGIAAASEVRELGSKKTVKKRKRKVKKLGTEEHDVVVTEGEK-------------------IETTGMIDQVDSVEPNSTETPNNNVFRKLLRG
        GTSG S D IAAASE RELGSKK+ KKRKRKVKKL TEE  VVVTEGEK                   IETT +IDQVDS  PN T+T NNNVFRKLLRG
Subjt:  GTSGASADGIAAASEVRELGSKKTVKKRKRKVKKLGTEEHDVVVTEGEK-------------------IETTGMIDQVDSVEPNSTETPNNNVFRKLLRG

Query:  PRYFDPPDSWGMCYNCGEEGHNAVNCTSVKRKKPCFVCGSLEHNARSCLKARDCHICKKVGHRAKDCPEKHRDVSSSSKICLKCGDPGHDMFSCQNLYPD
        PRYFDPPDSWG CYNCGEEGHNAV+CTS KRKKPCFVCGSLEHNARSC KARDC+IC KVGHRAKDCPEKH  VSSSSKICLKCGDPGHDMFSCQ+ YPD
Subjt:  PRYFDPPDSWGMCYNCGEEGHNAVNCTSVKRKKPCFVCGSLEHNARSCLKARDCHICKKVGHRAKDCPEKHRDVSSSSKICLKCGDPGHDMFSCQNLYPD

Query:  DDLKNIQCYICKTFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTK---GGKRNREEASGAVSSIPC
        DDLKNIQCYICK  GHLCCVNSTSDTSIDISCYKCG+ GH+GLACSRLRGEAS AVS+S CYRCGEEGHFARECTS  K   GGK+NREEASGA S  PC
Subjt:  DDLKNIQCYICKTFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTK---GGKRNREEASGAVSSIPC

Query:  YRCGEEGHFARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTK---GGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNLE
        YRCGEEGHFAREC  S K  K+                    G   RECT S K   GGK+  E+ASGAAS + CYRCGE GHFSREC S TK  KRN+E
Subjt:  YRCGEEGHFARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTK---GGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNLE

Query:  EASGATSTSLCYRCGEQGHFSRECTSSTKGDKRNRELSNPKLRSQIEEMNHMGSKSVPRDLAKAHNKKKKINHEEKYTSMPRNSGQKGRWMMEDLGRGNF
         ASGA ST+ CYRCG +GHF+RECTSSTKG KRNRELSN K RS+ EE  HMGSKS P +LAKAH+KKKKIN++EKYT++PR SGQKGRWMMED G  NF
Subjt:  EASGATSTSLCYRCGEQGHFSRECTSSTKGDKRNRELSNPKLRSQIEEMNHMGSKSVPRDLAKAHNKKKKINHEEKYTSMPRNSGQKGRWMMEDLGRGNF

Query:  SNGTFMRNNWNSPVTPSRWDHNYYMEYTGHYESPQFSGHYAS-HQSSGHYGSPQSFSKGRTLHPGTPMSLGSSIAPQNRFSASRFGGFSNEGRRKSYGWW
         NGT M NNWNSPVTPSRWDH YY EY GHY SPQFSG YAS  +SSG+Y SPQS ++ RTLHPGTPMS  SSIAPQN FSASRFGGFSNEGRRKSYGWW
Subjt:  SNGTFMRNNWNSPVTPSRWDHNYYMEYTGHYESPQFSGHYAS-HQSSGHYGSPQSFSKGRTLHPGTPMSLGSSIAPQNRFSASRFGGFSNEGRRKSYGWW

A0A6J1IF04 uncharacterized protein LOC111472161 isoform X14.1e-27973.6Show/hide
Query:  MGRRDKQKKFKFEDDEDELQGVKEGASKSANSVSSDDDEANEDLSLKIVEKALRLRSGKLVIAANNNKDGNRRQSGNVDVAVVGAVEVFPSSLEDADAGA
        MGRR KQKKFKFE+DEDE   +KE ASK+   VSSDD+EANED SLKIVEKALRLRSGKLV   ++N+ GNR +SGN DV  VG VEV  S L DAD GA
Subjt:  MGRRDKQKKFKFEDDEDELQGVKEGASKSANSVSSDDDEANEDLSLKIVEKALRLRSGKLVIAANNNKDGNRRQSGNVDVAVVGAVEVFPSSLEDADAGA

Query:  GTSGASADGIAAASEVRELGSKKTVKKRKRKVKKLGTEEHDVVVTEGEK-------------------IETTGMIDQVDSVEPNSTETPNNNVFRKLLRG
        GTSG S D IAAASE RELGSKK+ KKRKRKVKKL TE+  VVVTEGEK                   IETT +IDQVDS EPN T+T NNNVFRKLLRG
Subjt:  GTSGASADGIAAASEVRELGSKKTVKKRKRKVKKLGTEEHDVVVTEGEK-------------------IETTGMIDQVDSVEPNSTETPNNNVFRKLLRG

Query:  PRYFDPPDSWGMCYNCGEEGHNAVNCTSVKRKKPCFVCGSLEHNARSCLKARDCHICKKVGHRAKDCPEKHRDVSSSSKICLKCGDPGHDMFSCQNLYPD
        PRYFDPPDSWG CYNCGEEGHNAV+CTS KRKKPCFVCGSLEHNARSCLKARDC+IC KVGHRAKDCPEKH  VSSSSKICLKCGDPGHDMFSCQ+ YPD
Subjt:  PRYFDPPDSWGMCYNCGEEGHNAVNCTSVKRKKPCFVCGSLEHNARSCLKARDCHICKKVGHRAKDCPEKHRDVSSSSKICLKCGDPGHDMFSCQNLYPD

Query:  DDLKNIQCYICKTFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRC
        DDLKNIQCYICK  GHLCCVNSTSDTS+DISCYKCG+ GH+GLACSRLRGEAS AVS+S CYRCGEEGHFARECTS  KGGK+NREEASGA S  PCYRC
Subjt:  DDLKNIQCYICKTFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRC

Query:  GEEGHFARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTK---GGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNLEEAS
        GEEGHFAREC  S K  K+                    G   RECT S K   GGK+  E+ASGAAS + CYRCGE GHFSREC SSTK  KRN+E AS
Subjt:  GEEGHFARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTK---GGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNLEEAS

Query:  GATSTSLCYRCGEQGHFSRECTSSTKGDKRNRELSNPKLRSQIEEMNHMGSKSVPRDLAKAHNKKKKINHEEKYTSMPRNSGQKGRWMMEDLGRGNFSNG
        GA ST+ CYRCG +GHF+RECTSSTKG K+ R+LSNPK RS+  EM HMGSKS P +LAKAH+KKKKIN+EEKYT++PR SGQKGRWMMED G  NF NG
Subjt:  GATSTSLCYRCGEQGHFSRECTSSTKGDKRNRELSNPKLRSQIEEMNHMGSKSVPRDLAKAHNKKKKINHEEKYTSMPRNSGQKGRWMMEDLGRGNFSNG

Query:  TFMRNNWNSPVTPSRWDHNYYMEYTGHYESPQFSGHYAS-HQSSGHYGSPQSFSKGRTLHPGTPMSLGSSIAPQNRFSASRFGGFSNEGRRKSYGWW
        T M NNWNSPVTPSRWDH YY EY G Y SPQFSG YAS  +SSG+Y SPQS ++ RTLHPGTPMS  SSIAPQN FSASRFGGFSNEGRRKSYGWW
Subjt:  TFMRNNWNSPVTPSRWDHNYYMEYTGHYESPQFSGHYAS-HQSSGHYGSPQSFSKGRTLHPGTPMSLGSSIAPQNRFSASRFGGFSNEGRRKSYGWW

A0A6J1IF09 zinc finger CCHC domain-containing protein 7-like isoform X29.8e-28173.92Show/hide
Query:  MGRRDKQKKFKFEDDEDELQGVKEGASKSANSVSSDDDEANEDLSLKIVEKALRLRSGKLVIAANNNKDGNRRQSGNVDVAVVGAVEVFPSSLEDADAGA
        MGRR KQKKFKFE+DEDE   +KE ASK+   VSSDD+EANED SLKIVEKALRLRSGKLV   ++N+ GNR +SGN DV  VG VEV  S L DAD GA
Subjt:  MGRRDKQKKFKFEDDEDELQGVKEGASKSANSVSSDDDEANEDLSLKIVEKALRLRSGKLVIAANNNKDGNRRQSGNVDVAVVGAVEVFPSSLEDADAGA

Query:  GTSGASADGIAAASEVRELGSKKTVKKRKRKVKKLGTEEHDVVVTEGEK-------------------IETTGMIDQVDSVEPNSTETPNNNVFRKLLRG
        GTSG S D IAAASE RELGSKK+ KKRKRKVKKL TE+  VVVTEGEK                   IETT +IDQVDS EPN T+T NNNVFRKLLRG
Subjt:  GTSGASADGIAAASEVRELGSKKTVKKRKRKVKKLGTEEHDVVVTEGEK-------------------IETTGMIDQVDSVEPNSTETPNNNVFRKLLRG

Query:  PRYFDPPDSWGMCYNCGEEGHNAVNCTSVKRKKPCFVCGSLEHNARSCLKARDCHICKKVGHRAKDCPEKHRDVSSSSKICLKCGDPGHDMFSCQNLYPD
        PRYFDPPDSWG CYNCGEEGHNAV+CTS KRKKPCFVCGSLEHNARSCLKARDC+IC KVGHRAKDCPEKH  VSSSSKICLKCGDPGHDMFSCQ+ YPD
Subjt:  PRYFDPPDSWGMCYNCGEEGHNAVNCTSVKRKKPCFVCGSLEHNARSCLKARDCHICKKVGHRAKDCPEKHRDVSSSSKICLKCGDPGHDMFSCQNLYPD

Query:  DDLKNIQCYICKTFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRC
        DDLKNIQCYICK  GHLCCVNSTSDTS+DISCYKCG+ GH+GLACSRLRGEAS AVS+S CYRCGEEGHFARECTS  KGGK+NREEASGA S  PCYRC
Subjt:  DDLKNIQCYICKTFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRC

Query:  GEEGHFARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNLEEASGAT
        GEEGHFAREC  S K  K+                    G   RECT S KGGK+  E+ASGAAS + CYRCGE GHFSREC SSTK  KRN+E ASGA 
Subjt:  GEEGHFARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNLEEASGAT

Query:  STSLCYRCGEQGHFSRECTSSTKGDKRNRELSNPKLRSQIEEMNHMGSKSVPRDLAKAHNKKKKINHEEKYTSMPRNSGQKGRWMMEDLGRGNFSNGTFM
        ST+ CYRCG +GHF+RECTSSTKG K+ R+LSNPK RS+  EM HMGSKS P +LAKAH+KKKKIN+EEKYT++PR SGQKGRWMMED G  NF NGT M
Subjt:  STSLCYRCGEQGHFSRECTSSTKGDKRNRELSNPKLRSQIEEMNHMGSKSVPRDLAKAHNKKKKINHEEKYTSMPRNSGQKGRWMMEDLGRGNFSNGTFM

Query:  RNNWNSPVTPSRWDHNYYMEYTGHYESPQFSGHYAS-HQSSGHYGSPQSFSKGRTLHPGTPMSLGSSIAPQNRFSASRFGGFSNEGRRKSYGWW
         NNWNSPVTPSRWDH YY EY G Y SPQFSG YAS  +SSG+Y SPQS ++ RTLHPGTPMS  SSIAPQN FSASRFGGFSNEGRRKSYGWW
Subjt:  RNNWNSPVTPSRWDHNYYMEYTGHYESPQFSGHYAS-HQSSGHYGSPQSFSKGRTLHPGTPMSLGSSIAPQNRFSASRFGGFSNEGRRKSYGWW

SwissProt top hitse value%identityAlignment
O42395 Cellular nucleic acid-binding protein6.8e-2135.78Show/hide
Query:  CYKCGKTGHSGLAC---------SRLRGEASGAVSSSP----CYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRCGEEGHFARECPISTKGGK
        C+KCG+TGH    C          R RG A     SS     CYRCGE GH A++C          +E+ +       CY CG  GH A++C    K  K
Subjt:  CYKCGKTGHSGLAC---------SRLRGEASGAVSSSP----CYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRCGEEGHFARECPISTKGGK

Query:  RNREEASGAASPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECT--SSTKAGKR-NLEEASGATSTSLCYRCGEQGHFS
        R RE+         CY CG+ GHLAR+C                 A    CY CGE GH  ++CT     + G+  ++      TS   CYRCGE GH +
Subjt:  RNREEASGAASPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECT--SSTKAGKR-NLEEASGATSTSLCYRCGEQGHFS

Query:  RECT
        RECT
Subjt:  RECT

O65639 Cold shock protein 12.0e-2533.98Show/hide
Query:  CYKCGKTGHSGLACSRLRGEASGAVSS---SPCYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRCGEEGHFARECPISTKGGKRNREEASGAA
        CY CG+ GH    C    G   G   S     CY CG+ GHFAR+CTS   G +R   +      +  CY CG+ GH AR+C   + G    R    G  
Subjt:  CYKCGKTGHSGLACSRLRGEASGAVSS---SPCYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRCGEEGHFARECPISTKGGKRNREEASGAA

Query:  SPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGK------------RNLEE--ASGATSTSLCYRCGEQGHF
            CY CG+ GH AR+CT     G       SG   + +CY CG  GH +R+C +  +  +            R+ ++  + G  + + CY+CG++GHF
Subjt:  SPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGK------------RNLEE--ASGATSTSLCYRCGEQGHF

Query:  SRECTS
        +REC+S
Subjt:  SRECTS

Q04832 DNA-binding protein HEXBP2.6e-2834.82Show/hide
Query:  CYICKTFGHLCCVNSTSDTSID---ISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRCGEEG
        C  C   GH       +D+  D    +C++CG+ GH    C       SGA  +  C+RCGE GH +R+C +  K          GA     CY+CG+EG
Subjt:  CYICKTFGHLCCVNSTSDTSID---ISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRCGEEG

Query:  HFARECPIS---TKGG---KRNREEASGAAS-PSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNLEEAS
        H +R+CP S   ++GG   KR R  A G  S   +CY+CG+ GH++R+C +   G           A   +CY+CG+ GH SR+C         N +   
Subjt:  HFARECPIS---TKGG---KRNREEASGAAS-PSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNLEEAS

Query:  GATSTSLCYRCGEQGHFSRECTSS
               CY+CGE GH SREC S+
Subjt:  GATSTSLCYRCGEQGHFSRECTSS

Q3T0Q6 Cellular nucleic acid-binding protein4.0e-2136.45Show/hide
Query:  CYKCGKTGHSGLAC---------SRLRGEASGAVSSS---PCYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRCGEEGHFARECPISTKGGKR
        C+KCG++GH    C          R RG     VSSS    CYRCGE GH A++C          +E+A        CY CG  GH A++C    K  KR
Subjt:  CYKCGKTGHSGLAC---------SRLRGEASGAVSSS---PCYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRCGEEGHFARECPISTKGGKR

Query:  NREEASGAASPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECT--SSTKAGKR-NLEEASGATSTSLCYRCGEQGHFSR
         RE+         CY CG+ GHLAR+C                 A    CY CGE GH  ++CT     + G+  ++      TS   CYRCGE GH +R
Subjt:  NREEASGAASPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECT--SSTKAGKR-NLEEASGATSTSLCYRCGEQGHFSR

Query:  ECT
        ECT
Subjt:  ECT

Q94C69 Cold shock domain-containing protein 36.6e-2432.6Show/hide
Query:  CYICKTFGHLC--CVNSTSDTSI----------DISCYKCGKTGHSGLACSRLRGEASGAVSSS--PCYRCGEEGHFARECTSFTKGGKRNREEASGAVS
        C+ C   GH+   C   +   S           +  CY CG  GH    C +  G  SG       PCY CGE GH A++C     GG R          
Subjt:  CYICKTFGHLC--CVNSTSDTSI----------DISCYKCGKTGHSGLACSRLRGEASGAVSSS--PCYRCGEEGHFARECTSFTKGGKRNREEASGAVS

Query:  SIPCYRCGEEGHFARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNL
           CY CG  GHFAR+C        R     +     S+CY CG  GH+A+ CTS    G        G     +CY CG  GH +R+C       +R  
Subjt:  SIPCYRCGEEGHFARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNL

Query:  EEASGATSTSLCYRCGEQGHFSRECTS
          + G   ++ C+ CG++GHF+RECTS
Subjt:  EEASGATSTSLCYRCGEQGHFSRECTS

Arabidopsis top hitse value%identityAlignment
AT2G17870.1 cold shock domain protein 34.7e-2532.6Show/hide
Query:  CYICKTFGHLC--CVNSTSDTSI----------DISCYKCGKTGHSGLACSRLRGEASGAVSSS--PCYRCGEEGHFARECTSFTKGGKRNREEASGAVS
        C+ C   GH+   C   +   S           +  CY CG  GH    C +  G  SG       PCY CGE GH A++C     GG R          
Subjt:  CYICKTFGHLC--CVNSTSDTSI----------DISCYKCGKTGHSGLACSRLRGEASGAVSSS--PCYRCGEEGHFARECTSFTKGGKRNREEASGAVS

Query:  SIPCYRCGEEGHFARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNL
           CY CG  GHFAR+C        R     +     S+CY CG  GH+A+ CTS    G        G     +CY CG  GH +R+C       +R  
Subjt:  SIPCYRCGEEGHFARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNL

Query:  EEASGATSTSLCYRCGEQGHFSRECTS
          + G   ++ C+ CG++GHF+RECTS
Subjt:  EEASGATSTSLCYRCGEQGHFSRECTS

AT3G42860.1 zinc knuckle (CCHC-type) family protein3.4e-2333.33Show/hide
Query:  QCYICKTFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRCGEEGHF
        Q Y C  F   C      + ++  S      T +S    S  RG    A + +PCY+CG+EGH+AR+CT  +  G      A+G      C++CG+ GH+
Subjt:  QCYICKTFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRCGEEGHF

Query:  ARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKA----GKR
        +R+C   +   K    +   ++S   CY+CG+QGH +R+CT  +   +    +A   +ST  CY+CG+ GH+SR+CTS  +     GKR
Subjt:  ARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKA----GKR

AT3G43590.1 zinc knuckle (CCHC-type) family protein1.7e-8836.48Show/hide
Query:  MGRRDKQKKFKFEDDEDELQGVKEGASKSANSVSSDDDEANEDLSLKIVEKALRLRSGKLVIAANNNKDGNRRQSGNVDVAVVGAVEVFPSSLEDADAGA
        M R++K +KF F+D EDE +      ++  N    DDDEANEDLSLKI+EKAL  R        N         SG V   +V  V+             
Subjt:  MGRRDKQKKFKFEDDEDELQGVKEGASKSANSVSSDDDEANEDLSLKIVEKALRLRSGKLVIAANNNKDGNRRQSGNVDVAVVGAVEVFPSSLEDADAGA

Query:  GTSGASADGIAAASEVRELGSKKTVKKRK----RKVKKLGTEEHDVVVTEGEKIETTGMIDQVD-SVEPNSTETPNNNVFRKLLRGPRYFDPPDS-WGMC
                     S+V++  S K +K+ K     ++  +  ++ +  V E E ++  G  D+V+ S EP + ET +N V +KLLRG RYFDPPD+ W  C
Subjt:  GTSGASADGIAAASEVRELGSKKTVKKRK----RKVKKLGTEEHDVVVTEGEKIETTGMIDQVD-SVEPNSTETPNNNVFRKLLRGPRYFDPPDS-WGMC

Query:  YNCGEEGHNAVNC-TSVKRKKPCFVCGSLEHNARSCLKARDCHICKKVGHRAKDCPEKHRDVSSSSKICLKCGDPGHDMFSCQNLYPDDDLKNIQCYICK
        Y+CGE+GH + NC T  KR+KPCF+CGSLEH A+ C K  DC+ICKK GHRAKDCP+K+++  S   +CL+CGD GHDM  C+  Y  +DLK++QCYICK
Subjt:  YNCGEEGHNAVNC-TSVKRKKPCFVCGSLEHNARSCLKARDCHICKKVGHRAKDCPEKHRDVSSSSKICLKCGDPGHDMFSCQNLYPDDDLKNIQCYICK

Query:  TFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRCGEEGHFARECPI
        +FGHLCCV   +  S  +SCY+CG+ GHSGLAC R   E++   S++P      E  F             N  EAS       CYRCGEEGHFARECP 
Subjt:  TFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRCGEEGHFARECPI

Query:  STKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNLEEASGATSTSLCYRCGEQG
        S+                                          +  + G  S + CYRC   GHF+REC +S++  KR+ E +                
Subjt:  STKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNLEEASGATSTSLCYRCGEQG

Query:  HFSRECTSSTKGDKRNRELSNPKLRSQIEEMNHMGSKSVPRDLAKAHNKKKKINHEEKYTSMPRNSGQKGRWMMEDLGRGNFSNGTFMRNNWNSPVTPSR
              T+S K  K+N+E  N +  S   E N    K           KKKK + EE+  + PR    +G W+ E+    +F  G   R    SP+TPS 
Subjt:  HFSRECTSSTKGDKRNRELSNPKLRSQIEEMNHMGSKSVPRDLAKAHNKKKKINHEEKYTSMPRNSGQKGRWMMEDLGRGNFSNGTFMRNNWNSPVTPSR

Query:  WDHNYYMEYTGHYESPQFS--GHYASHQSSGHYGSP
        ++ +       +Y SP+F+  GHY   QSS H+  P
Subjt:  WDHNYYMEYTGHYESPQFS--GHYASHQSSGHYGSP

AT4G36020.1 cold shock domain protein 11.5e-2633.98Show/hide
Query:  CYKCGKTGHSGLACSRLRGEASGAVSS---SPCYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRCGEEGHFARECPISTKGGKRNREEASGAA
        CY CG+ GH    C    G   G   S     CY CG+ GHFAR+CTS   G +R   +      +  CY CG+ GH AR+C   + G    R    G  
Subjt:  CYKCGKTGHSGLACSRLRGEASGAVSS---SPCYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRCGEEGHFARECPISTKGGKRNREEASGAA

Query:  SPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGK------------RNLEE--ASGATSTSLCYRCGEQGHF
            CY CG+ GH AR+CT     G       SG   + +CY CG  GH +R+C +  +  +            R+ ++  + G  + + CY+CG++GHF
Subjt:  SPSSCYRCGEQGHLARECTSSTKGGKRILEEASGAASTSSCYRCGEQGHFSRECTSSTKAGK------------RNLEE--ASGATSTSLCYRCGEQGHF

Query:  SRECTS
        +REC+S
Subjt:  SRECTS

AT5G36240.1 zinc knuckle (CCHC-type) family protein9.1e-2936.28Show/hide
Query:  KHRDVSSSSKICLKCGDPGHDMFSCQNLYPDDDLKNIQCYICKTFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGH
        +H D    +++CL+CG  GHDM  C+  Y  +DLKNI+CY+C + GHLCC+      S  +SCY+CG+ GH+GLAC R       +VS S C+ CG EGH
Subjt:  KHRDVSSSSKICLKCGDPGHDMFSCQNLYPDDDLKNIQCYICKTFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLRGEASGAVSSSPCYRCGEEGH

Query:  FAREC-TSFTK--GGKRNREEASGAVSSIPCY----RCGEEGHFARECP---------ISTKGG-----KRNREEASGAASPSSCYRCGEQGHLARECTS
        F  +C  SF+       + +E  G  SS   +    R  EEGHF  +CP         IS + G       ++  + G  +   CY C  +GH+AR+C +
Subjt:  FAREC-TSFTK--GGKRNREEASGAVSSIPCY----RCGEEGHFARECP---------ISTKGG-----KRNREEASGAASPSSCYRCGEQGHLARECTS

Query:  STKGGKRILEE----ASGAASTSSCY
        S++     L      +S   S  SCY
Subjt:  STKGGKRILEE----ASGAASTSSCY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAGGAGAGACAAGCAGAAGAAGTTCAAGTTTGAGGACGACGAAGACGAACTTCAGGGTGTGAAAGAGGGGGCTTCGAAGTCTGCCAATTCGGTGAGCAGTGACGA
CGATGAGGCCAATGAAGATCTTAGCCTCAAGATTGTTGAAAAGGCTTTGCGATTGCGCTCCGGAAAGTTGGTCATCGCTGCCAACAATAATAAGGATGGTAACCGTAGAC
AGAGTGGTAATGTTGATGTTGCTGTCGTTGGAGCCGTCGAAGTGTTTCCCTCGTCTTTGGAGGATGCTGACGCTGGTGCCGGAACTAGCGGAGCCAGCGCCGATGGCATC
GCTGCTGCTTCTGAGGTGCGGGAATTGGGGAGTAAGAAGACAGTGAAGAAGAGGAAGAGGAAAGTTAAGAAGCTGGGAACTGAAGAGCACGATGTGGTTGTCACAGAAGG
AGAGAAGATCGAGACAACTGGAATGATTGATCAAGTTGATTCGGTGGAACCAAACTCTACTGAGACACCTAATAATAACGTGTTTAGGAAGCTTCTTCGTGGACCAAGAT
ACTTTGACCCTCCAGATAGTTGGGGAATGTGCTATAATTGTGGCGAGGAAGGTCATAATGCTGTGAATTGCACATCAGTTAAACGGAAGAAACCATGTTTTGTCTGTGGA
AGTCTGGAGCACAATGCAAGGAGTTGCTTAAAGGCACGAGACTGCCATATTTGTAAGAAAGTTGGGCATCGTGCAAAAGACTGTCCAGAGAAGCACAGGGATGTTTCTTC
AAGCTCAAAAATATGTTTAAAATGTGGAGATCCTGGGCATGATATGTTTTCTTGTCAAAATCTTTATCCAGATGATGATCTTAAGAACATACAATGCTACATTTGTAAGA
CATTTGGTCATTTGTGTTGCGTGAATTCCACCAGCGATACGTCAATAGACATTTCCTGCTATAAATGTGGAAAGACTGGACATTCTGGTCTGGCATGCTCAAGATTACGG
GGGGAAGCCTCTGGTGCTGTATCATCTAGCCCATGCTATAGATGTGGTGAAGAAGGGCATTTTGCCCGTGAATGCACAAGTTTCACCAAGGGTGGCAAGAGGAATCGAGA
GGAAGCCTCTGGTGCTGTATCATCTATCCCATGCTATAGATGTGGTGAAGAAGGACATTTTGCCCGCGAGTGCCCAATTTCCACCAAGGGTGGCAAGAGGAATCGTGAGG
AAGCCTCTGGTGCTGCGTCACCTAGCTCATGCTATAGATGTGGTGAACAAGGGCATTTAGCCCGTGAGTGCACAAGTTCCACTAAGGGTGGCAAGAGGATTCTTGAGGAA
GCCTCTGGTGCTGCATCAACTAGCTCATGCTATAGATGTGGTGAACAAGGGCATTTTTCCCGTGAGTGCACAAGTTCCACCAAGGCTGGCAAGAGGAATCTTGAGGAAGC
CTCTGGTGCTACATCAACTAGCTTATGCTATAGATGTGGTGAACAAGGGCATTTTTCACGTGAGTGCACAAGTTCCACCAAGGGTGACAAGAGGAATCGTGAGTTATCAA
ACCCAAAATTGAGGTCACAAATAGAAGAAATGAACCATATGGGATCAAAATCTGTGCCTCGTGATCTTGCAAAGGCTCATAATAAGAAGAAAAAGATAAACCACGAAGAA
AAGTACACTAGCATGCCTAGGAATTCAGGACAAAAGGGTCGCTGGATGATGGAAGATCTGGGTCGTGGTAACTTCTCCAATGGCACATTCATGAGAAATAACTGGAATTC
TCCCGTTACACCATCTCGGTGGGATCATAATTATTATATGGAATATACCGGTCACTATGAAAGCCCTCAATTTTCTGGTCACTATGCAAGTCATCAATCTTCCGGTCACT
ATGGAAGTCCTCAATCTTTCTCAAAAGGGCGCACATTACATCCAGGGACTCCAATGTCATTAGGATCCAGTATAGCTCCTCAGAACAGATTCTCGGCATCTAGATTTGGG
GGCTTTAGCAATGAAGGAAGGAGGAAAAGTTATGGATGGTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGAGGAGAGACAAGCAGAAGAAGTTCAAGTTTGAGGACGACGAAGACGAACTTCAGGGTGTGAAAGAGGGGGCTTCGAAGTCTGCCAATTCGGTGAGCAGTGACGA
CGATGAGGCCAATGAAGATCTTAGCCTCAAGATTGTTGAAAAGGCTTTGCGATTGCGCTCCGGAAAGTTGGTCATCGCTGCCAACAATAATAAGGATGGTAACCGTAGAC
AGAGTGGTAATGTTGATGTTGCTGTCGTTGGAGCCGTCGAAGTGTTTCCCTCGTCTTTGGAGGATGCTGACGCTGGTGCCGGAACTAGCGGAGCCAGCGCCGATGGCATC
GCTGCTGCTTCTGAGGTGCGGGAATTGGGGAGTAAGAAGACAGTGAAGAAGAGGAAGAGGAAAGTTAAGAAGCTGGGAACTGAAGAGCACGATGTGGTTGTCACAGAAGG
AGAGAAGATCGAGACAACTGGAATGATTGATCAAGTTGATTCGGTGGAACCAAACTCTACTGAGACACCTAATAATAACGTGTTTAGGAAGCTTCTTCGTGGACCAAGAT
ACTTTGACCCTCCAGATAGTTGGGGAATGTGCTATAATTGTGGCGAGGAAGGTCATAATGCTGTGAATTGCACATCAGTTAAACGGAAGAAACCATGTTTTGTCTGTGGA
AGTCTGGAGCACAATGCAAGGAGTTGCTTAAAGGCACGAGACTGCCATATTTGTAAGAAAGTTGGGCATCGTGCAAAAGACTGTCCAGAGAAGCACAGGGATGTTTCTTC
AAGCTCAAAAATATGTTTAAAATGTGGAGATCCTGGGCATGATATGTTTTCTTGTCAAAATCTTTATCCAGATGATGATCTTAAGAACATACAATGCTACATTTGTAAGA
CATTTGGTCATTTGTGTTGCGTGAATTCCACCAGCGATACGTCAATAGACATTTCCTGCTATAAATGTGGAAAGACTGGACATTCTGGTCTGGCATGCTCAAGATTACGG
GGGGAAGCCTCTGGTGCTGTATCATCTAGCCCATGCTATAGATGTGGTGAAGAAGGGCATTTTGCCCGTGAATGCACAAGTTTCACCAAGGGTGGCAAGAGGAATCGAGA
GGAAGCCTCTGGTGCTGTATCATCTATCCCATGCTATAGATGTGGTGAAGAAGGACATTTTGCCCGCGAGTGCCCAATTTCCACCAAGGGTGGCAAGAGGAATCGTGAGG
AAGCCTCTGGTGCTGCGTCACCTAGCTCATGCTATAGATGTGGTGAACAAGGGCATTTAGCCCGTGAGTGCACAAGTTCCACTAAGGGTGGCAAGAGGATTCTTGAGGAA
GCCTCTGGTGCTGCATCAACTAGCTCATGCTATAGATGTGGTGAACAAGGGCATTTTTCCCGTGAGTGCACAAGTTCCACCAAGGCTGGCAAGAGGAATCTTGAGGAAGC
CTCTGGTGCTACATCAACTAGCTTATGCTATAGATGTGGTGAACAAGGGCATTTTTCACGTGAGTGCACAAGTTCCACCAAGGGTGACAAGAGGAATCGTGAGTTATCAA
ACCCAAAATTGAGGTCACAAATAGAAGAAATGAACCATATGGGATCAAAATCTGTGCCTCGTGATCTTGCAAAGGCTCATAATAAGAAGAAAAAGATAAACCACGAAGAA
AAGTACACTAGCATGCCTAGGAATTCAGGACAAAAGGGTCGCTGGATGATGGAAGATCTGGGTCGTGGTAACTTCTCCAATGGCACATTCATGAGAAATAACTGGAATTC
TCCCGTTACACCATCTCGGTGGGATCATAATTATTATATGGAATATACCGGTCACTATGAAAGCCCTCAATTTTCTGGTCACTATGCAAGTCATCAATCTTCCGGTCACT
ATGGAAGTCCTCAATCTTTCTCAAAAGGGCGCACATTACATCCAGGGACTCCAATGTCATTAGGATCCAGTATAGCTCCTCAGAACAGATTCTCGGCATCTAGATTTGGG
GGCTTTAGCAATGAAGGAAGGAGGAAAAGTTATGGATGGTGGTAG
Protein sequenceShow/hide protein sequence
MGRRDKQKKFKFEDDEDELQGVKEGASKSANSVSSDDDEANEDLSLKIVEKALRLRSGKLVIAANNNKDGNRRQSGNVDVAVVGAVEVFPSSLEDADAGAGTSGASADGI
AAASEVRELGSKKTVKKRKRKVKKLGTEEHDVVVTEGEKIETTGMIDQVDSVEPNSTETPNNNVFRKLLRGPRYFDPPDSWGMCYNCGEEGHNAVNCTSVKRKKPCFVCG
SLEHNARSCLKARDCHICKKVGHRAKDCPEKHRDVSSSSKICLKCGDPGHDMFSCQNLYPDDDLKNIQCYICKTFGHLCCVNSTSDTSIDISCYKCGKTGHSGLACSRLR
GEASGAVSSSPCYRCGEEGHFARECTSFTKGGKRNREEASGAVSSIPCYRCGEEGHFARECPISTKGGKRNREEASGAASPSSCYRCGEQGHLARECTSSTKGGKRILEE
ASGAASTSSCYRCGEQGHFSRECTSSTKAGKRNLEEASGATSTSLCYRCGEQGHFSRECTSSTKGDKRNRELSNPKLRSQIEEMNHMGSKSVPRDLAKAHNKKKKINHEE
KYTSMPRNSGQKGRWMMEDLGRGNFSNGTFMRNNWNSPVTPSRWDHNYYMEYTGHYESPQFSGHYASHQSSGHYGSPQSFSKGRTLHPGTPMSLGSSIAPQNRFSASRFG
GFSNEGRRKSYGWW