; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G020660 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G020660
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionzinc knuckle (CCHC-type) family protein
Genome locationCG_Chr09:37656702..37665322
RNA-Seq ExpressionClCG09G020660
SyntenyClCG09G020660
Gene Ontology termsGO:2000767 - positive regulation of cytoplasmic translation (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0003727 - single-stranded RNA binding (molecular function)
GO:0003729 - mRNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0045182 - translation regulator activity (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025829 - Zinc knuckle CX2CX3GHX4C
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585819.1 DNA-binding protein HEXBP, partial [Cucurbita argyrosperma subsp. sororia]9.1e-20673.75Show/hide
Query:  MSSSLCSSIRPLPPHYWRRKNVRFFLLLQSNRGICFAPRFVAC-SNNDDSVAIPRPTPLAFDPAEDLYGLGIDLKPRYVASRAPEPRSWFGPNGQYIKEL
        MSS LCSSIR L P  WRR N+RFF+LLQS R + FAPRFVAC S+NDDSVAIP+P PLAFDP E++YGLG+DLKPR   S APEPRSWFGPNGQYI+EL
Subjt:  MSSSLCSSIRPLPPHYWRRKNVRFFLLLQSNRGICFAPRFVAC-SNNDDSVAIPRPTPLAFDPAEDLYGLGIDLKPRYVASRAPEPRSWFGPNGQYIKEL

Query:  PCPSCRGRGYAPCTECGIERSRADCSVCDGKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
        PCPSCRGRGYAPCTECGIERSRADCSVC+GKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
Subjt:  PCPSCRGRGYAPCTECGIERSRADCSVCDGKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI

Query:  SRSLKSLNAKTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEAWAKTSGFQIHAVMTTSNLQVLVAK
        SRSLKSLNAKTG+FSKRMKIIHRDP LHAQRVAAIKKAKGSAEARKRTSEALKAFF+DP+NR+KRSIAMK                              
Subjt:  SRSLKSLNAKTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEAWAKTSGFQIHAVMTTSNLQVLVAK

Query:  EGILIGSPTQLRAGVKFYCKNCGREGHRRHYCPELK-EAIDRRFRCRVCGEKGHNRRTCEKSRSNSIPMTATIQRHCGICGVKGHNRRNCQKSKLSNPTN
                     G+KFYCKNCGREGHRRHYCPELK ++IDRRFRCRVCGEKGHNRRTC+KSR N  PM+ATIQRHC ICG KGH+ RNC KS++ N +N
Subjt:  EGILIGSPTQLRAGVKFYCKNCGREGHRRHYCPELK-EAIDRRFRCRVCGEKGHNRRTCEKSRSNSIPMTATIQRHCGICGVKGHNRRNCQKSKLSNPTN

Query:  LRINYAHRRSSRNVLRLNLI------SENSRVRQYHCRICNKNGHSQQNCPNIDRKGNGLTTRRSYSCKLCHEKGHNIRTCPNRNMNNLQKNSPVALNQ
          I+Y   ++  NVL LNLI      +E+SRVRQY CRIC +NGH+++NCPNID K N  T RR+YSCKLCHEKGHN RTCP   MNNLQKN P A+NQ
Subjt:  LRINYAHRRSSRNVLRLNLI------SENSRVRQYHCRICNKNGHSQQNCPNIDRKGNGLTTRRSYSCKLCHEKGHNIRTCPNRNMNNLQKNSPVALNQ

XP_004143799.1 uncharacterized protein LOC101211176 [Cucumis sativus]9.4e-21978.31Show/hide
Query:  MSSSLCSSIRPLPPHYWRRKNVRFFLLLQSNRGICFAPRFVACSNNDDSVAIPRPTPLAFDPAEDLYGLGIDLKPRYVASRAPEPRSWFGPNGQYIKELP
        MSSSLC SI  L PHYWR KN RFFLLLQSNR ICFAPRFVA  NNDDSVAIP+P PLAFDPAE+LYGL +DLKPR  AS APEPRSWFGPNGQYIKELP
Subjt:  MSSSLCSSIRPLPPHYWRRKNVRFFLLLQSNRGICFAPRFVACSNNDDSVAIPRPTPLAFDPAEDLYGLGIDLKPRYVASRAPEPRSWFGPNGQYIKELP

Query:  CPSCRGRGYAPCTECGIERSRADCSVCDGKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKIS
        CPSCRGRGYAPCTECGIERSRADCSVC+GKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKIS
Subjt:  CPSCRGRGYAPCTECGIERSRADCSVCDGKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKIS

Query:  RSLKSLNAKTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEAWAKTSGFQIHAVMTTSNLQVLVAKE
        RSLK LNAKTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFF+DPENRRKRS AMK                               
Subjt:  RSLKSLNAKTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEAWAKTSGFQIHAVMTTSNLQVLVAKE

Query:  GILIGSPTQLRAGVKFYCKNCGREGHRRHYCPELKE-AIDRRFRCRVCGEKGHNRRTCEKSRSNSIPMTATIQRHCGICGVKGHNRRNCQKSKLSNPTNL
                    GVKFYCKNCGREGHRRHYCPELKE +IDRRFRCRVCGEKGHNR+TCEKS  N  P+TATIQRHCGICG+KGHN+RNCQKS        
Subjt:  GILIGSPTQLRAGVKFYCKNCGREGHRRHYCPELKE-AIDRRFRCRVCGEKGHNRRTCEKSRSNSIPMTATIQRHCGICGVKGHNRRNCQKSKLSNPTNL

Query:  RINYAHRRSSRNVLRLNLIS------ENSRVRQYHCRICNKNGHSQQNCPNIDRKGNGLTTRRSYSCKLCHEKGHNIRTCPNRNMNNLQKNSPVALNQ
            AHR+S +NVLR NLIS      EN RVRQYHCRIC ++GHSQ+NCP+ DR+GNGL+TRRSYSCKLCHEKGHNIRTCPNR+ NNLQKN PVALNQ
Subjt:  RINYAHRRSSRNVLRLNLIS------ENSRVRQYHCRICNKNGHSQQNCPNIDRKGNGLTTRRSYSCKLCHEKGHNIRTCPNRNMNNLQKNSPVALNQ

XP_008465678.1 PREDICTED: uncharacterized protein LOC103503314 isoform X1 [Cucumis melo]7.2e-21173.8Show/hide
Query:  MSSSLCSSIRPLPPHYWRRKNVRFFLLLQSNRGICFAPRFVACSNNDDSVAIPRPTP-------------------------LAFDPAEDLYGLGIDLKP
        MSSSLC SI  L PHYWR KN  FFLLLQSNR ICFAPRFVAC  NDDSVAIP+P P                         LAFDPAE+LYGL +DLKP
Subjt:  MSSSLCSSIRPLPPHYWRRKNVRFFLLLQSNRGICFAPRFVACSNNDDSVAIPRPTP-------------------------LAFDPAEDLYGLGIDLKP

Query:  RYVASRAPEPRSWFGPNGQYIKELPCPSCRGRGYAPCTECGIERSRADCSVCDGKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLE
        R  AS APEPRSWFGPNGQYIKELPCPSCRGRGYAPCTECGIERSRADCSVC+GKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLE
Subjt:  RYVASRAPEPRSWFGPNGQYIKELPCPSCRGRGYAPCTECGIERSRADCSVCDGKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLE

Query:  IKLEEKKKSKRVYQSPPPEVGLKISRSLKSLNAKTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEA
        IKLEEKKKSKRVYQSPPPEVGLKISRSLKSLNAKTGIFSKRM+IIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFF+DPENRRKRS AMK      
Subjt:  IKLEEKKKSKRVYQSPPPEVGLKISRSLKSLNAKTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEA

Query:  WAKTSGFQIHAVMTTSNLQVLVAKEGILIGSPTQLRAGVKFYCKNCGREGHRRHYCPELKE-AIDRRFRCRVCGEKGHNRRTCEKSRSNSIPMTATIQRH
                                             GVKFYCKNCGREGHRRHYCPELKE +IDRRFRCRVCGEKGHNRRTCEKS  N  PM  TIQRH
Subjt:  WAKTSGFQIHAVMTTSNLQVLVAKEGILIGSPTQLRAGVKFYCKNCGREGHRRHYCPELKE-AIDRRFRCRVCGEKGHNRRTCEKSRSNSIPMTATIQRH

Query:  CGICGVKGHNRRNCQKSKLSNPTNLRINYAHRRSSRNVLRLNLIS------ENSRVRQYHCRICNKNGHSQQNCPNIDRKGNGLTTRRSYSCKLCHEKGH
        CGICG+KGHNRRNCQKS +           HR+S +N+LR NLIS      EN RVRQ HCRIC K GHSQ  CP+IDR+GNGLTTRR Y+CKLCHEKGH
Subjt:  CGICGVKGHNRRNCQKSKLSNPTNLRINYAHRRSSRNVLRLNLIS------ENSRVRQYHCRICNKNGHSQQNCPNIDRKGNGLTTRRSYSCKLCHEKGH

Query:  NIRTCPNRNMNNLQKNSPVALNQ
        NIRTCP R+MN+LQKN PVALNQ
Subjt:  NIRTCPNRNMNNLQKNSPVALNQ

XP_008465679.1 PREDICTED: uncharacterized protein LOC103503314 isoform X2 [Cucumis melo]4.8e-21577.51Show/hide
Query:  MSSSLCSSIRPLPPHYWRRKNVRFFLLLQSNRGICFAPRFVACSNNDDSVAIPRPTPLAFDPAEDLYGLGIDLKPRYVASRAPEPRSWFGPNGQYIKELP
        MSSSLC SI  L PHYWR KN  FFLLLQSNR ICFAPRFVAC  NDDSVAIP+P PLAFDPAE+LYGL +DLKPR  AS APEPRSWFGPNGQYIKELP
Subjt:  MSSSLCSSIRPLPPHYWRRKNVRFFLLLQSNRGICFAPRFVACSNNDDSVAIPRPTPLAFDPAEDLYGLGIDLKPRYVASRAPEPRSWFGPNGQYIKELP

Query:  CPSCRGRGYAPCTECGIERSRADCSVCDGKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKIS
        CPSCRGRGYAPCTECGIERSRADCSVC+GKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKIS
Subjt:  CPSCRGRGYAPCTECGIERSRADCSVCDGKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKIS

Query:  RSLKSLNAKTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEAWAKTSGFQIHAVMTTSNLQVLVAKE
        RSLKSLNAKTGIFSKRM+IIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFF+DPENRRKRS AMK                               
Subjt:  RSLKSLNAKTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEAWAKTSGFQIHAVMTTSNLQVLVAKE

Query:  GILIGSPTQLRAGVKFYCKNCGREGHRRHYCPELKE-AIDRRFRCRVCGEKGHNRRTCEKSRSNSIPMTATIQRHCGICGVKGHNRRNCQKSKLSNPTNL
                    GVKFYCKNCGREGHRRHYCPELKE +IDRRFRCRVCGEKGHNRRTCEKS  N  PM  TIQRHCGICG+KGHNRRNCQKS +      
Subjt:  GILIGSPTQLRAGVKFYCKNCGREGHRRHYCPELKE-AIDRRFRCRVCGEKGHNRRTCEKSRSNSIPMTATIQRHCGICGVKGHNRRNCQKSKLSNPTNL

Query:  RINYAHRRSSRNVLRLNLIS------ENSRVRQYHCRICNKNGHSQQNCPNIDRKGNGLTTRRSYSCKLCHEKGHNIRTCPNRNMNNLQKNSPVALNQ
             HR+S +N+LR NLIS      EN RVRQ HCRIC K GHSQ  CP+IDR+GNGLTTRR Y+CKLCHEKGHNIRTCP R+MN+LQKN PVALNQ
Subjt:  RINYAHRRSSRNVLRLNLIS------ENSRVRQYHCRICNKNGHSQQNCPNIDRKGNGLTTRRSYSCKLCHEKGHNIRTCPNRNMNNLQKNSPVALNQ

XP_038889908.1 uncharacterized protein LOC120079678 isoform X2 [Benincasa hispida]1.4e-23381.96Show/hide
Query:  MSSSLCSSIRPLPPHYWRRKNVRFFLLL-QSNRGICFAPRFVACSNNDDSVAIPRPTPLAFDPAEDLYGLGIDLKPRYVASRAPEPRSWFGPNGQYIKEL
        MSSSLCSSIR L PHYWRRKN RFFLLL QSNR ICFAPRFVAC NNDDSVAIPRP PLAFDPAE+LYGLG+DLKPR  AS APEPRSWFGPNGQYIKEL
Subjt:  MSSSLCSSIRPLPPHYWRRKNVRFFLLL-QSNRGICFAPRFVACSNNDDSVAIPRPTPLAFDPAEDLYGLGIDLKPRYVASRAPEPRSWFGPNGQYIKEL

Query:  PCPSCRGRGYAPCTECGIERSRADCSVCDGKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
        PCPSCRGRGYAPCTECGIERS+ADCSVC+GKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
Subjt:  PCPSCRGRGYAPCTECGIERSRADCSVCDGKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI

Query:  SRSLKSLNAKTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEAWAKTSGFQIHAVMTTSNLQVLVAK
        SRSLKSLNAKTGIFSKRMKIIHRDPALHAQRVAAIKKAKG+AEARKRTSEALKAFF DPENR+KRSIAMK                              
Subjt:  SRSLKSLNAKTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEAWAKTSGFQIHAVMTTSNLQVLVAK

Query:  EGILIGSPTQLRAGVKFYCKNCGREGHRRHYCPELKE-AIDRRFRCRVCGEKGHNRRTCEKSRSNSIPMTATIQRHCGICGVKGHNRRNCQKSKLSNPTN
                     GVKFYCKNCGREGHRRHYCPELKE + DRRFRCRVCGEKGHNRRTCEKSR +SIPMTATIQR CGICG KGHN+RNCQKS+LSNPTN
Subjt:  EGILIGSPTQLRAGVKFYCKNCGREGHRRHYCPELKE-AIDRRFRCRVCGEKGHNRRTCEKSRSNSIPMTATIQRHCGICGVKGHNRRNCQKSKLSNPTN

Query:  LRINYAHRRSSRNVLRLNLISE------NSRVRQYHCRICNKNGHSQQNCPNIDRKGNGLTTRRSYSCKLCHEKGHNIRTCPNRNMNNLQKNSPVALNQ
        +RINYAHR+S RN LRLNLI++      N RVRQYHCRICNKNGHS++NCPN DR+GNGLTTRRSY+CKLCHEKGHNIRTCPNR+MNNLQ N PVALNQ
Subjt:  LRINYAHRRSSRNVLRLNLISE------NSRVRQYHCRICNKNGHSQQNCPNIDRKGNGLTTRRSYSCKLCHEKGHNIRTCPNRNMNNLQKNSPVALNQ

TrEMBL top hitse value%identityAlignment
A0A0A0KS13 Uncharacterized protein4.6e-21978.31Show/hide
Query:  MSSSLCSSIRPLPPHYWRRKNVRFFLLLQSNRGICFAPRFVACSNNDDSVAIPRPTPLAFDPAEDLYGLGIDLKPRYVASRAPEPRSWFGPNGQYIKELP
        MSSSLC SI  L PHYWR KN RFFLLLQSNR ICFAPRFVA  NNDDSVAIP+P PLAFDPAE+LYGL +DLKPR  AS APEPRSWFGPNGQYIKELP
Subjt:  MSSSLCSSIRPLPPHYWRRKNVRFFLLLQSNRGICFAPRFVACSNNDDSVAIPRPTPLAFDPAEDLYGLGIDLKPRYVASRAPEPRSWFGPNGQYIKELP

Query:  CPSCRGRGYAPCTECGIERSRADCSVCDGKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKIS
        CPSCRGRGYAPCTECGIERSRADCSVC+GKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKIS
Subjt:  CPSCRGRGYAPCTECGIERSRADCSVCDGKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKIS

Query:  RSLKSLNAKTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEAWAKTSGFQIHAVMTTSNLQVLVAKE
        RSLK LNAKTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFF+DPENRRKRS AMK                               
Subjt:  RSLKSLNAKTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEAWAKTSGFQIHAVMTTSNLQVLVAKE

Query:  GILIGSPTQLRAGVKFYCKNCGREGHRRHYCPELKE-AIDRRFRCRVCGEKGHNRRTCEKSRSNSIPMTATIQRHCGICGVKGHNRRNCQKSKLSNPTNL
                    GVKFYCKNCGREGHRRHYCPELKE +IDRRFRCRVCGEKGHNR+TCEKS  N  P+TATIQRHCGICG+KGHN+RNCQKS        
Subjt:  GILIGSPTQLRAGVKFYCKNCGREGHRRHYCPELKE-AIDRRFRCRVCGEKGHNRRTCEKSRSNSIPMTATIQRHCGICGVKGHNRRNCQKSKLSNPTNL

Query:  RINYAHRRSSRNVLRLNLIS------ENSRVRQYHCRICNKNGHSQQNCPNIDRKGNGLTTRRSYSCKLCHEKGHNIRTCPNRNMNNLQKNSPVALNQ
            AHR+S +NVLR NLIS      EN RVRQYHCRIC ++GHSQ+NCP+ DR+GNGL+TRRSYSCKLCHEKGHNIRTCPNR+ NNLQKN PVALNQ
Subjt:  RINYAHRRSSRNVLRLNLIS------ENSRVRQYHCRICNKNGHSQQNCPNIDRKGNGLTTRRSYSCKLCHEKGHNIRTCPNRNMNNLQKNSPVALNQ

A0A1S3CPF7 uncharacterized protein LOC103503314 isoform X22.3e-21577.51Show/hide
Query:  MSSSLCSSIRPLPPHYWRRKNVRFFLLLQSNRGICFAPRFVACSNNDDSVAIPRPTPLAFDPAEDLYGLGIDLKPRYVASRAPEPRSWFGPNGQYIKELP
        MSSSLC SI  L PHYWR KN  FFLLLQSNR ICFAPRFVAC  NDDSVAIP+P PLAFDPAE+LYGL +DLKPR  AS APEPRSWFGPNGQYIKELP
Subjt:  MSSSLCSSIRPLPPHYWRRKNVRFFLLLQSNRGICFAPRFVACSNNDDSVAIPRPTPLAFDPAEDLYGLGIDLKPRYVASRAPEPRSWFGPNGQYIKELP

Query:  CPSCRGRGYAPCTECGIERSRADCSVCDGKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKIS
        CPSCRGRGYAPCTECGIERSRADCSVC+GKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKIS
Subjt:  CPSCRGRGYAPCTECGIERSRADCSVCDGKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKIS

Query:  RSLKSLNAKTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEAWAKTSGFQIHAVMTTSNLQVLVAKE
        RSLKSLNAKTGIFSKRM+IIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFF+DPENRRKRS AMK                               
Subjt:  RSLKSLNAKTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEAWAKTSGFQIHAVMTTSNLQVLVAKE

Query:  GILIGSPTQLRAGVKFYCKNCGREGHRRHYCPELKE-AIDRRFRCRVCGEKGHNRRTCEKSRSNSIPMTATIQRHCGICGVKGHNRRNCQKSKLSNPTNL
                    GVKFYCKNCGREGHRRHYCPELKE +IDRRFRCRVCGEKGHNRRTCEKS  N  PM  TIQRHCGICG+KGHNRRNCQKS +      
Subjt:  GILIGSPTQLRAGVKFYCKNCGREGHRRHYCPELKE-AIDRRFRCRVCGEKGHNRRTCEKSRSNSIPMTATIQRHCGICGVKGHNRRNCQKSKLSNPTNL

Query:  RINYAHRRSSRNVLRLNLIS------ENSRVRQYHCRICNKNGHSQQNCPNIDRKGNGLTTRRSYSCKLCHEKGHNIRTCPNRNMNNLQKNSPVALNQ
             HR+S +N+LR NLIS      EN RVRQ HCRIC K GHSQ  CP+IDR+GNGLTTRR Y+CKLCHEKGHNIRTCP R+MN+LQKN PVALNQ
Subjt:  RINYAHRRSSRNVLRLNLIS------ENSRVRQYHCRICNKNGHSQQNCPNIDRKGNGLTTRRSYSCKLCHEKGHNIRTCPNRNMNNLQKNSPVALNQ

A0A1S3CQW6 uncharacterized protein LOC103503314 isoform X13.5e-21173.8Show/hide
Query:  MSSSLCSSIRPLPPHYWRRKNVRFFLLLQSNRGICFAPRFVACSNNDDSVAIPRPTP-------------------------LAFDPAEDLYGLGIDLKP
        MSSSLC SI  L PHYWR KN  FFLLLQSNR ICFAPRFVAC  NDDSVAIP+P P                         LAFDPAE+LYGL +DLKP
Subjt:  MSSSLCSSIRPLPPHYWRRKNVRFFLLLQSNRGICFAPRFVACSNNDDSVAIPRPTP-------------------------LAFDPAEDLYGLGIDLKP

Query:  RYVASRAPEPRSWFGPNGQYIKELPCPSCRGRGYAPCTECGIERSRADCSVCDGKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLE
        R  AS APEPRSWFGPNGQYIKELPCPSCRGRGYAPCTECGIERSRADCSVC+GKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLE
Subjt:  RYVASRAPEPRSWFGPNGQYIKELPCPSCRGRGYAPCTECGIERSRADCSVCDGKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLE

Query:  IKLEEKKKSKRVYQSPPPEVGLKISRSLKSLNAKTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEA
        IKLEEKKKSKRVYQSPPPEVGLKISRSLKSLNAKTGIFSKRM+IIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFF+DPENRRKRS AMK      
Subjt:  IKLEEKKKSKRVYQSPPPEVGLKISRSLKSLNAKTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEA

Query:  WAKTSGFQIHAVMTTSNLQVLVAKEGILIGSPTQLRAGVKFYCKNCGREGHRRHYCPELKE-AIDRRFRCRVCGEKGHNRRTCEKSRSNSIPMTATIQRH
                                             GVKFYCKNCGREGHRRHYCPELKE +IDRRFRCRVCGEKGHNRRTCEKS  N  PM  TIQRH
Subjt:  WAKTSGFQIHAVMTTSNLQVLVAKEGILIGSPTQLRAGVKFYCKNCGREGHRRHYCPELKE-AIDRRFRCRVCGEKGHNRRTCEKSRSNSIPMTATIQRH

Query:  CGICGVKGHNRRNCQKSKLSNPTNLRINYAHRRSSRNVLRLNLIS------ENSRVRQYHCRICNKNGHSQQNCPNIDRKGNGLTTRRSYSCKLCHEKGH
        CGICG+KGHNRRNCQKS +           HR+S +N+LR NLIS      EN RVRQ HCRIC K GHSQ  CP+IDR+GNGLTTRR Y+CKLCHEKGH
Subjt:  CGICGVKGHNRRNCQKSKLSNPTNLRINYAHRRSSRNVLRLNLIS------ENSRVRQYHCRICNKNGHSQQNCPNIDRKGNGLTTRRSYSCKLCHEKGH

Query:  NIRTCPNRNMNNLQKNSPVALNQ
        NIRTCP R+MN+LQKN PVALNQ
Subjt:  NIRTCPNRNMNNLQKNSPVALNQ

A0A6J1GJ81 uncharacterized protein LOC111454354 isoform X11.1e-20173.82Show/hide
Query:  MSSSLCSSIRPLPPHYWRRKNVRFFLLLQSNRGICFAPRFVAC-SNNDDSVAIPRPTPLAFDPAEDLYGLGIDLKPRYVASRAPEPRSWFGPNGQYIKEL
        MSS LCSSIR L P  WRR N+RFF+LLQS R + FAPRFVAC S+NDDSVAIP+P PLAFDP E++YGLG+DLKPR   S APEPRSWFGPNGQYI+EL
Subjt:  MSSSLCSSIRPLPPHYWRRKNVRFFLLLQSNRGICFAPRFVAC-SNNDDSVAIPRPTPLAFDPAEDLYGLGIDLKPRYVASRAPEPRSWFGPNGQYIKEL

Query:  PCPSCRGRGYAPCTECGIERSRADCSVCDGKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
        PCPSCRGRGYAPCTECGIERSRADCSVC+GKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
Subjt:  PCPSCRGRGYAPCTECGIERSRADCSVCDGKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI

Query:  SRSLKSLNAKTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEAWAKTSGFQIHAVMTTSNLQVLVAK
        SRSLKSLNAKTG+FSKRMKIIHRDP LHAQRVAAIKKAKGSAEARKRTSEALKAFF+DP+NR+KRSIAMK                              
Subjt:  SRSLKSLNAKTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEAWAKTSGFQIHAVMTTSNLQVLVAK

Query:  EGILIGSPTQLRAGVKFYCKNCGREGHRRHYCPELK-EAIDRRFRCRVCGEKGHNRRTCEKSRSNSIPMTATIQRHCGICGVKGHNRRNCQKSKLSNPTN
                     G+KFYCKNCGREGHRRHYCPELK ++IDRRFRCRVCGEKGHNRRTC+KSR N  PM+ATIQ HC ICG KGH+ RNC KS++ N +N
Subjt:  EGILIGSPTQLRAGVKFYCKNCGREGHRRHYCPELK-EAIDRRFRCRVCGEKGHNRRTCEKSRSNSIPMTATIQRHCGICGVKGHNRRNCQKSKLSNPTN

Query:  LRINYAHRRSSRNVLRLNLI------SENSRVRQYHCRICNKNGHSQQNCPNIDRKGNGLTTRRSYSCKLCHEKGHNIRTCPNRNMNNL
          I+Y   ++  NVL LNLI      +E+SRVRQY CRIC +NGH++QNCPNID K N  T RR+YSCKLCHEKGHN RTCP   MNNL
Subjt:  LRINYAHRRSSRNVLRLNLI------SENSRVRQYHCRICNKNGHSQQNCPNIDRKGNGLTTRRSYSCKLCHEKGHNIRTCPNRNMNNL

A0A6J1KMZ5 uncharacterized protein LOC111496065 isoform X19.5e-20172.55Show/hide
Query:  MSSSLCSSIRPLPPHYWRRKNVRFFLLLQSNRGICFAPRFVAC-SNNDDSVAIPRPTPLAFDPAEDLYGLGIDLKPRYVASRAPEPRSWFGPNGQYIKEL
        MSS LCSSIR L P  WRR N+RFF+LLQS R + FAPRFVAC S+NDDSVAIP+P PLAFDP E++YGLG+DLKPR   S APEPRSWFGPNGQYI+EL
Subjt:  MSSSLCSSIRPLPPHYWRRKNVRFFLLLQSNRGICFAPRFVAC-SNNDDSVAIPRPTPLAFDPAEDLYGLGIDLKPRYVASRAPEPRSWFGPNGQYIKEL

Query:  PCPSCRGRGYAPCTECGIERSRADCSVCDGKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
        PCPSCRGRGYAPCTECGIERSRADCSVC+GKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
Subjt:  PCPSCRGRGYAPCTECGIERSRADCSVCDGKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI

Query:  SRSLKSLNAKTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEAWAKTSGFQIHAVMTTSNLQVLVAK
        SRSLKSLNAKTG+FSKRMKIIHRDP LHAQRVAAIKKAKGSAEARKRTSEALKAFF+DP+NR+KRSI+MK                              
Subjt:  SRSLKSLNAKTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEAWAKTSGFQIHAVMTTSNLQVLVAK

Query:  EGILIGSPTQLRAGVKFYCKNCGREGHRRHYCPELK-EAIDRRFRCRVCGEKGHNRRTCEKSRSNSIPMTATIQRHCGICGVKGHNRRNCQKSKLSNPTN
                     G+KFYCKNCGREGHRRHYCPELK ++IDRRFRCRVCGEKGHNRRTC+KSR NS PM+A ++        KGH+RRNC KS++ NP N
Subjt:  EGILIGSPTQLRAGVKFYCKNCGREGHRRHYCPELK-EAIDRRFRCRVCGEKGHNRRTCEKSRSNSIPMTATIQRHCGICGVKGHNRRNCQKSKLSNPTN

Query:  LRINYAHRRSSRNVLRLNLI------SENSRVRQYHCRICNKNGHSQQNCPNIDRKGNGLTTRRSYSCKLCHEKGHNIRTCPNRNMNNLQKNSPVALNQ
           +Y   ++  NVL LNLI      +E+SRVRQY CRIC +NGH++QNCPNID K N  T R++YSCKLCHEKGHN RTCP  NMNNLQKNSP A+NQ
Subjt:  LRINYAHRRSSRNVLRLNLI------SENSRVRQYHCRICNKNGHSQQNCPNIDRKGNGLTTRRSYSCKLCHEKGHNIRTCPNRNMNNLQKNSPVALNQ

SwissProt top hitse value%identityAlignment
Q04832 DNA-binding protein HEXBP6.3e-0829.12Show/hide
Query:  CKNCGREGHRRHYCPELKEAIDRR-FRCRVCGEKGHNRRTC-EKSRS----------------------NSIPMTATIQRHCGICGVKGHNRRNCQKSKL
        C+NCG+EGH    CPE     D R   C  CGE+GH  R C  ++RS                      NS    A     C  CG +GH  R+C  S+ 
Subjt:  CKNCGREGHRRHYCPELKEAIDRR-FRCRVCGEKGHNRRTC-EKSRS----------------------NSIPMTATIQRHCGICGVKGHNRRNCQKSKL

Query:  SNPTNLRINYAHRRSSRNVLRLNLISENSRVRQYHCRICNKNGHSQQNCPNIDRKGNGLTTRRSYSCKLCHEKGHNIRTCPN
         +    R  Y  +R           ++        C  C   GH  ++CPN     +G   R   +C  C + GH  R CPN
Subjt:  SNPTNLRINYAHRRSSRNVLRLNLISENSRVRQYHCRICNKNGHSQQNCPNIDRKGNGLTTRRSYSCKLCHEKGHNIRTCPN

Q4P0H7 Branchpoint-bridging protein7.2e-0427.07Show/hide
Query:  QSPPPEVGLKISR--SLKSLNAKTGIFSKRMKIIHRDPA--LHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEAWAKT-SGFQ
        Q P  +V L I     +K      G     +K + R     +  +   ++K  KG  +A +   E       D E   K+ I + + ++E  A T  G  
Subjt:  QSPPPEVGLKISR--SLKSLNAKTGIFSKRMKIIHRDPA--LHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEAWAKT-SGFQ

Query:  IHAVMTTSNLQVLVAKEGILIGSPTQLRAGVKFYCKNCGREGHRRHYCPELKEAIDRRFRCRVCGEKGHNRRTCEKSRSNS
         H     + L+ L A  G L     QL       CKNCG +GHR   CPE +        C  CG +GH  R C + R+ +
Subjt:  IHAVMTTSNLQVLVAKEGILIGSPTQLRAGVKFYCKNCGREGHRRHYCPELKEAIDRRFRCRVCGEKGHNRRTCEKSRSNS

Arabidopsis top hitse value%identityAlignment
AT5G20220.1 zinc knuckle (CCHC-type) family protein1.9e-12453.44Show/hide
Query:  RRKNVRFFLLLQSNRGICFAPRFVACS-------NNDDSVAIPRPT--PLAFDPAEDLYGLGIDLKPRYVASRAPEPRSWFGPNGQYIKELPCPSCRGRG
        R+K  RF  LL  N    F PR ++ S       NND SV+  R     + +DP+E+L+  G+D KPR+++  + EPRSWFGPNGQYI+ELPCP+CRGRG
Subjt:  RRKNVRFFLLLQSNRGICFAPRFVACS-------NNDDSVAIPRPT--PLAFDPAEDLYGLGIDLKPRYVASRAPEPRSWFGPNGQYIKELPCPSCRGRG

Query:  YAPCTECGIERSRADCSVCDGKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKISRSLKSLNA
        Y  C+ CGIERSR DC  C GKGI+TC +CLGD VIWEESIDERPWEKARS+SP R+KEDDEVDNLEIK  +++KSKR+YQSP PEVG KISRSLKSLNA
Subjt:  YAPCTECGIERSRADCSVCDGKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKISRSLKSLNA

Query:  KTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEAWAKTSGFQIHAVMTTSNLQVLVAKEGILIGSPT
        KTG+FSKRMKIIHRDP LHAQRVAAIKKAKG+  ARK  SE++KAFF++P NR +RS++MK                                       
Subjt:  KTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEAWAKTSGFQIHAVMTTSNLQVLVAKEGILIGSPT

Query:  QLRAGVKFYCKNCGREGHRRHYCPELKEAIDRRFRCRVCGEKGHNRRTCEKSRSNSIPMTATIQRHCGICGVKGHNRRNCQKSKLSNPTNLRINYAHRRS
            G KFYCKNCG+EGHRRHYCPEL    DR+FRCR CG KGHNRRTC KS+S      +T    CGICG +GHN R C+K     PT +  + +   S
Subjt:  QLRAGVKFYCKNCGREGHRRHYCPELKEAIDRRFRCRVCGEKGHNRRTCEKSRSNSIPMTATIQRHCGICGVKGHNRRNCQKSKLSNPTNLRINYAHRRS

Query:  SRNVLRLNLISENSRVRQYHCRICNKNGHSQQNCPN
                   +      Y C  C K GH+ + CP+
Subjt:  SRNVLRLNLISENSRVRQYHCRICNKNGHSQQNCPN

AT5G20220.2 zinc knuckle (CCHC-type) family protein1.9e-12453.44Show/hide
Query:  RRKNVRFFLLLQSNRGICFAPRFVACS-------NNDDSVAIPRPT--PLAFDPAEDLYGLGIDLKPRYVASRAPEPRSWFGPNGQYIKELPCPSCRGRG
        R+K  RF  LL  N    F PR ++ S       NND SV+  R     + +DP+E+L+  G+D KPR+++  + EPRSWFGPNGQYI+ELPCP+CRGRG
Subjt:  RRKNVRFFLLLQSNRGICFAPRFVACS-------NNDDSVAIPRPT--PLAFDPAEDLYGLGIDLKPRYVASRAPEPRSWFGPNGQYIKELPCPSCRGRG

Query:  YAPCTECGIERSRADCSVCDGKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKISRSLKSLNA
        Y  C+ CGIERSR DC  C GKGI+TC +CLGD VIWEESIDERPWEKARS+SP R+KEDDEVDNLEIK  +++KSKR+YQSP PEVG KISRSLKSLNA
Subjt:  YAPCTECGIERSRADCSVCDGKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKISRSLKSLNA

Query:  KTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEAWAKTSGFQIHAVMTTSNLQVLVAKEGILIGSPT
        KTG+FSKRMKIIHRDP LHAQRVAAIKKAKG+  ARK  SE++KAFF++P NR +RS++MK                                       
Subjt:  KTGIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEAWAKTSGFQIHAVMTTSNLQVLVAKEGILIGSPT

Query:  QLRAGVKFYCKNCGREGHRRHYCPELKEAIDRRFRCRVCGEKGHNRRTCEKSRSNSIPMTATIQRHCGICGVKGHNRRNCQKSKLSNPTNLRINYAHRRS
            G KFYCKNCG+EGHRRHYCPEL    DR+FRCR CG KGHNRRTC KS+S      +T    CGICG +GHN R C+K     PT +  + +   S
Subjt:  QLRAGVKFYCKNCGREGHRRHYCPELKEAIDRRFRCRVCGEKGHNRRTCEKSRSNSIPMTATIQRHCGICGVKGHNRRNCQKSKLSNPTNLRINYAHRRS

Query:  SRNVLRLNLISENSRVRQYHCRICNKNGHSQQNCPN
                   +      Y C  C K GH+ + CP+
Subjt:  SRNVLRLNLISENSRVRQYHCRICNKNGHSQQNCPN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCATCGCTCTGTTCCTCAATTCGGCCATTGCCACCGCACTACTGGCGGCGGAAGAACGTCAGATTCTTCCTATTGCTTCAATCAAACAGAGGCATTTGC
TTCGCACCTCGCTTCGTTGCTTGCTCGAACAACGACGACTCTGTTGCAATACCCAGACCTACGCCGCTGGCTTTCGATCCTGCGGAGGATCTGTACGGACTTGGC
ATCGATTTAAAACCTAGGTATGTAGCTTCTAGGGCACCTGAACCCAGGTCCTGGTTTGGCCCAAATGGTCAGTATATTAAAGAGTTACCATGTCCAAGTTGCCGA
GGTAGGGGCTATGCGCCGTGTACGGAATGTGGAATTGAAAGATCCCGTGCTGATTGTTCCGTGTGTGATGGAAAGGGTATAGTGACCTGCCACCAATGTCTGGGG
GATCGTGTCATATGGGAAGAGTCTATTGATGAACGACCATGGGAGAAAGCACGCTCCACTTCTCCATTAAGAATGAAGGAAGATGATGAAGTCGATAACTTGGAA
ATAAAGCTGGAAGAAAAGAAGAAATCAAAGCGTGTTTACCAATCCCCTCCTCCTGAAGTGGGTTTAAAGATCAGTCGATCATTAAAAAGTCTCAATGCCAAAACA
GGTATATTTAGTAAGAGAATGAAGATTATCCATCGTGACCCTGCTCTTCATGCCCAGAGAGTGGCTGCAATTAAGAAAGCTAAAGGAAGTGCTGAAGCAAGAAAA
CGTACTTCTGAAGCTTTGAAAGCATTCTTTAACGATCCAGAAAATCGTCGAAAGCGGAGCATTGCAATGAAAGATCCAATGTTAGAAGCTTGGGCCAAAACCTCT
GGATTCCAGATTCATGCTGTCATGACAACTTCAAACTTACAAGTCTTAGTTGCTAAGGAGGGGATACTCATAGGTTCACCTACTCAACTGCGTGCAGGAGTAAAA
TTTTACTGCAAAAACTGCGGCCGTGAAGGACATCGAAGACACTATTGTCCAGAACTTAAGGAAGCAATTGATAGGCGATTCAGATGTCGAGTATGTGGAGAGAAG
GGCCATAACCGGAGGACTTGTGAGAAGTCTAGGTCGAATAGTATACCGATGACCGCAACAATTCAGCGCCATTGCGGGATATGCGGTGTGAAGGGCCACAACAGG
AGGAACTGTCAGAAGTCCAAGTTGAGCAATCCAACTAACCTACGAATAAATTATGCGCATCGTCGGAGCAGTCGCAATGTGTTGAGGCTGAATTTGATCAGTGAA
AATAGCAGAGTGAGGCAGTATCATTGTAGGATCTGCAATAAAAACGGTCACAGTCAGCAGAACTGTCCCAATATTGATAGGAAAGGTAACGGTCTGACTACTAGA
AGATCATACAGCTGCAAACTATGCCATGAAAAAGGGCATAATATTAGGACATGCCCAAATAGAAATATGAACAATCTACAGAAAAATTCTCCTGTTGCTTTAAAC
CAATAA
mRNA sequenceShow/hide mRNA sequence
TCCATTTTGGTTCTTCAGTTCGCGCTGTGGCCGTTGTTACTTGTTAGTCGCATATCATCTTCTTCTTATGTAAGCATTCTTCACAATAATCCAACAGATTTACCA
GATGTCTTCATCGCTCTGTTCCTCAATTCGGCCATTGCCACCGCACTACTGGCGGCGGAAGAACGTCAGATTCTTCCTATTGCTTCAATCAAACAGAGGCATTTG
CTTCGCACCTCGCTTCGTTGCTTGCTCGAACAACGACGACTCTGTTGCAATACCCAGACCTACGCCGCTGGCTTTCGATCCTGCGGAGGATCTGTACGGACTTGG
CATCGATTTAAAACCTAGGTATGTAGCTTCTAGGGCACCTGAACCCAGGTCCTGGTTTGGCCCAAATGGTCAGTATATTAAAGAGTTACCATGTCCAAGTTGCCG
AGGTAGGGGCTATGCGCCGTGTACGGAATGTGGAATTGAAAGATCCCGTGCTGATTGTTCCGTGTGTGATGGAAAGGGTATAGTGACCTGCCACCAATGTCTGGG
GGATCGTGTCATATGGGAAGAGTCTATTGATGAACGACCATGGGAGAAAGCACGCTCCACTTCTCCATTAAGAATGAAGGAAGATGATGAAGTCGATAACTTGGA
AATAAAGCTGGAAGAAAAGAAGAAATCAAAGCGTGTTTACCAATCCCCTCCTCCTGAAGTGGGTTTAAAGATCAGTCGATCATTAAAAAGTCTCAATGCCAAAAC
AGGTATATTTAGTAAGAGAATGAAGATTATCCATCGTGACCCTGCTCTTCATGCCCAGAGAGTGGCTGCAATTAAGAAAGCTAAAGGAAGTGCTGAAGCAAGAAA
ACGTACTTCTGAAGCTTTGAAAGCATTCTTTAACGATCCAGAAAATCGTCGAAAGCGGAGCATTGCAATGAAAGATCCAATGTTAGAAGCTTGGGCCAAAACCTC
TGGATTCCAGATTCATGCTGTCATGACAACTTCAAACTTACAAGTCTTAGTTGCTAAGGAGGGGATACTCATAGGTTCACCTACTCAACTGCGTGCAGGAGTAAA
ATTTTACTGCAAAAACTGCGGCCGTGAAGGACATCGAAGACACTATTGTCCAGAACTTAAGGAAGCAATTGATAGGCGATTCAGATGTCGAGTATGTGGAGAGAA
GGGCCATAACCGGAGGACTTGTGAGAAGTCTAGGTCGAATAGTATACCGATGACCGCAACAATTCAGCGCCATTGCGGGATATGCGGTGTGAAGGGCCACAACAG
GAGGAACTGTCAGAAGTCCAAGTTGAGCAATCCAACTAACCTACGAATAAATTATGCGCATCGTCGGAGCAGTCGCAATGTGTTGAGGCTGAATTTGATCAGTGA
AAATAGCAGAGTGAGGCAGTATCATTGTAGGATCTGCAATAAAAACGGTCACAGTCAGCAGAACTGTCCCAATATTGATAGGAAAGGTAACGGTCTGACTACTAG
AAGATCATACAGCTGCAAACTATGCCATGAAAAAGGGCATAATATTAGGACATGCCCAAATAGAAATATGAACAATCTACAGAAAAATTCTCCTGTTGCTTTAAA
CCAATAATTTTTCTTGTATGTATAATGTATGGCATCCAAATTCAACTAAATTTATCACATGAAGTGGGTAACAATTTTTTTTTT
Protein sequenceShow/hide protein sequence
MSSSLCSSIRPLPPHYWRRKNVRFFLLLQSNRGICFAPRFVACSNNDDSVAIPRPTPLAFDPAEDLYGLGIDLKPRYVASRAPEPRSWFGPNGQYIKELPCPSCR
GRGYAPCTECGIERSRADCSVCDGKGIVTCHQCLGDRVIWEESIDERPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKISRSLKSLNAKT
GIFSKRMKIIHRDPALHAQRVAAIKKAKGSAEARKRTSEALKAFFNDPENRRKRSIAMKDPMLEAWAKTSGFQIHAVMTTSNLQVLVAKEGILIGSPTQLRAGVK
FYCKNCGREGHRRHYCPELKEAIDRRFRCRVCGEKGHNRRTCEKSRSNSIPMTATIQRHCGICGVKGHNRRNCQKSKLSNPTNLRINYAHRRSSRNVLRLNLISE
NSRVRQYHCRICNKNGHSQQNCPNIDRKGNGLTTRRSYSCKLCHEKGHNIRTCPNRNMNNLQKNSPVALNQ