; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg004910 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg004910
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold9:20503138..20511289
RNA-Seq ExpressionSpg004910
SyntenySpg004910
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN01433.1 hypothetical protein CDL12_26059 [Handroanthus impetiginosus]4.5e-2632.3Show/hide
Query:  PKGKSPKAASSRSPFPEVFKDANFQERMEI-MKKRGFLNEKGFSNRAGALPEYVSKIISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVKGKMV
        PK K  +  SS S     F   + +ER    +  +  + E+GF  +  A  E++   + + KW+ F A P+  V+PLVREFYA   E      +V+G+ V
Subjt:  PKGKSPKAASSRSPFPEVFKDANFQERMEI-MKKRGFLNEKGFSNRAGALPEYVSKIISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVKGKMV

Query:  SFSSVDINRVYRIKAPLNPRGNDVIRN--PSAKQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESTVWLHFIKNRLMPTTHDSTISVDRVMLLYCLM
         F SV IN +Y I     P   D   N   +    ++  + +   G QWK ++ +  S   + L   + +WL FI  R++PT H   ++ DR +LLYC+M
Subjt:  SFSSVDINRVYRIKAPLNPRGNDVIRN--PSAKQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESTVWLHFIKNRLMPTTHDSTISVDRVMLLYCLM

Query:  KGLEINVGSIIRDEILACGRKRASKLFFGSLITQICQRVKIVPGKDEERHFFKPTID
         G   +VG II D I+         L+F SLIT++C R  +   + EE  F +  ID
Subjt:  KGLEINVGSIIRDEILACGRKRASKLFFGSLITQICQRVKIVPGKDEERHFFKPTID

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]2.0e-3433.45Show/hide
Query:  KSPKAASSRSPFPEVFKDANFQERMEIMKKRGFLNEKGFSNRAGALPEYVSKIISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVKGKMVSFSS
        K+ KA    +   E   + N Q R  +  ++GF+ +   S   G LP +++++I+Q+ W++FCAHP++ +VPLVREFYA L +   +   V+G  VS+S 
Subjt:  KSPKAASSRSPFPEVFKDANFQERMEIMKKRGFLNEKGFSNRAGALPEYVSKIISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVKGKMVSFSS

Query:  VDINRVYRIKAPLNPRGNDVIRNPSAKQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESTVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEIN
          IN V+ +  P++   ++ I N +   +   L+ VA  G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G  IN
Subjt:  VDINRVYRIKAPLNPRGNDVIRNPSAKQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESTVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEIN

Query:  VGSIIRDEILACGRKRASKLFFGSLITQICQRVKIVPGKDEERHFFKPTIDLSLIGKL-QQNNIQRKDKASTSQVTPQSGS
        VG +I  EI AC  ++   LFF SLIT++C+  +     +EE+      ID   + ++ Q+   +   + S+S+    S S
Subjt:  VGSIIRDEILACGRKRASKLFFGSLITQICQRVKIVPGKDEERHFFKPTIDLSLIGKL-QQNNIQRKDKASTSQVTPQSGS

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.0e-3434.15Show/hide
Query:  EKGF----SNRAGALPEYVSKIISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVKGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKD
        EKGF    S   G LP +++++I+Q+ W++FCAHP++ +VPLVREFYA L +   +   V+G  VS+S   IN V+ +  P++   ++ I+N + + +  
Subjt:  EKGF----SNRAGALPEYVSKIISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVKGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKD

Query:  ALKLVANKGVQWKESQTKVKSLVPSDLKPESTVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRASKLFFGSLITQICQ
         L+ VA  G +W  S     + + S L P + VW HF+K+RL+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++   LFF SLIT++C+
Subjt:  ALKLVANKGVQWKESQTKVKSLVPSDLKPESTVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRASKLFFGSLITQICQ

Query:  RVKIVPGKDEERHFFKPTIDLSLIGKLQQNNIQRKDKASTSQVTPQSGSNVASPSQHTPFTGPSPASEALEHLRERVKEEEEEE
          +     +EE+      ID   + ++ Q         ST Q  P S     + S  T         + L+ L +R+ ++E ++
Subjt:  RVKIVPGKDEERHFFKPTIDLSLIGKLQQNNIQRKDKASTSQVTPQSGSNVASPSQHTPFTGPSPASEALEHLRERVKEEEEEE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]4.6e-3133.2Show/hide
Query:  KAASSRSPFPEVFKDANFQER-MEIMKKRGFLNEKGFSNRAGALPEYVSKIISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVKGKMVSFSSVD
        KA    S   E+  + N Q R + + K+  + N K         P +++ +I Q+ WQ FCAHP++ +VPLVREFY  +         ++G  V  S   
Subjt:  KAASSRSPFPEVFKDANFQER-MEIMKKRGFLNEKGFSNRAGALPEYVSKIISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVKGKMVSFSSVD

Query:  INRVYRIKAPLNPRGNDVIRNPSAKQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESTVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVG
        IN ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     + + S L P + VW HF+K+RL+PTTH  T+S + V LLY ++ G  INVG
Subjt:  INRVYRIKAPLNPRGNDVIRNPSAKQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESTVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVG

Query:  SIIRDEILACGRKRASKLFFGSLITQICQRVKIVPGKDEER
         +I  EI AC  +++  LFF SLIT +C+  +     +EE+
Subjt:  SIIRDEILACGRKRASKLFFGSLITQICQRVKIVPGKDEER

TYK11835.1 uncharacterized protein E5676_scaffold152G00520 [Cucumis melo var. makuwa]6.9e-2764.71Show/hide
Query:  LRPRGNRSSLQTGELDKTV---------------EIELPVPDTLPTSAESSRSSSNTWLELYIESV------TFQTR--------FNVSVGSTGIVRGDD
        L PRGNR SLQTGELDKTV                IELPVPDTLPTSAESS S+S+TWLELY ESV      +F  +        FNVSVGSTGIVRGDD
Subjt:  LRPRGNRSSLQTGELDKTV---------------EIELPVPDTLPTSAESSRSSSNTWLELYIESV------TFQTR--------FNVSVGSTGIVRGDD

Query:  VCWLHAVFRAKLAGGPGGG
        VCWLHAVFRAK  GGPGGG
Subjt:  VCWLHAVFRAKLAGGPGGG

TrEMBL top hitse value%identityAlignment
A0A2G9G807 Uncharacterized protein2.2e-2632.3Show/hide
Query:  PKGKSPKAASSRSPFPEVFKDANFQERMEI-MKKRGFLNEKGFSNRAGALPEYVSKIISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVKGKMV
        PK K  +  SS S     F   + +ER    +  +  + E+GF  +  A  E++   + + KW+ F A P+  V+PLVREFYA   E      +V+G+ V
Subjt:  PKGKSPKAASSRSPFPEVFKDANFQERMEI-MKKRGFLNEKGFSNRAGALPEYVSKIISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVKGKMV

Query:  SFSSVDINRVYRIKAPLNPRGNDVIRN--PSAKQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESTVWLHFIKNRLMPTTHDSTISVDRVMLLYCLM
         F SV IN +Y I     P   D   N   +    ++  + +   G QWK ++ +  S   + L   + +WL FI  R++PT H   ++ DR +LLYC+M
Subjt:  SFSSVDINRVYRIKAPLNPRGNDVIRN--PSAKQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESTVWLHFIKNRLMPTTHDSTISVDRVMLLYCLM

Query:  KGLEINVGSIIRDEILACGRKRASKLFFGSLITQICQRVKIVPGKDEERHFFKPTID
         G   +VG II D I+         L+F SLIT++C R  +   + EE  F +  ID
Subjt:  KGLEINVGSIIRDEILACGRKRASKLFFGSLITQICQRVKIVPGKDEERHFFKPTID

A0A2P5AGA5 Uncharacterized protein (Fragment)9.8e-3533.45Show/hide
Query:  KSPKAASSRSPFPEVFKDANFQERMEIMKKRGFLNEKGFSNRAGALPEYVSKIISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVKGKMVSFSS
        K+ KA    +   E   + N Q R  +  ++GF+ +   S   G LP +++++I+Q+ W++FCAHP++ +VPLVREFYA L +   +   V+G  VS+S 
Subjt:  KSPKAASSRSPFPEVFKDANFQERMEIMKKRGFLNEKGFSNRAGALPEYVSKIISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVKGKMVSFSS

Query:  VDINRVYRIKAPLNPRGNDVIRNPSAKQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESTVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEIN
          IN V+ +  P++   ++ I N +   +   L+ VA  G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G  IN
Subjt:  VDINRVYRIKAPLNPRGNDVIRNPSAKQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESTVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEIN

Query:  VGSIIRDEILACGRKRASKLFFGSLITQICQRVKIVPGKDEERHFFKPTIDLSLIGKL-QQNNIQRKDKASTSQVTPQSGS
        VG +I  EI AC  ++   LFF SLIT++C+  +     +EE+      ID   + ++ Q+   +   + S+S+    S S
Subjt:  VGSIIRDEILACGRKRASKLFFGSLITQICQRVKIVPGKDEERHFFKPTIDLSLIGKL-QQNNIQRKDKASTSQVTPQSGS

A0A2P5BCG4 Uncharacterized protein (Fragment)9.8e-3534.15Show/hide
Query:  EKGF----SNRAGALPEYVSKIISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVKGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKD
        EKGF    S   G LP +++++I+Q+ W++FCAHP++ +VPLVREFYA L +   +   V+G  VS+S   IN V+ +  P++   ++ I+N + + +  
Subjt:  EKGF----SNRAGALPEYVSKIISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVKGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKD

Query:  ALKLVANKGVQWKESQTKVKSLVPSDLKPESTVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRASKLFFGSLITQICQ
         L+ VA  G +W  S     + + S L P + VW HF+K+RL+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++   LFF SLIT++C+
Subjt:  ALKLVANKGVQWKESQTKVKSLVPSDLKPESTVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRASKLFFGSLITQICQ

Query:  RVKIVPGKDEERHFFKPTIDLSLIGKLQQNNIQRKDKASTSQVTPQSGSNVASPSQHTPFTGPSPASEALEHLRERVKEEEEEE
          +     +EE+      ID   + ++ Q         ST Q  P S     + S  T         + L+ L +R+ ++E ++
Subjt:  RVKIVPGKDEERHFFKPTIDLSLIGKLQQNNIQRKDKASTSQVTPQSGSNVASPSQHTPFTGPSPASEALEHLRERVKEEEEEE

A0A2P5DAQ2 Uncharacterized protein2.2e-3133.2Show/hide
Query:  KAASSRSPFPEVFKDANFQER-MEIMKKRGFLNEKGFSNRAGALPEYVSKIISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVKGKMVSFSSVD
        KA    S   E+  + N Q R + + K+  + N K         P +++ +I Q+ WQ FCAHP++ +VPLVREFY  +         ++G  V  S   
Subjt:  KAASSRSPFPEVFKDANFQER-MEIMKKRGFLNEKGFSNRAGALPEYVSKIISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVKGKMVSFSSVD

Query:  INRVYRIKAPLNPRGNDVIRNPSAKQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESTVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVG
        IN ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     + + S L P + VW HF+K+RL+PTTH  T+S + V LLY ++ G  INVG
Subjt:  INRVYRIKAPLNPRGNDVIRNPSAKQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESTVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVG

Query:  SIIRDEILACGRKRASKLFFGSLITQICQRVKIVPGKDEER
         +I  EI AC  +++  LFF SLIT +C+  +     +EE+
Subjt:  SIIRDEILACGRKRASKLFFGSLITQICQRVKIVPGKDEER

A0A5D3CJX3 CCHC-type domain-containing protein3.4e-2764.71Show/hide
Query:  LRPRGNRSSLQTGELDKTV---------------EIELPVPDTLPTSAESSRSSSNTWLELYIESV------TFQTR--------FNVSVGSTGIVRGDD
        L PRGNR SLQTGELDKTV                IELPVPDTLPTSAESS S+S+TWLELY ESV      +F  +        FNVSVGSTGIVRGDD
Subjt:  LRPRGNRSSLQTGELDKTV---------------EIELPVPDTLPTSAESSRSSSNTWLELYIESV------TFQTR--------FNVSVGSTGIVRGDD

Query:  VCWLHAVFRAKLAGGPGGG
        VCWLHAVFRAK  GGPGGG
Subjt:  VCWLHAVFRAKLAGGPGGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAACACTCAAGGATCATCATCCTCACCCAAGAACACTCGATCTCAGAGTGCTCAAACAACCCACAAAGCTGAAGCAAGCAATCGACAGCAAGAAGAGAGCCCCGT
TACGCCCATGCAAGGCACACGAAGGACGAGACCCACGGGATTCTCGCCGGCGGTCGTGAACCAAGCGCCCAACGCTCAAGCTCCATCTTCTTCGGCAATGCCAGCCACGT
CGAGGGAGATGCTGAGTTCGTCTACACCAAAACGGTTCACGCGCTCCACTACTGTCCGTCAAATGCAAAAACCCGCCACTCAACAATACAGAAAACGTTCGCGGGAGTGG
TTTGAGATGATCCGTGAGATGGGTACCAAGAGACGAGCTGCCCTTGAAGAAGAAGGGAATCGGCAAGACAAAGAAAAAGTCGTCAAGGCAGCTGAAAGCTCTCGGCAAGG
TGAAGCTTCACTGGGTAAGGTTTCCGAACCTTCAACTAACCCTTCTCTCTCTTGCAGGACCAAGCCCGTTGTTACTTACAACGCAAGAAAGAGGAGCCCAAAGAAAAATG
TGTCTGAAAAGCCGCTTGAAATTCAGCCCCTAGAAACCGCAAGGATGCCTCCTGATGTATTCGAAGGAATAATTTGCCAAGCAGTGGCAAAGACCCTTGAGATTGCAGAG
GGGTATAAGGCTGAACAAGATGCTTTGAAAGAGGTTGAAGCGGAGAGAGAGATGGAAAATCAGAAAATGACTGAGGAAGACAATTTTGCAAAGGAAAGAGATGAGAGGGA
TGAGAAAAGAAAAAGAGAAGAAGAGCAAGAGGCCGAGAGGGCCTTAGAAGATGAGGAAAAGAGAAAATATGAGGAAAACCTCAGGAGGGCAGCTATGGAGTTGCAACTCC
TTGAGGAAGAGAAAAAGAGAAGAGTAGAAATAAGAGAAGATGAAAGAAGAAGGAAGGAAGCTGAAGACTTCCTTGCAGCCTTTGAACCACTTCACAAGGCTCAAAAAAAA
GAAATAGAAGTGGAGAATGAAGGCCAGAATGCGACCGCATTTGGGTCGCATTCTGAGGAAGGCCTAGCTGAGGCCACCGTTGATCAGCCAGCTGAAGAGGTTCTTGAACC
TCTATTCACACATGACCCACCAGCTGCTGATAGCACCTCTTCGGGAGAGAAGAGGGTTGAAGAGGAAAAAGAAGACGAGGAGGCTGAGACCTCCAGTGATTCTGACTCTG
ACACAGAATCTGATTCAGAGATTAGGGAGCTAGATGGCGACCAAGTCCCTATCTCTGCAGCGTTGAGAAGAAAGAGGAAGAGAGAGATAAAGGTTGAGATGAGGACAAAG
AACAAGAATGACCCGATATTTGCCAAGAGGCCGAGGACGAGGTCCATGGACGCCTCTCCTGTAGTTCCTCCTACCATTTCACCCGCCAAGCCAAAGGGCAAATCACCCAA
GGCCGCATCTTCTAGAAGTCCATTCCCTGAGGTATTTAAAGATGCTAATTTTCAGGAACGGATGGAGATCATGAAGAAAAGAGGTTTCCTCAATGAGAAAGGATTCTCTA
ATAGAGCAGGAGCACTGCCAGAGTACGTGAGCAAGATCATATCTCAATACAAGTGGCAGGAGTTCTGTGCTCACCCTCAAGAGGTTGTTGTGCCTCTAGTTCGTGAATTT
TACGCCGGTCTGAGGGAGGAAAGCATTAGCATGGCGGTAGTGAAGGGGAAGATGGTCAGTTTCTCCTCAGTCGACATTAATAGGGTGTACAGGATCAAGGCACCCTTGAA
CCCAAGAGGGAATGATGTGATCAGGAACCCTTCGGCCAAACAGATGAAAGACGCATTGAAGCTTGTGGCCAACAAGGGGGTCCAGTGGAAAGAATCACAGACCAAAGTGA
AGTCTTTGGTGCCAAGCGACCTAAAGCCAGAATCGACAGTTTGGCTTCACTTCATCAAGAACCGTTTGATGCCAACCACCCACGACAGCACAATTTCAGTGGATAGAGTG
ATGCTACTCTATTGTCTTATGAAGGGGTTGGAAATCAATGTAGGGAGCATCATCAGGGATGAGATCTTAGCCTGTGGACGGAAAAGGGCAAGCAAGCTTTTCTTTGGCTC
ACTCATCACCCAGATCTGTCAGAGGGTGAAGATTGTGCCAGGCAAGGACGAGGAACGCCATTTCTTTAAACCAACCATTGACTTGTCCTTGATAGGGAAGCTTCAGCAGA
ACAACATCCAGAGGAAGGATAAAGCCTCTACATCACAGGTCACTCCTCAATCAGGGTCGAATGTAGCCTCTCCATCCCAGCACACTCCTTTTACAGGGCCTTCACCAGCA
TCGGAAGCCCTAGAGCACTTGAGGGAGAGAGTGAAGGAAGAAGAAGAAGAAGAAGAATTTCGTGGTTTGAGGCCAAGAGGAAATAGGTCGAGTCTACAAACCGGGGAACT
AGATAAGACTGTAGAGATCGAGCTCCCGGTGCCTGATACACTGCCAACGTCTGCTGAAAGTTCTAGATCAAGCTCCAATACGTGGTTGGAGTTGTATATTGAGTCCGTTA
CTTTCCAAACCAGGTTCAACGTTTCAGTAGGGTCAACAGGGATCGTTAGAGGTGACGATGTCTGTTGGCTTCACGCCGTCTTTCGGGCTAAGCTAGCAGGTGGTCCGGGA
GGGGGGGCATTCATGACACCCATGGTTTGCCACTGCATGAAAGGTTTACCTAGGGAAAAAAACAGAGGAAAAGCTGGAATTCCCCAGAAATGCGACCGCATTTCTGGGAA
GGCAAAAATGGAATGCGACCGCATTTCTGGAAAAACAGAGGCCGTTCCGAGTCGCACGTGGGGCAAGCTGCCTAAGGCACCCTATATCGACTATAGGGTTATAATAGGTT
GCATTGGTGAATTATCTATGTCGGAGTCGAGTCACAGAAATGTGCATAATGGAGTTCACGGTGTTACGAATCCTATGATCGTGCCCTCCGAAGGGATGGTTGCTCCAGGA
CAACACGCAGGCGGATCATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAACACTCAAGGATCATCATCCTCACCCAAGAACACTCGATCTCAGAGTGCTCAAACAACCCACAAAGCTGAAGCAAGCAATCGACAGCAAGAAGAGAGCCCCGT
TACGCCCATGCAAGGCACACGAAGGACGAGACCCACGGGATTCTCGCCGGCGGTCGTGAACCAAGCGCCCAACGCTCAAGCTCCATCTTCTTCGGCAATGCCAGCCACGT
CGAGGGAGATGCTGAGTTCGTCTACACCAAAACGGTTCACGCGCTCCACTACTGTCCGTCAAATGCAAAAACCCGCCACTCAACAATACAGAAAACGTTCGCGGGAGTGG
TTTGAGATGATCCGTGAGATGGGTACCAAGAGACGAGCTGCCCTTGAAGAAGAAGGGAATCGGCAAGACAAAGAAAAAGTCGTCAAGGCAGCTGAAAGCTCTCGGCAAGG
TGAAGCTTCACTGGGTAAGGTTTCCGAACCTTCAACTAACCCTTCTCTCTCTTGCAGGACCAAGCCCGTTGTTACTTACAACGCAAGAAAGAGGAGCCCAAAGAAAAATG
TGTCTGAAAAGCCGCTTGAAATTCAGCCCCTAGAAACCGCAAGGATGCCTCCTGATGTATTCGAAGGAATAATTTGCCAAGCAGTGGCAAAGACCCTTGAGATTGCAGAG
GGGTATAAGGCTGAACAAGATGCTTTGAAAGAGGTTGAAGCGGAGAGAGAGATGGAAAATCAGAAAATGACTGAGGAAGACAATTTTGCAAAGGAAAGAGATGAGAGGGA
TGAGAAAAGAAAAAGAGAAGAAGAGCAAGAGGCCGAGAGGGCCTTAGAAGATGAGGAAAAGAGAAAATATGAGGAAAACCTCAGGAGGGCAGCTATGGAGTTGCAACTCC
TTGAGGAAGAGAAAAAGAGAAGAGTAGAAATAAGAGAAGATGAAAGAAGAAGGAAGGAAGCTGAAGACTTCCTTGCAGCCTTTGAACCACTTCACAAGGCTCAAAAAAAA
GAAATAGAAGTGGAGAATGAAGGCCAGAATGCGACCGCATTTGGGTCGCATTCTGAGGAAGGCCTAGCTGAGGCCACCGTTGATCAGCCAGCTGAAGAGGTTCTTGAACC
TCTATTCACACATGACCCACCAGCTGCTGATAGCACCTCTTCGGGAGAGAAGAGGGTTGAAGAGGAAAAAGAAGACGAGGAGGCTGAGACCTCCAGTGATTCTGACTCTG
ACACAGAATCTGATTCAGAGATTAGGGAGCTAGATGGCGACCAAGTCCCTATCTCTGCAGCGTTGAGAAGAAAGAGGAAGAGAGAGATAAAGGTTGAGATGAGGACAAAG
AACAAGAATGACCCGATATTTGCCAAGAGGCCGAGGACGAGGTCCATGGACGCCTCTCCTGTAGTTCCTCCTACCATTTCACCCGCCAAGCCAAAGGGCAAATCACCCAA
GGCCGCATCTTCTAGAAGTCCATTCCCTGAGGTATTTAAAGATGCTAATTTTCAGGAACGGATGGAGATCATGAAGAAAAGAGGTTTCCTCAATGAGAAAGGATTCTCTA
ATAGAGCAGGAGCACTGCCAGAGTACGTGAGCAAGATCATATCTCAATACAAGTGGCAGGAGTTCTGTGCTCACCCTCAAGAGGTTGTTGTGCCTCTAGTTCGTGAATTT
TACGCCGGTCTGAGGGAGGAAAGCATTAGCATGGCGGTAGTGAAGGGGAAGATGGTCAGTTTCTCCTCAGTCGACATTAATAGGGTGTACAGGATCAAGGCACCCTTGAA
CCCAAGAGGGAATGATGTGATCAGGAACCCTTCGGCCAAACAGATGAAAGACGCATTGAAGCTTGTGGCCAACAAGGGGGTCCAGTGGAAAGAATCACAGACCAAAGTGA
AGTCTTTGGTGCCAAGCGACCTAAAGCCAGAATCGACAGTTTGGCTTCACTTCATCAAGAACCGTTTGATGCCAACCACCCACGACAGCACAATTTCAGTGGATAGAGTG
ATGCTACTCTATTGTCTTATGAAGGGGTTGGAAATCAATGTAGGGAGCATCATCAGGGATGAGATCTTAGCCTGTGGACGGAAAAGGGCAAGCAAGCTTTTCTTTGGCTC
ACTCATCACCCAGATCTGTCAGAGGGTGAAGATTGTGCCAGGCAAGGACGAGGAACGCCATTTCTTTAAACCAACCATTGACTTGTCCTTGATAGGGAAGCTTCAGCAGA
ACAACATCCAGAGGAAGGATAAAGCCTCTACATCACAGGTCACTCCTCAATCAGGGTCGAATGTAGCCTCTCCATCCCAGCACACTCCTTTTACAGGGCCTTCACCAGCA
TCGGAAGCCCTAGAGCACTTGAGGGAGAGAGTGAAGGAAGAAGAAGAAGAAGAAGAATTTCGTGGTTTGAGGCCAAGAGGAAATAGGTCGAGTCTACAAACCGGGGAACT
AGATAAGACTGTAGAGATCGAGCTCCCGGTGCCTGATACACTGCCAACGTCTGCTGAAAGTTCTAGATCAAGCTCCAATACGTGGTTGGAGTTGTATATTGAGTCCGTTA
CTTTCCAAACCAGGTTCAACGTTTCAGTAGGGTCAACAGGGATCGTTAGAGGTGACGATGTCTGTTGGCTTCACGCCGTCTTTCGGGCTAAGCTAGCAGGTGGTCCGGGA
GGGGGGGCATTCATGACACCCATGGTTTGCCACTGCATGAAAGGTTTACCTAGGGAAAAAAACAGAGGAAAAGCTGGAATTCCCCAGAAATGCGACCGCATTTCTGGGAA
GGCAAAAATGGAATGCGACCGCATTTCTGGAAAAACAGAGGCCGTTCCGAGTCGCACGTGGGGCAAGCTGCCTAAGGCACCCTATATCGACTATAGGGTTATAATAGGTT
GCATTGGTGAATTATCTATGTCGGAGTCGAGTCACAGAAATGTGCATAATGGAGTTCACGGTGTTACGAATCCTATGATCGTGCCCTCCGAAGGGATGGTTGCTCCAGGA
CAACACGCAGGCGGATCATAA
Protein sequenceShow/hide protein sequence
MKNTQGSSSSPKNTRSQSAQTTHKAEASNRQQEESPVTPMQGTRRTRPTGFSPAVVNQAPNAQAPSSSAMPATSREMLSSSTPKRFTRSTTVRQMQKPATQQYRKRSREW
FEMIREMGTKRRAALEEEGNRQDKEKVVKAAESSRQGEASLGKVSEPSTNPSLSCRTKPVVTYNARKRSPKKNVSEKPLEIQPLETARMPPDVFEGIICQAVAKTLEIAE
GYKAEQDALKEVEAEREMENQKMTEEDNFAKERDERDEKRKREEEQEAERALEDEEKRKYEENLRRAAMELQLLEEEKKRRVEIREDERRRKEAEDFLAAFEPLHKAQKK
EIEVENEGQNATAFGSHSEEGLAEATVDQPAEEVLEPLFTHDPPAADSTSSGEKRVEEEKEDEEAETSSDSDSDTESDSEIRELDGDQVPISAALRRKRKREIKVEMRTK
NKNDPIFAKRPRTRSMDASPVVPPTISPAKPKGKSPKAASSRSPFPEVFKDANFQERMEIMKKRGFLNEKGFSNRAGALPEYVSKIISQYKWQEFCAHPQEVVVPLVREF
YAGLREESISMAVVKGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKDALKLVANKGVQWKESQTKVKSLVPSDLKPESTVWLHFIKNRLMPTTHDSTISVDRV
MLLYCLMKGLEINVGSIIRDEILACGRKRASKLFFGSLITQICQRVKIVPGKDEERHFFKPTIDLSLIGKLQQNNIQRKDKASTSQVTPQSGSNVASPSQHTPFTGPSPA
SEALEHLRERVKEEEEEEEFRGLRPRGNRSSLQTGELDKTVEIELPVPDTLPTSAESSRSSSNTWLELYIESVTFQTRFNVSVGSTGIVRGDDVCWLHAVFRAKLAGGPG
GGAFMTPMVCHCMKGLPREKNRGKAGIPQKCDRISGKAKMECDRISGKTEAVPSRTWGKLPKAPYIDYRVIIGCIGELSMSESSHRNVHNGVHGVTNPMIVPSEGMVAPG
QHAGGS