; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg007252 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg007252
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold9:46326624..46329108
RNA-Seq ExpressionSpg007252
SyntenySpg007252
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG48193.1 hypothetical protein EZV62_027487 [Acer yangbiense]6.6e-3332.15Show/hide
Query:  DMSRLLEKLKLE-EDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDDA
        ++++L E L LE ED  V E+ +D I++ + D +  +  K+LT   +N E F G++ +IW   G V VE  G N ++  F   + + ++   GPW++  +
Subjt:  DMSRLLEKLKLE-EDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDDA

Query:  IIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITYE
        +IV ++PK    +  L F    FWV  H +P +C  ++    L   IGE       + E   G+ +RVKV+++I + LKR   +K G   E   + + YE
Subjt:  IIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITYE

Query:  KLPDFCYYCGRLGHVVHECHEEGVERS----KERNFGVDLRETNGSKGFYKSWRSET--KQFRGRGFRGRGRGLFGRGAREDFKETGKSESDDEPIDNQS
        +LPDFC+ CGR+GH V EC +E  +R+    ++  FG  +R T   K   KS    T     RGR   G  R L G G+      +G         D+ S
Subjt:  KLPDFCYYCGRLGHVVHECHEEGVERS----KERNFGVDLRETNGSKGFYKSWRSET--KQFRGRGFRGRGRGLFGRGAREDFKETGKSESDDEPIDNQS

Query:  QVSKGPDTSIE
        QV+    T+ E
Subjt:  QVSKGPDTSIE

TXG54013.1 hypothetical protein EZV62_019269 [Acer yangbiense]2.9e-3328.8Show/hide
Query:  ENEDMSRLLEKLKLEEDNRVVE-VEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIY
        E++D+SR  EKL L++DN  +E ++    E   +     +  K +T+  IN E F   +  IW  +  V +E  G N++  +F+   D+ RI  GGPW++
Subjt:  ENEDMSRLLEKLKLEEDNRVVE-VEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIY

Query:  DDAIIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPI
        D  ++V  E   +  V  L F+YV FW+  H LP  C  R+  + LG  +G+ +  + G + +  G+ +R++V I++   LKRG  V  G   +   + I
Subjt:  DDAIIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPI

Query:  TYEKLPDFCYYCGRLGHVVHECHEEGVERSKERNFGVDLRETNGSKGF-YKSWRSETKQFRGRGFRGRGRGLFGRGAREDFKETGKSES-DDEPIDNQSQ
         YE+LP+FCYYCG++GH+V +C              ++ +E   S  F +  W     + R +G   +      + + E  +E G S++ ++  +   ++
Subjt:  TYEKLPDFCYYCGRLGHVVHECHEEGVERSKERNFGVDLRETNGSKGF-YKSWRSETKQFRGRGFRGRGRGLFGRGAREDFKETGKSES-DDEPIDNQSQ

Query:  VSKGPDTSI
         + G D+S+
Subjt:  VSKGPDTSI

TXG64850.1 hypothetical protein EZV62_011844 [Acer yangbiense]2.8e-3131.34Show/hide
Query:  EDMSRLLEKLKLEEDNRVVEVEDDDI-EEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDD
        E++++L + L + E        +  + +   +     I  K+LTS  +  EVF  +M KIW + G V +E    N++   F   +D+ R+ RGGPW +D 
Subjt:  EDMSRLLEKLKLEEDNRVVEVEDDDI-EEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDD

Query:  AIIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITY
        AII+FDEP     +  L F YV FWV  H LP +C   +  + LG+ IGE    + G      G  +RV+V I + + L+R   V      +   + + Y
Subjt:  AIIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITY

Query:  EKLPDFCYYCGRLGHVVHECHEE----GVERSKERNFGVDLRETNGSKGFYKSWRSETKQFRGRGFRG
        E+L D+C+ CGRLGH++ EC E+     +     R  GV LR  +  K  +  +    K+   R   G
Subjt:  EKLPDFCYYCGRLGHVVHECHEE----GVERSKERNFGVDLRETNGSKGFYKSWRSETKQFRGRGFRG

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]5.9e-3437.34Show/hide
Query:  SRLLE-----KLKLEEDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLE-GAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIY
        S LLE     KL  EED   V+++   +E   +  E  + CK+L+   I+  V    +   W L+  A  V+  G N++L  F R+ D+ RI R GPW +
Subjt:  SRLLE-----KLKLEEDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLE-GAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIY

Query:  DDAIIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPI
        D A+I+ D P +      +DF+ VS WVHF  L   C  +  A  LGN+IG FE  E  AN    G  LRV+V+ ++ + L RG  +         WIPI
Subjt:  DDAIIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPI

Query:  TYEKLPDFCYYCGRLGHVVHECHEEGVER-SKERNFGVDLR
         YE+LPDF Y+CGRL H++ +C +  V+  SK   +G  LR
Subjt:  TYEKLPDFCYYCGRLGHVVHECHEEGVER-SKERNFGVDLR

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]6.6e-3336.62Show/hide
Query:  KLKLEEDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDDAIIVFDEPK
        KL  EED   ++V+ D ++ A +     +  K+L    I+ +V S ++   W +E  + VE  G N++L  F R  D  R+ + GPW +D A+IV  +P 
Subjt:  KLKLEEDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDDAIIVFDEPK

Query:  ANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEK--IEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITYEKLPDFC
        ++  +  L+F  V+FW+H   LP     +  A+ LGN+IG F   +V  NEK    G +LR++V I+I + L+RG  +         WIPI YE+LPDFC
Subjt:  ANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEK--IEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITYEKLPDFC

Query:  YYCGRLGHVVHEC
        Y+CG +GH  H+C
Subjt:  YYCGRLGHVVHEC

TrEMBL top hitse value%identityAlignment
A0A5C7GU64 CCHC-type domain-containing protein3.2e-3332.15Show/hide
Query:  DMSRLLEKLKLE-EDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDDA
        ++++L E L LE ED  V E+ +D I++ + D +  +  K+LT   +N E F G++ +IW   G V VE  G N ++  F   + + ++   GPW++  +
Subjt:  DMSRLLEKLKLE-EDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDDA

Query:  IIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITYE
        +IV ++PK    +  L F    FWV  H +P +C  ++    L   IGE       + E   G+ +RVKV+++I + LKR   +K G   E   + + YE
Subjt:  IIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITYE

Query:  KLPDFCYYCGRLGHVVHECHEEGVERS----KERNFGVDLRETNGSKGFYKSWRSET--KQFRGRGFRGRGRGLFGRGAREDFKETGKSESDDEPIDNQS
        +LPDFC+ CGR+GH V EC +E  +R+    ++  FG  +R T   K   KS    T     RGR   G  R L G G+      +G         D+ S
Subjt:  KLPDFCYYCGRLGHVVHECHEEGVERS----KERNFGVDLRETNGSKGFYKSWRSET--KQFRGRGFRGRGRGLFGRGAREDFKETGKSESDDEPIDNQS

Query:  QVSKGPDTSIE
        QV+    T+ E
Subjt:  QVSKGPDTSIE

A0A5C7H9Y2 CCHC-type domain-containing protein1.4e-3328.8Show/hide
Query:  ENEDMSRLLEKLKLEEDNRVVE-VEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIY
        E++D+SR  EKL L++DN  +E ++    E   +     +  K +T+  IN E F   +  IW  +  V +E  G N++  +F+   D+ RI  GGPW++
Subjt:  ENEDMSRLLEKLKLEEDNRVVE-VEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIY

Query:  DDAIIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPI
        D  ++V  E   +  V  L F+YV FW+  H LP  C  R+  + LG  +G+ +  + G + +  G+ +R++V I++   LKRG  V  G   +   + I
Subjt:  DDAIIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPI

Query:  TYEKLPDFCYYCGRLGHVVHECHEEGVERSKERNFGVDLRETNGSKGF-YKSWRSETKQFRGRGFRGRGRGLFGRGAREDFKETGKSES-DDEPIDNQSQ
         YE+LP+FCYYCG++GH+V +C              ++ +E   S  F +  W     + R +G   +      + + E  +E G S++ ++  +   ++
Subjt:  TYEKLPDFCYYCGRLGHVVHECHEEGVERSKERNFGVDLRETNGSKGF-YKSWRSETKQFRGRGFRGRGRGLFGRGAREDFKETGKSES-DDEPIDNQSQ

Query:  VSKGPDTSI
         + G D+S+
Subjt:  VSKGPDTSI

A0A5C7I6E5 CCHC-type domain-containing protein1.3e-3131.34Show/hide
Query:  EDMSRLLEKLKLEEDNRVVEVEDDDI-EEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDD
        E++++L + L + E        +  + +   +     I  K+LTS  +  EVF  +M KIW + G V +E    N++   F   +D+ R+ RGGPW +D 
Subjt:  EDMSRLLEKLKLEEDNRVVEVEDDDI-EEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDD

Query:  AIIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITY
        AII+FDEP     +  L F YV FWV  H LP +C   +  + LG+ IGE    + G      G  +RV+V I + + L+R   V      +   + + Y
Subjt:  AIIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITY

Query:  EKLPDFCYYCGRLGHVVHECHEE----GVERSKERNFGVDLRETNGSKGFYKSWRSETKQFRGRGFRG
        E+L D+C+ CGRLGH++ EC E+     +     R  GV LR  +  K  +  +    K+   R   G
Subjt:  EKLPDFCYYCGRLGHVVHECHEE----GVERSKERNFGVDLRETNGSKGFYKSWRSETKQFRGRGFRG

A0A6J1BSZ1 uncharacterized protein LOC1110054812.9e-3437.34Show/hide
Query:  SRLLE-----KLKLEEDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLE-GAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIY
        S LLE     KL  EED   V+++   +E   +  E  + CK+L+   I+  V    +   W L+  A  V+  G N++L  F R+ D+ RI R GPW +
Subjt:  SRLLE-----KLKLEEDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLE-GAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIY

Query:  DDAIIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPI
        D A+I+ D P +      +DF+ VS WVHF  L   C  +  A  LGN+IG FE  E  AN    G  LRV+V+ ++ + L RG  +         WIPI
Subjt:  DDAIIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPI

Query:  TYEKLPDFCYYCGRLGHVVHECHEEGVER-SKERNFGVDLR
         YE+LPDF Y+CGRL H++ +C +  V+  SK   +G  LR
Subjt:  TYEKLPDFCYYCGRLGHVVHECHEEGVER-SKERNFGVDLR

A0A6J1DU55 uncharacterized protein LOC1110231353.2e-3336.62Show/hide
Query:  KLKLEEDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDDAIIVFDEPK
        KL  EED   ++V+ D ++ A +     +  K+L    I+ +V S ++   W +E  + VE  G N++L  F R  D  R+ + GPW +D A+IV  +P 
Subjt:  KLKLEEDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDDAIIVFDEPK

Query:  ANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEK--IEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITYEKLPDFC
        ++  +  L+F  V+FW+H   LP     +  A+ LGN+IG F   +V  NEK    G +LR++V I+I + L+RG  +         WIPI YE+LPDFC
Subjt:  ANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEK--IEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITYEKLPDFC

Query:  YYCGRLGHVVHEC
        Y+CG +GH  H+C
Subjt:  YYCGRLGHVVHEC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding4.6e-0820.77Show/hide
Query:  EEDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDDAIIVFDEPKANCC
        E++  V+ + ++ +E  N  ++  +  K+L  S I   V +  + ++W   G + V       ++ +F+  ++      GGPW      ++  +  +   
Subjt:  EEDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDDAIIVFDEPKANCC

Query:  VEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITYEKLPDFCYYCGRL
            D      WV    +P   + R   + +   +G     ++      +G   RV +++N+ + LK GT +  G         + YE L   C  CG  
Subjt:  VEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITYEKLPDFCYYCGRL

Query:  GHVVHEC
        GH+VH C
Subjt:  GHVVHEC

AT3G42140.1 zinc ion binding;nucleic acid binding3.1e-0421.38Show/hide
Query:  FKRTKDKARICRGGPWIYDDAIIVFDE-PKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESL
        F+  +    I R GPW ++D + V     K +   E   F+ + FW+    +P      +   ++G  +G F    +G +  +                 
Subjt:  FKRTKDKARICRGGPWIYDDAIIVFDE-PKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESL

Query:  KRGTNVKTGSMAEKKWIPITYEKLPDFCYYCGRLGHVVHECHEEG
                        +   YEKL +FC  CG L H   EC   G
Subjt:  KRGTNVKTGSMAEKKWIPITYEKLPDFCYYCGRLGHVVHECHEEG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACATGGGGGAAACCAGGAGGGGCAGCGAGCAAGCTTGCCGAACGGGGGAAGGATCCATGGAAGGAAAAGAACAATCCCAAACGATAAAGAAGGATTGTCTGGGAGG
ATGTCAGATTGGCTGGGAAGAAGGAAGAACCCAGATCACTCACAACAGTAACCAACCTCAGGCGGAAAAGGAAAACCAAACAAGCAAGAGGGAGAGTACAGAAGACACAG
AAGAAGAAGCAATGGAAGGAATAACAGAAAACGAAGATATGTCCAGGCTTCTGGAGAAGTTAAAGTTGGAAGAAGATAACAGAGTTGTGGAGGTTGAAGATGATGACATA
GAAGAAGCCAACAGAGATTTTGAGAATGTAATAGCGTGCAAGATACTTACATCCAGCTATATCAACCCTGAAGTCTTCTCTGGAATAATGCCAAAAATTTGGGGCCTGGA
AGGAGCCGTGAGAGTAGAAAAAGCGGGAACTAACGTGTATCTCTGCAAGTTCAAGAGGACTAAAGACAAAGCCAGAATCTGTCGAGGGGGGCCATGGATTTATGATGATG
CTATAATAGTGTTCGATGAGCCAAAAGCAAATTGTTGCGTGGAAGCTCTAGATTTCCAATATGTTTCTTTTTGGGTTCACTTTCATAAATTACCCCGTGTTTGTTTTTGC
AGGAAATATGCGGTGGCGTTGGGAAATTCGATAGGTGAATTCGAAGCAGCGGAAGTTGGCGCTAATGAAAAAATAGAGGGAGAAACTTTAAGGGTAAAGGTAAAGATCAA
TATTAAAGAGTCGCTGAAGAGAGGAACTAATGTGAAAACAGGGTCCATGGCTGAGAAAAAGTGGATTCCAATCACATATGAGAAACTCCCAGATTTTTGCTACTACTGTG
GGAGATTGGGTCATGTAGTTCATGAGTGTCATGAGGAAGGTGTGGAAAGAAGCAAAGAAAGAAACTTTGGAGTAGACCTGAGAGAGACCAATGGAAGCAAAGGATTCTAC
AAAAGCTGGAGGAGCGAAACCAAACAGTTCAGAGGAAGGGGTTTCAGAGGCAGAGGGAGAGGGCTCTTCGGCAGGGGTGCGAGAGAAGATTTCAAAGAAACTGGTAAATC
AGAATCTGACGACGAACCCATTGACAACCAAAGTCAGGTCTCCAAAGGACCAGATACATCGATAGAGGAAGAAGGAAACCGGCGAGAGGTGAACAGAAATGAAGACGCTG
GCCGGAAAAGGGATGAAGCCGGAGGAGAAGAAGGTTCAGCGGCGGAAGTGTCAGACCGGGTTGTCGGGTCAAATCAAACTGAAACAGACAGGAATGTGCTAGACCAGATG
TCAGAGTTACCAAGAGAATACGACAAAATAGGAAAAAAGAAAGAGCCGTTGGAGGAATCCTATGAAAAGTGTACAGATATTGACATGGGCCAAAGCGGTGATCCAAGTCT
CGAGGGGCCCACGGAAATAAAAATCTTGAGGAAAATAGAAAAAGTAAAAGAAAAGATTTTGGGAGGAAAATCGAAGGCAGTCACGGAAGGAGATGGCAAGAAAGAAAAAC
AAAAGGCAGAATGTAAAGTTGATCTCAGGAAAGGGAAAGACAGCCCAAAAAAGGCAAAAGGAGGAAATGTTAAAACTTGGAAAAGGATGGCTAGAAACGAGCAAGAAATC
AAATACTGGGACCAAAACGACATAGTGAATTACAGAGAAAAAAGGAAAGCTGAAGATGAAAAACTTGAACCAGGTTTTCAAGAAGATCTCGGTCAGACTGTTGCCCAAAC
ACAGATCAGACCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGACATGGGGGAAACCAGGAGGGGCAGCGAGCAAGCTTGCCGAACGGGGGAAGGATCCATGGAAGGAAAAGAACAATCCCAAACGATAAAGAAGGATTGTCTGGGAGG
ATGTCAGATTGGCTGGGAAGAAGGAAGAACCCAGATCACTCACAACAGTAACCAACCTCAGGCGGAAAAGGAAAACCAAACAAGCAAGAGGGAGAGTACAGAAGACACAG
AAGAAGAAGCAATGGAAGGAATAACAGAAAACGAAGATATGTCCAGGCTTCTGGAGAAGTTAAAGTTGGAAGAAGATAACAGAGTTGTGGAGGTTGAAGATGATGACATA
GAAGAAGCCAACAGAGATTTTGAGAATGTAATAGCGTGCAAGATACTTACATCCAGCTATATCAACCCTGAAGTCTTCTCTGGAATAATGCCAAAAATTTGGGGCCTGGA
AGGAGCCGTGAGAGTAGAAAAAGCGGGAACTAACGTGTATCTCTGCAAGTTCAAGAGGACTAAAGACAAAGCCAGAATCTGTCGAGGGGGGCCATGGATTTATGATGATG
CTATAATAGTGTTCGATGAGCCAAAAGCAAATTGTTGCGTGGAAGCTCTAGATTTCCAATATGTTTCTTTTTGGGTTCACTTTCATAAATTACCCCGTGTTTGTTTTTGC
AGGAAATATGCGGTGGCGTTGGGAAATTCGATAGGTGAATTCGAAGCAGCGGAAGTTGGCGCTAATGAAAAAATAGAGGGAGAAACTTTAAGGGTAAAGGTAAAGATCAA
TATTAAAGAGTCGCTGAAGAGAGGAACTAATGTGAAAACAGGGTCCATGGCTGAGAAAAAGTGGATTCCAATCACATATGAGAAACTCCCAGATTTTTGCTACTACTGTG
GGAGATTGGGTCATGTAGTTCATGAGTGTCATGAGGAAGGTGTGGAAAGAAGCAAAGAAAGAAACTTTGGAGTAGACCTGAGAGAGACCAATGGAAGCAAAGGATTCTAC
AAAAGCTGGAGGAGCGAAACCAAACAGTTCAGAGGAAGGGGTTTCAGAGGCAGAGGGAGAGGGCTCTTCGGCAGGGGTGCGAGAGAAGATTTCAAAGAAACTGGTAAATC
AGAATCTGACGACGAACCCATTGACAACCAAAGTCAGGTCTCCAAAGGACCAGATACATCGATAGAGGAAGAAGGAAACCGGCGAGAGGTGAACAGAAATGAAGACGCTG
GCCGGAAAAGGGATGAAGCCGGAGGAGAAGAAGGTTCAGCGGCGGAAGTGTCAGACCGGGTTGTCGGGTCAAATCAAACTGAAACAGACAGGAATGTGCTAGACCAGATG
TCAGAGTTACCAAGAGAATACGACAAAATAGGAAAAAAGAAAGAGCCGTTGGAGGAATCCTATGAAAAGTGTACAGATATTGACATGGGCCAAAGCGGTGATCCAAGTCT
CGAGGGGCCCACGGAAATAAAAATCTTGAGGAAAATAGAAAAAGTAAAAGAAAAGATTTTGGGAGGAAAATCGAAGGCAGTCACGGAAGGAGATGGCAAGAAAGAAAAAC
AAAAGGCAGAATGTAAAGTTGATCTCAGGAAAGGGAAAGACAGCCCAAAAAAGGCAAAAGGAGGAAATGTTAAAACTTGGAAAAGGATGGCTAGAAACGAGCAAGAAATC
AAATACTGGGACCAAAACGACATAGTGAATTACAGAGAAAAAAGGAAAGCTGAAGATGAAAAACTTGAACCAGGTTTTCAAGAAGATCTCGGTCAGACTGTTGCCCAAAC
ACAGATCAGACCATAA
Protein sequenceShow/hide protein sequence
MDMGETRRGSEQACRTGEGSMEGKEQSQTIKKDCLGGCQIGWEEGRTQITHNSNQPQAEKENQTSKRESTEDTEEEAMEGITENEDMSRLLEKLKLEEDNRVVEVEDDDI
EEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDDAIIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFC
RKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITYEKLPDFCYYCGRLGHVVHECHEEGVERSKERNFGVDLRETNGSKGFY
KSWRSETKQFRGRGFRGRGRGLFGRGAREDFKETGKSESDDEPIDNQSQVSKGPDTSIEEEGNRREVNRNEDAGRKRDEAGGEEGSAAEVSDRVVGSNQTETDRNVLDQM
SELPREYDKIGKKKEPLEESYEKCTDIDMGQSGDPSLEGPTEIKILRKIEKVKEKILGGKSKAVTEGDGKKEKQKAECKVDLRKGKDSPKKAKGGNVKTWKRMARNEQEI
KYWDQNDIVNYREKRKAEDEKLEPGFQEDLGQTVAQTQIRP