; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg031897 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg031897
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold11:40225254..40227795
RNA-Seq ExpressionSpg031897
SyntenySpg031897
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG48193.1 hypothetical protein EZV62_027487 [Acer yangbiense]6.8e-3332.15Show/hide
Query:  DMSRLLEKLKLE-EDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDDA
        ++++L E L LE ED  V E+ +D I++ + D +  +  K+LT   +N E F G++ +IW   G V VE  G N ++  F   + + ++   GPW++  +
Subjt:  DMSRLLEKLKLE-EDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDDA

Query:  IIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITYE
        +IV ++PK    +  L F    FWV  H +P +C  ++    L   IGE       + E   G+ +RVKV+++I + LKR   +K G   E   + + YE
Subjt:  IIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITYE

Query:  KLPDFCYYCGRLGHVVHECHEEGVERS----KERNFGVDLRETNGSKGFYKSWRSET--KQFRGRGFRGRGRGLFGRGAREDFKETGKSESDDEPIDNQS
        +LPDFC+ CGR+GH V EC +E  +R+    ++  FG  +R T   K   KS    T     RGR   G  R L G G+      +G         D+ S
Subjt:  KLPDFCYYCGRLGHVVHECHEEGVERS----KERNFGVDLRETNGSKGFYKSWRSET--KQFRGRGFRGRGRGLFGRGAREDFKETGKSESDDEPIDNQS

Query:  QVSKGPDTSIE
        QV+    T+ E
Subjt:  QVSKGPDTSIE

TXG57565.1 hypothetical protein EZV62_015394 [Acer yangbiense]2.8e-3138.38Show/hide
Query:  IACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDDAIIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFC
        +A K L+   +N E F  ++ +IW L   V VE   + V+   F+  +D+ ++  GGPW +DDA+IV +EP     +  L F  V FWVH   LP +C  
Subjt:  IACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDDAIIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFC

Query:  RKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITYEKLPDFCYYCGRLGHVVHECHE
        ++ A  LG  IGE    + G      G+ LRV+V I+I + L+R   V      E+  +PI +E+LP+FC+YCG LGHVV  C E
Subjt:  RKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITYEKLPDFCYYCGRLGHVVHECHE

TXG64850.1 hypothetical protein EZV62_011844 [Acer yangbiense]2.8e-3131.34Show/hide
Query:  EDMSRLLEKLKLEEDNRVVEVEDDDI-EEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDD
        E++++L + L + E        +  + +   +     I  K+LTS  +  EVF  +M KIW + G V +E    N++   F   +D+ R+ RGGPW +D 
Subjt:  EDMSRLLEKLKLEEDNRVVEVEDDDI-EEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDD

Query:  AIIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITY
        AII+FDEP     +  L F YV FWV  H LP +C   +  + LG+ IGE    + G      G  +RV+V I + + L+R   V      +   + + Y
Subjt:  AIIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITY

Query:  EKLPDFCYYCGRLGHVVHECHEE----GVERSKERNFGVDLRETNGSKGFYKSWRSETKQFRGRGFRG
        E+L D+C+ CGRLGH++ EC E+     +     R  GV LR  +  K  +  +    K+   R   G
Subjt:  EKLPDFCYYCGRLGHVVHECHEE----GVERSKERNFGVDLRETNGSKGFYKSWRSETKQFRGRGFRG

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]6.1e-3437.34Show/hide
Query:  SRLLE-----KLKLEEDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLE-GAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIY
        S LLE     KL  EED   V+++   +E   +  E  + CK+L+   I+  V    +   W L+  A  V+  G N++L  F R+ D+ RI R GPW +
Subjt:  SRLLE-----KLKLEEDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLE-GAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIY

Query:  DDAIIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPI
        D A+I+ D P +      +DF+ VS WVHF  L   C  +  A  LGN+IG FE  E  AN    G  LRV+V+ ++ + L RG  +         WIPI
Subjt:  DDAIIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPI

Query:  TYEKLPDFCYYCGRLGHVVHECHEEGVER-SKERNFGVDLR
         YE+LPDF Y+CGRL H++ +C +  V+  SK   +G  LR
Subjt:  TYEKLPDFCYYCGRLGHVVHECHEEGVER-SKERNFGVDLR

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]6.8e-3336.62Show/hide
Query:  KLKLEEDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDDAIIVFDEPK
        KL  EED   ++V+ D ++ A +     +  K+L    I+ +V S ++   W +E  + VE  G N++L  F R  D  R+ + GPW +D A+IV  +P 
Subjt:  KLKLEEDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDDAIIVFDEPK

Query:  ANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEK--IEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITYEKLPDFC
        ++  +  L+F  V+FW+H   LP     +  A+ LGN+IG F   +V  NEK    G +LR++V I+I + L+RG  +         WIPI YE+LPDFC
Subjt:  ANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEK--IEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITYEKLPDFC

Query:  YYCGRLGHVVHEC
        Y+CG +GH  H+C
Subjt:  YYCGRLGHVVHEC

TrEMBL top hitse value%identityAlignment
A0A5C7GU64 CCHC-type domain-containing protein3.3e-3332.15Show/hide
Query:  DMSRLLEKLKLE-EDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDDA
        ++++L E L LE ED  V E+ +D I++ + D +  +  K+LT   +N E F G++ +IW   G V VE  G N ++  F   + + ++   GPW++  +
Subjt:  DMSRLLEKLKLE-EDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDDA

Query:  IIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITYE
        +IV ++PK    +  L F    FWV  H +P +C  ++    L   IGE       + E   G+ +RVKV+++I + LKR   +K G   E   + + YE
Subjt:  IIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITYE

Query:  KLPDFCYYCGRLGHVVHECHEEGVERS----KERNFGVDLRETNGSKGFYKSWRSET--KQFRGRGFRGRGRGLFGRGAREDFKETGKSESDDEPIDNQS
        +LPDFC+ CGR+GH V EC +E  +R+    ++  FG  +R T   K   KS    T     RGR   G  R L G G+      +G         D+ S
Subjt:  KLPDFCYYCGRLGHVVHECHEEGVERS----KERNFGVDLRETNGSKGFYKSWRSET--KQFRGRGFRGRGRGLFGRGAREDFKETGKSESDDEPIDNQS

Query:  QVSKGPDTSIE
        QV+    T+ E
Subjt:  QVSKGPDTSIE

A0A5C7H9Y2 CCHC-type domain-containing protein3.4e-3025.95Show/hide
Query:  ENEDMSRLLEKLKLEEDNRVVE-VEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIY
        E++D+SR  EKL L++DN  +E ++    E   +     +  K +T+  IN E F   +  IW  +  V +E  G N++  +F+   D+ RI  GGPW++
Subjt:  ENEDMSRLLEKLKLEEDNRVVE-VEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIY

Query:  DDAIIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPI
        D  ++V  E   +  V  L F+YV FW+  H LP  C  R+  + LG  +G+ +  + G + +  G+ +R++V I++   LKRG  V  G   +   + I
Subjt:  DDAIIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPI

Query:  TYEKLPDFCYYCGRLGHVVHEC--HEEGVERSKERNFGVDLRETN--GSKGFYKSWRSETKQFRG------RGFRGRGRGLFGRGAREDFKETGKSESDD
         YE+LP+FCYYCG++GH+V +C  + + +  S    FG  +R  +   SKG  +   S      G         R +G   +  G     K++     D 
Subjt:  TYEKLPDFCYYCGRLGHVVHEC--HEEGVERSKERNFGVDLRETN--GSKGFYKSWRSETKQFRG------RGFRGRGRGLFGRGAREDFKETGKSESDD

Query:  EPIDNQSQVSKGPDTSIEEEGNRREVNRNED-----AGRKRDEVGGEEGSAAEVSDRVVGSNQTETDRNVLDQMSELPREYD--KIGKKKEPLEESYEKC
        E +D  +++  G     +   +R  V   ++         ++++  +    ++  +  V         NV +Q   +  E +  KI  +K     + EKC
Subjt:  EPIDNQSQVSKGPDTSIEEEGNRREVNRNED-----AGRKRDEVGGEEGSAAEVSDRVVGSNQTETDRNVLDQMSELPREYD--KIGKKKEPLEESYEKC

Query:  TDIDMGQSGDPSLEGPTEIK
          ++ G       +G  +I+
Subjt:  TDIDMGQSGDPSLEGPTEIK

A0A5C7I6E5 CCHC-type domain-containing protein1.4e-3131.34Show/hide
Query:  EDMSRLLEKLKLEEDNRVVEVEDDDI-EEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDD
        E++++L + L + E        +  + +   +     I  K+LTS  +  EVF  +M KIW + G V +E    N++   F   +D+ R+ RGGPW +D 
Subjt:  EDMSRLLEKLKLEEDNRVVEVEDDDI-EEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDD

Query:  AIIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITY
        AII+FDEP     +  L F YV FWV  H LP +C   +  + LG+ IGE    + G      G  +RV+V I + + L+R   V      +   + + Y
Subjt:  AIIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITY

Query:  EKLPDFCYYCGRLGHVVHECHEE----GVERSKERNFGVDLRETNGSKGFYKSWRSETKQFRGRGFRG
        E+L D+C+ CGRLGH++ EC E+     +     R  GV LR  +  K  +  +    K+   R   G
Subjt:  EKLPDFCYYCGRLGHVVHECHEE----GVERSKERNFGVDLRETNGSKGFYKSWRSETKQFRGRGFRG

A0A6J1BSZ1 uncharacterized protein LOC1110054813.0e-3437.34Show/hide
Query:  SRLLE-----KLKLEEDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLE-GAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIY
        S LLE     KL  EED   V+++   +E   +  E  + CK+L+   I+  V    +   W L+  A  V+  G N++L  F R+ D+ RI R GPW +
Subjt:  SRLLE-----KLKLEEDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLE-GAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIY

Query:  DDAIIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPI
        D A+I+ D P +      +DF+ VS WVHF  L   C  +  A  LGN+IG FE  E  AN    G  LRV+V+ ++ + L RG  +         WIPI
Subjt:  DDAIIVFDEPKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPI

Query:  TYEKLPDFCYYCGRLGHVVHECHEEGVER-SKERNFGVDLR
         YE+LPDF Y+CGRL H++ +C +  V+  SK   +G  LR
Subjt:  TYEKLPDFCYYCGRLGHVVHECHEEGVER-SKERNFGVDLR

A0A6J1DU55 uncharacterized protein LOC1110231353.3e-3336.62Show/hide
Query:  KLKLEEDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDDAIIVFDEPK
        KL  EED   ++V+ D ++ A +     +  K+L    I+ +V S ++   W +E  + VE  G N++L  F R  D  R+ + GPW +D A+IV  +P 
Subjt:  KLKLEEDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDDAIIVFDEPK

Query:  ANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEK--IEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITYEKLPDFC
        ++  +  L+F  V+FW+H   LP     +  A+ LGN+IG F   +V  NEK    G +LR++V I+I + L+RG  +         WIPI YE+LPDFC
Subjt:  ANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEK--IEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITYEKLPDFC

Query:  YYCGRLGHVVHEC
        Y+CG +GH  H+C
Subjt:  YYCGRLGHVVHEC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding4.7e-0820.77Show/hide
Query:  EEDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDDAIIVFDEPKANCC
        E++  V+ + ++ +E  N  ++  +  K+L  S I   V +  + ++W   G + V       ++ +F+  ++      GGPW      ++  +  +   
Subjt:  EEDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDDAIIVFDEPKANCC

Query:  VEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITYEKLPDFCYYCGRL
            D      WV    +P   + R   + +   +G     ++      +G   RV +++N+ + LK GT +  G         + YE L   C  CG  
Subjt:  VEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITYEKLPDFCYYCGRL

Query:  GHVVHEC
        GH+VH C
Subjt:  GHVVHEC

AT3G42140.1 zinc ion binding;nucleic acid binding3.2e-0421.38Show/hide
Query:  FKRTKDKARICRGGPWIYDDAIIVFDE-PKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESL
        F+  +    I R GPW ++D + V     K +   E   F+ + FW+    +P      +   ++G  +G F    +G +  +                 
Subjt:  FKRTKDKARICRGGPWIYDDAIIVFDE-PKANCCVEALDFQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESL

Query:  KRGTNVKTGSMAEKKWIPITYEKLPDFCYYCGRLGHVVHECHEEG
                        +   YEKL +FC  CG L H   EC   G
Subjt:  KRGTNVKTGSMAEKKWIPITYEKLPDFCYYCGRLGHVVHECHEEG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCACTGGATGCAAGGGTCCTAGCCGTTGCTTTCTCTCTCAAGAGAGAAAGGCTATGGACATGGGGGAAACCAGGAGGGGCAGCGAGCAAGCTTGCCGAACGGGGGA
AGGATCCATGGAAGGAAAAGAACAATCCCAAACGATAAAGAAGGATTGTCTGGGAGGATGTCAGATTGGCTGGGAAGAAGGAAGAACCCAGATCACTCACAACAGTAACC
AACCTCAGGCGGAAAAGGAAAACCAAACAAGCAAGAGGGAGAGTACAGAAGACACAGAAGAAGAAGCAATGGAAGGAATAACAGAAAACGAAGATATGTCCAGGCTTCTG
GAGAAGTTAAAGTTGGAAGAAGATAACAGAGTTGTGGAGGTTGAAGATGATGACATAGAAGAAGCCAACAGAGATTTTGAGAATGTAATAGCGTGCAAGATACTTACATC
CAGCTATATCAACCCTGAAGTCTTCTCTGGAATAATGCCAAAAATTTGGGGCCTGGAAGGAGCCGTGAGAGTAGAAAAAGCGGGAACTAACGTGTATCTCTGCAAGTTCA
AGAGGACTAAAGACAAAGCCAGAATCTGTCGAGGGGGGCCATGGATTTATGATGATGCTATAATAGTGTTCGATGAGCCAAAAGCAAATTGTTGCGTGGAAGCTCTAGAT
TTCCAATATGTTTCTTTTTGGGTTCACTTTCATAAATTACCCCGTGTTTGTTTTTGCAGGAAATATGCGGTGGCGTTGGGAAATTCGATAGGTGAATTCGAAGCAGCGGA
AGTTGGCGCTAATGAAAAAATAGAGGGAGAAACTTTAAGGGTAAAGGTAAAGATCAATATTAAAGAGTCGCTGAAGAGAGGAACTAATGTGAAAACAGGGTCCATGGCTG
AGAAAAAGTGGATTCCAATCACATATGAGAAACTCCCAGATTTTTGCTACTACTGTGGGAGATTGGGTCATGTAGTTCATGAGTGTCATGAGGAAGGTGTGGAAAGAAGC
AAAGAAAGAAACTTTGGAGTAGACCTGAGAGAGACCAATGGAAGCAAAGGATTCTACAAAAGCTGGAGGAGCGAAACCAAACAGTTCAGAGGAAGGGGTTTCAGAGGCAG
AGGGAGAGGGCTCTTCGGCAGGGGTGCGAGAGAAGATTTCAAAGAAACTGGTAAATCAGAATCTGACGACGAACCCATTGACAACCAAAGTCAGGTCTCCAAAGGACCAG
ATACATCGATAGAGGAAGAAGGAAACCGGCGAGAGGTGAACAGAAATGAAGACGCTGGCCGGAAAAGGGATGAAGTCGGAGGAGAAGAAGGTTCAGCGGCGGAAGTGTCA
GACCGGGTTGTCGGGTCAAATCAAACTGAAACAGACAGGAATGTGCTAGACCAGATGTCAGAGTTACCAAGAGAATACGACAAAATAGGAAAAAAGAAAGAGCCGTTGGA
GGAATCCTATGAAAAGTGTACAGATATTGACATGGGCCAAAGCGGTGATCCAAGTCTCGAGGGGCCCACGGAAATAAAAATCTTGAGGAAAATAGAAAAAGTAAAAGAAA
AGATTTTGGGAGGAAAATCGAAGGCAGTCACGGAAGGAGATGGCAAGAAAGAAAAACAAAAGGCAGAATGTAAAGTTGATCTCAGGAAAGGGAAAGACAGCCCAAAAAAG
GCAAAAGGAGGAAATGTTAAAACTTGGAAAAGGATGGCTAGAAACGAGCAAGAAATCAAATACTGGGACCAAAACGACATAGTGAATTACAGAGAAAAAAGGAAAGCTGA
AGATGAAAAACTTGAACCAGGTTTTCAAGAAGATCTCGGTCAGACTGTTGCCCAAACACAGATCAGACCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGTCACTGGATGCAAGGGTCCTAGCCGTTGCTTTCTCTCTCAAGAGAGAAAGGCTATGGACATGGGGGAAACCAGGAGGGGCAGCGAGCAAGCTTGCCGAACGGGGGA
AGGATCCATGGAAGGAAAAGAACAATCCCAAACGATAAAGAAGGATTGTCTGGGAGGATGTCAGATTGGCTGGGAAGAAGGAAGAACCCAGATCACTCACAACAGTAACC
AACCTCAGGCGGAAAAGGAAAACCAAACAAGCAAGAGGGAGAGTACAGAAGACACAGAAGAAGAAGCAATGGAAGGAATAACAGAAAACGAAGATATGTCCAGGCTTCTG
GAGAAGTTAAAGTTGGAAGAAGATAACAGAGTTGTGGAGGTTGAAGATGATGACATAGAAGAAGCCAACAGAGATTTTGAGAATGTAATAGCGTGCAAGATACTTACATC
CAGCTATATCAACCCTGAAGTCTTCTCTGGAATAATGCCAAAAATTTGGGGCCTGGAAGGAGCCGTGAGAGTAGAAAAAGCGGGAACTAACGTGTATCTCTGCAAGTTCA
AGAGGACTAAAGACAAAGCCAGAATCTGTCGAGGGGGGCCATGGATTTATGATGATGCTATAATAGTGTTCGATGAGCCAAAAGCAAATTGTTGCGTGGAAGCTCTAGAT
TTCCAATATGTTTCTTTTTGGGTTCACTTTCATAAATTACCCCGTGTTTGTTTTTGCAGGAAATATGCGGTGGCGTTGGGAAATTCGATAGGTGAATTCGAAGCAGCGGA
AGTTGGCGCTAATGAAAAAATAGAGGGAGAAACTTTAAGGGTAAAGGTAAAGATCAATATTAAAGAGTCGCTGAAGAGAGGAACTAATGTGAAAACAGGGTCCATGGCTG
AGAAAAAGTGGATTCCAATCACATATGAGAAACTCCCAGATTTTTGCTACTACTGTGGGAGATTGGGTCATGTAGTTCATGAGTGTCATGAGGAAGGTGTGGAAAGAAGC
AAAGAAAGAAACTTTGGAGTAGACCTGAGAGAGACCAATGGAAGCAAAGGATTCTACAAAAGCTGGAGGAGCGAAACCAAACAGTTCAGAGGAAGGGGTTTCAGAGGCAG
AGGGAGAGGGCTCTTCGGCAGGGGTGCGAGAGAAGATTTCAAAGAAACTGGTAAATCAGAATCTGACGACGAACCCATTGACAACCAAAGTCAGGTCTCCAAAGGACCAG
ATACATCGATAGAGGAAGAAGGAAACCGGCGAGAGGTGAACAGAAATGAAGACGCTGGCCGGAAAAGGGATGAAGTCGGAGGAGAAGAAGGTTCAGCGGCGGAAGTGTCA
GACCGGGTTGTCGGGTCAAATCAAACTGAAACAGACAGGAATGTGCTAGACCAGATGTCAGAGTTACCAAGAGAATACGACAAAATAGGAAAAAAGAAAGAGCCGTTGGA
GGAATCCTATGAAAAGTGTACAGATATTGACATGGGCCAAAGCGGTGATCCAAGTCTCGAGGGGCCCACGGAAATAAAAATCTTGAGGAAAATAGAAAAAGTAAAAGAAA
AGATTTTGGGAGGAAAATCGAAGGCAGTCACGGAAGGAGATGGCAAGAAAGAAAAACAAAAGGCAGAATGTAAAGTTGATCTCAGGAAAGGGAAAGACAGCCCAAAAAAG
GCAAAAGGAGGAAATGTTAAAACTTGGAAAAGGATGGCTAGAAACGAGCAAGAAATCAAATACTGGGACCAAAACGACATAGTGAATTACAGAGAAAAAAGGAAAGCTGA
AGATGAAAAACTTGAACCAGGTTTTCAAGAAGATCTCGGTCAGACTGTTGCCCAAACACAGATCAGACCATAA
Protein sequenceShow/hide protein sequence
MVTGCKGPSRCFLSQERKAMDMGETRRGSEQACRTGEGSMEGKEQSQTIKKDCLGGCQIGWEEGRTQITHNSNQPQAEKENQTSKRESTEDTEEEAMEGITENEDMSRLL
EKLKLEEDNRVVEVEDDDIEEANRDFENVIACKILTSSYINPEVFSGIMPKIWGLEGAVRVEKAGTNVYLCKFKRTKDKARICRGGPWIYDDAIIVFDEPKANCCVEALD
FQYVSFWVHFHKLPRVCFCRKYAVALGNSIGEFEAAEVGANEKIEGETLRVKVKINIKESLKRGTNVKTGSMAEKKWIPITYEKLPDFCYYCGRLGHVVHECHEEGVERS
KERNFGVDLRETNGSKGFYKSWRSETKQFRGRGFRGRGRGLFGRGAREDFKETGKSESDDEPIDNQSQVSKGPDTSIEEEGNRREVNRNEDAGRKRDEVGGEEGSAAEVS
DRVVGSNQTETDRNVLDQMSELPREYDKIGKKKEPLEESYEKCTDIDMGQSGDPSLEGPTEIKILRKIEKVKEKILGGKSKAVTEGDGKKEKQKAECKVDLRKGKDSPKK
AKGGNVKTWKRMARNEQEIKYWDQNDIVNYREKRKAEDEKLEPGFQEDLGQTVAQTQIRP