; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy06g009040 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy06g009040
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionCCHC-type domain-containing protein
Genome locationChr06:9126706..9132527
RNA-Seq ExpressionLcy06g009040
SyntenyLcy06g009040
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036397 - Ribonuclease H superfamily
IPR040256 - Uncharacterized protein At4g02000-like
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG48193.1 hypothetical protein EZV62_027487 [Acer yangbiense]1.4e-3532.17Show/hide
Query:  DVSSSVFHLQEDAIDITEKKLANAVLCKIFTKKKISPEVFKSMMPKIWNQEHTI-IDCRGFNLFLCKFKNARIKGHIVDSGPWFYDRAMLLLEEPKGDCC
        D  ++V  + ED I   ++ +   ++ K+ T KK++ E FK ++ +IWNQ   + ++  G N F+  F N   +  + + GPW + +++++LE+PKG   
Subjt:  DVSSSVFHLQEDAIDITEKKLANAVLCKIFTKKKISPEVFKSMMPKIWNQEHTI-IDCRGFNLFLCKFKNARIKGHIVDSGPWFYDRAMLLLEEPKGDCC

Query:  GDELAFRYVSFWVHFHKLPLACFSRSAAAEIGSLLGKVEQVDLEDDAEIKCDSSLRIKVQIDVTRPLKRGVFLKSGRSGVEKMIAVTYEKLPDFCYGCGR
          +L F    FWV  H +P+ C ++     +   +G+V  V++  ++       +R+KVQ+D+T+PLKR + +K G+     M+A+ YE+LPDFC+ CGR
Subjt:  GDELAFRYVSFWVHFHKLPLACFSRSAAAEIGSLLGKVEQVDLEDDAEIKCDSSLRIKVQIDVTRPLKRGVFLKSGRSGVEKMIAVTYEKLPDFCYGCGR

Query:  LGHIIKEC-DED---TGTKEEELPYGPWLR
        +GH ++EC DED        ++  +G W+R
Subjt:  LGHIIKEC-DED---TGTKEEELPYGPWLR

XP_015380691.1 uncharacterized protein LOC107174364 [Citrus sinensis]3.8e-3636.36Show/hide
Query:  KKLANAVLCKIFTKKKISPEVFKSMMPKIWNQEHTI-IDCRGFNLFLCKFKNARIKGHIVDSGPWFYDRAMLLLEEPKGDCCGDELAFRYVSFWVHFHKL
        K LA  ++ K+   + ++ E FKS + ++W     + I+  G N F+ KF     K  ++  GPW +DRA+L+L EPKG     + +F + +FW+    +
Subjt:  KKLANAVLCKIFTKKKISPEVFKSMMPKIWNQEHTI-IDCRGFNLFLCKFKNARIKGHIVDSGPWFYDRAMLLLEEPKGDCCGDELAFRYVSFWVHFHKL

Query:  PLACFSRSAAAEIGSLLGKVEQVDLEDDAEIKCDSSLRIKVQIDVTRPLKRGVFLK-SGRSGVEKMIAVTYEKLPDFCYGCGRLGHIIKECDEDTGTKEE
        P+AC  +    E+G ++G VE+++ +++ E   + + RI+V I++T PLK+ +FLK  G S ++  + V YE+LPDFCY CG +GH  KEC +  G ++E
Subjt:  PLACFSRSAAAEIGSLLGKVEQVDLEDDAEIKCDSSLRIKVQIDVTRPLKRGVFLK-SGRSGVEKMIAVTYEKLPDFCYGCGRLGHIIKECDEDTGTKEE

Query:  ELPYGPWLR
        +LPYG W++
Subjt:  ELPYGPWLR

XP_015382889.1 uncharacterized protein LOC102626150 [Citrus sinensis]2.2e-3637.14Show/hide
Query:  EKKLANAVLCKIFTKKKISPEVFKSMMPKIWNQEHTI-IDCRGFNLFLCKFKNARIKGHIVDSGPWFYDRAMLLLEEPKGDCCGDELAFRYVSFWVHFHK
        EK +A  ++ K+   +K+S E  K  M ++W     + I+  G N+F+ KF +   K  I+  GPW +DRA+++L EP G     + +F  VS WV  H 
Subjt:  EKKLANAVLCKIFTKKKISPEVFKSMMPKIWNQEHTI-IDCRGFNLFLCKFKNARIKGHIVDSGPWFYDRAMLLLEEPKGDCCGDELAFRYVSFWVHFHK

Query:  LPLACFSRSAAAEIGSLLGKVEQVDLEDDAEIKCDSSLRIKVQIDVTRPLKRGVFLKSGRSGVEKM-IAVTYEKLPDFCYGCGRLGHIIKECDEDTGTKE
        +P+ C S+  A ++G+++GKVE+VD  D  E      LR+++ +D+++PLK+ + L+      E + + V YE+LPDFC+ CGR+GH  +EC       +
Subjt:  LPLACFSRSAAAEIGSLLGKVEQVDLEDDAEIKCDSSLRIKVQIDVTRPLKRGVFLKSGRSGVEKM-IAVTYEKLPDFCYGCGRLGHIIKECDEDTGTKE

Query:  EELPYGPWLR
        +EL YGPWLR
Subjt:  EELPYGPWLR

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]3.2e-3534.84Show/hide
Query:  AEGIIQKLAAMKVTADVSSSVFHLQEDAIDITEKKLANAVLCKIFTKKKISPEVFKSMMPKIWNQEHTI--IDCRGFNLFLCKFKNARIKGHIVDSGPWF
        A  ++++    K+T++       +   A++ T K L  +++CK+ +K+ IS  V K+ +   W  +     +D  GFN+FL  F  +  +  I+  GPW 
Subjt:  AEGIIQKLAAMKVTADVSSSVFHLQEDAIDITEKKLANAVLCKIFTKKKISPEVFKSMMPKIWNQEHTI--IDCRGFNLFLCKFKNARIKGHIVDSGPWF

Query:  YDRAMLLLEEPKGDCCGDELAFRYVSFWVHFHKLPLACFSRSAAAEIGSLLGKVEQVDLEDDAEIKC-DSSLRIKVQIDVTRPLKRGVFLKSGRSGVEKM
        +DRA+++++ P       ++ FR VS WVHF  L LAC +++ A  +G+ +G  E  D+E +A   C  S LR++V+ DV +PL RG+ L          
Subjt:  YDRAMLLLEEPKGDCCGDELAFRYVSFWVHFHKLPLACFSRSAAAEIGSLLGKVEQVDLEDDAEIKC-DSSLRIKVQIDVTRPLKRGVFLKSGRSGVEKM

Query:  IAVTYEKLPDFCYGCGRLGHIIKEC-DEDTGTKEEELPYGPWLR
        I + YE+LPDF Y CGRL HI+K+C D    +  + L YGPWLR
Subjt:  IAVTYEKLPDFCYGCGRLGHIIKEC-DEDTGTKEEELPYGPWLR

XP_024033132.1 uncharacterized protein LOC112095437 [Citrus clementina]1.0e-3636.84Show/hide
Query:  KKLANAVLCKIFTKKKISPEVFKSMMPKIWNQEHTI-IDCRGFNLFLCKFKNARIKGHIVDSGPWFYDRAMLLLEEPKGDCCGDELAFRYVSFWVHFHKL
        K +A  ++ KI   + ++ E  K+ + + W   H   ++  G N+F+ KF +   K  + + GPW +DRA+++L+EPKG     + +F +VSFW+  H +
Subjt:  KKLANAVLCKIFTKKKISPEVFKSMMPKIWNQEHTI-IDCRGFNLFLCKFKNARIKGHIVDSGPWFYDRAMLLLEEPKGDCCGDELAFRYVSFWVHFHKL

Query:  PLACFSRSAAAEIGSLLGKVEQVDLEDDAEIKC-DSSLRIKVQIDVTRPLKRGVFLKSGRSGVEKMIAVTYEKLPDFCYGCGRLGHIIKECDEDTGTKEE
        PL C   S   E+GS +GKVE + +  DA+ +C    +RI+V +++T+PLK+ + LK      +  + V YE+LPDFC+ CG +GH  +EC E  G ++E
Subjt:  PLACFSRSAAAEIGSLLGKVEQVDLEDDAEIKC-DSSLRIKVQIDVTRPLKRGVFLKSGRSGVEKMIAVTYEKLPDFCYGCGRLGHIIKECDEDTGTKEE

Query:  ELPYGPWLR
         LP+GPWL+
Subjt:  ELPYGPWLR

TrEMBL top hitse value%identityAlignment
A0A1S8AC25 CCHC-type domain-containing protein (Fragment)3.1e-3634.44Show/hide
Query:  EGIIQKLAAMKVTADVSSSVFHLQEDAIDITEKKLANAVLCKIFTKKKISPEVFKSMMPKIWNQEHTI-IDCRGFNLFLCKFKNARIKGHIVDSGPWFYD
        E +I++  A++++ +    V       I+  EK LA  ++ K+   +++S E  K  M ++W     + I+  G N+F+ KF +   K  I+  GPW +D
Subjt:  EGIIQKLAAMKVTADVSSSVFHLQEDAIDITEKKLANAVLCKIFTKKKISPEVFKSMMPKIWNQEHTI-IDCRGFNLFLCKFKNARIKGHIVDSGPWFYD

Query:  RAMLLLEEPKGDCCGDELAFRYVSFWVHFHKLPLACFSRSAAAEIGSLLGKVEQVDLEDDAEIKC-DSSLRIKVQIDVTRPLKRGVFLKSGRSGVEKM-I
        RA++ L EP G     +  F +VSFWV  H +P+ C S+  AAE+G ++GKVE+V  E DA  +C    LR+++ +D+T+PLK+ + L+      + + +
Subjt:  RAMLLLEEPKGDCCGDELAFRYVSFWVHFHKLPLACFSRSAAAEIGSLLGKVEQVDLEDDAEIKC-DSSLRIKVQIDVTRPLKRGVFLKSGRSGVEKM-I

Query:  AVTYEKLPDFCYGCGRLGHIIKECDEDTGTKEEELPYGPWLREPYKLKDMESVNTIGRMQNYMGGRGRGR
         V YE+LPDFC+ CGR+GH  +EC       ++EL YGPWL+           NT+   +    GRGR R
Subjt:  AVTYEKLPDFCYGCGRLGHIIKECDEDTGTKEEELPYGPWLREPYKLKDMESVNTIGRMQNYMGGRGRGR

A0A5C7GU64 CCHC-type domain-containing protein7.0e-3632.17Show/hide
Query:  DVSSSVFHLQEDAIDITEKKLANAVLCKIFTKKKISPEVFKSMMPKIWNQEHTI-IDCRGFNLFLCKFKNARIKGHIVDSGPWFYDRAMLLLEEPKGDCC
        D  ++V  + ED I   ++ +   ++ K+ T KK++ E FK ++ +IWNQ   + ++  G N F+  F N   +  + + GPW + +++++LE+PKG   
Subjt:  DVSSSVFHLQEDAIDITEKKLANAVLCKIFTKKKISPEVFKSMMPKIWNQEHTI-IDCRGFNLFLCKFKNARIKGHIVDSGPWFYDRAMLLLEEPKGDCC

Query:  GDELAFRYVSFWVHFHKLPLACFSRSAAAEIGSLLGKVEQVDLEDDAEIKCDSSLRIKVQIDVTRPLKRGVFLKSGRSGVEKMIAVTYEKLPDFCYGCGR
          +L F    FWV  H +P+ C ++     +   +G+V  V++  ++       +R+KVQ+D+T+PLKR + +K G+     M+A+ YE+LPDFC+ CGR
Subjt:  GDELAFRYVSFWVHFHKLPLACFSRSAAAEIGSLLGKVEQVDLEDDAEIKCDSSLRIKVQIDVTRPLKRGVFLKSGRSGVEKMIAVTYEKLPDFCYGCGR

Query:  LGHIIKEC-DED---TGTKEEELPYGPWLR
        +GH ++EC DED        ++  +G W+R
Subjt:  LGHIIKEC-DED---TGTKEEELPYGPWLR

A0A5C7H466 CCHC-type domain-containing protein3.8e-3431.79Show/hide
Query:  AEGIIQKLAAMKVTADVSSSVFHLQEDAIDITEKKLANAVLCKIFTKKKISPEVFKSMMPKIWNQEHTI-IDCRGFNLFLCKFKNARIKGHIVDSGPWFY
        +E  I +L A    AD    V  +    I   E  +   ++ KI + KK++ + F  ++ ++W+    + I+    N+FL KF N   +  I   GPW++
Subjt:  AEGIIQKLAAMKVTADVSSSVFHLQEDAIDITEKKLANAVLCKIFTKKKISPEVFKSMMPKIWNQEHTI-IDCRGFNLFLCKFKNARIKGHIVDSGPWFY

Query:  DRAMLLLEEPKGDCCGDELAFRYVSFWVHFHKLPLACFSRSAAAEIGSLLGKVEQVDLEDDAEIKCDSSLRIKVQIDVTRPLKRGVFLKSGRSGVEKMIA
        D+++++L +P+G     +L F  V  WV  H +P+ C +R  A  +   +G+V  VD+  ++       L++KVQID+T+PLKR + L+  +S    M++
Subjt:  DRAMLLLEEPKGDCCGDELAFRYVSFWVHFHKLPLACFSRSAAAEIGSLLGKVEQVDLEDDAEIKCDSSLRIKVQIDVTRPLKRGVFLKSGRSGVEKMIA

Query:  VTYEKLPDFCYGCGRLGHIIKECDEDTGTKEEEL-----PYGPWLREP----YKLKDMESVNTIGRMQNYMGGRGRGRGR
        + YE+LP+FCY CGR+GH  K+C ED   K E L      +G W+R P     K   M+ V    ++Q    GR  G+G+
Subjt:  VTYEKLPDFCYGCGRLGHIIKECDEDTGTKEEEL-----PYGPWLREP----YKLKDMESVNTIGRMQNYMGGRGRGRGR

A0A6J1BSZ1 uncharacterized protein LOC1110054811.6e-3534.84Show/hide
Query:  AEGIIQKLAAMKVTADVSSSVFHLQEDAIDITEKKLANAVLCKIFTKKKISPEVFKSMMPKIWNQEHTI--IDCRGFNLFLCKFKNARIKGHIVDSGPWF
        A  ++++    K+T++       +   A++ T K L  +++CK+ +K+ IS  V K+ +   W  +     +D  GFN+FL  F  +  +  I+  GPW 
Subjt:  AEGIIQKLAAMKVTADVSSSVFHLQEDAIDITEKKLANAVLCKIFTKKKISPEVFKSMMPKIWNQEHTI--IDCRGFNLFLCKFKNARIKGHIVDSGPWF

Query:  YDRAMLLLEEPKGDCCGDELAFRYVSFWVHFHKLPLACFSRSAAAEIGSLLGKVEQVDLEDDAEIKC-DSSLRIKVQIDVTRPLKRGVFLKSGRSGVEKM
        +DRA+++++ P       ++ FR VS WVHF  L LAC +++ A  +G+ +G  E  D+E +A   C  S LR++V+ DV +PL RG+ L          
Subjt:  YDRAMLLLEEPKGDCCGDELAFRYVSFWVHFHKLPLACFSRSAAAEIGSLLGKVEQVDLEDDAEIKC-DSSLRIKVQIDVTRPLKRGVFLKSGRSGVEKM

Query:  IAVTYEKLPDFCYGCGRLGHIIKEC-DEDTGTKEEELPYGPWLR
        I + YE+LPDF Y CGRL HI+K+C D    +  + L YGPWLR
Subjt:  IAVTYEKLPDFCYGCGRLGHIIKEC-DEDTGTKEEELPYGPWLR

A0A6J1D765 uncharacterized protein LOC1110179024.5e-3532.71Show/hide
Query:  ITEKKLANAVLCKIFTKKKISPEVFKSMMPKIWN-QEHTIIDCRGFNLFLCKFKNARIKGHIVDSGPWFYDRAMLLLEEPKGDCCGDELAFRYVSFWVHF
        +T   +   V+ K+ T K+IS E  +S+M  +W     T  +  G N+++  FK+   K  ++ SGPW +++++L+L  P       ++ F + +FW+  
Subjt:  ITEKKLANAVLCKIFTKKKISPEVFKSMMPKIWN-QEHTIIDCRGFNLFLCKFKNARIKGHIVDSGPWFYDRAMLLLEEPKGDCCGDELAFRYVSFWVHF

Query:  HKLPLACFSRSAAAEIGSLLGKVEQVDLEDDAEIKCDSSLRIKVQIDVTRPLKRGVFLKSGRSGVEKMIAVTYEKLPDFCYGCGRLGHIIKECDEDTGTK
        H +P  C S   A  +G+ LG VE+++  D A+      +R++V+IDV++PL+RG+ LK+   G +    + YEKLPDFCY CG++GH  +EC++ +   
Subjt:  HKLPLACFSRSAAAEIGSLLGKVEQVDLEDDAEIKCDSSLRIKVQIDVTRPLKRGVFLKSGRSGVEKMIAVTYEKLPDFCYGCGRLGHIIKECDEDTGTK

Query:  EEELP--YGPWLREPYKLKDMESVNTIGRMQNYMGGRGR----GRGRFGDWEDR-RSWRNKEEGES
            P  YG WLR     K +         +    GRG     GRG  GDW  R  +WR+ +  ES
Subjt:  EEELP--YGPWLREPYKLKDMESVNTIGRMQNYMGGRGR----GRGRFGDWEDR-RSWRNKEEGES

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.4e-0429.49Show/hide
Query:  WIPPPTGVCKLNCDASWSSRLRRGGIGWILRDWRGCPLRGGVKWVKQSWKI--SWLEALS-VCEAVRHFPSDSLIFQLELDALQVVQLLTNESEDATELG
        W  PP    K N DA+W     R GIGWILR+  G  L  G + + ++  +  + LEAL      +  F    +IF  E DA  +V LL N  +    L 
Subjt:  WIPPPTGVCKLNCDASWSSRLRRGGIGWILRDWRGCPLRGGVKWVKQSWKI--SWLEALS-VCEAVRHFPSDSLIFQLELDALQVVQLLTNESEDATELG

Query:  GFIMEAQDLMKSLQVQTIQHVSRNNNGLAHHMAHMACELQMSN---SWSSVFPSWL
          + + Q L+   +    +   R  N +A  +A  +  +  SN      S+ P WL
Subjt:  GFIMEAQDLMKSLQVQTIQHVSRNNNGLAHHMAHMACELQMSN---SWSSVFPSWL

AT3G42140.1 zinc ion binding;nucleic acid binding5.3e-0421.97Show/hide
Query:  IVDSGPWFYDRAMLLLEEPKGDCCGDELAFRYVSFWVHFHKLPLACFSRSAAAEIGSLLGKVEQVDLEDDAEIKCDSSLRIKVQIDVTRPLKRGVFLKSG
        I+  GPW ++  M +++  +      +  F+ + FW+    +PL   +      IG  +G   + +L  D  +       +K Q                
Subjt:  IVDSGPWFYDRAMLLLEEPKGDCCGDELAFRYVSFWVHFHKLPLACFSRSAAAEIGSLLGKVEQVDLEDDAEIKCDSSLRIKVQIDVTRPLKRGVFLKSG

Query:  RSGVEKMIAVTYEKLPDFCYGCGRLGHIIKEC
                   YEKL +FC  CG L H   EC
Subjt:  RSGVEKMIAVTYEKLPDFCYGCGRLGHIIKEC

AT4G29090.1 Ribonuclease H-like superfamily protein6.1e-0824.91Show/hide
Query:  YLMKIWTKEGRDSIDQQRVATSLI--LCWQIWTHRNLVTQNNQKMDMQMLEAKIKVFLTEFLHQEESREELGTCSTQQNRGLEAPIGNANSARLQTIHGD
        Y+   W     +   Q   A+ L+  L W++W +RN +    ++ + Q +  + +  L E+      R E  +C T+       P  N +S         
Subjt:  YLMKIWTKEGRDSIDQQRVATSLI--LCWQIWTHRNLVTQNNQKMDMQMLEAKIKVFLTEFLHQEESREELGTCSTQQNRGLEAPIGNANSARLQTIHGD

Query:  SSLRSSLWIPPPTGVCKLNCDASWSSRLRRGGIGWILRDWRGCPLRGGVKWV-------KQSWKISWLEALS-VCEAVRHFPSDSLIFQLELDALQVVQL
               W PPP    K N DA+W+    R GIGW+LR+      +G VKW+        +S   + LEA+     ++  F  + +IF  E D+  ++++
Subjt:  SSLRSSLWIPPPTGVCKLNCDASWSSRLRRGGIGWILRDWRGCPLRGGVKWV-------KQSWKISWLEALS-VCEAVRHFPSDSLIFQLELDALQVVQL

Query:  LTNESEDATELGGFIMEAQDLMKSLQVQTIQHVSRNNNGLAHHMAHMACE-LQMSNSWSSVFPSW
        L N+ E    L   I + Q L+          + R  N LA  +A  +   L       S+ PSW
Subjt:  LTNESEDATELGGFIMEAQDLMKSLQVQTIQHVSRNNNGLAHHMAHMACE-LQMSNSWSSVFPSW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCAAGTAGTAATGCAGAGGGAATTATCCAGAAACTAGCAGCGATGAAAGTGACGGCGGATGTTTCATCAAGTGTGTTCCATTTACAAGAAGATGCAATT
GATATTACAGAGAAGAAACTGGCCAATGCAGTCTTATGCAAGATTTTTACAAAAAAGAAAATCAGTCCTGAAGTCTTCAAATCAATGATGCCAAAGATATGGAAC
CAAGAGCATACAATTATTGATTGCCGGGGCTTCAACTTATTCCTTTGCAAATTCAAAAATGCAAGGATTAAAGGGCATATCGTAGATTCGGGGCCTTGGTTTTAT
GATAGAGCGATGCTTCTGTTGGAAGAGCCAAAAGGAGACTGCTGTGGAGATGAATTGGCGTTCAGGTATGTCTCATTTTGGGTTCATTTTCATAAACTCCCTCTG
GCTTGTTTTTCTAGGTCTGCAGCAGCGGAAATTGGAAGCTTATTGGGCAAAGTGGAGCAAGTGGATTTAGAAGATGATGCAGAAATAAAGTGTGATAGTTCATTA
CGTATCAAGGTTCAAATCGATGTGACTCGCCCGCTGAAACGTGGTGTTTTTCTCAAATCTGGAAGATCTGGGGTGGAAAAAATGATCGCAGTGACGTATGAGAAA
TTACCTGATTTTTGCTATGGATGTGGGAGGTTGGGCCATATAATCAAGGAGTGTGATGAAGATACTGGTACAAAAGAAGAGGAGCTACCGTATGGTCCTTGGCTT
CGTGAACCATACAAACTAAAAGACATGGAGTCAGTTAATACCATAGGAAGGATGCAAAATTACATGGGAGGAAGAGGTAGAGGTCGAGGGCGGTTTGGGGATTGG
GAGGATCGTAGATCATGGAGGAATAAAGAAGAGGGGGAAAGCGAAGATGTGCTACCAAAGAAAGGAGACGACGGTGGCATGGTTGACGGCGGCAGAGCTGGCGGA
GGCACGGCTAGCGGTGGAGATAAGGCAGCAAGGGAGAAGGACCGAATGGCTCCGGCAGAGGCGGCAGATCCGACAGAGAGAGAGCCGGAGATTTCGGATCCATTA
ATTACCCAAACGGTAATAACACCATTAAAGGAGAAAGAGAAAATATTCGGGGAATCAAACATTCATGATAGAAATTCTACAGCTGTGATGGAGGTTGACATGTTG
ACTATGGTGTCTGAGGTAAAAAGGGACGAAGAAAAGGATTCAGTTATGAAAGAGGTTGGATATACCATTGGGGACGAGAGAAAAGAAAAGGCTAAAAAATGGAAA
AGACTGTCACGTGCTCGGGTGGAAGGTGAATTGCATGATCAACTAGGGCAGTACATTCACACTACTACCACGGGGAAACATAAGATTGAGATTGAGGATGCAAGT
TCTCATAAGAAAAGAATCGTTATGGTGGATGATAATTCAATTGCAAAATCGGTGGAGGCTGCGAGAAATCCCCAGAATCTGCTTCCCACCTATTTTGGGAATGTA
AGGAGGGGTGGATTATCTCCGGCGGACTATCTGATGAAAATCTGGACAAAGGAGGGGAGAGATTCAATTGACCAGCAGAGAGTTGCAACAAGCCTAATTTTATGC
TGGCAGATCTGGACTCATAGGAATTTGGTGACTCAAAACAACCAAAAGATGGATATGCAGATGCTGGAAGCTAAAATTAAAGTGTTTCTAACAGAATTTCTTCAT
CAAGAAGAGTCAAGAGAGGAGCTGGGTACCTGTTCTACTCAACAAAATCGAGGTCTGGAGGCCCCGATTGGTAACGCGAATTCGGCGAGGTTGCAAACGATTCAT
GGTGATTCTTCTCTGCGTTCGAGCTTGTGGATTCCTCCTCCGACGGGGGTTTGCAAATTGAACTGTGATGCTTCTTGGAGCTCCAGACTTCGGCGCGGTGGAATT
GGATGGATTCTCAGGGACTGGCGTGGCTGTCCGTTAAGAGGAGGAGTTAAATGGGTTAAGCAAAGCTGGAAAATTTCATGGTTAGAGGCTTTATCAGTTTGCGAA
GCCGTGAGACATTTCCCTTCTGATTCCCTCATTTTTCAACTTGAACTTGATGCACTCCAAGTGGTGCAGCTGCTGACCAACGAAAGCGAGGATGCCACTGAGTTG
GGGGGCTTCATAATGGAAGCTCAAGACCTAATGAAGTCTCTCCAAGTTCAAACTATTCAGCATGTGTCAAGGAATAATAATGGGCTGGCCCATCATATGGCCCAT
ATGGCATGTGAACTACAAATGTCCAATTCTTGGTCCTCTGTTTTTCCCTCATGGCTGTTAGATTATAATTATCTAGATACTGGGTATGAATCTTATACTTGTGGG
GGTCCCTGTCCCACAGGCAGTACCATTTTGGGAGCTGTGACTAGCTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCAAGTAGTAATGCAGAGGGAATTATCCAGAAACTAGCAGCGATGAAAGTGACGGCGGATGTTTCATCAAGTGTGTTCCATTTACAAGAAGATGCAATT
GATATTACAGAGAAGAAACTGGCCAATGCAGTCTTATGCAAGATTTTTACAAAAAAGAAAATCAGTCCTGAAGTCTTCAAATCAATGATGCCAAAGATATGGAAC
CAAGAGCATACAATTATTGATTGCCGGGGCTTCAACTTATTCCTTTGCAAATTCAAAAATGCAAGGATTAAAGGGCATATCGTAGATTCGGGGCCTTGGTTTTAT
GATAGAGCGATGCTTCTGTTGGAAGAGCCAAAAGGAGACTGCTGTGGAGATGAATTGGCGTTCAGGTATGTCTCATTTTGGGTTCATTTTCATAAACTCCCTCTG
GCTTGTTTTTCTAGGTCTGCAGCAGCGGAAATTGGAAGCTTATTGGGCAAAGTGGAGCAAGTGGATTTAGAAGATGATGCAGAAATAAAGTGTGATAGTTCATTA
CGTATCAAGGTTCAAATCGATGTGACTCGCCCGCTGAAACGTGGTGTTTTTCTCAAATCTGGAAGATCTGGGGTGGAAAAAATGATCGCAGTGACGTATGAGAAA
TTACCTGATTTTTGCTATGGATGTGGGAGGTTGGGCCATATAATCAAGGAGTGTGATGAAGATACTGGTACAAAAGAAGAGGAGCTACCGTATGGTCCTTGGCTT
CGTGAACCATACAAACTAAAAGACATGGAGTCAGTTAATACCATAGGAAGGATGCAAAATTACATGGGAGGAAGAGGTAGAGGTCGAGGGCGGTTTGGGGATTGG
GAGGATCGTAGATCATGGAGGAATAAAGAAGAGGGGGAAAGCGAAGATGTGCTACCAAAGAAAGGAGACGACGGTGGCATGGTTGACGGCGGCAGAGCTGGCGGA
GGCACGGCTAGCGGTGGAGATAAGGCAGCAAGGGAGAAGGACCGAATGGCTCCGGCAGAGGCGGCAGATCCGACAGAGAGAGAGCCGGAGATTTCGGATCCATTA
ATTACCCAAACGGTAATAACACCATTAAAGGAGAAAGAGAAAATATTCGGGGAATCAAACATTCATGATAGAAATTCTACAGCTGTGATGGAGGTTGACATGTTG
ACTATGGTGTCTGAGGTAAAAAGGGACGAAGAAAAGGATTCAGTTATGAAAGAGGTTGGATATACCATTGGGGACGAGAGAAAAGAAAAGGCTAAAAAATGGAAA
AGACTGTCACGTGCTCGGGTGGAAGGTGAATTGCATGATCAACTAGGGCAGTACATTCACACTACTACCACGGGGAAACATAAGATTGAGATTGAGGATGCAAGT
TCTCATAAGAAAAGAATCGTTATGGTGGATGATAATTCAATTGCAAAATCGGTGGAGGCTGCGAGAAATCCCCAGAATCTGCTTCCCACCTATTTTGGGAATGTA
AGGAGGGGTGGATTATCTCCGGCGGACTATCTGATGAAAATCTGGACAAAGGAGGGGAGAGATTCAATTGACCAGCAGAGAGTTGCAACAAGCCTAATTTTATGC
TGGCAGATCTGGACTCATAGGAATTTGGTGACTCAAAACAACCAAAAGATGGATATGCAGATGCTGGAAGCTAAAATTAAAGTGTTTCTAACAGAATTTCTTCAT
CAAGAAGAGTCAAGAGAGGAGCTGGGTACCTGTTCTACTCAACAAAATCGAGGTCTGGAGGCCCCGATTGGTAACGCGAATTCGGCGAGGTTGCAAACGATTCAT
GGTGATTCTTCTCTGCGTTCGAGCTTGTGGATTCCTCCTCCGACGGGGGTTTGCAAATTGAACTGTGATGCTTCTTGGAGCTCCAGACTTCGGCGCGGTGGAATT
GGATGGATTCTCAGGGACTGGCGTGGCTGTCCGTTAAGAGGAGGAGTTAAATGGGTTAAGCAAAGCTGGAAAATTTCATGGTTAGAGGCTTTATCAGTTTGCGAA
GCCGTGAGACATTTCCCTTCTGATTCCCTCATTTTTCAACTTGAACTTGATGCACTCCAAGTGGTGCAGCTGCTGACCAACGAAAGCGAGGATGCCACTGAGTTG
GGGGGCTTCATAATGGAAGCTCAAGACCTAATGAAGTCTCTCCAAGTTCAAACTATTCAGCATGTGTCAAGGAATAATAATGGGCTGGCCCATCATATGGCCCAT
ATGGCATGTGAACTACAAATGTCCAATTCTTGGTCCTCTGTTTTTCCCTCATGGCTGTTAGATTATAATTATCTAGATACTGGGTATGAATCTTATACTTGTGGG
GGTCCCTGTCCCACAGGCAGTACCATTTTGGGAGCTGTGACTAGCTCTTGA
Protein sequenceShow/hide protein sequence
MASSSNAEGIIQKLAAMKVTADVSSSVFHLQEDAIDITEKKLANAVLCKIFTKKKISPEVFKSMMPKIWNQEHTIIDCRGFNLFLCKFKNARIKGHIVDSGPWFY
DRAMLLLEEPKGDCCGDELAFRYVSFWVHFHKLPLACFSRSAAAEIGSLLGKVEQVDLEDDAEIKCDSSLRIKVQIDVTRPLKRGVFLKSGRSGVEKMIAVTYEK
LPDFCYGCGRLGHIIKECDEDTGTKEEELPYGPWLREPYKLKDMESVNTIGRMQNYMGGRGRGRGRFGDWEDRRSWRNKEEGESEDVLPKKGDDGGMVDGGRAGG
GTASGGDKAAREKDRMAPAEAADPTEREPEISDPLITQTVITPLKEKEKIFGESNIHDRNSTAVMEVDMLTMVSEVKRDEEKDSVMKEVGYTIGDERKEKAKKWK
RLSRARVEGELHDQLGQYIHTTTTGKHKIEIEDASSHKKRIVMVDDNSIAKSVEAARNPQNLLPTYFGNVRRGGLSPADYLMKIWTKEGRDSIDQQRVATSLILC
WQIWTHRNLVTQNNQKMDMQMLEAKIKVFLTEFLHQEESREELGTCSTQQNRGLEAPIGNANSARLQTIHGDSSLRSSLWIPPPTGVCKLNCDASWSSRLRRGGI
GWILRDWRGCPLRGGVKWVKQSWKISWLEALSVCEAVRHFPSDSLIFQLELDALQVVQLLTNESEDATELGGFIMEAQDLMKSLQVQTIQHVSRNNNGLAHHMAH
MACELQMSNSWSSVFPSWLLDYNYLDTGYESYTCGGPCPTGSTILGAVTSS