; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy01g009190 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy01g009190
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr01:25864874..25866271
RNA-Seq ExpressionLcy01g009190
SyntenyLcy01g009190
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG63812.1 hypothetical protein EZV62_010806 [Acer yangbiense]1.4e-3427.8Show/hide
Query:  MLKVGSKAEESWSRVTYEKLPEFCYCCGRIGHVAKDCEEDEVNK----AEEYQYGPWMREDNIIWGKSK-------GTKEEGRKSPKANLNVRKG-----
        ++++  ++ E W R    K   F   CGR+GH   +C + E  K        +YG W++   +   K K       G+  + R S      V  G     
Subjt:  MLKVGSKAEESWSRVTYEKLPEFCYCCGRIGHVAKDCEEDEVNK----AEEYQYGPWMREDNIIWGKSK-------GTKEEGRKSPKANLNVRKG-----

Query:  ------RNGGEDSK---------------SEEEEEARVEELADTRVSFPATAPPRTVEEQG-------SGGTKRKKE--------SLEDNSNRSPRKGID
              RNGG +S                S  EE A  E L    V  P       VE++        SGG  +  E         + +    SPR   D
Subjt:  ------RNGGEDSK---------------SEEEEEARVEELADTRVSFPATAPPRTVEEQG-------SGGTKRKKE--------SLEDNSNRSPRKGID

Query:  MSGEVSNHDEQFSDGSHRGMEIEKKEAQSDKKWEEILEVAVRRYHGPIKAPSSQR----AMGSGSQPVRNNHEEETVLANPNQDDQKYK-----------
            V   D+     +  G+++++      KKW+       R+    + A   Q+    ++ S   P+RN++       +P    +K K           
Subjt:  MSGEVSNHDEQFSDGSHRGMEIEKKEAQSDKKWEEILEVAVRRYHGPIKAPSSQR----AMGSGSQPVRNNHEEETVLANPNQDDQKYK-----------

Query:  -------DTKGEERDIG--------RGCGTTPPDAMKTLCWNIRGAGNPRAVRLLRLVVRQNFPNLVFLSETKIKGLCSNSLKLKLDFDNCFEVSNVGLS
                 + E+R  G        + CG  PP AM  LCWN+RG GNP  V  L+ +V+++ P+L+FLSETK++   +  ++  L +   F V ++G S
Subjt:  -------DTKGEERDIG--------RGCGTTPPDAMKTLCWNIRGAGNPRAVRLLRLVVRQNFPNLVFLSETKIKGLCSNSLKLKLDFDNCFEVSNVGLS

Query:  GGLMLLWKKEVHVTVRSFSKGHIDAIIELNEDR-WRFTGFYGSSDKDCRSDSLDLIIRLQAMDDLPWVLGGDFNEILFGNEK
        GGL+LLW ++ +V+V+SFS GHID  I++ +   W+F+G YG  + + R +  +LI RL+ +D LPW+  GDFNE+L  +EK
Subjt:  GGLMLLWKKEVHVTVRSFSKGHIDAIIELNEDR-WRFTGFYGSSDKDCRSDSLDLIIRLQAMDDLPWVLGGDFNEILFGNEK

XP_022157437.1 uncharacterized protein LOC111024135 [Momordica charantia]2.6e-3352.05Show/hide
Query:  MKTLCWNIRGAGNPRAVRLLRLVVRQNFPNLVFLSETKIKGLCSNSLKLKLDFDNCFEVSNVGLSGGLMLLWKKEVHVTVRSFSKGHIDAIIELNEDRWR
        MK+LCWN+ G GNP   R LR +VR++ P LVFLSETK         K +L FD C  V++ G SGGLMLLW  + +V ++S S GHID+II      WR
Subjt:  MKTLCWNIRGAGNPRAVRLLRLVVRQNFPNLVFLSETKIKGLCSNSLKLKLDFDNCFEVSNVGLSGGLMLLWKKEVHVTVRSFSKGHIDAIIELNEDRWR

Query:  FTGFYGSSDKDCRSDSLDLIIRLQAMDDLPWVLGGDFNEILFGNEK
        FTGFYG+     RS S  L+ RL  M DLPW++GGDFNEI+   EK
Subjt:  FTGFYGSSDKDCRSDSLDLIIRLQAMDDLPWVLGGDFNEILFGNEK

XP_028073297.1 uncharacterized protein LOC114275455 [Camellia sinensis]8.3e-3246.62Show/hide
Query:  MKTLCWNIRGAGNPRAVRLLRLVVRQNFPNLVFLSETKIKGLCSNSLKLKLDFDNCFEVSNVGLSGGLMLLWKKEVHVTVRSFSKGHIDAIIE--LNEDR
        M  +CWN RG GNPR+VR L+ ++++  P LVFL ETK+       ++ KL F   F V+ VGL+GGL LLW + + V+V+S+S GHID  ++  L E  
Subjt:  MKTLCWNIRGAGNPRAVRLLRLVVRQNFPNLVFLSETKIKGLCSNSLKLKLDFDNCFEVSNVGLSGGLMLLWKKEVHVTVRSFSKGHIDAIIE--LNEDR

Query:  WRFTGFYGSSDKDCRSDSLDLIIRLQAMDDLPWVLGGDFNEILFGNEK
        WRFTGFYG  +   R  S +L+ +L     +PW+  GDFNEILF  EK
Subjt:  WRFTGFYGSSDKDCRSDSLDLIIRLQAMDDLPWVLGGDFNEILFGNEK

XP_028075737.1 uncharacterized protein LOC114277953 [Camellia sinensis]2.1e-3550.68Show/hide
Query:  MKTLCWNIRGAGNPRAVRLLRLVVRQNFPNLVFLSETKIKGLCSNSLKLKLDFDNCFEVSNVGLSGGLMLLWKKEVHVTVRSFSKGHIDAII--ELNEDR
        MK LCWN RG GNPR VR L+L++++  P +VFL ETK+       ++ KL    CF V  VGLSGGL LLW  E+ + ++SFS+GH+D+II  E     
Subjt:  MKTLCWNIRGAGNPRAVRLLRLVVRQNFPNLVFLSETKIKGLCSNSLKLKLDFDNCFEVSNVGLSGGLMLLWKKEVHVTVRSFSKGHIDAII--ELNEDR

Query:  WRFTGFYGSSDKDCRSDSLDLIIRLQAMDDLPWVLGGDFNEILFGNEK
        W FTGFYG+     RSDS +L+ RLQ    LPW+  GDFNEIL+ +EK
Subjt:  WRFTGFYGSSDKDCRSDSLDLIIRLQAMDDLPWVLGGDFNEILFGNEK

XP_042950313.1 uncharacterized protein LOC122282426 [Carya illinoinensis]1.9e-3147.62Show/hide
Query:  MKTLCWNIRGAGNPRAVRLLRLVVRQNFPNLVFLSETKIKGLCSNSLKLKLDFDNCFEVSNVGLSGGLMLLWKKEVHVTVRSFSKGHIDAIIELNEDR-W
        MKT+CWN RG GNP  +R LR ++ +  P+L+FL ETK+     ++LK KL F NCF V + G SGGL LLW  ++ V +RSFS+ HID  I++++   W
Subjt:  MKTLCWNIRGAGNPRAVRLLRLVVRQNFPNLVFLSETKIKGLCSNSLKLKLDFDNCFEVSNVGLSGGLMLLWKKEVHVTVRSFSKGHIDAIIELNEDR-W

Query:  RFTGFYGSSDKDCRSDSLDLIIRLQAMDDLPWVLGGDFNEILFGNEK
        RFTG YG  D   R+ + +LI  L +++ LPW++GGD NE+L  +EK
Subjt:  RFTGFYGSSDKDCRSDSLDLIIRLQAMDDLPWVLGGDFNEILFGNEK

TrEMBL top hitse value%identityAlignment
A0A2N9EEH0 Uncharacterized protein1.9e-3430Show/hide
Query:  AEESWSRVTYEKLPEFCYCCGRIGHVAKDCEEDEVN----KAEEYQYGPWMREDNIIWGKSKGTKEEGRKSPKANLNVRKGRNGGEDSKSEEEEEARVEE
        + E W    YE+LP F Y CG++ H  KDC     N    K+ + QYGPW+R       +      EG+ S       R    GG  S++E+E +  V+ 
Subjt:  AEESWSRVTYEKLPEFCYCCGRIGHVAKDCEEDEVN----KAEEYQYGPWMREDNIIWGKSKGTKEEGRKSPKANLNVRKGRNGGEDSKSEEEEEARVEE

Query:  LADTRVSFPATAPPRTVEEQGSGGTKRKKESLEDNSNRSPRKGID--MSGEVSNHDEQFSDGS----HRGMEIEKKEAQSDKKWEE------ILEVAVRR
            + +        T+      G K    +L+D+S R  +   D   + E+   DE+   GS    H    IEK  A + ++ E+      I E  V  
Subjt:  LADTRVSFPATAPPRTVEEQGSGGTKRKKESLEDNSNRSPRKGID--MSGEVSNHDEQFSDGS----HRGMEIEKKEAQSDKKWEE------ILEVAVRR

Query:  YHGPIKAPS-----------------------------------------SQRAMGSGSQPVRNNHEEETVLANPNQDDQKYKDTKGEERDIGRGCGTTP
             +A +                                          +   G+G+   + N  E   L+  +++D + KD+ G           TP
Subjt:  YHGPIKAPS-----------------------------------------SQRAMGSGSQPVRNNHEEETVLANPNQDDQKYKDTKGEERDIGRGCGTTP

Query:  PDAMKTLCWNIRGAGNPRAVRLLRLVVRQNFPNLVFLSETKIKGLCSNSLKLKLDFDNCFEVSNVGLSGGLMLLWKKEVHVTVRSFSKGHIDA-IIELNE
        P +M  L  N RG GNP+ VR L  +V+   P +VFL ET+++      L++KL       V   G  GGL LLWKKEV  T+ S S GHIDA ++    
Subjt:  PDAMKTLCWNIRGAGNPRAVRLLRLVVRQNFPNLVFLSETKIKGLCSNSLKLKLDFDNCFEVSNVGLSGGLMLLWKKEVHVTVRSFSKGHIDA-IIELNE

Query:  DRWRFTGFYGSSDKDCRSDSLDLIIRLQAMDDLPWVLGGDFNEILFGNEK
          W FTGFYG+ +   R DS  L+ RLQ  DD+PW++ GDFNEIL  +EK
Subjt:  DRWRFTGFYGSSDKDCRSDSLDLIIRLQAMDDLPWVLGGDFNEILFGNEK

A0A2N9I239 RNase H domain-containing protein2.6e-3931.81Show/hide
Query:  KVG-SKAEESWSRVTYEKLPEFCYCCGRIGHVAKDCE----EDEVNKAEEYQYGPWMREDNIIWGKSKGTKEEGRKSPKANLNVRK--GRNGGEDSKSEE
        KVG   ++++W  + YE+LP FCY CG I H  +DCE          +E  QYGPWMR + I   +  G    G   P  ++   +  G  GG++ ++ +
Subjt:  KVG-SKAEESWSRVTYEKLPEFCYCCGRIGHVAKDCE----EDEVNKAEEYQYGPWMREDNIIWGKSKGTKEEGRKSPKANLNVRK--GRNGGEDSKSEE

Query:  EEEARVEELADTRVSFPATAPPRTVEEQ----------GSGGTKRKKESLEDNSNRSPRKGIDMSGEVSNHDEQFSDGSHRGMEIEKKEAQSDKKWEEIL
        +         D   S  A  P R  E+Q           SG +     S+++++  S R+  D+ G+V       +  +  G+ +  KE           
Subjt:  EEEARVEELADTRVSFPATAPPRTVEEQ----------GSGGTKRKKESLEDNSNRSPRKGIDMSGEVSNHDEQFSDGSHRGMEIEKKEAQSDKKWEEIL

Query:  EVAVRRYHGPIKAPSSQRAMGSGSQPVRNNHEEETVLANPNQDDQKYKDTKGEERDIGRGCGTTPPDAMKTLCWNIRGAGNPRAVRLLRLVVRQNFPNLV
                 PI + SS+   G+  +          +L  PN       +++    DIG GC  TPP+AM  L WN RG GNP  V+ L ++VRQ  P  +
Subjt:  EVAVRRYHGPIKAPSSQRAMGSGSQPVRNNHEEETVLANPNQDDQKYKDTKGEERDIGRGCGTTPPDAMKTLCWNIRGAGNPRAVRLLRLVVRQNFPNLV

Query:  FLSETKIKGLCSNSLKLKLDFDNCFEVSNVGLSGGLMLLWKKEVHVTVRSFSKGHIDAIIELNEDR-WRFTGFYGSSDKDCRSDSLDLIIRLQAMDDLPW
        F+SETK+       L+    F     V + G SGGL+L W++ V VTV S+S+ HIDA++E ++ + WR TGFYGS     +  + D++  L     LPW
Subjt:  FLSETKIKGLCSNSLKLKLDFDNCFEVSNVGLSGGLMLLWKKEVHVTVRSFSKGHIDAIIELNEDR-WRFTGFYGSSDKDCRSDSLDLIIRLQAMDDLPW

Query:  VLGGDFNEILFGNEK
        + GGDFNE+L G EK
Subjt:  VLGGDFNEILFGNEK

A0A2N9IFR8 RNase H domain-containing protein1.9e-3732.27Show/hide
Query:  SKAEESWSRVTYEKLPEFCYCCGRIGHVAKDCE----EDEVNKAEEYQYGPWMREDNIIWGKSKGTKEEGRKSPKANLNVRKGRNGGEDSKSEEEEEARV
        S ++++W  + YE+LP FCY CG I H  +DCE          +E  QYGPWMR + I   +  G  ++ R  P  N ++   R GG  +K  +   A  
Subjt:  SKAEESWSRVTYEKLPEFCYCCGRIGHVAKDCE----EDEVNKAEEYQYGPWMREDNIIWGKSKGTKEEGRKSPKANLNVRKGRNGGEDSKSEEEEEARV

Query:  EELADTRVSFPATAPPRTVEEQ----------GSGGTKRKKESLEDNSNRSPRKGIDMSGEVSNHDEQFSDGSHRGMEIEKKEAQSDKKWEEILEVAVRR
        ++ AD         P R  E+Q           SG +     S+++++  S R+  D+ G+V       +  +  G+ +  KE                 
Subjt:  EELADTRVSFPATAPPRTVEEQ----------GSGGTKRKKESLEDNSNRSPRKGIDMSGEVSNHDEQFSDGSHRGMEIEKKEAQSDKKWEEILEVAVRR

Query:  YHGPIKAPSSQRAMGSGSQPVRNNHEEETVLANPNQDDQKYKDTKGEERDIGRGCGTTPPDAMKTLCWNIRGAGNPRAVRLLRLVVRQNFPNLVFLSETK
           PI + SS+   G+  +          +L  PN       +++    DIG GC  TPP+AM  L WN RG GNP  V+ L ++VRQ  P  +F+SETK
Subjt:  YHGPIKAPSSQRAMGSGSQPVRNNHEEETVLANPNQDDQKYKDTKGEERDIGRGCGTTPPDAMKTLCWNIRGAGNPRAVRLLRLVVRQNFPNLVFLSETK

Query:  IKGLCSNSLKLKLDFDNCFEVSNVGLSGGLMLLWKKEVHVTVRSFSKGHIDAIIELNEDR-WRFTGFYGSSDKDCRSDSLDLIIRLQAMDDLPWVLGGDF
        +       L+    F     V + G SGGL+L W++ V VTV S+S  HIDA++E ++ + WR TGFYGS     +  + D++  L     LPW+ GGDF
Subjt:  IKGLCSNSLKLKLDFDNCFEVSNVGLSGGLMLLWKKEVHVTVRSFSKGHIDAIIELNEDR-WRFTGFYGSSDKDCRSDSLDLIIRLQAMDDLPWVLGGDF

Query:  NEILFGNEK
        NE+L G EK
Subjt:  NEILFGNEK

A0A5C7I5K8 DUF4283 domain-containing protein6.7e-3527.8Show/hide
Query:  MLKVGSKAEESWSRVTYEKLPEFCYCCGRIGHVAKDCEEDEVNK----AEEYQYGPWMREDNIIWGKSK-------GTKEEGRKSPKANLNVRKG-----
        ++++  ++ E W R    K   F   CGR+GH   +C + E  K        +YG W++   +   K K       G+  + R S      V  G     
Subjt:  MLKVGSKAEESWSRVTYEKLPEFCYCCGRIGHVAKDCEEDEVNK----AEEYQYGPWMREDNIIWGKSK-------GTKEEGRKSPKANLNVRKG-----

Query:  ------RNGGEDSK---------------SEEEEEARVEELADTRVSFPATAPPRTVEEQG-------SGGTKRKKE--------SLEDNSNRSPRKGID
              RNGG +S                S  EE A  E L    V  P       VE++        SGG  +  E         + +    SPR   D
Subjt:  ------RNGGEDSK---------------SEEEEEARVEELADTRVSFPATAPPRTVEEQG-------SGGTKRKKE--------SLEDNSNRSPRKGID

Query:  MSGEVSNHDEQFSDGSHRGMEIEKKEAQSDKKWEEILEVAVRRYHGPIKAPSSQR----AMGSGSQPVRNNHEEETVLANPNQDDQKYK-----------
            V   D+     +  G+++++      KKW+       R+    + A   Q+    ++ S   P+RN++       +P    +K K           
Subjt:  MSGEVSNHDEQFSDGSHRGMEIEKKEAQSDKKWEEILEVAVRRYHGPIKAPSSQR----AMGSGSQPVRNNHEEETVLANPNQDDQKYK-----------

Query:  -------DTKGEERDIG--------RGCGTTPPDAMKTLCWNIRGAGNPRAVRLLRLVVRQNFPNLVFLSETKIKGLCSNSLKLKLDFDNCFEVSNVGLS
                 + E+R  G        + CG  PP AM  LCWN+RG GNP  V  L+ +V+++ P+L+FLSETK++   +  ++  L +   F V ++G S
Subjt:  -------DTKGEERDIG--------RGCGTTPPDAMKTLCWNIRGAGNPRAVRLLRLVVRQNFPNLVFLSETKIKGLCSNSLKLKLDFDNCFEVSNVGLS

Query:  GGLMLLWKKEVHVTVRSFSKGHIDAIIELNEDR-WRFTGFYGSSDKDCRSDSLDLIIRLQAMDDLPWVLGGDFNEILFGNEK
        GGL+LLW ++ +V+V+SFS GHID  I++ +   W+F+G YG  + + R +  +LI RL+ +D LPW+  GDFNE+L  +EK
Subjt:  GGLMLLWKKEVHVTVRSFSKGHIDAIIELNEDR-WRFTGFYGSSDKDCRSDSLDLIIRLQAMDDLPWVLGGDFNEILFGNEK

A0A7N2LUL7 Uncharacterized protein1.5e-3430.58Show/hide
Query:  SKAEESWSRVTYEKLPEFCYCCGRIGHVAKDCE---EDEVNKAEEYQ-YGPWMREDNIIWGKSKGTKEEGRKSPKANLNVRKGRNGGEDSKSEEEEEARV
        SK E+SW    YE+LP  CY CG + HV  DC+   E E    +E Q YG W+R    + G+S   K  G  + K               K+++++    
Subjt:  SKAEESWSRVTYEKLPEFCYCCGRIGHVAKDCE---EDEVNKAEEYQ-YGPWMREDNIIWGKSKGTKEEGRKSPKANLNVRKGRNGGEDSKSEEEEEARV

Query:  EELADTRVSFPATAPPRTVEEQGSGGTKRKKESLEDNSNRSPRKGIDMSGEVSNHDEQFSDGSHRGMEIEKKEAQSDKKWEEILEVAVRRYHGPIKAPSS
        E +A   V       P  V+EQ         E        S    +   GE   HD    +        E++ A+ DK                      
Subjt:  EELADTRVSFPATAPPRTVEEQGSGGTKRKKESLEDNSNRSPRKGIDMSGEVSNHDEQFSDGSHRGMEIEKKEAQSDKKWEEILEVAVRRYHGPIKAPSS

Query:  QRAMGSGSQPVRNNHEEETVLANPNQDDQKYKDTKGEERDIGRGCGTTPPDAMKTLCWNIRGAGNPRAVRLLRLVVRQNFPNLVFLSETKIKGLCSNSLK
                                   D    D  G +   G GCG  PP AM  LCWN RG G+P+    L  ++  + P++VF++ET +K      L 
Subjt:  QRAMGSGSQPVRNNHEEETVLANPNQDDQKYKDTKGEERDIGRGCGTTPPDAMKTLCWNIRGAGNPRAVRLLRLVVRQNFPNLVFLSETKIKGLCSNSLK

Query:  LKLDFDNCFEVSNVGLSGGLMLLWKKEVHVTVRSFSKGHIDAIIEL-NEDRWRFTGFYGSSDKDCRSDSLDLIIRLQAMDDLPWVLGGDFNEILFGNEK
         KL FD   E S  G  GG+++ WKKEV  +V ++S  HIDAII    E  WRFTGFYG S+      S   + RL+A   LPW+  GDFNEI+  +EK
Subjt:  LKLDFDNCFEVSNVGLSGGLMLLWKKEVHVTVRSFSKGHIDAIIEL-NEDRWRFTGFYGSSDKDCRSDSLDLIIRLQAMDDLPWVLGGDFNEILFGNEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAAAGGTAGGATCTAAAGCTGAGGAAAGTTGGAGCAGAGTCACTTATGAAAAGCTCCCAGAATTCTGCTACTGCTGTGGTCGGATTGGTCACGTTGCCAAGGACTG
TGAAGAGGACGAAGTAAACAAAGCAGAGGAATATCAGTATGGCCCGTGGATGAGGGAAGATAATATTATCTGGGGGAAAAGCAAAGGTACGAAAGAGGAGGGAAGGAAAA
GCCCGAAAGCTAATCTGAATGTCCGGAAAGGCAGAAACGGTGGGGAAGACTCTAAAAGCGAGGAAGAAGAAGAAGCGAGAGTGGAAGAGTTAGCTGATACAAGAGTGAGT
TTCCCGGCGACAGCGCCTCCAAGAACGGTGGAGGAGCAGGGAAGCGGTGGGACGAAAAGGAAAAAAGAGTCGTTAGAGGACAACTCAAACAGAAGCCCAAGGAAAGGAAT
TGACATGTCAGGAGAGGTCAGTAATCATGATGAACAGTTTTCAGATGGGTCCCACAGAGGTATGGAAATAGAAAAAAAGGAGGCACAATCTGACAAAAAATGGGAGGAAA
TTTTGGAAGTGGCCGTTAGAAGATATCATGGGCCTATAAAGGCCCCATCTAGCCAAAGGGCCATGGGCTCTGGTAGTCAACCTGTCAGAAACAACCATGAAGAAGAAACG
GTGCTGGCTAATCCTAATCAAGATGATCAAAAATACAAGGACACTAAGGGTGAAGAGAGGGATATCGGCAGAGGCTGTGGGACAACCCCGCCGGACGCCATGAAAACATT
ATGTTGGAACATTCGAGGTGCGGGGAACCCTCGAGCGGTTCGTTTGCTGCGTTTGGTGGTGCGACAAAATTTCCCTAATTTAGTCTTTTTGTCTGAAACCAAGATTAAGG
GTCTTTGCTCAAATAGTCTTAAGCTGAAGCTGGATTTTGATAATTGTTTTGAGGTTTCAAATGTTGGGCTTAGTGGTGGGTTGATGTTGCTTTGGAAGAAGGAGGTGCAC
GTTACTGTTAGGTCCTTCTCTAAGGGCCATATTGATGCCATTATCGAGTTGAATGAGGATAGGTGGAGATTTACTGGCTTTTATGGGAGTTCGGATAAGGACTGTAGAAG
TGACTCGTTGGATCTTATTATACGCCTCCAAGCCATGGATGATCTTCCTTGGGTCTTGGGAGGAGATTTCAATGAGATCCTTTTTGGGAATGAGAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTAAAGGTAGGATCTAAAGCTGAGGAAAGTTGGAGCAGAGTCACTTATGAAAAGCTCCCAGAATTCTGCTACTGCTGTGGTCGGATTGGTCACGTTGCCAAGGACTG
TGAAGAGGACGAAGTAAACAAAGCAGAGGAATATCAGTATGGCCCGTGGATGAGGGAAGATAATATTATCTGGGGGAAAAGCAAAGGTACGAAAGAGGAGGGAAGGAAAA
GCCCGAAAGCTAATCTGAATGTCCGGAAAGGCAGAAACGGTGGGGAAGACTCTAAAAGCGAGGAAGAAGAAGAAGCGAGAGTGGAAGAGTTAGCTGATACAAGAGTGAGT
TTCCCGGCGACAGCGCCTCCAAGAACGGTGGAGGAGCAGGGAAGCGGTGGGACGAAAAGGAAAAAAGAGTCGTTAGAGGACAACTCAAACAGAAGCCCAAGGAAAGGAAT
TGACATGTCAGGAGAGGTCAGTAATCATGATGAACAGTTTTCAGATGGGTCCCACAGAGGTATGGAAATAGAAAAAAAGGAGGCACAATCTGACAAAAAATGGGAGGAAA
TTTTGGAAGTGGCCGTTAGAAGATATCATGGGCCTATAAAGGCCCCATCTAGCCAAAGGGCCATGGGCTCTGGTAGTCAACCTGTCAGAAACAACCATGAAGAAGAAACG
GTGCTGGCTAATCCTAATCAAGATGATCAAAAATACAAGGACACTAAGGGTGAAGAGAGGGATATCGGCAGAGGCTGTGGGACAACCCCGCCGGACGCCATGAAAACATT
ATGTTGGAACATTCGAGGTGCGGGGAACCCTCGAGCGGTTCGTTTGCTGCGTTTGGTGGTGCGACAAAATTTCCCTAATTTAGTCTTTTTGTCTGAAACCAAGATTAAGG
GTCTTTGCTCAAATAGTCTTAAGCTGAAGCTGGATTTTGATAATTGTTTTGAGGTTTCAAATGTTGGGCTTAGTGGTGGGTTGATGTTGCTTTGGAAGAAGGAGGTGCAC
GTTACTGTTAGGTCCTTCTCTAAGGGCCATATTGATGCCATTATCGAGTTGAATGAGGATAGGTGGAGATTTACTGGCTTTTATGGGAGTTCGGATAAGGACTGTAGAAG
TGACTCGTTGGATCTTATTATACGCCTCCAAGCCATGGATGATCTTCCTTGGGTCTTGGGAGGAGATTTCAATGAGATCCTTTTTGGGAATGAGAAGTAG
Protein sequenceShow/hide protein sequence
MLKVGSKAEESWSRVTYEKLPEFCYCCGRIGHVAKDCEEDEVNKAEEYQYGPWMREDNIIWGKSKGTKEEGRKSPKANLNVRKGRNGGEDSKSEEEEEARVEELADTRVS
FPATAPPRTVEEQGSGGTKRKKESLEDNSNRSPRKGIDMSGEVSNHDEQFSDGSHRGMEIEKKEAQSDKKWEEILEVAVRRYHGPIKAPSSQRAMGSGSQPVRNNHEEET
VLANPNQDDQKYKDTKGEERDIGRGCGTTPPDAMKTLCWNIRGAGNPRAVRLLRLVVRQNFPNLVFLSETKIKGLCSNSLKLKLDFDNCFEVSNVGLSGGLMLLWKKEVH
VTVRSFSKGHIDAIIELNEDRWRFTGFYGSSDKDCRSDSLDLIIRLQAMDDLPWVLGGDFNEILFGNEK