; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021118 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021118
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionzf-RVT domain-containing protein
Genome locationscaffold9:590056..593211
RNA-Seq ExpressionSpg021118
SyntenySpg021118
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR020847 - AP endonuclease 1, binding site
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBN69274.1 TatD related DNase [Prunus dulcis]6.4e-4233.79Show/hide
Query:  LSQRVVASIYGTETMGWNTGILPKKRGSRFWLDIERNFPQFHKFAKFQVSCGQKIRFWEDTWCASCPLQMAFPDLYVISQRKNASISECWNNNHQ---TW
        L  R++ S YG ++ GW+T  + K      W +I + +  F +  +F V  G+KIRFWED W     L+  FP LY +S+RKN  I+  + NNH+    W
Subjt:  LSQRVVASIYGTETMGWNTGILPKKRGSRFWLDIERNFPQFHKFAKFQVSCGQKIRFWEDTWCASCPLQMAFPDLYVISQRKNASISECWNNNHQ---TW

Query:  DLGLRRGLFYRESGSWEALVEILNEVQL-GSGVDKLIWTLEGSGCYTTKSMFQRCVGRSSKVNMPIVELIWEHNCPNKVKVFLWSVIYRSLNADEQLQRK
        D   RR L   E      L++IL  V+L GS  D+  W +E  G ++ KS F+  +  ++ V  P    IW+   P K++ F+W      +N  + +QR+
Subjt:  DLGLRRGLFYRESGSWEALVEILNEVQL-GSGVDKLIWTLEGSGCYTTKSMFQRCVGRSSKVNMPIVELIWEHNCPNKVKVFLWSVIYRSLNADEQLQRK

Query:  FKSWSLSPSVCRMCFNEEENIDHILIHCNFAKKAWDFIANLQGISFCLPKRVDDWLNKGLIAWNLKKKAKVIAGCAFRSTMWLLWKERNARTF
             LSPS C  C    ENIDH+ IHC+++ K W  + +  G  + +PK   + L+  L      KKA ++  C   +  W +W ERN R F
Subjt:  FKSWSLSPSVCRMCFNEEENIDHILIHCNFAKKAWDFIANLQGISFCLPKRVDDWLNKGLIAWNLKKKAKVIAGCAFRSTMWLLWKERNARTF

RVW15141.1 putative ribonuclease H protein [Vitis vinifera]5.4e-4131.16Show/hide
Query:  LSQRVVASIYGTETMGWNTGILPKKRGSRFWLDIERNFPQFHKFAKFQVSCGQKIRFWEDTWCASCPLQMAFPDLYVISQRKNASISECWNNNHQ-TWDL
        L  +V+ SIY + T GW+   + +      W  I + F  F KF +F V  G +IRFWED W     L + FP L  +   KN  IS    +    +W+ 
Subjt:  LSQRVVASIYGTETMGWNTGILPKKRGSRFWLDIERNFPQFHKFAKFQVSCGQKIRFWEDTWCASCPLQMAFPDLYVISQRKNASISECWNNNHQ-TWDL

Query:  GLRRGLFYRESGSWEALVEILNEVQLGSGV-DKLIWTLEGSGCYTTKSMFQRCVGRSSKVNMPIVELIWEHNCPNKVKVFLWSVIYRSLNADEQLQRKFK
          RR L   E    E+L++ L+ + L   V DK  W+L  SG +T KS F      SS  ++   +L+W    P K+K F+W V ++ +N ++ LQ +  
Subjt:  GLRRGLFYRESGSWEALVEILNEVQLGSGV-DKLIWTLEGSGCYTTKSMFQRCVGRSSKVNMPIVELIWEHNCPNKVKVFLWSVIYRSLNADEQLQRKFK

Query:  SWSLSPSVCRMCFNEEENIDHILIHCNFAKKAWDFIANLQGISFCLPKRVDDWLNKGLIAWNLKKKAKVIAGCAFRSTMWLLWKERNARTFD
          +LSP +C +C  + E  DH+ +HC+     W  +  L  I +  P+ + D L+     +   K+  ++   A  + +W++W+ERNAR F+
Subjt:  SWSLSPSVCRMCFNEEENIDHILIHCNFAKKAWDFIANLQGISFCLPKRVDDWLNKGLIAWNLKKKAKVIAGCAFRSTMWLLWKERNARTFD

RVW27524.1 putative ribonuclease H protein [Vitis vinifera]3.2e-4130.75Show/hide
Query:  KVVWSSRNVGWLHLEAVGSVAVYCLCGRRSVWR------LLSQRVVASIYGTETMGWNTGILPKKRGSRFWLDIERNFPQFHKFAKFQVSCGQKIRFWED
        +VV  S+  G L L  + S+    L G + +WR       L  +V+ SIYG+ + GW+   + +      W  I + F  F KF +F V  G +IRFWED
Subjt:  KVVWSSRNVGWLHLEAVGSVAVYCLCGRRSVWR------LLSQRVVASIYGTETMGWNTGILPKKRGSRFWLDIERNFPQFHKFAKFQVSCGQKIRFWED

Query:  TWCASCPLQMAFPDLYVISQRKNASISECWNNNHQ-TWDLGLRRGLFYRESGSWEALVEILNEVQLGSGV-DKLIWTLEGSGCYTTKSMFQRCVGRSSKV
         W     L + FP L  +   KN  IS    +    +W+   RR L   E    E+L++ L+ + L   + DK  W+L  SG +T KS F   +  S   
Subjt:  TWCASCPLQMAFPDLYVISQRKNASISECWNNNHQ-TWDLGLRRGLFYRESGSWEALVEILNEVQLGSGV-DKLIWTLEGSGCYTTKSMFQRCVGRSSKV

Query:  NMPIV---ELIWEHNCPNKVKVFLWSVIYRSLNADEQLQRKFKSWSLSPSVCRMCFNEEENIDHILIHCNFAKKAWDFIANLQGISFCLPKRVDDWLNKG
         +P+V   +L+W    P K+K F+W V ++ +N  + LQ +    +LSP +C +C    E +DH+ +HC+     W  +  L  I +  P+ V D ++  
Subjt:  NMPIV---ELIWEHNCPNKVKVFLWSVIYRSLNADEQLQRKFKSWSLSPSVCRMCFNEEENIDHILIHCNFAKKAWDFIANLQGISFCLPKRVDDWLNKG

Query:  LIAWNLKKKAKVIAGCAFRSTMWLLWKERNARTFD
           +   K+  V+   A  + +W++W+ERNAR F+
Subjt:  LIAWNLKKKAKVIAGCAFRSTMWLLWKERNARTFD

TQD85214.1 hypothetical protein C1H46_029232 [Malus baccata]7.0e-4132.65Show/hide
Query:  LSQRVVASIYGTETMGWNTGILPKKRGSR-FWLDIERNFPQFHKFAKFQVSCGQKIRFWEDTWCASCPLQMAFPDLYVISQRKNASISECW--NNNHQTW
        L  +V+ S YG E  GWN    P++  SR  W DI     QF +  KF+V  G+++RFWED W A  PL+  FP L+++S++ N +IS     +++  +W
Subjt:  LSQRVVASIYGTETMGWNTGILPKKRGSR-FWLDIERNFPQFHKFAKFQVSCGQKIRFWEDTWCASCPLQMAFPDLYVISQRKNASISECW--NNNHQTW

Query:  DLGLRRGLFYRESGSWEALVEILNEVQLG-SGVDKLIWTLEGSGCYTTKSMFQRCVGRSSKVNMPIVELIWEHNCPNKVKVFLWSVIYRSLNADEQLQRK
        +   RR L   E     +L++ +  ++L  S +D   W LE SG +T KS              P    IW+   P KVK+ +W V   SLN  +++Q++
Subjt:  DLGLRRGLFYRESGSWEALVEILNEVQLG-SGVDKLIWTLEGSGCYTTKSMFQRCVGRSSKVNMPIVELIWEHNCPNKVKVFLWSVIYRSLNADEQLQRK

Query:  FKSWSLSPSVCRMCFNEEENIDHILIHCNFAKKAWDFIANLQGISFCLPKRVDDWLNKGLIAWNLKKKAKVIAGCAFRSTMWLLWKERNARTFD
             LSP  C +C  +EE+++HI +HC+++ + W  +     +S+ +PK   + L+    A    KK+K + GC   +  W +W ERN R F+
Subjt:  FKSWSLSPSVCRMCFNEEENIDHILIHCNFAKKAWDFIANLQGISFCLPKRVDDWLNKGLIAWNLKKKAKVIAGCAFRSTMWLLWKERNARTFD

VVA39726.1 Hypothetical predicted protein, partial [Prunus dulcis]2.4e-4132.88Show/hide
Query:  LSQRVVASIYGTETMGWNTGILPKKRGS--RFWLDIERNFPQFHKFAKFQVSCGQKIRFWEDTWCASCPLQMAFPDLYVISQRKNASISECWNNN--HQT
        L  +V+ SIYG +T GW+    P  RGS    W DI   +  F +   F V CG ++RFWED W     L+  FP L+ +S+++N +IS   +++    +
Subjt:  LSQRVVASIYGTETMGWNTGILPKKRGS--RFWLDIERNFPQFHKFAKFQVSCGQKIRFWEDTWCASCPLQMAFPDLYVISQRKNASISECWNNN--HQT

Query:  WDLGLRRGLFYRESGSWEALVEILNEVQL-GSGVDKLIWTLEGSGCYTTKSMFQRCVGRSSKVNMPIVELIWEHNCPNKVKVFLWSVIYRSLNADEQLQR
        WD G RR L   E      L+++L  V+L  S +DK IW L+ SG +T  S+             P    IW+   P KVK+F+W  +   LN  + LQR
Subjt:  WDLGLRRGLFYRESGSWEALVEILNEVQL-GSGVDKLIWTLEGSGCYTTKSMFQRCVGRSSKVNMPIVELIWEHNCPNKVKVFLWSVIYRSLNADEQLQR

Query:  KFKSWSLSPSVCRMCFNEEENIDHILIHCNFAKKAWDFIANLQGISFCLPKRVDDWLNKGLIAWNLKKKAKVIAGCAFRSTMWLLWKERNARTFD
        +     +SP  C +C    +++DH+L+HC F+ K W+ +       + +P+   +  +  + A    KKAK++ G   ++ +W LW ERN R F+
Subjt:  KFKSWSLSPSVCRMCFNEEENIDHILIHCNFAKKAWDFIANLQGISFCLPKRVDDWLNKGLIAWNLKKKAKVIAGCAFRSTMWLLWKERNARTFD

TrEMBL top hitse value%identityAlignment
A0A5E4GJ11 Reverse transcriptase domain-containing protein (Fragment)1.2e-4132.88Show/hide
Query:  LSQRVVASIYGTETMGWNTGILPKKRGS--RFWLDIERNFPQFHKFAKFQVSCGQKIRFWEDTWCASCPLQMAFPDLYVISQRKNASISECWNNN--HQT
        L  +V+ SIYG +T GW+    P  RGS    W DI   +  F +   F V CG ++RFWED W     L+  FP L+ +S+++N +IS   +++    +
Subjt:  LSQRVVASIYGTETMGWNTGILPKKRGS--RFWLDIERNFPQFHKFAKFQVSCGQKIRFWEDTWCASCPLQMAFPDLYVISQRKNASISECWNNN--HQT

Query:  WDLGLRRGLFYRESGSWEALVEILNEVQL-GSGVDKLIWTLEGSGCYTTKSMFQRCVGRSSKVNMPIVELIWEHNCPNKVKVFLWSVIYRSLNADEQLQR
        WD G RR L   E      L+++L  V+L  S +DK IW L+ SG +T  S+             P    IW+   P KVK+F+W  +   LN  + LQR
Subjt:  WDLGLRRGLFYRESGSWEALVEILNEVQL-GSGVDKLIWTLEGSGCYTTKSMFQRCVGRSSKVNMPIVELIWEHNCPNKVKVFLWSVIYRSLNADEQLQR

Query:  KFKSWSLSPSVCRMCFNEEENIDHILIHCNFAKKAWDFIANLQGISFCLPKRVDDWLNKGLIAWNLKKKAKVIAGCAFRSTMWLLWKERNARTFD
        +     +SP  C +C    +++DH+L+HC F+ K W+ +       + +P+   +  +  + A    KKAK++ G   ++ +W LW ERN R F+
Subjt:  KFKSWSLSPSVCRMCFNEEENIDHILIHCNFAKKAWDFIANLQGISFCLPKRVDDWLNKGLIAWNLKKKAKVIAGCAFRSTMWLLWKERNARTFD

A0A5H2XQW2 TatD related DNase3.1e-4233.79Show/hide
Query:  LSQRVVASIYGTETMGWNTGILPKKRGSRFWLDIERNFPQFHKFAKFQVSCGQKIRFWEDTWCASCPLQMAFPDLYVISQRKNASISECWNNNHQ---TW
        L  R++ S YG ++ GW+T  + K      W +I + +  F +  +F V  G+KIRFWED W     L+  FP LY +S+RKN  I+  + NNH+    W
Subjt:  LSQRVVASIYGTETMGWNTGILPKKRGSRFWLDIERNFPQFHKFAKFQVSCGQKIRFWEDTWCASCPLQMAFPDLYVISQRKNASISECWNNNHQ---TW

Query:  DLGLRRGLFYRESGSWEALVEILNEVQL-GSGVDKLIWTLEGSGCYTTKSMFQRCVGRSSKVNMPIVELIWEHNCPNKVKVFLWSVIYRSLNADEQLQRK
        D   RR L   E      L++IL  V+L GS  D+  W +E  G ++ KS F+  +  ++ V  P    IW+   P K++ F+W      +N  + +QR+
Subjt:  DLGLRRGLFYRESGSWEALVEILNEVQL-GSGVDKLIWTLEGSGCYTTKSMFQRCVGRSSKVNMPIVELIWEHNCPNKVKVFLWSVIYRSLNADEQLQRK

Query:  FKSWSLSPSVCRMCFNEEENIDHILIHCNFAKKAWDFIANLQGISFCLPKRVDDWLNKGLIAWNLKKKAKVIAGCAFRSTMWLLWKERNARTF
             LSPS C  C    ENIDH+ IHC+++ K W  + +  G  + +PK   + L+  L      KKA ++  C   +  W +W ERN R F
Subjt:  FKSWSLSPSVCRMCFNEEENIDHILIHCNFAKKAWDFIANLQGISFCLPKRVDDWLNKGLIAWNLKKKAKVIAGCAFRSTMWLLWKERNARTF

M5VH03 zf-RVT domain-containing protein (Fragment)8.9e-4233.11Show/hide
Query:  LSQRVVASIYGTETMGWNTGILPKKRGSRFWLDIERNFPQFHKFAKFQVSCGQKIRFWEDTWCASCPLQMAFPDLYVISQRKNASISECWNNNH---QTW
        L  R++ S YG ++ GW+T  + K      W +I + +  F +  +F V  G+KIRFWED W     L+  FP L  +S+RKN SI+ C+ NNH     W
Subjt:  LSQRVVASIYGTETMGWNTGILPKKRGSRFWLDIERNFPQFHKFAKFQVSCGQKIRFWEDTWCASCPLQMAFPDLYVISQRKNASISECWNNNH---QTW

Query:  DLGLRRGLFYRESGSWEALVEILNEVQL-GSGVDKLIWTLEGSGCYTTKSMFQRCVGRSSKVNMPIVELIWEHNCPNKVKVFLWSVIYRSLNADEQLQRK
        D   RR L   E      L++IL  V+L GS  D+  W +E  G ++ KS F+  +  +++   P    IW+   P K++ F+W      +N  + +QR+
Subjt:  DLGLRRGLFYRESGSWEALVEILNEVQL-GSGVDKLIWTLEGSGCYTTKSMFQRCVGRSSKVNMPIVELIWEHNCPNKVKVFLWSVIYRSLNADEQLQRK

Query:  FKSWSLSPSVCRMCFNEEENIDHILIHCNFAKKAWDFIANLQGISFCLPKRVDDWLNKGLIAWNLKKKAKVIAGCAFRSTMWLLWKERNARTF
             LSPS C +C    ENIDH+ IHC+++ K W  +    G+ + +PK   + L+  L      K+A ++  C   +  W +W ERN + F
Subjt:  FKSWSLSPSVCRMCFNEEENIDHILIHCNFAKKAWDFIANLQGISFCLPKRVDDWLNKGLIAWNLKKKAKVIAGCAFRSTMWLLWKERNARTF

M5X4S0 Reverse transcriptase domain-containing protein (Fragment)1.2e-4132.76Show/hide
Query:  LSQRVVASIYGTETMGWNTGILPKKRGSRFWLDIERNFPQFHKFAKFQVSCGQKIRFWEDTWCASCPLQMAFPDLYVISQRKNASISECWNNNH---QTW
        L  R++ S YG ++ GW+T  + K      W +I + +  F +  +F V  G+KIRFWED W     L+  FP L  +S+RKN SI+ C+ NNH     W
Subjt:  LSQRVVASIYGTETMGWNTGILPKKRGSRFWLDIERNFPQFHKFAKFQVSCGQKIRFWEDTWCASCPLQMAFPDLYVISQRKNASISECWNNNH---QTW

Query:  DLGLRRGLFYRESGSWEALVEILNEVQL-GSGVDKLIWTLEGSGCYTTKSMFQRCVGRSSKVNMPIVELIWEHNCPNKVKVFLWSVIYRSLNADEQLQRK
        D   RR L   E      L++IL  V+L GS  D+  W +E  G ++ KS F+  +  +++   P    IW+   P K++ F+W      +N  + +QR+
Subjt:  DLGLRRGLFYRESGSWEALVEILNEVQL-GSGVDKLIWTLEGSGCYTTKSMFQRCVGRSSKVNMPIVELIWEHNCPNKVKVFLWSVIYRSLNADEQLQRK

Query:  FKSWSLSPSVCRMCFNEEENIDHILIHCNFAKKAWDFIANLQGISFCLPKRVDDWLNKGLIAWNLKKKAKVIAGCAFRSTMWLLWKERNARTF
             LSPS C +C    ENIDH+ +HC+++ + W  +    G+ + +PK   + L+  L      K+A ++  C   +  W +W ERN R F
Subjt:  FKSWSLSPSVCRMCFNEEENIDHILIHCNFAKKAWDFIANLQGISFCLPKRVDDWLNKGLIAWNLKKKAKVIAGCAFRSTMWLLWKERNARTF

M5XV38 zf-RVT domain-containing protein6.9e-4233.11Show/hide
Query:  LSQRVVASIYGTETMGWNTGILPKKRGSRFWLDIERNFPQFHKFAKFQVSCGQKIRFWEDTWCASCPLQMAFPDLYVISQRKNASISECWNNNH---QTW
        L  R++ S YG ++ GW+T  + K      W +I + +  F +  +F V  G+KIRFWED W     L+  FP L  +S+RKN SI+ C+ NNH     W
Subjt:  LSQRVVASIYGTETMGWNTGILPKKRGSRFWLDIERNFPQFHKFAKFQVSCGQKIRFWEDTWCASCPLQMAFPDLYVISQRKNASISECWNNNH---QTW

Query:  DLGLRRGLFYRESGSWEALVEILNEVQL-GSGVDKLIWTLEGSGCYTTKSMFQRCVGRSSKVNMPIVELIWEHNCPNKVKVFLWSVIYRSLNADEQLQRK
        D   RR L   E      L++IL  V+L GS  D+  W +E  G ++ KS F+  +  +++   P    IW+   P K++ F+W      +N  + +QR+
Subjt:  DLGLRRGLFYRESGSWEALVEILNEVQL-GSGVDKLIWTLEGSGCYTTKSMFQRCVGRSSKVNMPIVELIWEHNCPNKVKVFLWSVIYRSLNADEQLQRK

Query:  FKSWSLSPSVCRMCFNEEENIDHILIHCNFAKKAWDFIANLQGISFCLPKRVDDWLNKGLIAWNLKKKAKVIAGCAFRSTMWLLWKERNARTF
             LSPS C +C    ENIDH+ IHC+++ + W  +    G+ + +PK   + L+  L      K+A ++  C   +  W +W ERN R F
Subjt:  FKSWSLSPSVCRMCFNEEENIDHILIHCNFAKKAWDFIANLQGISFCLPKRVDDWLNKGLIAWNLKKKAKVIAGCAFRSTMWLLWKERNARTF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G45063.1 copper ion binding;electron carriers3.9e-0526.88Show/hide
Query:  KFKSWSLS-PSVCRMCFNEEENIDHILIHCNFAKKAWDFIANLQGISFCLPKRVDDWLNKGLIAWNLKKKAKVIAGCAFRSTMWLLWKERNAR
        +F SW +  PS+C +C   +E   H+   C F+ + W F  +   ++   P R+     + L      KK   I   A++++++ +W+ERN R
Subjt:  KFKSWSLS-PSVCRMCFNEEENIDHILIHCNFAKKAWDFIANLQGISFCLPKRVDDWLNKGLIAWNLKKKAKVIAGCAFRSTMWLLWKERNAR

AT3G09510.1 Ribonuclease H-like superfamily protein2.2e-0826.2Show/hide
Query:  DKLIWTLEGSGCYTTKSMFQRCVGRSSKVNMPIVE----------LIWEHNCPNKVKVFLWSVIYRSLNADEQLQRKFKSWSLSPSVCRMCFNEEENIDH
        DK+IW    +G YT +S +   +      N+P +            IW      K+K FLW  + ++L   E+L    +   + PS C  C  E E+I+H
Subjt:  DKLIWTLEGSGCYTTKSMFQRCVGRSSKVNMPIVE----------LIWEHNCPNKVKVFLWSVIYRSLNADEQLQRKFKSWSLSPSVCRMCFNEEENIDH

Query:  ILIHCNFAKKAW---------------DFIANLQGI-SFCLPKRVDDWLNKGLIAWNLKKKAKVIAGCAFRSTMWLLWKERNARTFD
         L  C FA  AW               DF  N+  I +F     + D+ +K L  W                 +W +WK RN   F+
Subjt:  ILIHCNFAKKAW---------------DFIANLQGI-SFCLPKRVDDWLNKGLIAWNLKKKAKVIAGCAFRSTMWLLWKERNARTFD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATCATCTCATGGAACGTTAGAGGCTTGGGAGCCTGGTCGAAAAGGGCTCTGATCAAATATCTCCTCTCGAAAGAGAACCCTGATATTGTGATTCTCCAAGAATC
AAAGTTGGGCTTTGTGGATAGGAAAGTAGTTAAAGTTGTGTGGAGCTCGAGAAATGTAGGTTGGCTGCATCTAGAGGCTGTGGGCTCTGTGGCAGTATATTGCTTATGTG
GAAGGAGATCTGTTTGGAGGTTGTTGAGTCAGAGAGTTGTGGCTAGCATCTACGGCACTGAAACTATGGGTTGGAACACAGGGATTCTGCCTAAGAAAAGAGGTAGCAGA
TTTTGGCTTGATATTGAAAGAAACTTTCCCCAATTCCACAAATTTGCTAAGTTTCAGGTGAGTTGTGGGCAGAAAATCAGATTTTGGGAAGATACATGGTGTGCTTCTTG
CCCCCTTCAAATGGCTTTCCCAGATTTATATGTGATCTCTCAAAGGAAAAATGCCTCCATCTCAGAGTGTTGGAATAATAACCATCAGACTTGGGATCTTGGCCTCAGAA
GAGGCCTTTTTTATAGAGAATCTGGCAGCTGGGAAGCTTTGGTGGAGATCCTGAATGAGGTCCAGTTGGGGAGCGGTGTTGACAAATTGATATGGACTTTAGAGGGCTCG
GGCTGCTATACCACAAAATCTATGTTCCAAAGATGTGTAGGGAGATCCTCTAAGGTTAATATGCCTATTGTCGAGCTGATATGGGAGCACAATTGCCCTAATAAGGTGAA
GGTTTTTTTGTGGTCGGTGATTTATCGTAGCTTGAATGCGGATGAGCAATTGCAAAGGAAATTTAAGAGCTGGTCCTTATCCCCCTCAGTTTGCAGAATGTGTTTCAATG
AGGAAGAGAACATAGACCACATATTAATCCACTGCAATTTTGCTAAGAAAGCATGGGATTTCATAGCTAATCTGCAGGGTATCTCCTTTTGCTTGCCTAAAAGGGTAGAT
GATTGGCTCAACAAGGGCTTAATAGCTTGGAATTTGAAGAAGAAGGCTAAAGTCATCGCCGGATGTGCTTTTAGATCGACTATGTGGCTTTTGTGGAAGGAAAGGAACGC
GAGGACTTTCGATTCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAATCATCTCATGGAACGTTAGAGGCTTGGGAGCCTGGTCGAAAAGGGCTCTGATCAAATATCTCCTCTCGAAAGAGAACCCTGATATTGTGATTCTCCAAGAATC
AAAGTTGGGCTTTGTGGATAGGAAAGTAGTTAAAGTTGTGTGGAGCTCGAGAAATGTAGGTTGGCTGCATCTAGAGGCTGTGGGCTCTGTGGCAGTATATTGCTTATGTG
GAAGGAGATCTGTTTGGAGGTTGTTGAGTCAGAGAGTTGTGGCTAGCATCTACGGCACTGAAACTATGGGTTGGAACACAGGGATTCTGCCTAAGAAAAGAGGTAGCAGA
TTTTGGCTTGATATTGAAAGAAACTTTCCCCAATTCCACAAATTTGCTAAGTTTCAGGTGAGTTGTGGGCAGAAAATCAGATTTTGGGAAGATACATGGTGTGCTTCTTG
CCCCCTTCAAATGGCTTTCCCAGATTTATATGTGATCTCTCAAAGGAAAAATGCCTCCATCTCAGAGTGTTGGAATAATAACCATCAGACTTGGGATCTTGGCCTCAGAA
GAGGCCTTTTTTATAGAGAATCTGGCAGCTGGGAAGCTTTGGTGGAGATCCTGAATGAGGTCCAGTTGGGGAGCGGTGTTGACAAATTGATATGGACTTTAGAGGGCTCG
GGCTGCTATACCACAAAATCTATGTTCCAAAGATGTGTAGGGAGATCCTCTAAGGTTAATATGCCTATTGTCGAGCTGATATGGGAGCACAATTGCCCTAATAAGGTGAA
GGTTTTTTTGTGGTCGGTGATTTATCGTAGCTTGAATGCGGATGAGCAATTGCAAAGGAAATTTAAGAGCTGGTCCTTATCCCCCTCAGTTTGCAGAATGTGTTTCAATG
AGGAAGAGAACATAGACCACATATTAATCCACTGCAATTTTGCTAAGAAAGCATGGGATTTCATAGCTAATCTGCAGGGTATCTCCTTTTGCTTGCCTAAAAGGGTAGAT
GATTGGCTCAACAAGGGCTTAATAGCTTGGAATTTGAAGAAGAAGGCTAAAGTCATCGCCGGATGTGCTTTTAGATCGACTATGTGGCTTTTGTGGAAGGAAAGGAACGC
GAGGACTTTCGATTCGTAA
Protein sequenceShow/hide protein sequence
MKIISWNVRGLGAWSKRALIKYLLSKENPDIVILQESKLGFVDRKVVKVVWSSRNVGWLHLEAVGSVAVYCLCGRRSVWRLLSQRVVASIYGTETMGWNTGILPKKRGSR
FWLDIERNFPQFHKFAKFQVSCGQKIRFWEDTWCASCPLQMAFPDLYVISQRKNASISECWNNNHQTWDLGLRRGLFYRESGSWEALVEILNEVQLGSGVDKLIWTLEGS
GCYTTKSMFQRCVGRSSKVNMPIVELIWEHNCPNKVKVFLWSVIYRSLNADEQLQRKFKSWSLSPSVCRMCFNEEENIDHILIHCNFAKKAWDFIANLQGISFCLPKRVD
DWLNKGLIAWNLKKKAKVIAGCAFRSTMWLLWKERNARTFDS