; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028712 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028712
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:29140275..29141086
RNA-Seq ExpressionLag0028712
SyntenyLag0028712
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3472112.1 reverse transcriptase [Gossypium australe]7.3e-1827.24Show/hide
Query:  PESTDHILFQCQIARDLWNITFNRVFQEVDFNGSFVDRWSRINSCCTQEELGLVAVTCWAIWMDRNKKVHGDPLPPVDIRSSWISRYLKEISEANTKG--
        PE ++H+L+ C++ R +WN+    ++ ++D + S  ++  R+    T E+  L+ ++ WAIW  RNK VH      +D    +I RY+ E++ + T    
Subjt:  PESTDHILFQCQIARDLWNITFNRVFQEVDFNGSFVDRWSRINSCCTQEELGLVAVTCWAIWMDRNKKVHGDPLPPVDIRSSWISRYLKEISEANTKG--

Query:  SRRSRPLLSLPLLPRII-----------FGSRPPLKRGRNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLSRVIIESDYLMAINFLNKPSE
        S         P LP II             S      GRN  G ++GA    I+ +     AE RA    +  A+D+G  ++++E D L  I  +    +
Subjt:  SRRSRPLLSLPLLPRII-----------FGSRPPLKRGRNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLSRVIIESDYLMAINFLNKPSE

Query:  AWSYLDGLIESLWVMGQNFSEISFVYSPRERNIVADKLAKYARLVKNDVTWVDCFPD
          S +  +I ++ ++ + F +ISF + PR+ N VA  LA           WV+  P+
Subjt:  AWSYLDGLIESLWVMGQNFSEISFVYSPRERNIVADKLAKYARLVKNDVTWVDCFPD

ONI01138.1 hypothetical protein PRUPE_6G123900 [Prunus persica]8.6e-1929.81Show/hide
Query:  CLNFPESTDHILFQCQIARDLW-NITFNRVFQEVDFNGSFVDRWSRINSCCTQEELGLVAVTCWAIWMDRNKKVHGDPLPPVDIRSSWISRYLKEISEAN
        C    ES  H ++ C+ A+++W N  +  V +    N SF + W  +    + EE GL A  CW +W  RN  +            S +++  +E S+AN
Subjt:  CLNFPESTDHILFQCQIARDLW-NITFNRVFQEVDFNGSFVDRWSRINSCCTQEELGLVAVTCWAIWMDRNKKVHGDPLPPVDIRSSWISRYLKEISEAN

Query:  ----TKGSRRSRPLLSLPLLPRIIFGSRPP--LKRG----------RNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLSRVIIESDYLMAI
            T   R+S P   L        G RPP  +K G          RN++G  + A VR I  S+   + EL A +EG+R A+D+G +  I+E D    +
Subjt:  ----TKGSRRSRPLLSLPLLPRIIFGSRPP--LKRG----------RNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLSRVIIESDYLMAI

Query:  NFLNKPSEAWSYLDG-LIESLWVMGQNFSEISFVYSPRERNIVADKLAKYARLVKNDVTWVDCFP
        N +   +E ++ +DG L+E +  +  NF  +   ++PR  N VA  LA++A      VTW++  P
Subjt:  NFLNKPSEAWSYLDG-LIESLWVMGQNFSEISFVYSPRERNIVADKLAKYARLVKNDVTWVDCFP

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]6.6e-1929.15Show/hide
Query:  CLNFPESTDHILFQCQIARDLW-NITFNRVFQEVDFNGSFVDRWSRINSCCTQEELGLVAVTCWAIWMDRNKKVHGDPLPPVDIRSSWISRYLKEISEAN
        C    ES  H ++ C+ A+++W N  +  V +E   N SF + W  +    + EE GL A  CW +W  RN  +              +++  +E S AN
Subjt:  CLNFPESTDHILFQCQIARDLW-NITFNRVFQEVDFNGSFVDRWSRINSCCTQEELGLVAVTCWAIWMDRNKKVHGDPLPPVDIRSSWISRYLKEISEAN

Query:  TKGSRRSRPLLSLPLLPRIIFGSRPP------------LKRG----------RNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLSRVIIES
           S       S P  P  + G RPP            +K G          RN++G  + A VR I  S+   + EL A +EG+R A+D+G +  ++E 
Subjt:  TKGSRRSRPLLSLPLLPRIIFGSRPP------------LKRG----------RNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLSRVIIES

Query:  DYLMAINFLNKPSEAWSYLDG-LIESLWVMGQNFSEISFVYSPRERNIVADKLAKYARLVKNDVTWVDCFP
        D    IN +    E  + +DG LIE +  +  NF  +   ++PR  N VA  LA++A      VTW++  P
Subjt:  DYLMAINFLNKPSEAWSYLDG-LIESLWVMGQNFSEISFVYSPRERNIVADKLAKYARLVKNDVTWVDCFP

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]3.9e-1928.8Show/hide
Query:  MCLNFPESTDHILFQCQIARDLWNITFN-RVFQEVDFNGSFVDRWSRINSCCTQEELGLVAVTCWAIWMDRNKKVHGDPLPPVDIRSSWISRYLKEISEA
        +C +  ES  H  F C+ AR +W   F        + N SF++ WS +      ++L L A+T W IW DRN  +HG  + PV+ +  W++ +L   S+A
Subjt:  MCLNFPESTDHILFQCQIARDLWNITFN-RVFQEVDFNGSFVDRWSRINSCCTQEELGLVAVTCWAIWMDRNKKVHGDPLPPVDIRSSWISRYLKEISEA

Query:  -------NTKGSRRS-----RPLLSLPLLPRIIFGSRPPLKRG----RNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLSRVIIESDYLMA
                T+ + R      RP  S+ L        R          R+S  S+V A    +     PL AE+R ILEG++ A     + + +ESD L+A
Subjt:  -------NTKGSRRS-----RPLLSLPLLPRIIFGSRPPLKRG----RNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLSRVIIESDYLMA

Query:  INFLNKPSEAWSYLDGLIESLWVMGQNFSEISFVYSPRERNIVADKLAKY
        I  +             +  +  +   F+ ISF +S R+ N  A  LAK+
Subjt:  INFLNKPSEAWSYLDGLIESLWVMGQNFSEISFVYSPRERNIVADKLAKY

XP_023874626.1 uncharacterized protein LOC111987155 [Quercus suber]4.3e-1829.18Show/hide
Query:  ESTDHILFQCQIARDLWNITFNRVFQEVDFNGSFVDRWSRINSCCTQEELGLVAVTCWAIWMDRNKKVHGDPLPPVDIRSSWISRYLKEISEANTKGSRR
        E++ HIL+ C  A  +W  T  ++      +  FVD    I       +  + AVT W++W +RN    G               Y++EI          
Subjt:  ESTDHILFQCQIARDLWNITFNRVFQEVDFNGSFVDRWSRINSCCTQEELGLVAVTCWAIWMDRNKKVHGDPLPPVDIRSSWISRYLKEISEANTKGSRR

Query:  SRPLL-SLPLLP-----------RIIFGSRPPLKRG---RNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLSRVIIESDYLMAINFLNKPS
        S P L + P LP             +F        G   RN  G ++GA  + ID+    LE E RA+ EGVRLA DLGL ++++ESD L  +N +  PS
Subjt:  SRPLL-SLPLLP-----------RIIFGSRPPLKRG---RNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLSRVIIESDYLMAINFLNKPS

Query:  EAWSYLDGLIESLWVMGQNFSEISFVYSPRERNIVADKLAKYARLVKNDVTWVDCFP
         + S +  ++E   +    F E    ++ R  N  A  +AKYA+ V + + WV+  P
Subjt:  EAWSYLDGLIESLWVMGQNFSEISFVYSPRERNIVADKLAKYARLVKNDVTWVDCFP

TrEMBL top hitse value%identityAlignment
A0A251NPF0 Reverse transcriptase domain-containing protein4.2e-1929.81Show/hide
Query:  CLNFPESTDHILFQCQIARDLW-NITFNRVFQEVDFNGSFVDRWSRINSCCTQEELGLVAVTCWAIWMDRNKKVHGDPLPPVDIRSSWISRYLKEISEAN
        C    ES  H ++ C+ A+++W N  +  V +    N SF + W  +    + EE GL A  CW +W  RN  +            S +++  +E S+AN
Subjt:  CLNFPESTDHILFQCQIARDLW-NITFNRVFQEVDFNGSFVDRWSRINSCCTQEELGLVAVTCWAIWMDRNKKVHGDPLPPVDIRSSWISRYLKEISEAN

Query:  ----TKGSRRSRPLLSLPLLPRIIFGSRPP--LKRG----------RNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLSRVIIESDYLMAI
            T   R+S P   L        G RPP  +K G          RN++G  + A VR I  S+   + EL A +EG+R A+D+G +  I+E D    +
Subjt:  ----TKGSRRSRPLLSLPLLPRIIFGSRPP--LKRG----------RNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLSRVIIESDYLMAI

Query:  NFLNKPSEAWSYLDG-LIESLWVMGQNFSEISFVYSPRERNIVADKLAKYARLVKNDVTWVDCFP
        N +   +E ++ +DG L+E +  +  NF  +   ++PR  N VA  LA++A      VTW++  P
Subjt:  NFLNKPSEAWSYLDG-LIESLWVMGQNFSEISFVYSPRERNIVADKLAKYARLVKNDVTWVDCFP

A0A5E4FZN9 PREDICTED: retrotransposon3.2e-1929.15Show/hide
Query:  CLNFPESTDHILFQCQIARDLW-NITFNRVFQEVDFNGSFVDRWSRINSCCTQEELGLVAVTCWAIWMDRNKKVHGDPLPPVDIRSSWISRYLKEISEAN
        C    ES  H ++ C+ A+++W N  +  V +E   N SF + W  +    + EE GL A  CW +W  RN  +              +++  +E S AN
Subjt:  CLNFPESTDHILFQCQIARDLW-NITFNRVFQEVDFNGSFVDRWSRINSCCTQEELGLVAVTCWAIWMDRNKKVHGDPLPPVDIRSSWISRYLKEISEAN

Query:  TKGSRRSRPLLSLPLLPRIIFGSRPP------------LKRG----------RNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLSRVIIES
           S       S P  P  + G RPP            +K G          RN++G  + A VR I  S+   + EL A +EG+R A+D+G +  ++E 
Subjt:  TKGSRRSRPLLSLPLLPRIIFGSRPP------------LKRG----------RNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLSRVIIES

Query:  DYLMAINFLNKPSEAWSYLDG-LIESLWVMGQNFSEISFVYSPRERNIVADKLAKYARLVKNDVTWVDCFP
        D    IN +    E  + +DG LIE +  +  NF  +   ++PR  N VA  LA++A      VTW++  P
Subjt:  DYLMAINFLNKPSEAWSYLDG-LIESLWVMGQNFSEISFVYSPRERNIVADKLAKYARLVKNDVTWVDCFP

A0A6J1DX30 uncharacterized protein LOC1110248741.9e-1928.8Show/hide
Query:  MCLNFPESTDHILFQCQIARDLWNITFN-RVFQEVDFNGSFVDRWSRINSCCTQEELGLVAVTCWAIWMDRNKKVHGDPLPPVDIRSSWISRYLKEISEA
        +C +  ES  H  F C+ AR +W   F        + N SF++ WS +      ++L L A+T W IW DRN  +HG  + PV+ +  W++ +L   S+A
Subjt:  MCLNFPESTDHILFQCQIARDLWNITFN-RVFQEVDFNGSFVDRWSRINSCCTQEELGLVAVTCWAIWMDRNKKVHGDPLPPVDIRSSWISRYLKEISEA

Query:  -------NTKGSRRS-----RPLLSLPLLPRIIFGSRPPLKRG----RNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLSRVIIESDYLMA
                T+ + R      RP  S+ L        R          R+S  S+V A    +     PL AE+R ILEG++ A     + + +ESD L+A
Subjt:  -------NTKGSRRS-----RPLLSLPLLPRIIFGSRPPLKRG----RNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLSRVIIESDYLMA

Query:  INFLNKPSEAWSYLDGLIESLWVMGQNFSEISFVYSPRERNIVADKLAKY
        I  +             +  +  +   F+ ISF +S R+ N  A  LAK+
Subjt:  INFLNKPSEAWSYLDGLIESLWVMGQNFSEISFVYSPRERNIVADKLAKY

A0A803PWX1 Uncharacterized protein1.6e-1827.06Show/hide
Query:  ESTDHILFQCQIARDLWNITFNRVFQEVDFNGSFVDRWSRINSCCTQEELGLVAVTCWAIWMDRNKKVHGDPLPPVDIRSSWISRYLKEISEANTK---G
        E+  H L+ C++ R++WN        +       +    R++S  T++E     +  W +W  RN   HG   P       W S++L E  E N     G
Subjt:  ESTDHILFQCQIARDLWNITFNRVFQEVDFNGSFVDRWSRINSCCTQEELGLVAVTCWAIWMDRNKKVHGDPLPPVDIRSSWISRYLKEISEANTK---G

Query:  SRRSRPLLSLPLLPRIIFGSRPPLKRG----------RNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLSRVIIESDYLMAINFLNKPSEA
         RR     + P            +K G          R+ +G ++ A  RF++    PL+ EL+AIL G++  +   L    +ESD L A+N + K  E 
Subjt:  SRRSRPLLSLPLLPRIIFGSRPPLKRG----------RNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLSRVIIESDYLMAINFLNKPSEA

Query:  WSYLDGLIESLWVMGQNFSEISFVYSPRERNIVADKLAKYARLVKNDVTWVDCFP
           +DGLI  +  + Q+       +  RE N VA  LA  A + K    WV   P
Subjt:  WSYLDGLIESLWVMGQNFSEISFVYSPRERNIVADKLAKYARLVKNDVTWVDCFP

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)4.2e-1929.81Show/hide
Query:  CLNFPESTDHILFQCQIARDLW-NITFNRVFQEVDFNGSFVDRWSRINSCCTQEELGLVAVTCWAIWMDRNKKVHGDPLPPVDIRSSWISRYLKEISEAN
        C    ES  H ++ C+ A+++W N  +  V +    N SF + W  +    + EE GL A  CW +W  RN  +            S +++  +E S+AN
Subjt:  CLNFPESTDHILFQCQIARDLW-NITFNRVFQEVDFNGSFVDRWSRINSCCTQEELGLVAVTCWAIWMDRNKKVHGDPLPPVDIRSSWISRYLKEISEAN

Query:  ----TKGSRRSRPLLSLPLLPRIIFGSRPP--LKRG----------RNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLSRVIIESDYLMAI
            T   R+S P   L        G RPP  +K G          RN++G  + A VR I  S+   + EL A +EG+R A+D+G +  I+E D    +
Subjt:  ----TKGSRRSRPLLSLPLLPRIIFGSRPP--LKRG----------RNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLSRVIIESDYLMAI

Query:  NFLNKPSEAWSYLDG-LIESLWVMGQNFSEISFVYSPRERNIVADKLAKYARLVKNDVTWVDCFP
        N +   +E ++ +DG L+E +  +  NF  +   ++PR  N VA  LA++A      VTW++  P
Subjt:  NFLNKPSEAWSYLDG-LIESLWVMGQNFSEISFVYSPRERNIVADKLAKYARLVKNDVTWVDCFP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10000.1 Ribonuclease H-like superfamily protein2.1e-0721.4Show/hide
Query:  PESTDHILFQCQIARDLWNITFNRVFQEVDFNGSFVDRWSRINSCCTQEELGLVAVT-----CWAIWMDRNKKVHGDPLPPVDIRSSWISRYLKEISEAN
        PE++ H+LF C  A  +WN+   ++   +      ++  + +        +G+   T     CW IW  RN+ +          ++S  S     + E  
Subjt:  PESTDHILFQCQIARDLWNITFNRVFQEVDFNGSFVDRWSRINSCCTQEELGLVAVT-----CWAIWMDRNKKVHGDPLPPVDIRSSWISRYLKEISEAN

Query:  TKGSRRSRPLLSLPL-LPRIIFGSRPP-------------LKRGRNSDGSMVGAGVRFIDLSF----------------YPLEAELRAILEGVRLAVDLG
        TK  + +    S  L LP++   +  P             +        S+ G+G  F   S                  PL AE  AI   +  A+ L 
Subjt:  TKGSRRSRPLLSLPL-LPRIIFGSRPP-------------LKRGRNSDGSMVGAGVRFIDLSF----------------YPLEAELRAILEGVRLAVDLG

Query:  LSRVIIESDYLMAINFLNKPSEAWSYLDGLIESLWVMGQNFSEISFVYSPRERNIVADKLAKYARLVKNDV
         S +++ SD    ++ LN  + + + + GL+  +  +   F  ISF + PR  N +AD  AK +  +  ++
Subjt:  LSRVIIESDYLMAINFLNKPSEAWSYLDGLIESLWVMGQNFSEISFVYSPRERNIVADKLAKYARLVKNDV

AT2G13980.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.8e-0632.35Show/hide
Query:  RNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLSRVIIESDYLMAINFLNKPSEAWSYLDGLIESLWVMGQNFSEISFVYSPRERNIVADKL
        RN DG     G   +D +  PLEAE +A+L  ++     G  RVI+E D     N ++  S + ++L  L++ + +  + FS + F +  R  N VA +L
Subjt:  RNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLSRVIIESDYLMAINFLNKPSEAWSYLDGLIESLWVMGQNFSEISFVYSPRERNIVADKL

Query:  AK
        AK
Subjt:  AK

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein9.8e-1324.62Show/hide
Query:  CLNFPESTDHILFQCQIARDLWNITFNRVFQEVDFNGS-FVDRWSRINSCCTQEELG----LVAVTCWAIWMDRNKKV-HGDPLPPVDIRSSWISRYLKE
        C +  E+ +H+LF+C  AR +W I+    + E ++  S + + +  +N      +LG    LV    W +W  RN+ +  G      ++    + R +++
Subjt:  CLNFPESTDHILFQCQIARDLWNITFNRVFQEVDFNGS-FVDRWSRINSCCTQEELG----LVAVTCWAIWMDRNKKV-HGDPLPPVDIRSSWISRYLKE

Query:  ISEANTKGSRRSRPLLSLPLLPR--IIFGSRPPLKRG---------------------RNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLS
          E +T+  R      S P + R   +    PP +                       RN  G ++  G R +  +   LEAEL A+   V         
Subjt:  ISEANTKGSRRSRPLLSLPLLPR--IIFGSRPPLKRG---------------------RNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLS

Query:  RVIIESDYLMAINFLNKPSEAWSYLDGLIESLWVMGQNFSEISFVYSPRERNIVADKLAK
        R+I ESD    +N LN   + W  L   +E +  +  +F E+ F ++PR  N VAD++A+
Subjt:  RVIIESDYLMAINFLNKPSEAWSYLDGLIESLWVMGQNFSEISFVYSPRERNIVADKLAK

AT4G09490.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein9.9e-0530.49Show/hide
Query:  PLEAELRAILEGVRLAVDLGLSRVIIESDYLMAINFLNKPSEAWSYLDGLIESLWVMGQNFSEISFVYSPRERNIVADKLAK
        PL AE  A+   ++ A  +G++++ + SD    I  +   S +  +  G+I  +  +   F+++SF + PR  N VAD+LAK
Subjt:  PLEAELRAILEGVRLAVDLGLSRVIIESDYLMAINFLNKPSEAWSYLDGLIESLWVMGQNFSEISFVYSPRERNIVADKLAK

AT4G29090.1 Ribonuclease H-like superfamily protein1.7e-0927.31Show/hide
Query:  CLNFPESTDHILFQCQIARDLWNITFNRVFQEVDFNGS-FVDRWSRIN----SCCTQEELGLVAVTCWAIWMDRNKKVH--------------GDPLPPV
        C +  E+ +H+LF+C  AR  W I+   +    ++  S +V+ +   N    +   ++   LV    W +W +RN+ V                D L   
Subjt:  CLNFPESTDHILFQCQIARDLWNITFNRVFQEVDFNGS-FVDRWSRIN----SCCTQEELGLVAVTCWAIWMDRNKKVH--------------GDPLPPV

Query:  DIRSSWISRYLKEISEANTKGSRRSRPLLSLPLLPRIIFGSRPPLKRG-----RNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLSR----
         IR+   S   K     ++ G  R  P   +       + +R   + G     RN  G +   G R +      L++ L A LE +R AV L LSR    
Subjt:  DIRSSWISRYLKEISEANTKGSRRSRPLLSLPLLPRIIFGSRPPLKRG-----RNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLSR----

Query:  -VIIESDYLMAINFLNKPSEAWSYLDGLIESLWVMGQNFSEISFVYSPRERNIVADKLAK
         VI ESD  + I  LN   E W  L   I+ L  +   F+E+ FV+ PRE N +A+++A+
Subjt:  -VIIESDYLMAINFLNKPSEAWSYLDGLIESLWVMGQNFSEISFVYSPRERNIVADKLAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTTTGAACTTCCCAGAATCCACAGACCATATACTGTTTCAGTGTCAAATAGCTAGGGATTTATGGAATATTACCTTTAATCGTGTGTTTCAAGAGGTGGACTTCAA
CGGGAGCTTCGTGGACAGATGGTCGCGTATTAATTCGTGTTGTACCCAGGAGGAGCTTGGTCTGGTTGCTGTGACATGCTGGGCCATCTGGATGGACAGAAATAAGAAAG
TTCATGGCGATCCCTTACCCCCAGTGGATATTAGAAGTTCTTGGATATCGAGATACCTAAAAGAGATTTCGGAGGCAAACACGAAAGGATCGCGAAGAAGTCGTCCCTTG
TTGAGTCTTCCTCTCCTTCCTCGAATAATATTTGGATCTCGCCCCCCTCTGAAACGTGGAAGGAATTCCGATGGGAGTATGGTCGGAGCTGGAGTTCGCTTCATCGATTT
ATCGTTTTATCCCCTTGAAGCAGAGTTGAGAGCTATCTTAGAGGGTGTTCGTCTGGCGGTGGATTTAGGGCTTTCTCGAGTGATTATCGAATCTGATTATTTAATGGCTA
TTAATTTCCTAAACAAACCATCGGAAGCTTGGAGTTACTTGGATGGCCTTATTGAGAGTTTATGGGTCATGGGCCAAAACTTCAGTGAGATAAGCTTTGTGTATAGCCCA
AGAGAGAGGAATATAGTTGCTGATAAATTGGCTAAATATGCCAGATTAGTGAAAAACGATGTAACCTGGGTAGATTGTTTCCCAGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGTTTGAACTTCCCAGAATCCACAGACCATATACTGTTTCAGTGTCAAATAGCTAGGGATTTATGGAATATTACCTTTAATCGTGTGTTTCAAGAGGTGGACTTCAA
CGGGAGCTTCGTGGACAGATGGTCGCGTATTAATTCGTGTTGTACCCAGGAGGAGCTTGGTCTGGTTGCTGTGACATGCTGGGCCATCTGGATGGACAGAAATAAGAAAG
TTCATGGCGATCCCTTACCCCCAGTGGATATTAGAAGTTCTTGGATATCGAGATACCTAAAAGAGATTTCGGAGGCAAACACGAAAGGATCGCGAAGAAGTCGTCCCTTG
TTGAGTCTTCCTCTCCTTCCTCGAATAATATTTGGATCTCGCCCCCCTCTGAAACGTGGAAGGAATTCCGATGGGAGTATGGTCGGAGCTGGAGTTCGCTTCATCGATTT
ATCGTTTTATCCCCTTGAAGCAGAGTTGAGAGCTATCTTAGAGGGTGTTCGTCTGGCGGTGGATTTAGGGCTTTCTCGAGTGATTATCGAATCTGATTATTTAATGGCTA
TTAATTTCCTAAACAAACCATCGGAAGCTTGGAGTTACTTGGATGGCCTTATTGAGAGTTTATGGGTCATGGGCCAAAACTTCAGTGAGATAAGCTTTGTGTATAGCCCA
AGAGAGAGGAATATAGTTGCTGATAAATTGGCTAAATATGCCAGATTAGTGAAAAACGATGTAACCTGGGTAGATTGTTTCCCAGACTAG
Protein sequenceShow/hide protein sequence
MCLNFPESTDHILFQCQIARDLWNITFNRVFQEVDFNGSFVDRWSRINSCCTQEELGLVAVTCWAIWMDRNKKVHGDPLPPVDIRSSWISRYLKEISEANTKGSRRSRPL
LSLPLLPRIIFGSRPPLKRGRNSDGSMVGAGVRFIDLSFYPLEAELRAILEGVRLAVDLGLSRVIIESDYLMAINFLNKPSEAWSYLDGLIESLWVMGQNFSEISFVYSP
RERNIVADKLAKYARLVKNDVTWVDCFPD