; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007526 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007526
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr9:698930..705155
RNA-Seq ExpressionLag0007526
SyntenyLag0007526
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036866.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.7e-4438.48Show/hide
Query:  HSQANQNKLSSSEEARYPQDSKKYDPQSSLIRQSVDSSAVISQNLPLVREVGFSIELEPDTTPIPKALYRMALAKLRELKVQIQELLGKSLICLKVSPW-
        H +     L+S  + R    S   +P   ++R   D        LP  REV F+IELEP T PI +A YRMA A+L+ELKVQ+QELL K  I L VSPW 
Subjt:  HSQANQNKLSSSEEARYPQDSKKYDPQSSLIRQSVDSSAVISQNLPLVREVGFSIELEPDTTPIPKALYRMALAKLRELKVQIQELLGKSLICLKVSPW-

Query:  --------------------------IKDEDIPKTAFRSRYDHYEFTVMFFGLAEAPAVFMDLM--------------------------TENTRHVRRV
                                  IKD D+PKTAFRSRY HYEF VM FGL  APAVFMDLM                           E+  H+R V
Subjt:  --------------------------IKDEDIPKTAFRSRYDHYEFTVMFFGLAEAPAVFMDLM--------------------------TENTRHVRRV

Query:  LST-QEETSYLPSSLSEFWFQQVSLLGHDVVPRNGIRVDSTMVEIVSEWPCLITVNEARS----SSVVYSDASKKSLGYVLTQDEKVVVYVFHQ------
        L T ++   Y   S  EFW +QVS LGH VV + G+ VD   +E V+ W    TV+E RS    +     DASKK LGYVL Q  KVV YV HQ      
Subjt:  LST-QEETSYLPSSLSEFWFQQVSLLGHDVVPRNGIRVDSTMVEIVSEWPCLITVNEARS----SSVVYSDASKKSLGYVLTQDEKVVVYVFHQ------

Query:  ---------------------------------SQEPKVFLHPKELNVRHRR-LSLVKDYVVEILYHSRRSCVPRSDELKHKVLTEACNLFIDAP
                                          +  K F   KELN+R RR L LVKDY  EILYH  ++ V  +D L  KV   A  +   AP
Subjt:  ---------------------------------SQEPKVFLHPKELNVRHRR-LSLVKDYVVEILYHSRRSCVPRSDELKHKVLTEACNLFIDAP

KAA0046956.1 pol protein [Cucumis melo var. makuwa]1.2e-4137.78Show/hide
Query:  LPLVREVGFSIELEPDTTPIPKALYRMALAKLRELKVQIQELLGKSLICLKVSPW--------------------------------IKDEDIPKTAFRS
        LP  RE+ F+IELE  T PI +A  +MA  +L+ELKVQ+QELL K  I   VSPW                                I+D +IPKTAFRS
Subjt:  LPLVREVGFSIELEPDTTPIPKALYRMALAKLRELKVQIQELLGKSLICLKVSPW--------------------------------IKDEDIPKTAFRS

Query:  RYDHYEFTVMFFGLAEAPAVFMDLM--------------------------TENTRHVRRVLST-QEETSYLPSSLSEFWFQQVSLLGHDVVPRNGIRVD
        RY HYEF VM FGL  AP VFMDLM                           E+  H+ +VL T +    Y   S  EFW ++VS LGH VV   G+ VD
Subjt:  RYDHYEFTVMFFGLAEAPAVFMDLM--------------------------TENTRHVRRVLST-QEETSYLPSSLSEFWFQQVSLLGHDVVPRNGIRVD

Query:  STMVEIVSEWPCLITVNEARS-------------------------------------SSVVYSDASKKSLGYVLTQDEKVVVYVFHQSQEPKVFLHPKE
           +E+V+ WP   TV++ RS                                     S V+YSDASKK +GYVL Q  K+ ++  H+S   K F   KE
Subjt:  STMVEIVSEWPCLITVNEARS-------------------------------------SSVVYSDASKKSLGYVLTQDEKVVVYVFHQSQEPKVFLHPKE

Query:  LNVRHRR-LSLVKDYVVEILYHSRRSCVPRSDELKHKVLTEACNLFIDAPRL
        LN+R RR L LVKDY  EILYH   + V  +D L  K+   A  +   AP L
Subjt:  LNVRHRR-LSLVKDYVVEILYHSRRSCVPRSDELKHKVLTEACNLFIDAPRL

KAA0054231.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.2e-4137.9Show/hide
Query:  LPLVREVGFSIELEPDTTPIPKALYRMALAKLRELKVQIQELLGKSLICLKVSPW---------------------------IKDEDIPKTAFRSRYDHY
        LP  REV F+IELEP T PI +A YRMA A+L+ELKVQ+QELL K  I   VSPW                           IKD D+PKT FRSRY HY
Subjt:  LPLVREVGFSIELEPDTTPIPKALYRMALAKLRELKVQIQELLGKSLICLKVSPW---------------------------IKDEDIPKTAFRSRYDHY

Query:  EFTVMFFGLAEAPAVFMDLM--------------------------TENTRHVRRVLST-QEETSYLPSSLSEFWFQQVSLLGHDVVPRNGIRVDSTMVE
        EF VM FGL  APAVFMDLM                           E+  H+R VL T  +   Y   S  EFW +QVS LGH VV + G+ VD   +E
Subjt:  EFTVMFFGLAEAPAVFMDLM--------------------------TENTRHVRRVLST-QEETSYLPSSLSEFWFQQVSLLGHDVVPRNGIRVDSTMVE

Query:  IVSEWPCLITVNEARS-------------------------SSVVYSDASKKSLGYVLTQDEKVVVY---------------------------------
         V+ W    TV+E+++                         S V+YSDASKK LG VL Q  KVV Y                                 
Subjt:  IVSEWPCLITVNEARS-------------------------SSVVYSDASKKSLGYVLTQDEKVVVY---------------------------------

Query:  ------VFHQSQEPKVFLHPKELNVRHRR-LSLVKDYVVEILYHSRRSCVPRSDELKHKVLTEACNLFIDAP
              +F   +  K F   KELN+R RR L LVKDY  EILYH  ++ V  +D L  KV   A  +   AP
Subjt:  ------VFHQSQEPKVFLHPKELNVRHRR-LSLVKDYVVEILYHSRRSCVPRSDELKHKVLTEACNLFIDAP

TYK04826.1 pol protein [Cucumis melo var. makuwa]1.2e-4137.78Show/hide
Query:  LPLVREVGFSIELEPDTTPIPKALYRMALAKLRELKVQIQELLGKSLICLKVSPW--------------------------------IKDEDIPKTAFRS
        LP  RE+ F+IELE  T PI +A  +MA  +L+ELKVQ+QELL K  I   VSPW                                I+D +IPKTAFRS
Subjt:  LPLVREVGFSIELEPDTTPIPKALYRMALAKLRELKVQIQELLGKSLICLKVSPW--------------------------------IKDEDIPKTAFRS

Query:  RYDHYEFTVMFFGLAEAPAVFMDLM--------------------------TENTRHVRRVLST-QEETSYLPSSLSEFWFQQVSLLGHDVVPRNGIRVD
        RY HYEF VM FGL  AP VFMDLM                           E+  H+ +VL T +    Y   S  EFW ++VS LGH VV   G+ VD
Subjt:  RYDHYEFTVMFFGLAEAPAVFMDLM--------------------------TENTRHVRRVLST-QEETSYLPSSLSEFWFQQVSLLGHDVVPRNGIRVD

Query:  STMVEIVSEWPCLITVNEARS-------------------------------------SSVVYSDASKKSLGYVLTQDEKVVVYVFHQSQEPKVFLHPKE
           +E+V+ WP   TV++ RS                                     S V+YSDASKK +GYVL Q  K+ ++  H+S   K F   KE
Subjt:  STMVEIVSEWPCLITVNEARS-------------------------------------SSVVYSDASKKSLGYVLTQDEKVVVYVFHQSQEPKVFLHPKE

Query:  LNVRHRR-LSLVKDYVVEILYHSRRSCVPRSDELKHKVLTEACNLFIDAPRL
        LN+R RR L LVKDY  EILYH   + V  +D L  K+   A  +   AP L
Subjt:  LNVRHRR-LSLVKDYVVEILYHSRRSCVPRSDELKHKVLTEACNLFIDAPRL

TYK09653.1 pol protein [Cucumis melo var. makuwa]2.8e-4135.78Show/hide
Query:  LSSSEEARYPQDSKKYDPQSSLIRQSVDSSAVISQNLPLV---REVGFSIELEPDTTPIPKALYRMALAKLRELKVQIQELLGKSLICLKVSPW------
        L+S  + R P+ S   +P   ++R+  D   V S  LP +   RE+ F+IELEPDT PI +A YR A A+L+E+KVQ+QELL K  I    SPW      
Subjt:  LSSSEEARYPQDSKKYDPQSSLIRQSVDSSAVISQNLPLV---REVGFSIELEPDTTPIPKALYRMALAKLRELKVQIQELLGKSLICLKVSPW------

Query:  ---------------------------------------------------------IKDEDIPKTAFRSRYDHYEFTVMFFGLAEAPAVFMDLM-----
                                                                 I+D DIPKTAFRSRY HYEF VM FGL  APAVFMDLM     
Subjt:  ---------------------------------------------------------IKDEDIPKTAFRSRYDHYEFTVMFFGLAEAPAVFMDLM-----

Query:  ---------------------TENTRHVRRVLST-QEETSYLPSSLSEFWFQQVSLLGHDVVPRNGIRVDSTMVEIVSEWPCLITVNEARS---------
                              E+  H+R+VL T +    Y   S  EFW ++VS LGH VV   G+ VD   +E V+ WP   TVNE RS         
Subjt:  ---------------------TENTRHVRRVLST-QEETSYLPSSLSEFWFQQVSLLGHDVVPRNGIRVDSTMVEIVSEWPCLITVNEARS---------

Query:  -------------SSVVYSDASKKSLGYVLTQD---------------EKVVVYVFHQSQEPKVFLHPKELNVRHRR-LSLVKDYVVEILYHSRRSCVPR
                     S V+YSD SKK LG VL Q                EK+ ++  H+S   K F   KELN+R RR L LVKDY  EILYH  ++ V  
Subjt:  -------------SSVVYSDASKKSLGYVLTQD---------------EKVVVYVFHQSQEPKVFLHPKELNVRHRR-LSLVKDYVVEILYHSRRSCVPR

Query:  SDELKHKVLTEACNLFIDAPRL
        +D L  KV   A  +   AP L
Subjt:  SDELKHKVLTEACNLFIDAPRL

TrEMBL top hitse value%identityAlignment
A0A5A7T407 Reverse transcriptase1.3e-4438.48Show/hide
Query:  HSQANQNKLSSSEEARYPQDSKKYDPQSSLIRQSVDSSAVISQNLPLVREVGFSIELEPDTTPIPKALYRMALAKLRELKVQIQELLGKSLICLKVSPW-
        H +     L+S  + R    S   +P   ++R   D        LP  REV F+IELEP T PI +A YRMA A+L+ELKVQ+QELL K  I L VSPW 
Subjt:  HSQANQNKLSSSEEARYPQDSKKYDPQSSLIRQSVDSSAVISQNLPLVREVGFSIELEPDTTPIPKALYRMALAKLRELKVQIQELLGKSLICLKVSPW-

Query:  --------------------------IKDEDIPKTAFRSRYDHYEFTVMFFGLAEAPAVFMDLM--------------------------TENTRHVRRV
                                  IKD D+PKTAFRSRY HYEF VM FGL  APAVFMDLM                           E+  H+R V
Subjt:  --------------------------IKDEDIPKTAFRSRYDHYEFTVMFFGLAEAPAVFMDLM--------------------------TENTRHVRRV

Query:  LST-QEETSYLPSSLSEFWFQQVSLLGHDVVPRNGIRVDSTMVEIVSEWPCLITVNEARS----SSVVYSDASKKSLGYVLTQDEKVVVYVFHQ------
        L T ++   Y   S  EFW +QVS LGH VV + G+ VD   +E V+ W    TV+E RS    +     DASKK LGYVL Q  KVV YV HQ      
Subjt:  LST-QEETSYLPSSLSEFWFQQVSLLGHDVVPRNGIRVDSTMVEIVSEWPCLITVNEARS----SSVVYSDASKKSLGYVLTQDEKVVVYVFHQ------

Query:  ---------------------------------SQEPKVFLHPKELNVRHRR-LSLVKDYVVEILYHSRRSCVPRSDELKHKVLTEACNLFIDAP
                                          +  K F   KELN+R RR L LVKDY  EILYH  ++ V  +D L  KV   A  +   AP
Subjt:  ---------------------------------SQEPKVFLHPKELNVRHRR-LSLVKDYVVEILYHSRRSCVPRSDELKHKVLTEACNLFIDAP

A0A5A7TYA3 Pol protein6.0e-4237.78Show/hide
Query:  LPLVREVGFSIELEPDTTPIPKALYRMALAKLRELKVQIQELLGKSLICLKVSPW--------------------------------IKDEDIPKTAFRS
        LP  RE+ F+IELE  T PI +A  +MA  +L+ELKVQ+QELL K  I   VSPW                                I+D +IPKTAFRS
Subjt:  LPLVREVGFSIELEPDTTPIPKALYRMALAKLRELKVQIQELLGKSLICLKVSPW--------------------------------IKDEDIPKTAFRS

Query:  RYDHYEFTVMFFGLAEAPAVFMDLM--------------------------TENTRHVRRVLST-QEETSYLPSSLSEFWFQQVSLLGHDVVPRNGIRVD
        RY HYEF VM FGL  AP VFMDLM                           E+  H+ +VL T +    Y   S  EFW ++VS LGH VV   G+ VD
Subjt:  RYDHYEFTVMFFGLAEAPAVFMDLM--------------------------TENTRHVRRVLST-QEETSYLPSSLSEFWFQQVSLLGHDVVPRNGIRVD

Query:  STMVEIVSEWPCLITVNEARS-------------------------------------SSVVYSDASKKSLGYVLTQDEKVVVYVFHQSQEPKVFLHPKE
           +E+V+ WP   TV++ RS                                     S V+YSDASKK +GYVL Q  K+ ++  H+S   K F   KE
Subjt:  STMVEIVSEWPCLITVNEARS-------------------------------------SSVVYSDASKKSLGYVLTQDEKVVVYVFHQSQEPKVFLHPKE

Query:  LNVRHRR-LSLVKDYVVEILYHSRRSCVPRSDELKHKVLTEACNLFIDAPRL
        LN+R RR L LVKDY  EILYH   + V  +D L  K+   A  +   AP L
Subjt:  LNVRHRR-LSLVKDYVVEILYHSRRSCVPRSDELKHKVLTEACNLFIDAPRL

A0A5A7UL17 Reverse transcriptase6.0e-4237.9Show/hide
Query:  LPLVREVGFSIELEPDTTPIPKALYRMALAKLRELKVQIQELLGKSLICLKVSPW---------------------------IKDEDIPKTAFRSRYDHY
        LP  REV F+IELEP T PI +A YRMA A+L+ELKVQ+QELL K  I   VSPW                           IKD D+PKT FRSRY HY
Subjt:  LPLVREVGFSIELEPDTTPIPKALYRMALAKLRELKVQIQELLGKSLICLKVSPW---------------------------IKDEDIPKTAFRSRYDHY

Query:  EFTVMFFGLAEAPAVFMDLM--------------------------TENTRHVRRVLST-QEETSYLPSSLSEFWFQQVSLLGHDVVPRNGIRVDSTMVE
        EF VM FGL  APAVFMDLM                           E+  H+R VL T  +   Y   S  EFW +QVS LGH VV + G+ VD   +E
Subjt:  EFTVMFFGLAEAPAVFMDLM--------------------------TENTRHVRRVLST-QEETSYLPSSLSEFWFQQVSLLGHDVVPRNGIRVDSTMVE

Query:  IVSEWPCLITVNEARS-------------------------SSVVYSDASKKSLGYVLTQDEKVVVY---------------------------------
         V+ W    TV+E+++                         S V+YSDASKK LG VL Q  KVV Y                                 
Subjt:  IVSEWPCLITVNEARS-------------------------SSVVYSDASKKSLGYVLTQDEKVVVY---------------------------------

Query:  ------VFHQSQEPKVFLHPKELNVRHRR-LSLVKDYVVEILYHSRRSCVPRSDELKHKVLTEACNLFIDAP
              +F   +  K F   KELN+R RR L LVKDY  EILYH  ++ V  +D L  KV   A  +   AP
Subjt:  ------VFHQSQEPKVFLHPKELNVRHRR-LSLVKDYVVEILYHSRRSCVPRSDELKHKVLTEACNLFIDAP

A0A5D3C310 Pol protein6.0e-4237.78Show/hide
Query:  LPLVREVGFSIELEPDTTPIPKALYRMALAKLRELKVQIQELLGKSLICLKVSPW--------------------------------IKDEDIPKTAFRS
        LP  RE+ F+IELE  T PI +A  +MA  +L+ELKVQ+QELL K  I   VSPW                                I+D +IPKTAFRS
Subjt:  LPLVREVGFSIELEPDTTPIPKALYRMALAKLRELKVQIQELLGKSLICLKVSPW--------------------------------IKDEDIPKTAFRS

Query:  RYDHYEFTVMFFGLAEAPAVFMDLM--------------------------TENTRHVRRVLST-QEETSYLPSSLSEFWFQQVSLLGHDVVPRNGIRVD
        RY HYEF VM FGL  AP VFMDLM                           E+  H+ +VL T +    Y   S  EFW ++VS LGH VV   G+ VD
Subjt:  RYDHYEFTVMFFGLAEAPAVFMDLM--------------------------TENTRHVRRVLST-QEETSYLPSSLSEFWFQQVSLLGHDVVPRNGIRVD

Query:  STMVEIVSEWPCLITVNEARS-------------------------------------SSVVYSDASKKSLGYVLTQDEKVVVYVFHQSQEPKVFLHPKE
           +E+V+ WP   TV++ RS                                     S V+YSDASKK +GYVL Q  K+ ++  H+S   K F   KE
Subjt:  STMVEIVSEWPCLITVNEARS-------------------------------------SSVVYSDASKKSLGYVLTQDEKVVVYVFHQSQEPKVFLHPKE

Query:  LNVRHRR-LSLVKDYVVEILYHSRRSCVPRSDELKHKVLTEACNLFIDAPRL
        LN+R RR L LVKDY  EILYH   + V  +D L  K+   A  +   AP L
Subjt:  LNVRHRR-LSLVKDYVVEILYHSRRSCVPRSDELKHKVLTEACNLFIDAPRL

A0A5D3CCS5 Pol protein1.3e-4135.78Show/hide
Query:  LSSSEEARYPQDSKKYDPQSSLIRQSVDSSAVISQNLPLV---REVGFSIELEPDTTPIPKALYRMALAKLRELKVQIQELLGKSLICLKVSPW------
        L+S  + R P+ S   +P   ++R+  D   V S  LP +   RE+ F+IELEPDT PI +A YR A A+L+E+KVQ+QELL K  I    SPW      
Subjt:  LSSSEEARYPQDSKKYDPQSSLIRQSVDSSAVISQNLPLV---REVGFSIELEPDTTPIPKALYRMALAKLRELKVQIQELLGKSLICLKVSPW------

Query:  ---------------------------------------------------------IKDEDIPKTAFRSRYDHYEFTVMFFGLAEAPAVFMDLM-----
                                                                 I+D DIPKTAFRSRY HYEF VM FGL  APAVFMDLM     
Subjt:  ---------------------------------------------------------IKDEDIPKTAFRSRYDHYEFTVMFFGLAEAPAVFMDLM-----

Query:  ---------------------TENTRHVRRVLST-QEETSYLPSSLSEFWFQQVSLLGHDVVPRNGIRVDSTMVEIVSEWPCLITVNEARS---------
                              E+  H+R+VL T +    Y   S  EFW ++VS LGH VV   G+ VD   +E V+ WP   TVNE RS         
Subjt:  ---------------------TENTRHVRRVLST-QEETSYLPSSLSEFWFQQVSLLGHDVVPRNGIRVDSTMVEIVSEWPCLITVNEARS---------

Query:  -------------SSVVYSDASKKSLGYVLTQD---------------EKVVVYVFHQSQEPKVFLHPKELNVRHRR-LSLVKDYVVEILYHSRRSCVPR
                     S V+YSD SKK LG VL Q                EK+ ++  H+S   K F   KELN+R RR L LVKDY  EILYH  ++ V  
Subjt:  -------------SSVVYSDASKKSLGYVLTQD---------------EKVVVYVFHQSQEPKVFLHPKELNVRHRR-LSLVKDYVVEILYHSRRSCVPR

Query:  SDELKHKVLTEACNLFIDAPRL
        +D L  KV   A  +   AP L
Subjt:  SDELKHKVLTEACNLFIDAPRL

SwissProt top hitse value%identityAlignment
Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.3e-0428.74Show/hide
Query:  PDTTPIPKALYRMALAKLRELKVQIQELLGKSLICLKVSPWIKDEDIPKTAFRSRYDHYEFTVMFFGLAEAPAVFM----DLMTENT-------------
        PDT PIP       LA L   K      L      +     +K+ DIPKTAF +    YEF  + FGL  APA+F     D++ E+              
Subjt:  PDTTPIPKALYRMALAKLRELKVQIQELLGKSLICLKVSPWIKDEDIPKTAFRSRYDHYEFTVMFFGLAEAPAVFM----DLMTENT-------------

Query:  ---------RHVRRVLSTQEETS-YLPSSLSEFWFQQVSLLGHDVVPRNGIRVDSTMVEIVSEWPCLITVNEAR
                 +++R VL++  + +  +    S F   QV  LG+ +V  +GI+ D   V  +SE P   +V E +
Subjt:  ---------RHVRRVLSTQEETS-YLPSSLSEFWFQQVSLLGHDVVPRNGIRVDSTMVEIVSEWPCLITVNEAR

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCAAGATCTTCCTCCACCTTCCGACAAAGAATGCAACAAAAAGAGCCAACCACGGGGGCCTTCACTCTCGAAAGCCTGTCAAAAGTATTAACACGGCCCAAGATGA
CTTGTCAAATGAAGAACAAAACTTTCTTTGGAACTTTAGGGCCTGTTTGTCGCTGAAGCCTCTAGGACCATTCACCTTCTTTTCGTTTCAGCCGGCGACGTCTTCCTTCC
CTCCGGCAACCTACATTCACAGCGGCGGCAGCAGTAGAGGTTCGGCGGTGAGCAACATTTTTCCGACGATTGTTCTATTAGACGCGGACAGCAGCATTCAGGGGCATTCT
TCTGCATTTCCCGATGTTTTTAGTTTGGATGAATACCCATTGGGTTTGGCTAATGTTTTCAAGGTACCCAAAGCGTTTGGAGGTTTAACCGCCGATTTGGAGCGAAAGTG
GAACCCGAACAGCAAGTTTAAAGTTCTGTTTTGCTTGATTAAAACCTTGGAGTTAAGGAATTTGTCGAAGGTGGTACTTGATGACATCAGGCGAGTGGCGAGAGCAGTGG
CTTGGCTAGGCTATGGGATGCAGCTGCAAATCACTTGGTATCAGAGCCCAGTTTTTAGGTTCTGTAGACTCGCTTACATCGTAAGCATCAGATTTTCCCATAGCCAAGCA
CAAGCAAATATGCCTCGTCATAGCCAGGCAAATCAAAATAAGTTAAGCTCCTCTGAGGAAGCAAGATACCCCCAGGACTCTAAGAAATATGATCCTCAGTCATCATTGAT
CCGACAGTCAGTAGACTCCAGTGCAGTGATCTCCCAGAATTTGCCTTTGGTTCGAGAAGTAGGCTTCAGCATTGAGCTCGAGCCAGACACAACCCCTATTCCTAAGGCGC
TCTACAGAATGGCTCTAGCAAAGTTAAGGGAGCTCAAGGTACAAATACAAGAACTCTTGGGCAAAAGTTTGATATGCCTCAAGGTTTCACCTTGGATAAAAGATGAAGAT
ATTCCCAAGACAGCATTTAGATCCAGATACGACCATTATGAGTTCACAGTGATGTTTTTTGGGCTAGCAGAGGCTCCTGCAGTGTTCATGGATTTGATGACCGAGAACAC
GAGACACGTCAGGAGAGTTCTGTCAACTCAAGAGGAAACTAGTTATTTGCCAAGTTCTCTAAGTGAGTTTTGGTTTCAGCAGGTGTCCTTGCTAGGGCACGACGTGGTAC
CGAGAAATGGAATTCGTGTTGATTCAACAATGGTTGAAATCGTGTCTGAGTGGCCTTGCTTGATTACTGTCAATGAAGCACGAAGTTCTTCCGTTGTTTACAGTGATGCT
TCCAAGAAAAGTTTAGGTTATGTCCTTACGCAGGACGAGAAGGTTGTTGTCTATGTTTTCCATCAATCACAAGAACCTAAAGTATTTCTTCACCCGAAGGAACTAAATGT
GAGGCATAGACGGCTCAGCTTGGTTAAGGATTATGTTGTAGAGATACTTTACCACTCAAGGCGATCGTGCGTACCTAGGAGTGATGAGCTTAAGCACAAGGTTCTGACTG
AAGCTTGCAATCTTTTTATCGATGCACCAAGGTTGTGTGAGGTTCCAGTAAAGATTGCTTCAGGGAGAGACCTTTGTGTTACCTTTGACTTTTGGACTAGTTTCAGAGAG
CTTTGGGCTCGCAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGATCAAGATCTTCCTCCACCTTCCGACAAAGAATGCAACAAAAAGAGCCAACCACGGGGGCCTTCACTCTCGAAAGCCTGTCAAAAGTATTAACACGGCCCAAGATGA
CTTGTCAAATGAAGAACAAAACTTTCTTTGGAACTTTAGGGCCTGTTTGTCGCTGAAGCCTCTAGGACCATTCACCTTCTTTTCGTTTCAGCCGGCGACGTCTTCCTTCC
CTCCGGCAACCTACATTCACAGCGGCGGCAGCAGTAGAGGTTCGGCGGTGAGCAACATTTTTCCGACGATTGTTCTATTAGACGCGGACAGCAGCATTCAGGGGCATTCT
TCTGCATTTCCCGATGTTTTTAGTTTGGATGAATACCCATTGGGTTTGGCTAATGTTTTCAAGGTACCCAAAGCGTTTGGAGGTTTAACCGCCGATTTGGAGCGAAAGTG
GAACCCGAACAGCAAGTTTAAAGTTCTGTTTTGCTTGATTAAAACCTTGGAGTTAAGGAATTTGTCGAAGGTGGTACTTGATGACATCAGGCGAGTGGCGAGAGCAGTGG
CTTGGCTAGGCTATGGGATGCAGCTGCAAATCACTTGGTATCAGAGCCCAGTTTTTAGGTTCTGTAGACTCGCTTACATCGTAAGCATCAGATTTTCCCATAGCCAAGCA
CAAGCAAATATGCCTCGTCATAGCCAGGCAAATCAAAATAAGTTAAGCTCCTCTGAGGAAGCAAGATACCCCCAGGACTCTAAGAAATATGATCCTCAGTCATCATTGAT
CCGACAGTCAGTAGACTCCAGTGCAGTGATCTCCCAGAATTTGCCTTTGGTTCGAGAAGTAGGCTTCAGCATTGAGCTCGAGCCAGACACAACCCCTATTCCTAAGGCGC
TCTACAGAATGGCTCTAGCAAAGTTAAGGGAGCTCAAGGTACAAATACAAGAACTCTTGGGCAAAAGTTTGATATGCCTCAAGGTTTCACCTTGGATAAAAGATGAAGAT
ATTCCCAAGACAGCATTTAGATCCAGATACGACCATTATGAGTTCACAGTGATGTTTTTTGGGCTAGCAGAGGCTCCTGCAGTGTTCATGGATTTGATGACCGAGAACAC
GAGACACGTCAGGAGAGTTCTGTCAACTCAAGAGGAAACTAGTTATTTGCCAAGTTCTCTAAGTGAGTTTTGGTTTCAGCAGGTGTCCTTGCTAGGGCACGACGTGGTAC
CGAGAAATGGAATTCGTGTTGATTCAACAATGGTTGAAATCGTGTCTGAGTGGCCTTGCTTGATTACTGTCAATGAAGCACGAAGTTCTTCCGTTGTTTACAGTGATGCT
TCCAAGAAAAGTTTAGGTTATGTCCTTACGCAGGACGAGAAGGTTGTTGTCTATGTTTTCCATCAATCACAAGAACCTAAAGTATTTCTTCACCCGAAGGAACTAAATGT
GAGGCATAGACGGCTCAGCTTGGTTAAGGATTATGTTGTAGAGATACTTTACCACTCAAGGCGATCGTGCGTACCTAGGAGTGATGAGCTTAAGCACAAGGTTCTGACTG
AAGCTTGCAATCTTTTTATCGATGCACCAAGGTTGTGTGAGGTTCCAGTAAAGATTGCTTCAGGGAGAGACCTTTGTGTTACCTTTGACTTTTGGACTAGTTTCAGAGAG
CTTTGGGCTCGCAATTGA
Protein sequenceShow/hide protein sequence
MIKIFLHLPTKNATKRANHGGLHSRKPVKSINTAQDDLSNEEQNFLWNFRACLSLKPLGPFTFFSFQPATSSFPPATYIHSGGSSRGSAVSNIFPTIVLLDADSSIQGHS
SAFPDVFSLDEYPLGLANVFKVPKAFGGLTADLERKWNPNSKFKVLFCLIKTLELRNLSKVVLDDIRRVARAVAWLGYGMQLQITWYQSPVFRFCRLAYIVSIRFSHSQA
QANMPRHSQANQNKLSSSEEARYPQDSKKYDPQSSLIRQSVDSSAVISQNLPLVREVGFSIELEPDTTPIPKALYRMALAKLRELKVQIQELLGKSLICLKVSPWIKDED
IPKTAFRSRYDHYEFTVMFFGLAEAPAVFMDLMTENTRHVRRVLSTQEETSYLPSSLSEFWFQQVSLLGHDVVPRNGIRVDSTMVEIVSEWPCLITVNEARSSSVVYSDA
SKKSLGYVLTQDEKVVVYVFHQSQEPKVFLHPKELNVRHRRLSLVKDYVVEILYHSRRSCVPRSDELKHKVLTEACNLFIDAPRLCEVPVKIASGRDLCVTFDFWTSFRE
LWARN