; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024953 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024953
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr10:7236265..7237158
RNA-Seq ExpressionLag0024953
SyntenyLag0024953
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4277969.1 unnamed protein product [Prunus armeniaca]5.2e-2230.29Show/hide
Query:  TNSMAKVWSNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCPNLVA--ETTDHNLVGCMKAREVWD-LTFAHDLLKTKLNHSFADKWF-AMN
        ++S  K W +LW LKVP K+ H LWR   D++P+   L+RR I    +C    A  ETT H LVGC    +VW  L F  D L   L  +    W  A+ 
Subjt:  TNSMAKVWSNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCPNLVA--ETTDHNLVGCMKAREVWD-LTFAHDLLKTKLNHSFADKWF-AMN

Query:  SFISIEDLQKVAITCWSNWADRN-KLVHNKPIPLPLIRSQLIKKYDSEYLQMNGPLSSNSSDPYTGIE---------------------------IVSRN
        S I  +     A T W  W +RN  L  ++P P  ++  Q  K YD+E+ + +     + S P   I+                            + R+
Subjt:  SFISIEDLQKVAITCWSNWADRN-KLVHNKPIPLPLIRSQLIKKYDSEYLQMNGPLSSNSSDPYTGIE---------------------------IVSRN

Query:  SIGKLIGASSVHIDMNFPPPMAELMAIVEGLKFAEQLGHDRIKLESDCLQAISLINRDAEVRNEIGTLIEDIKR
        S GKL+GA ++           EL A+  G+ FA  +    +++ESD LQA+S++N + E     G L++ ++R
Subjt:  SIGKLIGASSVHIDMNFPPPMAELMAIVEGLKFAEQLGHDRIKLESDCLQAISLINRDAEVRNEIGTLIEDIKR

CAB4277970.1 unnamed protein product [Prunus armeniaca]5.2e-2230.29Show/hide
Query:  TNSMAKVWSNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCPNLVA--ETTDHNLVGCMKAREVWD-LTFAHDLLKTKLNHSFADKWF-AMN
        ++S  K W +LW LKVP K+ H LWR   D++P+   L+RR I    +C    A  ETT H LVGC    +VW  L F  D L   L  +    W  A+ 
Subjt:  TNSMAKVWSNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCPNLVA--ETTDHNLVGCMKAREVWD-LTFAHDLLKTKLNHSFADKWF-AMN

Query:  SFISIEDLQKVAITCWSNWADRN-KLVHNKPIPLPLIRSQLIKKYDSEYLQMNGPLSSNSSDPYTGIE---------------------------IVSRN
        S I  +     A T W  W +RN  L  ++P P  ++  Q  K YD+E+ + +     + S P   I+                            + R+
Subjt:  SFISIEDLQKVAITCWSNWADRN-KLVHNKPIPLPLIRSQLIKKYDSEYLQMNGPLSSNSSDPYTGIE---------------------------IVSRN

Query:  SIGKLIGASSVHIDMNFPPPMAELMAIVEGLKFAEQLGHDRIKLESDCLQAISLINRDAEVRNEIGTLIEDIKR
        S GKL+GA ++           EL A+  G+ FA  +    +++ESD LQA+S++N + E     G L++ ++R
Subjt:  SIGKLIGASSVHIDMNFPPPMAELMAIVEGLKFAEQLGHDRIKLESDCLQAISLINRDAEVRNEIGTLIEDIKR

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.1e-2428.07Show/hide
Query:  SSTNSMAKVWSNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCP--NLVAETTDHNLVGCMKAREVWDLTFAH-DLLKTKLNHSFADKWFAM
        +STN     W+++W L VP K+K F+WR+ ++ IPT  NL  RGI     C       E+  H    C +AR++W   F     L  + N SF + W ++
Subjt:  SSTNSMAKVWSNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCP--NLVAETTDHNLVGCMKAREVWDLTFAH-DLLKTKLNHSFADKWFAM

Query:  NSFISIEDLQKVAITCWSNWADRNKLVHNKPIPLPLIRSQLIKKYDSEYLQMN----GPLSSNSSDPY------------------------TGIEIVSR
           +  +DL   AIT W  W DRN L+H K +     + + +  +   + Q       P + ++  P                         T    + R
Subjt:  NSFISIEDLQKVAITCWSNWADRNKLVHNKPIPLPLIRSQLIKKYDSEYLQMN----GPLSSNSSDPY------------------------TGIEIVSR

Query:  NSIGKLIGASSVHIDMNFPPPMAELMAIVEGLKFAEQLGHDRIKLESDCLQAISLINRDAEVRNEIGTLIEDIKRRTTVFSGILF
        +S   L+ A+S+ +     P +AE+  I+EGLKFA       +++ESD L AI LI  +   R +    + +I+  T  F+ I F
Subjt:  NSIGKLIGASSVHIDMNFPPPMAELMAIVEGLKFAEQLGHDRIKLESDCLQAISLINRDAEVRNEIGTLIEDIKRRTTVFSGILF

XP_030926547.1 uncharacterized protein LOC115953156 [Quercus lobata]1.2e-2128.57Show/hide
Query:  VWSNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCP--NLVAETTDHNLVGCMKAREVWDLTFAHDLLKTKLNHSFADKWFAMNSFISIEDL
        +W  LW L +P K+K F WRA  + +PT   +YRRGI  ++ CP     AE+ DH L+ C  +  VWD    + L      +SF D    + S  +++DL
Subjt:  VWSNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCP--NLVAETTDHNLVGCMKAREVWDLTFAHDLLKTKLNHSFADKWFAMNSFISIEDL

Query:  QKVAITCWSNWADRNKLVHNKPIPLPLIRSQLIKKYDSEY-------LQMNGPLSSNSSDPYTG------------------IEIVSRNSIGKLIGASSV
        +    T W+ W++RN ++H      PL      +    E+         +  P SS+ S P  G                  + +V R+S G+++ A  +
Subjt:  QKVAITCWSNWADRNKLVHNKPIPLPLIRSQLIKKYDSEY-------LQMNGPLSSNSSDPYTG------------------IEIVSRNSIGKLIGASSV

Query:  HIDMNFPPPMAELMAIVEGLKFAEQLGHDRIKLESDCLQAISLINRDAEVRNEIGTLIEDIKRRTTVFSGILF
         +   F   ++E+ A+ +G+ FA++L   RI +ESD L  I  +N D    +  G LI+ I +    F    F
Subjt:  HIDMNFPPPMAELMAIVEGLKFAEQLGHDRIKLESDCLQAISLINRDAEVRNEIGTLIEDIKRRTTVFSGILF

XP_042973114.1 uncharacterized protein LOC122304919 [Carya illinoinensis]6.2e-2327.72Show/hide
Query:  SSTNSMAKVWSNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCP--NLVAETTDHNLVGCMKAREVWDLTFAHDLLKTKLN-HSFADKWFAM
        S  + ++ VW  LW L     V+ F+W+A N+ +PT  NL RR I T    P   L  ETT+H L  C  AR+VW       + K  LN  SF + W  +
Subjt:  SSTNSMAKVWSNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCP--NLVAETTDHNLVGCMKAREVWDLTFAHDLLKTKLN-HSFADKWFAM

Query:  NSFISIEDLQKVAITCWSNWADRNKLVHNKPIPLPLIRSQLIKK-------YDSE-------------------------YLQMNGPLSSNSSDPYTGIE
        +  +  E+L +VA+ C S W  RN  +H K    P   +Q++ K       Y S                          Y ++N  ++ N++    G+ 
Subjt:  NSFISIEDLQKVAITCWSNWADRNKLVHNKPIPLPLIRSQLIKK-------YDSE-------------------------YLQMNGPLSSNSSDPYTGIE

Query:  IVSRNSIGKLIGASSVHIDMNFPPPMAELMAIVEGLKFAEQLGHDRIKLESDCLQAISLINRDAEVRNEIGTLIEDIKRRTTVFS
         + R+ +G+ IG      ++N  P   E  A++  + F + +G  R+ +E D +Q I L++      +  G ++ED KR+   F+
Subjt:  IVSRNSIGKLIGASSVHIDMNFPPPMAELMAIVEGLKFAEQLGHDRIKLESDCLQAISLINRDAEVRNEIGTLIEDIKRRTTVFS

TrEMBL top hitse value%identityAlignment
A0A6J1DX30 uncharacterized protein LOC1110248745.4e-2528.07Show/hide
Query:  SSTNSMAKVWSNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCP--NLVAETTDHNLVGCMKAREVWDLTFAH-DLLKTKLNHSFADKWFAM
        +STN     W+++W L VP K+K F+WR+ ++ IPT  NL  RGI     C       E+  H    C +AR++W   F     L  + N SF + W ++
Subjt:  SSTNSMAKVWSNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCP--NLVAETTDHNLVGCMKAREVWDLTFAH-DLLKTKLNHSFADKWFAM

Query:  NSFISIEDLQKVAITCWSNWADRNKLVHNKPIPLPLIRSQLIKKYDSEYLQMN----GPLSSNSSDPY------------------------TGIEIVSR
           +  +DL   AIT W  W DRN L+H K +     + + +  +   + Q       P + ++  P                         T    + R
Subjt:  NSFISIEDLQKVAITCWSNWADRNKLVHNKPIPLPLIRSQLIKKYDSEYLQMN----GPLSSNSSDPY------------------------TGIEIVSR

Query:  NSIGKLIGASSVHIDMNFPPPMAELMAIVEGLKFAEQLGHDRIKLESDCLQAISLINRDAEVRNEIGTLIEDIKRRTTVFSGILF
        +S   L+ A+S+ +     P +AE+  I+EGLKFA       +++ESD L AI LI  +   R +    + +I+  T  F+ I F
Subjt:  NSIGKLIGASSVHIDMNFPPPMAELMAIVEGLKFAEQLGHDRIKLESDCLQAISLINRDAEVRNEIGTLIEDIKRRTTVFSGILF

A0A6J5UN24 Uncharacterized protein2.5e-2230.29Show/hide
Query:  TNSMAKVWSNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCPNLVA--ETTDHNLVGCMKAREVWD-LTFAHDLLKTKLNHSFADKWF-AMN
        ++S  K W +LW LKVP K+ H LWR   D++P+   L+RR I    +C    A  ETT H LVGC    +VW  L F  D L   L  +    W  A+ 
Subjt:  TNSMAKVWSNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCPNLVA--ETTDHNLVGCMKAREVWD-LTFAHDLLKTKLNHSFADKWF-AMN

Query:  SFISIEDLQKVAITCWSNWADRN-KLVHNKPIPLPLIRSQLIKKYDSEYLQMNGPLSSNSSDPYTGIE---------------------------IVSRN
        S I  +     A T W  W +RN  L  ++P P  ++  Q  K YD+E+ + +     + S P   I+                            + R+
Subjt:  SFISIEDLQKVAITCWSNWADRN-KLVHNKPIPLPLIRSQLIKKYDSEYLQMNGPLSSNSSDPYTGIE---------------------------IVSRN

Query:  SIGKLIGASSVHIDMNFPPPMAELMAIVEGLKFAEQLGHDRIKLESDCLQAISLINRDAEVRNEIGTLIEDIKR
        S GKL+GA ++           EL A+  G+ FA  +    +++ESD LQA+S++N + E     G L++ ++R
Subjt:  SIGKLIGASSVHIDMNFPPPMAELMAIVEGLKFAEQLGHDRIKLESDCLQAISLINRDAEVRNEIGTLIEDIKR

A0A6J5UN51 Reverse transcriptase domain-containing protein2.5e-2230.29Show/hide
Query:  TNSMAKVWSNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCPNLVA--ETTDHNLVGCMKAREVWD-LTFAHDLLKTKLNHSFADKWF-AMN
        ++S  K W +LW LKVP K+ H LWR   D++P+   L+RR I    +C    A  ETT H LVGC    +VW  L F  D L   L  +    W  A+ 
Subjt:  TNSMAKVWSNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCPNLVA--ETTDHNLVGCMKAREVWD-LTFAHDLLKTKLNHSFADKWF-AMN

Query:  SFISIEDLQKVAITCWSNWADRN-KLVHNKPIPLPLIRSQLIKKYDSEYLQMNGPLSSNSSDPYTGIE---------------------------IVSRN
        S I  +     A T W  W +RN  L  ++P P  ++  Q  K YD+E+ + +     + S P   I+                            + R+
Subjt:  SFISIEDLQKVAITCWSNWADRN-KLVHNKPIPLPLIRSQLIKKYDSEYLQMNGPLSSNSSDPYTGIE---------------------------IVSRN

Query:  SIGKLIGASSVHIDMNFPPPMAELMAIVEGLKFAEQLGHDRIKLESDCLQAISLINRDAEVRNEIGTLIEDIKR
        S GKL+GA ++           EL A+  G+ FA  +    +++ESD LQA+S++N + E     G L++ ++R
Subjt:  SIGKLIGASSVHIDMNFPPPMAELMAIVEGLKFAEQLGHDRIKLESDCLQAISLINRDAEVRNEIGTLIEDIKR

A0A803PLS0 Uncharacterized protein9.6e-2228.06Show/hide
Query:  WSNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCPNL---VAETTDHNLVGCMKAREVWDLTFAHDLLKTKLNHSFADKWFAMNSFISIEDL
        W  LW LK+P KVKHF W+  ++ +P N NL +RGI + V+C      V E+  H L  C  ++  W ++  +D LK  +          + S    E L
Subjt:  WSNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCPNL---VAETTDHNLVGCMKAREVWDLTFAHDLLKTKLNHSFADKWFAMNSFISIEDL

Query:  QKVAITCWSNWADRNKLVHNKPIPLPLIRSQLIKKYDSEYLQMNGPLSSNSSDPYT----------------------------GIEIVSRNSIGKLIGA
        +   +  W+ W  RN +VH    P P    ++I+   S   +  G   S SS   T                            G+  V R+S G ++ A
Subjt:  QKVAITCWSNWADRNKLVHNKPIPLPLIRSQLIKKYDSEYLQMNGPLSSNSSDPYT----------------------------GIEIVSRNSIGKLIGA

Query:  SSVHIDMNFPPPMAELMAIVEGLKFAEQLGHDRIKLESDCLQAISLINRDAEVRNEIGTLIEDIKRRTTVFS--GILF
        ++  +    PP   ELMAI  G++   Q    R  +E+DCLQA+ LI         +  L++ I+   + +S  GI F
Subjt:  SSVHIDMNFPPPMAELMAIVEGLKFAEQLGHDRIKLESDCLQAISLINRDAEVRNEIGTLIEDIKRRTTVFS--GILF

A0A803QQT2 Uncharacterized protein3.3e-2228.67Show/hide
Query:  RSSTNSMAKVWSNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCPNL---VAETTDHNLVGCMKAREVWDLTFAHDLLKTKLNHSFADKWFA
        +S+ +S+ + W  LW LK+P KVKHF+W+  ++ +P NVNL +RGI +SV+C      V E+  H L  C  ++  W ++  +D LK  L          
Subjt:  RSSTNSMAKVWSNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCPNL---VAETTDHNLVGCMKAREVWDLTFAHDLLKTKLNHSFADKWFA

Query:  MNSFISIEDLQKVAITCWSNWADRNKLVHN--KPIPLPLI-----------------RSQLIKKYDSEY-------LQMNGPLSSNSSDPYTGIEIVSRN
        + +    E L+   +  W+ W  RN +VH    P P  +I                 RSQ   + DS +       + +N           +G+  V R+
Subjt:  MNSFISIEDLQKVAITCWSNWADRNKLVHN--KPIPLPLI-----------------RSQLIKKYDSEY-------LQMNGPLSSNSSDPYTGIEIVSRN

Query:  SIGKLIGASSVHIDMNFPPPMAELMAIVEGLKFAEQLGHDRIKLESDCLQAISLINRDAEVRNEIGTLIEDIKRRTTV--FSGILF
        + G ++ A++  +    PP   ELMAI +G++   Q    R  +E+DCLQA+ LI        +I  L+  I+   +   F GI F
Subjt:  SIGKLIGASSVHIDMNFPPPMAELMAIVEGLKFAEQLGHDRIKLESDCLQAISLINRDAEVRNEIGTLIEDIKRRTTV--FSGILF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein3.7e-1022.18Show/hide
Query:  LWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCPN--LVAETTDHNLVGCMKAREVW-------------------DLTFAHDLLKTKLNHSFA
        +W L V  K+KHFLWR +   + TN  L  R I    +C    +  ET  H +  C   + VW                   +L     L KT+  +S  
Subjt:  LWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCPN--LVAETTDHNLVGCMKAREVW-------------------DLTFAHDLLKTKLNHSFA

Query:  DK----------WFAMNSFISIEDLQ------KVAITCWSNWADRNKLVHNKPIPLPLIRSQLIKKYDSE-------YLQMNGPLSSNSSDPYTGIEIVS
        D+          W + N F+  +  Q      +  I   + W + N+   N  + +     Q  ++  S+       +++ N         PYT      
Subjt:  DK----------WFAMNSFISIEDLQ------KVAITCWSNWADRNKLVHNKPIPLPLIRSQLIKKYDSE-------YLQMNGPLSSNSSDPYTGIEIVS

Query:  RNSIGKLIGASSVHIDMNFPPPMAELMAIVEGLKFAEQLGHDRIKLESDCLQAISLINRDAEVRNEIGTLIEDIK
        R   G ++   +  +  +     AE +  +  L+     G   +  ESD    ++LIN + E  + +GTLI DI+
Subjt:  RNSIGKLIGASSVHIDMNFPPPMAELMAIVEGLKFAEQLGHDRIKLESDCLQAISLINRDAEVRNEIGTLIEDIK

AT3G09510.1 Ribonuclease H-like superfamily protein1.2e-1324.47Show/hide
Query:  SNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCP--NLVAETTDHNLVGCMKAREVWDLTFAHDLLKTKLNHSFADKWFAMNSFI---SIED
        + +W+L +  K+KHFLWRAL+  + T   L  RG++    CP  +   E+ +H L  C  A   W L+ +  +    +++ F +    + +F+   ++ D
Subjt:  SNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCP--NLVAETTDHNLVGCMKAREVWDLTFAHDLLKTKLNHSFADKWFAMNSFI---SIED

Query:  LQKV--AITCWSNWADRNKLVHNKPIPLP--------------LIRSQLIKKYDSEYLQM--------NGPLSSNSSDPYTGIEI---------VSRNSI
          K+      W  W  RN +V NK    P              L  +Q  KK  S   Q+        N P +    +   G ++         + RN  
Subjt:  LQKV--AITCWSNWADRNKLVHNKPIPLP--------------LIRSQLIKKYDSEYLQM--------NGPLSSNSSDPYTGIEI---------VSRNSI

Query:  GKLIGASSVHIDMNFPPPMAELMAIVEGLKFAEQLGHDRIKLESDCLQAISLINRDAEVRNEIGTLIEDIKRRTTVFSGILF
        G  I   S+ +     P  AE  A++  L+     G+ ++ +E DC   I+LIN      + +   +EDI      F+ I F
Subjt:  GKLIGASSVHIDMNFPPPMAELMAIVEGLKFAEQLGHDRIKLESDCLQAISLINRDAEVRNEIGTLIEDIKRRTTVFSGILF

AT3G25270.1 Ribonuclease H-like superfamily protein5.4e-0930.65Show/hide
Query:  AKVWSNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCPNLVA--ETTDHNLVGCMKAREVWDLT-FAHDLLKTKLNHSFADKWFAMNSFISI
        A++ + +W LK   K+KHFLW+ L+  + T  NL RR I+    C       ET+ H    C  A++VW  +   H  L+T            ++S ++ 
Subjt:  AKVWSNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCPNLVA--ETTDHNLVGCMKAREVWDLT-FAHDLLKTKLNHSFADKWFAMNSFISI

Query:  EDLQ--KVAI-TCWSNWADRNKLV
           Q   +AI   W  W  RN+LV
Subjt:  EDLQ--KVAI-TCWSNWADRNKLV

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.1e-0437.14Show/hide
Query:  MAKVW-SNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCPNL-VAETTDHNLVGCMKA-REV
        M   W  ++WSLK+  K+K  +W+ALN+ +P    L  R I     C      ET  H L  C  A REV
Subjt:  MAKVW-SNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCPNL-VAETTDHNLVGCMKA-REV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAACAGAATCAAAGCAAGGTCAAGTACTAACTCTATGGCGAAAGTCTGGTCTAACCTTTGGAGTTTGAAGGTTCCTGCCAAAGTGAAACACTTTTTATGGAGAGC
TCTTAATGATGTTATCCCCACTAATGTGAATTTATACCGAAGAGGTATTCAAACCAGTGTTATGTGCCCTAACTTGGTAGCTGAAACTACTGATCACAACTTAGTTGGAT
GTATGAAGGCACGGGAAGTTTGGGATCTTACCTTTGCCCATGATCTTTTAAAAACCAAGCTCAATCACAGTTTTGCTGACAAATGGTTTGCGATGAATTCTTTCATTTCT
ATCGAGGATCTCCAGAAGGTTGCCATTACCTGTTGGTCAAATTGGGCAGACAGAAATAAGTTAGTCCATAATAAGCCAATTCCCCTTCCTTTGATTCGAAGCCAATTGAT
TAAAAAATACGATTCAGAATACCTTCAAATGAATGGGCCTCTATCATCCAATTCGAGCGATCCCTATACTGGTATAGAGATTGTTAGCAGAAATTCTATCGGCAAACTTA
TTGGCGCCTCATCAGTTCATATAGATATGAATTTTCCTCCTCCCATGGCTGAGCTTATGGCAATTGTGGAAGGGTTGAAATTTGCTGAGCAATTAGGGCATGATCGAATT
AAGCTGGAATCAGACTGTCTCCAAGCTATCAGTCTTATAAACCGGGATGCTGAAGTAAGAAATGAGATTGGGACTTTGATTGAGGATATCAAGAGACGAACAACTGTTTT
CTCGGGGATTCTTTTTTTCTCATGTGCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGAACAGAATCAAAGCAAGGTCAAGTACTAACTCTATGGCGAAAGTCTGGTCTAACCTTTGGAGTTTGAAGGTTCCTGCCAAAGTGAAACACTTTTTATGGAGAGC
TCTTAATGATGTTATCCCCACTAATGTGAATTTATACCGAAGAGGTATTCAAACCAGTGTTATGTGCCCTAACTTGGTAGCTGAAACTACTGATCACAACTTAGTTGGAT
GTATGAAGGCACGGGAAGTTTGGGATCTTACCTTTGCCCATGATCTTTTAAAAACCAAGCTCAATCACAGTTTTGCTGACAAATGGTTTGCGATGAATTCTTTCATTTCT
ATCGAGGATCTCCAGAAGGTTGCCATTACCTGTTGGTCAAATTGGGCAGACAGAAATAAGTTAGTCCATAATAAGCCAATTCCCCTTCCTTTGATTCGAAGCCAATTGAT
TAAAAAATACGATTCAGAATACCTTCAAATGAATGGGCCTCTATCATCCAATTCGAGCGATCCCTATACTGGTATAGAGATTGTTAGCAGAAATTCTATCGGCAAACTTA
TTGGCGCCTCATCAGTTCATATAGATATGAATTTTCCTCCTCCCATGGCTGAGCTTATGGCAATTGTGGAAGGGTTGAAATTTGCTGAGCAATTAGGGCATGATCGAATT
AAGCTGGAATCAGACTGTCTCCAAGCTATCAGTCTTATAAACCGGGATGCTGAAGTAAGAAATGAGATTGGGACTTTGATTGAGGATATCAAGAGACGAACAACTGTTTT
CTCGGGGATTCTTTTTTTCTCATGTGCCTAG
Protein sequenceShow/hide protein sequence
MKNRIKARSSTNSMAKVWSNLWSLKVPAKVKHFLWRALNDVIPTNVNLYRRGIQTSVMCPNLVAETTDHNLVGCMKAREVWDLTFAHDLLKTKLNHSFADKWFAMNSFIS
IEDLQKVAITCWSNWADRNKLVHNKPIPLPLIRSQLIKKYDSEYLQMNGPLSSNSSDPYTGIEIVSRNSIGKLIGASSVHIDMNFPPPMAELMAIVEGLKFAEQLGHDRI
KLESDCLQAISLINRDAEVRNEIGTLIEDIKRRTTVFSGILFFSCA