; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001392 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001392
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:30684162..30685724
RNA-Seq ExpressionLag0001392
SyntenyLag0001392
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015383853.1 uncharacterized protein LOC107176237 [Citrus sinensis]3.5e-2125.07Show/hide
Query:  GGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRSII-ERNKTPKLTKDDLKG--CTVSSLIDETNNWIEGIVRA
        GG+  R++  FN+A++AK  WR+++ P  L+ R LK RY+K+  F++    +NPS  WRSI+  R    K  +  +     +V+ LI + N W +  VR 
Subjt:  GGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRSII-ERNKTPKLTKDDLKG--CTVSSLIDETNNWIEGIVRA

Query:  NFNPMNSNDIFSIPLGSPSTPDEIIWSPDSKGKISVKSA---YHLALSYSSFMEAEDWSSLGFWNWLIENLKWKELEIVVILIWSIWTLRNKILNDPINQ
         F   ++ +I  IPL      DE      +   +  K+A   +  A   + F +A +   L     + + L   ++E++V + W+IW  +NK L +    
Subjt:  NFNPMNSNDIFSIPLGSPSTPDEIIWSPDSKGKISVKSA---YHLALSYSSFMEAEDWSSLGFWNWLIENLKWKELEIVVILIWSIWTLRNKILNDPINQ

Query:  KP-----------EMHQIINQISRSIQ---------------------------EIEGKRGGLGWIVRDSRGSLICAGSKFVSIEWPIKVLEAMAPWEGL
        +P           E ++ + Q+   ++                             E ++ GLG ++RDS   +  A  K       +K  EA A   GL
Subjt:  KP-----------EMHQIINQISRSIQ---------------------------EIEGKRGGLGWIVRDSRGSLICAGSKFVSIEWPIKVLEAMAPWEGL

Query:  NFVLNFPKKPPSLEAESNCSDLINVLNEVEDDRTEVMSCVEAIRDVARNLGVLIFKHYPKSMNEAANFLTR
          V+        +  ES+C +++ ++N  E  RTEVM  +  I+ ++++       H P+S N  A+ L +
Subjt:  NFVLNFPKKPPSLEAESNCSDLINVLNEVEDDRTEVMSCVEAIRDVARNLGVLIFKHYPKSMNEAANFLTR

XP_015387170.1 uncharacterized protein LOC107177620 [Citrus sinensis]4.5e-2126.7Show/hide
Query:  GGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRSIIER--------------------NKTPKLTKDD------
        GG+  + + LFN AML K  WR++ +P+ L+ R  K RYF    F E  L +N S  WRSI+                         P L   D      
Subjt:  GGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRSIIER--------------------NKTPKLTKDD------

Query:  -----LKGCTVSSL-IDETNNWIEGIVRANFNPMNSNDIFSIPLGSPSTPDEIIWSPDSKGKISVKSAYHLALSYSSFMEAEDWSSLGFWNWLI----EN
             +   +V SL I     W   +V   FN  + N I  IPLG+    D   W PDSKG  +V+S Y L  S  S   +  W  L  W   I    +N
Subjt:  -----LKGCTVSSL-IDETNNWIEGIVRANFNPMNSNDIFSIPLGSPSTPDEIIWSPDSKGKISVKSAYHLALSYSSFMEAEDWSSLGFWNWLI----EN

Query:  LKWKELEIVVILIWSIWTLRNKILNDPINQKPEMHQIINQISRSIQEIEGKRG--GLGWIVRDSRGSLICAGSKFVSIEWPIKVLEAMAPWEGLNFVLNF
          W+ +        ++    + +L   + Q  +              I   RG   LG ++R  +G  I A S  +   +  +  EA+   E L+++  F
Subjt:  LKWKELEIVVILIWSIWTLRNKILNDPINQKPEMHQIINQISRSIQEIEGKRG--GLGWIVRDSRGSLICAGSKFVSIEWPIKVLEAMAPWEGLNFVLNF

Query:  PKKPPSLEAESNCSDLINVLNEVEDDRTEVMSCVEAIRDVARNLGVLIFKHYPKSMNEAANFLTREG---AGFHRNLESVGV
              LE +S    + N L++         S ++  R +AR+LG + F    +S+N AA+ + + G   +G  R  E++GV
Subjt:  PKKPPSLEAESNCSDLINVLNEVEDDRTEVMSCVEAIRDVARNLGVLIFKHYPKSMNEAANFLTREG---AGFHRNLESVGV

XP_015388020.1 uncharacterized protein LOC107177951 [Citrus sinensis]2.0e-2428.24Show/hide
Query:  GGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRSII-----------ERNKTPKLTKDDLKG------------
        G +  RD+  FNQA++AK SWRII+ P +LLAR +K +YF+   F+E  + +NPS  WRSI+            R +  KL K    G            
Subjt:  GGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRSII-----------ERNKTPKLTKDDLKG------------

Query:  -------CTVSSLIDETNNWIEGIVRANFNPMNSNDIFSIPLGSPSTPDEIIWSPDSKGKISVKSAYHLALSY-------SSFMEAEDWSSLGFWNWLIE
                TVS LIDE + W +  +R +F+  ++  I  IPL    + D+++W  D KG  SVKS Y LAL         SS      WSS+ +   + E
Subjt:  -------CTVSSLIDETNNWIEGIVRANFNPMNSNDIFSIPLGSPSTPDEIIWSPDSKGKISVKSAYHLALSY-------SSFMEAEDWSSLGFWNWLIE

Query:  NLKWKELEIVVILIWSIWTLRNKILNDPINQKPEMHQIINQISRSIQ------------------EIEGKRGGLGWIVRDSRGSLICAGSKFVSIEWPIK
          K ++ ++ V    +I     +I      Q P++  I +Q   S +                  +I+ +R GLG ++R+S G LI A  K      P K
Subjt:  NLKWKELEIVVILIWSIWTLRNKILNDPINQKPEMHQIINQISRSIQ------------------EIEGKRGGLGWIVRDSRGSLICAGSKFVSIEWPIK

Query:  VLEAM--APWEGLNFVLNFPKKPPSLE--AESNCSDLINVLNEVEDDRTEVMSCVEAIRDVARNLGVLIFKHYPKSMNEAANFLTR
         L+ M  A  E   F L   +    L    E++  ++ N+++  +  + E+   V  +++    L  +  +H P++ N  A+ L +
Subjt:  VLEAM--APWEGLNFVLNFPKKPPSLE--AESNCSDLINVLNEVEDDRTEVMSCVEAIRDVARNLGVLIFKHYPKSMNEAANFLTR

XP_023872147.1 uncharacterized protein LOC111984763 [Quercus suber]4.8e-2326.2Show/hide
Query:  GGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRSIIER------------------------------------
        GGM  R++  FN+AMLAK  WRI+++P + +AR LK RYF     +   L N+PS +WRSI                                       
Subjt:  GGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRSIIER------------------------------------

Query:  --NKTPKLTKDDLKGCTVSSLIDE-TNNWIEGIVRANFNPMNSNDIFSIPLGSPSTPDEIIWSPDSKGKISVKSAYHLALSYSSFMEAEDWSS-------
          N TP+          VSSLID  T  W   +VRA F P  ++ I  IPL      D+I W  + +G  +VKSAYH+AL+  + M    WS        
Subjt:  --NKTPKLTKDDLKGCTVSSLIDE-TNNWIEGIVRANFNPMNSNDIFSIPLGSPSTPDEIIWSPDSKGKISVKSAYHLALSYSSFMEAEDWSS-------

Query:  -----LGFWNWLIENLKWKELEIVVILIWSIWTLRNKILND--------PINQ-----------KPEMHQIINQISRSIQEIEGKRGGLGWIVRDSRGSL
             L    +++ +   ++LE+   + W++W  RN+++++          NQ            P +H+I    + S  +       +G ++RDS G +
Subjt:  -----LGFWNWLIENLKWKELEIVVILIWSIWTLRNKILND--------PINQ-----------KPEMHQIINQISRSIQEIEGKRGGLGWIVRDSRGSL

Query:  ICAGSKFVSIEWPIKVLEAMAPWEGLNFVLNFPKKPPSLEAESNCSDLINVLNEVEDDRTEVMSCVEAIRDVARNLGVLIFKHYPKSMNEAANFLTR
        + A  K +   +P ++ E MA  +G+  +L    + P +  ES+ S++I  + E +   +     ++ I     +     FKH  ++ N  A+ L +
Subjt:  ICAGSKFVSIEWPIKVLEAMAPWEGLNFVLNFPKKPPSLEAESNCSDLINVLNEVEDDRTEVMSCVEAIRDVARNLGVLIFKHYPKSMNEAANFLTR

XP_030495270.1 uncharacterized protein LOC115711072 [Cannabis sativa]3.1e-2229.11Show/hide
Query:  GGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRSIIERNKTPKLTKDDLK-----GCTVSSLIDETNNWIEGIV
        GGM  R  + FNQA+LAK +WRI   P  LL+R LK RYF +  F+E ++ ++PSLTW+SI       +L    L+     G  V S  D    WI    
Subjt:  GGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRSIIERNKTPKLTKDDLK-----GCTVSSLIDETNNWIEGIV

Query:  RANFNPMNSNDIFSIPLGSPSTPDEIIWSPDSKGKISVKSAYHLALSYSSFMEAEDWSSLGFWNWLIENLKWKELEIVVILIWSIWTLRNKILNDPINQK
          NF P++ + I SIPL   + PD ++W     G  SVK+ +HLA +       ED +S    N        K+         S W             +
Subjt:  RANFNPMNSNDIFSIPLGSPSTPDEIIWSPDSKGKISVKSAYHLALSYSSFMEAEDWSSLGFWNWLIENLKWKELEIVVILIWSIWTLRNKILNDPINQK

Query:  PEMHQIINQISRSIQEIEGKRGGLGWIVRDSRGSLICAGSKFVSIEWPIKVLEAMAPWEGLNFVLNFPKKPPSLEAESNCSDLINVLNEVEDDRTEVMSC
        P      N    +   +E K+ G+G I+RD  G+++ A SK V   +    +EA A +  +N+V     + P    E++ S + N LN +  D +     
Subjt:  PEMHQIINQISRSIQEIEGKRGGLGWIVRDSRGSLICAGSKFVSIEWPIKVLEAMAPWEGLNFVLNFPKKPPSLEAESNCSDLINVLNEVEDDRTEVMSC

Query:  VEAIRDVARNLGVLIFKHYPKSMNEAANFLTREGAGFHRNLESVGVV
        +  IR +      ++  H  ++ N+AA+ L +   G   +   VG +
Subjt:  VEAIRDVARNLGVLIFKHYPKSMNEAANFLTREGAGFHRNLESVGVV

TrEMBL top hitse value%identityAlignment
A0A1S8ACU2 Ribonuclease H-like superfamily protein6.4e-2134.02Show/hide
Query:  GGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRSII-----------------ERNK-----------------
        GG+  RDI  FNQA++AK SWRIIK P +L+A+ L+ +YFK   F++  L + PS  WRSII                 +R K                 
Subjt:  GGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRSII-----------------ERNK-----------------

Query:  TPKLTKDDLKGCTVSSLIDETNNWIEGIVRANFNPMNSNDIFSIPLGSPSTPDEIIWSPDSKGKISVKSAYHLALSYS-SFMEAEDWSSLGFWN--W---
         P L  D    CTV+ LIDE   W E +++ +FN  ++  I  I L     PD+I+W  D KG  SVKS Y +A+        +   S+LG WN  W   
Subjt:  TPKLTKDDLKGCTVSSLIDETNNWIEGIVRANFNPMNSNDIFSIPLGSPSTPDEIIWSPDSKGKISVKSAYHLALSYS-SFMEAEDWSSLGFWN--W---

Query:  LIENLK---WKELEIVVILIWSIWTLRNKILNDPINQKPEM
        L E +K   WK     +    ++W  R K++ +PI  + +M
Subjt:  LIENLK---WKELEIVVILIWSIWTLRNKILNDPINQKPEM

A0A2N9FM05 Reverse transcriptase domain-containing protein3.1e-2327.03Show/hide
Query:  GGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRSIIERNKTPKL---------------------TKDDLK---
        GGM  RD+ LFN A+LA+  WRIIK+P +LL R LK +YF +  F++  + +N S  WRSI+   +   L                     T+   K   
Subjt:  GGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRSIIERNKTPKL---------------------TKDDLK---

Query:  -------GCTVSSLI-DETNNWIEGIVRANFNPMNSNDIFSIPLGSPSTPDEIIWSPDSKGKISVKSAYHLALSYSSFMEAEDWSSLGFWNWLIENLKWK
                 TV +LI  +T  W   ++ A F    ++ I SIPL   S+ D IIWS    G  SV+SAYHL +      E+ + S         ++L   
Subjt:  -------GCTVSSLI-DETNNWIEGIVRANFNPMNSNDIFSIPLGSPSTPDEIIWSPDSKGKISVKSAYHLALSYSSFMEAEDWSSLGFWNWLIENLKWK

Query:  ELEIVVILIWSIWTLRN-------------------KILNDPINQKPEMHQIINQI-------SRSIQEIE--------GKRGGLGWIVRDSRGSLICAG
        ++E++  + W +W  RN                    +++D + Q+   H  I ++       S S+ +I            GG G ++RDS GS+I A 
Subjt:  ELEIVVILIWSIWTLRN-------------------KILNDPINQKPEMHQIINQI-------SRSIQEIE--------GKRGGLGWIVRDSRGSLICAG

Query:  SKFVSIEWPIKVLEAMAPWEGLNFVLNFPKKPPSLEAESNCSDLINVLNEVEDDRTEVMSCVEAIRDVARNLGVLIFKHYPKSMNEAANFLTREGAGFHR
            S       L A A  + L           SL  E +CS L+  L           + V+ I++++      IF   P+S N  +  L +E   F  
Subjt:  SKFVSIEWPIKVLEAMAPWEGLNFVLNFPKKPPSLEAESNCSDLINVLNEVEDDRTEVMSCVEAIRDVARNLGVLIFKHYPKSMNEAANFLTREGAGFHR

Query:  NLESVGV
            +GV
Subjt:  NLESVGV

A0A6J1DAR4 uncharacterized protein LOC1110189549.2e-2035.78Show/hide
Query:  GGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRSI----------------------------IERNKTPKLTK
        GGM  RD+ LFN+A+LAK  WRI+  P+++L+R LKGRYFKD  F+E  +  NPS  WRSI                            +    T K+  
Subjt:  GGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRSI----------------------------IERNKTPKLTK

Query:  DDLKGCT--VSSLID-ETNNWIEGIVRANFNPMNSNDIFSIPLGSPSTPDEIIWSPDSKGKISVKSAYHLAL---------SYSSFMEAEDWSSLGFWNW
                 VSSL+D E   W   +VR  F P  +  I SIP+G  +  D +IW+ +  G  SV+S Y +AL         S SS  E   W + GFW  
Subjt:  DDLKGCT--VSSLID-ETNNWIEGIVRANFNPMNSNDIFSIPLGSPSTPDEIIWSPDSKGKISVKSAYHLAL---------SYSSFMEAEDWSSLGFWNW

Query:  LIEN
         I N
Subjt:  LIEN

A0A6J1DRA0 uncharacterized protein LOC1110224231.2e-1936.63Show/hide
Query:  MGGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRSII------------------------------ERNKTPK
        +GGM  RDI +FNQAMLAK SWRI++ P +LLA+ L+G+YFK   F++  L   PS  WRSI+                              + N +P 
Subjt:  MGGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRSII------------------------------ERNKTPK

Query:  LTKDDLKGCTVSSLIDETNNWIEGIVRANFNPMNSNDIFSIPLGSPSTPDEIIWSPDSKGKISVKSAYHLAL
         T   ++  +V+ L++    W E  VR +F    ++ I   PL S    DEIIW  D  G  SV+SAYHL +
Subjt:  LTKDDLKGCTVSSLIDETNNWIEGIVRANFNPMNSNDIFSIPLGSPSTPDEIIWSPDSKGKISVKSAYHLAL

A0A803PI64 Uncharacterized protein4.6e-1923.21Show/hide
Query:  GGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRSIIERNK--------------TPKLTKDDL-----------
        GG+  RD+ +FNQA+LAK  WR I+ P  L +R LK  YF  K F+E     N S  WRS++   K              + ++ +D             
Subjt:  GGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRSIIERNK--------------TPKLTKDDL-----------

Query:  -----KGCTVSSLIDETNNWIEGIVRANFNPMNSNDIFSIPLGSPSTPDEIIWSPDSKGKISVKSAYHLALS-YSSFMEAEDWSSLGFWN----------
             +   V+ L      W E  +R+ FN +++  I +IP       D+I+      G+ +VKS Y +A S  +   ++ D S + +W           
Subjt:  -----KGCTVSSLIDETNNWIEGIVRANFNPMNSNDIFSIPLGSPSTPDEIIWSPDSKGKISVKSAYHLALS-YSSFMEAEDWSSLGFWN----------

Query:  -----WLI-------------ENL--------KW--KELEIVVILIWSIWTLRNKILNDPINQKPEMHQIINQISRSIQEIEGKRG--------------
             W++             +NL        +W  ++LE  +++ W++W +RN +++   + KPE   +I    R + E  G+ G              
Subjt:  -----WLI-------------ENL--------KW--KELEIVVILIWSIWTLRNKILNDPINQKPEMHQIINQISRSIQEIEGKRG--------------

Query:  ----------------------GLGWIVRDSRGSLICAGSKFVSIEWPIKVLEAMAPWEGLNFVLNFPKKPPSLEAESNCSDLINVLNEVEDDRTEVMSC
                              GLG ++RDS G+++CA +  +  E P   L  MA  +GL   L   ++      E++C   + ++   E+   +V   
Subjt:  ----------------------GLGWIVRDSRGSLICAGSKFVSIEWPIKVLEAMAPWEGLNFVLNFPKKPPSLEAESNCSDLINVLNEVEDDRTEVMSC

Query:  VEAIR
        ++ IR
Subjt:  VEAIR

SwissProt top hitse value%identityAlignment
P93295 Uncharacterized mitochondrial protein AtMg003103.2e-0951.61Show/hide
Query:  GGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRSII
        GG+  RD+  FNQA+LAK S+RII  P  LL+R L+ RYF     +E S+   PS  WRSII
Subjt:  GGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRSII

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein1.8e-0726.14Show/hide
Query:  GGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRS----------------------IIERNK------------
        GG+  +DI  FN A+L K  WR++  P +L+A+  K RYF     +   L + PS  W+S                      II R+K            
Subjt:  GGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRS----------------------IIERNK------------

Query:  ----TPKLTKDDLKGCTVSSLIDET-NNWIEGIVRANFNPMNSNDIFSIPLGSPSTPDEIIWSPDSKGKISVKSAY
             P+          VS LIDE+   W + ++   F  +    I  +  G     D   W   S G  +VKS Y
Subjt:  ----TPKLTKDDLKGCTVSSLIDET-NNWIEGIVRANFNPMNSNDIFSIPLGSPSTPDEIIWSPDSKGKISVKSAY

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.2e-1051.61Show/hide
Query:  GGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRSII
        GG+  RD+  FNQA+LAK S+RII  P  LL+R L+ RYF     +E S+   PS  WRSII
Subjt:  GGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRSII


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGAATGAGGTTAAGAGATATCCTTTTGTTTAACCAAGCCATGCTAGCTAAGCTGAGTTGGAGAATCATTAAAGATCCTTCCAACCTCCTAGCAAGAAAGCTTAA
AGGGAGATATTTTAAGGATAAGCCGTTTGTGGAGGTCTCCTTGAGAAACAATCCTTCCCTCACTTGGAGAAGCATCATAGAGCGTAACAAAACTCCTAAGCTCACCAAAG
ATGATCTAAAGGGGTGTACGGTTAGCAGCCTAATTGATGAAACAAACAATTGGATCGAAGGGATAGTCAGAGCTAACTTCAACCCTATGAACTCAAATGATATCTTTAGC
ATCCCCCTTGGTAGCCCTAGCACCCCAGATGAGATCATTTGGAGCCCCGACAGTAAAGGAAAAATTTCAGTGAAGAGTGCATACCACTTGGCCTTGTCCTACTCTTCATT
TATGGAGGCGGAGGATTGGTCGAGTTTAGGCTTTTGGAATTGGCTGATCGAGAACCTGAAATGGAAGGAGCTTGAAATTGTTGTGATCCTAATTTGGAGCATTTGGACAC
TCAGGAACAAAATTCTCAATGATCCAATCAATCAAAAGCCAGAGATGCACCAGATTATCAACCAAATCAGCAGAAGTATCCAAGAGATTGAAGGAAAGAGGGGTGGCCTA
GGTTGGATCGTGCGTGACTCGAGAGGATCTCTTATCTGTGCGGGATCGAAATTTGTCTCAATTGAATGGCCTATAAAGGTGTTGGAAGCTATGGCTCCTTGGGAAGGTCT
GAATTTTGTCCTTAATTTTCCCAAAAAGCCTCCGTCGCTTGAAGCCGAGTCGAATTGCTCTGACCTTATTAATGTTCTGAATGAGGTTGAAGATGATCGAACGGAAGTGA
TGTCTTGCGTTGAGGCCATTCGTGATGTGGCCAGAAATCTGGGAGTGCTCATCTTCAAGCACTACCCCAAATCGATGAATGAAGCGGCGAATTTCCTTACTCGTGAAGGT
GCAGGTTTTCACCGCAATTTGGAATCTGTTGGTGTTGTTTTTTTATCAGGGGGCTCCTTCCACGTTGGAAGAGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGAATGAGGTTAAGAGATATCCTTTTGTTTAACCAAGCCATGCTAGCTAAGCTGAGTTGGAGAATCATTAAAGATCCTTCCAACCTCCTAGCAAGAAAGCTTAA
AGGGAGATATTTTAAGGATAAGCCGTTTGTGGAGGTCTCCTTGAGAAACAATCCTTCCCTCACTTGGAGAAGCATCATAGAGCGTAACAAAACTCCTAAGCTCACCAAAG
ATGATCTAAAGGGGTGTACGGTTAGCAGCCTAATTGATGAAACAAACAATTGGATCGAAGGGATAGTCAGAGCTAACTTCAACCCTATGAACTCAAATGATATCTTTAGC
ATCCCCCTTGGTAGCCCTAGCACCCCAGATGAGATCATTTGGAGCCCCGACAGTAAAGGAAAAATTTCAGTGAAGAGTGCATACCACTTGGCCTTGTCCTACTCTTCATT
TATGGAGGCGGAGGATTGGTCGAGTTTAGGCTTTTGGAATTGGCTGATCGAGAACCTGAAATGGAAGGAGCTTGAAATTGTTGTGATCCTAATTTGGAGCATTTGGACAC
TCAGGAACAAAATTCTCAATGATCCAATCAATCAAAAGCCAGAGATGCACCAGATTATCAACCAAATCAGCAGAAGTATCCAAGAGATTGAAGGAAAGAGGGGTGGCCTA
GGTTGGATCGTGCGTGACTCGAGAGGATCTCTTATCTGTGCGGGATCGAAATTTGTCTCAATTGAATGGCCTATAAAGGTGTTGGAAGCTATGGCTCCTTGGGAAGGTCT
GAATTTTGTCCTTAATTTTCCCAAAAAGCCTCCGTCGCTTGAAGCCGAGTCGAATTGCTCTGACCTTATTAATGTTCTGAATGAGGTTGAAGATGATCGAACGGAAGTGA
TGTCTTGCGTTGAGGCCATTCGTGATGTGGCCAGAAATCTGGGAGTGCTCATCTTCAAGCACTACCCCAAATCGATGAATGAAGCGGCGAATTTCCTTACTCGTGAAGGT
GCAGGTTTTCACCGCAATTTGGAATCTGTTGGTGTTGTTTTTTTATCAGGGGGCTCCTTCCACGTTGGAAGAGTTTAA
Protein sequenceShow/hide protein sequence
MGGMRLRDILLFNQAMLAKLSWRIIKDPSNLLARKLKGRYFKDKPFVEVSLRNNPSLTWRSIIERNKTPKLTKDDLKGCTVSSLIDETNNWIEGIVRANFNPMNSNDIFS
IPLGSPSTPDEIIWSPDSKGKISVKSAYHLALSYSSFMEAEDWSSLGFWNWLIENLKWKELEIVVILIWSIWTLRNKILNDPINQKPEMHQIINQISRSIQEIEGKRGGL
GWIVRDSRGSLICAGSKFVSIEWPIKVLEAMAPWEGLNFVLNFPKKPPSLEAESNCSDLINVLNEVEDDRTEVMSCVEAIRDVARNLGVLIFKHYPKSMNEAANFLTREG
AGFHRNLESVGVVFLSGGSFHVGRV