; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022013 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022013
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:15914889..15918179
RNA-Seq ExpressionLag0022013
SyntenyLag0022013
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4361707.1 hypothetical protein F8388_026397 [Cannabis sativa]5.7e-2325Show/hide
Query:  PSSSSDDSIMGWWKGCWKRGIPNKINIFLWKLCLDRLPTVDNLVYRGMDVVNVCGLCGRHGESSLHVFWHCKFIKRVLMDFEFGILVNMVQAGSIFNLLR
        PSSS+ +++  WWK  W+  IP KI  F+++L    LPT +NL  R      +C  C    ES  H  + C+ +K+      F  ++   +  +IF++L 
Subjt:  PSSSSDDSIMGWWKGCWKRGIPNKINIFLWKLCLDRLPTVDNLVYRGMDVVNVCGLCGRHGESSLHVFWHCKFIKRVLMDFEFGILVNMVQAGSIFNLLR

Query:  EVRDNVGWGKFELFMVVLWNVWNFRNQRKFKG-IGSMEGLVEWARSYISSFQQATLSSGLCGHRGITREEVRWSPPDFGWYK------------------
         ++ N    +F LF+ +LW  WN RN   F+  +   E + + A+ Y++ +Q A             R+ + W PP  G+ K                  
Subjt:  EVRDNVGWGKFELFMVVLWNVWNFRNQRKFKG-IGSMEGLVEWARSYISSFQQATLSSGLCGHRGITREEVRWSPPDFGWYK------------------

Query:  ---QHVRSL--------------EMAEGWAAVKGMRLALEMGLFPLVLETDSSRVASVFQNESMDDYSDLGALVGDLRKELPRSSSVGCMLTRREGNGVA
            H   +              + AEGWA ++ ++   + G+    +E D   + +  Q  S ++ +  G ++  +R+ L    +V    TRR+GN  A
Subjt:  ---QHVRSL--------------EMAEGWAAVKGMRLALEMGLFPLVLETDSSRVASVFQNESMDDYSDLGALVGDLRKELPRSSSVGCMLTRREGNGVA

Query:  HHLARWAL
        H+LA+ ++
Subjt:  HHLARWAL

XP_022143319.1 uncharacterized protein LOC111013220 [Momordica charantia]2.3e-2437.11Show/hide
Query:  SEYRLGQSIWIAQLPSSSSDDSIMGWWKGCWKRGIPNKINIFLWKLCLDRLPTVDNLVYRGMDVVNVCGLCGRHGESSLHVFWHCKFIKRVLMDFEFGIL
        S+Y+L          S+S  + +  WWK  W+  +P+KI +F W+ CLDRLPT  NL+ RG+DV +    CG+ GE +LH+FW CK  K      +F  L
Subjt:  SEYRLGQSIWIAQLPSSSSDDSIMGWWKGCWKRGIPNKINIFLWKLCLDRLPTVDNLVYRGMDVVNVCGLCGRHGESSLHVFWHCKFIKRVLMDFEFGIL

Query:  VNMVQAGSIFNLLREVRDNVGWGKFELFMVVLWNVWNFRNQRKF-KGIGSMEGLVEWAR
           V+  S+ +LLR+    + W  FE  +V LW +W+ RN + F  G   +  L  W R
Subjt:  VNMVQAGSIFNLLREVRDNVGWGKFELFMVVLWNVWNFRNQRKF-KGIGSMEGLVEWAR

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]3.6e-4136.86Show/hide
Query:  QLPSSSSDDSIMGWWKGCWKRGIPNKINIFLWKLCLDRLPTVDNLVYRGMDVVNVCGLCGRHGESSLHVFWHCKFIKRVLMDFEFGILVNMVQAGSIFNL
        Q PSSSS + +  WW G WK  IPNKI +FLW+LCLDRLPT  NL  RG+++ N C  CGR+GE S+H+FW CKF + + ++ +FG L       S F +
Subjt:  QLPSSSSDDSIMGWWKGCWKRGIPNKINIFLWKLCLDRLPTVDNLVYRGMDVVNVCGLCGRHGESSLHVFWHCKFIKRVLMDFEFGILVNMVQAGSIFNL

Query:  LREVRDNVGWGKFELFMVVLWNVWNFRNQRKFKG-------IGSMEGLVEWARSYISSFQQATLSSGLCGHRGITREEVRWSPPDFGWYK----------
        LRE  +++    FE   VV+W +WN RN R F         IG ME LVEWA  Y   F++A  S+ + G R     E+ W PPD G YK          
Subjt:  LREVRDNVGWGKFELFMVVLWNVWNFRNQRKFKG-------IGSMEGLVEWARSYISSFQQATLSSGLCGHRGITREEVRWSPPDFGWYK----------

Query:  -QHV---------RSLEMAEGWAAVKGMRLALEMGLFPLVLETDSSRVAS-VFQNESMDDYSDLGALVGDLRKELPRSSSVGCMLTRREGNGVAHHLARW
         QH          R   MA   AA K +     + +   +   +  ++AS +  + +++D S+ G +V   +    +S        +REGN  AH LAR 
Subjt:  -QHV---------RSLEMAEGWAAVKGMRLALEMGLFPLVLETDSSRVAS-VFQNESMDDYSDLGALVGDLRKELPRSSSVGCMLTRREGNGVAHHLARW

Query:  ALIEGESMMAIE
        AL+  E  + +E
Subjt:  ALIEGESMMAIE

XP_023874626.1 uncharacterized protein LOC111987155 [Quercus suber]3.7e-2228.8Show/hide
Query:  SSSSDDSIMGWWKGCWKRGIPNKINIFLWKLCLDRLPTVDNLVYRGMDVVNVCGLCGRHGESSLHVFWHCKFIKRVLMDFEFGILVNMVQAGSIFNLLRE
        S S +  +   WK  W+   P+KI  FLW+ C D LPT + L+ RG+   + C LCG   E+S H+ W C + + V  D +  + V    +    +++ E
Subjt:  SSSSDDSIMGWWKGCWKRGIPNKINIFLWKLCLDRLPTVDNLVYRGMDVVNVCGLCGRHGESSLHVFWHCKFIKRVLMDFEFGILVNMVQAGSIFNLLRE

Query:  V---RDNVGWGKFELFMVVLWNVWNFRNQRKFKGIGSMEG-LVEWARSYISSFQQATLSSGLCGHRGITREEVRWSPPDFGWYK----------------
        +   R ++ W    +F V  W++WN RN   F G    +G L++ AR Y+   +  ++     G    +     W PP  GWYK                
Subjt:  V---RDNVGWGKFELFMVVLWNVWNFRNQRKFKGIGSMEG-LVEWARSYISSFQQATLSSGLCGHRGITREEVRWSPPDFGWYK----------------

Query:  ----QHVRSLEM---------------AEGWAAVKGMRLALEMGLFPLVLETDSSRVASVFQNESMDDYSDLGALVGDLRKELPRSSSVGCMLTRREGNG
            ++ R L M                E  A  +G+RLA ++GL  +VLE+DS  V +  +  S+   S L  + G  R EL          TRR GN 
Subjt:  ----QHVRSLEM---------------AEGWAAVKGMRLALEMGLFPLVLETDSSRVASVFQNESMDDYSDLGALVGDLRKELPRSSSVGCMLTRREGNG

Query:  VAHHLARWA
         AH +A++A
Subjt:  VAHHLARWA

XP_023915006.1 uncharacterized protein LOC112026546 [Quercus suber]2.8e-2225.24Show/hide
Query:  SSSDDSIMGWWKGCWKRGIPNKINIFLWKLCLDRLPTVDNLVYRGMDVVNVCGLCGRHGESSLHVFWHCKFIKRVLMDFEFGILVNMVQAGSIFNLLREV
        SSS + + G WK  W + IP K+ IF W+LC++ LPT+ NL +RG+     C LC +  E++ H   HC   K     +  G  V         ++  ++
Subjt:  SSSDDSIMGWWKGCWKRGIPNKINIFLWKLCLDRLPTVDNLVYRGMDVVNVCGLCGRHGESSLHVFWHCKFIKRVLMDFEFGILVNMVQAGSIFNLLREV

Query:  RDNVGWGKFELFMVVLWNVWNFRNQRKFKGIGSMEGLV-EWARSYISSFQQATLSSGLCGHRGITREEVRWSPPDFGWYKQHVRS---------------
          N      ELF+ V W++W  RNQ   +G G+    + + A   +  +++A   S L     +    + W PP  G+ K +V                 
Subjt:  RDNVGWGKFELFMVVLWNVWNFRNQRKFKGIGSMEGLV-EWARSYISSFQQATLSSGLCGHRGITREEVRWSPPDFGWYKQHVRS---------------

Query:  --------------------LEMAEGWAAVKGMRLALEMGLFPLVLETDSSRVASVFQNESMDDYSDLGALVGDLRKELPRSSSVGCMLTRREGNGVAHH
                             +++E  A   G+ LALE+GL  ++ E+D+  +       ++    ++G ++ +++      S       +REGN  AH 
Subjt:  --------------------LEMAEGWAAVKGMRLALEMGLFPLVLETDSSRVASVFQNESMDDYSDLGALVGDLRKELPRSSSVGCMLTRREGNGVAHH

Query:  LARWALIEGESMM
        LAR A + G S +
Subjt:  LARWALIEGESMM

TrEMBL top hitse value%identityAlignment
A0A2N9GUW4 Non-specific serine/threonine protein kinase3.3e-2427.51Show/hide
Query:  LPSSSSDDSIMG-WWKGCWKRGIPNKINIFLWKLCLDRLPTVDNLVYRGMDVVNVCGLCGRHGESSLHVFWHCKFIKRVLMDFEFGILVNMVQAGSIFNL
        L + SSD   M  +WK  W   +P KI  F+W+LCL+ LPT DNLV   +     C +CG   E+ +H++  C F ++V      G+ +++     I + 
Subjt:  LPSSSSDDSIMG-WWKGCWKRGIPNKINIFLWKLCLDRLPTVDNLVYRGMDVVNVCGLCGRHGESSLHVFWHCKFIKRVLMDFEFGILVNMVQAGSIFNL

Query:  LREVRDNVGWGKFELFMVVLWNVWNFRNQRKFKGIGSM-EGLVEWARSYISSFQQATLSSGLCGHRGITREEVRWSPPDFGWYKQHVRSLEM--------
        +  +  + G G F   +VVLW+VWN RN   F+ +  +   + + A  +I  +    L +G      ++RE VRW  P  GWYK ++  +          
Subjt:  LREVRDNVGWGKFELFMVVLWNVWNFRNQRKFKGIGSM-EGLVEWARSYISSFQQATLSSGLCGHRGITREEVRWSPPDFGWYKQHVRSLEM--------

Query:  ---------------------------AEGWAAVKGMRLALEMGLFPLVLETDSSRVASVFQNESMDDYSDLGALVGDLRKELPRSSSVGCMLTRREGNG
                                   AE  AA++ +   +++  F LV E D  +V          D+S +G +    R +L  +     +   REGN 
Subjt:  ---------------------------AEGWAAVKGMRLALEMGLFPLVLETDSSRVASVFQNESMDDYSDLGALVGDLRKELPRSSSVGCMLTRREGNG

Query:  VAHHLARWA
        VAH LA++A
Subjt:  VAHHLARWA

A0A2N9H1N4 RNase H domain-containing protein2.1e-2626.44Show/hide
Query:  MNCFRLPKKLIIETSRAIAQFWWRGEKVDRG----IHWVVLYLEEFTMG---------ERALGERIALADMEFEKCPGVWG--IGFLMMGVSGSEYRLGQ
        M+CFRLP  LI E    I +FWW G+  ++G    + W +LY+  +  G         E      I L+      C  VWG     +    SG    L  
Subjt:  MNCFRLPKKLIIETSRAIAQFWWRGEKVDRG----IHWVVLYLEEFTMG---------ERALGERIALADMEFEKCPGVWG--IGFLMMGVSGSEYRLGQ

Query:  SIWIAQLPSSSSDDSIMGWWKGCWKRGIPNKINIFLWKLCLDRLPTVDNLVYRGMDVVNVCGLCGRHGESSLHVFWHCKFIKRVLMDFEFGILVNMVQAG
        S      PS  S  S +  WK  W   +P K   FLW+ C + LPT  NL +R +     C +C +  ES++H  W CK ++ V     +G  +      
Subjt:  SIWIAQLPSSSSDDSIMGWWKGCWKRGIPNKINIFLWKLCLDRLPTVDNLVYRGMDVVNVCGLCGRHGESSLHVFWHCKFIKRVLMDFEFGILVNMVQAG

Query:  SIFNLLREVRDNVGWGKFELFMVVLWNVWNFRNQ-RKFKGIGSMEGLVEWARSYISSFQQATLSSGLCGHRGITREEVRWSPPDFGWYKQHV--------
           +LL +    +   + +LF ++ W++W  RN+ R  + + +   LV+ AR  +S FQ+A         +  +   V+W+PP  G YK +         
Subjt:  SIFNLLREVRDNVGWGKFELFMVVLWNVWNFRNQ-RKFKGIGSMEGLVEWARSYISSFQQATLSSGLCGHRGITREEVRWSPPDFGWYKQHV--------

Query:  ---------------------------RSLEMAEGWAAVKGMRLALEMGLFPLVLETDSSRV--ASVFQNESMDDYSDLGALVGDLRKELPRSSSVGCML
                                    S+E  E  AA   ++ A ++G   + LE DS  V  A + +      Y+    ++ D ++      SV  + 
Subjt:  ---------------------------RSLEMAEGWAAVKGMRLALEMGLFPLVLETDSSRV--ASVFQNESMDDYSDLGALVGDLRKELPRSSSVGCML

Query:  TRREGNGVAHHLARWA
        T REGN +AH LA+ A
Subjt:  TRREGNGVAHHLARWA

A0A6J1CNZ5 uncharacterized protein LOC1110132201.1e-2437.11Show/hide
Query:  SEYRLGQSIWIAQLPSSSSDDSIMGWWKGCWKRGIPNKINIFLWKLCLDRLPTVDNLVYRGMDVVNVCGLCGRHGESSLHVFWHCKFIKRVLMDFEFGIL
        S+Y+L          S+S  + +  WWK  W+  +P+KI +F W+ CLDRLPT  NL+ RG+DV +    CG+ GE +LH+FW CK  K      +F  L
Subjt:  SEYRLGQSIWIAQLPSSSSDDSIMGWWKGCWKRGIPNKINIFLWKLCLDRLPTVDNLVYRGMDVVNVCGLCGRHGESSLHVFWHCKFIKRVLMDFEFGIL

Query:  VNMVQAGSIFNLLREVRDNVGWGKFELFMVVLWNVWNFRNQRKF-KGIGSMEGLVEWAR
           V+  S+ +LLR+    + W  FE  +V LW +W+ RN + F  G   +  L  W R
Subjt:  VNMVQAGSIFNLLREVRDNVGWGKFELFMVVLWNVWNFRNQRKF-KGIGSMEGLVEWAR

A0A6J1DAR4 uncharacterized protein LOC1110189541.7e-4136.86Show/hide
Query:  QLPSSSSDDSIMGWWKGCWKRGIPNKINIFLWKLCLDRLPTVDNLVYRGMDVVNVCGLCGRHGESSLHVFWHCKFIKRVLMDFEFGILVNMVQAGSIFNL
        Q PSSSS + +  WW G WK  IPNKI +FLW+LCLDRLPT  NL  RG+++ N C  CGR+GE S+H+FW CKF + + ++ +FG L       S F +
Subjt:  QLPSSSSDDSIMGWWKGCWKRGIPNKINIFLWKLCLDRLPTVDNLVYRGMDVVNVCGLCGRHGESSLHVFWHCKFIKRVLMDFEFGILVNMVQAGSIFNL

Query:  LREVRDNVGWGKFELFMVVLWNVWNFRNQRKFKG-------IGSMEGLVEWARSYISSFQQATLSSGLCGHRGITREEVRWSPPDFGWYK----------
        LRE  +++    FE   VV+W +WN RN R F         IG ME LVEWA  Y   F++A  S+ + G R     E+ W PPD G YK          
Subjt:  LREVRDNVGWGKFELFMVVLWNVWNFRNQRKFKG-------IGSMEGLVEWARSYISSFQQATLSSGLCGHRGITREEVRWSPPDFGWYK----------

Query:  -QHV---------RSLEMAEGWAAVKGMRLALEMGLFPLVLETDSSRVAS-VFQNESMDDYSDLGALVGDLRKELPRSSSVGCMLTRREGNGVAHHLARW
         QH          R   MA   AA K +     + +   +   +  ++AS +  + +++D S+ G +V   +    +S        +REGN  AH LAR 
Subjt:  -QHV---------RSLEMAEGWAAVKGMRLALEMGLFPLVLETDSSRVAS-VFQNESMDDYSDLGALVGDLRKELPRSSSVGCMLTRREGNGVAHHLARW

Query:  ALIEGESMMAIE
        AL+  E  + +E
Subjt:  ALIEGESMMAIE

A0A803QE56 Uncharacterized protein2.1e-2327.08Show/hide
Query:  SSSSDDSIMGWWKGCWKRGIPNKINIFLWKLCLDRLPTVDNLVYRGMDVVNVCGLCGRHG-ESSLHVFWHCKFIKRVLMDFEFGILVNMVQAGSIFNLLR
        + S++  +M WW+G WK  +P K+  F+WK+    LPT   L  RGMDV   C  C   G E+  H  W C   K V     FGI   + + GS   L  
Subjt:  SSSSDDSIMGWWKGCWKRGIPNKINIFLWKLCLDRLPTVDNLVYRGMDVVNVCGLCGRHG-ESSLHVFWHCKFIKRVLMDFEFGILVNMVQAGSIFNLLR

Query:  EVRDNVGWGK--FELFMVVLWNVWNFRNQRKFKGI-GSMEGLVEWARSYISSFQ--QATLSSGLCGHRGITREEVRWSPPDFGWYKQHV-RSLEMAEGWA
         +R +  W K  FELF+VV W +W  RN  K  GI      + EW   Y+  ++    T+++G    RG   + V    P  G +K +V   ++   GW+
Subjt:  EVRDNVGWGK--FELFMVVLWNVWNFRNQRKFKGI-GSMEGLVEWARSYISSFQ--QATLSSGLCGHRGITREEVRWSPPDFGWYKQHV-RSLEMAEGWA

Query:  AVK----------------------------------GMRLALEMGLFPLVLETDSSRVASVFQNESMDDYSDLGALVGDLRKELPRSSSVGCMLTRREG
         V                                   G++  ++ G+    +E+D      + QN+  +   D+  ++  +R  L  ++  G +   RE 
Subjt:  AVK----------------------------------GMRLALEMGLFPLVLETDSSRVASVFQNESMDDYSDLGALVGDLRKELPRSSSVGCMLTRREG

Query:  NGVAHHLARWALIEGESMMAIELGRVMGRNGVVSDV
        N VAH LA +AL+   S M I +        ++ D+
Subjt:  NGVAHHLARWALIEGESMMAIELGRVMGRNGVVSDV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G33710.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.6e-0728.57Show/hide
Query:  WIAQLPSSSSDDSIMGWWKGCWKRGIPNKINIFLWKLCLDRLPTVDNLVYRGMDVVNVCGLCGRHGESSLHVF---------WHCKFIKRVLMDFEFGIL
        W A  P S     ++ W K  W +G   K    +W   LDRLPT   L   GM +   CGLC    E   H+F         WH   ++  L  F F + 
Subjt:  WIAQLPSSSSDDSIMGWWKGCWKRGIPNKINIFLWKLCLDRLPTVDNLVYRGMDVVNVCGLCGRHGESSLHVF---------WHCKFIKRVLMDFEFGIL

Query:  VNMVQAGSIFNLLREVRDNVGWGKFELFMVVLWNVWNFRN
         +++     + L R  R      K  +   VL+ +W  RN
Subjt:  VNMVQAGSIFNLLREVRDNVGWGKFELFMVVLWNVWNFRN

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.1e-0727.22Show/hide
Query:  LPSSSSDDS---------IMGWWKGCWKRGIPNKINIFLWKLCLDRLPTVDNLVYRGMDVVNVCGLCGRHGESSLHVFWHCKFIKRVLMDF--------E
        LPS SS D+          + W K  W +    + ++  W   L+RLPT D L   GM++ +   LC    E+  H+F+ C F   +   F         
Subjt:  LPSSSSDDS---------IMGWWKGCWKRGIPNKINIFLWKLCLDRLPTVDNLVYRGMDVVNVCGLCGRHGESSLHVFWHCKFIKRVLMDF--------E

Query:  FGILVNMVQAGSIFNLLREVRDNVGWGKFELFMVVLWNVWNFRNQRKFKGIGSMEGLVEWA-----RSYISSFQQATLSS
        FG+      A S + L   +R +       L    +++VW  RN R F  I S    +  A     R+ + SF  A L S
Subjt:  FGILVNMVQAGSIFNLLREVRDNVGWGKFELFMVVLWNVWNFRNQRKFKGIGSMEGLVEWA-----RSYISSFQQATLSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTGCTTCCGACTGCCAAAGAAGTTGATCATTGAGACTAGTAGAGCCATAGCTCAGTTCTGGTGGAGGGGAGAGAAGGTGGATCGAGGAATCCATTGGGTTGTCCT
TTATTTAGAGGAGTTTACTATGGGGGAGAGAGCTCTTGGCGAAAGGATTGCATTGGCGGATATGGAATTTGAAAAGTGTCCCGGTGTATGGGGAATTGGATTCTTGATGA
TGGGGGTCTCAGGGAGCGAGTATCGGTTAGGCCAATCGATTTGGATTGCTCAACTCCCATCTTCATCTTCTGACGATTCAATTATGGGCTGGTGGAAAGGGTGTTGGAAA
AGGGGGATTCCAAATAAGATTAATATCTTCTTATGGAAACTATGTTTAGATCGTCTGCCTACAGTGGATAACCTAGTTTATCGGGGTATGGATGTGGTGAATGTGTGTGG
TCTTTGTGGCCGACATGGTGAATCAAGCTTGCATGTCTTTTGGCATTGTAAGTTTATCAAAAGGGTTCTGATGGATTTTGAGTTTGGGATCTTGGTAAACATGGTGCAGG
CAGGGTCTATTTTCAATCTCCTTAGAGAAGTGAGGGATAATGTGGGATGGGGGAAATTTGAGTTGTTTATGGTGGTGTTGTGGAATGTGTGGAATTTTCGAAACCAGCGA
AAGTTCAAGGGGATTGGGTCTATGGAGGGACTAGTGGAGTGGGCGAGGAGTTACATCTCTTCGTTCCAGCAAGCTACCTTGTCTAGTGGGTTGTGTGGGCATAGGGGGAT
TACAAGGGAAGAGGTAAGATGGAGCCCCCCGGATTTTGGGTGGTATAAGCAGCACGTACGAAGCCTGGAGATGGCTGAAGGTTGGGCGGCTGTGAAGGGCATGAGGCTGG
CCTTGGAGATGGGTTTATTCCCACTGGTGCTAGAGACTGACTCTAGTCGAGTGGCTAGTGTTTTTCAGAATGAGTCAATGGATGACTATTCTGACTTAGGTGCGCTAGTG
GGTGATCTGCGGAAGGAGCTTCCGAGGTCTTCTTCCGTCGGCTGTATGTTAACTCGAAGAGAGGGAAATGGAGTGGCACATCATTTGGCTCGTTGGGCGCTGATAGAGGG
AGAATCTATGATGGCGATTGAACTCGGACGAGTCATGGGCAGGAATGGTGTGGTTTCAGATGTTGGGTTTTGGCAATCATGTCAAGGCCTAAAGAATGAGCTTGAGGTAA
AGATGTGCGGGAACGAGGAGGCGGGAGAAATTGTGGTATCTAGCCTAATTTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATTGCTTCCGACTGCCAAAGAAGTTGATCATTGAGACTAGTAGAGCCATAGCTCAGTTCTGGTGGAGGGGAGAGAAGGTGGATCGAGGAATCCATTGGGTTGTCCT
TTATTTAGAGGAGTTTACTATGGGGGAGAGAGCTCTTGGCGAAAGGATTGCATTGGCGGATATGGAATTTGAAAAGTGTCCCGGTGTATGGGGAATTGGATTCTTGATGA
TGGGGGTCTCAGGGAGCGAGTATCGGTTAGGCCAATCGATTTGGATTGCTCAACTCCCATCTTCATCTTCTGACGATTCAATTATGGGCTGGTGGAAAGGGTGTTGGAAA
AGGGGGATTCCAAATAAGATTAATATCTTCTTATGGAAACTATGTTTAGATCGTCTGCCTACAGTGGATAACCTAGTTTATCGGGGTATGGATGTGGTGAATGTGTGTGG
TCTTTGTGGCCGACATGGTGAATCAAGCTTGCATGTCTTTTGGCATTGTAAGTTTATCAAAAGGGTTCTGATGGATTTTGAGTTTGGGATCTTGGTAAACATGGTGCAGG
CAGGGTCTATTTTCAATCTCCTTAGAGAAGTGAGGGATAATGTGGGATGGGGGAAATTTGAGTTGTTTATGGTGGTGTTGTGGAATGTGTGGAATTTTCGAAACCAGCGA
AAGTTCAAGGGGATTGGGTCTATGGAGGGACTAGTGGAGTGGGCGAGGAGTTACATCTCTTCGTTCCAGCAAGCTACCTTGTCTAGTGGGTTGTGTGGGCATAGGGGGAT
TACAAGGGAAGAGGTAAGATGGAGCCCCCCGGATTTTGGGTGGTATAAGCAGCACGTACGAAGCCTGGAGATGGCTGAAGGTTGGGCGGCTGTGAAGGGCATGAGGCTGG
CCTTGGAGATGGGTTTATTCCCACTGGTGCTAGAGACTGACTCTAGTCGAGTGGCTAGTGTTTTTCAGAATGAGTCAATGGATGACTATTCTGACTTAGGTGCGCTAGTG
GGTGATCTGCGGAAGGAGCTTCCGAGGTCTTCTTCCGTCGGCTGTATGTTAACTCGAAGAGAGGGAAATGGAGTGGCACATCATTTGGCTCGTTGGGCGCTGATAGAGGG
AGAATCTATGATGGCGATTGAACTCGGACGAGTCATGGGCAGGAATGGTGTGGTTTCAGATGTTGGGTTTTGGCAATCATGTCAAGGCCTAAAGAATGAGCTTGAGGTAA
AGATGTGCGGGAACGAGGAGGCGGGAGAAATTGTGGTATCTAGCCTAATTTTTTGA
Protein sequenceShow/hide protein sequence
MNCFRLPKKLIIETSRAIAQFWWRGEKVDRGIHWVVLYLEEFTMGERALGERIALADMEFEKCPGVWGIGFLMMGVSGSEYRLGQSIWIAQLPSSSSDDSIMGWWKGCWK
RGIPNKINIFLWKLCLDRLPTVDNLVYRGMDVVNVCGLCGRHGESSLHVFWHCKFIKRVLMDFEFGILVNMVQAGSIFNLLREVRDNVGWGKFELFMVVLWNVWNFRNQR
KFKGIGSMEGLVEWARSYISSFQQATLSSGLCGHRGITREEVRWSPPDFGWYKQHVRSLEMAEGWAAVKGMRLALEMGLFPLVLETDSSRVASVFQNESMDDYSDLGALV
GDLRKELPRSSSVGCMLTRREGNGVAHHLARWALIEGESMMAIELGRVMGRNGVVSDVGFWQSCQGLKNELEVKMCGNEEAGEIVVSSLIF