; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027179 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027179
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr10:45626929..45628373
RNA-Seq ExpressionLag0027179
SyntenyLag0027179
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015384077.1 uncharacterized protein LOC107176301 [Citrus sinensis]4.0e-2524.2Show/hide
Query:  QDDIIWNADPKGIFTVKSAYHLASSSLRNSKDSPSNVNEASP-LWKRFWKIRTFPKAKLCAWRIIHDSIPTVANIRKKGIWSNQLCFFCRKFEESSIHSL
        +D ++W+ D KG +TVKS Y +A   L+    + ++ + +SP  W   W +    K K+  WR   + +PT  N+ K+    + +C  C +  E+  H+L
Subjt:  QDDIIWNADPKGIFTVKSAYHLASSSLRNSKDSPSNVNEASP-LWKRFWKIRTFPKAKLCAWRIIHDSIPTVANIRKKGIWSNQLCFFCRKFEESSIHSL

Query:  WNCKIAKKLWIHFIPLTLDLFRMDRLWSHRNTI-SHSSANPNLDFLITAVESKISNGDNYLNSDTPKQLVRSKIQMSQARWIPPAHGSWSLKVDASRDDS
          CK  KK+W H +   +      R  S  + I   +  +  +D  ++A +++          +     + +  ++ Q +W PP    + + VDA+ +  
Subjt:  WNCKIAKKLWIHFIPLTLDLFRMDRLWSHRNTI-SHSSANPNLDFLITAVESKISNGDNYLNSDTPKQLVRSKIQMSQARWIPPAHGSWSLKVDASRDDS

Query:  LNRGGVGWVLHDSSGSPICSGFKRIAVQ---------------------------VASDAISVINLLNNRDSDQSEIWFLAKEIERMRADFVNISFVHIP
            G+G V+ DS  + + +G  +  ++                           + SD + V+ L+NN  S+++ +W++ +EI+  +  F N+   HIP
Subjt:  LNRGGVGWVLHDSSGSPICSGFKRIAVQ---------------------------VASDAISVINLLNNRDSDQSEIWFLAKEIERMRADFVNISFVHIP

Query:  RAQNDEAHFLARFA
        R  N  AH LA+FA
Subjt:  RAQNDEAHFLARFA

XP_023915006.1 uncharacterized protein LOC112026546 [Quercus suber]8.9e-2527.65Show/hide
Query:  QDDIIWNADPKGIFTVKSAYHLASSSLRNSKD-SPSNVNEASPLWKRFWKIRTFPKAKLCAWRIIHDSIPTVANIRKKGIWSNQLCFFCRKFEESSIHSL
        +D ++W  + KG FTVKSAY++ASS L +++D   S+ NE + LWKR W  +  PK K+ AWR+  + +PT+ N+  +G+  +  C  C K  E++ H+L
Subjt:  QDDIIWNADPKGIFTVKSAYHLASSSLRNSKD-SPSNVNEASPLWKRFWKIRTFPKAKLCAWRIIHDSIPTVANIRKKGIWSNQLCFFCRKFEESSIHSL

Query:  WNCKIAKKLWIHF--------------IPLTLDLFRMD-------------RLWSHRNTISH--SSANPNLDFLITAVESKISNGDNYLNSDTPKQLVRS
         +C  AK  W H+              + + LD+ + D              +W +RN   H  S A P+    I  +  +I        S +   LV S
Subjt:  WNCKIAKKLWIHF--------------IPLTLDLFRMD-------------RLWSHRNTISH--SSANPNLDFLITAVESKISNGDNYLNSDTPKQLVRS

Query:  KIQMSQARWIPPAHGSWSLKVDASRDDSLNRGGVGWVLHDSSGSPICSGFKRIA---------------------------VQVASDAISVINLLNNRDS
         I      W PP  G   + VD +  D       G ++ DS GS I +  + ++                           V   SDA+S+I  +N  + 
Subjt:  KIQMSQARWIPPAHGSWSLKVDASRDDSLNRGGVGWVLHDSSGSPICSGFKRIA---------------------------VQVASDAISVINLLNNRDS

Query:  DQSEIWFLAKEIERMRADFVNISFVHIPRAQNDEAHFLARFAAFGSLECTSRVLYPSF
           EI  + + I+ + + F   +F H  R  N  AH LAR A    +    + + PSF
Subjt:  DQSEIWFLAKEIERMRADFVNISFVHIPRAQNDEAHFLARFAAFGSLECTSRVLYPSF

XP_030483481.1 uncharacterized protein LOC115700065 [Cannabis sativa]8.9e-2526.09Show/hide
Query:  SFQDDIIWNADPKGIFTVKSAYHLASSSLRNSKDSPSNVNEASPLWKRFWKIRTFPKAKLCAWRIIHDSIPTVANIRKKGIWSNQLCFFCRKFEESSIHS
        S  D +IW+    GI+ V S YH  +S      D  S  N ++  WK FWK++   K K+ AWR+ HD++P   ++ ++ I ++  C  CR+  ES  H+
Subjt:  SFQDDIIWNADPKGIFTVKSAYHLASSSLRNSKDSPSNVNEASPLWKRFWKIRTFPKAKLCAWRIIHDSIPTVANIRKKGIWSNQLCFFCRKFEESSIHS

Query:  LWNCKIAKKLW----IHF-----------------------IPLTLDLFRMDRLWSHRNTISHSSANPNLDFLITAVESKISNGDNYLNSDTPKQLVRSK
        L+ CK AK +W     HF                       + +   +  +  +W+ RN I H     +   L +   + +SN  +      P +   ++
Subjt:  LWNCKIAKKLW----IHF-----------------------IPLTLDLFRMDRLWSHRNTISHSSANPNLDFLITAVESKISNGDNYLNSDTPKQLVRSK

Query:  ----IQMSQARWIPPAHGSWSLKVDASRDDSLNRGGVGWVLHDSSGSPICSGFKRI------------AV---------------QVASDAISVINLLNN
              +  + W PP  GS  + VDA+ + S N+ G+G V+ +  G+ I +  K +            AV               Q+ +DA+ V N L  
Subjt:  ----IQMSQARWIPPAHGSWSLKVDASRDDSLNRGGVGWVLHDSSGSPICSGFKRI------------AV---------------QVASDAISVINLLNN

Query:  RDSDQSEIWFLAKEIERMRADFVNISFVHIPRAQNDEAHFLARFA
        R +  S    L  ++  + + F  +S VH  R  N  AH+LAR+A
Subjt:  RDSDQSEIWFLAKEIERMRADFVNISFVHIPRAQNDEAHFLARFA

XP_042974832.1 uncharacterized protein LOC122306468 [Carya illinoinensis]2.0e-2425.5Show/hide
Query:  GSFQDDIIWNADPKGIFTVKSAYHLASSSLRNSKDSPSNVNEASPLWKRFWKIRTFPKAKLCAWRIIHDSIPTVANIRKKGIWSNQLCFFCRKFEESSIH
        GS  D +IW     G+F+VKS YHL      N +  PS   + S LWK  WK++     K+  WR  ++ +PT+AN++++ I +   C  C + EE++ H
Subjt:  GSFQDDIIWNADPKGIFTVKSAYHLASSSLRNSKDSPSNVNEASPLWKRFWKIRTFPKAKLCAWRIIHDSIPTVANIRKKGIWSNQLCFFCRKFEESSIH

Query:  SLWNCKIAKKLW------IHFIPLTLDLFR---------------------MDRLWSHRNTISHSSA--NPNLDFLITAVESKISNGDNYLNSDTPKQLV
        +LW+C  A+ +W      I  + L +  F+                     M  +W+ RN + H  A  +PN   L+ A +      D Y      K  +
Subjt:  SLWNCKIAKKLW------IHFIPLTLDLFR---------------------MDRLWSHRNTISHSSA--NPNLDFLITAVESKISNGDNYLNSDTPKQLV

Query:  RSKIQMSQA-------RWIPPAHGSWSLKVDASRDDSLNRGGVGWVLHDSSG--------------SPIC---------------SGFKRIAVQVASDAI
        + ++    A       +WI P  G + L  DA+ +  LN+ G+G ++ D +G              SP                 +GF+ I  +   D++
Subjt:  RSKIQMSQA-------RWIPPAHGSWSLKVDASRDDSLNRGGVGWVLHDSSG--------------SPIC---------------SGFKRIAVQVASDAI

Query:  SVINLLNNRDSDQSEIWFLAKEIERMRADFVNISFVHIPRAQNDEAHFLARFA
         V+  +  +  D S+   + ++ + +  D V   F H  R  N  AH LA+ A
Subjt:  SVINLLNNRDSDQSEIWFLAKEIERMRADFVNISFVHIPRAQNDEAHFLARFA

XP_042980185.1 uncharacterized protein LOC122310356 [Carya illinoinensis]4.7e-2626.4Show/hide
Query:  QDDIIWNADPKGIFTVKSAYHLASSSLRNSKDSPSNVNEASPLWKRFWKIRTFPKAKLCAWRIIHDSIPTVANIRKKGIWSNQLCFFCRKFEESSIHSLW
        QD + W     G F+VKSAYHL SS     K  PSN N  S +W R W I      K   WR   +S+PT  N+ K+ + ++ LC  C   EE+  H+LW
Subjt:  QDDIIWNADPKGIFTVKSAYHLASSSLRNSKDSPSNVNEASPLWKRFWKIRTFPKAKLCAWRIIHDSIPTVANIRKKGIWSNQLCFFCRKFEESSIHSLW

Query:  NCKIAKKLW------IHFIPLTLDLFR---------------------MDRLWSHRNTISHS---SANPNLDFLITAVESKISNGDNYLNSDTPKQLVRS
        +CK A+ +W      +  +  T + F+                     +  LW+ RN++      S+  NL   I A  S I +G    +   P+Q+   
Subjt:  NCKIAKKLW------IHFIPLTLDLFR---------------------MDRLWSHRNTISHS---SANPNLDFLITAVESKISNGDNYLNSDTPKQLVRS

Query:  KIQMSQARWIPPAHGSWSLKVDASRDDSLNRGGVGWVLHDSSG---------------------------SPICSGFKRIAVQVASDAISVINLLNNRDS
           +S   W PP  G      DA+ D   +R G+G V+ DS G                           + +C+      + +  D++ V+  +   + 
Subjt:  KIQMSQARWIPPAHGSWSLKVDASRDDSLNRGGVGWVLHDSSG---------------------------SPICSGFKRIAVQVASDAISVINLLNNRDS

Query:  DQSEIWFLAKEIERMRADFVNISFVHIPRAQNDEAHFLARFAAFGSLECTSRVLYP
          S +  + ++I  + ++F + S  H+PR  N  A+ LA+ A     EC     YP
Subjt:  DQSEIWFLAKEIERMRADFVNISFVHIPRAQNDEAHFLARFAAFGSLECTSRVLYP

TrEMBL top hitse value%identityAlignment
A0A803P119 Uncharacterized protein7.3e-2528.95Show/hide
Query:  IIWNADPKGIFTVKSAYHLASSSLRNSKDSPSNVNEASPLWKRFWKIRTFPKAKLCAWRIIHDSIPTVANIRKKGIWSNQLCFFCRKFEESSIHSLWNCK
        +IW   P G + VKS YHLA++     ++SPS+ N  +  W  FW +    K K+ AWR+IH+ +P  AN+ K+ I  +  C  C    ES  H+ + CK
Subjt:  IIWNADPKGIFTVKSAYHLASSSLRNSKDSPSNVNEASPLWKRFWKIRTFPKAKLCAWRIIHDSIPTVANIRKKGIWSNQLCFFCRKFEESSIHSLWNCK

Query:  IAKKLWIHFIPLTLDL---------------------FRMD-------RLWSHRNTISHSSANPNLDFLITAVESKISNGDNYLN---SDTPKQ--LVRS
         AK++W   +P  +DL                     F+M+        LW  RN  S      NL  +I      + N  ++LN   S T +Q  L  +
Subjt:  IAKKLWIHFIPLTLDL---------------------FRMD-------RLWSHRNTISHSSANPNLDFLITAVESKISNGDNYLN---SDTPKQ--LVRS

Query:  KIQMSQARWIPPAHGSWSLKVDASRDDSLNRGGVGWVLHDSSGSPI---------CSGFKRIAVQ------------------VASDAISVINLLNNRDS
        K  M+++ W PP HG   L VDA+ D +    G+G ++ DS+G+ +         C   K I                     + +DA  V   +NN   
Subjt:  KIQMSQARWIPPAHGSWSLKVDASRDDSLNRGGVGWVLHDSSGSPI---------CSGFKRIAVQ------------------VASDAISVINLLNNRDS

Query:  DQSEIWFLAKEIERMRADFVNISFVHIPRAQNDEAHFLARFA
          S    L  ++    + F  +S  H+ R+ N+ AH LARFA
Subjt:  DQSEIWFLAKEIERMRADFVNISFVHIPRAQNDEAHFLARFA

A0A803PJK4 Uncharacterized protein4.7e-2424.93Show/hide
Query:  SFQDDIIWNADPKGIFTVKSAYHLASSSLRNSKDSPSNVNEASPLWKRFWKIRTFPKAKLCAWRIIHDSIPTVANIRKKGIWSNQLCFFCRKFEESSIHS
        S  D IIW+ +  GI+TVKS Y LAS      +++PS  + ++ LWK FWK+    K ++  W+ + + +P  A ++K  I  + +C  CR   ES +H+
Subjt:  SFQDDIIWNADPKGIFTVKSAYHLASSSLRNSKDSPSNVNEASPLWKRFWKIRTFPKAKLCAWRIIHDSIPTVANIRKKGIWSNQLCFFCRKFEESSIHS

Query:  LWNCKIAKKLW----------IHFIPLTLDLF-----------------RMDRLWSHRNTISHSSANPNLDFLITAVESKISNGDNYLNSDTPKQLVRSK
        L+ CK AKK+W          IH      DLF                  +  +W+ RN   H +     + L+ +  S +  G+   +       +R++
Subjt:  LWNCKIAKKLW----------IHFIPLTLDLF-----------------RMDRLWSHRNTISHSSANPNLDFLITAVESKISNGDNYLNSDTPKQLVRSK

Query:  IQMSQ-----ARWIPPAHGSWSLKVDASRDDSLNRGGVGWVLHDSSGSPICSGFK----------------RIAVQ-----------VASDAISVINLLN
        +  +       +W+ P  G   L  DA+ D+     G G +L DS G  + +  K                R ++Q           + +D++ V+  L 
Subjt:  IQMSQ-----ARWIPPAHGSWSLKVDASRDDSLNRGGVGWVLHDSSGSPICSGFK----------------RIAVQ-----------VASDAISVINLLN

Query:  NRDSDQSEIWFLAKEIERMRADFVNISFVHIPRAQNDEAHFLARFAAFGSLEC
         R  + S+   +  +I  + ++F  +   H+ R+ N  AH LA++A     EC
Subjt:  NRDSDQSEIWFLAKEIERMRADFVNISFVHIPRAQNDEAHFLARFAAFGSLEC

A0A803QEG9 Uncharacterized protein2.7e-2727.47Show/hide
Query:  QDDIIWNADPKGIFTVKSAYHLASSSLRNSKDSPSNVNEASPLWKRFWKIRTFPKAKLCAWRIIHDSIPTVANIRKKGIWSNQLCFFCRKFEESSIHSLW
        +D +IW+    G FTV+SAYHLA+S    ++D  S+   A   WK FW ++   K K+ AWR+IHD++P   ++ ++ I ++  C  C++  ES+ H+L+
Subjt:  QDDIIWNADPKGIFTVKSAYHLASSSLRNSKDSPSNVNEASPLWKRFWKIRTFPKAKLCAWRIIHDSIPTVANIRKKGIWSNQLCFFCRKFEESSIHSLW

Query:  NCKIAKKLW--------------------IHFIPLTLDLFRMDRL-------WSHRNTISHSS----ANPNLDFLI-------------TAVESKISNGD
         CK AK +W                    + ++    +   M+ L       W  RN + H      AN    F                A  +  +  D
Subjt:  NCKIAKKLW--------------------IHFIPLTLDLFRMDRL-------WSHRNTISHSS----ANPNLDFLI-------------TAVESKISNGD

Query:  NYLNSDTPKQLVRSKIQMSQARWIPPAHGSWSLKVDASRDDSLNRGGVGWVLHDSSGS-------PICSGFKRIAVQ--------------------VAS
            S  P +   +   +S   W PPA   + L  DA+ ++S N  GVG VL D+SGS       PI   FK   ++                    + +
Subjt:  NYLNSDTPKQLVRSKIQMSQARWIPPAHGSWSLKVDASRDDSLNRGGVGWVLHDSSGS-------PICSGFKRIAVQ--------------------VAS

Query:  DAISVINLLNNRDSDQSEIWFLAKEIERMRADFVNISFVHIPRAQNDEAHFLARFAAFGSLECT
        DA+ V+N L    +  SE   L  ++  + + F N+S  H+ R  N+ AH LA+FA      CT
Subjt:  DAISVINLLNNRDSDQSEIWFLAKEIERMRADFVNISFVHIPRAQNDEAHFLARFAAFGSLECT

M5VU98 Reverse transcriptase domain-containing protein2.1e-2427.01Show/hide
Query:  DDIIWNADPKGIFTVKSAYHLASSSLRNSKDSPSNVN-EASPLWKRFWKIRTFPKAKLCAWRIIHDSIPTVANIRKKGIWSNQLCFFCRKFEESSIHSLW
        D I+WN D  G+FTVKSAY +A       +D  S+ N +   LW+  W      K K+ AWR+ HD +PT AN+ KKG+    +C FC    ES++H L 
Subjt:  DDIIWNADPKGIFTVKSAYHLASSSLRNSKDSPSNVN-EASPLWKRFWKIRTFPKAKLCAWRIIHDSIPTVANIRKKGIWSNQLCFFCRKFEESSIHSLW

Query:  NCKIAKKLWIHFIPLTLDLFRMDRLWSHRNTISHSSANPNLDFLITAVESKISNGDNYLNSDTPKQLVRSKIQMSQARWIPPAHGSWSLKVDASRDDSLN
         C  A   W            +  L  H +     S +  + F    V   I+       +DTP + V  +++    RW  P  G      D + D +  
Subjt:  NCKIAKKLWIHFIPLTLDLFRMDRLWSHRNTISHSSANPNLDFLITAVESKISNGDNYLNSDTPKQLVRSKIQMSQARWIPPAHGSWSLKVDASRDDSLN

Query:  RGGVGWVLHDSSGSPICSGFKRIAVQVAS---------------------------DAISVINLLNNRDSDQSEIWFLAKEIERMRADFVNISFVHIPRA
        RG VG V  D+ G  + +  K +   +++                           D+  V++ +     D S I  + ++++ ++  F +  F   PR 
Subjt:  RGGVGWVLHDSSGSPICSGFKRIAVQVAS---------------------------DAISVINLLNNRDSDQSEIWFLAKEIERMRADFVNISFVHIPRA

Query:  QNDEAHFLARF
         N  AH LARF
Subjt:  QNDEAHFLARF

M5XK32 Reverse transcriptase domain-containing protein (Fragment)1.2e-2426.75Show/hide
Query:  DDIIWNADPKGIFTVKSAYHLASSSLRNSKDSPSNVN-EASPLWKRFWKIRTFPKAKLCAWRIIHDSIPTVANIRKKGIWSNQLCFFCRKFEESSIHSLW
        D I+WN D  G+FTVKSAY +A       +D  S+ N + S LW+  W      K K+ AWR+ HD +PT AN+ KKG+    +C FC    ES++H L 
Subjt:  DDIIWNADPKGIFTVKSAYHLASSSLRNSKDSPSNVN-EASPLWKRFWKIRTFPKAKLCAWRIIHDSIPTVANIRKKGIWSNQLCFFCRKFEESSIHSLW

Query:  NCKIAKKLWIHFIPLTLDLFRMDRLWSHRNTISHSSANPNLDFLITAVESKISNGDNYLNSDTPKQLVRSKIQMSQARWIPPAHGSWSLKVDASRDDSLN
         C  A   W            +  L  H +     S +  + F    V   I+       +DTP + V  +++    RW  P+ G      D + D +  
Subjt:  NCKIAKKLWIHFIPLTLDLFRMDRLWSHRNTISHSSANPNLDFLITAVESKISNGDNYLNSDTPKQLVRSKIQMSQARWIPPAHGSWSLKVDASRDDSLN

Query:  RGGVGWVLHDSSGSPICSGFKRIAVQVAS---------------------------DAISVINLLNNRDSDQSEIWFLAKEIERMRADFVNISFVHIPRA
        RG VG V  D+ G  + +  K +   +++                           D+  V++ +     D S I  + ++++ ++  F +  F   PR 
Subjt:  RGGVGWVLHDSSGSPICSGFKRIAVQVAS---------------------------DAISVINLLNNRDSDQSEIWFLAKEIERMRADFVNISFVHIPRA

Query:  QNDEAHFLARFAAF
         N   H LARF  +
Subjt:  QNDEAHFLARFAAF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein8.3e-2124.72Show/hide
Query:  GGGSFQDDIIWNADPKGIFTVKSAYHLASSSLRNSKDSPSNVNEAS--PLWKRFWKIRTFPKAKLCAWRIIHDSIPTVANIRKKGIWSNQLCFFCRKFEE
        GG    D   W+    G +TVKS Y + +  + N + SP  V+E S  P++++ WK +T PK +   W+ + +S+P    +  + +     C  C   +E
Subjt:  GGGSFQDDIIWNADPKGIFTVKSAYHLASSSLRNSKDSPSNVNEAS--PLWKRFWKIRTFPKAKLCAWRIIHDSIPTVANIRKKGIWSNQLCFFCRKFEE

Query:  SSIHSLWNCKIAKKLW-IHFIPLTLD-------------LFRMD------------------RLWSHRNTISHSSANPNLDFLITAVESKISNGDNYLNS
        +  H L+ C  A+  W I  IP+ L              +F +                   RLW +RN +       N   ++   E  +        +
Subjt:  SSIHSLWNCKIAKKLW-IHFIPLTLD-------------LFRMD------------------RLWSHRNTISHSSANPNLDFLITAVESKISNGDNYLNS

Query:  D---TPKQLVRSKIQMSQARWIPPAHGSWSLKVDASRDDSLNRGGVGWVLHDSSGSPICSG---------------------------FKRIAVQVASDA
        +   T  Q+ RS    S  RW PP H       DA+ +    R G+GWVL +  G     G                           F+   V   SD+
Subjt:  D---TPKQLVRSKIQMSQARWIPPAHGSWSLKVDASRDDSLNRGGVGWVLHDSSGSPICSG---------------------------FKRIAVQVASDA

Query:  ISVINLLNNRDSDQSEIWFLAK----EIERMRADFVNISFVHIPRAQNDEAHFLAR
          +I +LNN      EIW   K    +++R+ + F  + FV IPR  N  A  +AR
Subjt:  ISVINLLNNRDSDQSEIWFLAK----EIERMRADFVNISFVHIPRAQNDEAHFLAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACCCAACCGTGCCTCTCCCACGGCCCTTCAGACCTGGAAGATATTTCAAATACTATAGTGGGGGAGGGTCTTTCCAAGATGATATAATTTGGAATGCTGATCCTAA
AGGCATTTTTACTGTGAAAAGTGCTTACCACTTAGCCTCTTCATCTTTGAGAAATTCTAAAGATTCTCCTTCTAATGTGAATGAGGCTAGCCCTTTGTGGAAGAGGTTTT
GGAAAATTCGTACCTTTCCTAAGGCTAAGCTGTGTGCTTGGAGAATTATCCATGATTCCATCCCCACGGTGGCTAACATTCGTAAGAAAGGAATTTGGTCTAATCAGTTG
TGTTTCTTTTGCAGGAAATTCGAAGAATCTTCCATTCATTCTCTATGGAACTGCAAGATAGCAAAAAAGTTGTGGATCCATTTCATTCCCCTGACTTTAGATCTTTTTCG
TATGGACAGGTTATGGTCTCACAGAAACACCATAAGTCACAGTTCAGCCAATCCAAACCTGGATTTTCTAATTACAGCAGTAGAATCGAAAATTTCTAATGGGGATAATT
ACCTCAATTCTGACACCCCCAAGCAGCTAGTAAGATCGAAGATCCAGATGAGTCAGGCTCGATGGATTCCTCCAGCTCATGGATCGTGGTCTCTCAAAGTGGACGCCTCT
CGTGACGATTCTCTCAACCGAGGAGGAGTTGGTTGGGTTCTGCATGACTCGTCCGGTTCTCCCATCTGTTCTGGCTTCAAACGGATTGCAGTTCAGGTGGCGTCCGATGC
CATCAGCGTCATAAACCTTCTGAACAATCGGGACTCCGACCAATCGGAGATTTGGTTCCTGGCCAAAGAAATCGAGCGCATGCGTGCTGATTTCGTAAACATCTCTTTCG
TCCATATCCCTCGTGCGCAAAACGATGAAGCGCACTTTCTGGCTCGTTTTGCCGCGTTCGGGTCTCTCGAGTGTACCAGTCGAGTTTTGTACCCTTCTTTCAATTCGGAA
GAAGGAATTTTTTTGTTGGGCCAGTTTTTGCCCGATATTTTAAGCCCTCTTCTTAGGGAGGCAGATGTAGCGTTTTTCTCTCGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCACCCAACCGTGCCTCTCCCACGGCCCTTCAGACCTGGAAGATATTTCAAATACTATAGTGGGGGAGGGTCTTTCCAAGATGATATAATTTGGAATGCTGATCCTAA
AGGCATTTTTACTGTGAAAAGTGCTTACCACTTAGCCTCTTCATCTTTGAGAAATTCTAAAGATTCTCCTTCTAATGTGAATGAGGCTAGCCCTTTGTGGAAGAGGTTTT
GGAAAATTCGTACCTTTCCTAAGGCTAAGCTGTGTGCTTGGAGAATTATCCATGATTCCATCCCCACGGTGGCTAACATTCGTAAGAAAGGAATTTGGTCTAATCAGTTG
TGTTTCTTTTGCAGGAAATTCGAAGAATCTTCCATTCATTCTCTATGGAACTGCAAGATAGCAAAAAAGTTGTGGATCCATTTCATTCCCCTGACTTTAGATCTTTTTCG
TATGGACAGGTTATGGTCTCACAGAAACACCATAAGTCACAGTTCAGCCAATCCAAACCTGGATTTTCTAATTACAGCAGTAGAATCGAAAATTTCTAATGGGGATAATT
ACCTCAATTCTGACACCCCCAAGCAGCTAGTAAGATCGAAGATCCAGATGAGTCAGGCTCGATGGATTCCTCCAGCTCATGGATCGTGGTCTCTCAAAGTGGACGCCTCT
CGTGACGATTCTCTCAACCGAGGAGGAGTTGGTTGGGTTCTGCATGACTCGTCCGGTTCTCCCATCTGTTCTGGCTTCAAACGGATTGCAGTTCAGGTGGCGTCCGATGC
CATCAGCGTCATAAACCTTCTGAACAATCGGGACTCCGACCAATCGGAGATTTGGTTCCTGGCCAAAGAAATCGAGCGCATGCGTGCTGATTTCGTAAACATCTCTTTCG
TCCATATCCCTCGTGCGCAAAACGATGAAGCGCACTTTCTGGCTCGTTTTGCCGCGTTCGGGTCTCTCGAGTGTACCAGTCGAGTTTTGTACCCTTCTTTCAATTCGGAA
GAAGGAATTTTTTTGTTGGGCCAGTTTTTGCCCGATATTTTAAGCCCTCTTCTTAGGGAGGCAGATGTAGCGTTTTTCTCTCGTTAA
Protein sequenceShow/hide protein sequence
MHPTVPLPRPFRPGRYFKYYSGGGSFQDDIIWNADPKGIFTVKSAYHLASSSLRNSKDSPSNVNEASPLWKRFWKIRTFPKAKLCAWRIIHDSIPTVANIRKKGIWSNQL
CFFCRKFEESSIHSLWNCKIAKKLWIHFIPLTLDLFRMDRLWSHRNTISHSSANPNLDFLITAVESKISNGDNYLNSDTPKQLVRSKIQMSQARWIPPAHGSWSLKVDAS
RDDSLNRGGVGWVLHDSSGSPICSGFKRIAVQVASDAISVINLLNNRDSDQSEIWFLAKEIERMRADFVNISFVHIPRAQNDEAHFLARFAAFGSLECTSRVLYPSFNSE
EGIFLLGQFLPDILSPLLREADVAFFSR