; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022057 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022057
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:17178991..17181531
RNA-Seq ExpressionLag0022057
SyntenyLag0022057
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5449841.1 hypothetical protein F2P56_030246 [Juglans regia]5.0e-6247.17Show/hide
Query:  MSLLFWNARGLGSPQAFRRLYNLVQSHKPQMVFLTETKMCGSRMNSIKIQLGFQNCFSVDCCGRSGGLSLFWSAEIGFSLLSFSKNHIDGWV-DWDSKNW
        M +L WN+ GLG+PQ  R L +L+    P +VFL ETKM    M + K + GF NCF VDC GRSGGLSL W  EI  S+ SFS+ HID  + + D   W
Subjt:  MSLLFWNARGLGSPQAFRRLYNLVQSHKPQMVFLTETKMCGSRMNSIKIQLGFQNCFSVDCCGRSGGLSLFWSAEIGFSLLSFSKNHIDGWV-DWDSKNW

Query:  RFTGFYGNPQTELRYLSWALLKNLCGNTSTPWLVGGDFNGILFQHEKQGGRDKSEAELNDFRASLDDCGLMDVGFTGDIFTWTNERPGEENVKERLDRIC
        RFTG YG+P    RYL+W L++ LC +TS PWLVGGDFN +L+ HEK+GGR    A+L +FRA ++DC L D+GF G  FTW N R G + + ERLDR  
Subjt:  RFTGFYGNPQTELRYLSWALLKNLCGNTSTPWLVGGDFNGILFQHEKQGGRDKSEAELNDFRASLDDCGLMDVGFTGDIFTWTNERPGEENVKERLDRIC

Query:  VTEGWKDLFPDCSIDHLPFNRSDHRPILLTLVRFTGL---RGARRGRIHRFEEAWNRLPECSEIV
          +   ++FP   + H     SDH PIL      TGL   RG +  +  RFE  W    +C++I+
Subjt:  VTEGWKDLFPDCSIDHLPFNRSDHRPILLTLVRFTGL---RGARRGRIHRFEEAWNRLPECSEIV

KAF7824053.1 hypothetical protein G2W53_022197 [Senna tora]1.7e-5737.91Show/hide
Query:  MSLLFWNARGLGSPQAFRRLYNLVQSHKPQMVFLTETKMCGSRMNSIKIQLGFQNCFSVDCCG----RSGGLSLFWSAEIGFSLLSFSKNHIDGWVDWDS
        MS + WN RGLG+P+A R L  L Q  +P ++FL ETK   S M  +K QLGF   F+VDC G    R+GGL+LFW   +  +L SFS NHID  V    
Subjt:  MSLLFWNARGLGSPQAFRRLYNLVQSHKPQMVFLTETKMCGSRMNSIKIQLGFQNCFSVDCCG----RSGGLSLFWSAEIGFSLLSFSKNHIDGWVDWDS

Query:  KN--WRFTGFYGNPQTELRYLSWALLKNLCGNTSTPWLVGGDFNGILFQHEKQGGRDKSEAELNDFRASLDDCGLMDVGFTGDIFTWTNERPGEENVKER
         N  WR TG +G P+ + ++ +W LL+ L  N+  PWL  GDFN I+F  EKQGG  KS+  +  FR + + CG  D+GF G  FTW N R  + N++ER
Subjt:  KN--WRFTGFYGNPQTELRYLSWALLKNLCGNTSTPWLVGGDFNGILFQHEKQGGRDKSEAELNDFRASLDDCGLMDVGFTGDIFTWTNERPGEENVKER

Query:  LDRICVTEGWKDLFPDCSIDHLPFNRSDHRPILLTLVRFTGLRGARRGRIHRFEEAWNRLPECSEIVIC----------QPKPVIVLLSF----------
        LDR+  TE W   FP   ++H+    SDH  + ++          RR R+ RFEEAW     C +++            +   V   LSF          
Subjt:  LDRICVTEGWKDLFPDCSIDHLPFNRSDHRPILLTLVRFTGLRGARRGRIHRFEEAWNRLPECSEIVIC----------QPKPVIVLLSF----------

Query:  --GAVLCGGKELLVKGLRWRIGDGQSASIYSSNWL
             +   + +L  G  WR+G+G   +I+  NW+
Subjt:  --GAVLCGGKELLVKGLRWRIGDGQSASIYSSNWL

XP_012851712.1 PREDICTED: uncharacterized protein LOC105971405 [Erythranthe guttata]2.2e-5742.97Show/hide
Query:  MSLLFWNARGLGSPQAFRRLYNLVQSHKPQMVFLTETKMCGSRMNSIKIQLGFQNCFSVDCCGRSGGLSLFWSAEIGFSLLSFSKNHIDGWVDWDSKN--
        MS +FWN +GLG+P     L ++++ H+P +VFL+ET+     ++ +K +    N  SVD  G SGGL+L W  +I   L+S+S NHID  V   + N  
Subjt:  MSLLFWNARGLGSPQAFRRLYNLVQSHKPQMVFLTETKMCGSRMNSIKIQLGFQNCFSVDCCGRSGGLSLFWSAEIGFSLLSFSKNHIDGWVDWDSKN--

Query:  WRFTGFYGNPQTELRYLSWALLKNLCGNTSTPWLVGGDFNGILFQHEKQGGRDKSEAELNDFRASLDDCGLMDVGFTGDIFTWTNERPGEENVKERLDRI
        WR TGFYG P+ + +YLSW L++NL  + + PWL+GGDFN IL   EK GG  +  A +  FR +L+DC L D+GF G  FTW+N R   + V+ RLDR 
Subjt:  WRFTGFYGNPQTELRYLSWALLKNLCGNTSTPWLVGGDFNGILFQHEKQGGRDKSEAELNDFRASLDDCGLMDVGFTGDIFTWTNERPGEENVKERLDRI

Query:  CVTEGWKDLFPDCSIDHLPFNRSDHRPILLTLVRFTGLRGARRGRIHRFEEAWNRLPECSEIV
        C   GW  LFP  S+ HL ++ SDH PI   L     ++  R+ R  RFE  W R  +C +I+
Subjt:  CVTEGWKDLFPDCSIDHLPFNRSDHRPILLTLVRFTGLRGARRGRIHRFEEAWNRLPECSEIV

XP_012857846.1 PREDICTED: uncharacterized protein LOC105977118 [Erythranthe guttata]1.7e-5742.97Show/hide
Query:  MSLLFWNARGLGSPQAFRRLYNLVQSHKPQMVFLTETKMCGSRMNSIKIQLGFQNCFSVDCCGRSGGLSLFWSAEIGFSLLSFSKNHIDGWVDWDSKN--
        MS +FWN +GLG+P     L ++++ H+P +VFL+ET+     ++ +K +    N  SVD  G SGGL+L W  +I   L+S+S NHID  V   + N  
Subjt:  MSLLFWNARGLGSPQAFRRLYNLVQSHKPQMVFLTETKMCGSRMNSIKIQLGFQNCFSVDCCGRSGGLSLFWSAEIGFSLLSFSKNHIDGWVDWDSKN--

Query:  WRFTGFYGNPQTELRYLSWALLKNLCGNTSTPWLVGGDFNGILFQHEKQGGRDKSEAELNDFRASLDDCGLMDVGFTGDIFTWTNERPGEENVKERLDRI
        WR TGFYG P+ + +YLSW L++NL  + + PWL+GGDFN IL   EK GG  +  A +  FR +L+DC L D+GF G  FTW+N R   + V+ RLDR 
Subjt:  WRFTGFYGNPQTELRYLSWALLKNLCGNTSTPWLVGGDFNGILFQHEKQGGRDKSEAELNDFRASLDDCGLMDVGFTGDIFTWTNERPGEENVKERLDRI

Query:  CVTEGWKDLFPDCSIDHLPFNRSDHRPILLTLVRFTGLRGARRGRIHRFEEAWNRLPECSEIV
        C   GW  LFP  S+ HL ++ SDH PI   L     ++  R+ R  RFE  W R  +C +I+
Subjt:  CVTEGWKDLFPDCSIDHLPFNRSDHRPILLTLVRFTGLRGARRGRIHRFEEAWNRLPECSEIV

XP_024036939.1 uncharacterized protein LOC112096938 [Citrus clementina]1.7e-5743.02Show/hide
Query:  MSLLFWNARGLGSPQAFRRLYNLVQSHKPQMVFLTETKMCGSRMNSIKIQLGFQNCFSVDCCGRSGGLSLFWSAEIGFSLLSFSKNHIDGWVDWDSKN-W
        M++L WN RGLG+P+AF+ L  ++Q H  Q+VFL ETK+   +MN I  +L F+NC +V+C GR GG+++ W ++IG  + S+S++HID      + N  
Subjt:  MSLLFWNARGLGSPQAFRRLYNLVQSHKPQMVFLTETKMCGSRMNSIKIQLGFQNCFSVDCCGRSGGLSLFWSAEIGFSLLSFSKNHIDGWVDWDSKN-W

Query:  RFTGFYGNPQTELRYLSWALLKNLCGNTSTPWLVGGDFNGILFQHEKQGGRDKSEAELNDFRASLDDCGLMDVGFTGDIFTWTNERPGEENVKERLDRIC
        R TG YG+P+T  +  +W LL+ L   +S+PWL  GDFN IL   EK GG D++   +NDFR  L DCGL DVG+ G  FTW N R G+  ++ERLDR  
Subjt:  RFTGFYGNPQTELRYLSWALLKNLCGNTSTPWLVGGDFNGILFQHEKQGGRDKSEAELNDFRASLDDCGLMDVGFTGDIFTWTNERPGEENVKERLDRIC

Query:  VTEGWKDLFPDCSIDHLPFNRSDHRPILLTLVRFTG---LRGARRGRIHRFEEAWNRLPECSEIV
          + W D F DC   +L    SDH P+L+ +   +G    RG    +IH +E+ W+    C EI+
Subjt:  VTEGWKDLFPDCSIDHLPFNRSDHRPILLTLVRFTG---LRGARRGRIHRFEEAWNRLPECSEIV

TrEMBL top hitse value%identityAlignment
A0A2N9EV35 Uncharacterized protein1.2e-5643.46Show/hide
Query:  LLFWNARGLGSPQAFRRLYNLVQSHKPQMVFLTETKMCGSRMNSIKIQLGFQNCFSVDCCGRSGGLSLFWSAEIGFSLLSFSKNHIDGWVDW-DSKNWRF
        LL WN RGLG+P A R L+ +V++  P ++FL ETK+    M  ++++LG+ N F+V   GRSGGL+L W+ +I  ++ +F+ NHID  V+  D K WR 
Subjt:  LLFWNARGLGSPQAFRRLYNLVQSHKPQMVFLTETKMCGSRMNSIKIQLGFQNCFSVDCCGRSGGLSLFWSAEIGFSLLSFSKNHIDGWVDW-DSKNWRF

Query:  TGFYGNPQTELRYLSWALLKNLCGNTSTPWLVGGDFNGILFQHEKQGGRDKSEAELNDFRASLDDCGLMDVGFTGDIFTWTNERPGEENVKERLDRICVT
        T F G P+ + ++ SWALL +L    + PWL  GDFN I+ Q+EK+G   +S A++  FR   + C L+D+GF+G  FTW N R G  NV+ER+DR   +
Subjt:  TGFYGNPQTELRYLSWALLKNLCGNTSTPWLVGGDFNGILFQHEKQGGRDKSEAELNDFRASLDDCGLMDVGFTGDIFTWTNERPGEENVKERLDRICVT

Query:  EGWKDLFPDCSIDHLPFNRSDHRPILLTLVRFTGLRGARRGRIHRFEEAWNRLPECSEIV
          W + FP+  + HLP   SDH PIL+ + +       RR + HRFEE W   PEC EI+
Subjt:  EGWKDLFPDCSIDHLPFNRSDHRPILLTLVRFTGLRGARRGRIHRFEEAWNRLPECSEIV

A0A2N9EWI9 Reverse transcriptase domain-containing protein3.6e-5838.66Show/hide
Query:  MSLLFWNARGLGSPQAFRRLYNLVQSHKPQMVFLTETKMCGSRMNSIKIQLGFQNCFSVDCCGRSGGLSLFWSAEIGFSLLSFSKNHIDGWVDWDS-KNW
        M+L+ WN RGLG+ +  R L +LV++  P ++FL ETK+    M  I++ LG+ N F V C GR GGL+LFW  +IG  + S+S++HID  V   S   W
Subjt:  MSLLFWNARGLGSPQAFRRLYNLVQSHKPQMVFLTETKMCGSRMNSIKIQLGFQNCFSVDCCGRSGGLSLFWSAEIGFSLLSFSKNHIDGWVDWDS-KNW

Query:  RFTGFYGNPQTELRYLSWALLKNLCGNTSTPWLVGGDFNGILFQHEKQGGRDKSEAELNDFRASLDDCGLMDVGFTGDIFTWTNERPGEENVKERLDRIC
        R TGFYG+P+   R  SWALL +L      PW   GDFN ILFQ EKQG  D+ E ++  FR  L  C L D+GF G  FTW N R G  NV+ RLDR  
Subjt:  RFTGFYGNPQTELRYLSWALLKNLCGNTSTPWLVGGDFNGILFQHEKQGGRDKSEAELNDFRASLDDCGLMDVGFTGDIFTWTNERPGEENVKERLDRIC

Query:  VTEGWKDLFPDCSIDHLPFNRSDHRPILLTLVRFTGLRGARRGRIHRFEEAWNRLPECSEIVICQPKPVIVLLSFGAVLCGGKELLVKGLRWRIGDGQSA
         T  W   F   ++ HLP +RSDH  +LL  V   G    RR R+HRFE+ W   PEC  +V    K  +   S   ++C      +K +R  + +   A
Subjt:  VTEGWKDLFPDCSIDHLPFNRSDHRPILLTLVRFTGLRGARRGRIHRFEEAWNRLPECSEIVICQPKPVIVLLSFGAVLCGGKELLVKGLRWRIGDGQSA

Query:  SIYSSNWLPRDFSLCVSSAVTLPLDTWVKSGYRVGQQVRLADRASSSSSMQSRVWWK
         +Y      R  ++ +     L    +     + G QVR A+R      +   V+W+
Subjt:  SIYSSNWLPRDFSLCVSSAVTLPLDTWVKSGYRVGQQVRLADRASSSSSMQSRVWWK

A0A2N9G8I6 Reverse transcriptase domain-containing protein4.7e-5842.37Show/hide
Query:  MSLLFWNARGLGSPQAFRRLYNLVQSHKPQMVFLTETKMCGSRMNSIKIQLGFQNCFSVDCCGRSGGLSLFWSAEIGFSLLSFSKNHIDGWVD-WDSKNW
        M LL WN +GLG+P+A R L+++V+   P+++FL ETK+   RM  I+++LGF N F+V   GRSGGL+L W A+    + ++S++HID  VD   +K W
Subjt:  MSLLFWNARGLGSPQAFRRLYNLVQSHKPQMVFLTETKMCGSRMNSIKIQLGFQNCFSVDCCGRSGGLSLFWSAEIGFSLLSFSKNHIDGWVD-WDSKNW

Query:  RFTGFYGNPQTELRYLSWALLKNLCGNTSTPWLVGGDFNGILFQHEKQGGRDKSEAELNDFRASLDDCGLMDVGFTGDIFTWTNERPGEENVKERLDRIC
        R TGFYG P+   R  SWALLK+L      PW   GDFN IL  +EK GGR++S  ++ +F+ +++ C  +D+GF G  +TWTN R    N++ RLDR  
Subjt:  RFTGFYGNPQTELRYLSWALLKNLCGNTSTPWLVGGDFNGILFQHEKQGGRDKSEAELNDFRASLDDCGLMDVGFTGDIFTWTNERPGEENVKERLDRIC

Query:  VTEGWKDLFPDCSIDHLPFNRSDHRPILLTLVRFTGLRGARRGRIHRFEEAWNRLPECSEIV
         T  W DLFP  S+ H+P + SDH  +++++V  T     ++  + RFEE W   P+C +++
Subjt:  VTEGWKDLFPDCSIDHLPFNRSDHRPILLTLVRFTGLRGARRGRIHRFEEAWNRLPECSEIV

A0A2N9IBI9 Reverse transcriptase domain-containing protein4.7e-5842.37Show/hide
Query:  MSLLFWNARGLGSPQAFRRLYNLVQSHKPQMVFLTETKMCGSRMNSIKIQLGFQNCFSVDCCGRSGGLSLFWSAEIGFSLLSFSKNHIDGWVD-WDSKNW
        M LL WN +GLG+P+A R L+++V+   P+++FL ETK+   RM  I+++LGF N F+V   GRSGGL+L W A+    + ++S++HID  VD   +K W
Subjt:  MSLLFWNARGLGSPQAFRRLYNLVQSHKPQMVFLTETKMCGSRMNSIKIQLGFQNCFSVDCCGRSGGLSLFWSAEIGFSLLSFSKNHIDGWVD-WDSKNW

Query:  RFTGFYGNPQTELRYLSWALLKNLCGNTSTPWLVGGDFNGILFQHEKQGGRDKSEAELNDFRASLDDCGLMDVGFTGDIFTWTNERPGEENVKERLDRIC
        R TGFYG P+   R  SWALLK+L      PW   GDFN IL  +EK GGR++S  ++ +F+ +++ C  +D+GF G  +TWTN R    N++ RLDR  
Subjt:  RFTGFYGNPQTELRYLSWALLKNLCGNTSTPWLVGGDFNGILFQHEKQGGRDKSEAELNDFRASLDDCGLMDVGFTGDIFTWTNERPGEENVKERLDRIC

Query:  VTEGWKDLFPDCSIDHLPFNRSDHRPILLTLVRFTGLRGARRGRIHRFEEAWNRLPECSEIV
         T  W DLFP  S+ H+P + SDH  +++++V  T     ++  + RFEE W   P+C +++
Subjt:  VTEGWKDLFPDCSIDHLPFNRSDHRPILLTLVRFTGLRGARRGRIHRFEEAWNRLPECSEIV

A0A5C7IIT4 Uncharacterized protein2.3e-5731.71Show/hide
Query:  GLGSPQAFRRLYNLVQSHKPQMVFLTETKMCGSRMNSIKIQLGFQNCFSVDCCGRSGGLSLFWSAEIGFSLLSFSKNHIDGWVDW-DSKNWRFTGFYGNP
        GLG  +AFR L++L+Q + P +VFL E       M S++I+LGF     VD  G SGGL L W AEI  +LLS+S+ HID  V   + K WR TGFYG+P
Subjt:  GLGSPQAFRRLYNLVQSHKPQMVFLTETKMCGSRMNSIKIQLGFQNCFSVDCCGRSGGLSLFWSAEIGFSLLSFSKNHIDGWVDW-DSKNWRFTGFYGNP

Query:  QTELRYLSWALLKNLCGNTSTPWLVGGDFNGILFQHEKQGGRDKSEAELNDFRASLDDCGLMDVGFTGDIFTWTNERPGEENVKERLDRICVTEGWKDLF
            R   W LL+ L G +  PW VGGDFN I+   EK GG  + E  + +F+ +L+DCGL D+GF G  FTW+N R  E  ++ERLDR     GW DLF
Subjt:  QTELRYLSWALLKNLCGNTSTPWLVGGDFNGILFQHEKQGGRDKSEAELNDFRASLDDCGLMDVGFTGDIFTWTNERPGEENVKERLDRICVTEGWKDLF

Query:  PDCSIDHLPFNRSDHRPILLTLVRFTGLRGARRGRIHRF--EEAW-----------NRLPECSEIVICQPKPVIVLLSFGAVLCGGKELLVKGLRWRIGD
           SI HL F +SDHRPILL +       G R    HRF  +  W           ++LP    ++ C+ K  I+   F  V+            WR+  
Subjt:  PDCSIDHLPFNRSDHRPILLTLVRFTGLRGARRGRIHRF--EEAW-----------NRLPECSEIVICQPKPVIVLLSFGAVLCGGKELLVKGLRWRIGD

Query:  GQSASIY-SSNWLPRDFSLCVSSAVTLPLDTWVKSGYRVGQQVRLADRASSSSSMQSRV--WWKGYGICSSLGKSRYSCGAFVADEERMLSIFFGSDLKE
         ++  +Y  S+    DF +         LD W  S     Q  + A    + S ++ RV   WK     S  G  + +  A +    ++  I        
Subjt:  GQSASIY-SSNWLPRDFSLCVSSAVTLPLDTWVKSGYRVGQQVRLADRASSSSSMQSRV--WWKGYGICSSLGKSRYSCGAFVADEERMLSIFFGSDLKE

Query:  GISWDEFAVLATFLWGLWNAQNRLRLQGLVPTSDIGPWALHGSGYCGSGLPWQRSAIDGKVHRGCALSVDLVEGLAATEGVRLCSESGCRPFHLETDSSR
                V+    +G   A       GL                                     L    VE +A   G RL  E+G  P  +E+DS  
Subjt:  GISWDEFAVLATFLWGLWNAQNRLRLQGLVPTSDIGPWALHGSGYCGSGLPWQRSAIDGKVHRGCALSVDLVEGLAATEGVRLCSESGCRPFHLETDSSR

Query:  IFQLLQGDVEDLSEVGMVTSSLRQHLSSVGFPSF-FTLREVNQAADCLAKLALQNRWNQVWVEDYPKELSSFLV
        +  L+       +E+G+V   +    S+  F S  F  R  N  A  LAKL+L      VW+ED P  + S ++
Subjt:  IFQLLQGDVEDLSEVGMVTSSLRQHLSSVGFPSF-FTLREVNQAADCLAKLALQNRWNQVWVEDYPKELSSFLV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCTCCTATTTTGGAATGCTCGTGGTTTGGGGTCACCACAAGCATTCCGCAGATTGTACAATCTGGTGCAATCTCATAAACCCCAAATGGTGTTCCTTACAGAAAC
AAAGATGTGTGGTTCCAGGATGAATTCAATCAAAATTCAACTGGGTTTTCAGAATTGTTTTAGTGTTGATTGTTGTGGAAGAAGCGGTGGGTTAAGTCTTTTTTGGTCTG
CTGAGATTGGCTTCTCTCTTCTTTCCTTTTCGAAAAATCATATCGATGGCTGGGTTGATTGGGACTCAAAGAATTGGCGCTTCACCGGCTTCTATGGCAACCCTCAGACA
GAATTGAGATACTTATCTTGGGCCTTGCTTAAGAATCTTTGTGGCAATACTTCTACTCCGTGGCTAGTTGGGGGTGACTTTAATGGGATCCTCTTCCAACATGAGAAACA
AGGAGGTAGAGATAAGTCAGAGGCCGAGTTGAACGATTTTAGAGCATCGCTGGATGATTGTGGTTTGATGGATGTTGGTTTTACGGGCGATATATTCACATGGACCAATG
AACGACCAGGAGAGGAAAATGTTAAGGAACGTTTGGATCGTATTTGTGTCACAGAGGGATGGAAGGACCTTTTTCCTGATTGTTCGATTGATCACCTTCCATTTAACAGA
TCCGACCACAGGCCAATTTTGTTGACGCTCGTTCGTTTTACTGGTCTAAGGGGAGCACGCAGAGGTAGGATTCATCGCTTTGAAGAAGCGTGGAATCGACTTCCCGAATG
TTCTGAGATTGTTATTTGCCAGCCCAAACCGGTTATCGTGCTTCTTTCATTTGGCGCAGTGTTATGTGGGGGAAAAGAGCTTCTGGTAAAGGGTCTTCGATGGCGAATTG
GGGACGGTCAGTCTGCTTCGATTTATTCCTCCAATTGGCTTCCTCGAGATTTCTCTCTTTGTGTGAGTTCGGCGGTGACGCTTCCTTTGGATACATGGGTGAAGAGCGGG
TATAGAGTGGGCCAACAAGTCCGCTTGGCTGACAGGGCTTCTTCGTCTAGCTCGATGCAATCCCGTGTCTGGTGGAAGGGTTATGGAATATGCAGCTCCCTGGGAAAATC
AAGATATTCATGTGGCGCTTTTGTGGCCGACGAGGAGAGGATGCTCTCCATATTTTTTGGCTCTGATTTAAAGGAAGGGATTTCTTGGGATGAGTTTGCGGTTCTTGCGA
CTTTTCTTTGGGGTCTTTGGAATGCACAAAATCGTCTCCGTCTGCAAGGTTTAGTGCCAACTTCTGATATTGGGCCATGGGCACTTCACGGGTCTGGGTATTGTGGTTCG
GGACTGCCATGGCAACGTTCTGCTATCGACGGTAAAGTTCATCGTGGGTGTGCGCTGTCGGTGGATCTTGTAGAAGGCCTTGCTGCTACTGAGGGGGTCCGATTGTGTTC
TGAATCTGGATGCCGACCTTTTCATTTGGAAACTGATTCTTCGCGGATCTTCCAGCTGTTGCAAGGTGACGTCGAAGACCTTTCAGAAGTGGGCATGGTGACTTCCAGTC
TTCGTCAACATCTATCATCTGTTGGGTTCCCTTCTTTCTTCACGTTGCGAGAGGTCAATCAAGCAGCAGATTGTTTGGCAAAGCTAGCATTGCAAAATCGTTGGAACCAA
GTTTGGGTTGAAGACTATCCTAAGGAACTTTCTAGTTTTCTTGTTGTTAATGCTTCTTTTGTTATTTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTCTCCTATTTTGGAATGCTCGTGGTTTGGGGTCACCACAAGCATTCCGCAGATTGTACAATCTGGTGCAATCTCATAAACCCCAAATGGTGTTCCTTACAGAAAC
AAAGATGTGTGGTTCCAGGATGAATTCAATCAAAATTCAACTGGGTTTTCAGAATTGTTTTAGTGTTGATTGTTGTGGAAGAAGCGGTGGGTTAAGTCTTTTTTGGTCTG
CTGAGATTGGCTTCTCTCTTCTTTCCTTTTCGAAAAATCATATCGATGGCTGGGTTGATTGGGACTCAAAGAATTGGCGCTTCACCGGCTTCTATGGCAACCCTCAGACA
GAATTGAGATACTTATCTTGGGCCTTGCTTAAGAATCTTTGTGGCAATACTTCTACTCCGTGGCTAGTTGGGGGTGACTTTAATGGGATCCTCTTCCAACATGAGAAACA
AGGAGGTAGAGATAAGTCAGAGGCCGAGTTGAACGATTTTAGAGCATCGCTGGATGATTGTGGTTTGATGGATGTTGGTTTTACGGGCGATATATTCACATGGACCAATG
AACGACCAGGAGAGGAAAATGTTAAGGAACGTTTGGATCGTATTTGTGTCACAGAGGGATGGAAGGACCTTTTTCCTGATTGTTCGATTGATCACCTTCCATTTAACAGA
TCCGACCACAGGCCAATTTTGTTGACGCTCGTTCGTTTTACTGGTCTAAGGGGAGCACGCAGAGGTAGGATTCATCGCTTTGAAGAAGCGTGGAATCGACTTCCCGAATG
TTCTGAGATTGTTATTTGCCAGCCCAAACCGGTTATCGTGCTTCTTTCATTTGGCGCAGTGTTATGTGGGGGAAAAGAGCTTCTGGTAAAGGGTCTTCGATGGCGAATTG
GGGACGGTCAGTCTGCTTCGATTTATTCCTCCAATTGGCTTCCTCGAGATTTCTCTCTTTGTGTGAGTTCGGCGGTGACGCTTCCTTTGGATACATGGGTGAAGAGCGGG
TATAGAGTGGGCCAACAAGTCCGCTTGGCTGACAGGGCTTCTTCGTCTAGCTCGATGCAATCCCGTGTCTGGTGGAAGGGTTATGGAATATGCAGCTCCCTGGGAAAATC
AAGATATTCATGTGGCGCTTTTGTGGCCGACGAGGAGAGGATGCTCTCCATATTTTTTGGCTCTGATTTAAAGGAAGGGATTTCTTGGGATGAGTTTGCGGTTCTTGCGA
CTTTTCTTTGGGGTCTTTGGAATGCACAAAATCGTCTCCGTCTGCAAGGTTTAGTGCCAACTTCTGATATTGGGCCATGGGCACTTCACGGGTCTGGGTATTGTGGTTCG
GGACTGCCATGGCAACGTTCTGCTATCGACGGTAAAGTTCATCGTGGGTGTGCGCTGTCGGTGGATCTTGTAGAAGGCCTTGCTGCTACTGAGGGGGTCCGATTGTGTTC
TGAATCTGGATGCCGACCTTTTCATTTGGAAACTGATTCTTCGCGGATCTTCCAGCTGTTGCAAGGTGACGTCGAAGACCTTTCAGAAGTGGGCATGGTGACTTCCAGTC
TTCGTCAACATCTATCATCTGTTGGGTTCCCTTCTTTCTTCACGTTGCGAGAGGTCAATCAAGCAGCAGATTGTTTGGCAAAGCTAGCATTGCAAAATCGTTGGAACCAA
GTTTGGGTTGAAGACTATCCTAAGGAACTTTCTAGTTTTCTTGTTGTTAATGCTTCTTTTGTTATTTCTTAA
Protein sequenceShow/hide protein sequence
MSLLFWNARGLGSPQAFRRLYNLVQSHKPQMVFLTETKMCGSRMNSIKIQLGFQNCFSVDCCGRSGGLSLFWSAEIGFSLLSFSKNHIDGWVDWDSKNWRFTGFYGNPQT
ELRYLSWALLKNLCGNTSTPWLVGGDFNGILFQHEKQGGRDKSEAELNDFRASLDDCGLMDVGFTGDIFTWTNERPGEENVKERLDRICVTEGWKDLFPDCSIDHLPFNR
SDHRPILLTLVRFTGLRGARRGRIHRFEEAWNRLPECSEIVICQPKPVIVLLSFGAVLCGGKELLVKGLRWRIGDGQSASIYSSNWLPRDFSLCVSSAVTLPLDTWVKSG
YRVGQQVRLADRASSSSSMQSRVWWKGYGICSSLGKSRYSCGAFVADEERMLSIFFGSDLKEGISWDEFAVLATFLWGLWNAQNRLRLQGLVPTSDIGPWALHGSGYCGS
GLPWQRSAIDGKVHRGCALSVDLVEGLAATEGVRLCSESGCRPFHLETDSSRIFQLLQGDVEDLSEVGMVTSSLRQHLSSVGFPSFFTLREVNQAADCLAKLALQNRWNQ
VWVEDYPKELSSFLVVNASFVIS