; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021189 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021189
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:5405989..5407646
RNA-Seq ExpressionLag0021189
SyntenyLag0021189
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023914298.1 uncharacterized protein LOC112025844 [Quercus suber]5.9e-11040.58Show/hide
Query:  VFLTETKVQTARFDVLKVRLGYPGCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHVDGWVDGGASP---WRFTGIYGHPQAELKARTWALMKHLRGSSE
        +FL ET    +R   L   L +  C+   S G+ G LAL W   V   ++S S NH+D  V  G +P   WRF+G+YG      KA TWAL++ L     
Subjt:  VFLTETKVQTARFDVLKVRLGYPGCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHVDGWVDGGASP---WRFTGIYGHPQAELKARTWALMKHLRGSSE

Query:  TPWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDIVWERIDRCMGNVAWRDMFAGYEVRHLDFSRSDHRPILL
         PWL  GDFN +L+ HEK G  P+    +L FRE +D C LMDLG+ G  FTW  +R G  +V ER+DR + + AW  +F G +VRHL+   SDH+ I++
Subjt:  TPWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDIVWERIDRCMGNVAWRDMFAGYEVRHLDFSRSDHRPILL

Query:  TLSVCAHSVHNAGHRIQRFEETWLLSQGFKEAVSANWGLSGAVGSPKTLASATKRCMSGLSKWG-------RPGWEILG---INAEE-------------
         L      +    +R  +FE+ WL  +G  E + + W  S    +   +A   K+C   L+ W        R   E +G     AEE             
Subjt:  TLSVCAHSVHNAGHRIQRFEETWLLSQGFKEAVSANWGLSGAVGSPKTLASATKRCMSGLSKWG-------RPGWEILG---INAEE-------------

Query:  ----QLESLLVEDEMYWKQRSRDSWLKWGDRNTRWFHNHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQALLHVSSSVT
            +L  LL ++ + W+QR+R  +LK GDRNT +FH+ AS R RRN++ GL +    W  D  +++ +   YF  L+ TSQPS  E+   L  V  SVT
Subjt:  ----QLESLLVEDEMYWKQRSRDSWLKWGDRNTRWFHNHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQALLHVSSSVT

Query:  DEMNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKKRSPRRVSEYRPISLCNVTYKLV
         EMN++LL PF ++EV +AL Q+    APGPDG+  +FY   W+V+G +V    L+ LNN   P+ +N T I LIPK +SP  +S+YRPISLCNV YKLV
Subjt:  DEMNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKKRSPRRVSEYRPISLCNVTYKLV

Query:  SKVLVNRMKGILNEVISLSQSAFIPGRCVVDNAILGYECLHVLKGRTRGKTG
        SKVL NR K +L  VIS +QSAF  GR + DN ++ YE LH +K   +GK+G
Subjt:  SKVLVNRMKGILNEVISLSQSAFIPGRCVVDNAILGYECLHVLKGRTRGKTG

XP_030924745.1 uncharacterized protein LOC115951731 [Quercus lobata]1.1e-11139.27Show/hide
Query:  MVFLTETKVQTARFDVLKVRLGYPGCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHVDGWVDGGA-SPWRFTGIYGHPQAELKARTWALMKHLRGSSET
        MVFL ETKV+    + +  +L +   F V      GGLALLW       + S+S NH+D  VD G    WRFTG YG P    +  +W+L++ L      
Subjt:  MVFLTETKVQTARFDVLKVRLGYPGCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHVDGWVDGGA-SPWRFTGIYGHPQAELKARTWALMKHLRGSSET

Query:  PWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDIVWERIDRCMGNVAWRDMFAGYEVRHLDFSRSDHRPILLT
        PWL  GDFN +    EK G   +P  ++  FR+A+D C L DLG++G  FTWCNRR G   VW R+DR +  V W   F    + HLD   SDH+PILL 
Subjt:  PWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDIVWERIDRCMGNVAWRDMFAGYEVRHLDFSRSDHRPILLT

Query:  LSVCAHSVHNAGHRIQRFEETWLLSQGFKEAVSANWGLSGAVGSPKTLASATKRCMSGLSKWGRPGWEILGINAEEQLESLLV-----------------
                +  G R  RFE  W+     ++ +  +WG +    S     +  +     L  W R  +  +  +  ++L+ L V                 
Subjt:  LSVCAHSVHNAGHRIQRFEETWLLSQGFKEAVSANWGLSGAVGSPKTLASATKRCMSGLSKWGRPGWEILGINAEEQLESLLV-----------------

Query:  ---------EDEMYWKQRSRDSWLKWGDRNTRWFHNHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQALLHVSSSVTDE
                 ++E  WKQRSR +WLK GDRNT +FH  A+ R +RN + GLEDE G W    + +  ++E YF++++T+S PS  + ++ L  +   +T E
Subjt:  ---------EDEMYWKQRSRDSWLKWGDRNTRWFHNHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQALLHVSSSVTDE

Query:  MNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKKRSPRRVSEYRPISLCNVTYKLVSK
        M++ L R +Q +EVL ALKQ+ P  APGPDG+S +FY+  W +VG DV+   L+ LN+ +    LN T I LIPK ++P+RV+E+RPISLCNV YKL++K
Subjt:  MNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKKRSPRRVSEYRPISLCNVTYKLVSK

Query:  VLVNRMKGILNEVISLSQSAFIPGRCVVDNAILGYECLHVLKGRTRGKTG
        V+VNR+K IL  VI  SQSAF+ GR + DN ++ +E LH LK +T+G+ G
Subjt:  VLVNRMKGILNEVISLSQSAFIPGRCVVDNAILGYECLHVLKGRTRGKTG

XP_030925054.1 uncharacterized protein LOC115952115 [Quercus lobata]3.5e-11038.36Show/hide
Query:  MVFLTETKVQTARFDVLKVRLGYPGCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHVDGWVDGGA-SPWRFTGIYGHPQAELKARTWALMKHLRGSSET
        ++FL ETKV+ +    +  ++ Y   F V  +   GGLAL WT +    + S+S NH+D  +D G    WRFTG YG P+   +  +W++++ L      
Subjt:  MVFLTETKVQTARFDVLKVRLGYPGCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHVDGWVDGGA-SPWRFTGIYGHPQAELKARTWALMKHLRGSSET

Query:  PWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDIVWERIDRCMGNVAWRDMFAGYEVRHLDFSRSDHRPILLT
        PWL  GDFN +L+  EK G   +P  ++ GFR+A+D C L DLG++G  FTWCNRRPG   VW R+DR +  V W   F    + HLD   SDH+PILL+
Subjt:  PWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDIVWERIDRCMGNVAWRDMFAGYEVRHLDFSRSDHRPILLT

Query:  LSVCAHSVHNAGHRIQRFEETWLLSQGFKEAVSANWGLSGAVGSPKTLASATKRCMSGLSKWGRPGW---------EILGINAEEQ--------------
          +  +  +  G R   FE  WL  +  +E +  +WG+     +     S    C + L  W +  +         ++  + +EE+              
Subjt:  LSVCAHSVHNAGHRIQRFEETWLLSQGFKEAVSANWGLSGAVGSPKTLASATKRCMSGLSKWGRPGW---------EILGINAEEQ--------------

Query:  ---LESLLVEDEMYWKQRSRDSWLKWGDRNTRWFHNHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQALLHVSSSVTDE
           ++ L   +E  WKQRSR+ WLK GD+NTR+FH  A+ R RRN + GLED++G+W +D   +  ++EGYF+ ++T+S P     D  L  + S V  +
Subjt:  ---LESLLVEDEMYWKQRSRDSWLKWGDRNTRWFHNHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQALLHVSSSVTDE

Query:  MNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKKRSPRRVSEYRPISLCNVTYKLVSK
           +L    Q  EV  AL Q+ P  APG DG+S VFY+  W +VG DV    L  LN+ + P  +N T I LIPK ++P++VS++RPISLCNV YKL++K
Subjt:  MNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKKRSPRRVSEYRPISLCNVTYKLVSK

Query:  VLVNRMKGILNEVISLSQSAFIPGRCVVDNAILGYECLHVLKGRTRGKTG
        V+ NR+K  L   +  SQSAF+ GR + DN ++ +E LH LK +TRGK G
Subjt:  VLVNRMKGILNEVISLSQSAFIPGRCVVDNAILGYECLHVLKGRTRGKTG

XP_030939698.1 uncharacterized protein LOC115964550 [Quercus lobata]7.2e-11640.55Show/hide
Query:  MVFLTETKVQTARFDVLKVRLGYPGCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHVDGWV-DGGASPWRFTGIYGHPQAELKARTWALMKHLRGSSET
        ++FL ETK        ++  L Y   F V S  RSGGLALLW  E+   + +++ NH+D  + D  A+ WR TG YG P+ + K  +W L+KHL      
Subjt:  MVFLTETKVQTARFDVLKVRLGYPGCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHVDGWV-DGGASPWRFTGIYGHPQAELKARTWALMKHLRGSSET

Query:  PWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDIVWERIDRCMGNVAWRDMFAGYEVRHLDFSRSDHRPILLT
        PWL  GDFN +L   EK GG PKP   +L FREA+  C L+DLGY G +FTW N R  +D+V ER+DR    + WRD FA  +V HL+ S SDH PIL+T
Subjt:  PWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDIVWERIDRCMGNVAWRDMFAGYEVRHLDFSRSDHRPILLT

Query:  LSVCAHSVHNAGHRIQRFEETWLLSQGFKEAVSANWGLSGAVGSPKT-LASATKRCMSGLSKWGRPGWEILGINAEE-----------------------
          +  H          RFEE W      +  +   W      GSP   L    KRC   L  W R  + +     +E                       
Subjt:  LSVCAHSVHNAGHRIQRFEETWLLSQGFKEAVSANWGLSGAVGSPKT-LASATKRCMSGLSKWGRPGWEILGINAEE-----------------------

Query:  --QLESLLVEDEMYWKQRSRDSWLKWGDRNTRWFHNHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQALLHVSSSVTDE
          ++ +++ +DE++W+QRSR  WL  GD+NT++FHN AS RRR+N + G+ D D  W    ++I  + E YF++L++T+ P  + ++  L  V   VT  
Subjt:  --QLESLLVEDEMYWKQRSRDSWLKWGDRNTRWFHNHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQALLHVSSSVTDE

Query:  MNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKKRSPRRVSEYRPISLCNVTYKLVSK
        MN  L RP+  DEV LAL Q+HP+K+PGPDG+S  F+Q  W ++G DV +  L+ L +      +N T IVLIPKK+ P+ +++YRPISL NV  +++SK
Subjt:  MNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKKRSPRRVSEYRPISLCNVTYKLVSK

Query:  VLVNRMKGILNEVISLSQSAFIPGRCVVDNAILGYECLHVLKGRTRGKTG
        V+ NR+K IL  VIS SQSAF+P R + DN  + YE LH ++ R RGK G
Subjt:  VLVNRMKGILNEVISLSQSAFIPGRCVVDNAILGYECLHVLKGRTRGKTG

XP_042962672.1 uncharacterized protein LOC122296942 [Carya illinoinensis]2.0e-11340Show/hide
Query:  MVFLTETKVQTARFDVLKVRLGYPGCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHVDGWVDG-GASPWRFTGIYGHPQAELKARTWALMKHLRGSSET
        ++FL ETK+ + R +  K+RLG+  CF VDS GRSGGLALLW  ++   +I+YSS+H+   +       W  TG+YGH  +  ++  W L+K L      
Subjt:  MVFLTETKVQTARFDVLKVRLGYPGCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHVDGWVDG-GASPWRFTGIYGHPQAELKARTWALMKHLRGSSET

Query:  PWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDIVWERIDRCMGNVAWRDMFAGYEVRHLDFSRSDHRPILLT
        PW++ GDFN +L   EK GG  +   ++  FRE +  C L DLGY G  FTW NRR  ED+V ER+DR + N  W DMF    V H   + SDH P  L 
Subjt:  PWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDIVWERIDRCMGNVAWRDMFAGYEVRHLDFSRSDHRPILLT

Query:  LSVCAHSVHNAGHRIQRFEETWLLSQGFKEAVSANWGLSGAVGSPKTLASATKRCMSGLSKWGRPGWEILGIN---AEEQLESL----------------
        L      V     R+ RFE  W+        +   WG      S   +      C + L +W +  +  +  N   A+ +L+ L                
Subjt:  LSVCAHSVHNAGHRIQRFEETWLLSQGFKEAVSANWGLSGAVGSPKTLASATKRCMSGLSKWGRPGWEILGIN---AEEQLESL----------------

Query:  -------LVEDEMYWKQRSRDSWLKWGDRNTRWFHNHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQALLHVSSSVTDE
               L  DE+ WKQRSR  WL+ GD N+R+FH+ AS+RRR+N +  L+DE G+WQ+  D++  LI  YF+ L+T +     +++  L  V + VT E
Subjt:  -------LVEDEMYWKQRSRDSWLKWGDRNTRWFHNHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQALLHVSSSVTDE

Query:  MNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKKRSPRRVSEYRPISLCNVTYKLVSK
        MN  LL+P+  +EV +ALKQ+HP+KAPGPDG+  +F+Q  W ++G  +    L+ LN+ + P+ LN T I LIPKK SP +V+++RPISLCNV YK++SK
Subjt:  MNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKKRSPRRVSEYRPISLCNVTYKLVSK

Query:  VLVNRMKGILNEVISLSQSAFIPGRCVVDNAILGYECLHVLKGRTRGKTG
        V+ NR+K +L ++IS SQSAF+PGR + DN ++ YE LH L+ + +G+ G
Subjt:  VLVNRMKGILNEVISLSQSAFIPGRCVVDNAILGYECLHVLKGRTRGKTG

TrEMBL top hitse value%identityAlignment
A0A2N9EV35 Uncharacterized protein1.1e-11742.13Show/hide
Query:  MVFLTETKVQTARFDVLKVRLGYPGCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHVDGWVD-GGASPWRFTGIYGHPQAELKARTWALMKHLRGSSET
        ++FL ETK+     + L+V+LGY   F V S+GRSGGLALLW  +++ ++ +++SNH+D  V+      WR T   G P+ + K  +WAL+ HL      
Subjt:  MVFLTETKVQTARFDVLKVRLGYPGCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHVDGWVD-GGASPWRFTGIYGHPQAELKARTWALMKHLRGSSET

Query:  PWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDIVWERIDRCMGNVAWRDMFAGYEVRHLDFSRSDHRPILLT
        PWL  GDFN ++  +EK G  P+   ++  FRE  + C L+D+G+SG  FTW N R G   V ERIDR   +  W + F    V HL    SDH PIL+ 
Subjt:  PWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDIVWERIDRCMGNVAWRDMFAGYEVRHLDFSRSDHRPILLT

Query:  LSVCAHSVHNAGHRIQRFEETWLLSQGFKEAVSANWGLSGAVGSPK-TLASATKRCMSGLSKWGRPGWEILGINAEEQLESLLVEDEMYWKQRSRDSWLK
        +     ++     +  RFEE W+ +   +E +   W   G VGSP   L    KRC  GL +W +  +                 DE++WKQR R  WLK
Subjt:  LSVCAHSVHNAGHRIQRFEETWLLSQGFKEAVSANWGLSGAVGSPK-TLASATKRCMSGLSKWGRPGWEILGINAEEQLESLLVEDEMYWKQRSRDSWLK

Query:  WGDRNTRWFHNHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQALLHVSSSVTDEMNSKLLRPFQQDEVLLALKQIHPNK
         GDRNTR+FH  A+ R++ N V GL D    W  DPD +  +   YF++L+TTS PS   ID  LL V   VT EMN +LL P+   E+  AL Q+HP+K
Subjt:  WGDRNTRWFHNHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQALLHVSSSVTDEMNSKLLRPFQQDEVLLALKQIHPNK

Query:  APGPDGLSGVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKKRSPRRVSEYRPISLCNVTYKLVSKVLVNRMKGILNEVISLSQSAFIPGR
        +PGPDG+S +F+Q  W ++G DVVQ    +L +      +N T I LIPK ++P+++S+YRPISLCNV YK++SK L NR+K  L  +IS +QSAF+PGR
Subjt:  APGPDGLSGVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKKRSPRRVSEYRPISLCNVTYKLVSKVLVNRMKGILNEVISLSQSAFIPGR

Query:  CVVDNAILGYECLHVLKGRTRGKTGGL
         + DN I+ YE L+ LK R  GKTG +
Subjt:  CVVDNAILGYECLHVLKGRTRGKTGGL

A0A2N9HU09 Reverse transcriptase domain-containing protein8.3e-11841.05Show/hide
Query:  MVFLTETKVQTARFDVLKVRLGYPGCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHVDGWVD-GGASPWRFTGIYGHPQAELKARTWALMKHLRGSSET
        ++FL ETK+     + ++V+LG+  CF V   GRSGGLALLW    + ++ ++S NHVD  V     + WRFTG YGHP+   K  +W L+  L     T
Subjt:  MVFLTETKVQTARFDVLKVRLGYPGCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHVDGWVD-GGASPWRFTGIYGHPQAELKARTWALMKHLRGSSET

Query:  PWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDIVWERIDRCMGNVAWRDMFAGYEVRHLDFSRSDHRPILLT
        PWL  GDFN +L   E+ G        +  F + V+ C L+DLG+ G  FTW NRR GE ++ +R+DR + N AW D F    V H+  S SDH P+LL 
Subjt:  PWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDIVWERIDRCMGNVAWRDMFAGYEVRHLDFSRSDHRPILLT

Query:  LSVCAHSVHNAGHRIQRFEETWLLSQGFKEAVSANWGLSGAVGSPK-TLASATKRCMSGLSKW----------------------------GRPGWEILG
        +            R ++FEE W L    +  +   W    A+GSP   L    K C   L +W                            G+    I  
Subjt:  LSVCAHSVHNAGHRIQRFEETWLLSQGFKEAVSANWGLSGAVGSPK-TLASATKRCMSGLSKW----------------------------GRPGWEILG

Query:  INAEEQLESLLVEDEMYWKQRSRDSWLKWGDRNTRWFHNHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQALLHVSSSV
        + AE  +  LL+ +E++W+QRSR +WL  GD NT++FH+ A+ RRR N + GL + D VW  D  +I  +   YF+D++ TS P +  ++  L  V+S V
Subjt:  INAEEQLESLLVEDEMYWKQRSRDSWLKWGDRNTRWFHNHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQALLHVSSSV

Query:  TDEMNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKKRSPRRVSEYRPISLCNVTYKL
        T E N +LL+PF  DEV +AL Q+HP+KAPGPDG+S  F+Q  W++VG DVV   L++LN+      +N T I LIPKK++P R+SEYRPISLCNV YK+
Subjt:  TDEMNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKKRSPRRVSEYRPISLCNVTYKL

Query:  VSKVLVNRMKGILNEVISLSQSAFIPGRCVVDNAILGYECLHVLKGRTRGKTG
        +SKVL NR+K IL  +IS SQSAF+PGR + DN  + +E +H +K + RGK G
Subjt:  VSKVLVNRMKGILNEVISLSQSAFIPGRCVVDNAILGYECLHVLKGRTRGKTG

A0A2N9I475 Reverse transcriptase domain-containing protein8.3e-11841.05Show/hide
Query:  MVFLTETKVQTARFDVLKVRLGYPGCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHVDGWVD-GGASPWRFTGIYGHPQAELKARTWALMKHLRGSSET
        ++FL ETK+     + ++V+LG+  CF V   GRSGGLALLW    + ++ ++S NHVD  V     + WRFTG YGHP+   K  +W L+  L     T
Subjt:  MVFLTETKVQTARFDVLKVRLGYPGCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHVDGWVD-GGASPWRFTGIYGHPQAELKARTWALMKHLRGSSET

Query:  PWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDIVWERIDRCMGNVAWRDMFAGYEVRHLDFSRSDHRPILLT
        PWL  GDFN +L   E+ G        +  F + V+ C L+DLG+ G  FTW NRR GE ++ +R+DR + N AW D F    V H+  S SDH P+LL 
Subjt:  PWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDIVWERIDRCMGNVAWRDMFAGYEVRHLDFSRSDHRPILLT

Query:  LSVCAHSVHNAGHRIQRFEETWLLSQGFKEAVSANWGLSGAVGSPK-TLASATKRCMSGLSKW----------------------------GRPGWEILG
        +            R ++FEE W L    +  +   W    A+GSP   L    K C   L +W                            G+    I  
Subjt:  LSVCAHSVHNAGHRIQRFEETWLLSQGFKEAVSANWGLSGAVGSPK-TLASATKRCMSGLSKW----------------------------GRPGWEILG

Query:  INAEEQLESLLVEDEMYWKQRSRDSWLKWGDRNTRWFHNHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQALLHVSSSV
        + AE  +  LL+ +E++W+QRSR +WL  GD NT++FH+ A+ RRR N + GL + D VW  D  +I  +   YF+D++ TS P +  ++  L  V+S V
Subjt:  INAEEQLESLLVEDEMYWKQRSRDSWLKWGDRNTRWFHNHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQALLHVSSSV

Query:  TDEMNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKKRSPRRVSEYRPISLCNVTYKL
        T E N +LL+PF  DEV +AL Q+HP+KAPGPDG+S  F+Q  W++VG DVV   L++LN+      +N T I LIPKK++P R+SEYRPISLCNV YK+
Subjt:  TDEMNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKKRSPRRVSEYRPISLCNVTYKL

Query:  VSKVLVNRMKGILNEVISLSQSAFIPGRCVVDNAILGYECLHVLKGRTRGKTG
        +SKVL NR+K IL  +IS SQSAF+PGR + DN  + +E +H +K + RGK G
Subjt:  VSKVLVNRMKGILNEVISLSQSAFIPGRCVVDNAILGYECLHVLKGRTRGKTG

A0A2N9ISW4 Reverse transcriptase domain-containing protein8.3e-11841.05Show/hide
Query:  MVFLTETKVQTARFDVLKVRLGYPGCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHVDGWVD-GGASPWRFTGIYGHPQAELKARTWALMKHLRGSSET
        ++FL ETK+     + ++V+LG+  CF V   GRSGGLALLW    + ++ ++S NHVD  V     + WRFTG YGHP+   K  +W L+  L     T
Subjt:  MVFLTETKVQTARFDVLKVRLGYPGCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHVDGWVD-GGASPWRFTGIYGHPQAELKARTWALMKHLRGSSET

Query:  PWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDIVWERIDRCMGNVAWRDMFAGYEVRHLDFSRSDHRPILLT
        PWL  GDFN +L   E+ G        +  F + V+ C L+DLG+ G  FTW NRR GE ++ +R+DR + N AW D F    V H+  S SDH P+LL 
Subjt:  PWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDIVWERIDRCMGNVAWRDMFAGYEVRHLDFSRSDHRPILLT

Query:  LSVCAHSVHNAGHRIQRFEETWLLSQGFKEAVSANWGLSGAVGSPK-TLASATKRCMSGLSKW----------------------------GRPGWEILG
        +            R ++FEE W L    +  +   W    A+GSP   L    K C   L +W                            G+    I  
Subjt:  LSVCAHSVHNAGHRIQRFEETWLLSQGFKEAVSANWGLSGAVGSPK-TLASATKRCMSGLSKW----------------------------GRPGWEILG

Query:  INAEEQLESLLVEDEMYWKQRSRDSWLKWGDRNTRWFHNHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQALLHVSSSV
        + AE  +  LL+ +E++W+QRSR +WL  GD NT++FH+ A+ RRR N + GL + D VW  D  +I  +   YF+D++ TS P +  ++  L  V+S V
Subjt:  INAEEQLESLLVEDEMYWKQRSRDSWLKWGDRNTRWFHNHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQALLHVSSSV

Query:  TDEMNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKKRSPRRVSEYRPISLCNVTYKL
        T E N +LL+PF  DEV +AL Q+HP+KAPGPDG+S  F+Q  W++VG DVV   L++LN+      +N T I LIPKK++P R+SEYRPISLCNV YK+
Subjt:  TDEMNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKKRSPRRVSEYRPISLCNVTYKL

Query:  VSKVLVNRMKGILNEVISLSQSAFIPGRCVVDNAILGYECLHVLKGRTRGKTG
        +SKVL NR+K IL  +IS SQSAF+PGR + DN  + +E +H +K + RGK G
Subjt:  VSKVLVNRMKGILNEVISLSQSAFIPGRCVVDNAILGYECLHVLKGRTRGKTG

A0A2N9IT57 Reverse transcriptase domain-containing protein8.3e-11841.05Show/hide
Query:  MVFLTETKVQTARFDVLKVRLGYPGCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHVDGWVD-GGASPWRFTGIYGHPQAELKARTWALMKHLRGSSET
        ++FL ETK+     + ++V+LG+  CF V   GRSGGLALLW    + ++ ++S NHVD  V     + WRFTG YGHP+   K  +W L+  L     T
Subjt:  MVFLTETKVQTARFDVLKVRLGYPGCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHVDGWVD-GGASPWRFTGIYGHPQAELKARTWALMKHLRGSSET

Query:  PWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDIVWERIDRCMGNVAWRDMFAGYEVRHLDFSRSDHRPILLT
        PWL  GDFN +L   E+ G        +  F + V+ C L+DLG+ G  FTW NRR GE ++ +R+DR + N AW D F    V H+  S SDH P+LL 
Subjt:  PWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDIVWERIDRCMGNVAWRDMFAGYEVRHLDFSRSDHRPILLT

Query:  LSVCAHSVHNAGHRIQRFEETWLLSQGFKEAVSANWGLSGAVGSPK-TLASATKRCMSGLSKW----------------------------GRPGWEILG
        +            R ++FEE W L    +  +   W    A+GSP   L    K C   L +W                            G+    I  
Subjt:  LSVCAHSVHNAGHRIQRFEETWLLSQGFKEAVSANWGLSGAVGSPK-TLASATKRCMSGLSKW----------------------------GRPGWEILG

Query:  INAEEQLESLLVEDEMYWKQRSRDSWLKWGDRNTRWFHNHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQALLHVSSSV
        + AE  +  LL+ +E++W+QRSR +WL  GD NT++FH+ A+ RRR N + GL + D VW  D  +I  +   YF+D++ TS P +  ++  L  V+S V
Subjt:  INAEEQLESLLVEDEMYWKQRSRDSWLKWGDRNTRWFHNHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQALLHVSSSV

Query:  TDEMNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKKRSPRRVSEYRPISLCNVTYKL
        T E N +LL+PF  DEV +AL Q+HP+KAPGPDG+S  F+Q  W++VG DVV   L++LN+      +N T I LIPKK++P R+SEYRPISLCNV YK+
Subjt:  TDEMNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKKRSPRRVSEYRPISLCNVTYKL

Query:  VSKVLVNRMKGILNEVISLSQSAFIPGRCVVDNAILGYECLHVLKGRTRGKTG
        +SKVL NR+K IL  +IS SQSAF+PGR + DN  + +E +H +K + RGK G
Subjt:  VSKVLVNRMKGILNEVISLSQSAFIPGRCVVDNAILGYECLHVLKGRTRGKTG

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein6.1e-1729.03Show/hide
Query:  RRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQAL-LHVSSSVTDEMNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQH
        +R +N++  ++++ G    DP  I   I  Y++ LY     + EE+D  L  +    +  E    L RP    E++  +  +   K+PGPDG +  FYQ 
Subjt:  RRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQAL-LHVSSSVTDEMNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQH

Query:  SWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKK-RSPRRVSEYRPISLCNVTYKLVSKVLVNRMKGILNEVISLSQSAFIPG
            +   +++   +I    + P    E  I+LIPK  R   +   +RPISL N+  K+++K+L NR++  + ++I   Q  FIPG
Subjt:  SWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKK-RSPRRVSEYRPISLCNVTYKLVSKVLVNRMKGILNEVISLSQSAFIPG

P08548 LINE-1 reverse transcriptase homolog1.0e-1629.69Show/hide
Query:  NHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQAL--LHVSSSVTDEMNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLS
        N    +R ++ +  + + +     DP  I  ++  Y++ LY+    + +EIDQ L   H+      E+   L RP    E+   ++ +   K+PGPDG +
Subjt:  NHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQAL--LHVSSSVTDEMNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLS

Query:  GVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKK-RSPRRVSEYRPISLCNVTYKLVSKVLVNRMKGILNEVISLSQSAFIPG
          FYQ     +   ++    NI    + P    E  I LIPK  + P R   YRPISL N+  K+++K+L NR++  + ++I   Q  FIPG
Subjt:  GVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKK-RSPRRVSEYRPISLCNVTYKLVSKVLVNRMKGILNEVISLSQSAFIPG

P11369 LINE-1 retrotransposable element ORF2 protein9.7e-1528.8Show/hide
Query:  VKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQAL-----LHVSSSVTDEMNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQHSW
        +  + +E G    DP+ I   I  +++ LY+T   + +E+D+ L       ++    D +NS    P    E+   +  +   K+PGPDG S  FYQ   
Subjt:  VKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQPSHEEIDQAL-----LHVSSSVTDEMNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQHSW

Query:  SVVGADVVQCCLNILNNRVSPAPLNETMIVLIPK-KRSPRRVSEYRPISLCNVTYKLVSKVLVNRMKGILNEVISLSQSAFIPG
          +   + +    I      P    E  I LIPK ++ P ++  +RPISL N+  K+++K+L NR++  +  +I   Q  FIPG
Subjt:  SVVGADVVQCCLNILNNRVSPAPLNETMIVLIPK-KRSPRRVSEYRPISLCNVTYKLVSKVLVNRMKGILNEVISLSQSAFIPG

P14381 Transposon TX1 uncharacterized 149 kDa protein6.5e-2726.14Show/hide
Query:  SETPWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDL----GYSGPVFTWCNRRPGEDIVWERIDRCMGNVAWRDMFAGYEVRHLDFSRSD
        S+   ++GGDFN  L   +++  + + + E +  RE +    L+D+          FT+   R G  +   RIDR   +           +R   F  SD
Subjt:  SETPWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDL----GYSGPVFTWCNRRPGEDIVWERIDRCMGNVAWRDMFAGYEVRHLDFSRSD

Query:  HRPILLTLSVCAHSVHNAGHRIQRFEETWLLSQGFKEAVSANWGLSGA-------------VGSP--KTLASATKRCMSGLSKWGRPGWEILGINAE---
        H  + L +S+ A S+  A +    F  + L  +GF ++V   W    A             VG    K L     + +S     G+   EI  +N E   
Subjt:  HRPILLTLSVCAHSVHNAGHRIQRFEETWLLSQGFKEAVSANWGLSGA-------------VGSP--KTLASATKRCMSGLSKWGRPGWEILGINAE---

Query:  ---------------EQLESLLVEDEMYWKQ------RSRDSWLKWGDRNTRWFHNHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQ
                       E LE       M  +Q      RSR   L   DR +R+F+     +  R ++  L  EDG   +DP+ I      ++++L+ +  
Subjt:  ---------------EQLESLLVEDEMYWKQ------RSRDSWLKWGDRNTRWFHNHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQ

Query:  PSHEEIDQALLHVSSSVTDEMNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKKRSPR
        P   +  + L      V++    +L  P   DE+  AL+ +  NK+PG DGL+  F+Q  W  +G D  +           P      ++ L+PKK   R
Subjt:  PSHEEIDQALLHVSSSVTDEMNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKKRSPR

Query:  RVSEYRPISLCNVTYKLVSKVLVNRMKGILNEVISLSQSAFIPGRCVVDNAILGYECLH
         +  +RP+SL +  YK+V+K +  R+K +L EVI   QS  +PGR + DN  L  + LH
Subjt:  RVSEYRPISLCNVTYKLVSKVLVNRMKGILNEVISLSQSAFIPGRCVVDNAILGYECLH

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.3e-2223.7Show/hide
Query:  LMGGDFN--AVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDIVWERIDRCMGNVAWRDMFAGYEVRHLDFSRSDHRPILLT
        ++ GDF+  A   DH        P   L  F+  +   +L+D+   G  +TW N +    I+  ++DR + N  W   F            SDH P ++ 
Subjt:  LMGGDFN--AVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDIVWERIDRCMGNVAWRDMFAGYEVRHLDFSRSDHRPILLT

Query:  LSVCAHSVHNAGHRIQRFEETWLLSQGFKEAVSANWGLSGAVGSPK----TLASATKRCMSGLSKWGRPGWEILGINAEEQLESL-------------LV
        L     ++     +  R+         F  +++  W     VGS          A K+C   L++ G    +     A + LES+              V
Subjt:  LSVCAHSVHNAGHRIQRFEETWLLSQGFKEAVSANWGLSGAVGSPK----TLASATKRCMSGLSKWGRPGWEILGINAEEQLESL-------------LV

Query:  ED-------------EMYWKQRSRDSWLKWGDRNTRWFHNHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQP--SHEEIDQALLHVS
        E              E +++Q+SR  WL+ GD NTR+FH    + + +N +K L  +D V  ++  ++  +I  Y+  L  +     + + + +      
Subjt:  ED-------------EMYWKQRSRDSWLKWGDRNTRWFHNHASSRRRRNEVKGLEDEDGVWQQDPDRILGLIEGYFEDLYTTSQP--SHEEIDQALLHVS

Query:  SSVTDEMNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKKRSPRRVSEYRPISLCNVT
            D + S+L       E+  A+  +  NKAPGPD  +  F+  SW VV    +                N T I LIPK     ++S +RP+S C V 
Subjt:  SSVTDEMNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNETMIVLIPKKRSPRRVSEYRPISLCNVT

Query:  YKLVS
        YK+++
Subjt:  YKLVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGTTTCTGACAGAAACGAAAGTGCAGACGGCGAGGTTTGATGTGCTTAAAGTTAGGCTGGGGTATCCGGGTTGCTTCTGTGTGGATAGTAATGGGAGAAGTGGAGG
TCTGGCTTTGCTTTGGACTACGGAGGTTAAGTTCAGTCTTATTTCATATTCCTCTAATCATGTAGACGGATGGGTGGATGGAGGGGCGAGTCCATGGAGATTCACAGGGA
TATACGGGCATCCTCAAGCGGAGCTAAAAGCTAGAACGTGGGCATTGATGAAGCATCTCCGAGGGAGTAGTGAGACTCCGTGGCTTATGGGGGGTGACTTTAATGCAGTG
TTGTTTGATCATGAGAAGGATGGGGGAAGGCCTAAGCCGGCTGGGGAGCTGCTTGGTTTCCGGGAGGCGGTGGATGTGTGCGAGCTCATGGATCTTGGTTACAGTGGGCC
TGTATTTACCTGGTGCAATAGGAGACCTGGGGAGGATATTGTTTGGGAGAGAATTGATAGGTGTATGGGTAATGTGGCATGGCGAGATATGTTCGCTGGTTATGAGGTGA
GACACCTGGACTTTAGTCGGTCGGATCATAGACCTATTCTCCTGACGTTGTCGGTGTGTGCTCATTCGGTTCATAATGCAGGTCATAGAATCCAGAGGTTTGAGGAAACG
TGGCTTCTATCACAGGGGTTTAAGGAAGCAGTGTCCGCTAATTGGGGTTTGAGTGGTGCAGTAGGATCGCCGAAGACATTGGCCTCGGCGACGAAACGGTGCATGAGCGG
CCTTAGCAAATGGGGTAGGCCGGGATGGGAAATTTTAGGCATCAATGCAGAAGAACAGTTGGAGTCTTTGCTGGTTGAGGATGAGATGTACTGGAAACAAAGATCCAGAG
ATAGCTGGCTGAAATGGGGGGACAGGAACACTCGATGGTTTCATAATCATGCCTCGAGTAGGAGAAGGAGAAATGAGGTGAAGGGTTTGGAGGATGAGGATGGGGTCTGG
CAGCAAGATCCGGACAGAATCTTGGGGCTTATTGAGGGATATTTTGAGGACTTGTATACGACATCGCAACCTTCACATGAGGAAATAGATCAGGCTCTGTTACATGTGTC
TTCCTCGGTTACGGATGAGATGAACAGTAAGCTCCTGCGCCCGTTCCAGCAGGATGAAGTTCTACTTGCTTTGAAGCAGATTCACCCAAATAAAGCTCCGGGACCGGATG
GATTGTCAGGGGTGTTTTATCAGCACTCGTGGTCTGTCGTTGGGGCAGATGTGGTCCAGTGCTGTCTGAATATCCTGAATAACAGGGTCTCCCCAGCTCCCCTCAACGAA
ACGATGATTGTGTTGATACCAAAGAAGAGGAGCCCCAGACGTGTTTCTGAATACAGGCCCATCTCACTCTGTAACGTGACGTATAAGTTAGTTTCGAAGGTGTTAGTGAA
CCGCATGAAAGGGATTCTAAATGAGGTGATCTCTCTTAGTCAGAGTGCATTTATTCCTGGGCGGTGTGTGGTGGACAATGCCATTTTGGGTTATGAATGTTTGCACGTTT
TGAAAGGGAGAACAAGGGGCAAAACGGGTGGGCTTCACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGTTTCTGACAGAAACGAAAGTGCAGACGGCGAGGTTTGATGTGCTTAAAGTTAGGCTGGGGTATCCGGGTTGCTTCTGTGTGGATAGTAATGGGAGAAGTGGAGG
TCTGGCTTTGCTTTGGACTACGGAGGTTAAGTTCAGTCTTATTTCATATTCCTCTAATCATGTAGACGGATGGGTGGATGGAGGGGCGAGTCCATGGAGATTCACAGGGA
TATACGGGCATCCTCAAGCGGAGCTAAAAGCTAGAACGTGGGCATTGATGAAGCATCTCCGAGGGAGTAGTGAGACTCCGTGGCTTATGGGGGGTGACTTTAATGCAGTG
TTGTTTGATCATGAGAAGGATGGGGGAAGGCCTAAGCCGGCTGGGGAGCTGCTTGGTTTCCGGGAGGCGGTGGATGTGTGCGAGCTCATGGATCTTGGTTACAGTGGGCC
TGTATTTACCTGGTGCAATAGGAGACCTGGGGAGGATATTGTTTGGGAGAGAATTGATAGGTGTATGGGTAATGTGGCATGGCGAGATATGTTCGCTGGTTATGAGGTGA
GACACCTGGACTTTAGTCGGTCGGATCATAGACCTATTCTCCTGACGTTGTCGGTGTGTGCTCATTCGGTTCATAATGCAGGTCATAGAATCCAGAGGTTTGAGGAAACG
TGGCTTCTATCACAGGGGTTTAAGGAAGCAGTGTCCGCTAATTGGGGTTTGAGTGGTGCAGTAGGATCGCCGAAGACATTGGCCTCGGCGACGAAACGGTGCATGAGCGG
CCTTAGCAAATGGGGTAGGCCGGGATGGGAAATTTTAGGCATCAATGCAGAAGAACAGTTGGAGTCTTTGCTGGTTGAGGATGAGATGTACTGGAAACAAAGATCCAGAG
ATAGCTGGCTGAAATGGGGGGACAGGAACACTCGATGGTTTCATAATCATGCCTCGAGTAGGAGAAGGAGAAATGAGGTGAAGGGTTTGGAGGATGAGGATGGGGTCTGG
CAGCAAGATCCGGACAGAATCTTGGGGCTTATTGAGGGATATTTTGAGGACTTGTATACGACATCGCAACCTTCACATGAGGAAATAGATCAGGCTCTGTTACATGTGTC
TTCCTCGGTTACGGATGAGATGAACAGTAAGCTCCTGCGCCCGTTCCAGCAGGATGAAGTTCTACTTGCTTTGAAGCAGATTCACCCAAATAAAGCTCCGGGACCGGATG
GATTGTCAGGGGTGTTTTATCAGCACTCGTGGTCTGTCGTTGGGGCAGATGTGGTCCAGTGCTGTCTGAATATCCTGAATAACAGGGTCTCCCCAGCTCCCCTCAACGAA
ACGATGATTGTGTTGATACCAAAGAAGAGGAGCCCCAGACGTGTTTCTGAATACAGGCCCATCTCACTCTGTAACGTGACGTATAAGTTAGTTTCGAAGGTGTTAGTGAA
CCGCATGAAAGGGATTCTAAATGAGGTGATCTCTCTTAGTCAGAGTGCATTTATTCCTGGGCGGTGTGTGGTGGACAATGCCATTTTGGGTTATGAATGTTTGCACGTTT
TGAAAGGGAGAACAAGGGGCAAAACGGGTGGGCTTCACTAA
Protein sequenceShow/hide protein sequence
MVFLTETKVQTARFDVLKVRLGYPGCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHVDGWVDGGASPWRFTGIYGHPQAELKARTWALMKHLRGSSETPWLMGGDFNAV
LFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDIVWERIDRCMGNVAWRDMFAGYEVRHLDFSRSDHRPILLTLSVCAHSVHNAGHRIQRFEET
WLLSQGFKEAVSANWGLSGAVGSPKTLASATKRCMSGLSKWGRPGWEILGINAEEQLESLLVEDEMYWKQRSRDSWLKWGDRNTRWFHNHASSRRRRNEVKGLEDEDGVW
QQDPDRILGLIEGYFEDLYTTSQPSHEEIDQALLHVSSSVTDEMNSKLLRPFQQDEVLLALKQIHPNKAPGPDGLSGVFYQHSWSVVGADVVQCCLNILNNRVSPAPLNE
TMIVLIPKKRSPRRVSEYRPISLCNVTYKLVSKVLVNRMKGILNEVISLSQSAFIPGRCVVDNAILGYECLHVLKGRTRGKTGGLH