; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001177 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001177
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:26081747..26084090
RNA-Seq ExpressionLag0001177
SyntenyLag0001177
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017250619.1 PREDICTED: uncharacterized protein LOC108221234 [Daucus carota subsp. sativus]1.9e-8329.15Show/hide
Query:  MNRQLLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKVRSPRRVSDYRPISLCNVVYKLVSK
        MNR L R F   E+  A+  + P K+PGPDG++  FF+Q W++V   V +  L  LNE+A    IN T++ LIPKV+ P+ V D+RPISLCNV+YK ++K
Subjt:  MNRQLLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKVRSPRRVSDYRPISLCNVVYKLVSK

Query:  AIVNRMKGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYRCISSVTLC---
         ++NR+K IL  +++  QSAF+PGR + DN ++ YEC+H L+    G+  + ++KLDM KAYDRVEWIF+E+++ K+GF+  WV+ + +C++SV      
Subjt:  AIVNRMKGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYRCISSVTLC---

Query:  -------------------LLPVL-AVCGVAVESASCGRKYRGH-LGFK---SVPT--------------RAEVGEALAIQDILRCYERASGQTMNFDKS
                           L P L  +C     +     + R   LG K   + P+              +A      +IQ+I   Y   SGQ +NF+KS
Subjt:  -------------------LLPVL-AVCGVAVESASCGRKYRGH-LGFK---SVPT--------------RAEVGEALAIQDILRCYERASGQTMNFDKS

Query:  IILFSPYTEEGARRRPGLAAD-------------PRMEG-----------------------ETVLSGRKEDLLKSVVQAIPCYAMNCFRLSKKLIVEIN
        ++ FSP T    R    ++ +             P + G                       +    G KE LLK+V+QA+P Y M+CF++ +    EI 
Subjt:  IILFSPYTEEGARRRPGLAAD-------------PRMEG-----------------------ETVLSGRKEDLLKSVVQAIPCYAMNCFRLSKKLIVEIN

Query:  RLMARFWWCGMEENKRIHWDGQSADFLSCPCSE---------------------------TAVL-PVLEV--------FGGKGGKSSFLYLEESYVGEGV
        +L+AR+WW  +   ++IHW   +   +S P SE                           T++L  VL+            K G+   L       G+ +
Subjt:  RLMARFWWCGMEENKRIHWDGQSADFLSCPCSE---------------------------TAVL-PVLEV--------FGGKGGKSSFLYLEESYVGEGV

Query:  TGKGIRWRIGNGE--------------------RGS---------------------------GMRLLSDIILACRRLAHP---FYSFT-----PVKSGY
          +G+R RIGNG+                    RGS                            ++L+ +I L+  R  H    F+ +       VKSGY
Subjt:  TGKGIRWRIGNGE--------------------RGS---------------------------GMRLLSDIILACRRLAHP---FYSFT-----PVKSGY

Query:  RLGQGPLLAQSPSSSSPELLRRWWKNCWSMKIPSKIKIFFWRLYLDRLPMMDNLCRRGVDSLNMCVCCGRLGESGLHLFWHCKRTKEIMWVAGFGAIVSK
        +L     L +  SSSS +++ +WWK  W+ KIP KI IF WR Y + LP    L  R +   + C  CG   +S  H  + C   +E+  +  +  +V +
Subjt:  RLGQGPLLAQSPSSSSPELLRRWWKNCWSMKIPSKIKIFFWRLYLDRLPMMDNLCRRGVDSLNMCVCCGRLGESGLHLFWHCKRTKEIMWVAGFGAIVSK

Query:  CEAVDIKFLLRDVNDELEWERFEEWVVLLWAIWFRRNEL-SQKRKVSIANLADWVVGYLNAFRDA
         E +  K +L  + + LE +  +  ++  W IW  RN+L  Q+++ + + +  W+  Y    ++A
Subjt:  CEAVDIKFLLRDVNDELEWERFEEWVVLLWAIWFRRNEL-SQKRKVSIANLADWVVGYLNAFRDA

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]1.1e-9131.3Show/hide
Query:  MNRQLLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKVRSPRRVSDYRPISLCNVVYKLVSK
        MN  L++ F +EE+  AL Q+HP+KAPGPDG+S  FF++ W++V  D+V   L +LN   S V IN+T I L+PK+++P ++SD+RPISLCNVVYKL+SK
Subjt:  MNRQLLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKVRSPRRVSDYRPISLCNVVYKLVSK

Query:  AIVNRMKGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYRCISSVTLCLLP
         + NR+K IL  ++SENQSAF+ GR + DNV++ +E +H L+ K  G+ G+A++KLDM KAYDRVEW F++Q+M KMGF   W++LV  CI+SV+  +L 
Subjt:  AIVNRMKGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYRCISSVTLCLLP

Query:  -----------------------VLAVCGVAVESA--SCGRKYR-------------GHLGFKS---VPTRAEVGEALAIQDILRCYERASGQTMNFDKS
                               +  +C     S      RK R              HL F     +  +A   E   + DIL+ YE ASGQ +N DKS
Subjt:  -----------------------VLAVCGVAVESA--SCGRKYR-------------GHLGFKS---VPTRAEVGEALAIQDILRCYERASGQTMNFDKS

Query:  IILFSPYTEEGAR---------------------------RRPGLAADPRMEGETVLSGRKEDLL---------KSVVQAIPCYAMNCFRLSKKLIVEIN
         + FS  T +  R                            +  + A+ +   E  LSG KE LL         K+V QAIP Y M+CF++ K L  EI 
Subjt:  IILFSPYTEEGAR---------------------------RRPGLAADPRMEGETVLSGRKEDLL---------KSVVQAIPCYAMNCFRLSKKLIVEIN

Query:  RLMARFWWCGMEENKRIHWDG----------------------------QSADFLSCPCSETAVL------PVLEVFGGKGGKSSFLYLEESYVGEGVTG
         +M RFWW    +  +I W                              Q    +S P S  A +      P  +VF  K G S        + G  V  
Subjt:  RLMARFWWCGMEENKRIHWDG----------------------------QSADFLSCPCSETAVL------PVLEVFGGKGGKSSFLYLEESYVGEGVTG

Query:  KGIRWRIGNGER-----------------------------------GSGMRLLSDIILACRRLAHPFYSFT-------------------------PVK
        +G RWR+GNGER                                       R   D++   R L  PF + T                          VK
Subjt:  KGIRWRIGNGER-----------------------------------GSGMRLLSDIILACRRLAHPFYSFT-------------------------PVK

Query:  SGYRLGQGPL----LAQSPSSSSPELLRRWWKNCWSMKIPSKIKIFFWRLYLDRLPMMDNLCRRGVDSLNMCVCCGRLGESGLHLFWHCKRTKEI--MWV
        S Y +  G +    + +S S  S  LL   W+  W + IP K++IF W++ ++ LP   NL R+GV+  ++C  CG   ES +H+F  C+  K +   W+
Subjt:  SGYRLGQGPL----LAQSPSSSSPELLRRWWKNCWSMKIPSKIKIFFWRLYLDRLPMMDNLCRRGVDSLNMCVCCGRLGESGLHLFWHCKRTKEI--MWV

Query:  AGFGAIVS-KCEAVDIKFLLRDVNDELEWERFEEWVVLLWAIWFRRNELSQKRKVSIANLADWVVG----YLNAFRDAGRREVDLL
             +V+   + VDI   + D     + E F    V+ WAIW  RN++  +   S++ + +++ G    Y+  F++A      +L
Subjt:  AGFGAIVS-KCEAVDIKFLLRDVNDELEWERFEEWVVLLWAIWFRRNELSQKRKVSIANLADWVVG----YLNAFRDAGRREVDLL

XP_030483481.1 uncharacterized protein LOC115700065 [Cannabis sativa]1.2e-8229.71Show/hide
Query:  MNRQLLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKVRSPRRVSDYRPISLCNVVYKLVSK
        MN  LL+PF K+++ LAL  + P K+PG DG+S  F++++WD+V   V    LS+LN+ A+P P+N T+I LIPK++  + + D+RPISLCNVV KL++K
Subjt:  MNRQLLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKVRSPRRVSDYRPISLCNVVYKLVSK

Query:  AIVNRMKGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYRCISSVTLCLLP
         +V R K +L  ++SE QSAF+P R + DN+++ +E IH +K K  GR G A++KLDM KA+DRVEW F++ +M +MGFA +W  L+  C+++     L 
Subjt:  AIVNRMKGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYRCISSVTLCLLP

Query:  VLAVCG--------------------VAVESASCGRKYRGHLGFKSVPT---------------------RAEVGEALAIQDILRCYERASGQTMNFDKS
           + G                    + +E  SC  ++   LG   V +                      A     LAI+  L  Y +ASGQ +N DKS
Subjt:  VLAVCG--------------------VAVESASCGRKYRGHLGFKSVPT---------------------RAEVGEALAIQDILRCYERASGQTMNFDKS

Query:  IILFSPYTEEGAR----------------RRPGLAADPRMEGETVLS--------------------GRKEDLLKSVVQAIPCYAMNCFRLSKKLIVEIN
        ++ FSP T   A+                +  GL A    + + + S                    G KE LLK+VVQ+IP Y M+CF+L   L  ++ 
Subjt:  IILFSPYTEEGAR----------------RRPGLAADPRMEGETVLS--------------------GRKEDLLKSVVQAIPCYAMNCFRLSKKLIVEIN

Query:  RLMARFWWCGMEENKRIHW-----------DG-----------------QSADFLSCPCS------ETAVLPVLEVFGGKGGKSSFLYLEESYVGEGVTG
         +M+ FWW   E   +IHW           DG                 Q+   L  P S      ++   P         G S  L  +    G  +  
Subjt:  RLMARFWWCGMEENKRIHW-----------DG-----------------QSADFLSCPCS------ETAVLPVLEVFGGKGGKSSFLYLEESYVGEGVTG

Query:  KGIRWRIGNGE---------------------RGSGMRLLSDIILACR----RLAHPFYSFTPV----------------------KSG-YRLGQG----
         G+RW++G G                       G    +++++I   R    +L   F+S   V                       SG Y++  G    
Subjt:  KGIRWRIGNGE---------------------RGSGMRLLSDIILACR----RLAHPFYSFTPV----------------------KSG-YRLGQG----

Query:  PLLAQSPSSSSPELLRRWWKNCWSMKIPSKIKIFFWRLYLDRLPMMDNLCRRGVDSLNMCVCCGRLGESGLHLFWHCKRTKEIMWVAGF----GAIVSKC
          L  S  SS+      WWK  W +++P KIKIF WR++ D LP+  +L RR + + + C  C +  ES  H  + CK  K +    GF     A V   
Subjt:  PLLAQSPSSSSPELLRRWWKNCWSMKIPSKIKIFFWRLYLDRLPMMDNLCRRGVDSLNMCVCCGRLGESGLHLFWHCKRTKEIMWVAGF----GAIVSKC

Query:  EAVDIKFLLRDVNDELEWERFEEWVVLLWAIWFRRNELSQKRKV-SIANLADWVVGYLNAFRDA
        +  D    L  ++ +LE E     + +LW+IW  RN +   +K  S   LA +   YL+ +  A
Subjt:  EAVDIKFLLRDVNDELEWERFEEWVVLLWAIWFRRNELSQKRKV-SIANLADWVVGYLNAFRDA

XP_030487384.1 uncharacterized protein LOC115704310 [Cannabis sativa]2.2e-8429.32Show/hide
Query:  MNRQLLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKVRSPRRVSDYRPISLCNVVYKLVSK
        MN  L+  F  EE++ A+K+++P+KAPG DGL   F+++ W  ++ DV+  CL++LN+ A    +N+TM+ LIPKV  P+R+ ++RPISLCNV+YK+VSK
Subjt:  MNRQLLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKVRSPRRVSDYRPISLCNVVYKLVSK

Query:  AIVNRMKGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYRCISSVTLC---
         + NRM+  L  +VS++QSAF+ GR + DN I+GYE +H ++          +LKLDM KAYDRVEW FLE +M+K+G++  WV  +  C++SV      
Subjt:  AIVNRMKGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYRCISSVTLC---

Query:  -------------------LLPVL-AVCGVAVESASCGRKYRG---------------HLGFKS---VPTRAEVGEALAIQDILRCYERASGQTMNFDKS
                           L P L  +C  A  S     + RG               HL F     V   A   E    +++L  Y  ASGQ +NF KS
Subjt:  -------------------LLPVL-AVCGVAVESASCGRKYRG---------------HLGFKS---VPTRAEVGEALAIQDILRCYERASGQTMNFDKS

Query:  IILFSPYTEEGARRR-------------------PGLAADPRME-----------------GETVLSGRKEDLLKSVVQAIPCYAMNCFRLSKKLIVEIN
         + F        + +                   P      + +                 G    + RKE L+K++VQAIP Y M+CFRL KK I  I+
Subjt:  IILFSPYTEEGARRR-------------------PGLAADPRME-----------------GETVLSGRKEDLLKSVVQAIPCYAMNCFRLSKKLIVEIN

Query:  RLMARFWWCGMEENKRIHWDGQSADFLSCPCSETAVLPVLEVFGGKGGKSSFLYLEESYVGEGVTGKGIRWRIGNGERGSGMRLLSDIILACRRLAHPFY
         + ARFWW   E++ +IHW         C   + +  P   V   K G  +         G+ +  KG RWRIGN    + +R+L D  L  R +    Y
Subjt:  RLMARFWWCGMEENKRIHWDGQSADFLSCPCSETAVLPVLEVFGGKGGKSSFLYLEESYVGEGVTGKGIRWRIGNGERGSGMRLLSDIILACRRLAHPFY

Query:  SFTP-----------------------------------------------------------VKSGYRLGQGPLLAQSPSSSSPELLRRWWKNCWSMKI
           P                                                           V+SGYR+     L      S  E   +WWK  W +KI
Subjt:  SFTP-----------------------------------------------------------VKSGYRLGQGPLLAQSPSSSSPELLRRWWKNCWSMKI

Query:  PSKIKIFFWRLYLDRLPMMDNLCRRGVDSLNMCV-CCGRLGESGLHLFWHCKRTKEIMWVAGFGAIVSKCEAVDIKFLLRDVNDELEWERFEEWVVLLWA
        P K+K F W++    +P    L  R +     C  C     E+  H  W C+  +E+   AGF   + +    D+   L  ++     E  E ++VL W 
Subjt:  PSKIKIFFWRLYLDRLPMMDNLCRRGVDSLNMCV-CCGRLGESGLHLFWHCKRTKEIMWVAGFGAIVSKCEAVDIKFLLRDVNDELEWERFEEWVVLLWA

Query:  IWFRRNELSQ-KRKVSIANLADWVVGYLNAFRDAGRREVD
        +W+ RN ++    K   + + +W   YL+ FR +   + D
Subjt:  IWFRRNELSQ-KRKVSIANLADWVVGYLNAFRDAGRREVD

XP_030924745.1 uncharacterized protein LOC115951731 [Quercus lobata]1.9e-8330.68Show/hide
Query:  MNRQLLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKVRSPRRVSDYRPISLCNVVYKLVSK
        M+  L R +  EE+L ALKQ+ P  APGPDG+S  F++  W +V  DV+   L  LN       +N T I LIPKV++P+RV+++RPISLCNVVYKL++K
Subjt:  MNRQLLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKVRSPRRVSDYRPISLCNVVYKLVSK

Query:  AIVNRMKGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYRCISSVTLCLL-
         +VNR+K IL  ++ ++QSAF+ GR + DNV++ +E +H LK K +GR G+ +LKLDM KAYD+VEW+FL ++M  +GF    + L+  C+S+V+  +L 
Subjt:  AIVNRMKGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYRCISSVTLCLL-

Query:  ---PVLA--------VCGVAVESASCGRKYR-GHLGFKS---VPTRAEVGEALAIQDILRCYERASGQTMNFDKSIILFSPYTEEGARRR----------
           P L         + GV++    C    R  HL F     +  RA V E   I D+L  YE+ SGQ +N +K+ I FS  T    + +          
Subjt:  ---PVLA--------VCGVAVESASCGRKYR-GHLGFKS---VPTRAEVGEALAIQDILRCYERASGQTMNFDKSIILFSPYTEEGARRR----------

Query:  ---------PGLAADPRMEGETVLSGR-----------------KEDLLKSVVQAIPCYAMNCFRLSKKLIVEINRLMARFWWCGMEENKRIHW------
                 P L    + +  T +  R                 +E L+KSV+QAIP Y M+CF+L K LI E+  L+ +FWW    EN+++HW      
Subjt:  ---------PGLAADPRMEGETVLSGR-----------------KEDLLKSVVQAIPCYAMNCFRLSKKLIVEINRLMARFWWCGMEENKRIHW------

Query:  ------------------DG----------QSADFLSCPCSETAVLPVLEVFGGKGGKSSFLYLEESYVG-EGVTGKGIRWRIGNGE-------------
                          D            + D L     +    P   +   K   S   Y  +S +G   V  +G+ WRIGNG              
Subjt:  ------------------DG----------QSADFLSCPCSETAVLPVLEVFGGKGGKSSFLYLEESYVG-EGVTGKGIRWRIGNGE-------------

Query:  --------------------------RGSGMRLLSDIILACR-----------RLA--HPFYSFTP-----VKSGYRLGQGPLLAQSPSSSSPELLRRWW
                                  RG    L++ I L  +           RL      +S TP      KS Y+L           +SSP   R++W
Subjt:  --------------------------RGSGMRLLSDIILACR-----------RLA--HPFYSFTP-----VKSGYRLGQGPLLAQSPSSSSPELLRRWW

Query:  KNCWSMKIPSKIKIFFWRLYLDRLPMMDNLCRRGVDSLNMCVCCGRLGESGLHLFWHCKRTKEIMWVAGFGAIVSKCEAVDIKFLLRDVNDELEWERFEE
        +  W +++P+K+K F WR   D LP + NL RR + +  +C  C    E  LH  W C   + +     +    +       + LL       E  R E 
Subjt:  KNCWSMKIPSKIKIFFWRLYLDRLPMMDNLCRRGVDSLNMCVCCGRLGESGLHLFWHCKRTKEIMWVAGFGAIVSKCEAVDIKFLLRDVNDELEWERFEE

Query:  WVVLLWAIWFRRNELSQKRKVSIANLADWVVG-YLNAFRD
        ++ + W +W RRN L     V   N    + G YL  F D
Subjt:  WVVLLWAIWFRRNELSQKRKVSIANLADWVVG-YLNAFRD

TrEMBL top hitse value%identityAlignment
A0A2N9EL92 Reverse transcriptase domain-containing protein8.6e-9033.53Show/hide
Query:  NRQLLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKVRSPRRVSDYRPISLCNVVYKLVSKA
        N++LL+PF  +E+ +AL Q+HPSKAPGPDG+S  FF++ W++V  DVV   LS+LN       IN T I LIPK ++P R+S+YRPISLCNVVYK++SK 
Subjt:  NRQLLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKVRSPRRVSDYRPISLCNVVYKLVSKA

Query:  IVNRMKGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYRCISSVTLCLL--
        + NR+K IL  ++S++QSAF+PGR + DNV + +E IH +K K RG+ G  ++KLDM KAYDRVEW F+E IM K+GFA  W+ L+  CI +V   +L  
Subjt:  IVNRMKGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYRCISSVTLCLL--

Query:  ---------------------PVLAVCGVAVES--------------ASC-GRKYRGHLGFKS---VPTRAEVGEALAIQDILRCYERASGQTMNFDKSI
                              V  +C   + +               SC G  +  HL F     +  +A + E   I +IL  YE +SGQ +N +K+ 
Subjt:  ---------------------PVLAVCGVAVES--------------ASC-GRKYRGHLGFKS---VPTRAEVGEALAIQDILRCYERASGQTMNFDKSI

Query:  ILFSPYTEEGAR----------------RRPGLAADPRMEGETVLSGRKED--------------------LLKSVVQAIPCYAMNCFRLSKKLIVEINR
        I FS  T +  R                +  GL A      +++ +G KE                     L+K+V QAIP YAMNCFRL K    E+N 
Subjt:  ILFSPYTEEGAR----------------RRPGLAADPRMEGETVLSGRKED--------------------LLKSVVQAIPCYAMNCFRLSKKLIVEINR

Query:  LMARFWWCGMEENKRIHW-----------DGQSADFLSCPCSETAVLP-------------VLEVFGGK------------GGKSSFLYLEESYVGEGVT
        L+AR+WW   ++ +++HW           +G    F +     TA+L                 VF  +            G   SFL+      G GV 
Subjt:  LMARFWWCGMEENKRIHW-----------DGQSADFLSCPCSETAVLP-------------VLEVFGGK------------GGKSSFLYLEESYVGEGVT

Query:  GKGIRWRIGNGERGSGMRLLSDIILACRRLAHPFYSFT-----PVKSGYRLGQGPLLAQSPSSSSPELLRRW-WKNCWSMKIPSKIKIFFWRLYLDRLPM
         KG+RW    G+                 L+ P +S T      VKS Y + +           S     RW W+  W + IP KIK F WR Y + LP 
Subjt:  GKGIRWRIGNGERGSGMRLLSDIILACRRLAHPFYSFT-----PVKSGYRLGQGPLLAQSPSSSSPELLRRW-WKNCWSMKIPSKIKIFFWRLYLDRLPM

Query:  MDNLCRRGVDSLNMCVCCGRLGESGLHLFWHCKRTKEIMWVAGFGAIVSKCEAVDIKF--LLRDVNDELEWERFEEWVVLLWAIWFRRN
           L RR + S ++C  C +  E+  H  W C   +   W    G +  K    D +F   L+ +   L  E  E+W V  W+IW  RN
Subjt:  MDNLCRRGVDSLNMCVCCGRLGESGLHLFWHCKRTKEIMWVAGFGAIVSKCEAVDIKF--LLRDVNDELEWERFEEWVVLLWAIWFRRN

A0A2N9G8I6 Reverse transcriptase domain-containing protein1.5e-8931.44Show/hide
Query:  MNRQLLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKVRSPRRVSDYRPISLCNVVYKLVSK
        MNRQLL  F   E+  A  Q+HPSKAPGPDG+S  FF++ W +V  DVV   LS++N       +N + +VLIPK ++P+ VSDYRPISL NVVYK++SK
Subjt:  MNRQLLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKVRSPRRVSDYRPISLCNVVYKLVSK

Query:  AIVNRMKGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYRCISSVTLCLL-
        A+ NR+K +L  ++SE+QSAF+PGR + DN+ + +E +H L+ + +G+    +LKLDM KAYDRVEW FLE +M +MGFA  W+ L+  C+ + +  +L 
Subjt:  AIVNRMKGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYRCISSVTLCLL-

Query:  ---------------------PVL-AVCGVAVESASCGRKYR---------------GHLGFKS---VPTRAEVGEALAIQDILRCYERASGQTMNFDKS
                             P L  +C   + +  C  +                  HL F     +  +A   E   + ++L  YERASGQ +N +K+
Subjt:  ---------------------PVL-AVCGVAVESASCGRKYR---------------GHLGFKS---VPTRAEVGEALAIQDILRCYERASGQTMNFDKS

Query:  IILFSPYTEEGAR----------------RRPGLAADPRMEGETV--------------LSGRKEDLL---------KSVVQAIPCYAMNCFRLSKKLIV
         + FS  T E  R                +  GL   P M G++               L G KE LL         K++ QAIP Y M+CF+L K    
Subjt:  IILFSPYTEEGAR----------------RRPGLAADPRMEGETV--------------LSGRKEDLL---------KSVVQAIPCYAMNCFRLSKKLIV

Query:  EINRLMARFWWCGMEENKRIHW-----------DG-----------------QSADFLSCPCSETAVLPVLEVFGGKGGKSSFLYLEESYVGEGVTG---
        +IN L++ +WW    E  +IHW           DG                 Q    +S PCS  A     + F G     + L    SY+   +     
Subjt:  EINRLMARFWWCGMEENKRIHW-----------DG-----------------QSADFLSCPCSETAVLPVLEVFGGKGGKSSFLYLEESYVGEGVTG---

Query:  ---KGIRWRIGNGERG-----------------------------------------------SGMRLLSDIILACRRLAHPFYSFTP-----VKSGYRL
           KG+RW+IGNG++                                                S  R+   ++  CRR  +P +   P     VKS YR+
Subjt:  ---KGIRWRIGNGERG-----------------------------------------------SGMRLLSDIILACRRLAHPFYSFTP-----VKSGYRL

Query:  GQG-PLLAQSPSSSSPELLRRWWKNCWSMKIPSKIKIFFWRLYLDRLPMMDNLCRRGVDSLNMCVCCGRLGESGLHLFWHCKRTKEIMWVAGFGAIVS-K
          G  L  ++  SS     +R WK+ W +KIP+K+K   WR  L+ LP    LCRR +     C  C    E+  H  W C     + W    G +    
Subjt:  GQG-PLLAQSPSSSSPELLRRWWKNCWSMKIPSKIKIFFWRLYLDRLPMMDNLCRRGVDSLNMCVCCGRLGESGLHLFWHCKRTKEIMWVAGFGAIVS-K

Query:  CEAVDIKFLLRDVNDELEWERFEEWVVLLWAIWFRRNE
           VD   L R    ++E E  E W V+ WAIW+ RN+
Subjt:  CEAVDIKFLLRDVNDELEWERFEEWVVLLWAIWFRRNE

A0A2N9GPG8 Reverse transcriptase domain-containing protein5.4e-9234.09Show/hide
Query:  MNRQLLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKVRSPRRVSDYRPISLCNVVYKLVSK
        +N++LL+PF  +E+ +AL Q+HPSKAPGPDG+S  FF++ W++V ADVV   LS+LN       IN T I LIPK ++P R+S+YRPISLCNVVYK++SK
Subjt:  MNRQLLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKVRSPRRVSDYRPISLCNVVYKLVSK

Query:  AIVNRMKGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYRCISSVTLCLL-
         + NR+K +L  ++S++QSAF+PGR + DNV + +E IH +K K +G+ G  +LKLDM KAYDRVEW F+E IM K+GFA  W+ L+  CI +V   +L 
Subjt:  AIVNRMKGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYRCISSVTLCLL-

Query:  ----------------------PVLAVCGVAVES--------------ASC-GRKYRGHLGFKS---VPTRAEVGEALAIQDILRCYERASGQTMNFDKS
                               V  +C   + +               SC G  +  HL F     +  +A + E   I +IL  YE +SGQ +N +K+
Subjt:  ----------------------PVLAVCGVAVES--------------ASC-GRKYRGHLGFKS---VPTRAEVGEALAIQDILRCYERASGQTMNFDKS

Query:  IILFSPYTEEGAR----------------RRPGLAADPRMEGETVLSGRKED--------------------LLKSVVQAIPCYAMNCFRLSKKLIVEIN
         I FS  T +  R                +  GL A      +++ +G KE                     L+K+V QAIP YAMNCFRL K    E+N
Subjt:  IILFSPYTEEGAR----------------RRPGLAADPRMEGETVLSGRKED--------------------LLKSVVQAIPCYAMNCFRLSKKLIVEIN

Query:  RLMARFWWCGMEENKRIH---WDGQSADFLSCPCSETAVLPVLEVFGGKGGKSSFLYLEESYVGEGVTGKGIRWRIGNGERGSGMRLLSDIILACRRLAH
         L+AR+WW   ++ +++H   WD          C          +   K G+    Y E  Y    V  KG+RW    G+                 L+ 
Subjt:  RLMARFWWCGMEENKRIH---WDGQSADFLSCPCSETAVLPVLEVFGGKGGKSSFLYLEESYVGEGVTGKGIRWRIGNGERGSGMRLLSDIILACRRLAH

Query:  PFYSFT-----PVKSGYRLGQGPLLAQSPSSSSPELLRRW-WKNCWSMKIPSKIKIFFWRLYLDRLPMMDNLCRRGVDSLNMCVCCGRLGESGLHLFWHC
        P +S T      VKS Y + +           S     RW W+  W + IP KIK F WR Y + LP    L RR + S ++C  C +  E+  H  W C
Subjt:  PFYSFT-----PVKSGYRLGQGPLLAQSPSSSSPELLRRW-WKNCWSMKIPSKIKIFFWRLYLDRLPMMDNLCRRGVDSLNMCVCCGRLGESGLHLFWHC

Query:  KRTKEIMWVAGFGAIVSKCEAVDIKF--LLRDVNDELEWERFEEWVVLLWAIWFRRN
           +   W    G +  K    D +F  L++ +   L  E  E+W +  W+IW  RN
Subjt:  KRTKEIMWVAGFGAIVSKCEAVDIKF--LLRDVNDELEWERFEEWVVLLWAIWFRRN

A0A2N9HRH8 Reverse transcriptase domain-containing protein2.9e-9033.53Show/hide
Query:  NRQLLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKVRSPRRVSDYRPISLCNVVYKLVSKA
        N++LL+PF  +E+ +AL Q+HPSKAPGPDG+S  FF++ W++V  DVV   LS+LN       IN T I LIPK ++P R+S+YRPISLCNVVYK++SK 
Subjt:  NRQLLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKVRSPRRVSDYRPISLCNVVYKLVSKA

Query:  IVNRMKGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYRCISSVTLCLL--
        + NR+K IL  ++S++QSAF+PGR + DNV + +E IH +K K RG+ G  ++KLDM KAYDRVEW F+E IM K+GFA  W+ L+  CI +V   +L  
Subjt:  IVNRMKGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYRCISSVTLCLL--

Query:  ---------------------PVLAVCGVAVES--------------ASC-GRKYRGHLGFKS---VPTRAEVGEALAIQDILRCYERASGQTMNFDKSI
                              V  +C   + +               SC G  +  HL F     +  +A + E   I +IL  YE +SGQ +N +K+ 
Subjt:  ---------------------PVLAVCGVAVES--------------ASC-GRKYRGHLGFKS---VPTRAEVGEALAIQDILRCYERASGQTMNFDKSI

Query:  ILFSPYTEEGAR----------------RRPGLAADPRMEGETVLSGRKED--------------------LLKSVVQAIPCYAMNCFRLSKKLIVEINR
        I FS  T +  R                +  GL A      +++ +G KE                     L+K+V QAIP YAMNCFRL K    E+N 
Subjt:  ILFSPYTEEGAR----------------RRPGLAADPRMEGETVLSGRKED--------------------LLKSVVQAIPCYAMNCFRLSKKLIVEINR

Query:  LMARFWWCGMEENKRIHW-----------DGQSADFLSCPCSETAVLP-------------VLEVFGGK------------GGKSSFLYLEESYVGEGVT
        L+AR+WW   ++ +++HW           +G    F +     TA+L                 VF  +            G   SFL+      G GV 
Subjt:  LMARFWWCGMEENKRIHW-----------DGQSADFLSCPCSETAVLP-------------VLEVFGGK------------GGKSSFLYLEESYVGEGVT

Query:  GKGIRWRIGNGERGSGMRLLSDIILACRRLAHPFYSFT-----PVKSGYRLGQGPLLAQSPSSSSPELLRRW-WKNCWSMKIPSKIKIFFWRLYLDRLPM
         KG+RW    G+                 L+ P +S T      VKS Y + +   +       S     RW W+  W + IP KIK F WR Y + LP 
Subjt:  GKGIRWRIGNGERGSGMRLLSDIILACRRLAHPFYSFT-----PVKSGYRLGQGPLLAQSPSSSSPELLRRW-WKNCWSMKIPSKIKIFFWRLYLDRLPM

Query:  MDNLCRRGVDSLNMCVCCGRLGESGLHLFWHCKRTKEIMWVAGFGAIVSKCEAVDIKF--LLRDVNDELEWERFEEWVVLLWAIWFRRN
           L RR + S ++C  C +  E+  H  W C   +   W    G +  K    D +F   L+ +   L  E  E+W V  W+IW  RN
Subjt:  MDNLCRRGVDSLNMCVCCGRLGESGLHLFWHCKRTKEIMWVAGFGAIVSKCEAVDIKF--LLRDVNDELEWERFEEWVVLLWAIWFRRN

A0A2N9I475 Reverse transcriptase domain-containing protein8.6e-9033.53Show/hide
Query:  NRQLLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKVRSPRRVSDYRPISLCNVVYKLVSKA
        N++LL+PF  +E+ +AL Q+HPSKAPGPDG+S  FF++ W++V  DVV   LS+LN       IN T I LIPK ++P R+S+YRPISLCNVVYK++SK 
Subjt:  NRQLLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKVRSPRRVSDYRPISLCNVVYKLVSKA

Query:  IVNRMKGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYRCISSVTLCLL--
        + NR+K IL  ++S++QSAF+PGR + DNV + +E IH +K K RG+ G  ++KLDM KAYDRVEW F+E IM K+GFA  W+ L+  CI +V   +L  
Subjt:  IVNRMKGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYRCISSVTLCLL--

Query:  ---------------------PVLAVCGVAVES--------------ASC-GRKYRGHLGFKS---VPTRAEVGEALAIQDILRCYERASGQTMNFDKSI
                              V  +C   + +               SC G  +  HL F     +  +A + E   I +IL  YE +SGQ +N +K+ 
Subjt:  ---------------------PVLAVCGVAVES--------------ASC-GRKYRGHLGFKS---VPTRAEVGEALAIQDILRCYERASGQTMNFDKSI

Query:  ILFSPYTEEGAR----------------RRPGLAADPRMEGETVLSGRKED--------------------LLKSVVQAIPCYAMNCFRLSKKLIVEINR
        I FS  T +  R                +  GL A      +++ +G KE                     L+K+V QAIP YAMNCFRL K    E+N 
Subjt:  ILFSPYTEEGAR----------------RRPGLAADPRMEGETVLSGRKED--------------------LLKSVVQAIPCYAMNCFRLSKKLIVEINR

Query:  LMARFWWCGMEENKRIHW-----------DGQSADFLSCPCSETAVLP-------------VLEVFGGK------------GGKSSFLYLEESYVGEGVT
        L+AR+WW   ++ +++HW           +G    F +     TA+L                 VF  +            G   SFL+      G GV 
Subjt:  LMARFWWCGMEENKRIHW-----------DGQSADFLSCPCSETAVLP-------------VLEVFGGK------------GGKSSFLYLEESYVGEGVT

Query:  GKGIRWRIGNGERGSGMRLLSDIILACRRLAHPFYSFT-----PVKSGYRLGQGPLLAQSPSSSSPELLRRW-WKNCWSMKIPSKIKIFFWRLYLDRLPM
         KG+RW    G+                 L+ P +S T      VKS Y + +           S     RW W+  W + IP KIK F WR Y + LP 
Subjt:  GKGIRWRIGNGERGSGMRLLSDIILACRRLAHPFYSFT-----PVKSGYRLGQGPLLAQSPSSSSPELLRRW-WKNCWSMKIPSKIKIFFWRLYLDRLPM

Query:  MDNLCRRGVDSLNMCVCCGRLGESGLHLFWHCKRTKEIMWVAGFGAIVSKCEAVDIKF--LLRDVNDELEWERFEEWVVLLWAIWFRRN
           L RR + S ++C  C +  E+  H  W C   +   W    G +  K    D +F   L+ +   L  E  E+W V  W+IW  RN
Subjt:  MDNLCRRGVDSLNMCVCCGRLGESGLHLFWHCKRTKEIMWVAGFGAIVSKCEAVDIKF--LLRDVNDELEWERFEEWVVLLWAIWFRRN

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.2e-1326.49Show/hide
Query:  LLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKV-RSPRRVSDYRPISLCNVVYKLVSKAIV
        L RP    E++  +  +   K+PGPDG +  F+++  + +   +++   SI  E   P    E  I+LIPK  R   +  ++RPISL N+  K+++K + 
Subjt:  LLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKV-RSPRRVSDYRPISLCNVVYKLVSKAIV

Query:  NRMKGILNGLVSENQSAFIPGRCVVDNVILGYECI-HALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELV
        NR++  +  L+  +Q  FIPG     N+      I H  + K +       + +D  KA+D+++  F+ + + K+G    +++++
Subjt:  NRMKGILNGLVSENQSAFIPGRCVVDNVILGYECI-HALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELV

P08548 LINE-1 reverse transcriptase homolog4.3e-1428.21Show/hide
Query:  LLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKV-RSPRRVSDYRPISLCNVVYKLVSKAIV
        L RP    E+   ++ +   K+PGPDG +  F++   + +   ++    +I  E   P    E  I LIPK  + P R  +YRPISL N+  K+++K + 
Subjt:  LLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKV-RSPRRVSDYRPISLCNVVYKLVSKAIV

Query:  NRMKGILNGLVSENQSAFIPGRCVVDNV---ILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYRCISSVT
        NR++  +  ++  +Q  FIPG     N+   I   + I+ LK K         L +D  KA+D ++  F+ + + K+G    +++L+    S  T
Subjt:  NRMKGILNGLVSENQSAFIPGRCVVDNV---ILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYRCISSVT

P11369 LINE-1 retrotransposable element ORF2 protein1.1e-1225.41Show/hide
Query:  PFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPK-VRSPRRVSDYRPISLCNVVYKLVSKAIVNRM
        P   +E+   +  +   K+PGPDG S  F++   + +   + +    I  E   P    E  I LIPK  + P ++ ++RPISL N+  K+++K + NR+
Subjt:  PFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPK-VRSPRRVSDYRPISLCNVVYKLVSKAIVNRM

Query:  KGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELV
        +  +  ++  +Q  FIPG     N+      IH +  K + +     + LD  KA+D+++  F+ +++ + G    ++ ++
Subjt:  KGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELV

P14381 Transposon TX1 uncharacterized 149 kDa protein3.1e-2031.82Show/hide
Query:  QLLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKVRSPRRVSDYRPISLCNVVYKLVSKAIV
        +L  P   +EL  AL+ +  +K+PG DGL+  FF+  WD +  D  R       +   P+     ++ L+PK    R + ++RP+SL +  YK+V+KAI 
Subjt:  QLLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKVRSPRRVSDYRPISLCNVVYKLVSKAIV

Query:  NRMKGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYRCISSVTLCLLPV
         R+K +L  ++  +QS  +PGR + DNV L  + +H  +   R     A L LD  KA+DRV+  +L   +    F P +V  +    +S   CL+ +
Subjt:  NRMKGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYRCISSVTLCLLPV

P93295 Uncharacterized mitochondrial protein AtMg003101.1e-0447.5Show/hide
Query:  AIPCYAMNCFRLSKKLIVEINRLMARFWWCGMEENKRIHW
        A+P YAM+CFRLSK L  ++   M  FWW   E  ++I W
Subjt:  AIPCYAMNCFRLSKKLIVEINRLMARFWWCGMEENKRIHW

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.3e-0935.23Show/hide
Query:  EELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKVRSPRRVSDYRPISLCNVVYKLVS
        +E+  A+  +  +KAPGPD  +  FF +SW VV+   +                N T I LIPKV    ++S +RP+S C VVYK+++
Subjt:  EELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKVRSPRRVSDYRPISLCNVVYKLVS

AT4G20520.1 RNA binding;RNA-directed DNA polymerases9.0e-1538.64Show/hide
Query:  IVNRMKGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYR
        +V R+K ++  L+   Q++FIPGR   DN++   E +H+++ K +G  GW  LKLD+ KAYDR+ W +LE  ++  GF   W+  + R
Subjt:  IVNRMKGILNGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYR

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.6e-0647.5Show/hide
Query:  AIPCYAMNCFRLSKKLIVEINRLMARFWWCGMEENKRIHW
        A+P YAM+CFRLSK L  ++   M  FWW   E  ++I W
Subjt:  AIPCYAMNCFRLSKKLIVEINRLMARFWWCGMEENKRIHW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAGGCAATTACTCCGACCTTTTCACAAGGAGGAGCTTTTGCTGGCTTTGAAGCAGATTCACCCTAGTAAGGCGCCTGGGCCAGATGGCTTGTCAGGTGCGTTCTT
TAGGCAGTCGTGGGATGTTGTGGAGGCGGATGTAGTGAGATGTTGTTTGAGTATCTTGAACGAGAAGGCTTCACCGGTGCCGATTAATGAAACAATGATTGTGTTGATAC
CAAAGGTTAGAAGTCCCCGCCGTGTGTCTGACTACCGGCCTATCTCCCTGTGTAATGTTGTCTATAAACTGGTGTCGAAGGCAATTGTTAATCGGATGAAGGGAATCCTG
AATGGGCTGGTTTCTGAGAATCAGAGTGCCTTTATCCCGGGGCGGTGTGTTGTGGATAACGTGATTCTGGGCTACGAGTGTATTCACGCCCTGAAGGGTAAGGCTAGGGG
TAGGGCTGGGTGGGCATCCCTGAAGCTGGACATGGGTAAGGCCTATGACCGGGTGGAATGGATTTTCTTGGAACAGATTATGTTGAAAATGGGCTTTGCACCAGACTGGG
TGGAGTTGGTGTATCGATGCATATCTTCAGTGACCCTCTGTCTCTTACCTGTTCTTGCTGTGTGCGGAGTGGCTGTTGAGTCTGCTTCGTGTGGTAGAAAGTACAGGGGC
CATCTCGGGTTTAAGAGTGTCCCGACAAGGGCAGAGGTGGGGGAAGCACTGGCTATCCAGGACATCCTCCGGTGTTATGAACGAGCGTCGGGTCAGACAATGAATTTTGA
TAAATCTATTATCTTGTTCAGCCCTTATACGGAGGAGGGCGCCCGGAGGAGACCGGGTCTGGCAGCAGATCCAAGGATGGAAGGGGAAACTGTTCTCAGTGGGAGGAAGG
AGGACCTTCTTAAGTCTGTTGTGCAAGCCATTCCATGTTATGCCATGAACTGCTTCCGGCTCTCAAAGAAGCTTATTGTTGAGATAAATCGCTTGATGGCAAGGTTTTGG
TGGTGTGGGATGGAGGAGAACAAGAGGATTCACTGGGATGGTCAGTCAGCCGACTTCCTTTCTTGCCCGTGTTCTGAAACAGCGGTACTACCCGTTCTCGAAGTTTTTGG
AGGCAAGGGTGGGAAGTCGTCCTTCCTTTATCTGGAAGAGTCTTATGTGGGGGAAGGAGTTACTGGGAAGGGGATTCGTTGGAGGATTGGGAACGGGGAAAGAGGCAGTG
GAATGAGGCTCTTATCTGACATCATTTTAGCCTGCAGGAGGCTAGCACATCCTTTCTATTCCTTTACGCCAGTGAAGAGTGGGTACCGTTTAGGGCAGGGGCCGCTGTTG
GCCCAAAGTCCTTCATCCTCGTCTCCTGAGTTGTTGCGTAGGTGGTGGAAGAACTGTTGGAGTATGAAGATACCTAGCAAGATCAAAATATTTTTCTGGAGACTGTACTT
AGATCGGCTGCCTATGATGGATAACTTGTGTAGGAGGGGTGTGGACTCACTGAACATGTGTGTGTGTTGTGGCAGGCTAGGGGAGTCGGGTCTCCACTTGTTTTGGCATT
GTAAACGTACGAAGGAGATCATGTGGGTTGCAGGTTTTGGGGCTATTGTATCCAAGTGCGAGGCAGTTGACATTAAGTTCCTTCTTCGAGATGTAAATGATGAGTTGGAG
TGGGAACGGTTTGAGGAGTGGGTGGTGTTATTGTGGGCGATCTGGTTCAGGCGGAATGAATTGAGCCAAAAGAGGAAGGTTTCGATTGCAAATCTGGCAGATTGGGTTGT
TGGGTATCTGAACGCTTTTCGTGATGCGGGAAGGAGGGAGGTGGACCTGTTGGTGGGAGGCCAAACCGGGCGGTCTACAGGTGGGCATCGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACAGGCAATTACTCCGACCTTTTCACAAGGAGGAGCTTTTGCTGGCTTTGAAGCAGATTCACCCTAGTAAGGCGCCTGGGCCAGATGGCTTGTCAGGTGCGTTCTT
TAGGCAGTCGTGGGATGTTGTGGAGGCGGATGTAGTGAGATGTTGTTTGAGTATCTTGAACGAGAAGGCTTCACCGGTGCCGATTAATGAAACAATGATTGTGTTGATAC
CAAAGGTTAGAAGTCCCCGCCGTGTGTCTGACTACCGGCCTATCTCCCTGTGTAATGTTGTCTATAAACTGGTGTCGAAGGCAATTGTTAATCGGATGAAGGGAATCCTG
AATGGGCTGGTTTCTGAGAATCAGAGTGCCTTTATCCCGGGGCGGTGTGTTGTGGATAACGTGATTCTGGGCTACGAGTGTATTCACGCCCTGAAGGGTAAGGCTAGGGG
TAGGGCTGGGTGGGCATCCCTGAAGCTGGACATGGGTAAGGCCTATGACCGGGTGGAATGGATTTTCTTGGAACAGATTATGTTGAAAATGGGCTTTGCACCAGACTGGG
TGGAGTTGGTGTATCGATGCATATCTTCAGTGACCCTCTGTCTCTTACCTGTTCTTGCTGTGTGCGGAGTGGCTGTTGAGTCTGCTTCGTGTGGTAGAAAGTACAGGGGC
CATCTCGGGTTTAAGAGTGTCCCGACAAGGGCAGAGGTGGGGGAAGCACTGGCTATCCAGGACATCCTCCGGTGTTATGAACGAGCGTCGGGTCAGACAATGAATTTTGA
TAAATCTATTATCTTGTTCAGCCCTTATACGGAGGAGGGCGCCCGGAGGAGACCGGGTCTGGCAGCAGATCCAAGGATGGAAGGGGAAACTGTTCTCAGTGGGAGGAAGG
AGGACCTTCTTAAGTCTGTTGTGCAAGCCATTCCATGTTATGCCATGAACTGCTTCCGGCTCTCAAAGAAGCTTATTGTTGAGATAAATCGCTTGATGGCAAGGTTTTGG
TGGTGTGGGATGGAGGAGAACAAGAGGATTCACTGGGATGGTCAGTCAGCCGACTTCCTTTCTTGCCCGTGTTCTGAAACAGCGGTACTACCCGTTCTCGAAGTTTTTGG
AGGCAAGGGTGGGAAGTCGTCCTTCCTTTATCTGGAAGAGTCTTATGTGGGGGAAGGAGTTACTGGGAAGGGGATTCGTTGGAGGATTGGGAACGGGGAAAGAGGCAGTG
GAATGAGGCTCTTATCTGACATCATTTTAGCCTGCAGGAGGCTAGCACATCCTTTCTATTCCTTTACGCCAGTGAAGAGTGGGTACCGTTTAGGGCAGGGGCCGCTGTTG
GCCCAAAGTCCTTCATCCTCGTCTCCTGAGTTGTTGCGTAGGTGGTGGAAGAACTGTTGGAGTATGAAGATACCTAGCAAGATCAAAATATTTTTCTGGAGACTGTACTT
AGATCGGCTGCCTATGATGGATAACTTGTGTAGGAGGGGTGTGGACTCACTGAACATGTGTGTGTGTTGTGGCAGGCTAGGGGAGTCGGGTCTCCACTTGTTTTGGCATT
GTAAACGTACGAAGGAGATCATGTGGGTTGCAGGTTTTGGGGCTATTGTATCCAAGTGCGAGGCAGTTGACATTAAGTTCCTTCTTCGAGATGTAAATGATGAGTTGGAG
TGGGAACGGTTTGAGGAGTGGGTGGTGTTATTGTGGGCGATCTGGTTCAGGCGGAATGAATTGAGCCAAAAGAGGAAGGTTTCGATTGCAAATCTGGCAGATTGGGTTGT
TGGGTATCTGAACGCTTTTCGTGATGCGGGAAGGAGGGAGGTGGACCTGTTGGTGGGAGGCCAAACCGGGCGGTCTACAGGTGGGCATCGCTAG
Protein sequenceShow/hide protein sequence
MNRQLLRPFHKEELLLALKQIHPSKAPGPDGLSGAFFRQSWDVVEADVVRCCLSILNEKASPVPINETMIVLIPKVRSPRRVSDYRPISLCNVVYKLVSKAIVNRMKGIL
NGLVSENQSAFIPGRCVVDNVILGYECIHALKGKARGRAGWASLKLDMGKAYDRVEWIFLEQIMLKMGFAPDWVELVYRCISSVTLCLLPVLAVCGVAVESASCGRKYRG
HLGFKSVPTRAEVGEALAIQDILRCYERASGQTMNFDKSIILFSPYTEEGARRRPGLAADPRMEGETVLSGRKEDLLKSVVQAIPCYAMNCFRLSKKLIVEINRLMARFW
WCGMEENKRIHWDGQSADFLSCPCSETAVLPVLEVFGGKGGKSSFLYLEESYVGEGVTGKGIRWRIGNGERGSGMRLLSDIILACRRLAHPFYSFTPVKSGYRLGQGPLL
AQSPSSSSPELLRRWWKNCWSMKIPSKIKIFFWRLYLDRLPMMDNLCRRGVDSLNMCVCCGRLGESGLHLFWHCKRTKEIMWVAGFGAIVSKCEAVDIKFLLRDVNDELE
WERFEEWVVLLWAIWFRRNELSQKRKVSIANLADWVVGYLNAFRDAGRREVDLLVGGQTGRSTGGHR