; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041130 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041130
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr13:12525573..12528999
RNA-Seq ExpressionLag0041130
SyntenyLag0041130
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBN69746.1 VIRB2-interacting protein 2 [Prunus dulcis]5.2e-6435.02Show/hide
Query:  GDENTKFFHRTLAARKRKNSINEVLSHQGVSLVTATDIEREFIDFYRNLFTKDNHPQFLQINVDWCPISETQALGLEVAFSEEEVFQAMNSLGSSKSPSP
        GD NTKFFHR  + R+++N I ++    G  +V+  +IE E I+F++NL++ +    +    ++W  IS  +A  L+  F EEEV +A+   G  KSP P
Subjt:  GDENTKFFHRTLAARKRKNSINEVLSHQGVSLVTATDIEREFIDFYRNLFTKDNHPQFLQINVDWCPISETQALGLEVAFSEEEVFQAMNSLGSSKSPSP

Query:  DGFTAEFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPKRLD----SKYCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQILD
        DGF+   F+  W+ +K+DL  ++ DFFN G+IN   NET+ICLIPK+ +    S + PIS ++  YK++++VL++RL+ VL STI+  Q AFV  RQILD
Subjt:  DGFTAEFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPKRLD----SKYCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQILD

Query:  ASLIANELIDDWNIASKIGVILKLDLEKAFDKVDWDFLDVVLHAKGFG------------------LLGGNGLGA------------------VFLVGRF
        A+LIANE++++    +K G++ K+DLEKA+D V+W F+D VL  KGFG                  ++ G   G                     ++   
Subjt:  ASLIANELIDDWNIASKIGVILKLDLEKAFDKVDWDFLDVVLHAKGFG------------------LLGGNGLGA------------------VFLVGRF

Query:  SKLVPHPIDTSSF----------SLNHLQFADDTLLFSSYDSDALNNVFAVIKIFELASGLNVNLAKSELLGIHIHVSELEELAAKFGCKSGIWPTSRVH
        S+++    D   F           ++HLQFADDT+ F     +  NN+  ++++F   SG+ +N +K  L+GI++    + E+A  +GC  G+WP   + 
Subjt:  SKLVPHPIDTSSF----------SLNHLQFADDTLLFSSYDSDALNNVFAVIKIFELASGLNVNLAKSELLGIHIHVSELEELAAKFGCKSGIWPTSRVH

Query:  RCIGNDQRT-SFWN
          +G + R   FW+
Subjt:  RCIGNDQRT-SFWN

CAN78744.1 hypothetical protein VITISV_014186 [Vitis vinifera]1.5e-6631.06Show/hide
Query:  DENTKFFHRTLAARKRKNSINEVLSHQGVSLVTATDIEREFIDFYRNLFTKDNHPQFLQINVDWCPISETQALGLEVAFSEEEVFQAMNSLGSSKSPSPD
        D N+KF+H+    R+ +  I E+ + +G+ L  A  I  + + ++  L+T      +    +DW PISE  AL L+  F+EEE+ +A+  L   K+P PD
Subjt:  DENTKFFHRTLAARKRKNSINEVLSHQGVSLVTATDIEREFIDFYRNLFTKDNHPQFLQINVDWCPISETQALGLEVAFSEEEVFQAMNSLGSSKSPSPD

Query:  GFTAEFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPKRLDSK----YCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQILDA
        GFT   F+  W+  K+DL  +  +F  +G+IN   N ++I LIPK+  SK    + PIS I+  YKIIA+VLS RL+GVL  TI  +Q AFV  RQILDA
Subjt:  GFTAEFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPKRLDSK----YCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQILDA

Query:  SLIANELIDDWNIASKIGVILKLDLEKAFDKVDWDFLDVVLHAKGFG------------------LLGGNGLGAV------------------FLVGRFS
         LIANE++D+   + K GV+ K+D EKA+D V WDFLD VL  KGF                   L+ G+  G V                   +    S
Subjt:  SLIANELIDDWNIASKIGVILKLDLEKAFDKVDWDFLDVVLHAKGFG------------------LLGGNGLGAV------------------FLVGRFS

Query:  KLVPHP----------IDTSSFSLNHLQFADDTLLFSSYDSDALNNVFAVIKIFELASGLNVNLAKSELLGIHIHVSELEELAAKFGCKSGIWP------
        +++             +  +   ++HLQFADDT+ FS+   + L  + +++ +F   S L VNL KS +  I++  + L  LA    CK   WP      
Subjt:  KLVPHP----------IDTSSFSLNHLQFADDTLLFSSYDSDALNNVFAVIKIFELASGLNVNLAKSELLGIHIHVSELEELAAKFGCKSGIWP------

Query:  ------------TSRVHRCIGNDQRTSFWNDSWLNCGILATTFPRLYRLTTNQNAMVADVWNSSHE-AWDLSLRRQLNELETNEWANL-----SYLLSSF
                       + R   N +R  FW D W     L T +PRL+R+  ++N  ++ V   S   +W+L+ RR L++ E  +   L        LS  
Subjt:  ------------TSRVHRCIGNDQRTSFWNDSWLNCGILATTFPRLYRLTTNQNAMVADVWNSSHE-AWDLSLRRQLNELETNEWANL-----SYLLSSF

Query:  SFCARLWQRVEKMGCRLPK--WCSLFHNSA---RLLGRFHYDDQESFIQRNWRFILGGKRGGNTTVLSPR------STTVCLLSVE
           ARLW  +  +G    K  + +L  +S        +F ++ Q  F  +++ +++  K+     +L  R      S  +C+L ++
Subjt:  SFCARLWQRVEKMGCRLPK--WCSLFHNSA---RLLGRFHYDDQESFIQRNWRFILGGKRGGNTTVLSPR------STTVCLLSVE

VVA21938.1 Hypothetical predicted protein, partial [Prunus dulcis]5.2e-6435.02Show/hide
Query:  GDENTKFFHRTLAARKRKNSINEVLSHQGVSLVTATDIEREFIDFYRNLFTKDNHPQFLQINVDWCPISETQALGLEVAFSEEEVFQAMNSLGSSKSPSP
        GD NTKFFHR  + R+++N I ++    G  +V+  +IE E I+F++NL++ +    +    ++W  IS  +A  L+  F EEEV +A+   G  KSP P
Subjt:  GDENTKFFHRTLAARKRKNSINEVLSHQGVSLVTATDIEREFIDFYRNLFTKDNHPQFLQINVDWCPISETQALGLEVAFSEEEVFQAMNSLGSSKSPSP

Query:  DGFTAEFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPKRLD----SKYCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQILD
        DGF+   F+  W+ +K+DL  ++ DFFN G+IN   NET+ICLIPK+ +    S + PIS ++  YK++++VL++RL+ VL STI+  Q AFV  RQILD
Subjt:  DGFTAEFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPKRLD----SKYCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQILD

Query:  ASLIANELIDDWNIASKIGVILKLDLEKAFDKVDWDFLDVVLHAKGFG------------------LLGGNGLGA------------------VFLVGRF
        A+LIANE++++    +K G++ K+DLEKA+D V+W F+D VL  KGFG                  ++ G   G                     ++   
Subjt:  ASLIANELIDDWNIASKIGVILKLDLEKAFDKVDWDFLDVVLHAKGFG------------------LLGGNGLGA------------------VFLVGRF

Query:  SKLVPHPIDTSSF----------SLNHLQFADDTLLFSSYDSDALNNVFAVIKIFELASGLNVNLAKSELLGIHIHVSELEELAAKFGCKSGIWPTSRVH
        S+++    D   F           ++HLQFADDT+ F     +  NN+  ++++F   SG+ +N +K  L+GI++    + E+A  +GC  G+WP   + 
Subjt:  SKLVPHPIDTSSF----------SLNHLQFADDTLLFSSYDSDALNNVFAVIKIFELASGLNVNLAKSELLGIHIHVSELEELAAKFGCKSGIWPTSRVH

Query:  RCIGNDQRT-SFWN
          +G + R   FW+
Subjt:  RCIGNDQRT-SFWN

VVA31869.1 Hypothetical predicted protein, partial [Prunus dulcis]2.7e-6835.36Show/hide
Query:  GDENTKFFHRTLAARKRKNSINEV-LSHQGVSLVTATDIEREFIDFYRNLFTKDNHPQFLQINVDWCPISETQALGLEVAFSEEEVFQAMNSLGSSKSPS
        GD NTKFFH+     +++N I+++ +   GV  V A +IERE I F++ LF+ + +  +    ++WCPIS+T+A  LE  F  EEV +A+   G  KSP 
Subjt:  GDENTKFFHRTLAARKRKNSINEV-LSHQGVSLVTATDIEREFIDFYRNLFTKDNHPQFLQINVDWCPISETQALGLEVAFSEEEVFQAMNSLGSSKSPS

Query:  PDGFTAEFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPKRLDS----KYCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQIL
        PDGF+  FF+  W  +K DL  ++QDFF +G++N   NET+ICLIPK+ +S     + PIS ++  YK+I++VL++RL+ VL +TI+++Q AFV  RQIL
Subjt:  PDGFTAEFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPKRLDS----KYCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQIL

Query:  DASLIANELIDDWNIASKIGVILKLDLEKAFDKVDWDFLDVVLHAKGFGLLGGNGLG-----AVFLVGRFSKLVPHPIDTS--------------SFSLN
        DA L+ANE++++    ++ G++ K+D EKA+D V+W+F+D VL  KGFG     GL      + FL    S ++   I+ +                 ++
Subjt:  DASLIANELIDDWNIASKIGVILKLDLEKAFDKVDWDFLDVVLHAKGFGLLGGNGLG-----AVFLVGRFSKLVPHPIDTS--------------SFSLN

Query:  HLQFADDTLLFSSYDSDALNNVFAVIKIFELASGLNVNLAKSELLGIHIHVSELEELAAKFGCKSGIWPTSRVHRCIGNDQRT-SFWNDSWLNCGILATT
        HLQFADDT+ F     +   N+  ++K+F   SG+ +N AKS +LGI+  +  L  +A  +GC+ G WP   +   +G + R  +FWN       +L   
Subjt:  HLQFADDTLLFSSYDSDALNNVFAVIKIFELASGLNVNLAKSELLGIHIHVSELEELAAKFGCKSGIWPTSRVHRCIGNDQRT-SFWNDSWLNCGILATT

Query:  FPRLYR-----LTTNQNAMVADVWNSSHEAWDLSLRRQLNELETNEWANLSYLLSSFSFCARLWQRVEK-MGCRLPKW
          RL R     L+      +     SS  ++ +SL +    +     A +  L+ +F     LW+ VE+   C L +W
Subjt:  FPRLYR-----LTTNQNAMVADVWNSSHEAWDLSLRRQLNELETNEWANLSYLLSSFSFCARLWQRVEK-MGCRLPKW

XP_020420593.1 uncharacterized protein LOC18774736 [Prunus persica]2.4e-6436.14Show/hide
Query:  GDENTKFFHRTLAARKRKNSINEV-LSHQGVSLVTATDIEREFIDFYRNLFTKDNHPQFLQINVDWCPISETQALGLEVAFSEEEVFQAMNSLGSSKSPS
        GD NTKFFHR  + R+++N I ++ ++  GV +V+  +IE E I+F++NL++ +    +    ++W  IS  +A  LE  F EEEV +A+   G  KSP 
Subjt:  GDENTKFFHRTLAARKRKNSINEV-LSHQGVSLVTATDIEREFIDFYRNLFTKDNHPQFLQINVDWCPISETQALGLEVAFSEEEVFQAMNSLGSSKSPS

Query:  PDGFTAEFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPKRLD----SKYCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQIL
        PDGF+   F+  W+ +K+DL  ++ DFFN G+IN   NET+ICLIPK+ +    S + PIS ++  YK++++VL++RL+ VL STI+  Q AFV  RQIL
Subjt:  PDGFTAEFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPKRLD----SKYCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQIL

Query:  DASLIANELIDDWNIASKIGVILKLDLEKAFDKVDWDFLDVVLHAKGFG------------------LLGGNGLGA------------------VFLVGR
        DA+LIANE++++    +K G++ K+DLEKA+D V+W F+D VL  KGFG                  ++ G   G                     ++  
Subjt:  DASLIANELIDDWNIASKIGVILKLDLEKAFDKVDWDFLDVVLHAKGFG------------------LLGGNGLGA------------------VFLVGR

Query:  FSKLVPHPIDTSSF----------SLNHLQFADDTLLFSSYDSDALNNVFAVIKIFELASGLNVNLAKSELLGIHIHVSELEELAAKFGCKSGIWPTSRV
         S+++    D   F           ++HLQFADDT+ F     +  NN+  ++++F   SG+ +N +K  L+GI++    L ELA  +GC+ G WP S +
Subjt:  FSKLVPHPIDTSSF----------SLNHLQFADDTLLFSSYDSDALNNVFAVIKIFELASGLNVNLAKSELLGIHIHVSELEELAAKFGCKSGIWPTSRV

Query:  HRCIGNDQRT-SFWN
           +G + R   FW+
Subjt:  HRCIGNDQRT-SFWN

TrEMBL top hitse value%identityAlignment
A0A5E4FWN6 Reverse transcriptase domain-containing protein (Fragment)1.3e-6835.36Show/hide
Query:  GDENTKFFHRTLAARKRKNSINEV-LSHQGVSLVTATDIEREFIDFYRNLFTKDNHPQFLQINVDWCPISETQALGLEVAFSEEEVFQAMNSLGSSKSPS
        GD NTKFFH+     +++N I+++ +   GV  V A +IERE I F++ LF+ + +  +    ++WCPIS+T+A  LE  F  EEV +A+   G  KSP 
Subjt:  GDENTKFFHRTLAARKRKNSINEV-LSHQGVSLVTATDIEREFIDFYRNLFTKDNHPQFLQINVDWCPISETQALGLEVAFSEEEVFQAMNSLGSSKSPS

Query:  PDGFTAEFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPKRLDS----KYCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQIL
        PDGF+  FF+  W  +K DL  ++QDFF +G++N   NET+ICLIPK+ +S     + PIS ++  YK+I++VL++RL+ VL +TI+++Q AFV  RQIL
Subjt:  PDGFTAEFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPKRLDS----KYCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQIL

Query:  DASLIANELIDDWNIASKIGVILKLDLEKAFDKVDWDFLDVVLHAKGFGLLGGNGLG-----AVFLVGRFSKLVPHPIDTS--------------SFSLN
        DA L+ANE++++    ++ G++ K+D EKA+D V+W+F+D VL  KGFG     GL      + FL    S ++   I+ +                 ++
Subjt:  DASLIANELIDDWNIASKIGVILKLDLEKAFDKVDWDFLDVVLHAKGFGLLGGNGLG-----AVFLVGRFSKLVPHPIDTS--------------SFSLN

Query:  HLQFADDTLLFSSYDSDALNNVFAVIKIFELASGLNVNLAKSELLGIHIHVSELEELAAKFGCKSGIWPTSRVHRCIGNDQRT-SFWNDSWLNCGILATT
        HLQFADDT+ F     +   N+  ++K+F   SG+ +N AKS +LGI+  +  L  +A  +GC+ G WP   +   +G + R  +FWN       +L   
Subjt:  HLQFADDTLLFSSYDSDALNNVFAVIKIFELASGLNVNLAKSELLGIHIHVSELEELAAKFGCKSGIWPTSRVHRCIGNDQRT-SFWNDSWLNCGILATT

Query:  FPRLYR-----LTTNQNAMVADVWNSSHEAWDLSLRRQLNELETNEWANLSYLLSSFSFCARLWQRVEK-MGCRLPKW
          RL R     L+      +     SS  ++ +SL +    +     A +  L+ +F     LW+ VE+   C L +W
Subjt:  FPRLYR-----LTTNQNAMVADVWNSSHEAWDLSLRRQLNELETNEWANLSYLLSSFSFCARLWQRVEK-MGCRLPKW

A0A803P465 Uncharacterized protein1.1e-6433.64Show/hide
Query:  GDENTKFFHRTLAARKRKNSINEVLSHQGVSLVTATDIEREFIDFYRNLFTKDNHPQFLQINVDWCPISETQALGLEVAFSEEEVFQAMNSLGSSKSPSP
        GD N++FFH  L ARK +N+I+ +    G  L    +I +E I F+ +L+T +         +DW  I +  A  LE  F E EV +A+ S   SK+P P
Subjt:  GDENTKFFHRTLAARKRKNSINEVLSHQGVSLVTATDIEREFIDFYRNLFTKDNHPQFLQINVDWCPISETQALGLEVAFSEEEVFQAMNSLGSSKSPSP

Query:  DGFTAEFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPKRLDS----KYCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQILD
        DGF+   F+  W  IK+DL  +V+ F   G I  ++NET+ICLIPK+L S     Y PIS I+  YKIIA++LS RL+GVL  TI + Q AFV  RQILD
Subjt:  DGFTAEFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPKRLDS----KYCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQILD

Query:  ASLIANELIDDWNIASKIGVILKLDLEKAFDKVDWDFLDVVLHAKGFGLL---------------------------GGNGLG------------AVFLV
        + LIANE ++D+    + G++ K+D EKA+D+V+W+F+DVVL  KGFG +                           G  GL                ++
Subjt:  ASLIANELIDDWNIASKIGVILKLDLEKAFDKVDWDFLDVVLHAKGFGLL---------------------------GGNGLG------------AVFLV

Query:  GRFSKLVPHPIDTSSF-------SLNHLQFADDTLLFSSYDSDALNNVFAVIKIFELASGLNVNLAKSELLGIHIHVSELEELAAKFGCKSGIWPTSRVH
        GR +       + S F        ++HLQFADDT+ F   +  +L+ +  V++ F   SGL +NL+KS+LLGI +    +  LA + GC+ G WP   + 
Subjt:  GRFSKLVPHPIDTSSF-------SLNHLQFADDTLLFSSYDSDALNNVFAVIKIFELASGLNVNLAKSELLGIHIHVSELEELAAKFGCKSGIWPTSRVH

Query:  RCIGNDQRT-SFWNDSWLNC-----GILATTFPRLYRLTTNQNAMVADVWNSSHEAWDLSLRRQLNELETNEWANLSYLLSSFSFCARLWQRVEKMGC-R
          +G   R  SFW      C     G       +  RLT  Q+ +      SS   + LSL +    +       L  ++  F     LW+  E  G   
Subjt:  RCIGNDQRT-SFWNDSWLNC-----GILATTFPRLYRLTTNQNAMVADVWNSSHEAWDLSLRRQLNELETNEWANLSYLLSSFSFCARLWQRVEKMGC-R

Query:  LPKW---CSLFHNSARLLGRFHYDDQESFIQRNWRFIL
        L  W   C   H     +GR    ++   ++  WRF L
Subjt:  LPKW---CSLFHNSARLLGRFHYDDQESFIQRNWRFIL

A5B7M7 Reverse transcriptase domain-containing protein7.1e-6731.06Show/hide
Query:  DENTKFFHRTLAARKRKNSINEVLSHQGVSLVTATDIEREFIDFYRNLFTKDNHPQFLQINVDWCPISETQALGLEVAFSEEEVFQAMNSLGSSKSPSPD
        D N+KF+H+    R+ +  I E+ + +G+ L  A  I  + + ++  L+T      +    +DW PISE  AL L+  F+EEE+ +A+  L   K+P PD
Subjt:  DENTKFFHRTLAARKRKNSINEVLSHQGVSLVTATDIEREFIDFYRNLFTKDNHPQFLQINVDWCPISETQALGLEVAFSEEEVFQAMNSLGSSKSPSPD

Query:  GFTAEFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPKRLDSK----YCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQILDA
        GFT   F+  W+  K+DL  +  +F  +G+IN   N ++I LIPK+  SK    + PIS I+  YKIIA+VLS RL+GVL  TI  +Q AFV  RQILDA
Subjt:  GFTAEFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPKRLDSK----YCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQILDA

Query:  SLIANELIDDWNIASKIGVILKLDLEKAFDKVDWDFLDVVLHAKGFG------------------LLGGNGLGAV------------------FLVGRFS
         LIANE++D+   + K GV+ K+D EKA+D V WDFLD VL  KGF                   L+ G+  G V                   +    S
Subjt:  SLIANELIDDWNIASKIGVILKLDLEKAFDKVDWDFLDVVLHAKGFG------------------LLGGNGLGAV------------------FLVGRFS

Query:  KLVPHP----------IDTSSFSLNHLQFADDTLLFSSYDSDALNNVFAVIKIFELASGLNVNLAKSELLGIHIHVSELEELAAKFGCKSGIWP------
        +++             +  +   ++HLQFADDT+ FS+   + L  + +++ +F   S L VNL KS +  I++  + L  LA    CK   WP      
Subjt:  KLVPHP----------IDTSSFSLNHLQFADDTLLFSSYDSDALNNVFAVIKIFELASGLNVNLAKSELLGIHIHVSELEELAAKFGCKSGIWP------

Query:  ------------TSRVHRCIGNDQRTSFWNDSWLNCGILATTFPRLYRLTTNQNAMVADVWNSSHE-AWDLSLRRQLNELETNEWANL-----SYLLSSF
                       + R   N +R  FW D W     L T +PRL+R+  ++N  ++ V   S   +W+L+ RR L++ E  +   L        LS  
Subjt:  ------------TSRVHRCIGNDQRTSFWNDSWLNCGILATTFPRLYRLTTNQNAMVADVWNSSHE-AWDLSLRRQLNELETNEWANL-----SYLLSSF

Query:  SFCARLWQRVEKMGCRLPK--WCSLFHNSA---RLLGRFHYDDQESFIQRNWRFILGGKRGGNTTVLSPR------STTVCLLSVE
           ARLW  +  +G    K  + +L  +S        +F ++ Q  F  +++ +++  K+     +L  R      S  +C+L ++
Subjt:  SFCARLWQRVEKMGCRLPK--WCSLFHNSA---RLLGRFHYDDQESFIQRNWRFILGGKRGGNTTVLSPR------STTVCLLSVE

M5WPQ5 Reverse transcriptase domain-containing protein3.0e-6537.79Show/hide
Query:  GDENTKFFHRTLAARKRKNSINEV-LSHQGVSLVTATDIEREFIDFYRNLFTKDNHPQFLQINVDWCPISETQALGLEVAFSEEEVFQAMNSLGSSKSPS
        GD NTKFFHR     +++N I ++ +   GV  V A +IERE I F++ L++++ +  +    ++WCPIS+ +A  LE  F  EEV +A+   G  KSP 
Subjt:  GDENTKFFHRTLAARKRKNSINEV-LSHQGVSLVTATDIEREFIDFYRNLFTKDNHPQFLQINVDWCPISETQALGLEVAFSEEEVFQAMNSLGSSKSPS

Query:  PDGFTAEFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPKRLDS----KYCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQIL
        PDGF+  FF+  W  +K DL  ++QDFF +G++N   NET+ICLIPK+ +S     Y PIS ++  YK+I++VL++RL+ VL +TI+++Q AFV  RQIL
Subjt:  PDGFTAEFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPKRLDS----KYCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQIL

Query:  DASLIANELIDDWNIASKIGVILKLDLEKAFDKVDWDFLDVVLHAKGFGLLGGNGLGAVFLVGRFSKLV----------------PHPIDTSSFSL----
        DA L+ANE++++     + G++ K+D EKA+D V+W+F+D V+  KGFG+     +        FS ++                  P+    F+L    
Subjt:  DASLIANELIDDWNIASKIGVILKLDLEKAFDKVDWDFLDVVLHAKGFGLLGGNGLGAVFLVGRFSKLV----------------PHPIDTSSFSL----

Query:  NHLQFADDTLLFSSYDSDALNNVFAVIKIFELASGLNVNLAKSELLGIHIHVSELEELAAKFGCKSGIWPTSRVHRCIGNDQRT-SFWN
        +HLQFADDT+       +   N+  ++K+F   SG+ +N AKS +LGI+     L  +A  +GC+ G WP   +   +G + R  +FWN
Subjt:  NHLQFADDTLLFSSYDSDALNNVFAVIKIFELASGLNVNLAKSELLGIHIHVSELEELAAKFGCKSGIWPTSRVHRCIGNDQRT-SFWN

M5XUF8 Reverse transcriptase domain-containing protein (Fragment)3.0e-6535.99Show/hide
Query:  GDENTKFFHRTLAARKRKNSINEVLSHQGVSLVTATDIEREFIDFYRNLFTKDNHPQFLQINVDWCPISETQALGLEVAFSEEEVFQAMNSLGSSKSPSP
        GD NTKFFHR    R+++N I ++       +V   +IE E I+F++NL++ +    +    ++W  IS  +A  LE  F EEEV +A+   G  KSP P
Subjt:  GDENTKFFHRTLAARKRKNSINEVLSHQGVSLVTATDIEREFIDFYRNLFTKDNHPQFLQINVDWCPISETQALGLEVAFSEEEVFQAMNSLGSSKSPSP

Query:  DGFTAEFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPKRLD----SKYCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQILD
        DGF+   F+  W  +K+DL  ++ DFFN G+IN   NET+ICLIPK+ +    S + PIS ++  YK++++VL++RL+ VL STI+  Q AFV  RQILD
Subjt:  DGFTAEFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPKRLD----SKYCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQILD

Query:  ASLIANELIDDWNIASKIGVILKLDLEKAFDKVDWDFLDVVLHAKGFG------------------LLGGNGLGAV------------------FLVGRF
        A+LIANE++++    +K G++ K+DLEKA+D V+W F+D VL  KGFG                  ++ G   G +                   ++   
Subjt:  ASLIANELIDDWNIASKIGVILKLDLEKAFDKVDWDFLDVVLHAKGFG------------------LLGGNGLGAV------------------FLVGRF

Query:  SKLVPHPIDTSSF----------SLNHLQFADDTLLFSSYDSDALNNVFAVIKIFELASGLNVNLAKSELLGIHIHVSELEELAAKFGCKSGIWPTSRVH
        S+++    DT  F           ++HLQFADDT+ F     +  NN+  ++++F   SG+ +N +K  L+GI++    L ELA  +GC+ G WP S + 
Subjt:  SKLVPHPIDTSSF----------SLNHLQFADDTLLFSSYDSDALNNVFAVIKIFELASGLNVNLAKSELLGIHIHVSELEELAAKFGCKSGIWPTSRVH

Query:  RCIGNDQRT-SFWN
          +G + R   FW+
Subjt:  RCIGNDQRT-SFWN

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein5.2e-1422.71Show/hide
Query:  RTLAARKRKNSINEVLSHQGVSLVTATDIEREFIDFYRNLF-TKDNHPQFLQINVDWCP---ISETQALGLEVAFSEEEVFQAMNSLGSSKSPSPDGFTA
        R +  ++ KN I+ + + +G      T+I+    ++Y++L+  K  + + +   +D      +++ +   L    +  E+   +NSL + KSP PDGFTA
Subjt:  RTLAARKRKNSINEVLSHQGVSLVTATDIEREFIDFYRNLF-TKDNHPQFLQINVDWCP---ISETQALGLEVAFSEEEVFQAMNSLGSSKSPSPDGFTA

Query:  EFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPK-----RLDSKYCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQILDASLI
        EF++     +   L  + Q     G++  +  E  I LIPK          + PIS ++   KI+ ++L+NR++  +   I  +Q+ F+   Q       
Subjt:  EFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPK-----RLDSKYCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQILDASLI

Query:  ANELIDDWNIA-SKIGVILKLDLEKAFDKVDWDFLDVVLHAKGFGLLGGNGLGAVF------LVGRFSKLVPHPIDTSS---------------------
        +  +I   N A  K  VI+ +D EKAFDK+   F+   L+  G   +    + A++      ++    KL   P+ T +                     
Subjt:  ANELIDDWNIA-SKIGVILKLDLEKAFDKVDWDFLDVVLHAKGFGLLGGNGLGAVF------LVGRFSKLVPHPIDTSS---------------------

Query:  ----FSLNHLQ----------FADDTLLFSSYDSDALNNVFAVIKIFELASGLNVNLAKSE
              +  +Q          FADD +++      +  N+  +I  F   SG  +N+ KS+
Subjt:  ----FSLNHLQ----------FADDTLLFSSYDSDALNNVFAVIKIFELASGLNVNLAKSE

P08548 LINE-1 reverse transcriptase homolog4.8e-1222.56Show/hide
Query:  LAARKRKNSINEVLSHQGVSLVT-ATDIEREFIDFYRNLFT-KDNHPQFLQINVDWC---PISETQALGLEVAFSEEEVFQAMNSLGSSKSPSPDGFTAE
        L  +KR  S+   + +    + T  ++I++   ++Y+ L++ K  + + +   ++ C    +S+ +   L    S  E+   + +L   KSP PDGFT+E
Subjt:  LAARKRKNSINEVLSHQGVSLVT-ATDIEREFIDFYRNLFT-KDNHPQFLQINVDWC---PISETQALGLEVAFSEEEVFQAMNSLGSSKSPSPDGFTAE

Query:  FFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPK-----RLDSKYCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQILDASLIA
        F++     +   L  + Q+    G++     E  I LIPK          Y PIS ++   KI+ ++L+NR++  +   I  +Q+ F+   Q       +
Subjt:  FFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPK-----RLDSKYCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQILDASLIA

Query:  NELIDDWN-IASKIGVILKLDLEKAFDKVDWDFL--------------------------DVVLHA---KGFGLLGGNGLGA------------VFLVGR
          +I   N + +K  +IL +D EKAFD +   F+                          +++L+    K F L  G   G             V  +  
Subjt:  NELIDDWN-IASKIGVILKLDLEKAFDKVDWDFL--------------------------DVVLHA---KGFGLLGGNGLGA------------VFLVGR

Query:  FSKLVPHPIDTSSFSLNHLQFADDTLLFSSYDSDALNNVFAVIKIFELASGLNVNLAKS
          +     I   S  +    FADD +++     D+   +  VIK +   SG  +N  KS
Subjt:  FSKLVPHPIDTSSFSLNHLQFADDTLLFSSYDSDALNNVFAVIKIFELASGLNVNLAKS

P11369 LINE-1 retrotransposable element ORF2 protein2.3e-1427.56Show/hide
Query:  RTLAARKRKNSINEVLSHQGVSLVTATDIEREFIDFYRNLFTK-----DNHPQFLQINVDWCPISETQALGLEVAFSEEEVFQAMNSLGSSKSPSPDGFT
        R     + K  IN++ + +G       +I+     FY+ L++      D   +FL        +++ Q   L    S +E+   +NSL + KSP PDGF+
Subjt:  RTLAARKRKNSINEVLSHQGVSLVTATDIEREFIDFYRNLFTK-----DNHPQFLQINVDWCPISETQALGLEVAFSEEEVFQAMNSLGSSKSPSPDGFT

Query:  AEFFKFSWNTIKQDLRTMVQDFFN----TGVINVALNETYICLIPKRLD-----SKYCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVA-----
        AEF++    T K+DL  ++   F+     G +  +  E  I LIPK          + PIS ++   KI+ ++L+NR++  + + I  +Q+ F+      
Subjt:  AEFFKFSWNTIKQDLRTMVQDFFN----TGVINVALNETYICLIPKRLD-----SKYCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVA-----

Query:  --TRQILDASLIANELIDDWNIASKIGVILKLDLEKAFDKVDWDFLDVVLHAKG
           R+ ++     N+L D      K  +I+ LD EKAFDK+   F+  VL   G
Subjt:  --TRQILDASLIANELIDDWNIASKIGVILKLDLEKAFDKVDWDFLDVVLHAKG

P14381 Transposon TX1 uncharacterized 149 kDa protein1.9e-2431.75Show/hide
Query:  DENTKFFHRTLAARKRKNSINEVLSHQGVSLVTATDIEREFIDFYRNLFTKDN-HPQFLQINVDWCP-ISETQALGLEVAFSEEEVFQAMNSLGSSKSPS
        D  ++FF+     +  +  I  + +  G  L     I      FY+NLF+ D   P   +   D  P +SE +   LE   + +E+ QA+  +  +KSP 
Subjt:  DENTKFFHRTLAARKRKNSINEVLSHQGVSLVTATDIEREFIDFYRNLFTKDN-HPQFLQINVDWCP-ISETQALGLEVAFSEEEVFQAMNSLGSSKSPS

Query:  PDGFTAEFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPK----RLDSKYCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQIL
         DG T EFF+F W+T+  D   ++ + F  G + ++     + L+PK    RL   + P+S +S  YKI+A+ +S RLK VL   I  +Q   V  R I 
Subjt:  PDGFTAEFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPK----RLDSKYCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVATRQIL

Query:  DASLIANELIDDWNIASKIGV---ILKLDLEKAFDKVDWDFLDVVLHAKGFG
        D   +  +L+   + A + G+    L LD EKAFD+VD  +L   L A  FG
Subjt:  DASLIANELIDDWNIASKIGV---ILKLDLEKAFDKVDWDFLDVVLHAKGFG

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein7.9e-1833.33Show/hide
Query:  GDENTKFFHRTLAARKRKNSINEVLSHQGVSLVTATDIEREFIDFYRNLFTKDNH---PQFLQINVDWCPI--SETQALGLEVAFSEEEVFQAMNSLGSS
        GD NT+FFH+ + A + KN I  +     V +   T ++   + +Y +L   D+    P  +Q   D  P   ++T A  L    S++E+  A+ ++  +
Subjt:  GDENTKFFHRTLAARKRKNSINEVLSHQGVSLVTATDIEREFIDFYRNLFTKDNH---PQFLQINVDWCPI--SETQALGLEVAFSEEEVFQAMNSLGSS

Query:  KSPSPDGFTAEFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPKRLD----SKYCPISRISCAYKII
        K+P PD FTAEFF  SW  +K      V++FF TG +    N T I LIPK       S + P+S  +  YKII
Subjt:  KSPSPDGFTAEFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPKRLD----SKYCPISRISCAYKII

AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.2e-0435.53Show/hide
Query:  RLKGVLPSTIAKNQMAFVATRQILDASLIANELIDDWNIASKIGV----ILKLDLEKAFDKVDWDFLDVVLHAKGF
        RLK ++ + I   Q +F+  R   D  +   E +   ++  K GV    +LKLDLEKA+D++ WD+L+  L + GF
Subjt:  RLKGVLPSTIAKNQMAFVATRQILDASLIANELIDDWNIASKIGV----ILKLDLEKAFDKVDWDFLDVVLHAKGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATTCATGGTTATAAATTGACTCTTTTCGTGAGGTATTGGATAATTGGTGGAGACAAAATCCGCTTCAAGGATGGTCGAGCCATGGGTGACGAAAATACTAAATT
CTTCCATCGTACCTTGGCTGCCCGTAAAAGGAAGAATTCAATTAATGAGGTGTTATCCCATCAAGGGGTCAGTTTAGTTACCGCTACTGATATTGAAAGGGAATTCATTG
ATTTCTATCGAAATTTGTTCACCAAAGATAACCATCCCCAGTTTCTCCAAATCAATGTTGATTGGTGCCCTATTAGCGAGACTCAGGCGTTAGGGTTGGAAGTTGCCTTT
TCTGAGGAAGAAGTATTCCAAGCGATGAATTCTCTAGGATCAAGTAAGTCTCCTAGCCCAGATGGTTTTACAGCTGAATTCTTTAAATTCTCTTGGAATACTATTAAACA
GGATCTTAGGACCATGGTTCAAGATTTTTTTAACACAGGTGTTATTAATGTGGCGTTGAATGAAACTTATATTTGTCTGATTCCAAAGCGTTTAGATTCAAAATATTGCC
CTATCAGCCGCATCTCGTGTGCTTATAAGATCATTGCTCGAGTTTTATCTAATCGTTTGAAGGGTGTTTTGCCATCTACCATAGCTAAAAACCAAATGGCTTTTGTTGCT
ACCAGACAAATCTTGGATGCCTCCTTAATTGCTAATGAGCTAATTGATGATTGGAATATAGCTTCAAAAATAGGTGTGATTCTTAAATTGGATTTAGAAAAGGCCTTTGA
TAAAGTTGATTGGGATTTTCTGGATGTCGTCCTTCATGCAAAAGGCTTTGGTTTACTTGGAGGAAATGGATTAGGGGCTGTATTTCTAGTTGGCAGATTCAGTAAACTTG
TGCCTCATCCAATTGACACCTCATCTTTTAGTCTGAACCATTTGCAATTTGCGGATGACACCCTATTGTTTTCTTCTTATGATTCAGATGCTTTGAATAACGTATTTGCA
GTCATCAAAATTTTTGAATTGGCCTCTGGTCTTAATGTCAACCTTGCTAAGAGTGAACTTTTGGGAATACATATTCATGTTTCAGAATTGGAGGAGTTGGCAGCAAAATT
TGGTTGTAAAAGTGGAATTTGGCCTACTAGTCGAGTTCATCGTTGTATTGGTAACGACCAAAGAACGTCATTCTGGAATGACTCTTGGTTAAATTGTGGGATCCTTGCAA
CAACTTTCCCACGCCTCTATCGTTTAACCACCAACCAAAATGCTATGGTGGCCGATGTTTGGAATTCGTCACATGAGGCTTGGGATCTAAGCCTTCGACGTCAACTAAAT
GAGCTTGAAACAAATGAATGGGCTAATCTCTCTTATCTCCTGTCTTCGTTTAGCTTTTGTGCAAGGCTTTGGCAAAGGGTGGAGAAAATGGGTTGCAGGCTGCCTAAATG
GTGTAGCCTATTTCATAATAGCGCACGACTTTTGGGGCGCTTCCACTACGACGACCAAGAAAGTTTCATCCAACGTAATTGGAGATTCATCCTTGGCGGGAAAAGAGGCG
GAAACACCACCGTCCTCTCACCGCGATCAACAACCGTGTGCTTGCTGTCGGTTGAACAAGATACAAAAATCTTGAAAGAAGATGTGGGTAAGATAAAGAAGATCTTGGAG
ATGATTTGTGAAAAAATGGGCTGCAGAACGGATCAACAAGTTTTTGATTCAAGAACACATACGACAGTGGAAAAGAGACAGCAAGATTATCAAGGAGACGATTCAAGGAC
AAGACAATGGCAAGAGAGACAATTTACAGAGCAGAAAATGACACAAGAACCAATTCCAACACCAAGATTCCAGCAAGATTACCATTCTGGAATGCGAGAATTTAAACAGA
ATCCCCTATTTCGAAGACAACCTGAATGGAATGGGGATAGTTCGAGTGAGGATGAATACCAAGAACTTCAAAAGGAAGGTCGGAGGAACAATGGGGATCAACATCATCAA
GAAAGCGACTTTAAGATAAAGATTGATATCCCGACATATAGTGGAAAGATGGAAATAGAGGCTTTCTTGGAATGGATTAGACATGTGGAAATTTTTTTCAATTACATGAA
CACTCCCGAAAACAAGAAAGTAAGATTAGTAGCCTTGAAGCTTAAAGGAGGAGCTCAAGCATGGTGGGACCAACTAGAAATCAACCGACAACGTTATGGGAAAAGGCCAA
TTCAAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAATTCATGGTTATAAATTGACTCTTTTCGTGAGGTATTGGATAATTGGTGGAGACAAAATCCGCTTCAAGGATGGTCGAGCCATGGGTGACGAAAATACTAAATT
CTTCCATCGTACCTTGGCTGCCCGTAAAAGGAAGAATTCAATTAATGAGGTGTTATCCCATCAAGGGGTCAGTTTAGTTACCGCTACTGATATTGAAAGGGAATTCATTG
ATTTCTATCGAAATTTGTTCACCAAAGATAACCATCCCCAGTTTCTCCAAATCAATGTTGATTGGTGCCCTATTAGCGAGACTCAGGCGTTAGGGTTGGAAGTTGCCTTT
TCTGAGGAAGAAGTATTCCAAGCGATGAATTCTCTAGGATCAAGTAAGTCTCCTAGCCCAGATGGTTTTACAGCTGAATTCTTTAAATTCTCTTGGAATACTATTAAACA
GGATCTTAGGACCATGGTTCAAGATTTTTTTAACACAGGTGTTATTAATGTGGCGTTGAATGAAACTTATATTTGTCTGATTCCAAAGCGTTTAGATTCAAAATATTGCC
CTATCAGCCGCATCTCGTGTGCTTATAAGATCATTGCTCGAGTTTTATCTAATCGTTTGAAGGGTGTTTTGCCATCTACCATAGCTAAAAACCAAATGGCTTTTGTTGCT
ACCAGACAAATCTTGGATGCCTCCTTAATTGCTAATGAGCTAATTGATGATTGGAATATAGCTTCAAAAATAGGTGTGATTCTTAAATTGGATTTAGAAAAGGCCTTTGA
TAAAGTTGATTGGGATTTTCTGGATGTCGTCCTTCATGCAAAAGGCTTTGGTTTACTTGGAGGAAATGGATTAGGGGCTGTATTTCTAGTTGGCAGATTCAGTAAACTTG
TGCCTCATCCAATTGACACCTCATCTTTTAGTCTGAACCATTTGCAATTTGCGGATGACACCCTATTGTTTTCTTCTTATGATTCAGATGCTTTGAATAACGTATTTGCA
GTCATCAAAATTTTTGAATTGGCCTCTGGTCTTAATGTCAACCTTGCTAAGAGTGAACTTTTGGGAATACATATTCATGTTTCAGAATTGGAGGAGTTGGCAGCAAAATT
TGGTTGTAAAAGTGGAATTTGGCCTACTAGTCGAGTTCATCGTTGTATTGGTAACGACCAAAGAACGTCATTCTGGAATGACTCTTGGTTAAATTGTGGGATCCTTGCAA
CAACTTTCCCACGCCTCTATCGTTTAACCACCAACCAAAATGCTATGGTGGCCGATGTTTGGAATTCGTCACATGAGGCTTGGGATCTAAGCCTTCGACGTCAACTAAAT
GAGCTTGAAACAAATGAATGGGCTAATCTCTCTTATCTCCTGTCTTCGTTTAGCTTTTGTGCAAGGCTTTGGCAAAGGGTGGAGAAAATGGGTTGCAGGCTGCCTAAATG
GTGTAGCCTATTTCATAATAGCGCACGACTTTTGGGGCGCTTCCACTACGACGACCAAGAAAGTTTCATCCAACGTAATTGGAGATTCATCCTTGGCGGGAAAAGAGGCG
GAAACACCACCGTCCTCTCACCGCGATCAACAACCGTGTGCTTGCTGTCGGTTGAACAAGATACAAAAATCTTGAAAGAAGATGTGGGTAAGATAAAGAAGATCTTGGAG
ATGATTTGTGAAAAAATGGGCTGCAGAACGGATCAACAAGTTTTTGATTCAAGAACACATACGACAGTGGAAAAGAGACAGCAAGATTATCAAGGAGACGATTCAAGGAC
AAGACAATGGCAAGAGAGACAATTTACAGAGCAGAAAATGACACAAGAACCAATTCCAACACCAAGATTCCAGCAAGATTACCATTCTGGAATGCGAGAATTTAAACAGA
ATCCCCTATTTCGAAGACAACCTGAATGGAATGGGGATAGTTCGAGTGAGGATGAATACCAAGAACTTCAAAAGGAAGGTCGGAGGAACAATGGGGATCAACATCATCAA
GAAAGCGACTTTAAGATAAAGATTGATATCCCGACATATAGTGGAAAGATGGAAATAGAGGCTTTCTTGGAATGGATTAGACATGTGGAAATTTTTTTCAATTACATGAA
CACTCCCGAAAACAAGAAAGTAAGATTAGTAGCCTTGAAGCTTAAAGGAGGAGCTCAAGCATGGTGGGACCAACTAGAAATCAACCGACAACGTTATGGGAAAAGGCCAA
TTCAAAGATGA
Protein sequenceShow/hide protein sequence
MKIHGYKLTLFVRYWIIGGDKIRFKDGRAMGDENTKFFHRTLAARKRKNSINEVLSHQGVSLVTATDIEREFIDFYRNLFTKDNHPQFLQINVDWCPISETQALGLEVAF
SEEEVFQAMNSLGSSKSPSPDGFTAEFFKFSWNTIKQDLRTMVQDFFNTGVINVALNETYICLIPKRLDSKYCPISRISCAYKIIARVLSNRLKGVLPSTIAKNQMAFVA
TRQILDASLIANELIDDWNIASKIGVILKLDLEKAFDKVDWDFLDVVLHAKGFGLLGGNGLGAVFLVGRFSKLVPHPIDTSSFSLNHLQFADDTLLFSSYDSDALNNVFA
VIKIFELASGLNVNLAKSELLGIHIHVSELEELAAKFGCKSGIWPTSRVHRCIGNDQRTSFWNDSWLNCGILATTFPRLYRLTTNQNAMVADVWNSSHEAWDLSLRRQLN
ELETNEWANLSYLLSSFSFCARLWQRVEKMGCRLPKWCSLFHNSARLLGRFHYDDQESFIQRNWRFILGGKRGGNTTVLSPRSTTVCLLSVEQDTKILKEDVGKIKKILE
MICEKMGCRTDQQVFDSRTHTTVEKRQQDYQGDDSRTRQWQERQFTEQKMTQEPIPTPRFQQDYHSGMREFKQNPLFRRQPEWNGDSSSEDEYQELQKEGRRNNGDQHHQ
ESDFKIKIDIPTYSGKMEIEAFLEWIRHVEIFFNYMNTPENKKVRLVALKLKGGAQAWWDQLEINRQRYGKRPIQR