; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G19540 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G19540
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationChr4:17320546..17322045
RNA-Seq ExpressionCSPI04G19540
SyntenyCSPI04G19540
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]4.3e-7034.54Show/hide
Query:  SLLDRFLVTDDWDESFADTRASRKERLASDHFAILLEAGAFEWGPSPFCFCNSWLLDKQCCHIIKNSLTSGNHQGWAGFFICSKLQNLKSSLKSWLVDSE
        S LDRFL++  W+ +F    +   ER  SDHF ILLE+   +WGP PF   NS L DK+      N   S    G+ G+     L +L   +K W  +  
Subjt:  SLLDRFLVTDDWDESFADTRASRKERLASDHFAILLEAGAFEWGPSPFCFCNSWLLDKQCCHIIKNSLTSGNHQGWAGFFICSKLQNLKSSLKSWLVDSE

Query:  RARKNQEESLLQALENEEIKEESQTMSSLDNVLKLSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLISELIDDQGTPTVSFNG
              +++LL+ ++  +  E    MS+  +  ++S+KSDL++I   + +   Q+ +  W  L DEN S+FHR     +R++LI  + D  GT   S + 
Subjt:  RARKNQEESLLQALENEEIKEESQTMSSLDNVLKLSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLISELIDDQGTPTVSFNG

Query:  IESLIIEFYKSPYTKSPQVGSLPNPLEWSVVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDNYLALFNEFYENGRLNACVK
        I    I  +++ YTK      L + L W+ +S    + L   F   EIKS +      KAPGPDGYT  F    W   KD+ L +F +F++ G +N  V 
Subjt:  IESLIIEFYKSPYTKSPQVGSLPNPLEWSVVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDNYLALFNEFYENGRLNACVK

Query:  ENFIGYTAEFLISFWDHFKDNYLALFNEFYENGRLNVCVKENFIYLIKKKEDAVRVKDFRPISLTTTIYKLVRKVLAERLKKVMLSIIAPSQSDFLEGRQ
          FI                                         LI KKE   +  D+RPISLTT++YK++ K LA RLK  +   IA +Q  F++GRQ
Subjt:  ENFIGYTAEFLISFWDHFKDNYLALFNEFYENGRLNVCVKENFIYLIKKKEDAVRVKDFRPISLTTTIYKLVRKVLAERLKKVMLSIIAPSQSDFLEGRQ

Query:  ILDPILIANEAVKDYRIRKKKGWIIKLDIEKAFDRVDWALLEKVMRKKNFAEKWILWIMGGIKNPKYSV
        I D ILIANEA+  ++ RK KG+++KLDIEKAFD++ W+ ++ ++ KK+F  KW  WI   I N +YS+
Subjt:  ILDPILIANEAVKDYRIRKKKGWIIKLDIEKAFDRVDWALLEKVMRKKNFAEKWILWIMGGIKNPKYSV

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.3e-6934.33Show/hide
Query:  SLLDRFLVTDDWDESFADTRASRKERLASDHFAILLEAGAFEWGPSPFCFCNSWLLDKQCCHIIKNSLTSGNHQGWAGFFICSKLQNLKSSLKSWLVDSE
        S LDRFL++  W+ +F    +   ER  SDHF ILLE+   +WGP PF   NS L DK+      N   S    G+ G+     L +L   +K W  +  
Subjt:  SLLDRFLVTDDWDESFADTRASRKERLASDHFAILLEAGAFEWGPSPFCFCNSWLLDKQCCHIIKNSLTSGNHQGWAGFFICSKLQNLKSSLKSWLVDSE

Query:  RARKNQEESLLQALENEEIKEESQTMSSLDNVLKLSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLISELIDDQGTPTVSFNG
              +++LL+ ++  +  E    MS+  +  ++S+KSDL++I   + +   Q+ +  W  L DEN S+FHR     +R++LI  + D  GT   S + 
Subjt:  RARKNQEESLLQALENEEIKEESQTMSSLDNVLKLSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLISELIDDQGTPTVSFNG

Query:  IESLIIEFYKSPYTKSPQVGSLPNPLEWSVVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDNYLALFNEFYENGRLNACVK
        I    I  +++ YTK      L + L W+ +S    + L   F   EIKS +      KAPGPDGYT  F    W   KD+ L +F +F++ G +N  V 
Subjt:  IESLIIEFYKSPYTKSPQVGSLPNPLEWSVVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDNYLALFNEFYENGRLNACVK

Query:  ENFIGYTAEFLISFWDHFKDNYLALFNEFYENGRLNVCVKENFIYLIKKKEDAVRVKDFRPISLTTTIYKLVRKVLAERLKKVMLSIIAPSQSDFLEGRQ
          FI                                         LI KKE   +  D+RPISLTT++YK++ K LA RLK  +   IA +Q  F++GRQ
Subjt:  ENFIGYTAEFLISFWDHFKDNYLALFNEFYENGRLNVCVKENFIYLIKKKEDAVRVKDFRPISLTTTIYKLVRKVLAERLKKVMLSIIAPSQSDFLEGRQ

Query:  ILDPILIANEAVKDYRIRKKKGWIIKLDIEKAFDRVDWALLEKVMRKKNFAEKWILWIMGGIKNPKYSV
        I D ILIANE +  ++ RK KG+++KLDIEKAFD++ W+ ++ ++ KK+F  KW  WI   I N +YS+
Subjt:  ILDPILIANEAVKDYRIRKKKGWIIKLDIEKAFDRVDWALLEKVMRKKNFAEKWILWIMGGIKNPKYSV

XP_038884535.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X1 [Benincasa hispida]1.6e-8552.94Show/hide
Query:  LVTDDWDESFADTRASRKERLASDHFAILLEAGAFEWGPSPFCFCNSWLLDKQCCHIIKNSLTSGNHQGWAGFFICSKLQNLKSSLKSWLVDSERARKNQ
        LV+ +WDE F D+R SR+ R  SDHF +L EAGAFEWGPSPF FCNSWL +K+CC II+NS      Q WAGF + S+L+ +K S+K WL + E+ +K +
Subjt:  LVTDDWDESFADTRASRKERLASDHFAILLEAGAFEWGPSPFCFCNSWLLDKQCCHIIKNSLTSGNHQGWAGFFICSKLQNLKSSLKSWLVDSERARKNQ

Query:  EESLLQALENEEIKEESQTMSSLDNVLKLSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLISELIDDQGTPTVSFNGIESLII
        EESLL+ ++ ++++ ++    S +  +++S+K+DL+++Y+ EERDLIQK KLNWL L DENTSFFHRFLAAK+R++LI+EL ++QG PT SF  IE++I+
Subjt:  EESLLQALENEEIKEESQTMSSLDNVLKLSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLISELIDDQGTPTVSFNGIESLII

Query:  EFYKSPYTKSPQVGSLPNPLEWSVVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDNYLALFNEFYENGRL
        +F+ + YTK     S+P  + WS VSAE N+RL ++FS  EI  A+Q LGKNKAPGPDG+T EF++ FW+  KD +  +F EFY NG++
Subjt:  EFYKSPYTKSPQVGSLPNPLEWSVVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDNYLALFNEFYENGRL

XP_038884536.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X2 [Benincasa hispida]1.6e-8552.94Show/hide
Query:  LVTDDWDESFADTRASRKERLASDHFAILLEAGAFEWGPSPFCFCNSWLLDKQCCHIIKNSLTSGNHQGWAGFFICSKLQNLKSSLKSWLVDSERARKNQ
        LV+ +WDE F D+R SR+ R  SDHF +L EAGAFEWGPSPF FCNSWL +K+CC II+NS      Q WAGF + S+L+ +K S+K WL + E+ +K +
Subjt:  LVTDDWDESFADTRASRKERLASDHFAILLEAGAFEWGPSPFCFCNSWLLDKQCCHIIKNSLTSGNHQGWAGFFICSKLQNLKSSLKSWLVDSERARKNQ

Query:  EESLLQALENEEIKEESQTMSSLDNVLKLSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLISELIDDQGTPTVSFNGIESLII
        EESLL+ ++ ++++ ++    S +  +++S+K+DL+++Y+ EERDLIQK KLNWL L DENTSFFHRFLAAK+R++LI+EL ++QG PT SF  IE++I+
Subjt:  EESLLQALENEEIKEESQTMSSLDNVLKLSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLISELIDDQGTPTVSFNGIESLII

Query:  EFYKSPYTKSPQVGSLPNPLEWSVVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDNYLALFNEFYENGRL
        +F+ + YTK     S+P  + WS VSAE N+RL ++FS  EI  A+Q LGKNKAPGPDG+T EF++ FW+  KD +  +F EFY NG++
Subjt:  EFYKSPYTKSPQVGSLPNPLEWSVVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDNYLALFNEFYENGRL

XP_038884537.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X3 [Benincasa hispida]1.6e-8552.94Show/hide
Query:  LVTDDWDESFADTRASRKERLASDHFAILLEAGAFEWGPSPFCFCNSWLLDKQCCHIIKNSLTSGNHQGWAGFFICSKLQNLKSSLKSWLVDSERARKNQ
        LV+ +WDE F D+R SR+ R  SDHF +L EAGAFEWGPSPF FCNSWL +K+CC II+NS      Q WAGF + S+L+ +K S+K WL + E+ +K +
Subjt:  LVTDDWDESFADTRASRKERLASDHFAILLEAGAFEWGPSPFCFCNSWLLDKQCCHIIKNSLTSGNHQGWAGFFICSKLQNLKSSLKSWLVDSERARKNQ

Query:  EESLLQALENEEIKEESQTMSSLDNVLKLSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLISELIDDQGTPTVSFNGIESLII
        EESLL+ ++ ++++ ++    S +  +++S+K+DL+++Y+ EERDLIQK KLNWL L DENTSFFHRFLAAK+R++LI+EL ++QG PT SF  IE++I+
Subjt:  EESLLQALENEEIKEESQTMSSLDNVLKLSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLISELIDDQGTPTVSFNGIESLII

Query:  EFYKSPYTKSPQVGSLPNPLEWSVVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDNYLALFNEFYENGRL
        +F+ + YTK     S+P  + WS VSAE N+RL ++FS  EI  A+Q LGKNKAPGPDG+T EF++ FW+  KD +  +F EFY NG++
Subjt:  EFYKSPYTKSPQVGSLPNPLEWSVVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDNYLALFNEFYENGRL

TrEMBL top hitse value%identityAlignment
A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein4.4e-6833.26Show/hide
Query:  RRSVSRSLLDRFLVTDDWDESFADTRASRKERLASDHFAILLEAGAFEWGPSPFCFCNSWLLDKQCCHIIKNSLTSGNHQGWAGFFICSKLQNLKSSLKS
        R   + S LDRFL T  W+  F    +    R  SDHF I+LE+    WGPSPF F N++L D      I+    + +  G+AG+    +L+ L   +K+
Subjt:  RRSVSRSLLDRFLVTDDWDESFADTRASRKERLASDHFAILLEAGAFEWGPSPFCFCNSWLLDKQCCHIIKNSLTSGNHQGWAGFFICSKLQNLKSSLKS

Query:  WLVDSERARKNQEESLLQALENEEIKEESQTMSSLDNVLKLSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLISELIDDQGTP
        W  D +   +  +++ ++ ++  +  E   + + +    + ++K+DL  I   E +   QKCK  W+   DEN+SFFH+   A++++ LIS++I++ G  
Subjt:  WLVDSERARKNQEESLLQALENEEIKEESQTMSSLDNVLKLSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLISELIDDQGTP

Query:  TVSFNGIESLIIEFYKSPYTKSPQVGSLPNPLEWSVVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDNYLALFNEFYENGR
         ++ + I    I+ ++  YT +         L+W  +S   +  L   F+  EI   L+   KNKAPGPDGY  +FL   W   K N   +F +F+    
Subjt:  TVSFNGIESLIIEFYKSPYTKSPQVGSLPNPLEWSVVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDNYLALFNEFYENGR

Query:  LNACVKENFIGYTAEFLISFWDHFKDNYLALFNEFYENGRLNVCVKENFIYLIKKKEDAVRVKDFRPISLTTTIYKLVRKVLAERLKKVMLSIIAPSQSD
                              H                 +N  V E  I LI KKE      DFRPISLTT IYKL+ K LA+RLK+ +   I+ SQ  
Subjt:  LNACVKENFIGYTAEFLISFWDHFKDNYLALFNEFYENGRLNVCVKENFIYLIKKKEDAVRVKDFRPISLTTTIYKLVRKVLAERLKKVMLSIIAPSQSD

Query:  FLEGRQILDPILIANEAVKDYRIRKKKGWIIKLDIEKAFDRVDWALLEKVMRKKNFAEKWILWIMGGIKNPKYSV
        F++GRQI + ILIANEA+  +R +K++G++IKLDIEKAFD+++W  ++ V+ KKN+++KW   I   I + +YS+
Subjt:  FLEGRQILDPILIANEAVKDYRIRKKKGWIIKLDIEKAFDRVDWALLEKVMRKKNFAEKWILWIMGGIKNPKYSV

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein2.1e-7034.54Show/hide
Query:  SLLDRFLVTDDWDESFADTRASRKERLASDHFAILLEAGAFEWGPSPFCFCNSWLLDKQCCHIIKNSLTSGNHQGWAGFFICSKLQNLKSSLKSWLVDSE
        S LDRFL++  W+ +F    +   ER  SDHF ILLE+   +WGP PF   NS L DK+      N   S    G+ G+     L +L   +K W  +  
Subjt:  SLLDRFLVTDDWDESFADTRASRKERLASDHFAILLEAGAFEWGPSPFCFCNSWLLDKQCCHIIKNSLTSGNHQGWAGFFICSKLQNLKSSLKSWLVDSE

Query:  RARKNQEESLLQALENEEIKEESQTMSSLDNVLKLSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLISELIDDQGTPTVSFNG
              +++LL+ ++  +  E    MS+  +  ++S+KSDL++I   + +   Q+ +  W  L DEN S+FHR     +R++LI  + D  GT   S + 
Subjt:  RARKNQEESLLQALENEEIKEESQTMSSLDNVLKLSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLISELIDDQGTPTVSFNG

Query:  IESLIIEFYKSPYTKSPQVGSLPNPLEWSVVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDNYLALFNEFYENGRLNACVK
        I    I  +++ YTK      L + L W+ +S    + L   F   EIKS +      KAPGPDGYT  F    W   KD+ L +F +F++ G +N  V 
Subjt:  IESLIIEFYKSPYTKSPQVGSLPNPLEWSVVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDNYLALFNEFYENGRLNACVK

Query:  ENFIGYTAEFLISFWDHFKDNYLALFNEFYENGRLNVCVKENFIYLIKKKEDAVRVKDFRPISLTTTIYKLVRKVLAERLKKVMLSIIAPSQSDFLEGRQ
          FI                                         LI KKE   +  D+RPISLTT++YK++ K LA RLK  +   IA +Q  F++GRQ
Subjt:  ENFIGYTAEFLISFWDHFKDNYLALFNEFYENGRLNVCVKENFIYLIKKKEDAVRVKDFRPISLTTTIYKLVRKVLAERLKKVMLSIIAPSQSDFLEGRQ

Query:  ILDPILIANEAVKDYRIRKKKGWIIKLDIEKAFDRVDWALLEKVMRKKNFAEKWILWIMGGIKNPKYSV
        I D ILIANEA+  ++ RK KG+++KLDIEKAFD++ W+ ++ ++ KK+F  KW  WI   I N +YS+
Subjt:  ILDPILIANEAVKDYRIRKKKGWIIKLDIEKAFDRVDWALLEKVMRKKNFAEKWILWIMGGIKNPKYSV

A0A5D3BJP3 LINE-1 retrotransposable element ORF2 protein2.0e-6833.05Show/hide
Query:  RRSVSRSLLDRFLVTDDWDESFADTRASRKERLASDHFAILLEAGAFEWGPSPFCFCNSWLLDKQCCHIIKNSLTSGNHQGWAGFFICSKLQNLKSSLKS
        R   + S LDRFL +  W+  F    +    R  SDHF I+LE+ +  WGPSPF F N++L D      I+    + +  G+AG+    +L+ L   +K+
Subjt:  RRSVSRSLLDRFLVTDDWDESFADTRASRKERLASDHFAILLEAGAFEWGPSPFCFCNSWLLDKQCCHIIKNSLTSGNHQGWAGFFICSKLQNLKSSLKS

Query:  WLVDSERARKNQEESLLQALENEEIKEESQTMSSLDNVLKLSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLISELIDDQGTP
        W  + +   +  +++ ++ ++     E   T + +    ++++K+DL  I   E +   QKCK  W+   DEN+SFFH+   A++++ LIS++I+  G  
Subjt:  WLVDSERARKNQEESLLQALENEEIKEESQTMSSLDNVLKLSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLISELIDDQGTP

Query:  TVSFNGIESLIIEFYKSPYTKSPQVGSLPNPLEWSVVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDNYLALFNEFYENGR
         ++ + I    I+ ++  YT +       + L+W  +S   +  L   F+  EI   L+   KNKAPGPDG+T +FL   W   K N   +F +F+ N  
Subjt:  TVSFNGIESLIIEFYKSPYTKSPQVGSLPNPLEWSVVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDNYLALFNEFYENGR

Query:  LNACVKENFIGYTAEFLISFWDHFKDNYLALFNEFYENGRLNVCVKENFIYLIKKKEDAVRVKDFRPISLTTTIYKLVRKVLAERLKKVMLSIIAPSQSD
        +N  V E  I +                                        I KKE+   V DFRPISLTT IYKL+ KVLA+RLK+ +   I+ SQ  
Subjt:  LNACVKENFIGYTAEFLISFWDHFKDNYLALFNEFYENGRLNVCVKENFIYLIKKKEDAVRVKDFRPISLTTTIYKLVRKVLAERLKKVMLSIIAPSQSD

Query:  FLEGRQILDPILIANEAVKDYRIRKKKGWIIKLDIEKAFDRVDWALLEKVMRKKNFAEKWILWIMGGIKNPKYSV
        F++GRQI + ILIANEA+  +R +K++G++IKLDIEKAFD+++W  ++ ++ KKN+++KW   I   I + +YS+
Subjt:  FLEGRQILDPILIANEAVKDYRIRKKKGWIIKLDIEKAFDRVDWALLEKVMRKKNFAEKWILWIMGGIKNPKYSV

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein6.1e-7034.33Show/hide
Query:  SLLDRFLVTDDWDESFADTRASRKERLASDHFAILLEAGAFEWGPSPFCFCNSWLLDKQCCHIIKNSLTSGNHQGWAGFFICSKLQNLKSSLKSWLVDSE
        S LDRFL++  W+ +F    +   ER  SDHF ILLE+   +WGP PF   NS L DK+      N   S    G+ G+     L +L   +K W  +  
Subjt:  SLLDRFLVTDDWDESFADTRASRKERLASDHFAILLEAGAFEWGPSPFCFCNSWLLDKQCCHIIKNSLTSGNHQGWAGFFICSKLQNLKSSLKSWLVDSE

Query:  RARKNQEESLLQALENEEIKEESQTMSSLDNVLKLSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLISELIDDQGTPTVSFNG
              +++LL+ ++  +  E    MS+  +  ++S+KSDL++I   + +   Q+ +  W  L DEN S+FHR     +R++LI  + D  GT   S + 
Subjt:  RARKNQEESLLQALENEEIKEESQTMSSLDNVLKLSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLISELIDDQGTPTVSFNG

Query:  IESLIIEFYKSPYTKSPQVGSLPNPLEWSVVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDNYLALFNEFYENGRLNACVK
        I    I  +++ YTK      L + L W+ +S    + L   F   EIKS +      KAPGPDGYT  F    W   KD+ L +F +F++ G +N  V 
Subjt:  IESLIIEFYKSPYTKSPQVGSLPNPLEWSVVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDNYLALFNEFYENGRLNACVK

Query:  ENFIGYTAEFLISFWDHFKDNYLALFNEFYENGRLNVCVKENFIYLIKKKEDAVRVKDFRPISLTTTIYKLVRKVLAERLKKVMLSIIAPSQSDFLEGRQ
          FI                                         LI KKE   +  D+RPISLTT++YK++ K LA RLK  +   IA +Q  F++GRQ
Subjt:  ENFIGYTAEFLISFWDHFKDNYLALFNEFYENGRLNVCVKENFIYLIKKKEDAVRVKDFRPISLTTTIYKLVRKVLAERLKKVMLSIIAPSQSDFLEGRQ

Query:  ILDPILIANEAVKDYRIRKKKGWIIKLDIEKAFDRVDWALLEKVMRKKNFAEKWILWIMGGIKNPKYSV
        I D ILIANE +  ++ RK KG+++KLDIEKAFD++ W+ ++ ++ KK+F  KW  WI   I N +YS+
Subjt:  ILDPILIANEAVKDYRIRKKKGWIIKLDIEKAFDRVDWALLEKVMRKKNFAEKWILWIMGGIKNPKYSV

A0A803P8A0 Uncharacterized protein4.4e-6833.88Show/hide
Query:  SLLDRFLVTDDWDESFADTRASRKERLASDHFAILLEAGAFEWGPSPFCFCNSWLLDKQCCHIIKNSLTSGNHQGWAGFFICSKLQNLKSSLKSW---LV
        S LDRFL  ++W+  F   R     RL SDH  +++++   +WGP PF F N WL  K      ++        GW G     KL+ L+   K W     
Subjt:  SLLDRFLVTDDWDESFADTRASRKERLASDHFAILLEAGAFEWGPSPFCFCNSWLLDKQCCHIIKNSLTSGNHQGWAGFFICSKLQNLKSSLKSW---LV

Query:  DSERARKNQEESLLQALENEEIKEESQTMSSLDNVLKLSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLISELIDDQGTPTVS
           +A KN  E  L  L+ +E    S   S  D   KL  K +   +   EER +  K K  W K  D N+ FFH  L A+K R+ IS +  D G    S
Subjt:  DSERARKNQEESLLQALENEEIKEESQTMSSLDNVLKLSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLISELIDDQGTPTVS

Query:  FNGIESLIIEFYKSPYTKSPQVGSLPNPLEWSVVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDNYLALFNEFYENGRLNA
           I   +I F+   YT   ++G+    +EW  ++     +L   F   E+++ +     +KAPGPDG++               LA+F           
Subjt:  FNGIESLIIEFYKSPYTKSPQVGSLPNPLEWSVVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDNYLALFNEFYENGRLNA

Query:  CVKENFIGYTAEFLISFWDHFKDNYLALFNEFYENGRLNVCVKENFIYLIKKKEDAVRVKDFRPISLTTTIYKLVRKVLAERLKKVMLSIIAPSQSDFLE
                       + W+  K+  + +F  F+  GR+   + + FI LI K+ ++ +VKDFRPISL T++YK++ K LA RL+ V+   I+ +QS F+E
Subjt:  CVKENFIGYTAEFLISFWDHFKDNYLALFNEFYENGRLNVCVKENFIYLIKKKEDAVRVKDFRPISLTTTIYKLVRKVLAERLKKVMLSIIAPSQSDFLE

Query:  GRQILDPILIANEAVKDYRIRKKKGWIIKLDIEKAFDRVDWALLEKVMRKKNFAEKWILWIMGGIKNPKYSVFYQWKAKRKNFSFKG
        GRQILD +L+ANEAV+DYR R KKG+++K+D EKA+DRVDW  L+ V+RKK F E+W  WI G + +  +S+F   + + K    +G
Subjt:  GRQILDPILIANEAVKDYRIRKKKGWIIKLDIEKAFDRVDWALLEKVMRKKNFAEKWILWIMGGIKNPKYSVFYQWKAKRKNFSFKG

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.1e-1222.66Show/hide
Query:  RKNQEESLLQALENE--EIKEESQTMSSLDNVLKLS-IKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLISELIDDQGTPTVSFN
        ++ QE S +  L ++  E++++ QT S      +++ I+++L  I  ++    I + +  + +  ++      R +  K+ ++ I  + +D+G  T    
Subjt:  RKNQEESLLQALENE--EIKEESQTMSSLDNVLKLS-IKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLISELIDDQGTPTVSFN

Query:  GIESLIIEFYKSPYTKS----PQVGSLPNPLEWSVVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDNYLALFNEFYENGRL
         I++ I E+YK  Y        ++ +  +      ++ E+   L    +  EI + +  L   K+PGPDG+TAEF   + +      L LF    + G  
Subjt:  GIESLIIEFYKSPYTKS----PQVGSLPNPLEWSVVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDNYLALFNEFYENGRL

Query:  NACVKENFIGYTAEFLISFWDHFKDNYLALFNEFYENGRLNVCVKENFIYLIKKKEDAVRVKDFRPISLTTTIYKLVRKVLAERLKKVMLSIIAPSQSDF
                                     L N FYE          + I + K   D  + ++FRPISL     K++ K+LA R+++ +  +I   Q  F
Subjt:  NACVKENFIGYTAEFLISFWDHFKDNYLALFNEFYENGRLNVCVKENFIYLIKKKEDAVRVKDFRPISLTTTIYKLVRKVLAERLKKVMLSIIAPSQSDF

Query:  LEGRQILDPILIANEAVKDY-RIRKKKGWIIKLDIEKAFDRVDWALLEKVMRK
        + G Q    I  +   ++   R + K   II +D EKAFD++    + K + K
Subjt:  LEGRQILDPILIANEAVKDY-RIRKKKGWIIKLDIEKAFDRVDWALLEKVMRK

P08548 LINE-1 reverse transcriptase homolog3.5e-1423.85Show/hide
Query:  SDHFAILLEAG--------AFEWGPSPFCFCNSWLLDKQCCHIIKNSLTSGNHQ--GWAGFFICSK--LQNLKSSLKSWLVDSERARKNQEESLLQALEN
        SDH  I +E             W  +     ++W++D+    I K  L   N+Q   +   +  +K  L+    +L+++L  +ER   N     L+ LE 
Subjt:  SDHFAILLEAG--------AFEWGPSPFCFCNSWLLDKQCCHIIKNSLTSGNHQ--GWAGFFICSK--LQNLKSSLKSWLVDSERARKNQEESLLQALEN

Query:  EEIKEESQTMSSLDNVLKLSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLISELIDDQGTPTVSFNGIESLIIEFYKSPYT-K
        EE    +   S    + K  I+++L  I  +     I K K  + +  ++           K+ +SLIS + +     T   + I+ ++ E+YK  Y+ K
Subjt:  EEIKEESQTMSSLDNVLKLSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLISELIDDQGTPTVSFNGIESLIIEFYKSPYT-K

Query:  SPQVGSLPNPLE---WSVVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDNYLALFNEFYENGRLNACVKENFIGYTAEFLI
           +  +   LE      +S ++   L    S  EI S +Q L K K+PGPDG+T+EF  +F +      L LF    + G                   
Subjt:  SPQVGSLPNPLE---WSVVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDNYLALFNEFYENGRLNACVKENFIGYTAEFLI

Query:  SFWDHFKDNYLALFNEFYENGRLNVCVKENFIYLIKKKEDAVRVKDFRPISLTTTIYKLVRKVLAERLKKVMLSIIAPSQSDFLEGRQILDPILIANEAV
                    L N FYE          N   + K  +D  R +++RPISL     K++ K+L  R+++ +  II   Q  F+ G Q    I  +   +
Subjt:  SFWDHFKDNYLALFNEFYENGRLNVCVKENFIYLIKKKEDAVRVKDFRPISLTTTIYKLVRKVLAERLKKVMLSIIAPSQSDFLEGRQILDPILIANEAV

Query:  KDY-RIRKKKGWIIKLDIEKAFDRVDWALLEKVMRK
        +   +++ K   I+ +D EKAFD +    + + ++K
Subjt:  KDY-RIRKKKGWIIKLDIEKAFDRVDWALLEKVMRK

P11369 LINE-1 retrotransposable element ORF2 protein2.1e-1423.53Show/hide
Query:  FICSKLQNLKSSLKSWLVDSERARKNQEESLLQALENEEIKEESQTMSSLDNVLKLSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKK
        F+  KL  L +S K      E A  +   + L+ALE +E    S   S    ++KL  + ++  +  R     I + +  + +  ++      R     +
Subjt:  FICSKLQNLKSSLKSWLVDSERARKNQEESLLQALENEEIKEESQTMSSLDNVLKLSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKK

Query:  RRSLISELIDDQGTPTVSFNGIESLIIEFYKSPY-TKSPQVGSLPNPLEWSVV---SAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFW
         + LI+++ +++G  T     I++ I  FYK  Y TK   +  +   L+   V   + +Q   L S  S  EI++ +  L   K+PGPDG++AE    F+
Subjt:  RRSLISELIDDQGTPTVSFNGIESLIIEFYKSPY-TKSPQVGSLPNPLEWSVV---SAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFW

Query:  DHFKDNYLALFNEFYENGRLNACVKENFIGYTAEFLISFWDHFKDNYLALFNEFYENGRLNVCVKENFIYLIKKKEDAVRVKDFRPISLTTTIYKLVRKV
          FK++ + + ++ +    +                             L N FYE              + K ++D  ++++FRPISL     K++ K+
Subjt:  DHFKDNYLALFNEFYENGRLNACVKENFIGYTAEFLISFWDHFKDNYLALFNEFYENGRLNVCVKENFIYLIKKKEDAVRVKDFRPISLTTTIYKLVRKV

Query:  LAERLKKVMLSIIAPSQSDFLEGRQILDPILIANEAVKDY-RIRKKKGWIIKLDIEKAFDRVDWALLEKVMRKKNFAEKWILWIMGGIKNP
        LA R+++ + +II P Q  F+ G Q    I  +   +    +++ K   II LD EKAFD++    + KV+ +      ++  I      P
Subjt:  LAERLKKVMLSIIAPSQSDFLEGRQILDPILIANEAVKDY-RIRKKKGWIIKLDIEKAFDRVDWALLEKVMRKKNFAEKWILWIMGGIKNP

P14381 Transposon TX1 uncharacterized 149 kDa protein1.3e-2426.37Show/hide
Query:  VSRSLLDRFLVTDDWDESFADTRA-SRKERLA--SDHFAILLEAGAFEWGPSP--FCFCNSWLLDKQCCHIIKNSLTSGNHQGWAGFFICSKLQNL----
        VS+S +DR  ++     S   +RA S   RLA  SDH  + L        P    + F NS L D+     ++++      +GW  F       N     
Subjt:  VSRSLLDRFLVTDDWDESFADTRA-SRKERLA--SDHFAILLEAGAFEWGPSP--FCFCNSWLLDKQCCHIIKNSLTSGNHQGWAGFFICSKLQNL----

Query:  -KSSLKSWLVDSERARKNQEESLLQALENEEIKEESQTMSSLDNVLK---LSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLI
         K  LK    +  ++   Q  + ++AL  E +  E +   S D  L+   L  K  L  + +R+ R    + ++  L   D  + FF+     K  R  I
Subjt:  -KSSLKSWLVDSERARKNQEESLLQALENEEIKEESQTMSSLDNVLK---LSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLI

Query:  SELIDDQGTPTVSFNGIESLIIEFYKSPYTKSPQVGSLPNPLEWS---VVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDN
        + L  + GTP      I      FY++ ++  P        L W    VVS  +  RL +  +L E+  AL+L+  NK+PG DG T EF   FWD     
Subjt:  SELIDDQGTPTVSFNGIESLIIEFYKSPYTKSPQVGSLPNPLEWS---VVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDN

Query:  YLALFNEFYENGRLNACVKENFIGYTAEFLISFWDHFKDNYLALFNEFYENGRLNVCVKENFIYLIKKKEDAVRVKDFRPISLTTTIYKLVRKVLAERLK
           L  +F+                                  +  E ++ G L +  +   + L+ KK D   +K++RP+SL +T YK+V K ++ RLK
Subjt:  YLALFNEFYENGRLNACVKENFIGYTAEFLISFWDHFKDNYLALFNEFYENGRLNVCVKENFIYLIKKKEDAVRVKDFRPISLTTTIYKLVRKVLAERLK

Query:  KVMLSIIAPSQSDFLEGRQILDPILIANEAVKDYRIRKKKGWIIKLDIEKAFDRVDWALLEKVMRKKNFAEKWI
         V+  +I P QS  + GR I D + +  + +   R        + LD EKAFDRVD   L   ++  +F  +++
Subjt:  KVMLSIIAPSQSDFLEGRQILDPILIANEAVKDYRIRKKKGWIIKLDIEKAFDRVDWALLEKVMRKKNFAEKWI

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.0e-1324.19Show/hide
Query:  SHGQERRSVSRSLLDRFLVTDDWDESFADTRASRKERLASDHFAILLEAGAFEWGP--SPFCFCNSWLLDKQCCHIIKNSLT-SGNHQGWAGFFICSKLQ
        S+ Q+   + R  LDR +   DW  SF    A  +    SDH   ++     E  P  S  CF     L      ++  SLT +   Q   G  + S  +
Subjt:  SHGQERRSVSRSLLDRFLVTDDWDESFADTRASRKERLASDHFAILLEAGAFEWGP--SPFCFCNSWLLDKQCCHIIKNSLT-SGNHQGWAGFFICSKLQ

Query:  NLKSSLKSWLVDSERARKNQEESLLQALENEEIKEESQTMSSLDNVLKLS---------IKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAA
        +LK++ K   + + +   N +    +AL++ E  +     +  D++ ++            + L + YR       QK ++ WL+  D NT FFH+ + A
Subjt:  NLKSSLKSWLVDSERARKNQEESLLQALENEEIKEESQTMSSLDNVLKLS---------IKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAA

Query:  KKRRSLISELIDDQGTPTVSFNGIESLIIEFYKSPYTKSPQVGSLPNPLE--WSVVSAEQNTRLTSRFS-LP---EIKSALQLLGKNKAPGPDGYTAEFL
         + ++LI  L  D      +   ++ +I+ +Y         +   P+ ++    +     N  L SR S LP   EI +A+  + +NKAPGPD +TAEF 
Subjt:  KKRRSLISELIDDQGTPTVSFNGIESLIIEFYKSPYTKSPQVGSLPNPLE--WSVVSAEQNTRLTSRFS-LP---EIKSALQLLGKNKAPGPDGYTAEFL

Query:  ISFWDHFKDNYLALFNEFYENGRLNACVKENFIGYTAEFLISFWDHFKDNYLALFNEFYENGRLNVCVKENFIYLIKKKEDAVRVKDFRPISLTTTIYKL
          FW+                                      W   KD+ +A   EF+  G L        I LI K     ++  FRP+S  T +YK+
Subjt:  ISFWDHFKDNYLALFNEFYENGRLNACVKENFIGYTAEFLISFWDHFKDNYLALFNEFYENGRLNVCVKENFIYLIKKKEDAVRVKDFRPISLTTTIYKL

Query:  V
        +
Subjt:  V

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.3e-1142.68Show/hide
Query:  LAERLKKVMLSIIAPSQSDFLEGRQILDPILIANEAVKDYRIRK-KKGW-IIKLDIEKAFDRVDWALLEKVMRKKNFAEKWI
        + ERLK +M ++I P+Q+ F+ GR   D I+   EAV   R +K  KGW ++KLD+EKA+DR+ W  LE  +    F E W+
Subjt:  LAERLKKVMLSIIAPSQSDFLEGRQILDPILIANEAVKDYRIRK-KKGW-IIKLDIEKAFDRVDWALLEKVMRKKNFAEKWI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAAATTCTCATGGTCAAGAGAGAAGGTCAGTTTCACGTTCTCTGCTGGATAGGTTCCTAGTAACTGATGATTGGGACGAATCCTTTGCTGATACCAGAGCCTCTCG
AAAGGAAAGACTTGCTTCAGATCATTTCGCCATCTTATTAGAAGCTGGTGCTTTTGAATGGGGCCCCTCTCCATTTTGCTTTTGCAACAGCTGGCTGCTCGATAAGCAGT
GCTGCCATATTATAAAAAATTCTTTGACCTCAGGAAATCATCAAGGGTGGGCTGGTTTTTTCATTTGTTCAAAACTTCAAAATTTAAAAAGCTCCTTGAAATCTTGGCTT
GTGGACAGTGAAAGAGCCAGAAAAAACCAAGAAGAGTCTTTGCTCCAAGCTCTTGAAAATGAAGAAATTAAGGAAGAATCTCAGACCATGTCATCCTTAGACAATGTTCT
GAAATTGTCTATTAAATCAGACCTCATAGCTATATATAGAAGAGAGGAAAGAGACTTAATTCAAAAATGTAAGTTGAACTGGTTAAAATTGGAAGATGAAAATACTAGTT
TCTTCCATCGTTTCCTTGCTGCAAAAAAAAGACGTAGCTTGATTTCAGAATTGATTGATGACCAAGGTACTCCTACAGTTTCATTTAATGGAATTGAAAGTCTCATCATT
GAATTCTACAAGTCTCCCTATACAAAATCCCCTCAAGTTGGCTCTCTTCCTAATCCCCTTGAATGGTCAGTGGTTTCGGCAGAACAAAATACAAGGCTCACCTCAAGATT
TAGTCTACCCGAAATAAAATCTGCACTTCAACTGCTGGGGAAAAATAAGGCCCCTGGACCAGATGGGTATACAGCTGAATTTCTAATCAGCTTTTGGGATCATTTCAAAG
ATAATTACCTTGCTCTCTTTAATGAGTTTTATGAGAATGGGAGGTTAAATGCATGTGTAAAAGAGAACTTCATTGGGTATACAGCTGAATTTCTAATCAGCTTTTGGGAT
CATTTCAAAGATAATTACCTTGCTCTCTTTAATGAGTTTTATGAGAATGGGAGGTTAAATGTATGTGTAAAAGAGAACTTCATTTATTTAATCAAGAAGAAAGAAGATGC
AGTAAGGGTAAAAGACTTTAGACCAATAAGTCTTACAACAACAATTTATAAGCTGGTCAGAAAGGTTCTCGCTGAAAGATTAAAGAAAGTAATGCTGAGTATTATTGCTC
CATCACAAAGTGACTTTCTGGAAGGGCGCCAAATTTTGGACCCTATTTTAATTGCAAACGAAGCTGTGAAGGATTATAGAATCAGAAAAAAGAAAGGATGGATCATTAAA
TTAGACATTGAAAAGGCTTTCGACAGAGTGGATTGGGCACTTTTAGAAAAAGTAATGCGTAAGAAGAACTTCGCTGAAAAATGGATCTTATGGATAATGGGTGGTATCAA
GAATCCTAAATATTCGGTCTTTTATCAATGGAAGGCCAAGAGGAAGAATTTCAGCTTCAAGGGGTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTAAATTCTCATGGTCAAGAGAGAAGGTCAGTTTCACGTTCTCTGCTGGATAGGTTCCTAGTAACTGATGATTGGGACGAATCCTTTGCTGATACCAGAGCCTCTCG
AAAGGAAAGACTTGCTTCAGATCATTTCGCCATCTTATTAGAAGCTGGTGCTTTTGAATGGGGCCCCTCTCCATTTTGCTTTTGCAACAGCTGGCTGCTCGATAAGCAGT
GCTGCCATATTATAAAAAATTCTTTGACCTCAGGAAATCATCAAGGGTGGGCTGGTTTTTTCATTTGTTCAAAACTTCAAAATTTAAAAAGCTCCTTGAAATCTTGGCTT
GTGGACAGTGAAAGAGCCAGAAAAAACCAAGAAGAGTCTTTGCTCCAAGCTCTTGAAAATGAAGAAATTAAGGAAGAATCTCAGACCATGTCATCCTTAGACAATGTTCT
GAAATTGTCTATTAAATCAGACCTCATAGCTATATATAGAAGAGAGGAAAGAGACTTAATTCAAAAATGTAAGTTGAACTGGTTAAAATTGGAAGATGAAAATACTAGTT
TCTTCCATCGTTTCCTTGCTGCAAAAAAAAGACGTAGCTTGATTTCAGAATTGATTGATGACCAAGGTACTCCTACAGTTTCATTTAATGGAATTGAAAGTCTCATCATT
GAATTCTACAAGTCTCCCTATACAAAATCCCCTCAAGTTGGCTCTCTTCCTAATCCCCTTGAATGGTCAGTGGTTTCGGCAGAACAAAATACAAGGCTCACCTCAAGATT
TAGTCTACCCGAAATAAAATCTGCACTTCAACTGCTGGGGAAAAATAAGGCCCCTGGACCAGATGGGTATACAGCTGAATTTCTAATCAGCTTTTGGGATCATTTCAAAG
ATAATTACCTTGCTCTCTTTAATGAGTTTTATGAGAATGGGAGGTTAAATGCATGTGTAAAAGAGAACTTCATTGGGTATACAGCTGAATTTCTAATCAGCTTTTGGGAT
CATTTCAAAGATAATTACCTTGCTCTCTTTAATGAGTTTTATGAGAATGGGAGGTTAAATGTATGTGTAAAAGAGAACTTCATTTATTTAATCAAGAAGAAAGAAGATGC
AGTAAGGGTAAAAGACTTTAGACCAATAAGTCTTACAACAACAATTTATAAGCTGGTCAGAAAGGTTCTCGCTGAAAGATTAAAGAAAGTAATGCTGAGTATTATTGCTC
CATCACAAAGTGACTTTCTGGAAGGGCGCCAAATTTTGGACCCTATTTTAATTGCAAACGAAGCTGTGAAGGATTATAGAATCAGAAAAAAGAAAGGATGGATCATTAAA
TTAGACATTGAAAAGGCTTTCGACAGAGTGGATTGGGCACTTTTAGAAAAAGTAATGCGTAAGAAGAACTTCGCTGAAAAATGGATCTTATGGATAATGGGTGGTATCAA
GAATCCTAAATATTCGGTCTTTTATCAATGGAAGGCCAAGAGGAAGAATTTCAGCTTCAAGGGGTCTTAG
Protein sequenceShow/hide protein sequence
MVNSHGQERRSVSRSLLDRFLVTDDWDESFADTRASRKERLASDHFAILLEAGAFEWGPSPFCFCNSWLLDKQCCHIIKNSLTSGNHQGWAGFFICSKLQNLKSSLKSWL
VDSERARKNQEESLLQALENEEIKEESQTMSSLDNVLKLSIKSDLIAIYRREERDLIQKCKLNWLKLEDENTSFFHRFLAAKKRRSLISELIDDQGTPTVSFNGIESLII
EFYKSPYTKSPQVGSLPNPLEWSVVSAEQNTRLTSRFSLPEIKSALQLLGKNKAPGPDGYTAEFLISFWDHFKDNYLALFNEFYENGRLNACVKENFIGYTAEFLISFWD
HFKDNYLALFNEFYENGRLNVCVKENFIYLIKKKEDAVRVKDFRPISLTTTIYKLVRKVLAERLKKVMLSIIAPSQSDFLEGRQILDPILIANEAVKDYRIRKKKGWIIK
LDIEKAFDRVDWALLEKVMRKKNFAEKWILWIMGGIKNPKYSVFYQWKAKRKNFSFKGS