; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005724 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005724
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr6:27204379..27207645
RNA-Seq ExpressionLag0005724
SyntenyLag0005724
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]8.3e-20936.78Show/hide
Query:  MGFSGDRFTWARNKSKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPTKNIIK---LEESWLKFEGSKQTFKSSWE-
        MGF G +FTW+  +       ERLDR + +    S  +N+    L    SDH PI+  +    K  ++   KN       E+ W  +E      +S WE 
Subjt:  MGFSGDRFTWARNKSKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPTKNIIK---LEESWLKFEGSKQTFKSSWE-

Query:  ---DTPEVSTYNFSKKIQEGMINMSRWNKERLKGSIKGAIARKEKKIQRLSKCS----NLMNHEALCQAEKDLEMLLEEEETYWKLRSREDWLKSGDKNT
           ++ E     F +  +  + ++  W+KE  +G  K    ++ + I RL          ++ E + + E  +  +L +EE YWK RSR DWLK GDKNT
Subjt:  ---DTPEVSTYNFSKKIQEGMINMSRWNKERLKGSIKGAIARKEKKIQRLSKCS----NLMNHEALCQAEKDLEMLLEEEETYWKLRSREDWLKSGDKNT

Query:  KWFHSKASCRKRKNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDG
        K+FHSKAS R+RKN+I G+ D++GNW +  E +      +F+ L +S++PS+  + +    +   +  +    L+ PF+ E+I +A+  M P+KAPGPDG
Subjt:  KWFHSKASCRKRKNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDG

Query:  AHAMIYQRYWDILGEDTIRVCLGFLNHKEALEPLNKTFISLIPKSKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNV
          A  +Q++W I+GE   + CL  LN +  L+ LN TFI+LIPK + P+K+ EFRPISLCNV+Y+I+ K IANR+K IL  IIS +QSAFIP R I+DNV
Subjt:  AHAMIYQRYWDILGEDTIRVCLGFLNHKEALEPLNKTFISLIPKSKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNV

Query:  LVGFECIQALNKRKKGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEG
        ++G+EC+  +   K  + G + LKLD+SKAYDRVEW FLE+ M  + FS  W   +M+C+++  +S++ING P    KP RGLRQG PLSPYLF+L AE 
Subjt:  LVGFECIQALNKRKKGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEG

Query:  FSALLRREESLSNISGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKSTFMTSKNMKPEDQEWCERALGIKRSESLGIY
        FS LL + E    I GL+  +    +THL FADDS++F KA   + + LK + + Y +ASGQ  N +KS+   S     E     +    +K       Y
Subjt:  FSALLRREESLSNISGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKSTFMTSKNMKPEDQEWCERALGIKRSESLGIY

Query:  LGMPSQVGRNKKEAFRRIKDRVEKILQGWKERLFSLGGKEVLIKAVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVR
        LG+P  +GRNK   F+ +K +V   +  W  +LFS GGKE+LIKAVAQA+P Y MS F+LP  +C +I +  ARFWWGT  DK+  HW  W +M K K R
Subjt:  LGMPSQVGRNKKEAFRRIKDRVEKILQGWKERLFSLGGKEVLIKAVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVR

Query:  GGLGFRQMTLFNQAMLAKQSWRIIKNPDNLLFKILRGKYFKDGNFLEAPIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWINRTGRLSPLV
        GGLGFR +  FNQA++AKQ WR+++ P++L+ ++++ +Y+K+  F  A +G+  S  WRSI WG  +  KG RWR+G+   + + KD WI R     P+ 
Subjt:  GGLGFRQMTLFNQAMLAKQSWRIIKNPDNLLFKILRGKYFKDGNFLEAPIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWINRTGRLSPLV

Query:  TVDSLKGQRVNRLLDGNNHWKKELLVQSFSPQDVEDIMNIPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVEYNSSSQASPSNAENTSRLWKSIWNAKVL
                 V  L+D  N W+ + L Q F  +D+E I+ I        DE++W +DKKG +SVKS Y LA+  N  ++  P ++ ++SRLWK  W   + 
Subjt:  TVDSLKGQRVNRLLDGNNHWKKELLVQSFSPQDVEDIMNIPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVEYNSSSQASPSNAENTSRLWKSIWNAKVL

Query:  PRTKICIWKIVKDIIPSKANILKKGVSLNPLC------------------------DLA-------------------------------ITAITMWMIW
         + KI +W+ +K+I+P+  N+ K+     P+C                        DLA                               +  +  W+IW
Subjt:  PRTKICIWKIVKDIIPSKANILKKGVSLNPLC------------------------DLA-------------------------------ITAITMWMIW

Query:  SSRNR-----DQPDSR--ERTPDVLIQAISNHISERKNFYLKQSFPKRSRSLANQAMWAKPKLGSWKLNSDASWNEKKGQGGIGWVVHDSTGSPICFGLR
        S+RN+      + DSR      D +++A    +S+  N +       + R + +Q  W  P     KLN DA+ + K  + G+G +V D+ G  +  G++
Subjt:  SSRNR-----DQPDSR--ERTPDVLIQAISNHISERKNFYLKQSFPKRSRSLANQAMWAKPKLGSWKLNSDASWNEKKGQGGIGWVVHDSTGSPICFGLR

Query:  Q
        Q
Subjt:  Q

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]1.1e-20838.34Show/hide
Query:  MGFSGDRFTWARNKSKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPTKNIIKLEESWLKFEGSKQTFKSSWEDTPE
        +G+ G  +TW+  +  +     RLDR +      +K   +KV HL     DH  +L++    + I+   P       E  W K E  K   ++SW    +
Subjt:  MGFSGDRFTWARNKSKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPTKNIIKLEESWLKFEGSKQTFKSSWEDTPE

Query:  VST-YNFSKKIQEGMINMSRWNKERLKGSIKGAIARK-EKKIQRLSKCSNLMNHEALC----QAEKDLEMLLEEEETYWKLRSREDWLKSGDKNTKWFHS
        +ST    S+ ++   + +S+W+      +I G I +K + K  RL+  +     E L     +  +++  LL++EETYW  R++  WLK GD+NTK+FH+
Subjt:  VST-YNFSKKIQEGMINMSRWNKERLKGSIKGAIARK-EKKIQRLSKCSNLMNHEALC----QAEKDLEMLLEEEETYWKLRSREDWLKSGDKNTKWFHS

Query:  KASCRKRKNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDGAHAMI
        +AS R+++N I GI DE+G W + EE + + A +YF ++ SS+HPS+I  ++V++AI   + ++    L   F+KEE+  A+K ++P+KAPGPDG  A+ 
Subjt:  KASCRKRKNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDGAHAMI

Query:  YQRYWDILGEDTIRVCLGFLNHKEALEPLNKTFISLIPKSKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNVLVGFE
        +Q+YW I+G +   + L  LNH   +  LNKT ISLIPK+ +PK+M +FRPISLCNV+YK+I K +ANR+K +L  IIS +QSAF   R I+DNVLV FE
Subjt:  YQRYWDILGEDTIRVCLGFLNHKEALEPLNKTFISLIPKSKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNVLVGFE

Query:  CIQALNKRKKGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEGFSALL
         +  L+ +  GK G M +KLDMSKA+DRVEW F+ ++ME+M F N W   VM C++SV+YSI+INGV      PSRGLRQGDPLSP LFLL AEG SAL+
Subjt:  CIQALNKRKKGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEGFSALL

Query:  RREESLSNISGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKSTFMTSKNMKPEDQEWCERALGIKRSESLGIYLGMPS
         +      I+G+ +N+ CP +THLFFADDS++F KA  +E   L+ +L +Y++ASGQ IN DKS+   S N   E ++     LG  ++     YLG+PS
Subjt:  RREESLSNISGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKSTFMTSKNMKPEDQEWCERALGIKRSESLGIYLGMPS

Query:  QVGRNKKEAFRRIKDRVEKILQGWKERLFSLGGKEVLIKAVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVRGGLGF
         +GR+K + F  +K++V   L GWK +L S+GGKE+LIKAVAQAIPTYTMSCF LP  +C +++R+   FWWG    + +  W+SW  MC +K  GGLGF
Subjt:  QVGRNKKEAFRRIKDRVEKILQGWKERLFSLGGKEVLIKAVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVRGGLGF

Query:  RQMTLFNQAMLAKQSWRIIKNPDNLLFKILRGKYFKDGNFLEAPIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWINRTGR---LSPLVTV
        R +  FN AMLAKQ+WRI+ NP++L+ ++L+ +YF  G+ L A +G++ S +WRSI    ++  +G RWRVGN   I I +D W+        +SP   +
Subjt:  RQMTLFNQAMLAKQSWRIIKNPDNLLFKILRGKYFKDGNFLEAPIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWINRTGR---LSPLVTV

Query:  DSLKGQRVNRLLDGNNH-WKKELLVQSFSPQDVEDIMNIPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVE-YNSSSQASPSNAENTSRLWKSIWNAKVL
         + +   V+ L+D +   WK E L   F P +VE I+ IP       D++IW  +KKG FSVKSAYH+A    + + +   SN +    LWK +W   + 
Subjt:  DSLKGQRVNRLLDGNNH-WKKELLVQSFSPQDVEDIMNIPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVE-YNSSSQASPSNAENTSRLWKSIWNAKVL

Query:  PRTKICIWKIVKDIIPSKANILKKGV---SLNPLCDLA----------------------------------------------------ITAITMWMIW
         + KI  W+   D +P+  NI K+G+   S  P+C L                                                     +  +  W IW
Subjt:  PRTKICIWKIVKDIIPSKANILKKGV---SLNPLCDLA----------------------------------------------------ITAITMWMIW

Query:  SSRNRDQPDSRERTPDVLIQAISNHISERKNFYLKQSFPKRSRSLANQAMWAKPKLGSWKLNSDASWNEKKGQGGIGWVVHDSTG
         +RN+   +    +P  +    +N + + K        P R     +Q  W  P LG +K+N D + +++     IG ++ DS G
Subjt:  SSRNRDQPDSRERTPDVLIQAISNHISERKNFYLKQSFPKRSRSLANQAMWAKPKLGSWKLNSDASWNEKKGQGGIGWVVHDSTG

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]7.0e-21639.29Show/hide
Query:  RLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPTKNIIKLEESWLKFEGSKQTFKSSWEDTPEV-STYNFSKKIQEGMINMSRWN
        RLDR +     +   ++VKV HL    SDH  +L++    + + Q    +   + E  W + E  K   +  W  + EV S    + +++    N+S WN
Subjt:  RLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPTKNIIKLEESWLKFEGSKQTFKSSWEDTPEV-STYNFSKKIQEGMINMSRWN

Query:  KERLKGSIKGAIARKEKKIQRL--SKCSNLMNHEALCQAEKDLEMLLEEEETYWKLRSREDWLKSGDKNTKWFHSKASCRKRKNEIKGILDERGNWTEKE
        K  + G+I   I  K++ +  L  S  +  +  E +    K++  LL+ EE  W+ RSR  WL  GD+NTK+FH+KAS R+R+N I GI+DE GNW +  
Subjt:  KERLKGSIKGAIARKEKKIQRL--SKCSNLMNHEALCQAEKDLEMLLEEEETYWKLRSREDWLKSGDKNTKWFHSKASCRKRKNEIKGILDERGNWTEKE

Query:  EDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDGAHAMIYQRYWDILGEDTIRVCLGFLNHKEA
        E + K+A +YF+++ SS+ P++IS  +V DAI +T+ ++    L   F++EEIE A+  M+P+KAPGPDG  A+ +Q+YW+I+G D + + L  LN   +
Subjt:  EDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDGAHAMIYQRYWDILGEDTIRVCLGFLNHKEA

Query:  LEPLNKTFISLIPKSKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNVLVGFECIQALNKRKKGKVGHMTLKLDMSKA
        +  +NKT I+L+PK K+P KM +FRPISLCNV+YK+I K +ANR+K IL  IIS +QSAF+ GR I+DNVLV FE +  L  +K+GK G   +KLDMSKA
Subjt:  LEPLNKTFISLIPKSKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNVLVGFECIQALNKRKKGKVGHMTLKLDMSKA

Query:  YDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEGFSALLRREESLSNISGLQLNKFCPNLTHLF
        YDRVEW F++++MEKM F   W + VM C++SV+YSI++NG       P+RGLRQGDP+SPY+FLL A+GFS+LL        ISG+ + + CP +THLF
Subjt:  YDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEGFSALLRREESLSNISGLQLNKFCPNLTHLF

Query:  FADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKSTFMTSKNMKPEDQEWCE--RALGIKRSESLGIYLGMPSQVGRNKKEAFRRIKDRVEKILQG
        FADDS++F KA ++E + L  +L+ Y+ ASGQ IN+DKS+   S N    D++ CE  R LG  +      YLG+PS +G++K E F  +K+RVE+ L G
Subjt:  FADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKSTFMTSKNMKPEDQEWCE--RALGIKRSESLGIYLGMPSQVGRNKKEAFRRIKDRVEKILQG

Query:  WKERLFSLGGKEVLIKAVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVRGGLGFRQMTLFNQAMLAKQSWRIIKNPD
        WKE+L S+GG+E+LIKAVAQAIPTYTMSCF++P  +C EI+ +  RFWWG  G +++  W+SW  +CK K  GG+GFR +  FN AMLAKQ WR+I NP+
Subjt:  WKERLFSLGGKEVLIKAVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVRGGLGFRQMTLFNQAMLAKQSWRIIKNPD

Query:  NLLFKILRGKYFKDGNFLEAPIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWINRT---GRLSPLVTVDSLKGQRVNRLLD-GNNHWKKEL
        +L+ +I + +Y+  G+  +A +G + S TWRSI  G ++  +G RWRVGN   I I +D W+        +SP    D     RV+ L+D     WK ++
Subjt:  NLLFKILRGKYFKDGNFLEAPIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWINRT---GRLSPLVTVDSLKGQRVNRLLD-GNNHWKKEL

Query:  LVQSFSPQDVEDIMNIPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVEYNSSSQASPSNA-ENTSRLWKSIWNAKVLPRTKICIWKIVKDIIPSKANILK
        +   F P +   I++IP       D+IIW  ++KG FSVKSAY++AV    + +   S++ ++ S LW+ +W+  + P+ +I  WK+  + +P+  N+L+
Subjt:  LVQSFSPQDVEDIMNIPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVEYNSSSQASPSNA-ENTSRLWKSIWNAKVLPRTKICIWKIVKDIIPSKANILK

Query:  KGVSLNPLC-------------------------------------------------------DLAITAITMWMIWSSRNRDQPDSRERTPDVLIQAIS
        KGV++  +C                                                       DL I  +  W IW +RN+   +S  + P+ +     
Subjt:  KGVSLNPLC-------------------------------------------------------DLAITAITMWMIWSSRNRDQPDSRERTPDVLIQAIS

Query:  NHISERKNFYLK--QSFPKRSRSLANQAMWAKPKLGSWKLNSDASWNEKKGQGGIGWVVHDSTG
         +I E KN      Q  P+      +   W  P  G +K+N D + +E      +G ++ D+TG
Subjt:  NHISERKNFYLK--QSFPKRSRSLANQAMWAKPKLGSWKLNSDASWNEKKGQGGIGWVVHDSTG

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]5.0e-20637.31Show/hide
Query:  MGFSGDRFTWARNKSKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPTKNIIKLEESWLKFEGSKQTFKSSW--EDT
        +GF G  +TW+  +  +  T+ RLDR + N +   + +  +V HL+ H SDH P+LL +   S+ +Q        K EESWL  +      + +W   D 
Subjt:  MGFSGDRFTWARNKSKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPTKNIIKLEESWLKFEGSKQTFKSSW--EDT

Query:  PEVSTYNFSKKIQEGMINMSRWNKERLKGSIKGAIARKEKKIQRLSKCSNLMNHEA-LCQAEKDLEMLLEEEETYWKLRSREDWLKSGDKNTKWFHSKAS
                 +KI+   + +  W    +     GAI   +K++ RL++       +A      K ++ LL+++E YW  RSR +WL+ GD+NTK+FH+KAS
Subjt:  PEVSTYNFSKKIQEGMINMSRWNKERLKGSIKGAIARKEKKIQRLSKCSNLMNHEA-LCQAEKDLEMLLEEEETYWKLRSREDWLKSGDKNTKWFHSKAS

Query:  CRKRKNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDGAHAMIYQR
         R+RKN I+GI + +G W E  E+VG++A+ YF +L  +    +  +++  DA+ + + +D +  L   F+ EE++ A+  M P+KAPGPDG +A+ YQ+
Subjt:  CRKRKNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDGAHAMIYQR

Query:  YWDILGEDTIRVCLGFLNHKEALEPLNKTFISLIPKSKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNVLVGFECIQ
        +W I+G+  +   L FLN+   L  +N T I LIPK ++P++M EFRPISLCNVIYKII K +ANR+K +L  IISS+QSAF+PGR I+DNVLV +E + 
Subjt:  YWDILGEDTIRVCLGFLNHKEALEPLNKTFISLIPKSKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNVLVGFECIQ

Query:  ALNKRKKGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEGFSALLRRE
         ++ RKKGK G + LKLD+SKAYDRVEW FL+ IMEKM F   W  RVM+CV++ ++SI++NG P E  +PSRG+RQGDP+SPYLFLL AEG +ALL + 
Subjt:  ALNKRKKGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEGFSALLRRE

Query:  ESLSNISGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKSTFMTSKNMKPEDQEWCERALGIKRSESLGIYLGMPSQVG
        E    I+G+ + +  P +T+L FADDS++F +A   E E +  +L+ Y++ASGQ+INL+KS+   S N     +      LG+K  +    YLG+P+ +G
Subjt:  ESLSNISGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKSTFMTSKNMKPEDQEWCERALGIKRSESLGIYLGMPSQVG

Query:  RNKKEAFRRIKDRVEKILQGWKERLFSLGGKEVLIKAVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVRGGLGFRQM
        R K   F  +KDRV K LQGWK  L S  GKE+LIKAVAQAIPTYTMS F++P  +CSE++ LCARFWWG VG++ + HW SW  +   K  GG+GFR +
Subjt:  RNKKEAFRRIKDRVEKILQGWKERLFSLGGKEVLIKAVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVRGGLGFRQM

Query:  TLFNQAMLAKQSWRIIKNPDNLLFKILRGKYFKDGNFLEAPIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWINR--TGRLSPLVTVDSLK
          FN AMLAKQ WR+++  D+LL++  + +YF   +FLEA      S  WRS+   + +   GY WRVGN   I   KD W+    T ++   V  D  +
Subjt:  TLFNQAMLAKQSWRIIKNPDNLLFKILRGKYFKDGNFLEAPIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWINR--TGRLSPLVTVDSLK

Query:  GQRVNRLLDGNNHWKKELLVQSFSPQDVEDIMNIPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVE-YNSSSQASPSNAENTSRLWKSIWNAKVLPRTKI
              +    N W  E +   F   + E I  IP   +   D I W Y  +G+FSVKSAYH+A      +++   S       +W +IW  ++  + K+
Subjt:  GQRVNRLLDGNNHWKKELLVQSFSPQDVEDIMNIPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVE-YNSSSQASPSNAENTSRLWKSIWNAKVLPRTKI

Query:  CIWKIVKDIIPSKANILKKGVSLNPLC-------------------------------------------------------DLAITAITMWMIWSSRNR
          W+   +I+P+  N+  + +  +  C                                                       +L +     W++W  RN 
Subjt:  CIWKIVKDIIPSKANILKKGVSLNPLC-------------------------------------------------------DLAITAITMWMIWSSRNR

Query:  DQPDSRERTPDVLIQAISNHISERKNFYLKQSFPKRSRSLANQAMWAKPKLGSWKLNSDASWNEKKGQGGIGWVVHDSTG
             R + P  L      +I+E +N   +    +  +S+++  +W  P  G +KLN DA+     G+ G G ++ +  G
Subjt:  DQPDSRERTPDVLIQAISNHISERKNFYLKQSFPKRSRSLANQAMWAKPKLGSWKLNSDASWNEKKGQGGIGWVVHDSTG

XP_030969743.1 uncharacterized protein LOC115990020 [Quercus lobata]4.6e-20737.96Show/hide
Query:  MGFSGDRFTWARNKSKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPTKNIIKLEESWLKFEGSKQTFKSSWEDTPE
        +GFSG  FTW   + + E   ERLDR + N + +++    +V+HLN + SDHRP+LLS+D   K Q+    +   + E  W+   G K T   +W   P 
Subjt:  MGFSGDRFTWARNKSKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPTKNIIKLEESWLKFEGSKQTFKSSWEDTPE

Query:  VST--YNFSKKIQEGMINMSRWNKERLKGSIKGAIARKEKKIQRLSKCSNLMNHE--ALCQAEKDLEMLLEEEETYWKLRSREDWLKSGDKNTKWFHSKA
          T   N + KI+E    + RW+KE   G++K  I   ++K+  +++  ++   +   +   + +L  LLE+EE  W  RSR  WL+ GD+NT++FH  A
Subjt:  VST--YNFSKKIQEGMINMSRWNKERLKGSIKGAIARKEKKIQRLSKCSNLMNHE--ALCQAEKDLEMLLEEEETYWKLRSREDWLKSGDKNTKWFHSKA

Query:  SCRKRKNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDGAHAMIYQ
        +CRKR+N IKG+ DE G W  +E+    + + +++ L  S++P  I   +V D +   + +     L  P+S +E+E+AIK M P KAPGPDG   + YQ
Subjt:  SCRKRKNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDGAHAMIYQ

Query:  RYWDILGEDTIRVCLGFLNHKEALEPLNKTFISLIPKSKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNVLVGFECI
         YW  +  D  +  L  LN    L+ +N TFI+LIPK K+P+K+ EFRPISLCNV+YKI+ K IANR+K +L SIIS +QSAFI  R I+DNVL+ FE +
Subjt:  RYWDILGEDTIRVCLGFLNHKEALEPLNKTFISLIPKSKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNVLVGFECI

Query:  QALNKRKKGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEGFSALLRR
          +     GK G M LKLDMSKAYDRVEW+FLE+++ K+ F   W   +M C+++VTYSI++NG P+    P+RGLRQGDPLSPYLFL  AEG +A+ ++
Subjt:  QALNKRKKGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEGFSALLRR

Query:  EESLSNISGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKSTFMTSKNMKPEDQEWCERALGIKRSESLGIYLGMPSQV
              I G  + +  P LTHLFFADD ++F ++  +E E +K +L  Y++ASGQ +N DK+T   SKN     QE  + +LG+        YLG+PS +
Subjt:  EESLSNISGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKSTFMTSKNMKPEDQEWCERALGIKRSESLGIYLGMPSQV

Query:  GRNKKEAFRRIKDRVEKILQGWKERLFSLGGKEVLIKAVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVRGGLGFRQ
        GRNKK  F +IK+R+   +QGWKE+L S  GKE++IKAV Q+IPTY+MS F+LP  +C +I+ +  +FWWG  G+  + HW++W+ +C +K  GGLGFR 
Subjt:  GRNKKEAFRRIKDRVEKILQGWKERLFSLGGKEVLIKAVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVRGGLGFRQ

Query:  MTLFNQAMLAKQSWRIIKNPDNLLFKILRGKYFKDGNFLEAPIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWI-NRTGRLSPLVTVDSLK
        +  FN AMLAKQ WR+    D LLF++   KYF +GN  + PI    S  W+SI   R++   G  WRVGN +LI I    W+   T  L      D   
Subjt:  MTLFNQAMLAKQSWRIIKNPDNLLFKILRGKYFKDGNFLEAPIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWI-NRTGRLSPLVTVDSLK

Query:  GQRVNRLLDGNNHWKKELLVQSFSPQDVEDIMNIPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVEYNSSSQASPSNAENTSRLWKSIWNAKVLPRTKIC
         +  + LL     W  +L+ + F P +  +I  I   E +  D++IW     GV+SVKSAYH+        +AS S++E    +W  IW  +   R +  
Subjt:  GQRVNRLLDGNNHWKKELLVQSFSPQDVEDIMNIPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVEYNSSSQASPSNAENTSRLWKSIWNAKVLPRTKIC

Query:  IWKIVKDIIPSKANILKK---GVSLNPLCD----------------------------------------------------LAITAITMWMIWSSRNRD
        +W+ VKD +P+K N+ ++   G +L PLC+                                                    +A  A   W +W  RN  
Subjt:  IWKIVKDIIPSKANILKK---GVSLNPLCD----------------------------------------------------LAITAITMWMIWSSRNRD

Query:  QPDSRERTPDVLIQAISNHISER-KNFYLKQSFPKRSRSLANQAMWAKPKLGSWKLNSDASWNEKKGQGGIGWVVHDSTG
          + RE  P   +  +S       + F+  Q+  ++   +     W  P    +K N DA+  E+ G  G+G V+ DS G
Subjt:  QPDSRERTPDVLIQAISNHISER-KNFYLKQSFPKRSRSLANQAMWAKPKLGSWKLNSDASWNEKKGQGGIGWVVHDSTG

TrEMBL top hitse value%identityAlignment
A0A2N9GPY1 Reverse transcriptase domain-containing protein4.0e-20936.81Show/hide
Query:  MGFSGDRFTWARNKSKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPTKNIIKLEESWLKFEGSKQTFKSSWE-DTP
        +GF+G  FTW  N+   + T ERLDR +     +S   + +V HL+   SDH+PI     W    + V P +   + EE W    G ++T ++SW    P
Subjt:  MGFSGDRFTWARNKSKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPTKNIIKLEESWLKFEGSKQTFKSSWE-DTP

Query:  EVSTYNFSKKIQEGMINMSRWNKERLKGSIKGAIARKEKKIQRL-SKCSNLMNHEALCQAEKDLEMLLEEEETYWKLRSREDWLKSGDKNTKWFHSKASC
         V  Y   +KI E    + +W+K    G+IK  I   E +++   +K     +H  + + +  L  LL ++E  W+  SR +WL+ GD+NT++FHSKA+ 
Subjt:  EVSTYNFSKKIQEGMINMSRWNKERLKGSIKGAIARKEKKIQRL-SKCSNLMNHEALCQAEKDLEMLLEEEETYWKLRSREDWLKSGDKNTKWFHSKASC

Query:  RKRKNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDGAHAMIYQRY
        R+R+N +  + D  G WT  ++ V  +   Y+ SL ++ +P+ +  ++V+D I   + D+    L   F+ +E+ KA+K M P K+PGPDG   + YQ+Y
Subjt:  RKRKNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDGAHAMIYQRY

Query:  WDILGEDTIRVCLGFLNHKEALEPLNKTFISLIPKSKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNVLVGFECIQA
        W ++GED  +  L  LN  + L+ +N T+I+LIPK K+P++++EFRPISLCNVIYK++ K + NR+K+IL  I+S SQSAF+PGR I+DN+LV FE +  
Subjt:  WDILGEDTIRVCLGFLNHKEALEPLNKTFISLIPKSKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNVLVGFECIQA

Query:  LNKRKKGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEGFSALLRREE
        ++ +KKGK G M LKLDMSKAYDRVEW +L+ +ME++ F + W   +M C+++V+YSI++NG P    KPSRGLRQGD LSPYLFL  AEGF +L++R +
Subjt:  LNKRKKGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEGFSALLRREE

Query:  SLSNISGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKSTFMTSKNMKPEDQEWCERALGIKRSESLGIYLGMPSQVGR
            + G+ +++  P LTHLFFADDS++F KA A+++  ++ +L  Y+ ASGQ IN  K+T   SK+    DQ      LG+   +    YLG+PS VGR
Subjt:  SLSNISGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKSTFMTSKNMKPEDQEWCERALGIKRSESLGIYLGMPSQVGR

Query:  NKKEAFRRIKDRVEKILQGWKERLFSLGGKEVLIKAVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVRGGLGFRQMT
         K  +F +IK+RV   L+GWKE+L S  G+EVLIKAVAQAIPTY MSCFRLP  +  EI+ L  RFWWG  GD+ + HW+ W  +CK+K  GGLG R + 
Subjt:  NKKEAFRRIKDRVEKILQGWKERLFSLGGKEVLIKAVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVRGGLGFRQMT

Query:  LFNQAMLAKQSWRIIKNPDNLLFKILRGKYFKDGNFLEAPIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWI---NRTGRLSPLVTVDSLK
        +FN+A+LAKQ WR++ N  +L +++ + KYF   + LEA      S+ W+SI     L  KG  WRVG+   I I +D W+   +    LSP      + 
Subjt:  LFNQAMLAKQSWRIIKNPDNLLFKILRGKYFKDGNFLEAPIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWI---NRTGRLSPLVTVDSLK

Query:  GQRVNRLLD-GNNHWKKELLVQSFSPQDVEDIMNIPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVEYNSSSQASPSNAENTSRLWKSIWNAKVLPRTKI
           V +L+D  +N WK+ ++   F P + E I+ IP   K   D  +W   K GV+SV+S YH  +  +      PSN    +++W +IW  K+  + + 
Subjt:  GQRVNRLLD-GNNHWKKELLVQSFSPQDVEDIMNIPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVEYNSSSQASPSNAENTSRLWKSIWNAKVLPRTKI

Query:  CIWKIVKDIIPSKANILKKGVSLNPLC-------------------------------------------------------DLAITAITMWMIWSSRNR
         +W+   + +P+  N+  + V  +P C                                                       +L + A T W IW  +N 
Subjt:  CIWKIVKDIIPSKANILKKGVSLNPLC-------------------------------------------------------DLAITAITMWMIWSSRNR

Query:  DQPDSRERTPDVLIQAISNHISERKNFYLKQSFPKRSRSLAN---QAMWAKPKLGSWKLNSDASWNEKKGQGGIGWVVHDSTGS
         + + +    + L+    + +SE   F   Q  P  +   ++   +  W  P  G +K N D ++ E   + GIG ++ +  G+
Subjt:  DQPDSRERTPDVLIQAISNHISERKNFYLKQSFPKRSRSLAN---QAMWAKPKLGSWKLNSDASWNEKKGQGGIGWVVHDSTGS

A0A2N9HK89 Uncharacterized protein2.0e-20836.16Show/hide
Query:  MGFSGDRFTWARNKSKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPTKNIIKLEESWLKFEGSKQTFKSSWEDTPE
        +G+ G  FTW  N+     T  RLDR + +   + K +N ++EHLN   SDH+ + L ++  ++ Q     +   + EE W   +G + T ++ W D   
Subjt:  MGFSGDRFTWARNKSKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPTKNIIKLEESWLKFEGSKQTFKSSWEDTPE

Query:  VSTYNFS--KKIQEGMINMSRWNKERLKGSIKGAIARKEKKIQRLSKCSNLMNHEALCQAEKDLEMLLEEEETYWKLRSREDWLKSGDKNTKWFHSKASC
        V T  F    K++     + +W+++      +    ++++ +   +      + + L   + D+  LLE+EE  W+ RSR  WLK GD+NT++FH +AS 
Subjt:  VSTYNFS--KKIQEGMINMSRWNKERLKGSIKGAIARKEKKIQRLSKCSNLMNHEALCQAEKDLEMLLEEEETYWKLRSREDWLKSGDKNTKWFHSKASC

Query:  RKRKNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDGAHAMIYQRY
        R+R+N I G+ +  G W E++E++  I   Y++++ +++ P+ I  ++    +   +       L  P+++ E+E+A+K M P KAPGPDG   + YQRY
Subjt:  RKRKNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDGAHAMIYQRY

Query:  WDILGEDTIRVCLGFLNHKEALEPLNKTFISLIPKSKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNVLVGFECIQA
        W ++G+D  R  L  LN  + L  +N TFI+LIPK K+P+++ E+RPISLCNVIYK++ K IANR+KTIL  +IS SQSAF+PGR I+DNVLV FE +  
Subjt:  WDILGEDTIRVCLGFLNHKEALEPLNKTFISLIPKSKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNVLVGFECIQA

Query:  LNKRKKGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEGFSALLRREE
        ++  K G+ G M LK DMSKAYDRVEW +LE IM KM F N W   +M C+++V+YS+++NG P    KPSRGLRQGDPLSPYLFLL AEGF +L+ +  
Subjt:  LNKRKKGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEGFSALLRREE

Query:  SLSNISGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKSTFMTSKNMKPEDQEWCERALGIKRSESLGIYLGMPSQVGR
           ++ G+ L +  P +THLFFADDS++F KA  ++   L+ +L  Y++ASGQ +N DK+T   SK    + QE  +  LG+   +    YLG+PS VG+
Subjt:  SLSNISGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKSTFMTSKNMKPEDQEWCERALGIKRSESLGIYLGMPSQVGR

Query:  NKKEAFRRIKDRVEKILQGWKERLFSLGGKEVLIKAVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVRGGLGFRQMT
        N+   F ++K+RV   L GWKE+L S  G+EVLIK+VAQAIPTYTMSCFRLPN +C +++ +  +FWWG   D N+  W+ W+++C  K  GG+GFR + 
Subjt:  NKKEAFRRIKDRVEKILQGWKERLFSLGGKEVLIKAVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVRGGLGFRQMT

Query:  LFNQAMLAKQSWRIIKNPDNLLFKILRGKYFKDGNFLEAPIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWINRTGRLSPLVTVDS-LKGQ
         FN A+L KQ WR++ + ++LL+++ + K+F + + LEA      S  W+SI   RD+   G  WRVGN   I I +  W+        +    + L+  
Subjt:  LFNQAMLAKQSWRIIKNPDNLLFKILRGKYFKDGNFLEAPIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWINRTGRLSPLVTVDS-LKGQ

Query:  RVNRLLDG-NNHWKKELLVQSFSPQDVEDIMNIPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVEYNSSSQASPSNAENTSRLWKSIWNAKVLPRTKICI
         V+ L+    NHW + L+   F P D   I +IP  E  S D+  W   K G ++VKS Y      +  +Q S SN++  +++WK +W+ +V  + ++ +
Subjt:  RVNRLLDG-NNHWKKELLVQSFSPQDVEDIMNIPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVEYNSSSQASPSNAENTSRLWKSIWNAKVLPRTKICI

Query:  WKIVKDIIPSKANILKKGVSLNPLC-------------------------------------------------------DLAITAITMWMIWSSRNR--
        W+ +KD +PSK N+ KK V  +P+C                                                       D+ + + T W IW+ RN+  
Subjt:  WKIVKDIIPSKANILKKGVSLNPLC-------------------------------------------------------DLAITAITMWMIWSSRNR--

Query:  -DQPDSRERTPDVLIQAISNHISERKNFYLKQSFPKRSRSLANQAMWAKPKLGSWKLNSDASWNEKKGQGGIGWVVHDSTGSPI
         +QP  +     V ++         +   L  + P+ S        W KP    +K+N D +   +K + GIG +V +  G P+
Subjt:  -DQPDSRERTPDVLIQAISNHISERKNFYLKQSFPKRSRSLANQAMWAKPKLGSWKLNSDASWNEKKGQGGIGWVVHDSTGSPI

A0A2N9J809 Uncharacterized protein5.1e-21237.58Show/hide
Query:  MGFSGDRFTWARNKSKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPTKNIIKLEESWLKFEGSKQTFKSSW-EDTP
        +GF+G  FTW  N+S  + T ERLDR +  +  ++   + +V HL+   SDH+PI     W    + V P +   + EE W    G  +T ++SW +   
Subjt:  MGFSGDRFTWARNKSKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPTKNIIKLEESWLKFEGSKQTFKSSW-EDTP

Query:  EVSTYNFSKKIQEGMINMSRWNKERLKGSIKGAIARKEKKIQRLSKCSNL-MNHEALCQAEKDLEMLLEEEETYWKLRSREDWLKSGDKNTKWFHSKASC
         V  Y+  +KIQE    + +W+K+   GSIK  I   E +++     S +  +H  +   +  L  LL ++E  W+ RSR +WL+ GD+NT++FHSKA+ 
Subjt:  EVSTYNFSKKIQEGMINMSRWNKERLKGSIKGAIARKEKKIQRLSKCSNL-MNHEALCQAEKDLEMLLEEEETYWKLRSREDWLKSGDKNTKWFHSKASC

Query:  RKRKNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDGAHAMIYQRY
        R+R+N +  + D  G WT  ++ V  +   Y+ SL ++ +P+ I  +KV+D I   + D+    L   F+ +E+ +A+K M P KAPGPDG   + YQ+Y
Subjt:  RKRKNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDGAHAMIYQRY

Query:  WDILGEDTIRVCLGFLNHKEALEPLNKTFISLIPKSKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNVLVGFECIQA
        W ++G D  +  L  LN  + L+ +N T+I+LIPK K+P+ ++EFRPISLCNVIYK+I K + NR+K+IL  ++S SQSAF+PGR I+DN+LV FE +  
Subjt:  WDILGEDTIRVCLGFLNHKEALEPLNKTFISLIPKSKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNVLVGFECIQA

Query:  LNKRKKGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEGFSALLRREE
        ++ +K GK G M LKLDMSKAYDRVEW +L+ +M++M F + W   +M C+++V+YSI+ING P    KPSRGLRQGDPLSPYLFL  AEGF AL++R +
Subjt:  LNKRKKGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEGFSALLRREE

Query:  SLSNISGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKSTFMTSKNMKPEDQEWCERALGIKRSESLGIYLGMPSQVGR
            + G+ +++  P LTHLFFADDS++F KA   ++  ++ +L++Y+ ASGQ IN  K+T   SK+    DQ      LG+   +    YLG+PS VGR
Subjt:  SLSNISGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKSTFMTSKNMKPEDQEWCERALGIKRSESLGIYLGMPSQVGR

Query:  NKKEAFRRIKDRVEKILQGWKERLFSLGGKEVLIKAVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVRGGLGFRQMT
         K  +F +IK+RV   L+GWKERL S  G+E+LIKAVAQAIPTY MSCFRLP  +  EI+ L  RFWWG  GD+ + HWM W+ +CK+K  GGLG R + 
Subjt:  NKKEAFRRIKDRVEKILQGWKERLFSLGGKEVLIKAVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVRGGLGFRQMT

Query:  LFNQAMLAKQSWRIIKNPDNLLFKILRGKYFKDGNFLEAPIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWI---NRTGRLSPLVTVDSLK
        +FN+A+LAKQ WR++ N  +L +++ + KYF   + LEA   +  S+ W+SI     L  KG  WRVG+ + I I +D W+   +    LSP      L 
Subjt:  LFNQAMLAKQSWRIIKNPDNLLFKILRGKYFKDGNFLEAPIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWI---NRTGRLSPLVTVDSLK

Query:  GQRVNRLLD-GNNHWKKELLVQSFSPQDVEDIMNIPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVEYNSSSQASPSNAENTSRLWKSIWNAKVLPRTKI
           V +L+D  +N WK+ ++  SF P + E I+ IP    ++ D  +W   + GV+SV+S YH  +E +      PS+    +++W + W+ KV  + + 
Subjt:  GQRVNRLLD-GNNHWKKELLVQSFSPQDVEDIMNIPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVEYNSSSQASPSNAENTSRLWKSIWNAKVLPRTKI

Query:  CIWKIVKDIIPSKANILKKGVSLNPLC-------------------------------------------------------DLAITAITMWMIWSSRNR
         +W+   + +P++ N+  + V  +P C                                                       +L + AIT W IW  RN 
Subjt:  CIWKIVKDIIPSKANILKKGVSLNPLC-------------------------------------------------------DLAITAITMWMIWSSRNR

Query:  DQPDSRERTPDVLIQAISNHISERKNFYLKQSFPKRSRSLA---NQAMWAKPKLGSWKLNSDASWNEKKGQGGIGWVVHDSTG
         + + +      L+      +SE   F   Q  P  +   +   +   W  P  G +K+N D ++ E+  + GIG V+ +  G
Subjt:  DQPDSRERTPDVLIQAISNHISERKNFYLKQSFPKRSRSLA---NQAMWAKPKLGSWKLNSDASWNEKKGQGGIGWVVHDSTG

A0A803QH07 Uncharacterized protein2.6e-20841.17Show/hide
Query:  FSGDRFTWARNKSKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPTKNIIKLEESWLKFEGSKQTFKSSWEDTPEVS
        F+G+ FTW +N+++ E  +ERLD    N   ++  + +   HL+F+ SDHR I ++I   +  QQ    K   + E+ WLK E +    K +W+      
Subjt:  FSGDRFTWARNKSKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPTKNIIKLEESWLKFEGSKQTFKSSWEDTPEVS

Query:  TYNFSKKIQEGMINMSRWNKERLKGSIKGAIARKEKKIQRLSKCSNLMNH--EALCQAEKDLEMLLEEEETYWKLRSREDWLKSGDKNTKWFHSKASCRK
          +F   +      + RW+  +  G +K  I++ +K + +L+  ++        L Q+EK L+ LL  EE YW  RSR DWL+ GDKNTK+FH+KAS RK
Subjt:  TYNFSKKIQEGMINMSRWNKERLKGSIKGAIARKEKKIQRLSKCSNLMNH--EALCQAEKDLEMLLEEEETYWKLRSREDWLKSGDKNTKWFHSKASCRK

Query:  RKNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDGAHAMIYQRYWD
          N IK + ++ G     +E++ ++  +Y++ L +S      SL+ V +AI STI       L  PFS  E+  A++SM+P K+PG DG  AM YQ YWD
Subjt:  RKNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDGAHAMIYQRYWD

Query:  ILGEDTIRVCLGFLNHKEALEPLNKTFISLIPKSKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNVLVGFECIQALN
        I+G    ++ LG LN    +  LN + I+LIPK  +P  M ++RPISLCNVIYK+I KTI  R + +L  +IS +QSAF+  R I+DN+LV FE I  L 
Subjt:  ILGEDTIRVCLGFLNHKEALEPLNKTFISLIPKSKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNVLVGFECIQALN

Query:  KRKKGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEGFSALLRREESL
         + +G++G+  LKLDMSKA+DRVEW +LE +M KM F+  W   +M+C+++ ++S  +NG      +P RGLRQGDPLSPYLFL+ +EG S LL+ EE+ 
Subjt:  KRKKGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEGFSALLRREESL

Query:  SNISGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKSTFMTSKNMKPEDQEWCERALGIKRSESLGIYLGMPSQVGRNK
         ++ GL+L +  P+++HL FADDS++F +A  +   ALKR L  Y +ASGQ +N DKS    S N     Q +    L +  ++    YLG+PS  GR+K
Subjt:  SNISGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKSTFMTSKNMKPEDQEWCERALGIKRSESLGIYLGMPSQVGRNK

Query:  KEAFRRIKDRVEKILQGWKERLFSLGGKEVLIKAVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVRGGLGFRQMTLF
        +E F  IK++V K+L  W E++FS+GGKEVL+KAV Q+IPTY MSCF+L    C++++ + A FWWGT  +  + HW  W ++CK+K  GG+GFR    F
Subjt:  KEAFRRIKDRVEKILQGWKERLFSLGGKEVLIKAVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVRGGLGFRQMTLF

Query:  NQAMLAKQSWRIIKNPDNLLFKILRGKYFKDGNFLEAPIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWINRTGRLSPLVTVDSLKGQRVN
        NQA+LAKQ+WRI   PD+LL ++L+ +YF   +FL+A IG++ S TW+SICWGR+L +KG R++VGN + I   KD WI       P V+        V+
Subjt:  NQAMLAKQSWRIIKNPDNLLFKILRGKYFKDGNFLEAPIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWINRTGRLSPLVTVDSLKGQRVN

Query:  RLLDGNNHWKKELLVQSFSPQDVEDIMNIPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVEYNSSSQASPSNAENTSRLWKSIWNAKVLPRTKICIWKIV
         L++ N  W   LL   F P DVE I++IP    +++D +IW +   G ++VKS +HLA       Q+S S+A      WK  WN  + P+ +I  WK++
Subjt:  RLLDGNNHWKKELLVQSFSPQDVEDIMNIPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVEYNSSSQASPSNAENTSRLWKSIWNAKVLPRTKICIWKIV

Query:  KDIIPSKANILKKGVSLNPLCDL
        + I+P  A + K+ V  +  C L
Subjt:  KDIIPSKANILKKGVSLNPLCDL

M5VU98 Reverse transcriptase domain-containing protein1.9e-21439.79Show/hide
Query:  MGFSGDRFTWARNKSKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPTKNIIK---LEESWLKFEGSKQTFKSSWED
        +GF+G +FTW + +      R RLDR +  +   +      V+HL+  +SDH PIL+      +I+     K+  +    E  W      ++T K  WE 
Subjt:  MGFSGDRFTWARNKSKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPTKNIIK---LEESWLKFEGSKQTFKSSWED

Query:  TPEVS-TYNFSKKIQEGMINMSRWNKERLKGSIKGAIARKEKKIQRLSKCS-NLMNHEALCQAEKDLEMLLEEEETYWKLRSREDWLKSGDKNTKWFHSK
           +       KKI++    + RW+K    G IK        K+  L +   +    E     +K L+ LL + E YW  RSRE+WLK+GDKNT +FH K
Subjt:  TPEVS-TYNFSKKIQEGMINMSRWNKERLKGSIKGAIARKEKKIQRLSKCS-NLMNHEALCQAEKDLEMLLEEEETYWKLRSREDWLKSGDKNTKWFHSK

Query:  ASCRKRKNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDGAHAMIY
        A+ R+R+N IKG+ D  G W    + +  I   YF  L  S+  S   ++++  A+   +  D +  L   FS +EI+ A+  M PSKAPGPDG   + Y
Subjt:  ASCRKRKNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDGAHAMIY

Query:  QRYWDILGEDTIRVCLGFLNHKEALEPLNKTFISLIPKSKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNVLVGFEC
        Q+YW I+G+D +     FL   E L  LN TF++LIPK K+P+ M + RPISLCNV+Y+I  KT+ANRMK ++ S+IS SQSAF+PGR I+DN +V FE 
Subjt:  QRYWDILGEDTIRVCLGFLNHKEALEPLNKTFISLIPKSKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNVLVGFEC

Query:  IQALNKRKKGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEGFSALLR
           L +R++G+ G + LKLDMSKAYDRVEW FLE++M  M F   W R VM CV++V+YS ++NG P     P+RGLRQGDPLSPYLFLL AEGF+ LL 
Subjt:  IQALNKRKKGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEGFSALLR

Query:  REESLSNISGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKSTFMTSKNMKPEDQEWCERALGIKRSESLGIYLGMPSQ
        + E    + G+ + +  P ++HLFFADDS +F KA       LK + E Y+ ASGQ IN  KS    S N+  + Q      LG+ R +S   YLG+P  
Subjt:  REESLSNISGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKSTFMTSKNMKPEDQEWCERALGIKRSESLGIYLGMPSQ

Query:  VGRNKKEAFRRIKDRVEKILQGWKERLFSLGGKEVLIKAVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVRGGLGFR
        +GRNK   FR +K+RV K LQGW+E+  S+ GKEVL+K VAQ+IP Y MSCF LP  +C EI+++ ARFWWG  G+  + HWM W  +CK K  GG+GFR
Subjt:  VGRNKKEAFRRIKDRVEKILQGWKERLFSLGGKEVLIKAVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVRGGLGFR

Query:  QMTLFNQAMLAKQSWRIIKNPDNLLFKILRGKYFKDGNFLEAPIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWINRTGRLSPLVT-VDSL
         +  FN AMLAKQ WR++ NP +L  ++L+ KYF   NF EA +G+  S  W+SI   R +   G R+++G+   + I  D W+ R    + + + +D +
Subjt:  QMTLFNQAMLAKQSWRIIKNPDNLLFKILRGKYFKDGNFLEAPIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWINRTGRLSPLVT-VDSL

Query:  KGQRVNRLL--DGNNHWKKELLVQSFSPQDVEDIMNIPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVEYNSSSQ-ASPSNAENTSRLWKSIWNAKVLPR
        +  +V+ L+  +G+  W  + L   F P DV DI+ IP   ++  D I+W YDK G+F+VKSAY +A+   S  +  S S+  +T  LW+ IWNA V  +
Subjt:  KGQRVNRLL--DGNNHWKKELLVQSFSPQDVEDIMNIPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVEYNSSSQ-ASPSNAENTSRLWKSIWNAKVLPR

Query:  TKICIWKIVKDIIPSKANILKKGVSLNPLC--------------DLAITAITMWMIWSSRNRDQPDSRERTPDVLIQAISNHISERKNFYLKQSFP-KRS
         KI  W++  DI+P+KAN++KKGV +  +C               +   A+  W I S   R      +R+P  ++     ++ E   F      P K +
Subjt:  TKICIWKIVKDIIPSKANILKKGVSLNPLC--------------DLAITAITMWMIWSSRNRDQPDSRERTPDVLIQAISNHISERKNFYLKQSFP-KRS

Query:  RSLANQAMWAKPKLGSWKLNSDASWNEKKGQGGIGWVVHDSTG
          + +   WA P  G  K N D +++   G+G +G V  D+ G
Subjt:  RSLANQAMWAKPKLGSWKLNSDASWNEKKGQGGIGWVVHDSTG

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein5.5e-4625.94Show/hide
Query:  TRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPT--KNIIKLEESWLKFEGSKQTFKSSWE-----DTPEVSTYNFSKKIQE
        T  ++D  + +  +LSK +  + E +  + SDH  I L +   +  Q    T   N + L + W+  E  K   K  +E     DT   + ++  K +  
Subjt:  TRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPT--KNIIKLEESWLKFEGSKQTFKSSWE-----DTPEVSTYNFSKKIQE

Query:  G---MINMSRWNKERLK-GSIKGAIARKEKKIQRLSKCSNLMNHEALCQAEKDLEMLLEEEETYWKLRSREDW-LKSGDKNTKWFHSKASCRKRKNEIKG
        G    +N  +  +ER K  ++   +   EK+ Q  SK S     + + +   +L+  +E ++T  K+     W  +  +K  +        ++ KN+I  
Subjt:  G---MINMSRWNKERLK-GSIKGAIARKEKKIQRLSKCSNLMNHEALCQAEKDLEMLLEEEETYWKLRSREDW-LKSGDKNTKWFHSKASCRKRKNEIKG

Query:  ILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMR-LDIPFSKEEIEKAIKSMNPSKAPGPDGAHAMIYQRYWDILGEDT
        I +++G+ T    ++      Y+K L ++   +   +    D       + +++  L+ P +  EI   I S+   K+PGPDG  A  YQRY + L    
Subjt:  ILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMR-LDIPFSKEEIEKAIKSMNPSKAPGPDGAHAMIYQRYWDILGEDT

Query:  IRVCLGFLNHKEALEP--LNKTFISLIPK-SKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNVLVGFECIQALNKRK
        +++       KE + P    +  I LIPK  +D  K + FRPISL N+  KI+ K +ANR++  +  +I   Q  FIPG Q   N+      IQ +N+ K
Subjt:  IRVCLGFLNHKEALEP--LNKTFISLIPK-SKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNVLVGFECIQALNKRK

Query:  KGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEGFSALLRREESLSNI
             H+ + +D  KA+D+++  F+ + + K+     + + + A     T +I++NG   E F    G RQG PLSP LF +  E  +  +R+E+    I
Subjt:  KGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEGFSALLRREESLSNI

Query:  SGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKS-TFMTSKNMKPEDQEWCERALGI--KRSESLGIYL--GMPSQVGR
         G+QL K    L+   FADD +++++      + L +++  + + SG  IN+ KS  F+ + N + E Q   E    I  KR + LGI L   +      
Subjt:  SGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKS-TFMTSKNMKPEDQEWCERALGI--KRSESLGIYL--GMPSQVGR

Query:  NKKEAFRRIKDRVEKILQGWKERLFSLGGKEVLIK--AVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVRGGLGFRQ
        N K   + IK+   K    WK    S  G+  ++K   + + I  +     +LP    +E+++   +F W     +     +S     KNK  GG+    
Subjt:  NKKEAFRRIKDRVEKILQGWKERLFSLGGKEVLIK--AVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVRGGLGFRQ

Query:  MTLFNQAMLAKQSWRIIKNPD
          L+ +A + K +W   +N D
Subjt:  MTLFNQAMLAKQSWRIIKNPD

P08548 LINE-1 reverse transcriptase homolog9.0e-4924.37Show/hide
Query:  SKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPT--KNIIKLEESWLKFEGSKQTFKSSWEDTPEVSTYNFSKKIQE
        S    T  ++D  + +   LSK + +++    F  SDH  I + ++    +     T   N + L+++W+  E  K+  K   ++  + + Y        
Subjt:  SKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPT--KNIIKLEESWLKFEGSKQTFKSSWEDTPEVSTYNFSKKIQE

Query:  GMINMSRWNKERLKGSIKGAIARKEKKIQRLSKCSNLMNHEALCQAEKDLEMLLEEEETYWKLRSREDWLKSG------DKNTKWFHSKAS---------
           N+    K  L+G    A+    KK +R  + +NLM H    + E+         +   K+R+  + +++       +K+  WF  K +         
Subjt:  GMINMSRWNKERLKGSIKGAIARKEKKIQRLSKCSNLMNHEALCQAEKDLEMLLEEEETYWKLRSREDWLKSG------DKNTKWFHSKAS---------

Query:  CRKR--KNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMR-LDIPFSKEEIEKAIKSMNPSKAPGPDGAHAMI
         RK+  K+ I  I +     T    ++ KI + Y+K L S  + +   + +  +A        +++  L+ P S  EI   I+++   K+PGPDG  +  
Subjt:  CRKR--KNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMR-LDIPFSKEEIEKAIKSMNPSKAPGPDGAHAMI

Query:  YQRYWDILGEDTIRVCLGFLNHKEALEPLNKTF----ISLIPK-SKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNV
        YQ +     E+ + + L    + E    L  TF    I+LIPK  KDP + + +RPISL N+  KI+ K + NR++  +  II   Q  FIPG Q   N+
Subjt:  YQRYWDILGEDTIRVCLGFLNHKEALEPLNKTF----ISLIPK-SKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNV

Query:  LVGFECIQALNKRKKGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEG
              IQ +NK K     HM L +D  KA+D ++  F+   ++K+     + + + A  S  T +I++NGV  + F    G RQG PLSP LF +  E 
Subjt:  LVGFECIQALNKRKKGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEG

Query:  FSALLRREESLSNISGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKS-TFMTSKNMKPED--QEWCERALGIKRSESL
         +  +R E++   I G+ +      +    FADD +++++        L  V++EY   SG  IN  KS  F+ + N + E   ++     +  K+ + L
Subjt:  FSALLRREESLSNISGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKS-TFMTSKNMKPED--QEWCERALGIKRSESL

Query:  GIYLGMPSQVGRNKKEAFRRIKDRVEKILQGWKERLFSLGGKEVLIK--AVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMC
        G+YL     V    KE +  ++  + + +  WK    S  G+  ++K   + +AI  +     + P +   +++++   F W     +     +S     
Subjt:  GIYLGMPSQVGRNKKEAFRRIKDRVEKILQGWKERLFSLGGKEVLIK--AVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMC

Query:  KNKVRGGLGFRQMTLFNQAMLAKQSWRIIKNPDNLLFKILRGKYFKDGNF----LEAPIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWIN
        KNK  GG+    + L+ ++++ K +W   KN +  ++  +  +      +     + PI N        I WG+D     + W     N + I + L ++
Subjt:  KNKVRGGLGFRQMTLFNQAMLAKQSWRIIKNPDNLLFKILRGKYFKDGNF----LEAPIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWIN

Query:  RTGRLSPLVTVDSLKGQRVNRLLDGNNHWKKEL
            LSPL  +DS              HW K+L
Subjt:  RTGRLSPLVTVDSLKGQRVNRLLDGNNHWKKEL

P0C2F6 Putative ribonuclease H protein At1g657504.8e-3427.27Show/hide
Query:  MPSQVGRNKKEAFRRIKDRVEKILQGWKERLFSLGGKEVLIKAVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVRGG
        MP    R  K+ F  I +RV   + GW+E+  S  G+  L KAV  ++P ++MS   LP +I + +D+L   F WG+  +K + H + W+ +C  K  GG
Subjt:  MPSQVGRNKKEAFRRIKDRVEKILQGWKERLFSLGGKEVLIKAVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVRGG

Query:  LGFRQMTLFNQAMLAKQSWRIIKNPDNLLFKILRGKY----FKDGNFLEAPIGNASSMTWRSICWG-RDLFLKGYRWRVGNENLITIDKDLWINRTGRLS
        LG R     N+A+++K  WR+++  ++L   +L+ KY     +D  +L  P G+ SS TWRSI  G RD+   G  W  G+   I    D W++      
Subjt:  LGFRQMTLFNQAMLAKQSWRIIKNPDNLLFKILRGKY----FKDGNFLEAPIGNASSMTWRSICWG-RDLFLKGYRWRVGNENLITIDKDLWINRTGRLS

Query:  PLVTVDSLKGQRVNRLLDGNNHWKKELLV-------QSFSPQDVED--------IMNIPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVEYNSSSQASPS
        PL+ +D+  G+R     D +    K+L +           P    +        ++++ TG   +RD + W++ + G FSV+SAY +             
Subjt:  PLVTVDSLKGQRVNRLLDGNNHWKKELLV-------QSFSPQDVED--------IMNIPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVEYNSSSQASPS

Query:  NAENTSRLWKSIWNAKVLPRTKICIWKIVKDIIPSKANILKKGVSLNPLCDL
           N +  +  +W  +V  R K  +W +    + ++    ++ +S + +C +
Subjt:  NAENTSRLWKSIWNAKVLPRTKICIWKIVKDIIPSKANILKKGVSLNPLCDL

P11369 LINE-1 retrotransposable element ORF2 protein1.0e-4425.24Show/hide
Query:  SKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPT----KNIIKLEESWLKFEGSKQTFKSSWE-DTPEVSTYNFSKK
        S    T  ++D  + +   L++ +N+++  +    SDH    L + + + I    PT     N   L ++ +K EG K+  K   E +  E +TY     
Subjt:  SKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPT----KNIIKLEESWLKFEGSKQTFKSSWE-DTPEVSTYNFSKK

Query:  IQEGMINMSRWNKERLKGSIKGAIARKEKKIQRLSKCSNLMNHEALCQAEKDLEMLLEEEETYWKLRSREDWLKSG------DKNTKWFHSK--------
              N+    K  L+G +    A K+K+    +  S+L  H    + ++         +   KLR   + +++       ++   WF  K        
Subjt:  IQEGMINMSRWNKERLKGSIKGAIARKEKKIQRLSKCSNLMNHEALCQAEKDLEMLLEEEETYWKLRSREDWLKSG------DKNTKWFHSK--------

Query:  ---ASCRKRKNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAI-CSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDGAH
               + K  I  I +E+G+ T   E++     +++K L S+   +   + K  D      +  DQ   L+ P S +EIE  I S+   K+PGPDG  
Subjt:  ---ASCRKRKNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAI-CSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDGAH

Query:  AMIYQRYWDILGEDTIRVCLGFLNHKEALEPLNKTF----ISLIPK-SKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQIS
        A  YQ +     ED I +     +  E    L  +F    I+LIPK  KDP K++ FRPISL N+  KI+ K +ANR++  + +II   Q  FIPG Q  
Subjt:  AMIYQRYWDILGEDTIRVCLGFLNHKEALEPLNKTF----ISLIPK-SKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQIS

Query:  DNVLVGFECIQALNKRKKGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLF
         N+      I  +NK K     HM + LD  KA+D+++  F+ +++E+      +   + A  S    +I +NG   E      G RQG PLSPYLF + 
Subjt:  DNVLVGFECIQALNKRKKGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLF

Query:  AEGFSALLRREESLSNISGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKS-TFMTSKNMKPEDQ--EWCERALGIKRS
         E  +  +R+++    I G+Q+ K    ++ L  ADD +++I         L  ++  + +  G  IN +KS  F+ +KN + E +  E    ++     
Subjt:  AEGFSALLRREESLSNISGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKS-TFMTSKNMKPEDQ--EWCERALGIKRS

Query:  ESLGIYLGMPSQVGRNKKEAFRRIKDRVEKILQGWKERLFSLGGKEVLIK--AVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWA
        + LG+ L    +V     + F+ +K  +++ L+ WK+   S  G+  ++K   + +AI  +     ++P    +E++    +F W      N+   ++ +
Subjt:  ESLGIYLGMPSQVGRNKKEAFRRIKDRVEKILQGWKERLFSLGGKEVLIK--AVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWA

Query:  NMCKNKVRGGLGFRQMTLFNQAMLAKQSW
         +   +  GG+    + L+ +A++ K +W
Subjt:  NMCKNKVRGGLGFRQMTLFNQAMLAKQSW

P14381 Transposon TX1 uncharacterized 149 kDa protein3.3e-4324.6Show/hide
Query:  FTWARNKSKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSID----------W---GSKIQQVIPTKNIIKLEESWLKFEGSKQTFKSS
        FT+ R +     ++ R+DR  ++S ++S+A++  +    F  SDH  + L +           W    S ++     K++      W  F+    T  + 
Subjt:  FTWARNKSKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSID----------W---GSKIQQVIPTKNIIKLEESWLKFEGSKQTFKSS

Query:  WEDTPEVSTYNFSKKIQEGMINMSRWNKERLKGSIKGAIARKEKKIQRLSKCSNLMNHEALCQAEKDLEMLLEEEETYWKLRSREDWLKSGDKNTKWFHS
        W D  +V      ++  + +        E L G +     R      +  +C  L   EAL   E       + +     +RSR   L   D+ +++F++
Subjt:  WEDTPEVSTYNFSKKIQEGMINMSRWNKERLKGSIKGAIARKEKKIQRLSKCSNLMNHEALCQAEKDLEMLLEEEETYWKLRSREDWLKSGDKNTKWFHS

Query:  KASCRKRKNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDGAHAMI
            +  + +I  +  E G   E  E +   A +++++L S +  S  + +++ D +   + + +K RL+ P + +E+ +A++ M  +K+PG DG     
Subjt:  KASCRKRKNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDGAHAMI

Query:  YQRYWDILGEDTIRVCLGFLNHKEALEPLNKTFISLIPKSKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNVLVGFE
        +Q +WD LG D  RV        E      +  +SL+PK  D + ++ +RP+SL +  YKI+ K I+ R+K++LA +I   QS  +PGR I DNV   F 
Subjt:  YQRYWDILGEDTIRVCLGFLNHKEALEPLNKTFISLIPKSKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNVLVGFE

Query:  CIQALNKRKKGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEGFSALL
            L+  ++  +    L LD  KA+DRV+  +L   ++  +F   +   +    +S    + IN          RG+RQG PLS  L+ L  E F  LL
Subjt:  CIQALNKRKKGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSNDWTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEGFSALL

Query:  RREESLSNISGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKSTFMTSKNMKPE-------DQEWCERALGIKRSESLG
        R+      ++GL L +    +    +ADD +I +  +  ++E  +   E Y  AS   IN  KS+ +   ++K +       D  W  + +     + LG
Subjt:  RREESLSNISGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASGQTINLDKSTFMTSKNMKPE-------DQEWCERALGIKRSESLG

Query:  IYLGMPS-QVGRNKKEAFRRIKDRVEKILQGWK--ERLFSLGGKEVLIKAVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMS
        +YL      V +N    F  +++ V   L  WK   ++ S+ G+ ++I  +  +   Y + C        ++I R    F W  +G     HW+S
Subjt:  IYLGMPS-QVGRNKKEAFRRIKDRVEKILQGWK--ERLFSLGGKEVLIKAVAQAIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.8e-1826.3Show/hide
Query:  GDRFTWARNKSKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPTKNIIKLEESWLKFEGSKQTFKSS----WEDTPE
        G  +TW+ ++      R +LDR + N    S   +          SDH P ++       I + +P ++  K    +  F  +  TF  S    WE+   
Subjt:  GDRFTWARNKSKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPTKNIIKLEESWLKFEGSKQTFKSS----WEDTPE

Query:  VSTYNFS-----KKIQEGMINMSRWNKERLKGSIKGAIARKEKKIQRL--SKCSNLMNHEALCQAEKDLEMLLEEEETYWKLRSREDWLKSGDKNTKWFH
        V ++ FS     K  ++    ++R     ++   K A+   E    +L  +   +L   E +  A K         E++++ +SR  WL+ GD NT++FH
Subjt:  VSTYNFS-----KKIQEGMINMSRWNKERLKGSIKGAIARKEKKIQRL--SKCSNLMNHEALCQAEKDLEMLLEEEETYWKLRSREDWLKSGDKNTKWFH

Query:  SKASCRKRKNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHP--SKISLKKVSDAICSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDGAH
              + KN IK +  +     E    V ++  AY+  LL S+    +  S++++ D       D    RL    S +EI  A+ +M  +KAPGPD   
Subjt:  SKASCRKRKNEIKGILDERGNWTEKEEDVGKIASAYFKSLLSSNHP--SKISLKKVSDAICSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDGAH

Query:  AMIYQRYWDILGEDTIRVCLGFLNHKEALEPLNKTFISLIPKSKDPKKMQEFRPISLCNVIYKII
        A  +   W ++ + TI     F      L+  N T I+LIPK     ++  FRP+S C V+YKII
Subjt:  AMIYQRYWDILGEDTIRVCLGFLNHKEALEPLNKTFISLIPKSKDPKKMQEFRPISLCNVIYKII

AT3G09510.1 Ribonuclease H-like superfamily protein3.9e-1524.88Show/hide
Query:  LRGKYFKDGNFLEAPIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWINRTGRLSPLVTVDSLKGQRVNRLLDGNN---HWKKELLVQSFSP
        ++ +YFKD + L+A +    S  W S+  G  L  KG R  +G+   I I  D  ++ +    PL T ++ K   +N L +       W    + Q    
Subjt:  LRGKYFKDGNFLEAPIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWINRTGRLSPLVTVDSLKGQRVNRLLDGNN---HWKKELLVQSFSP

Query:  QDVEDIMNIPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVEYNSSSQASPSNAENTSRLWKSIWNAKVLPRTKICIWKIVKDIIPSKANILKKGVSLNPL
         D   I  I   +    D+IIW Y+  G ++V+S Y L     S++  + +    +  L   IWN  ++P+ K  +W+ +   + +   +  +G+ ++P 
Subjt:  QDVEDIMNIPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVEYNSSSQASPSNAENTSRLWKSIWNAKVLPRTKICIWKIVKDIIPSKANILKKGVSLNPL

Query:  C
        C
Subjt:  C

AT4G29090.1 Ribonuclease H-like superfamily protein1.3e-3927.89Show/hide
Query:  AIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVRGGLGFRQMTLFNQAMLAKQSWRIIKNPDNLLFKILRGKYFKDGNFLEA
        A+PTYTM+CF LP  +C +I  + A FWW    +    HW +W ++   K  GG+GF+ +  FN A+L KQ WR++  P++L+ K+ + +YF   + L A
Subjt:  AIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKVRGGLGFRQMTLFNQAMLAKQSWRIIKNPDNLLFKILRGKYFKDGNFLEA

Query:  PIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWIN--------RTGRLSPLVTVDSLKGQRVNRLLD-GNNHWKKELLVQSFSPQDVEDIMN
        P+G+  S  W+SI   +++  +G R  VGN   I I +  W++        R  R+ P          +V+ L+D     W+K+++   F   + + I  
Subjt:  PIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWIN--------RTGRLSPLVTVDSLKGQRVNRLLD-GNNHWKKELLVQSFSPQDVEDIMN

Query:  IPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVE-YNSSSQASPSNAENTSRLWKSIWNAKVLPRTKICIWKIVKDIIPSKANILKKGVSLNPLC
        +  G +   D   W Y   G ++VKS Y +  +  N  S     +  + + +++ IW ++  P+ +  +WK + + +P    +  + +S    C
Subjt:  IPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVE-YNSSSQASPSNAENTSRLWKSIWNAKVLPRTKICIWKIVKDIIPSKANILKKGVSLNPLC

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein8.7e-3140.79Show/hide
Query:  AIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKV-RGGLGFRQMTLFNQAMLAKQSWRIIKNPDNLLFKILRGKYFKDGNFLE
        A+P Y MSCFRL   +C ++      FWW +  +K +  W++W  +CK+K   GGLGFR +  FNQA+LAKQS+RII  P  LL ++LR +YF   + +E
Subjt:  AIPTYTMSCFRLPNNICSEIDRLCARFWWGTVGDKNRNHWMSWANMCKNKV-RGGLGFRQMTLFNQAMLAKQSWRIIKNPDNLLFKILRGKYFKDGNFLE

Query:  APIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWINRTGRLSPL
          +G   S  WRSI  GR+L  +G    +G+     +  D WI     L PL
Subjt:  APIGNASSMTWRSICWGRDLFLKGYRWRVGNENLITIDKDLWINRTGRLSPL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)5.7e-1451.47Show/hide
Query:  MINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEGFSALLRREESLSNISGLQLNKFCPNLTHLFFADDS
        +ING PQ    PSRGLRQGDPLSPYLF+L  E  S L RR +    + G++++   P + HL FADD+
Subjt:  MINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEGFSALLRREESLSNISGLQLNKFCPNLTHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCTTTTCGGGTGACAGGTTCACGTGGGCGAGGAACAAATCCAAAAAGGAAGCTACAAGAGAAAGACTGGACCGGTTCATGATGAACTCGAAGATGCTTTCTAAAGC
CAGAAATGTGAAAGTTGAGCATCTAAACTTCCACCAATCGGATCACAGGCCTATCCTGTTGAGTATCGATTGGGGGAGTAAAATCCAGCAAGTTATTCCGACCAAAAACA
TCATTAAGCTGGAAGAAAGTTGGCTGAAGTTTGAAGGCAGCAAACAGACTTTCAAAAGCTCTTGGGAGGACACTCCAGAAGTTTCCACTTACAATTTCAGCAAAAAAATC
CAAGAAGGGATGATTAATATGAGCAGATGGAATAAAGAAAGGCTAAAGGGTTCTATCAAGGGGGCCATTGCCCGGAAAGAAAAAAAAATTCAAAGGCTTTCCAAGTGCAG
CAACCTTATGAACCATGAAGCCTTATGTCAAGCTGAGAAAGATTTAGAAATGTTGCTGGAAGAAGAGGAAACATACTGGAAATTGAGATCGAGAGAAGACTGGCTTAAAA
GTGGGGATAAGAATACAAAATGGTTTCACTCCAAGGCCTCGTGTAGAAAAAGGAAAAACGAAATCAAGGGCATTCTAGATGAAAGAGGGAATTGGACTGAAAAAGAGGAA
GATGTAGGCAAAATTGCTTCAGCCTATTTCAAAAGCTTACTCAGTTCAAACCATCCCAGCAAGATTAGCTTAAAGAAAGTTTCAGATGCCATTTGCTCTACAATCTTCGA
TGACCAAAAAATGAGGTTGGATATTCCTTTCTCCAAGGAAGAAATTGAAAAAGCTATTAAGAGTATGAATCCTTCTAAAGCCCCCGGGCCAGATGGAGCTCATGCAATGA
TATATCAAAGGTATTGGGACATCCTTGGGGAAGACACGATAAGAGTCTGCCTCGGCTTTCTTAATCACAAGGAGGCTTTGGAACCACTGAACAAAACTTTCATATCTCTT
ATCCCAAAATCCAAAGATCCAAAGAAGATGCAAGAATTTAGGCCTATAAGCTTGTGTAATGTGATATACAAGATTATAGTCAAGACTATTGCTAATAGAATGAAAACCAT
TCTGGCATCGATCATTTCTTCTTCTCAGTCAGCTTTCATCCCCGGAAGACAAATATCAGACAATGTTTTAGTGGGATTTGAATGCATTCAAGCGCTTAATAAGAGGAAAA
AAGGAAAAGTGGGGCATATGACTTTAAAATTGGATATGAGTAAGGCTTACGACAGGGTCGAATGGGCCTTTTTGGAAGAAATCATGGAAAAAATGAATTTCAGCAATGAT
TGGACAAGAAGAGTTATGGCTTGTGTTTCCTCGGTCACTTATTCCATCATGATCAACGGTGTCCCCCAAGAAGAATTCAAGCCAAGTAGGGGTCTGAGACAAGGAGATCC
TCTGTCTCCGTACCTCTTTCTCCTATTTGCTGAAGGGTTCTCAGCTCTTCTTAGAAGAGAGGAAAGCTTATCCAACATCTCAGGGCTTCAACTCAATAAATTTTGTCCTA
ACCTAACTCATCTCTTTTTTGCAGATGATAGCATGATTTTTATCAAAGCAGAAGCTAAGGAAATAGAAGCTCTCAAAAGAGTTCTGGAGGAGTACAAACAAGCTTCAGGC
CAGACCATCAATCTGGACAAATCAACCTTTATGACCAGCAAGAATATGAAGCCTGAAGATCAAGAGTGGTGTGAAAGAGCCCTGGGTATCAAAAGATCGGAGTCTCTAGG
AATTTACCTAGGGATGCCGTCTCAAGTTGGGAGAAACAAAAAAGAAGCTTTTAGAAGGATCAAAGACAGAGTTGAAAAAATCCTCCAAGGATGGAAAGAGAGGCTCTTCT
CTCTTGGAGGAAAAGAAGTCTTGATAAAAGCAGTGGCCCAAGCAATCCCGACCTATACTATGAGTTGTTTTCGCCTCCCTAACAATATTTGTTCTGAGATTGACAGGTTG
TGTGCCAGGTTCTGGTGGGGGACGGTAGGAGACAAAAACAGAAATCATTGGATGAGTTGGGCAAATATGTGCAAGAATAAAGTCCGAGGAGGTTTAGGGTTCAGACAAAT
GACTCTTTTCAACCAAGCCATGCTAGCTAAGCAAAGCTGGAGAATCATAAAGAACCCCGATAACTTGCTATTCAAAATCCTAAGAGGCAAGTATTTTAAGGATGGGAACT
TCCTGGAGGCTCCCATAGGAAATGCTTCGTCTATGACTTGGAGAAGCATATGTTGGGGTAGAGATTTGTTCCTAAAAGGCTACAGATGGCGGGTTGGGAATGAGAATCTT
ATCACGATAGACAAAGACCTCTGGATAAACAGGACTGGCAGGTTAAGTCCCTTGGTAACGGTGGATTCCTTAAAGGGGCAGAGAGTCAATCGACTTCTAGATGGAAACAA
CCACTGGAAAAAAGAGCTGCTAGTTCAATCCTTTTCCCCTCAAGACGTAGAGGATATCATGAATATTCCTACAGGGGAGAAAAGTTCTAGAGATGAGATAATCTGGAGGT
ATGATAAGAAAGGTGTTTTCTCAGTCAAGAGTGCCTATCACTTAGCCGTGGAGTACAATTCGAGCAGCCAAGCCTCTCCGTCCAATGCTGAAAATACTTCTAGACTGTGG
AAGAGCATCTGGAATGCAAAAGTGCTTCCTAGAACTAAAATTTGTATTTGGAAGATTGTCAAAGATATCATTCCTAGCAAAGCGAATATTCTAAAAAAGGGAGTTTCTCT
AAATCCCTTGTGCGATTTAGCTATAACTGCTATCACAATGTGGATGATTTGGTCGTCTAGGAATAGAGATCAACCAGACTCAAGAGAAAGAACACCAGATGTTCTCATCC
AAGCAATCAGCAACCATATTTCTGAAAGGAAGAATTTTTACCTCAAGCAAAGTTTCCCAAAAAGATCGAGGAGCCTTGCGAATCAGGCGATGTGGGCTAAGCCGAAGCTG
GGTAGCTGGAAGCTGAATTCCGACGCGTCTTGGAACGAGAAGAAAGGCCAAGGCGGTATCGGCTGGGTCGTGCATGATTCGACCGGATCCCCAATCTGTTTTGGATTGAG
ACAAATTCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGCTTTTCGGGTGACAGGTTCACGTGGGCGAGGAACAAATCCAAAAAGGAAGCTACAAGAGAAAGACTGGACCGGTTCATGATGAACTCGAAGATGCTTTCTAAAGC
CAGAAATGTGAAAGTTGAGCATCTAAACTTCCACCAATCGGATCACAGGCCTATCCTGTTGAGTATCGATTGGGGGAGTAAAATCCAGCAAGTTATTCCGACCAAAAACA
TCATTAAGCTGGAAGAAAGTTGGCTGAAGTTTGAAGGCAGCAAACAGACTTTCAAAAGCTCTTGGGAGGACACTCCAGAAGTTTCCACTTACAATTTCAGCAAAAAAATC
CAAGAAGGGATGATTAATATGAGCAGATGGAATAAAGAAAGGCTAAAGGGTTCTATCAAGGGGGCCATTGCCCGGAAAGAAAAAAAAATTCAAAGGCTTTCCAAGTGCAG
CAACCTTATGAACCATGAAGCCTTATGTCAAGCTGAGAAAGATTTAGAAATGTTGCTGGAAGAAGAGGAAACATACTGGAAATTGAGATCGAGAGAAGACTGGCTTAAAA
GTGGGGATAAGAATACAAAATGGTTTCACTCCAAGGCCTCGTGTAGAAAAAGGAAAAACGAAATCAAGGGCATTCTAGATGAAAGAGGGAATTGGACTGAAAAAGAGGAA
GATGTAGGCAAAATTGCTTCAGCCTATTTCAAAAGCTTACTCAGTTCAAACCATCCCAGCAAGATTAGCTTAAAGAAAGTTTCAGATGCCATTTGCTCTACAATCTTCGA
TGACCAAAAAATGAGGTTGGATATTCCTTTCTCCAAGGAAGAAATTGAAAAAGCTATTAAGAGTATGAATCCTTCTAAAGCCCCCGGGCCAGATGGAGCTCATGCAATGA
TATATCAAAGGTATTGGGACATCCTTGGGGAAGACACGATAAGAGTCTGCCTCGGCTTTCTTAATCACAAGGAGGCTTTGGAACCACTGAACAAAACTTTCATATCTCTT
ATCCCAAAATCCAAAGATCCAAAGAAGATGCAAGAATTTAGGCCTATAAGCTTGTGTAATGTGATATACAAGATTATAGTCAAGACTATTGCTAATAGAATGAAAACCAT
TCTGGCATCGATCATTTCTTCTTCTCAGTCAGCTTTCATCCCCGGAAGACAAATATCAGACAATGTTTTAGTGGGATTTGAATGCATTCAAGCGCTTAATAAGAGGAAAA
AAGGAAAAGTGGGGCATATGACTTTAAAATTGGATATGAGTAAGGCTTACGACAGGGTCGAATGGGCCTTTTTGGAAGAAATCATGGAAAAAATGAATTTCAGCAATGAT
TGGACAAGAAGAGTTATGGCTTGTGTTTCCTCGGTCACTTATTCCATCATGATCAACGGTGTCCCCCAAGAAGAATTCAAGCCAAGTAGGGGTCTGAGACAAGGAGATCC
TCTGTCTCCGTACCTCTTTCTCCTATTTGCTGAAGGGTTCTCAGCTCTTCTTAGAAGAGAGGAAAGCTTATCCAACATCTCAGGGCTTCAACTCAATAAATTTTGTCCTA
ACCTAACTCATCTCTTTTTTGCAGATGATAGCATGATTTTTATCAAAGCAGAAGCTAAGGAAATAGAAGCTCTCAAAAGAGTTCTGGAGGAGTACAAACAAGCTTCAGGC
CAGACCATCAATCTGGACAAATCAACCTTTATGACCAGCAAGAATATGAAGCCTGAAGATCAAGAGTGGTGTGAAAGAGCCCTGGGTATCAAAAGATCGGAGTCTCTAGG
AATTTACCTAGGGATGCCGTCTCAAGTTGGGAGAAACAAAAAAGAAGCTTTTAGAAGGATCAAAGACAGAGTTGAAAAAATCCTCCAAGGATGGAAAGAGAGGCTCTTCT
CTCTTGGAGGAAAAGAAGTCTTGATAAAAGCAGTGGCCCAAGCAATCCCGACCTATACTATGAGTTGTTTTCGCCTCCCTAACAATATTTGTTCTGAGATTGACAGGTTG
TGTGCCAGGTTCTGGTGGGGGACGGTAGGAGACAAAAACAGAAATCATTGGATGAGTTGGGCAAATATGTGCAAGAATAAAGTCCGAGGAGGTTTAGGGTTCAGACAAAT
GACTCTTTTCAACCAAGCCATGCTAGCTAAGCAAAGCTGGAGAATCATAAAGAACCCCGATAACTTGCTATTCAAAATCCTAAGAGGCAAGTATTTTAAGGATGGGAACT
TCCTGGAGGCTCCCATAGGAAATGCTTCGTCTATGACTTGGAGAAGCATATGTTGGGGTAGAGATTTGTTCCTAAAAGGCTACAGATGGCGGGTTGGGAATGAGAATCTT
ATCACGATAGACAAAGACCTCTGGATAAACAGGACTGGCAGGTTAAGTCCCTTGGTAACGGTGGATTCCTTAAAGGGGCAGAGAGTCAATCGACTTCTAGATGGAAACAA
CCACTGGAAAAAAGAGCTGCTAGTTCAATCCTTTTCCCCTCAAGACGTAGAGGATATCATGAATATTCCTACAGGGGAGAAAAGTTCTAGAGATGAGATAATCTGGAGGT
ATGATAAGAAAGGTGTTTTCTCAGTCAAGAGTGCCTATCACTTAGCCGTGGAGTACAATTCGAGCAGCCAAGCCTCTCCGTCCAATGCTGAAAATACTTCTAGACTGTGG
AAGAGCATCTGGAATGCAAAAGTGCTTCCTAGAACTAAAATTTGTATTTGGAAGATTGTCAAAGATATCATTCCTAGCAAAGCGAATATTCTAAAAAAGGGAGTTTCTCT
AAATCCCTTGTGCGATTTAGCTATAACTGCTATCACAATGTGGATGATTTGGTCGTCTAGGAATAGAGATCAACCAGACTCAAGAGAAAGAACACCAGATGTTCTCATCC
AAGCAATCAGCAACCATATTTCTGAAAGGAAGAATTTTTACCTCAAGCAAAGTTTCCCAAAAAGATCGAGGAGCCTTGCGAATCAGGCGATGTGGGCTAAGCCGAAGCTG
GGTAGCTGGAAGCTGAATTCCGACGCGTCTTGGAACGAGAAGAAAGGCCAAGGCGGTATCGGCTGGGTCGTGCATGATTCGACCGGATCCCCAATCTGTTTTGGATTGAG
ACAAATTCATTAA
Protein sequenceShow/hide protein sequence
MGFSGDRFTWARNKSKKEATRERLDRFMMNSKMLSKARNVKVEHLNFHQSDHRPILLSIDWGSKIQQVIPTKNIIKLEESWLKFEGSKQTFKSSWEDTPEVSTYNFSKKI
QEGMINMSRWNKERLKGSIKGAIARKEKKIQRLSKCSNLMNHEALCQAEKDLEMLLEEEETYWKLRSREDWLKSGDKNTKWFHSKASCRKRKNEIKGILDERGNWTEKEE
DVGKIASAYFKSLLSSNHPSKISLKKVSDAICSTIFDDQKMRLDIPFSKEEIEKAIKSMNPSKAPGPDGAHAMIYQRYWDILGEDTIRVCLGFLNHKEALEPLNKTFISL
IPKSKDPKKMQEFRPISLCNVIYKIIVKTIANRMKTILASIISSSQSAFIPGRQISDNVLVGFECIQALNKRKKGKVGHMTLKLDMSKAYDRVEWAFLEEIMEKMNFSND
WTRRVMACVSSVTYSIMINGVPQEEFKPSRGLRQGDPLSPYLFLLFAEGFSALLRREESLSNISGLQLNKFCPNLTHLFFADDSMIFIKAEAKEIEALKRVLEEYKQASG
QTINLDKSTFMTSKNMKPEDQEWCERALGIKRSESLGIYLGMPSQVGRNKKEAFRRIKDRVEKILQGWKERLFSLGGKEVLIKAVAQAIPTYTMSCFRLPNNICSEIDRL
CARFWWGTVGDKNRNHWMSWANMCKNKVRGGLGFRQMTLFNQAMLAKQSWRIIKNPDNLLFKILRGKYFKDGNFLEAPIGNASSMTWRSICWGRDLFLKGYRWRVGNENL
ITIDKDLWINRTGRLSPLVTVDSLKGQRVNRLLDGNNHWKKELLVQSFSPQDVEDIMNIPTGEKSSRDEIIWRYDKKGVFSVKSAYHLAVEYNSSSQASPSNAENTSRLW
KSIWNAKVLPRTKICIWKIVKDIIPSKANILKKGVSLNPLCDLAITAITMWMIWSSRNRDQPDSRERTPDVLIQAISNHISERKNFYLKQSFPKRSRSLANQAMWAKPKL
GSWKLNSDASWNEKKGQGGIGWVVHDSTGSPICFGLRQIH