; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008946 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008946
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:32886998..32889831
RNA-Seq ExpressionLag0008946
SyntenyLag0008946
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI01138.1 hypothetical protein PRUPE_6G123900 [Prunus persica]9.4e-19638.56Show/hide
Query:  MNSSLIRPFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKVQHPRKASDFRPISLCNVVYKLVSK
        MN+ L++ FT  E+   L Q+ P+K+PG DG+   F++ +W +VG  V + CLQILN   S    N TLI +IPKV+ P   S+FRPISLC  VYK+++K
Subjt:  MNSSLIRPFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKVQHPRKASDFRPISLCNVVYKLVSK

Query:  AIVNRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNL
         I NR+K +L+ +I+  QSAF+P R ++DN +  +E ++ ++G + G+     LKLDM+KAYDRVEW FL  MMLK+GF+  WV  +  CIST  +S   
Subjt:  AIVNRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNL

Query:  NGSRLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKS
         G+ +G+I P RGLRQG PLSPYLFLIC EG SC+LR AE    L G  +++G P+++H  FADDS+LF KA  ++   L +L   YE  +GQ INY KS
Subjt:  NGSRLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKS

Query:  AISFSPNTNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRDTLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNCFQLPKSLIADLH
        A+S SPN      + I  +  +  V+CH+ YLGLP+   + R+     +KD++W+H+ GWK KL S  G+E LIK+V+QA+P Y+M+CF++PK L  +L+
Subjt:  AISFSPNTNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRDTLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNCFQLPKSLIADLH

Query:  RIIARFWWNGSAESHKIHWISWDSMWLAV-----------------------RLLQSPSSLLGRVLKAQYFKHNSFLQAQLGSRPSFIWRSLLWGRGLLR
         I+ARFWW  + +   IHW+ W+ +  +                        R+L++P SL+ R+ +A+Y     FL+A++G+ PSFIWRSL WG+ LL 
Subjt:  RIIARFWWNGSAESHKIHWISWDSMWLAV-----------------------RLLQSPSSLLGRVLKAQYFKHNSFLQAQLGSRPSFIWRSLLWGRGLLR

Query:  CCLRWRIGNGNSVNIYGENWLSSGSSLQINSPPRLPLDSKVCSLFSASGQWDSIKIHSVFHPDEAHAILSIPVGDWVREDKQIWHYESNGIFSVRSGYRL
          LRWR+G+G S+ +Y + WL + S  +I SPP+LPL ++VC LF++SGQW+   +  +F   E  AIL IP+      D  IWHYE NG++SV+SGYRL
Subjt:  CCLRWRIGNGNSVNIYGENWLSSGSSLQINSPPRLPLDSKVCSLFSASGQWDSIKIHSVFHPDEAHAILSIPVGDWVREDKQIWHYESNGIFSVRSGYRL

Query:  A--QMVDLRGIPSSSTSSMVGWWKGLWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNLCSDCGRSGESVVHVFWRCKVARKEWLLSNFGHLWQSI
        A  +   + G PS+       +WK +W L IP+KI+ FLWR   + LP G  L  R I    +C  C R  ESV+H  W C+ A++ W  S +G++ +  
Subjt:  A--QMVDLRGIPSSSTSSMVGWWKGLWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNLCSDCGRSGESVVHVFWRCKVARKEWLLSNFGHLWQSI

Query:  PWDPSVSFLHILQDIKDLMGWLKFEEVAVF---LWTLWNNRNMRKFQGKEASSPVGSWASDYLLSYQTAQVSLRIRRPVGCMQSPAS----WSPPRPGYF
          +      H LQ           EE  +F    W LWN RN   F+GK + + +   +    L+ + +  +  +    G   SP +    W PP     
Subjt:  PWDPSVSFLHILQDIKDLMGWLKFEEVAVF---LWTLWNNRNMRKFQGKEASSPVGSWASDYLLSYQTAQVSLRIRRPVGCMQSPAS----WSPPRPGYF

Query:  KINVDASFLAGENLATGGVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLLLARDLGFFPWILETDSLRLFRLLSSVTED-------LSELGVLAA
              +  +G+++   GV++R+  G                   E  A  EGL  A D+GF   ILE D+      + S  E        L E+  L  
Subjt:  KINVDASFLAGENLATGGVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLLLARDLGFFPWILETDSLRLFRLLSSVTED-------LSELGVLAA

Query:  ALRGSMLHASCSFSFTPRSGNTAAHGLAKLAFSSRSNMVWLEEWPSAISEVVRSEAL
          R  + H      +TPR GN  AH LA+ AF     + W+EE PS +  V+ ++ L
Subjt:  ALRGSMLHASCSFSFTPRSGNTAAHGLAKLAFSSRSNMVWLEEWPSAISEVVRSEAL

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]3.6e-20339.85Show/hide
Query:  MNSSLIRPFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKVQHPRKASDFRPISLCNVVYKLVSK
        MN  L++ FT  E+   L Q+ P+K+PG DG+   F++ +W +VG  V + CLQILN   S    N TLI +IPKV+ P   S+FRPISLC  VYK+++K
Subjt:  MNSSLIRPFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKVQHPRKASDFRPISLCNVVYKLVSK

Query:  AIVNRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNL
         I NR+K +L  +I+ NQSAF+P R ++DN +  +E +H ++G + G+     LKLDM+KAYDRVEW FL +MMLK+GF+  WV  +  CIST  +S   
Subjt:  AIVNRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNL

Query:  NGSRLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKS
         G+ +G+I P RGLRQG PLSPYLFL+C EG SC+LR AE    L G  +++GGP+++H  FADDS+LF KA  +    L +L   YE  SGQ INY KS
Subjt:  NGSRLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKS

Query:  AISFSPNTNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRDTLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNCFQLPKSLIADLH
        A S SPN      + I  +  +  VQCH+KYLGLP+   + R+     +KD++W+H+ GWK KL S  G+E L+K+V+QA+P Y+M+CF++PK L  +L+
Subjt:  AISFSPNTNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRDTLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNCFQLPKSLIADLH

Query:  RIIARFWWNGSAESHKIHWISWDSMWLAV-----------------------RLLQSPSSLLGRVLKAQYFKHNSFLQAQLGSRPSFIWRSLLWGRGLLR
         I+ARFWW  + +   IHW+ W+ +  +                        R+L++P SL+ R+ +A+Y     FL+A++G+ PSFIWRSL WG+ LL 
Subjt:  RIIARFWWNGSAESHKIHWISWDSMWLAV-----------------------RLLQSPSSLLGRVLKAQYFKHNSFLQAQLGSRPSFIWRSLLWGRGLLR

Query:  CCLRWRIGNGNSVNIYGENWLSSGSSLQINSPPRLPLDSKVCSLFSASGQWDSIKIHSVFHPDEAHAILSIPVGDWVREDKQIWHYESNGIFSVRSGYRL
          LRWR+GNG S+ +Y + WL + S  +I SPP+LPL + VC LF++SGQW+   +  +F   E  A L IP+      D  IWHYE NG++SV+SGYRL
Subjt:  CCLRWRIGNGNSVNIYGENWLSSGSSLQINSPPRLPLDSKVCSLFSASGQWDSIKIHSVFHPDEAHAILSIPVGDWVREDKQIWHYESNGIFSVRSGYRL

Query:  A--QMVDLRGIPSSSTSSMVGWWKGLWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNLCSDCGRSGESVVHVFWRCKVARKEWLLSNFGHLWQSI
        A  +   + G PS        +WK +W L IP+KI+ FLWR   + LP G  L  R I    +C +C R  ESV+H  W C+ A++ W  S +G++ +  
Subjt:  A--QMVDLRGIPSSSTSSMVGWWKGLWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNLCSDCGRSGESVVHVFWRCKVARKEWLLSNFGHLWQSI

Query:  PWDPSVSFLHILQDIKDLMGWLKFEEVAVF---LWTLWNNRNMRKFQGK-EASSPVGSWASDYLLSYQTA-QVSLRIRRPVGCMQSPA-SWSPPRPGYFK
          +      H LQ           EE  +F    W LWN RN   F+GK E ++ +    +     +  A  +S  I       Q+P   W PP  G +K
Subjt:  PWDPSVSFLHILQDIKDLMGWLKFEEVAVF---LWTLWNNRNMRKFQGK-EASSPVGSWASDYLLSYQTA-QVSLRIRRPVGCMQSPA-SWSPPRPGYFK

Query:  INVDASFLAGENLATGGVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLLLARDLGFFPWILETDSLRLFRLLSSVTEDLSELGVLAAALRGSMLH
        INVD +  +G+++   GV++R+  G                   E  A  EGL  A D+GF   +LE D+      + S TE+ + +  L       +LH
Subjt:  INVDASFLAGENLATGGVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLLLARDLGFFPWILETDSLRLFRLLSSVTEDLSELGVLAAALRGSMLH

Query:  --ASCSFSFTPRSGNTAAHGLAKLAFSSRSNMVWLEEWPSAISEVVRSEAL
           +    +TPRSGN  AH LA+ AF     + W+EE P  +  V+ ++ L
Subjt:  --ASCSFSFTPRSGNTAAHGLAKLAFSSRSNMVWLEEWPSAISEVVRSEAL

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]2.0e-19839.73Show/hide
Query:  MNSSLIRPFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKVQHPRKASDFRPISLCNVVYKLVSK
        MN SLI+ FT  E+  AL Q+HP+K+PGPDG+S  F++ +W +VG D+    L +LN  +S   IN+T I ++PK+++P K SDFRPISLCNVVYKL+SK
Subjt:  MNSSLIRPFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKVQHPRKASDFRPISLCNVVYKLVSK

Query:  AIVNRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNL
         + NR+K IL  IIS NQSAF+ GR + DN ++ +E +H L+ ++ GK G+  +KLDMSKAYDRVEW F+ Q+M KMGF  +W+ ++  CI++V YS  +
Subjt:  AIVNRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNL

Query:  NGSRLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKS
        NG   G+ITP+RGLRQGDP+SPY+FL+CA+G S +L        +SG +I +G P I+H FFADDSLLF KAN QE   L+ +L LYE ASGQ IN +KS
Subjt:  NGSRLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKS

Query:  AISFSPNTNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRDTLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNCFQLPKSLIADLH
        ++ FS NT  + +  + ++        H+KYLGLPS + +++ +  + +K+RV R L GWK KL SVGGRE LIK+V QA+P YTM+CFQ+PK+L  ++ 
Subjt:  AISFSPNTNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRDTLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNCFQLPKSLIADLH

Query:  RIIARFWWNGSAESHKIHWISWDSMWLAV-----------------------RLLQSPSSLLGRVLKAQYFKHNSFLQAQLGSRPSFIWRSLLWGRGLLR
         ++ RFWW    +  KI W+SW  +  A                        RL+ +P+SL+ ++ KA+Y+ H    QA+LG+ PS+ WRS+  G  ++R
Subjt:  RIIARFWWNGSAESHKIHWISWDSMWLAV-----------------------RLLQSPSSLLGRVLKAQYFKHNSFLQAQLGSRPSFIWRSLLWGRGLLR

Query:  CCLRWRIGNGNSVNIYGENWLSSGSSLQINSPPRLPLDS--KVCSLFSAS-GQWDSIKIHSVFHPDEAHAILSIPVGDWVREDKQIWHYESNGIFSVRSG
           RWR+GNG  + I+ + WL +  + ++ SPP+ P D   +V +L      +W    +  +F P EA  ILSIP+     ED+ IW     G FSV+S 
Subjt:  CCLRWRIGNGNSVNIYGENWLSSGSSLQINSPPRLPLDS--KVCSLFSAS-GQWDSIKIHSVFHPDEAHAILSIPVGDWVREDKQIWHYESNGIFSVRSG

Query:  YRLAQMV--DLRGIPSSSTSSMVGWWKGLWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNLCSDCGRSGESVVHVFWRCKVARKEWLLSNFGHLW
        Y +A  V  +L    SSS  S    W+ LW L+IP K+RIF W++C+N LPT  NL+ + +++ ++C  CG   ES +H+F +C+VA++ W        W
Subjt:  YRLAQMV--DLRGIPSSSTSSMVGWWKGLWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNLCSDCGRSGESVVHVFWRCKVARKEWLLSNFGHLW

Query:  QSIPWD---PSVSFLHILQDIKDLMGWLKFEEVAVFLWTLWNNRNMRKFQG-KEASSPVGSWASDYLLSYQTAQVSLRIRRPVGCMQSPASWSPPRPGYF
           P D    ++  + I   I D       E   V  W +W NRN   F+   +    +  +A  Y+L ++ A  +     P    QS   W  P PG F
Subjt:  QSIPWD---PSVSFLHILQDIKDLMGWLKFEEVAVFLWTLWNNRNMRKFQG-KEASSPVGSWASDYLLSYQTAQVSLRIRRPVGCMQSPASWSPPRPGYF

Query:  KINVDASFLAGENLATGGVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLLLARDLGFFPWILETDSLRLFRLLSSVTEDLSELGVLAAALRGSML
        KINVD +       ++ GVIIRD  G V      +     SV+  E  A+  GLLLA++      I+E+D+L +   ++S  E    LG +   +   + 
Subjt:  KINVDASFLAGENLATGGVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLLLARDLGFFPWILETDSLRLFRLLSSVTEDLSELGVLAAALRGSML

Query:  HASC-SFSFTPRSGNTAAHGLAKLAFSSRSNMVWLEEWPSAISEVVRSE
           C   +   R  N AAH LA+ A     + VW+   P  + +VV+ +
Subjt:  HASC-SFSFTPRSGNTAAHGLAKLAFSSRSNMVWLEEWPSAISEVVRSE

XP_030936391.1 uncharacterized protein LOC115961572 [Quercus lobata]1.0e-19440.25Show/hide
Query:  MNSSLIRPFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKVQHPRKASDFRPISLCNVVYKLVSK
        MN  L R +T+ EV  AL+Q++P K+PGPDG+   F+++ W   G  VT   L  LN  +SP   NET IV+IPK+  P+  SD+RPISLCNV YK+ SK
Subjt:  MNSSLIRPFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKVQHPRKASDFRPISLCNVVYKLVSK

Query:  AIVNRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNL
        AI NR+K  L +IIS  QSAF+ GR + DN ++ +E +H +  ++ GKVG   +KLDMSKAYDRVEW F+ ++M K+GF      +I  CI+TV Y+  +
Subjt:  AIVNRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNL

Query:  NGSRLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKS
        NG   G I PSRG+RQGDPLSPYLFL+CAEGLS +++ +     + G  I +GGP +SH FFADDSL+F KA   E   L  +L +YE ASGQ +N  K+
Subjt:  NGSRLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKS

Query:  AISFSPNTNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRDTLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNCFQLPKSLIADLH
        ++ FS NT  ++QE I   F    ++ H+KYLGLPS + +N+R T + IK+++ + L GWK KL S  G+E LIK+V  AVP YTM+CF+LP +L  +L 
Subjt:  AISFSPNTNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRDTLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNCFQLPKSLIADLH

Query:  RIIARFWWNGSAESHKIHWISWDSMW-----------------LAV------RLLQSPSSLLGRVLKAQYFKHNSFLQAQLGSRPSFIWRSLLWGRGLLR
         +I +FWW      ++I W+SWD M                  LA+      RL     SL+ RVLKA+YF    F+ A LG+ PS+ WRS++  + L++
Subjt:  RIIARFWWNGSAESHKIHWISWDSMW-----------------LAV------RLLQSPSSLLGRVLKAQYFKHNSFLQAQLGSRPSFIWRSLLWGRGLLR

Query:  CCLRWRIGNGNSVNIYGENWLSSGSSLQINSPPR-LPLDSKVCSLF-SASGQWDSIKIHSVFHPDEAHAILSIPVGDWVREDKQIWHYESNGIFSVRSGY
          L+WR+GNG S+ ++ + WL +  S ++ +P   L  D++V  L  S  G+W +  I +VF P EA +I SIP+   +  DK IW    NG+F+VRS Y
Subjt:  CCLRWRIGNGNSVNIYGENWLSSGSSLQINSPPR-LPLDSKVCSLF-SASGQWDSIKIHSVFHPDEAHAILSIPVGDWVREDKQIWHYESNGIFSVRSGY

Query:  RLAQMVDLRGIPS----SSTSSMVGWWKGLWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNLCSDCGRSGESVVHVFWRCKVAR--KEWLLSNFG
        +LA  V+L  +P+    S  S M  +W+ +W + +P KIR F+WR C N LPT DNL+ R I   ++C DC  + ESV HV W C+ AR  +E     F 
Subjt:  RLAQMVDLRGIPS----SSTSSMVGWWKGLWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNLCSDCGRSGESVVHVFWRCKVAR--KEWLLSNFG

Query:  HLWQSIPWDPSVSFLHILQDI--KDLMGWLKFEEVAVFLWTLWNNRNMRKFQG-KEASSPVGSWASDYLLSYQTAQVSLRIRRP-VGCMQSPASWSPPRP
         L  S     S+SF+ ++  +  ++ +G     + A   W +W+NRN  +  G ++    + SWA++YL  Y+ A    ++ RP V   Q    W+PPR 
Subjt:  HLWQSIPWDPSVSFLHILQDI--KDLMGWLKFEEVAVFLWTLWNNRNMRKFQG-KEASSPVGSWASDYLLSYQTAQVSLRIRRP-VGCMQSPASWSPPRP

Query:  GYFKINVDASFLAGENLATGGVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLLLARDLGFFPWILETDSLRLFRLLSSVTEDLSELGVLAAALRG
        G FKINVD +    +     GV+IRD  G +    +            E  A+  GLL A+D+G    +LE DS  ++  L + +   S +  +   ++ 
Subjt:  GYFKINVDASFLAGENLATGGVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLLLARDLGFFPWILETDSLRLFRLLSSVTEDLSELGVLAAALRG

Query:  -SMLHASCSFSFTPRSGNTAAHGLAKLAFSSRSNMVWLEEWPSAISEVV
              S  +S   R GN  AH LAK A S    + W+EE P  I + +
Subjt:  -SMLHASCSFSFTPRSGNTAAHGLAKLAFSSRSNMVWLEEWPSAISEVV

XP_030939975.1 uncharacterized protein LOC115964883 [Quercus lobata]6.1e-19538.66Show/hide
Query:  MNSSLIRPFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKVQHPRKASDFRPISLCNVVYKLVSK
        MN+ L R FT+ EV  AL+Q+ P  +PG DG+S  FY++ W  +G DV    L ILN    P  +N T I +IPK++ P KA+DFRPISLCNV+YK+VSK
Subjt:  MNSSLIRPFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKVQHPRKASDFRPISLCNVVYKLVSK

Query:  AIVNRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNL
         I NR+K +L  ++S +QSAF+  R + DN ++ +E +H L+ +  GK G+  +KLDMSKAYDRVEW FL ++M K+GF   W+ ++  CI +V +S  +
Subjt:  AIVNRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNL

Query:  NGSRLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKS
        NG   GN TP+RGLRQGDPLSPYLFL+CAEGL  +++ AE +  + G ++   GP +SH FFADDSLLF +AN QE++ ++ +L  YE ASGQ IN EK+
Subjt:  NGSRLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKS

Query:  AISFSPNTNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRDTLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNCFQLPKSLIADLH
         + FSPNT+  VQE I  L  +     ++KYLGLPSF+ R ++ +  +I++R+W  +QGWK +L S GGRE LIK+V+QA+P +TM CF++PKSL  D+ 
Subjt:  AISFSPNTNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRDTLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNCFQLPKSLIADLH

Query:  RIIARFWWNGSAESHKIHWISWDSMWLAV-----------------------RLLQSPSSLLGRVLKAQYFKHNSFLQAQLGSRPSFIWRSLLWGRGLLR
         +I +FWW    E+ KIHW+ W  +  +                        RL+ +  SL  +V KA++F + S L   +    S+ W+S+L  RG++R
Subjt:  RIIARFWWNGSAESHKIHWISWDSMWLAV-----------------------RLLQSPSSLLGRVLKAQYFKHNSFLQAQLGSRPSFIWRSLLWGRGLLR

Query:  CCLRWRIGNGNSVNIYGENWLSSGSSLQINSPPR-LPLDSKVCSLFSASGQ-WDSIKIHSVFHPDEAHAILSIPVGDWVREDKQIWHYESNGIFSVRSGY
           +WRIG+G+SV I G+ WL    S ++ SP +  P +++VC+L     + W   +I   F P EA AILS+P+    +ED+ IW   +NG ++ +S Y
Subjt:  CCLRWRIGNGNSVNIYGENWLSSGSSLQINSPPR-LPLDSKVCSLFSASGQ-WDSIKIHSVFHPDEAHAILSIPVGDWVREDKQIWHYESNGIFSVRSGY

Query:  R-LAQMVDLRGIPSSSTSSMVGWWKGLWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNLCSDCGRSGESVVHVFWRCKVARKEWLLSNFGHLWQ-
        R L Q  +     +S+ +    +W+ LW L++P+KIR FLWR   + LPT  NL+ RNI     C  CG   E  +H  W C++ ++ W        W+ 
Subjt:  R-LAQMVDLRGIPSSSTSSMVGWWKGLWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNLCSDCGRSGESVVHVFWRCKVARKEWLLSNFGHLWQ-

Query:  ----SIPWDPSVSFLHILQDIKDLMGWLKFEEVAVFL-WTLWNNRNMRKFQGKE-ASSPVGSWASDYLLSYQTAQVSLRIRRPVGCMQSPASWSPPRPGY
                +   SF  +LQ I        F E+  F+ W++W++RN R+       +  +   A + L  + + Q   R + PV     P  W PP P  
Subjt:  ----SIPWDPSVSFLHILQDIKDLMGWLKFEEVAVFL-WTLWNNRNMRKFQGKE-ASSPVGSWASDYLLSYQTAQVSLRIRRPVGCMQSPASWSPPRPGY

Query:  FKINVDASFLAGENLATGGVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLLLARDLGFFPWILETDSLRLFRLLSSVTEDLSELGVLAAALRG-S
        +K+N D +       A  GV+IRD  G+V    +      P+V   E  A R  ++ AR+LG    + E DS  +F+LL++    ++  G +    R  +
Subjt:  FKINVDASFLAGENLATGGVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLLLARDLGFFPWILETDSLRLFRLLSSVTEDLSELGVLAAALRG-S

Query:  MLHASCSFSFTPRSGNTAAHGLAKLAFSSRSNMVWLEEWPSAISEVVRSEAL
         +  S +F+ T R GN  A  LAKLA +     VWLEE   +++E+V ++ +
Subjt:  MLHASCSFSFTPRSGNTAAHGLAKLAFSSRSNMVWLEEWPSAISEVVRSEAL

TrEMBL top hitse value%identityAlignment
A0A1R3GNW3 Reverse transcriptase1.1e-19438.16Show/hide
Query:  MNSSLIRPFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKVQHPRKASDFRPISLCNVVYKLVSK
        MN  L+  FT+ E+  AL+QIHP+K+PGPDG+   F++  W +VG DVT  CL   +  +     N+T IV+IPKV  P+  + FRPISLCNV+YK++SK
Subjt:  MNSSLIRPFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKVQHPRKASDFRPISLCNVVYKLVSK

Query:  AIVNRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNL
         +VNR+K IL   IS +QSAF+PGR + DN ++ +E +H+L+ R++GK G+  LKLDMSKAYDRVEWDFL  +ML+MGF   WV++I  C+ +V +S  +
Subjt:  AIVNRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNL

Query:  NGSRLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKS
        NG    N  P  GLRQGDPLSPYLFL+C EGLS +L   +T   LSG ++S+ GP +SH FFADDSLLF KAN  ES  +   L +YE  SGQ IN+EKS
Subjt:  NGSRLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKS

Query:  AISFSPNTNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRDTLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNCFQLPKSLIADLH
         + FS N     ++ +  +FA+      +KYLGLP+F+ RN+R   ++IK+R+ + +  W  +  S GGRE +IKSV+QA+P Y MN F  P++L  D++
Subjt:  AISFSPNTNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRDTLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNCFQLPKSLIADLH

Query:  RIIARFWWNGSAESHKIHWISWDSMWLAV-----------------------RLLQSPSSLLGRVLKAQYFKHNSFLQAQLGSRPSFIWRSLLWGRGLLR
         +I+RFWW    +   I+WI W  +  A                        +LL  P SL+ RVLKA+YF   +FL+A+ G  PSF WRS+L GR LL+
Subjt:  RIIARFWWNGSAESHKIHWISWDSMWLAV-----------------------RLLQSPSSLLGRVLKAQYFKHNSFLQAQLGSRPSFIWRSLLWGRGLLR

Query:  CCLRWRIGNGNSVNIYGENWLSS-GSSLQINSPPRLPLDSKVCSLFSASG-QWDSIKIHSVFHPDEAHAILSIPVGDWVREDKQIWHYESNGIFSVRSGY
          LRWRIGNG SV I  + W++     +  +   ++P DS V  L    G  WD   I S+F  +EA AI+ IP+   ++ D  +WH+++ GI+SV+SGY
Subjt:  CCLRWRIGNGNSVNIYGENWLSS-GSSLQINSPPRLPLDSKVCSLFSASG-QWDSIKIHSVFHPDEAHAILSIPVGDWVREDKQIWHYESNGIFSVRSGY

Query:  RLAQMVDLRGIPSSSTSSMVG---WWKGLWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNLCSDCGRSGESVVHVFWRCKVARKEWL-LSNF---
        R+  + +L+     +   ++    ++  +W   +P K+R+F WRL    L   D+L  R++DV   C  C +  ESV H    C  A + W  +SNF   
Subjt:  RLAQMVDLRGIPSSSTSSMVG---WWKGLWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNLCSDCGRSGESVVHVFWRCKVARKEWL-LSNF---

Query:  --GHLWQSIPWDPSVSFLHILQDIKDLMGWLKFEEVAVFLWTLWNNRNMRKFQGK-EASSPVGSWASDYLLSYQTAQVSLRIRRPVGCMQSPASWSPPRP
           +L+  I  DP         ++     W +   +    W +WN RN   F+ K   S     +A  Y   +   + +  I+ P     SP  W PP  
Subjt:  --GHLWQSIPWDPSVSFLHILQDIKDLMGWLKFEEVAVFLWTLWNNRNMRKFQGK-EASSPVGSWASDYLLSYQTAQVSLRIRRPVGCMQSPASWSPPRP

Query:  GYFKINVDASFLAGENLATGGVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLLLARDLGFFPWILETDSLRLFRLLSSVTEDLSELGVLAAALRG
         + K+N D +F +   +   G+I R+  G+V    +    N+     AE FA    L  +R++GF   ++E D+L + R ++S++ D S +G      + 
Subjt:  GYFKINVDASFLAGENLATGGVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLLLARDLGFFPWILETDSLRLFRLLSSVTEDLSELGVLAAALRG

Query:  -SMLHASCSFSFTPRSGNTAAHGLAKLAFSSRSNMVWLEEWPSAISEVVRSEAL
          +L +SC      R+GN+ AH LAK        +VW+E+ P  + + ++++ +
Subjt:  -SMLHASCSFSFTPRSGNTAAHGLAKLAFSSRSNMVWLEEWPSAISEVVRSEAL

A0A251NPF0 Reverse transcriptase domain-containing protein4.6e-19638.56Show/hide
Query:  MNSSLIRPFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKVQHPRKASDFRPISLCNVVYKLVSK
        MN+ L++ FT  E+   L Q+ P+K+PG DG+   F++ +W +VG  V + CLQILN   S    N TLI +IPKV+ P   S+FRPISLC  VYK+++K
Subjt:  MNSSLIRPFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKVQHPRKASDFRPISLCNVVYKLVSK

Query:  AIVNRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNL
         I NR+K +L+ +I+  QSAF+P R ++DN +  +E ++ ++G + G+     LKLDM+KAYDRVEW FL  MMLK+GF+  WV  +  CIST  +S   
Subjt:  AIVNRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNL

Query:  NGSRLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKS
         G+ +G+I P RGLRQG PLSPYLFLIC EG SC+LR AE    L G  +++G P+++H  FADDS+LF KA  ++   L +L   YE  +GQ INY KS
Subjt:  NGSRLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKS

Query:  AISFSPNTNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRDTLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNCFQLPKSLIADLH
        A+S SPN      + I  +  +  V+CH+ YLGLP+   + R+     +KD++W+H+ GWK KL S  G+E LIK+V+QA+P Y+M+CF++PK L  +L+
Subjt:  AISFSPNTNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRDTLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNCFQLPKSLIADLH

Query:  RIIARFWWNGSAESHKIHWISWDSMWLAV-----------------------RLLQSPSSLLGRVLKAQYFKHNSFLQAQLGSRPSFIWRSLLWGRGLLR
         I+ARFWW  + +   IHW+ W+ +  +                        R+L++P SL+ R+ +A+Y     FL+A++G+ PSFIWRSL WG+ LL 
Subjt:  RIIARFWWNGSAESHKIHWISWDSMWLAV-----------------------RLLQSPSSLLGRVLKAQYFKHNSFLQAQLGSRPSFIWRSLLWGRGLLR

Query:  CCLRWRIGNGNSVNIYGENWLSSGSSLQINSPPRLPLDSKVCSLFSASGQWDSIKIHSVFHPDEAHAILSIPVGDWVREDKQIWHYESNGIFSVRSGYRL
          LRWR+G+G S+ +Y + WL + S  +I SPP+LPL ++VC LF++SGQW+   +  +F   E  AIL IP+      D  IWHYE NG++SV+SGYRL
Subjt:  CCLRWRIGNGNSVNIYGENWLSSGSSLQINSPPRLPLDSKVCSLFSASGQWDSIKIHSVFHPDEAHAILSIPVGDWVREDKQIWHYESNGIFSVRSGYRL

Query:  A--QMVDLRGIPSSSTSSMVGWWKGLWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNLCSDCGRSGESVVHVFWRCKVARKEWLLSNFGHLWQSI
        A  +   + G PS+       +WK +W L IP+KI+ FLWR   + LP G  L  R I    +C  C R  ESV+H  W C+ A++ W  S +G++ +  
Subjt:  A--QMVDLRGIPSSSTSSMVGWWKGLWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNLCSDCGRSGESVVHVFWRCKVARKEWLLSNFGHLWQSI

Query:  PWDPSVSFLHILQDIKDLMGWLKFEEVAVF---LWTLWNNRNMRKFQGKEASSPVGSWASDYLLSYQTAQVSLRIRRPVGCMQSPAS----WSPPRPGYF
          +      H LQ           EE  +F    W LWN RN   F+GK + + +   +    L+ + +  +  +    G   SP +    W PP     
Subjt:  PWDPSVSFLHILQDIKDLMGWLKFEEVAVF---LWTLWNNRNMRKFQGKEASSPVGSWASDYLLSYQTAQVSLRIRRPVGCMQSPAS----WSPPRPGYF

Query:  KINVDASFLAGENLATGGVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLLLARDLGFFPWILETDSLRLFRLLSSVTED-------LSELGVLAA
              +  +G+++   GV++R+  G                   E  A  EGL  A D+GF   ILE D+      + S  E        L E+  L  
Subjt:  KINVDASFLAGENLATGGVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLLLARDLGFFPWILETDSLRLFRLLSSVTED-------LSELGVLAA

Query:  ALRGSMLHASCSFSFTPRSGNTAAHGLAKLAFSSRSNMVWLEEWPSAISEVVRSEAL
          R  + H      +TPR GN  AH LA+ AF     + W+EE PS +  V+ ++ L
Subjt:  ALRGSMLHASCSFSFTPRSGNTAAHGLAKLAFSSRSNMVWLEEWPSAISEVVRSEAL

A0A5E4FZN9 PREDICTED: retrotransposon1.7e-20339.85Show/hide
Query:  MNSSLIRPFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKVQHPRKASDFRPISLCNVVYKLVSK
        MN  L++ FT  E+   L Q+ P+K+PG DG+   F++ +W +VG  V + CLQILN   S    N TLI +IPKV+ P   S+FRPISLC  VYK+++K
Subjt:  MNSSLIRPFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKVQHPRKASDFRPISLCNVVYKLVSK

Query:  AIVNRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNL
         I NR+K +L  +I+ NQSAF+P R ++DN +  +E +H ++G + G+     LKLDM+KAYDRVEW FL +MMLK+GF+  WV  +  CIST  +S   
Subjt:  AIVNRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNL

Query:  NGSRLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKS
         G+ +G+I P RGLRQG PLSPYLFL+C EG SC+LR AE    L G  +++GGP+++H  FADDS+LF KA  +    L +L   YE  SGQ INY KS
Subjt:  NGSRLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKS

Query:  AISFSPNTNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRDTLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNCFQLPKSLIADLH
        A S SPN      + I  +  +  VQCH+KYLGLP+   + R+     +KD++W+H+ GWK KL S  G+E L+K+V+QA+P Y+M+CF++PK L  +L+
Subjt:  AISFSPNTNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRDTLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNCFQLPKSLIADLH

Query:  RIIARFWWNGSAESHKIHWISWDSMWLAV-----------------------RLLQSPSSLLGRVLKAQYFKHNSFLQAQLGSRPSFIWRSLLWGRGLLR
         I+ARFWW  + +   IHW+ W+ +  +                        R+L++P SL+ R+ +A+Y     FL+A++G+ PSFIWRSL WG+ LL 
Subjt:  RIIARFWWNGSAESHKIHWISWDSMWLAV-----------------------RLLQSPSSLLGRVLKAQYFKHNSFLQAQLGSRPSFIWRSLLWGRGLLR

Query:  CCLRWRIGNGNSVNIYGENWLSSGSSLQINSPPRLPLDSKVCSLFSASGQWDSIKIHSVFHPDEAHAILSIPVGDWVREDKQIWHYESNGIFSVRSGYRL
          LRWR+GNG S+ +Y + WL + S  +I SPP+LPL + VC LF++SGQW+   +  +F   E  A L IP+      D  IWHYE NG++SV+SGYRL
Subjt:  CCLRWRIGNGNSVNIYGENWLSSGSSLQINSPPRLPLDSKVCSLFSASGQWDSIKIHSVFHPDEAHAILSIPVGDWVREDKQIWHYESNGIFSVRSGYRL

Query:  A--QMVDLRGIPSSSTSSMVGWWKGLWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNLCSDCGRSGESVVHVFWRCKVARKEWLLSNFGHLWQSI
        A  +   + G PS        +WK +W L IP+KI+ FLWR   + LP G  L  R I    +C +C R  ESV+H  W C+ A++ W  S +G++ +  
Subjt:  A--QMVDLRGIPSSSTSSMVGWWKGLWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNLCSDCGRSGESVVHVFWRCKVARKEWLLSNFGHLWQSI

Query:  PWDPSVSFLHILQDIKDLMGWLKFEEVAVF---LWTLWNNRNMRKFQGK-EASSPVGSWASDYLLSYQTA-QVSLRIRRPVGCMQSPA-SWSPPRPGYFK
          +      H LQ           EE  +F    W LWN RN   F+GK E ++ +    +     +  A  +S  I       Q+P   W PP  G +K
Subjt:  PWDPSVSFLHILQDIKDLMGWLKFEEVAVF---LWTLWNNRNMRKFQGK-EASSPVGSWASDYLLSYQTA-QVSLRIRRPVGCMQSPA-SWSPPRPGYFK

Query:  INVDASFLAGENLATGGVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLLLARDLGFFPWILETDSLRLFRLLSSVTEDLSELGVLAAALRGSMLH
        INVD +  +G+++   GV++R+  G                   E  A  EGL  A D+GF   +LE D+      + S TE+ + +  L       +LH
Subjt:  INVDASFLAGENLATGGVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLLLARDLGFFPWILETDSLRLFRLLSSVTEDLSELGVLAAALRGSMLH

Query:  --ASCSFSFTPRSGNTAAHGLAKLAFSSRSNMVWLEEWPSAISEVVRSEAL
           +    +TPRSGN  AH LA+ AF     + W+EE P  +  V+ ++ L
Subjt:  --ASCSFSFTPRSGNTAAHGLAKLAFSSRSNMVWLEEWPSAISEVVRSEAL

A0A7N2L6Z9 Reverse transcriptase domain-containing protein4.1e-19740.59Show/hide
Query:  MNSSLIRPFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKVQHPRKASDFRPISLCNVVYKLVSK
        MN+ L R FT  E+  AL+QIHP+KSPGPDG+S  F++ +W++VG +V+   L +LN  +S   IN+T IV+IPK  +P++ +DFRPISLCNV+YKL+SK
Subjt:  MNSSLIRPFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKVQHPRKASDFRPISLCNVVYKLVSK

Query:  AIVNRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNL
         + NR+K  L  II+ NQSAF   R + DN ++ YE +H L+ ++ GK  +   KLDMSKA+DRVEW F+ ++M KMGF   W+ +I  CIS+V YS  +
Subjt:  AIVNRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNL

Query:  NGSRLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKS
        NG   GNI P+RGLRQGDPLSPYLFL+CAEGLS +L  A   + L+G ++ +G P I+H FFADDSLLF KAN +E   L  +L  YE+ASGQ +N +KS
Subjt:  NGSRLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKS

Query:  AISFSPNTNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRDTLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNCFQLPKSLIADLH
        +I FSPNT  +++E I  +        H KYLGLPS + R+++   + IK+RV   L GWKGKL S GG+E LIK+V QA+P YTM+CF LPKSL  +L 
Subjt:  AISFSPNTNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRDTLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNCFQLPKSLIADLH

Query:  RIIARFWWNGSAESHKIHWISWDSMW-----------------------LAVRLLQSPSSLLGRVLKAQYFKHNSFLQAQLGSRPSFIWRSLLWGRGLLR
        +++  FWW    +  K+ WISW  M                         A R+L +P SL  R+LKA+YF +   L A LGS PS+ WRS+     +L+
Subjt:  RIIARFWWNGSAESHKIHWISWDSMW-----------------------LAVRLLQSPSSLLGRVLKAQYFKHNSFLQAQLGSRPSFIWRSLLWGRGLLR

Query:  CCLRWRIGNGNSVNIYGENWLSSGSSLQINSPPRLPLD-SKVCSLFSASGQWDSI-KIHSVFHPDEAHAILSIPVGDWVREDKQIWHYESNGIFSVRSGY
           RWR+GNG  ++I+ + WL S S+ ++ +PPR+  D   V SL     +W  I  I ++F P +A AIL IP+   + +D+ IW     G FSV+S Y
Subjt:  CCLRWRIGNGNSVNIYGENWLSSGSSLQINSPPRLPLD-SKVCSLFSASGQWDSI-KIHSVFHPDEAHAILSIPVGDWVREDKQIWHYESNGIFSVRSGY

Query:  RLA--QMVDLRGIPSSSTSSMVGWWKGLWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNLCSDCGRSGESVVHVFWRCKVARKEWLLSNFGHLWQ
         +A   +     +  SS    +  WK LW+L +P K++IF WR CLN LPT  N+ +R ++    C  C    E + H    C  A   W       LWQ
Subjt:  RLA--QMVDLRGIPSSSTSSMVGWWKGLWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNLCSDCGRSGESVVHVFWRCKVARKEWLLSNFGHLWQ

Query:  SIPWDPSVSFLHILQDIKDLMGWLKF------EEVAVFL---WTLWNNRNMRKFQGKEASSPVGSW--ASDYLLSYQTAQVSLRIRRPVGCMQSP-ASWS
          P    +  L   +DIK L   L F        + +F    W +W+NRN+R    ++  SP+ +W  A   +  Y  A     I+     MQS    WS
Subjt:  SIPWDPSVSFLHILQDIKDLMGWLKF------EEVAVFL---WTLWNNRNMRKFQGKEASSPVGSW--ASDYLLSYQTAQVSLRIRRPVGCMQSP-ASWS

Query:  PPRPGYFKINVDASFLAGENLATG-GVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLLLARDLGFFPWILETDSLRLFRLLSSVTEDLSELGVLA
         P PG FK+NVD +       ++G GV+IRDE G+V         +    ++ E FAI +GLLLA+++     +LE+D+L     ++S   +     ++ 
Subjt:  PPRPGYFKINVDASFLAGENLATG-GVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLLLARDLGFFPWILETDSLRLFRLLSSVTEDLSELGVLA

Query:  AALRGSMLHASCSFSFTPRSGNTAAHGLAKLAFSSRSNMVWLEEWPSAISEVVRSE
          L+   L + CSFS+  R  N  AH LA+ A S+  + VW    P  +S ++ S+
Subjt:  AALRGSMLHASCSFSFTPRSGNTAAHGLAKLAFSSRSNMVWLEEWPSAISEVVRSE

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)4.6e-19638.56Show/hide
Query:  MNSSLIRPFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKVQHPRKASDFRPISLCNVVYKLVSK
        MN+ L++ FT  E+   L Q+ P+K+PG DG+   F++ +W +VG  V + CLQILN   S    N TLI +IPKV+ P   S+FRPISLC  VYK+++K
Subjt:  MNSSLIRPFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKVQHPRKASDFRPISLCNVVYKLVSK

Query:  AIVNRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNL
         I NR+K +L+ +I+  QSAF+P R ++DN +  +E ++ ++G + G+     LKLDM+KAYDRVEW FL  MMLK+GF+  WV  +  CIST  +S   
Subjt:  AIVNRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNL

Query:  NGSRLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKS
         G+ +G+I P RGLRQG PLSPYLFLIC EG SC+LR AE    L G  +++G P+++H  FADDS+LF KA  ++   L +L   YE  +GQ INY KS
Subjt:  NGSRLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKS

Query:  AISFSPNTNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRDTLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNCFQLPKSLIADLH
        A+S SPN      + I  +  +  V+CH+ YLGLP+   + R+     +KD++W+H+ GWK KL S  G+E LIK+V+QA+P Y+M+CF++PK L  +L+
Subjt:  AISFSPNTNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRDTLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNCFQLPKSLIADLH

Query:  RIIARFWWNGSAESHKIHWISWDSMWLAV-----------------------RLLQSPSSLLGRVLKAQYFKHNSFLQAQLGSRPSFIWRSLLWGRGLLR
         I+ARFWW  + +   IHW+ W+ +  +                        R+L++P SL+ R+ +A+Y     FL+A++G+ PSFIWRSL WG+ LL 
Subjt:  RIIARFWWNGSAESHKIHWISWDSMWLAV-----------------------RLLQSPSSLLGRVLKAQYFKHNSFLQAQLGSRPSFIWRSLLWGRGLLR

Query:  CCLRWRIGNGNSVNIYGENWLSSGSSLQINSPPRLPLDSKVCSLFSASGQWDSIKIHSVFHPDEAHAILSIPVGDWVREDKQIWHYESNGIFSVRSGYRL
          LRWR+G+G S+ +Y + WL + S  +I SPP+LPL ++VC LF++SGQW+   +  +F   E  AIL IP+      D  IWHYE NG++SV+SGYRL
Subjt:  CCLRWRIGNGNSVNIYGENWLSSGSSLQINSPPRLPLDSKVCSLFSASGQWDSIKIHSVFHPDEAHAILSIPVGDWVREDKQIWHYESNGIFSVRSGYRL

Query:  A--QMVDLRGIPSSSTSSMVGWWKGLWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNLCSDCGRSGESVVHVFWRCKVARKEWLLSNFGHLWQSI
        A  +   + G PS+       +WK +W L IP+KI+ FLWR   + LP G  L  R I    +C  C R  ESV+H  W C+ A++ W  S +G++ +  
Subjt:  A--QMVDLRGIPSSSTSSMVGWWKGLWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNLCSDCGRSGESVVHVFWRCKVARKEWLLSNFGHLWQSI

Query:  PWDPSVSFLHILQDIKDLMGWLKFEEVAVF---LWTLWNNRNMRKFQGKEASSPVGSWASDYLLSYQTAQVSLRIRRPVGCMQSPAS----WSPPRPGYF
          +      H LQ           EE  +F    W LWN RN   F+GK + + +   +    L+ + +  +  +    G   SP +    W PP     
Subjt:  PWDPSVSFLHILQDIKDLMGWLKFEEVAVF---LWTLWNNRNMRKFQGKEASSPVGSWASDYLLSYQTAQVSLRIRRPVGCMQSPAS----WSPPRPGYF

Query:  KINVDASFLAGENLATGGVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLLLARDLGFFPWILETDSLRLFRLLSSVTED-------LSELGVLAA
              +  +G+++   GV++R+  G                   E  A  EGL  A D+GF   ILE D+      + S  E        L E+  L  
Subjt:  KINVDASFLAGENLATGGVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLLLARDLGFFPWILETDSLRLFRLLSSVTED-------LSELGVLAA

Query:  ALRGSMLHASCSFSFTPRSGNTAAHGLAKLAFSSRSNMVWLEEWPSAISEVVRSEAL
          R  + H      +TPR GN  AH LA+ AF     + W+EE PS +  V+ ++ L
Subjt:  ALRGSMLHASCSFSFTPRSGNTAAHGLAKLAFSSRSNMVWLEEWPSAISEVVRSEAL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.7e-3527.71Show/hide
Query:  SLIRPFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKV-QHPRKASDFRPISLCNVVYKLVSKAI
        SL RP T  E+   +  +   KSPGPDG +  FY+ + E +   + +    I  + + P    E  I++IPK  +   K  +FRPISL N+  K+++K +
Subjt:  SLIRPFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKV-QHPRKASDFRPISLCNVVYKLVSKAI

Query:  VNRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNLNG
         NR++  +  +I  +Q  FIPG     N       I  +   R+       + +D  KA+D+++  F+++ + K+G    ++ +IR        +  LNG
Subjt:  VNRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNLNG

Query:  SRLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKSAI
         +L       G RQG PLSP LF I  E L+  +R     + + G  + K    +S   FADD +++ +     +  LL L+  +   SG  IN +KS  
Subjt:  SRLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKSAI

Query:  SFSPNTNVQVQEAISQLFAITHVQCHQKYLGL------PSFMPRNRRDTLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNC--FQLPKS
        +F  N N Q +  I      T      KYLG+            N +  L  IK+        WK    S  GR  ++K  +     Y  N    +LP +
Subjt:  SFSPNTNVQVQEAISQLFAITHVQCHQKYLGL------PSFMPRNRRDTLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNC--FQLPKS

Query:  LIADLHRIIARFWWN
           +L +   +F WN
Subjt:  LIADLHRIIARFWWN

P08548 LINE-1 reverse transcriptase homolog1.4e-3727.67Show/hide
Query:  LIRPFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKV-QHPRKASDFRPISLCNVVYKLVSKAIV
        L RP +S E+   ++ +   KSPGPDG +  FY+   E +   +      I  + + P    E  I +IPK  + P +  ++RPISL N+  K+++K + 
Subjt:  LIRPFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKV-QHPRKASDFRPISLCNVVYKLVSKAIV

Query:  NRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNLNGS
        NR++  +  II  +Q  FIPG     N       I  +   ++       L +D  KA+D ++  F+I+ + K+G    ++ +I    S    +  LNG 
Subjt:  NRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNLNGS

Query:  RLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKSAIS
        +L +     G RQG PLSP LF I  E L+  +R  +  +      I  G   I    FADD +++ +     +  LL ++  Y + SG  IN  KS ++
Subjt:  RLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKSAIS

Query:  FSPNTNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRD----TLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNCFQL--PKSLIA
        F    N Q ++ +      T V    KYLG+  ++ ++ +D        ++  +   +  WK    S  GR  ++K  +     Y  N   +  P S   
Subjt:  FSPNTNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRD----TLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNCFQL--PKSLIA

Query:  DLHRIIARFWWN
        DL +II  F WN
Subjt:  DLHRIIARFWWN

P0C2F6 Putative ribonuclease H protein At1g657507.8e-3624.33Show/hide
Query:  LPSFMPRNRRDTLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNCFQLPKSLIADLHRIIARFWWNGSAESHKIHWISWDSMW-------
        +P    R  +DT   I +RV   + GW+ K  S  GR TL K+V+ ++P ++M+   LP+S++  L ++   F W  +AE  K H + W  +        
Subjt:  LPSFMPRNRRDTLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNCFQLPKSLIADLHRIIARFWWNGSAESHKIHWISWDSMW-------

Query:  ----------------LAVRLLQSPSSLLGRVLKAQYFKHNSFLQAQLGSRP----SFIWRSLLWG-RGLLRCCLRWRIGNGNSVNIYGENWLSSGSSLQ
                        +  RLLQ  +SL   VL+ +Y  H   ++      P    S  WRS+  G R ++   + W  G+G  +  + + W+S    L+
Subjt:  ----------------LAVRLLQSPSSLLGRVLKAQYFKHNSFLQAQLGSRP----SFIWRSLLWG-RGLLRCCLRWRIGNGNSVNIYGENWLSSGSSLQ

Query:  INSPPRLPLDSKVC---SLFSASGQWDSIKIHSVFHPDEAHAILSIPVGDWV--REDKQIWHYESNGIFSVRSGYRLAQMVDLRGIPSSSTSSMVGWWKG
        +++  R P D        L+     WD  KI   +  +     L   V D V    D+  W +  +G FSVRS Y   +M+ +  +P     +M  ++  
Subjt:  INSPPRLPLDSKVC---SLFSASGQWDSIKIHSVFHPDEAHAILSIPVGDWV--REDKQIWHYESNGIFSVRSGYRLAQMVDLRGIPSSSTSSMVGWWKG

Query:  LWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNLCSDCGRSGESVVHVFWRCKVARKEWLLSNFGHLWQSIPWDPSVSFLHILQDIKDLMGWL---
        LW++ +P +++ FLW +    + T +    R++   N+C  C    ES++HV   C      W+        + +P      F       K L  WL   
Subjt:  LWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNLCSDCGRSGESVVHVFWRCKVARKEWLLSNFGHLWQSIPWDPSVSFLHILQDIKDLMGWL---

Query:  -----KFEEV------AVFLWTLWNNRNMRKF----QGKEASSPVGSWASDYLLSYQTAQVSLRIRRPVGCMQSPASWSPPRPGYFKINVDASFLAGENL
               E++      AV +W  W  R    F    + ++    V  WA +   ++ +  V + I +P   ++    W  P  G+ K+N D +      L
Subjt:  -----KFEEV------AVFLWTLWNNRNMRKF----QGKEASSPVGSWASDYLLSYQTAQVSLRIRRPVGCMQSPASWSPPRPGYFKINVDASFLAGENL

Query:  ATGGVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLLLARDLGFFPWILETDSLRLFRLLSSVTEDLSELGVLAAALRGSMLHA-SCSFSFTPRSG
        A+ G ++RD  G      +   G   S   AE + +  GL  A +       LE DS  +   L +   D   L  L     G +            R  
Subjt:  ATGGVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLLLARDLGFFPWILETDSLRLFRLLSSVTEDLSELGVLAAALRGSMLHA-SCSFSFTPRSG

Query:  NTAAHGLAKLAFSSRSNMVWLEEWPSAISEVVRSEAL
        N  A GLA  AFS        +  P A+S ++R + L
Subjt:  NTAAHGLAKLAFSSRSNMVWLEEWPSAISEVVRSEAL

P11369 LINE-1 retrotransposable element ORF2 protein3.2e-3728.61Show/hide
Query:  PFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKVQ-HPRKASDFRPISLCNVVYKLVSKAIVNRM
        P +  E+   +  +   KSPGPDG S  FY+   E +   + +   +I  +   P    E  I +IPK Q  P K  +FRPISL N+  K+++K + NR+
Subjt:  PFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKVQ-HPRKASDFRPISLCNVVYKLVSKAIVNRM

Query:  KGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNLNGSRLG
        +  + AII  +Q  FIPG     N       IH +   +        + LD  KA+D+++  F+I+++ + G    +++MI+   S    +  +NG +L 
Subjt:  KGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNLNGSRLG

Query:  NITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKSAISFSP
         I    G RQG PLSPYLF I  E L+  +R     + + G  I K    IS    ADD +++       +  LL+L++ +    G  IN  KS ++F  
Subjt:  NITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKSAISFSP

Query:  NTNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRD----TLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNC--FQLPKSLIADLH
          N Q ++ I +    + V  + KYLG+   + +  +D        +K  +   L+ WK    S  GR  ++K  +     Y  N    ++P     +L 
Subjt:  NTNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRD----TLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNC--FQLPKSLIADLH

Query:  RIIARFWWN
          I +F WN
Subjt:  RIIARFWWN

P14381 Transposon TX1 uncharacterized 149 kDa protein7.1e-3728.23Show/hide
Query:  PFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKVQHPRKASDFRPISLCNVVYKLVSKAIVNRMK
        P T  E+ +ALR +  +KSPG DGL+  F++  W+ +G D  +   +   +   P      ++ ++PK    R   ++RP+SL +  YK+V+KAI  R+K
Subjt:  PFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKVQHPRKASDFRPISLCNVVYKLVSKAIVNRMK

Query:  GILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNLNGSRLGN
         +L  +I  +QS  +PGR + DN  L  + +H    RR+G +    L LD  KA+DRV+  +LI  +    F P++V  ++   ++      +N S    
Subjt:  GILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNLNGSRLGN

Query:  ITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKSAISFSPN
        +   RG+RQG PLS  L+ +  E   C+LR     + L+G  + +    +    +ADD +L  + +  +         +Y +AS   IN+ KS+     +
Subjt:  ITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKSAISFSPN

Query:  TNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRDTLSFI--KDRVWRHLQGWKG--KLFSVGGRETLIKSVVQAVPCYTMNCFQLPKSLIADLHRII
          V       +   I+      KYLG+          + +FI  ++ V   L  WKG  K+ S+ GR  +I  +V +   Y + C    +  IA + R +
Subjt:  TNVQVQEAISQLFAITHVQCHQKYLGLPSFMPRNRRDTLSFI--KDRVWRHLQGWKG--KLFSVGGRETLIKSVVQAVPCYTMNCFQLPKSLIADLHRII

Query:  ARFWWNGSAESHKIHWIS
          F W G       HW+S
Subjt:  ARFWWNGSAESHKIHWIS

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein1.4e-3527.2Show/hide
Query:  LKAQYFKHNSFLQAQLGSRPSFIWRSLLWGRGLLRCCLRWRIGNGNSVNIYGENWLSSGSSLQINSPPRLPLDSKVCSLFSASGQ---WDSIKIHSVFHP
        +KA+YFK  S L A++  + S+ W SLL G  LL+   R  IG+G ++ I  +N + S     +N+      +  + +LF   G    WD  KI      
Subjt:  LKAQYFKHNSFLQAQLGSRPSFIWRSLLWGRGLLRCCLRWRIGNGNSVNIYGENWLSSGSSLQINSPPRLPLDSKVCSLFSASGQ---WDSIKIHSVFHP

Query:  DEAHAILSIPVGDWVREDKQIWHYESNGIFSVRSGYRLAQMVDLRGIPS-SSTSSMVGWWKGLWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNL
         +   I  I +    + DK IW+Y + G ++VRSGY L        IP+ +     +     +W L I  K++ FLWR     L T + L  R + +   
Subjt:  DEAHAILSIPVGDWVREDKQIWHYESNGIFSVRSGYRLAQMVDLRGIPS-SSTSSMVGWWKGLWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNL

Query:  CSDCGRSGESVVHVFWRCKVARKEWLLSN---FGHLWQSIPWDPSVS-FLHILQDI------KDLMGWLKFEEVAVFLWTLWNNRN---MRKFQGKEASS
        C  C R  ES+ H  + C  A   W LS+     +   S  ++ ++S  L+ +QD       K L  WL        +W +W  RN     KF+   + +
Subjt:  CSDCGRSGESVVHVFWRCKVARKEWLLSN---FGHLWQSIPWDPSVS-FLHILQDI------KDLMGWLKFEEVAVFLWTLWNNRN---MRKFQGKEASS

Query:  PVGSWAS--DYLLSYQTAQVSLRIRRPVGCMQSPASWSPPRPGYFKINVDASFLAGENLATGGVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLL
         + + A   D+L + Q+ + +    R +   ++   W  P   Y K N DA F   +  ATGG IIR+  G      +    +  +   AE  A+   L 
Subjt:  PVGSWAS--DYLLSYQTAQVSLRIRRPVGCMQSPASWSPPRPGYFKINVDASFLAGENLATGGVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLL

Query:  LARDLGFFPWILETDSLRLFRLLSSVTEDLSELGVLAAALRGSMLHASCSFSFTPRSGNTAAHGLAKLAFS-------SRSNMVWLEEW
             G+    +E D   L  L++ ++   S    L      +   AS  F F  R GN  AH LAK   +       S S  +WL+ +
Subjt:  LARDLGFFPWILETDSLRLFRLLSSVTEDLSELGVLAAALRGSMLHASCSFSFTPRSGNTAAHGLAKLAFS-------SRSNMVWLEEW

AT4G20520.1 RNA binding;RNA-directed DNA polymerases6.6e-1439.53Show/hide
Query:  IVNRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMI
        +V R+K ++  +I   Q++FIPGR   DN +   E +H+++ R+ G  GW  LKLD+ KAYDR+ WD+L   ++  GF   W+  I
Subjt:  IVNRMKGILNAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMI

AT4G29090.1 Ribonuclease H-like superfamily protein2.8e-5228.67Show/hide
Query:  AVPCYTMNCFQLPKSLIADLHRIIARFWWNGSAESHKIHWISWD--------------------------SMWLAVRLLQSPSSLLGRVLKAQYFKHNSF
        A+P YTM CF LPK++   +  ++A FWW    E+  +HW +WD                           MW   R+L  P SL+ +V K++YF  +  
Subjt:  AVPCYTMNCFQLPKSLIADLHRIIARFWWNGSAESHKIHWISWD--------------------------SMWLAVRLLQSPSSLLGRVLKAQYFKHNSF

Query:  LQAQLGSRPSFIWRSLLWGRGLLRCCLRWRIGNGNSVNIYGENWLSS---GSSLQINSPPRLPLDS-----KVCSLFSASG-QWDSIKIHSVFHPDEAHA
        L A LGSRPSF+W+S+   + +LR   R  +GNG  + I+   WL S    ++L++   P     S     KV  L   SG +W    I  +F   E   
Subjt:  LQAQLGSRPSFIWRSLLWGRGLLRCCLRWRIGNGNSVNIYGENWLSS---GSSLQINSPPRLPLDS-----KVCSLFSASG-QWDSIKIHSVFHPDEAHA

Query:  ILSIPVGDWVREDKQIWHYESNGIFSVRSGY-RLAQMVDLRGIPSS-STSSMVGWWKGLWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNLCSDC
        I  +  G     D   W Y S+G ++V+SGY  L Q+++ R  P   S  S+   ++ +W+     KI+ FLW+   N LP    L  R++   + C  C
Subjt:  ILSIPVGDWVREDKQIWHYESNGIFSVRSGY-RLAQMVDLRGIPSS-STSSMVGWWKGLWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNLCSDC

Query:  GRSGESVVHVFWRCKVARKEWLLSN----FGHLWQSIPWDPSVSFLHILQDIKDLMGWLKFEE-VAVFLWTLWNNRNMRKFQGKEASSPVGSWASDYLLS
            E+V H+ ++C  AR  W +S+     G  W    +   V+   +         W K  + V   LW LW NRN   F+G+E +      A + L  
Subjt:  GRSGESVVHVFWRCKVARKEWLLSN----FGHLWQSIPWDPSVSFLHILQDIKDLMGWLKFEE-VAVFLWTLWNNRNMRKFQGKEASSPVGSWASDYLLS

Query:  YQTAQVSLRIR--------RPVGCMQSPASWSPPRPGYFKINVDASFLAGENLATGGVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLLLARDLG
         +      RIR        +P     S   W PP   + K N DA++         G ++R+E+G V          L SV  AE  A+R  +L      
Subjt:  YQTAQVSLRIR--------RPVGCMQSPASWSPPRPGYFKINVDASFLAGENLATGGVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLLLARDLG

Query:  FFPWILETDSLRLFRLLSSVTEDLSELGVLAAALRGSMLHASCSFSFTPRSGNTAAHGLAKLAFS
        +   I E+DS  L  +L++     S    +    R         F F PR GNT A  +A+ + S
Subjt:  FFPWILETDSLRLFRLLSSVTEDLSELGVLAAALRGSMLHASCSFSFTPRSGNTAAHGLAKLAFS

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.8e-1930.2Show/hide
Query:  AVPCYTMNCFQLPKSLIADLHRIIARFWWNGSAESHKIHWISWDSM----------------WL--------AVRLLQSPSSLLGRVLKAQYFKHNSFLQ
        A+P Y M+CF+L K L   L   +  FWW+      KI W++W  +                W         + R++  P +LL R+L+++YF H+S ++
Subjt:  AVPCYTMNCFQLPKSLIADLHRIIARFWWNGSAESHKIHWISWDSM----------------WL--------AVRLLQSPSSLLGRVLKAQYFKHNSFLQ

Query:  AQLGSRPSFIWRSLLWGRGLLRCCLRWRIGNGNSVNIYGENWLSSGSSL
          +G+RPS+ WRS++ GR LL   L   IG+G    ++ + W+   + L
Subjt:  AQLGSRPSFIWRSLLWGRGLLRCCLRWRIGNGNSVNIYGENWLSSGSSL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.7e-1553.62Show/hide
Query:  FNLNGSRLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDS
        F +NG+  G +TPSRGLRQGDPLSPYLF++C E LS + R A+    L G  +S   P I+H  FADD+
Subjt:  FNLNGSRLGNITPSRGLRQGDPLSPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTCCAGTTTAATTAGGCCTTTTACATCGCTTGAAGTTCATGAGGCCCTCCGACAAATTCATCCATCTAAATCTCCTGGCCCTGATGGTCTTTCAGGATCCTTTTA
TCGCAATCATTGGGAGTTGGTTGGTGGTGACGTTACACAATGTTGTTTACAGATTTTGAACCAGAGAGTTTCACCGGGGCCAATAAATGAGACGCTGATTGTTATGATTC
CCAAGGTGCAACATCCCAGGAAGGCTTCTGATTTTAGGCCTATATCTCTCTGTAATGTTGTCTATAAGTTAGTCTCTAAGGCCATTGTCAATCGAATGAAAGGAATTTTA
AATGCTATAATTTCCGCTAATCAAAGTGCTTTTATTCCTGGTCGCTGTGTTGTCGATAACGCTATACTGGGCTATGAATGTATTCATGCTCTCCAGGGTCGTCGTTCTGG
TAAAGTTGGTTGGGGTACTCTGAAGTTGGATATGAGCAAGGCCTACGATAGAGTTGAATGGGACTTTTTAATTCAGATGATGTTGAAGATGGGTTTTGCCCCAGAGTGGG
TCGACATGATTCGCCTTTGTATTTCTACTGTGCGTTACTCATTTAATCTGAATGGTTCTCGCCTGGGTAATATTACTCCGTCACGGGGTCTTCGGCAAGGGGACCCTTTA
TCTCCCTACTTGTTTCTTATCTGTGCAGAGGGATTGTCCTGTATTTTAAGACATGCGGAGACCACACGATATTTATCTGGTTTTACTATTTCTAAGGGTGGTCCCACTAT
ATCCCATTTTTTCTTTGCTGACGATAGCCTTCTCTTTTTTAAGGCTAATTGGCAGGAGAGTGCGTTTTTACTTTCCTTGTTACACTTATATGAGTCTGCATCTGGCCAAT
TAATTAATTATGAGAAGTCTGCGATTTCCTTTAGCCCAAATACGAATGTGCAGGTTCAGGAGGCCATTAGCCAACTTTTCGCCATTACTCATGTTCAATGTCACCAGAAG
TATCTTGGCCTTCCGTCTTTCATGCCCCGGAATCGGCGTGATACTTTGAGTTTTATTAAGGATAGGGTTTGGCGTCATCTGCAGGGTTGGAAGGGCAAACTTTTCTCTGT
TGGTGGTCGTGAAACTTTGATCAAATCAGTAGTCCAGGCCGTGCCGTGCTATACTATGAATTGCTTTCAGTTACCAAAATCGCTGATTGCTGATCTACATCGTATTATTG
CTCGATTTTGGTGGAATGGGTCTGCTGAATCTCATAAAATTCATTGGATCAGTTGGGATTCCATGTGGTTGGCGGTGCGGTTGCTCCAGTCTCCCTCCTCTTTGTTAGGG
AGAGTTTTGAAAGCTCAATATTTTAAACACAACAGTTTCCTTCAGGCCCAATTAGGATCACGTCCTTCTTTTATCTGGAGGAGTTTGTTGTGGGGGCGGGGGCTCCTGCG
TTGTTGTCTTCGATGGAGGATTGGCAATGGCAATTCTGTGAATATCTATGGTGAGAATTGGCTTTCTTCTGGATCTTCCTTACAAATCAATTCGCCGCCTAGGCTCCCAT
TGGATAGTAAGGTCTGTTCGTTGTTTTCTGCTTCGGGACAGTGGGATTCTATCAAGATTCATAGTGTTTTCCATCCCGATGAGGCCCATGCCATTCTCTCTATTCCAGTG
GGGGATTGGGTGAGGGAAGATAAGCAAATTTGGCACTATGAGAGTAATGGAATTTTTTCGGTTCGCAGTGGTTACCGACTGGCCCAGATGGTGGATCTTCGAGGTATCCC
TTCCTCCTCTACTTCCTCAATGGTAGGGTGGTGGAAAGGTTTATGGCAGTTGTCTATCCCAAGTAAAATTCGTATATTTCTATGGCGATTATGTCTGAATCGTCTTCCGA
CTGGGGATAATTTGATATTGAGGAATATTGATGTGCCAAATCTTTGTTCTGACTGTGGTAGATCGGGCGAATCTGTGGTTCATGTGTTTTGGCGTTGCAAGGTTGCTAGG
AAAGAGTGGTTGCTGTCAAATTTTGGGCATCTTTGGCAATCGATTCCTTGGGATCCTTCGGTTTCCTTTTTACATATTCTGCAGGACATTAAAGACTTGATGGGCTGGCT
CAAATTTGAGGAGGTGGCCGTTTTCTTATGGACTTTATGGAATAACCGGAACATGCGAAAATTTCAAGGGAAAGAGGCGTCCTCTCCGGTGGGTTCTTGGGCGTCGGACT
ATCTTCTGTCTTATCAGACAGCCCAAGTCTCGTTGCGGATTCGTCGTCCGGTGGGTTGCATGCAATCGCCGGCCTCGTGGTCTCCCCCTCGGCCTGGTTATTTTAAAATA
AATGTGGATGCGAGTTTCTTGGCGGGGGAAAATCTTGCCACCGGTGGGGTTATTATCCGTGACGAACGGGGCGTCGTCTTCCTCACTGCGACTGCTTTCTTTGGGAATCT
TCCGTCCGTAGATTTCGCTGAGGGGTTCGCGATTAGAGAGGGGCTTCTTTTGGCTCGTGATTTGGGTTTCTTTCCGTGGATTCTTGAAACCGACTCTTTGCGTCTATTTC
GTTTGTTATCGTCGGTGACAGAAGATCTGTCGGAGTTGGGCGTTCTTGCGGCAGCTTTGCGAGGCTCCATGTTGCATGCTTCTTGCTCTTTTAGCTTTACTCCTCGTTCT
GGTAATACAGCTGCGCATGGGCTGGCCAAATTAGCTTTCTCTTCTCGTTCAAATATGGTTTGGTTGGAAGAATGGCCATCTGCCATTTCTGAAGTTGTACGTTCTGAAGC
TCTCATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATTCCAGTTTAATTAGGCCTTTTACATCGCTTGAAGTTCATGAGGCCCTCCGACAAATTCATCCATCTAAATCTCCTGGCCCTGATGGTCTTTCAGGATCCTTTTA
TCGCAATCATTGGGAGTTGGTTGGTGGTGACGTTACACAATGTTGTTTACAGATTTTGAACCAGAGAGTTTCACCGGGGCCAATAAATGAGACGCTGATTGTTATGATTC
CCAAGGTGCAACATCCCAGGAAGGCTTCTGATTTTAGGCCTATATCTCTCTGTAATGTTGTCTATAAGTTAGTCTCTAAGGCCATTGTCAATCGAATGAAAGGAATTTTA
AATGCTATAATTTCCGCTAATCAAAGTGCTTTTATTCCTGGTCGCTGTGTTGTCGATAACGCTATACTGGGCTATGAATGTATTCATGCTCTCCAGGGTCGTCGTTCTGG
TAAAGTTGGTTGGGGTACTCTGAAGTTGGATATGAGCAAGGCCTACGATAGAGTTGAATGGGACTTTTTAATTCAGATGATGTTGAAGATGGGTTTTGCCCCAGAGTGGG
TCGACATGATTCGCCTTTGTATTTCTACTGTGCGTTACTCATTTAATCTGAATGGTTCTCGCCTGGGTAATATTACTCCGTCACGGGGTCTTCGGCAAGGGGACCCTTTA
TCTCCCTACTTGTTTCTTATCTGTGCAGAGGGATTGTCCTGTATTTTAAGACATGCGGAGACCACACGATATTTATCTGGTTTTACTATTTCTAAGGGTGGTCCCACTAT
ATCCCATTTTTTCTTTGCTGACGATAGCCTTCTCTTTTTTAAGGCTAATTGGCAGGAGAGTGCGTTTTTACTTTCCTTGTTACACTTATATGAGTCTGCATCTGGCCAAT
TAATTAATTATGAGAAGTCTGCGATTTCCTTTAGCCCAAATACGAATGTGCAGGTTCAGGAGGCCATTAGCCAACTTTTCGCCATTACTCATGTTCAATGTCACCAGAAG
TATCTTGGCCTTCCGTCTTTCATGCCCCGGAATCGGCGTGATACTTTGAGTTTTATTAAGGATAGGGTTTGGCGTCATCTGCAGGGTTGGAAGGGCAAACTTTTCTCTGT
TGGTGGTCGTGAAACTTTGATCAAATCAGTAGTCCAGGCCGTGCCGTGCTATACTATGAATTGCTTTCAGTTACCAAAATCGCTGATTGCTGATCTACATCGTATTATTG
CTCGATTTTGGTGGAATGGGTCTGCTGAATCTCATAAAATTCATTGGATCAGTTGGGATTCCATGTGGTTGGCGGTGCGGTTGCTCCAGTCTCCCTCCTCTTTGTTAGGG
AGAGTTTTGAAAGCTCAATATTTTAAACACAACAGTTTCCTTCAGGCCCAATTAGGATCACGTCCTTCTTTTATCTGGAGGAGTTTGTTGTGGGGGCGGGGGCTCCTGCG
TTGTTGTCTTCGATGGAGGATTGGCAATGGCAATTCTGTGAATATCTATGGTGAGAATTGGCTTTCTTCTGGATCTTCCTTACAAATCAATTCGCCGCCTAGGCTCCCAT
TGGATAGTAAGGTCTGTTCGTTGTTTTCTGCTTCGGGACAGTGGGATTCTATCAAGATTCATAGTGTTTTCCATCCCGATGAGGCCCATGCCATTCTCTCTATTCCAGTG
GGGGATTGGGTGAGGGAAGATAAGCAAATTTGGCACTATGAGAGTAATGGAATTTTTTCGGTTCGCAGTGGTTACCGACTGGCCCAGATGGTGGATCTTCGAGGTATCCC
TTCCTCCTCTACTTCCTCAATGGTAGGGTGGTGGAAAGGTTTATGGCAGTTGTCTATCCCAAGTAAAATTCGTATATTTCTATGGCGATTATGTCTGAATCGTCTTCCGA
CTGGGGATAATTTGATATTGAGGAATATTGATGTGCCAAATCTTTGTTCTGACTGTGGTAGATCGGGCGAATCTGTGGTTCATGTGTTTTGGCGTTGCAAGGTTGCTAGG
AAAGAGTGGTTGCTGTCAAATTTTGGGCATCTTTGGCAATCGATTCCTTGGGATCCTTCGGTTTCCTTTTTACATATTCTGCAGGACATTAAAGACTTGATGGGCTGGCT
CAAATTTGAGGAGGTGGCCGTTTTCTTATGGACTTTATGGAATAACCGGAACATGCGAAAATTTCAAGGGAAAGAGGCGTCCTCTCCGGTGGGTTCTTGGGCGTCGGACT
ATCTTCTGTCTTATCAGACAGCCCAAGTCTCGTTGCGGATTCGTCGTCCGGTGGGTTGCATGCAATCGCCGGCCTCGTGGTCTCCCCCTCGGCCTGGTTATTTTAAAATA
AATGTGGATGCGAGTTTCTTGGCGGGGGAAAATCTTGCCACCGGTGGGGTTATTATCCGTGACGAACGGGGCGTCGTCTTCCTCACTGCGACTGCTTTCTTTGGGAATCT
TCCGTCCGTAGATTTCGCTGAGGGGTTCGCGATTAGAGAGGGGCTTCTTTTGGCTCGTGATTTGGGTTTCTTTCCGTGGATTCTTGAAACCGACTCTTTGCGTCTATTTC
GTTTGTTATCGTCGGTGACAGAAGATCTGTCGGAGTTGGGCGTTCTTGCGGCAGCTTTGCGAGGCTCCATGTTGCATGCTTCTTGCTCTTTTAGCTTTACTCCTCGTTCT
GGTAATACAGCTGCGCATGGGCTGGCCAAATTAGCTTTCTCTTCTCGTTCAAATATGGTTTGGTTGGAAGAATGGCCATCTGCCATTTCTGAAGTTGTACGTTCTGAAGC
TCTCATTTAA
Protein sequenceShow/hide protein sequence
MNSSLIRPFTSLEVHEALRQIHPSKSPGPDGLSGSFYRNHWELVGGDVTQCCLQILNQRVSPGPINETLIVMIPKVQHPRKASDFRPISLCNVVYKLVSKAIVNRMKGIL
NAIISANQSAFIPGRCVVDNAILGYECIHALQGRRSGKVGWGTLKLDMSKAYDRVEWDFLIQMMLKMGFAPEWVDMIRLCISTVRYSFNLNGSRLGNITPSRGLRQGDPL
SPYLFLICAEGLSCILRHAETTRYLSGFTISKGGPTISHFFFADDSLLFFKANWQESAFLLSLLHLYESASGQLINYEKSAISFSPNTNVQVQEAISQLFAITHVQCHQK
YLGLPSFMPRNRRDTLSFIKDRVWRHLQGWKGKLFSVGGRETLIKSVVQAVPCYTMNCFQLPKSLIADLHRIIARFWWNGSAESHKIHWISWDSMWLAVRLLQSPSSLLG
RVLKAQYFKHNSFLQAQLGSRPSFIWRSLLWGRGLLRCCLRWRIGNGNSVNIYGENWLSSGSSLQINSPPRLPLDSKVCSLFSASGQWDSIKIHSVFHPDEAHAILSIPV
GDWVREDKQIWHYESNGIFSVRSGYRLAQMVDLRGIPSSSTSSMVGWWKGLWQLSIPSKIRIFLWRLCLNRLPTGDNLILRNIDVPNLCSDCGRSGESVVHVFWRCKVAR
KEWLLSNFGHLWQSIPWDPSVSFLHILQDIKDLMGWLKFEEVAVFLWTLWNNRNMRKFQGKEASSPVGSWASDYLLSYQTAQVSLRIRRPVGCMQSPASWSPPRPGYFKI
NVDASFLAGENLATGGVIIRDERGVVFLTATAFFGNLPSVDFAEGFAIREGLLLARDLGFFPWILETDSLRLFRLLSSVTEDLSELGVLAAALRGSMLHASCSFSFTPRS
GNTAAHGLAKLAFSSRSNMVWLEEWPSAISEVVRSEALI