; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g20260 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g20260
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr3:13659796..13662339
RNA-Seq ExpressionMoc03g20260
SyntenyMoc03g20260
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN81099.1 hypothetical protein VITISV_017741 [Vitis vinifera]1.4e-21749.09Show/hide
Query:  MSCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRST
        +SCP+T QQ G+ ERKHR IV+ GLTLL  ASL ++FWD++F T VYL NRLP  + H   P+EVLF   P+YSFLKVF C C+P+LRPYN HK++ RS 
Subjt:  MSCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRST

Query:  PCIFLGYSNTYKGYKCLSFSGRVYVSRHVLFNEYSFPASPHSSQQTIAPINFRSFLPILLPS-SLPLISPTMPFFDRQTSHCPIPIALSSDPQTFSSLNH
         C FLGYS  +KGYKC+S +GRVY+S  V+FNE SFP S      +            L PS S P++SPTM          P P +  S  +  S +++
Subjt:  PCIFLGYSNTYKGYKCLSFSGRVYVSRHVLFNEYSFPASPHSSQQTIAPINFRSFLPILLPS-SLPLISPTMPFFDRQTSHCPIPIALSSDPQTFSSLNH

Query:  IVQSTQVSSIVAQQSAAPLPPTTSQQPILSDLSHHSFNVPPIAGSSSASDLFPNTEPLCSSSSSDTLPSGSPPVSQQNQMVNAHSMITRVKAGIFKPKIL
        IV +   +   A  +  P    ++  P+ + + H    V  IA +S    +  + +                         N H MITR K+GI KPKI 
Subjt:  IVQSTQVSSIVAQQSAAPLPPTTSQQPILSDLSHHSFNVPPIAGSSSASDLFPNTEPLCSSSSSDTLPSGSPPVSQQNQMVNAHSMITRVKAGIFKPKIL

Query:  LAQYLETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQDTDLDYTETFSQVVKPTTIR
        +A     EP SV  ALQ D W KAM  EYDAL RN+TWSLVP P  ++ IGCKWV+K K N DG++  YKARLVAKGFHQ    D+TETFS VVKP+T+R
Subjt:  LAQYLETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQDTDLDYTETFSQVVKPTTIR

Query:  VLFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMDVQGSIPLVCRLKKVIYGLKQAPRAWFERLSLFLHTLGFSNSQAVFSLLFRHQDGQHCFIL
        V+FT+AL+  W ++Q+D+NNAFL+G L EEVFM QP G   + +  LVCRL K +YGLKQAPRAWFE+L   L + GF ++++  SL  R       ++L
Subjt:  VLFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMDVQGSIPLVCRLKKVIYGLKQAPRAWFERLSLFLHTLGFSNSQAVFSLLFRHQDGQHCFIL

Query:  VYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPMVGGSVISAFKGDIFHDVYLYRS
        VYVDDI++ GS +  I +++  LN++FSLKDLG++ YFLG++VS+ T+ G+ LSQ KYI D+L KTKM       TP+  G  +    GD   D++ YRS
Subjt:  VYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPMVGGSVISAFKGDIFHDVYLYRS

Query:  TVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISWSSKKQA
        TVGALQY T+T  E+++SVNKVCQFM  PT  HW+ VKRILRYL G L HG+  +K  +L L GF DADW SD DDR+STSG C+F   NLISW SKKQ 
Subjt:  TVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISWSSKKQA

Query:  IISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLHSRTKHVEIDIYFVRDLALQRRLQVCHLPSSAQVADIFSKPLS
        I+SRSS E E RSLA + A + W+++L ++L L    PP VWCDNL  V LSAN VLH+RTKH+E+D+YFVR+  +++ ++V H+PS+ Q+AD+ +K +S
Subjt:  IISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLHSRTKHVEIDIYFVRDLALQRRLQVCHLPSSAQVADIFSKPLS

Query:  PTRFLSLRSKLNVVDASTIGLRG
         T+F+  R KL + + ST+ LRG
Subjt:  PTRFLSLRSKLNVVDASTIGLRG

CAN83392.1 hypothetical protein VITISV_041406 [Vitis vinifera]1.2e-21649.46Show/hide
Query:  MSCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRST
        +SCP+TS+Q GI+ERKHRHIV+ GLTLL+ ASL +++W DAF+TAV+LINRLP  V     P E LF  KPNYS LKVF CLC+P LRPYNKHK++ RS+
Subjt:  MSCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRST

Query:  PCIFLGYSNTYKGYKCLSFSGRVYVSRHVLFNEYSFPASPHSSQQTIAPINFRSFLPILLPSSLPLISPTMPFFDRQTSHCPIPIALSSDPQTFSSLNHI
        PC FLGYS+ +KGYKCL+  GR+++SR V+F+E  FP +    +    P+   S   + LP  +PL+    P     +   P   A SS         H 
Subjt:  PCIFLGYSNTYKGYKCLSFSGRVYVSRHVLFNEYSFPASPHSSQQTIAPINFRSFLPILLPSSLPLISPTMPFFDRQTSHCPIPIALSSDPQTFSSLNHI

Query:  VQSTQVSSIVAQQSAAPLPPTTSQQPILSDLSHHSFNVPPIAGSSSASDLF--PNTEPLCSSSSSDTLPSGSPPVSQQNQMVNAHSMITRVKAGIFKPKI
        +     S I + Q       ++S  PIL +           A   S+S+L+  P T PL ++S        + PV+   Q    H M+TR K GIFKP +
Subjt:  VQSTQVSSIVAQQSAAPLPPTTSQQPILSDLSHHSFNVPPIAGSSSASDLF--PNTEPLCSSSSSDTLPSGSPPVSQQNQMVNAHSMITRVKAGIFKPKI

Query:  LLAQYLETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQDTDLDYTETFSQVVKPTTI
                EP + + A+    W +AM +E+ AL++N TWSLV  P ++  +GC+WVFK+KRN DGS+S YKARLVAKG+ Q    D+ ETFS VVKPTTI
Subjt:  LLAQYLETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQDTDLDYTETFSQVVKPTTI

Query:  RVLFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMD--VQGSIPLVCRLKKVIYGLKQAPRAWFERLSLFLHTLGFSNSQAVFSLLFRHQDGQHC
        RV+  +A++  W +RQ+D+NNAFL+G L EEV+M QPPG D        LVC+L K +YGLKQAPRAWF++L + L   GFS++++  SL  R  +    
Subjt:  RVLFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMD--VQGSIPLVCRLKKVIYGLKQAPRAWFERLSLFLHTLGFSNSQAVFSLLFRHQDGQHC

Query:  FILVYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPMVGGSVISAFKGDIFHDVYL
        F+LVYVDDIV+TGSSS  I  +++ L   FSLKDLG+LSYFLG+EV     GG+ LSQ+KYI D+L KTKM  A S+ TPM+ G  +SA  GD   +V+ 
Subjt:  FILVYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPMVGGSVISAFKGDIFHDVYL

Query:  YRSTVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISWSSK
        YRS VGALQY T+T  EIA+SVNKVCQFM  P   HW+AVKRILRYLNG  + GI+ +    + L GF DADW SD DDR+STSG C+F   +L+SWSSK
Subjt:  YRSTVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISWSSK

Query:  KQAIISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLHSRTKHVEIDIYFVRDLALQRRLQVCHLPSSAQVADIFSK
        KQ   SRSSTEAE RSLA +++ ++W+Q+L ++L       P +WCDN+  V LSAN VLHSRTKH+E+D+YFVR+  ++R+L V H+P+  QVAD+F+K
Subjt:  KQAIISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLHSRTKHVEIDIYFVRDLALQRRLQVCHLPSSAQVADIFSK

Query:  PLSPTRFLSLRSKLNVVDASTIGLRGG
        PLS   F  LR KL V   +++ L+ G
Subjt:  PLSPTRFLSLRSKLNVVDASTIGLRGG

RHN69202.1 putative RNA-directed DNA polymerase [Medicago truncatula]4.9e-21550.18Show/hide
Query:  SCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRSTP
        +CP+T  Q G VERKHRHIV+TGLTLLSHA + ++FWD AF TA YLINRLP  V    SP  +L    P+Y FLK F C C+P LRPYN HK +  S  
Subjt:  SCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRSTP

Query:  CIFLGYSNTYKGYKCLSFSGRVYVSRHVLFNEYSFPASPHSSQQTIAPINFRSFLPILLPSSLPLISPTMPFFDRQTSHCPIPIALSSDPQTFSSLNHIV
        C+FLGYSN++KGYKCL  SGR+++S+ V+FNE  FP       Q +      S LP     S  L +P    F    SH P      S P T   +N   
Subjt:  CIFLGYSNTYKGYKCLSFSGRVYVSRHVLFNEYSFPASPHSSQQTIAPINFRSFLPILLPSSLPLISPTMPFFDRQTSHCPIPIALSSDPQTFSSLNHIV

Query:  QSTQVSSIVAQQSAAPLPPTTSQQPILSDLSHHS---FNVPPIAGSSSASDLFPNTEPLCSSSSSDTLPSGS-PPVSQQNQMVNAHSMITRVKAGIFKPK
          T  S  V     +  P T S     S+ SH +    N  PI   S ++    + E   S +SS +  S S PPV  +    N H+M TR K GI +P+
Subjt:  QSTQVSSIVAQQSAAPLPPTTSQQPILSDLSHHS---FNVPPIAGSSSASDLFPNTEPLCSSSSSDTLPSGS-PPVSQQNQMVNAHSMITRVKAGIFKPK

Query:  ILLAQYL-ETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQDTDLDYTETFSQVVKPT
        I     L   EP + KTALQ   W  AM++EY+AL+ N TWSLV  P ++  IGCKWVF++K N DG+++ YKARLVAKGFHQ T  DY ETFS VVKP 
Subjt:  ILLAQYL-ETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQDTDLDYTETFSQVVKPT

Query:  TIRVLFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMDVQGSIPLVCRLKKVIYGLKQAPRAWFERLSLFLHTLGFSNSQAVFSLLFRHQDGQHC
        T+R + TLA+ Y WTL+Q+D+NNAFL+GVL+EEV+M QPPG +      LVC+L K +YGLKQAPRAWFERL   L + GF +S+   SL   H      
Subjt:  TIRVLFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMDVQGSIPLVCRLKKVIYGLKQAPRAWFERLSLFLHTLGFSNSQAVFSLLFRHQDGQHC

Query:  FILVYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPMVGGSVISAFKGDIFHDVYL
        FILVYVDDI+ITG+S L I  +V  LN++FSLKDLG L YFLG+EV +  SG + LSQ KYI D+L K  M  ANS+ +PM   + +S F      D   
Subjt:  FILVYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPMVGGSVISAFKGDIFHDVYL

Query:  YRSTVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILF---RKPPDLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISW
        +RS VGALQYAT+T  EI+YSVNKVCQF+  P   HW+AVKRILRYL G L+HG++         + + GF DADW SDPDDR+STSG CIF   NL+SW
Subjt:  YRSTVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILF---RKPPDLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISW

Query:  SSKKQAIISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLHSRTKHVEIDIYFVRDLALQRRLQVCHLPSSAQVADI
         ++KQ +++RSS EAE RSLA  SA ++WIQ+L  +L +     P V+CDNL AV L+ N VLHSRTKH+E+DI+FVR+  +++ L V H+P+  Q AD+
Subjt:  SSKKQAIISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLHSRTKHVEIDIYFVRDLALQRRLQVCHLPSSAQVADI

Query:  FSKPLSPTRFLSLRSKLNVVDASTIGLRG
         +KPLS  RFL LR KL V D  T+ L+G
Subjt:  FSKPLSPTRFLSLRSKLNVVDASTIGLRG

RVW44519.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.6e-21849.82Show/hide
Query:  MSCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRST
        +SCP+TS+Q GI+ERKHRHIV+ GLTLL+ ASL +++W DAF+TAV+LINRLP  V     P E LF  KPNYS LKVF CLC+P LRPYNKHK++ RS+
Subjt:  MSCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRST

Query:  PCIFLGYSNTYKGYKCLSFSGRVYVSRHVLFNEYSFPASPHSSQQTIAPINFRSFLPILLPSSLPLISPTMPFFDRQTSHCPIPIALSSDPQTFSSLNHI
        PC FLGYS+ +KGYKCL+  GR+++SR V+F+E  FP +    +    P+   S   + LP  +PL+    P     +   P   A SS         H 
Subjt:  PCIFLGYSNTYKGYKCLSFSGRVYVSRHVLFNEYSFPASPHSSQQTIAPINFRSFLPILLPSSLPLISPTMPFFDRQTSHCPIPIALSSDPQTFSSLNHI

Query:  VQSTQVSSIVAQQSAAPLPPTTSQQPILSDLSHHSFNVPPIAGSSSASDLFPNTEPLCSSSSSDTLPSGSPPVSQQNQMVNAHSMITRVKAGIFKPKILL
        +     S I + Q       ++S  PIL++    S ++P    SSS     P T PL ++S        + PV+   Q    H M+TR K GIFKPK+  
Subjt:  VQSTQVSSIVAQQSAAPLPPTTSQQPILSDLSHHSFNVPPIAGSSSASDLFPNTEPLCSSSSSDTLPSGSPPVSQQNQMVNAHSMITRVKAGIFKPKILL

Query:  AQYLETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQDTDLDYTETFSQVVKPTTIRV
              EP + + A+    W +AM +E+ AL++N TWSLV  P ++  +GC+WVFK+KRN DGS+S YKARLVAKG+ Q    D+ ETFS VVKPTTIRV
Subjt:  AQYLETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQDTDLDYTETFSQVVKPTTIRV

Query:  LFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMD--VQGSIPLVCRLKKVIYGLKQAPRAWFERLSLFLHTLGFSNSQAVFSLLFRHQDGQHCFI
        +  +A++  W +RQ+D+NNAFL+G L EEV+M QPPG D        LVC+L K +YGLKQAPRAWF++L + L   GFS++++  SL  R  +    F+
Subjt:  LFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMD--VQGSIPLVCRLKKVIYGLKQAPRAWFERLSLFLHTLGFSNSQAVFSLLFRHQDGQHCFI

Query:  LVYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPMVGGSVISAFKGDIFHDVYLYR
        LVYVDDIV+TGSSS  I  +++ L   FSLKDLG+LSYFLG+EV     GG+ LSQ+KYI D+L KTKM  A S+ TPM+ G  +SA  GD   +V+ YR
Subjt:  LVYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPMVGGSVISAFKGDIFHDVYLYR

Query:  STVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISWSSKKQ
        S VGALQY T+T  EIA+SVNKVCQFM  P   HW+AVKRILRYLNG  + GI+ +    + L GF DADW SD DDR+STSG C+F   +L+SWSSKKQ
Subjt:  STVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISWSSKKQ

Query:  AIISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLHSRTKHVEIDIYFVRDLALQRRLQVCHLPSSAQVADIFSKPL
           SRSSTEAE RSLA +++ ++W+Q+L ++L       P +WCDN+  V LSAN VLHSRTKH+E+D+YFVR+  ++R+L V H+P+  QVAD+F+KPL
Subjt:  AIISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLHSRTKHVEIDIYFVRDLALQRRLQVCHLPSSAQVADIFSKPL

Query:  SPTRFLSLRSKLNVVDASTIGLRGG
        S   F  LR KL V   +++ L+ G
Subjt:  SPTRFLSLRSKLNVVDASTIGLRGG

RVX14937.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]5.0e-22049.7Show/hide
Query:  MSCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRST
        +SCP+T QQ G+ ERKHR IV+ GLTLL   SL ++FWD++F T VYL NRLP  V H   P+EVLF   P+YSFLKVF C C+P+LRPYN HK++ RS 
Subjt:  MSCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRST

Query:  PCIFLGYSNTYKGYKCLSFSGRVYVSRHVLFNEYSFPASPHSSQQTIAPINFRSFLPILLPS-SLPLISPTMPFFDRQTSHCPIPIALSSDPQTFSSLNH
         C FLGYS  +KGYKC+S +GRVY+SR V+FNE SFP S      +  P         L PS S P++SPTM          P P +  S  +  S +++
Subjt:  PCIFLGYSNTYKGYKCLSFSGRVYVSRHVLFNEYSFPASPHSSQQTIAPINFRSFLPILLPS-SLPLISPTMPFFDRQTSHCPIPIALSSDPQTFSSLNH

Query:  IVQSTQVSSIVAQQSAAPLPPTTSQQPILSDLSHHSFNVPPIAGSSSASDLFPNTEPLCSSSSSDTLPSGSPPVSQQNQMVNAHSMITRVKAGIFKPKIL
        IV +   +   A  +  P    ++  P+ + + H    V  IA +S    +  + +                         N H MITR K+GI KPKI 
Subjt:  IVQSTQVSSIVAQQSAAPLPPTTSQQPILSDLSHHSFNVPPIAGSSSASDLFPNTEPLCSSSSSDTLPSGSPPVSQQNQMVNAHSMITRVKAGIFKPKIL

Query:  LAQYLETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQDTDLDYTETFSQVVKPTTIR
        +A     EP SV  ALQ D W KAM  EYDAL RN+TWSLVP P  ++ IGCKWV+K K N DG++  YKARLVAKGFHQ    D+TETFS VVKP+TIR
Subjt:  LAQYLETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQDTDLDYTETFSQVVKPTTIR

Query:  VLFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMDVQGSIPLVCRLKKVIYGLKQAPRAWFERLSLFLHTLGFSNSQAVFSLLFRHQDGQHCFIL
        V+FT+AL+  W ++Q+D+NNAFL+G L EEVFM QP G   + +  LVCRL K +YGLKQAPRAWFE+L   L + GF ++++  SL  R       ++L
Subjt:  VLFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMDVQGSIPLVCRLKKVIYGLKQAPRAWFERLSLFLHTLGFSNSQAVFSLLFRHQDGQHCFIL

Query:  VYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPMVGGSVISAFKGDIFHDVYLYRS
        VYVDDI++ GS +  I +++  LN++FSLKDLG++ YFLG++VS+ T+ G+ LSQ KYI D+L KTKM       TP+  G  + A  GD   D++ YRS
Subjt:  VYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPMVGGSVISAFKGDIFHDVYLYRS

Query:  TVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISWSSKKQA
        TVGALQY T+T  E+++SVNKVCQFM  PT  HW+AVKRILRYL G L HG+  +K  +L L GF DADW SD DDR+STSG C+F   NLISW SKKQ 
Subjt:  TVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISWSSKKQA

Query:  IISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLHSRTKHVEIDIYFVRDLALQRRLQVCHLPSSAQVADIFSKPLS
         +SRSSTEAE RSLA + A + W+++L ++L L    PP VWCDNL  V LSAN VLH+RTKH+E+D+YFV +  +++ ++V H+PS+ Q+AD+ +K +S
Subjt:  IISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLHSRTKHVEIDIYFVRDLALQRRLQVCHLPSSAQVADIFSKPLS

Query:  PTRFLSLRSKLNVVDASTIGLRG
         T+F+  R KL + + ST+ LRG
Subjt:  PTRFLSLRSKLNVVDASTIGLRG

TrEMBL top hitse value%identityAlignment
A0A2N9FKQ2 Integrase catalytic domain-containing protein3.9e-21848.34Show/hide
Query:  MSCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRST
        +SCP+T QQ G +ERKHRH+V+TGL LLSHA + +++WDDAF+TA YLINRLP  +    +P E LF  KPNY FLKVF C C+P+LRPYNKHK++PRS 
Subjt:  MSCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRST

Query:  PCIFLGYSNTYKGYKCLSF-SGRVYVSRHVLFNEYSFPASPHSSQQTIAPINFRSFLPILLPSSLP-LISPTMPFFDRQTSHCPIPIALSSDPQTFSSLN
         C+FLGYS  +KGYKCL   SGR+Y+SR V+F E  FP       Q   P   +       P  LP LI+PT P+  R T+                   
Subjt:  PCIFLGYSNTYKGYKCLSF-SGRVYVSRHVLFNEYSFPASPHSSQQTIAPINFRSFLPILLPSSLP-LISPTMPFFDRQTSHCPIPIALSSDPQTFSSLN

Query:  HIVQSTQVSSIVAQQSAAPLPPTTSQQ--PILSDLSHHSFNVPPIAGSSSASDLFPNTEPLCSSSSSDTLPSGSPPVSQQN--QMVNAHSMITRVKAGIF
                        +APL P++S    P + DLS       P A S  ASD                +P+ +PP++ Q+   + ++H M+TR KA I 
Subjt:  HIVQSTQVSSIVAQQSAAPLPPTTSQQ--PILSDLSHHSFNVPPIAGSSSASDLFPNTEPLCSSSSSDTLPSGSPPVSQQN--QMVNAHSMITRVKAGIF

Query:  KPK-------------ILLAQ--YLETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQ
        KPK              LLA+     +EP    +A++   W +AM  E+DAL++N TW+LVPS   + ++GCKWVF++K  +DG+I  YKARLVAKGFHQ
Subjt:  KPK-------------ILLAQ--YLETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQ

Query:  DTDLDYTETFSQVVKPTTIRVLFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMDVQGSIPLVCRLKKVIYGLKQAPRAWFERLSLFLHTLGFSN
           +DYTETFS VVKPTT+R + +LAL+  W+LRQ+D+ NAFLHG LSEEV+MTQPPG +       VC+L K +YGLKQAPRAWF RL+ +L + GF+ 
Subjt:  DTDLDYTETFSQVVKPTTIRVLFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMDVQGSIPLVCRLKKVIYGLKQAPRAWFERLSLFLHTLGFSN

Query:  SQAVFSLLFRHQDGQHCFILVYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPMVG
        S +  SLL  H      + L+YVDDI+IT S +  ID ++  L  +F++KDLG L+YFLG+EV  P + G+ LSQ+KYI DIL +TKM EA  +++PM  
Subjt:  SQAVFSLLFRHQDGQHCFILVYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPMVG

Query:  GSVISAFKGDIFHDVYLYRSTVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKST
         + +S   GD   D  LYRSTVGALQY ++T  +IA+SVNK+ QFMH PT LHWQ+VKR+LRYL   ++ G+  +      LQGF DADW  D DDR+ST
Subjt:  GSVISAFKGDIFHDVYLYRSTVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKST

Query:  SGFCIFFCGNLISWSSKKQAIISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLHSRTKHVEIDIYFVRDLALQRRL
         G+C+F   NL+SWS KKQA ++RSSTEAE ++LA+V+A + W  TL  +L +S   PP +WCDN+GA +LS+N V H+RTKHVEID +FVRD+   R +
Subjt:  SGFCIFFCGNLISWSSKKQAIISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLHSRTKHVEIDIYFVRDLALQRRL

Query:  QVCHLPSSAQVADIFSKPLSPTRFLSLRSKLNVVDASTIGLRGG
         +  L S  Q+ADIF+KPLS  RF  LR+KLNVV    +GLRGG
Subjt:  QVCHLPSSAQVADIFSKPLSPTRFLSLRSKLNVVDASTIGLRGG

A0A2N9GUG4 Integrase catalytic domain-containing protein3.9e-21848.34Show/hide
Query:  MSCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRST
        +SCP+T QQ G +ERKHRH+V+TGL LLSHA + +++WDDAF+TA YLINRLP  +    +P E LF  KPNY FLKVF C C+P+LRPYNKHK++PRS 
Subjt:  MSCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRST

Query:  PCIFLGYSNTYKGYKCLSF-SGRVYVSRHVLFNEYSFPASPHSSQQTIAPINFRSFLPILLPSSLP-LISPTMPFFDRQTSHCPIPIALSSDPQTFSSLN
         C+FLGYS  +KGYKCL   SGR+Y+SR V+F E  FP       Q   P   +       P  LP LI+PT P+  R T+                   
Subjt:  PCIFLGYSNTYKGYKCLSF-SGRVYVSRHVLFNEYSFPASPHSSQQTIAPINFRSFLPILLPSSLP-LISPTMPFFDRQTSHCPIPIALSSDPQTFSSLN

Query:  HIVQSTQVSSIVAQQSAAPLPPTTSQQ--PILSDLSHHSFNVPPIAGSSSASDLFPNTEPLCSSSSSDTLPSGSPPVSQQN--QMVNAHSMITRVKAGIF
                        +APL P++S    P + DLS       P A S  ASD                +P+ +PP++ Q+   + ++H M+TR KA I 
Subjt:  HIVQSTQVSSIVAQQSAAPLPPTTSQQ--PILSDLSHHSFNVPPIAGSSSASDLFPNTEPLCSSSSSDTLPSGSPPVSQQN--QMVNAHSMITRVKAGIF

Query:  KPK-------------ILLAQ--YLETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQ
        KPK              LLA+     +EP    +A++   W +AM  E+DAL++N TW+LVPS   + ++GCKWVF++K  +DG+I  YKARLVAKGFHQ
Subjt:  KPK-------------ILLAQ--YLETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQ

Query:  DTDLDYTETFSQVVKPTTIRVLFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMDVQGSIPLVCRLKKVIYGLKQAPRAWFERLSLFLHTLGFSN
           +DYTETFS VVKPTT+R + +LAL+  W+LRQ+D+ NAFLHG LSEEV+MTQPPG +       VC+L K +YGLKQAPRAWF RL+ +L + GF+ 
Subjt:  DTDLDYTETFSQVVKPTTIRVLFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMDVQGSIPLVCRLKKVIYGLKQAPRAWFERLSLFLHTLGFSN

Query:  SQAVFSLLFRHQDGQHCFILVYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPMVG
        S +  SLL  H      + L+YVDDI+IT S +  ID ++  L  +F++KDLG L+YFLG+EV  P + G+ LSQ+KYI DIL +TKM EA  +++PM  
Subjt:  SQAVFSLLFRHQDGQHCFILVYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPMVG

Query:  GSVISAFKGDIFHDVYLYRSTVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKST
         + +S   GD   D  LYRSTVGALQY ++T  +IA+SVNK+ QFMH PT LHWQ+VKR+LRYL   ++ G+  +      LQGF DADW  D DDR+ST
Subjt:  GSVISAFKGDIFHDVYLYRSTVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKST

Query:  SGFCIFFCGNLISWSSKKQAIISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLHSRTKHVEIDIYFVRDLALQRRL
         G+C+F   NL+SWS KKQA ++RSSTEAE ++LA+V+A + W  TL  +L +S   PP +WCDN+GA +LS+N V H+RTKHVEID +FVRD+   R +
Subjt:  SGFCIFFCGNLISWSSKKQAIISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLHSRTKHVEIDIYFVRDLALQRRL

Query:  QVCHLPSSAQVADIFSKPLSPTRFLSLRSKLNVVDASTIGLRGG
         +  L S  Q+ADIF+KPLS  RF  LR+KLNVV    +GLRGG
Subjt:  QVCHLPSSAQVADIFSKPLSPTRFLSLRSKLNVVDASTIGLRGG

A0A438EA49 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-21849.82Show/hide
Query:  MSCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRST
        +SCP+TS+Q GI+ERKHRHIV+ GLTLL+ ASL +++W DAF+TAV+LINRLP  V     P E LF  KPNYS LKVF CLC+P LRPYNKHK++ RS+
Subjt:  MSCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRST

Query:  PCIFLGYSNTYKGYKCLSFSGRVYVSRHVLFNEYSFPASPHSSQQTIAPINFRSFLPILLPSSLPLISPTMPFFDRQTSHCPIPIALSSDPQTFSSLNHI
        PC FLGYS+ +KGYKCL+  GR+++SR V+F+E  FP +    +    P+   S   + LP  +PL+    P     +   P   A SS         H 
Subjt:  PCIFLGYSNTYKGYKCLSFSGRVYVSRHVLFNEYSFPASPHSSQQTIAPINFRSFLPILLPSSLPLISPTMPFFDRQTSHCPIPIALSSDPQTFSSLNHI

Query:  VQSTQVSSIVAQQSAAPLPPTTSQQPILSDLSHHSFNVPPIAGSSSASDLFPNTEPLCSSSSSDTLPSGSPPVSQQNQMVNAHSMITRVKAGIFKPKILL
        +     S I + Q       ++S  PIL++    S ++P    SSS     P T PL ++S        + PV+   Q    H M+TR K GIFKPK+  
Subjt:  VQSTQVSSIVAQQSAAPLPPTTSQQPILSDLSHHSFNVPPIAGSSSASDLFPNTEPLCSSSSSDTLPSGSPPVSQQNQMVNAHSMITRVKAGIFKPKILL

Query:  AQYLETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQDTDLDYTETFSQVVKPTTIRV
              EP + + A+    W +AM +E+ AL++N TWSLV  P ++  +GC+WVFK+KRN DGS+S YKARLVAKG+ Q    D+ ETFS VVKPTTIRV
Subjt:  AQYLETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQDTDLDYTETFSQVVKPTTIRV

Query:  LFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMD--VQGSIPLVCRLKKVIYGLKQAPRAWFERLSLFLHTLGFSNSQAVFSLLFRHQDGQHCFI
        +  +A++  W +RQ+D+NNAFL+G L EEV+M QPPG D        LVC+L K +YGLKQAPRAWF++L + L   GFS++++  SL  R  +    F+
Subjt:  LFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMD--VQGSIPLVCRLKKVIYGLKQAPRAWFERLSLFLHTLGFSNSQAVFSLLFRHQDGQHCFI

Query:  LVYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPMVGGSVISAFKGDIFHDVYLYR
        LVYVDDIV+TGSSS  I  +++ L   FSLKDLG+LSYFLG+EV     GG+ LSQ+KYI D+L KTKM  A S+ TPM+ G  +SA  GD   +V+ YR
Subjt:  LVYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPMVGGSVISAFKGDIFHDVYLYR

Query:  STVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISWSSKKQ
        S VGALQY T+T  EIA+SVNKVCQFM  P   HW+AVKRILRYLNG  + GI+ +    + L GF DADW SD DDR+STSG C+F   +L+SWSSKKQ
Subjt:  STVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISWSSKKQ

Query:  AIISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLHSRTKHVEIDIYFVRDLALQRRLQVCHLPSSAQVADIFSKPL
           SRSSTEAE RSLA +++ ++W+Q+L ++L       P +WCDN+  V LSAN VLHSRTKH+E+D+YFVR+  ++R+L V H+P+  QVAD+F+KPL
Subjt:  AIISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLHSRTKHVEIDIYFVRDLALQRRLQVCHLPSSAQVADIFSKPL

Query:  SPTRFLSLRSKLNVVDASTIGLRGG
        S   F  LR KL V   +++ L+ G
Subjt:  SPTRFLSLRSKLNVVDASTIGLRGG

A0A438K147 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-22049.7Show/hide
Query:  MSCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRST
        +SCP+T QQ G+ ERKHR IV+ GLTLL   SL ++FWD++F T VYL NRLP  V H   P+EVLF   P+YSFLKVF C C+P+LRPYN HK++ RS 
Subjt:  MSCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRST

Query:  PCIFLGYSNTYKGYKCLSFSGRVYVSRHVLFNEYSFPASPHSSQQTIAPINFRSFLPILLPS-SLPLISPTMPFFDRQTSHCPIPIALSSDPQTFSSLNH
         C FLGYS  +KGYKC+S +GRVY+SR V+FNE SFP S      +  P         L PS S P++SPTM          P P +  S  +  S +++
Subjt:  PCIFLGYSNTYKGYKCLSFSGRVYVSRHVLFNEYSFPASPHSSQQTIAPINFRSFLPILLPS-SLPLISPTMPFFDRQTSHCPIPIALSSDPQTFSSLNH

Query:  IVQSTQVSSIVAQQSAAPLPPTTSQQPILSDLSHHSFNVPPIAGSSSASDLFPNTEPLCSSSSSDTLPSGSPPVSQQNQMVNAHSMITRVKAGIFKPKIL
        IV +   +   A  +  P    ++  P+ + + H    V  IA +S    +  + +                         N H MITR K+GI KPKI 
Subjt:  IVQSTQVSSIVAQQSAAPLPPTTSQQPILSDLSHHSFNVPPIAGSSSASDLFPNTEPLCSSSSSDTLPSGSPPVSQQNQMVNAHSMITRVKAGIFKPKIL

Query:  LAQYLETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQDTDLDYTETFSQVVKPTTIR
        +A     EP SV  ALQ D W KAM  EYDAL RN+TWSLVP P  ++ IGCKWV+K K N DG++  YKARLVAKGFHQ    D+TETFS VVKP+TIR
Subjt:  LAQYLETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQDTDLDYTETFSQVVKPTTIR

Query:  VLFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMDVQGSIPLVCRLKKVIYGLKQAPRAWFERLSLFLHTLGFSNSQAVFSLLFRHQDGQHCFIL
        V+FT+AL+  W ++Q+D+NNAFL+G L EEVFM QP G   + +  LVCRL K +YGLKQAPRAWFE+L   L + GF ++++  SL  R       ++L
Subjt:  VLFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMDVQGSIPLVCRLKKVIYGLKQAPRAWFERLSLFLHTLGFSNSQAVFSLLFRHQDGQHCFIL

Query:  VYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPMVGGSVISAFKGDIFHDVYLYRS
        VYVDDI++ GS +  I +++  LN++FSLKDLG++ YFLG++VS+ T+ G+ LSQ KYI D+L KTKM       TP+  G  + A  GD   D++ YRS
Subjt:  VYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPMVGGSVISAFKGDIFHDVYLYRS

Query:  TVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISWSSKKQA
        TVGALQY T+T  E+++SVNKVCQFM  PT  HW+AVKRILRYL G L HG+  +K  +L L GF DADW SD DDR+STSG C+F   NLISW SKKQ 
Subjt:  TVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISWSSKKQA

Query:  IISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLHSRTKHVEIDIYFVRDLALQRRLQVCHLPSSAQVADIFSKPLS
         +SRSSTEAE RSLA + A + W+++L ++L L    PP VWCDNL  V LSAN VLH+RTKH+E+D+YFV +  +++ ++V H+PS+ Q+AD+ +K +S
Subjt:  IISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLHSRTKHVEIDIYFVRDLALQRRLQVCHLPSSAQVADIFSKPLS

Query:  PTRFLSLRSKLNVVDASTIGLRG
         T+F+  R KL + + ST+ LRG
Subjt:  PTRFLSLRSKLNVVDASTIGLRG

A5BFT3 Integrase catalytic domain-containing protein6.7e-21849.09Show/hide
Query:  MSCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRST
        +SCP+T QQ G+ ERKHR IV+ GLTLL  ASL ++FWD++F T VYL NRLP  + H   P+EVLF   P+YSFLKVF C C+P+LRPYN HK++ RS 
Subjt:  MSCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRST

Query:  PCIFLGYSNTYKGYKCLSFSGRVYVSRHVLFNEYSFPASPHSSQQTIAPINFRSFLPILLPS-SLPLISPTMPFFDRQTSHCPIPIALSSDPQTFSSLNH
         C FLGYS  +KGYKC+S +GRVY+S  V+FNE SFP S      +            L PS S P++SPTM          P P +  S  +  S +++
Subjt:  PCIFLGYSNTYKGYKCLSFSGRVYVSRHVLFNEYSFPASPHSSQQTIAPINFRSFLPILLPS-SLPLISPTMPFFDRQTSHCPIPIALSSDPQTFSSLNH

Query:  IVQSTQVSSIVAQQSAAPLPPTTSQQPILSDLSHHSFNVPPIAGSSSASDLFPNTEPLCSSSSSDTLPSGSPPVSQQNQMVNAHSMITRVKAGIFKPKIL
        IV +   +   A  +  P    ++  P+ + + H    V  IA +S    +  + +                         N H MITR K+GI KPKI 
Subjt:  IVQSTQVSSIVAQQSAAPLPPTTSQQPILSDLSHHSFNVPPIAGSSSASDLFPNTEPLCSSSSSDTLPSGSPPVSQQNQMVNAHSMITRVKAGIFKPKIL

Query:  LAQYLETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQDTDLDYTETFSQVVKPTTIR
        +A     EP SV  ALQ D W KAM  EYDAL RN+TWSLVP P  ++ IGCKWV+K K N DG++  YKARLVAKGFHQ    D+TETFS VVKP+T+R
Subjt:  LAQYLETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQDTDLDYTETFSQVVKPTTIR

Query:  VLFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMDVQGSIPLVCRLKKVIYGLKQAPRAWFERLSLFLHTLGFSNSQAVFSLLFRHQDGQHCFIL
        V+FT+AL+  W ++Q+D+NNAFL+G L EEVFM QP G   + +  LVCRL K +YGLKQAPRAWFE+L   L + GF ++++  SL  R       ++L
Subjt:  VLFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMDVQGSIPLVCRLKKVIYGLKQAPRAWFERLSLFLHTLGFSNSQAVFSLLFRHQDGQHCFIL

Query:  VYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPMVGGSVISAFKGDIFHDVYLYRS
        VYVDDI++ GS +  I +++  LN++FSLKDLG++ YFLG++VS+ T+ G+ LSQ KYI D+L KTKM       TP+  G  +    GD   D++ YRS
Subjt:  VYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPMVGGSVISAFKGDIFHDVYLYRS

Query:  TVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISWSSKKQA
        TVGALQY T+T  E+++SVNKVCQFM  PT  HW+ VKRILRYL G L HG+  +K  +L L GF DADW SD DDR+STSG C+F   NLISW SKKQ 
Subjt:  TVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISWSSKKQA

Query:  IISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLHSRTKHVEIDIYFVRDLALQRRLQVCHLPSSAQVADIFSKPLS
        I+SRSS E E RSLA + A + W+++L ++L L    PP VWCDNL  V LSAN VLH+RTKH+E+D+YFVR+  +++ ++V H+PS+ Q+AD+ +K +S
Subjt:  IISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLHSRTKHVEIDIYFVRDLALQRRLQVCHLPSSAQVADIFSKPLS

Query:  PTRFLSLRSKLNVVDASTIGLRG
         T+F+  R KL + + ST+ LRG
Subjt:  PTRFLSLRSKLNVVDASTIGLRG

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.3e-8928.07Show/hide
Query:  MSCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLP--CTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKH-KIEP
        ++ P+T Q  G+ ER  R I +   T++S A L   FW +A  TA YLINR+P    V    +P E+    KP    L+VF    Y  ++  NK  K + 
Subjt:  MSCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLP--CTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKH-KIEP

Query:  RSTPCIFLGYS-NTYKGYKCLSFSGRVYVSRHVLFNEYSFPASPHSSQQTIAPINFRS--FLPILLPSSLPLISPTMPFFDRQTSHCPIPIALSSDPQTF
        +S   IF+GY  N +K +  +  + +  V+R V+ +E +              +N R+  F  + L  S    +   P   R+      P    ++ +  
Subjt:  RSTPCIFLGYS-NTYKGYKCLSFSGRVYVSRHVLFNEYSFPASPHSSQQTIAPINFRS--FLPILLPSSLPLISPTMPFFDRQTSHCPIPIALSSDPQTF

Query:  SSLNHIVQSTQVSSIVAQQSAAPLPPTTSQQPILSDLSHHS---FNVPPIAGSSSASDLFPNTEPLCSSSS--SDTLPSGSPPVSQQN------------
         ++  +  S        ++S     P  S++ I ++  + S    N+  +  S  ++  F N           +++  SG+P  S+++            
Subjt:  SSLNHIVQSTQVSSIVAQQSAAPLPPTTSQQPILSDLSHHS---FNVPPIAGSSSASDLFPNTEPLCSSSS--SDTLPSGSPPVSQQN------------

Query:  --------QMVNAHSMITRVKAGI--------FKPKILLAQYLETEPPSVKTALQC----DHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKI
                +++N  S   + K  I            +L A  +  + P+    +Q       W +A+  E +A   N+TW++   P +K ++  +WVF +
Subjt:  --------QMVNAHSMITRVKAGI--------FKPKILLAQYLETEPPSVKTALQC----DHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKI

Query:  KRNSDGSISWYKARLVAKGFHQDTDLDYTETFSQVVKPTTIRVLFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMDVQGSIPLVCRLKKVIYGL
        K N  G+   YKARLVA+GF Q   +DY ETF+ V + ++ R + +L + Y   + Q+D+  AFL+G L EE++M  P G+        VC+L K IYGL
Subjt:  KRNSDGSISWYKARLVAKGFHQDTDLDYTETFSQVVKPTTIRVLFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMDVQGSIPLVCRLKKVIYGL

Query:  KQAPRAWFERLSLFLHTLGFSNSQA--VFSLLFRHQDGQHCFILVYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQ
        KQA R WFE     L    F NS       +L +    ++ ++L+YVDD+VI       ++     L  +F + DL ++ +F+G+ +       I+LSQ 
Subjt:  KQAPRAWFERLSLFLHTLGFSNSQA--VFSLLFRHQDGQHCFILVYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQ

Query:  KYISDILHKTKMHEANSISTPMVGGSVISAFKGDIFHDVYLYRSTVGALQYATL-THLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFR
         Y+  IL K  M   N++STP+           D   +    RS +G L Y  L T  ++  +VN + ++        WQ +KR+LRYL G ++  ++F+
Subjt:  KYISDILHKTKMHEANSISTPMVGGSVISAFKGDIFHDVYLYRSTVGALQYATL-THLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFR

Query:  K--PPDLLLQGFADADWTSDPDDRKSTSGFCI-FFCGNLISWSSKKQAIISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLS
        K    +  + G+ D+DW     DRKST+G+    F  NLI W++K+Q  ++ SSTEAE  +L       +W++ L   + +    P  ++ DN G + ++
Subjt:  K--PPDLLLQGFADADWTSDPDDRKSTSGFCI-FFCGNLISWSSKKQAIISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLS

Query:  ANLVLHSRTKHVEIDIYFVRDLALQRRLQVCHLPSSAQVADIFSKPLSPTRFLSLRSKLNVV
         N   H R KH++I  +F R+      + + ++P+  Q+ADIF+KPL   RF+ LR KL ++
Subjt:  ANLVLHSRTKHVEIDIYFVRDLALQRRLQVCHLPSSAQVADIFSKPLSPTRFLSLRSKLNVV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-10130.95Show/hide
Query:  PYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRSTPCI
        P T Q  G+ ER +R IV+   ++L  A L   FW +A  TA YLINR P        P  V    + +YS LKVF C  +  +    + K++ +S PCI
Subjt:  PYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRSTPCI

Query:  FLGYSNTYKGYKCLS-FSGRVYVSRHVLFNEYSFPASPHSSQQTIAPINFRSFLPILLPSSLPLISPTMPFFDRQTSHCPIPIALSSDPQTFSSLNHIVQ
        F+GY +   GY+       +V  SR V+F E                                                       S+ +T + ++  V+
Subjt:  FLGYSNTYKGYKCLS-FSGRVYVSRHVLFNEYSFPASPHSSQQTIAPINFRSFLPILLPSSLPLISPTMPFFDRQTSHCPIPIALSSDPQTFSSLNHIVQ

Query:  STQVSSIVAQQSAAPLPPTTSQQPILSDLSHHSFNVPPIAGSSSASDLFPNTEPLCSSSSSDTLPSGSPPVSQQNQMVNAHSMITRVKAGIFKPKILLAQ
        +  + + V         P+TS             N P  A S++        +P       + L  G   V    Q    H  + R +    +P++   +
Subjt:  STQVSSIVAQQSAAPLPPTTSQQPILSDLSHHSFNVPPIAGSSSASDLFPNTEPLCSSSSSDTLPSGSPPVSQQNQMVNAHSMITRVKAGIFKPKILLAQ

Query:  YLET---------EPPSVKTAL---QCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQDTDLDYTETFS
        Y  T         EP S+K  L   + +   KAM++E ++L +N T+ LV  P  K+ + CKWVFK+K++ D  +  YKARLV KGF Q   +D+ E FS
Subjt:  YLET---------EPPSVKTAL---QCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQDTDLDYTETFS

Query:  QVVKPTTIRVLFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMDVQGSIPLVCRLKKVIYGLKQAPRAWFERLSLFLHTLGFSNSQAVFSLLF-R
         VVK T+IR + +LA +    + Q+D+  AFLHG L EE++M QP G +V G   +VC+L K +YGLKQAPR W+ +   F+ +  +  + +   + F R
Subjt:  QVVKPTTIRVLFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMDVQGSIPLVCRLKKVIYGLKQAPRAWFERLSLFLHTLGFSNSQAVFSLLF-R

Query:  HQDGQHCFILVYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVE-VSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPMVGGSVIS----
          +     +L+YVDD++I G    +I  +   L+  F +KDLG     LG++ V   TS  ++LSQ+KYI  +L +  M  A  +STP+ G   +S    
Subjt:  HQDGQHCFILVYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVE-VSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPMVGGSVIS----

Query:  ----AFKGDIFHDVYLYRSTVGALQYATL-THLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKST
              KG++      Y S VG+L YA + T  +IA++V  V +F+  P   HW+AVK ILRYL G     + F    D +L+G+ DAD   D D+RKS+
Subjt:  ----AFKGDIFHDVYLYRSTVGALQYATL-THLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKST

Query:  SGFCIFFCGNLISWSSKKQAIISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLHSRTKHVEIDIYFVRDLALQRRL
        +G+   F G  ISW SK Q  ++ S+TEAE  +       ++W++    +L L       V+CD+  A+ LS N + H+RTKH+++  +++R++     L
Subjt:  SGFCIFFCGNLISWSSKKQAIISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLHSRTKHVEIDIYFVRDLALQRRL

Query:  QVCHLPSSAQVADIFSKPLSPTRF
        +V  + ++   AD+ +K +   +F
Subjt:  QVCHLPSSAQVADIFSKPLSPTRF

P92519 Uncharacterized mitochondrial protein AtMg008107.4e-4944.98Show/hide
Query:  FILVYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPM---VGGSVISAFKGDIFHD
        ++L+YVDDI++TGSS+ +++ ++  L++ FS+KDLG + YFLG+++    S G+FLSQ KY   IL+   M +   +STP+   +  SV +A     + D
Subjt:  FILVYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPM---VGGSVISAFKGDIFHD

Query:  VYLYRSTVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISW
           +RS VGALQY TLT  +I+Y+VN VCQ MH PT+  +  +KR+LRY+ G + HG+   K   L +Q F D+DW      R+ST+GFC F   N+ISW
Subjt:  VYLYRSTVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISW

Query:  SSKKQAIISRSSTEAECRSLAHVSANLVW
        S+K+Q  +SRSSTE E R+LA  +A L W
Subjt:  SSKKQAIISRSSTEAECRSLAHVSANLVW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.8e-18143.24Show/hide
Query:  SCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRSTP
        S P+T +  G+ ERKHRHIV+TGLTLLSHAS+   +W  AFA AVYLINRLP  +    SP + LFG  PNY  L+VF C CYP LRPYN+HK++ +S  
Subjt:  SCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRSTP

Query:  CIFLGYSNTYKGYKCLSF-SGRVYVSRHVLFNEYSFPA---------------------SPHSSQQTIAPINFRSFLPILLPSSLPLISPTMPFFDRQTS
        C+FLGYS T   Y CL   + R+Y+SRHV F+E  FP                      SPH++  T  P+           ++ P  SP+ PF + Q S
Subjt:  CIFLGYSNTYKGYKCLSF-SGRVYVSRHVLFNEYSFPA---------------------SPHSSQQTIAPINFRSFLPILLPSSLPLISPTMPFFDRQTS

Query:  HCPIPIALS----SDPQTFSSLNHIVQ------STQVSSIVAQQSAAPLPPTTSQQPILSDLSHHSFNVPPIAGSSSASDLFPNTEPLCSSSSSDTLPS-
           +  + S    S P+  +   +  Q       TQ  +  +Q ++   P   S   +   LS      P  + SSS S     T    SSS+S T PS 
Subjt:  HCPIPIALS----SDPQTFSSLNHIVQ------STQVSSIVAQQSAAPLPPTTSQQPILSDLSHHSFNVPPIAGSSSASDLFPNTEPLCSSSSSDTLPS-

Query:  ---GSPPVSQ----QNQM-VNAHSMITRVKAGIFK--PKILLAQYL--ETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDK-KVIGCKWVF
             PP++Q     NQ  +N HSM TR KAGI K  PK  LA  L  E+EP +   AL+ + W  AM  E +A I N TW LVP PP    ++GC+W+F
Subjt:  ---GSPPVSQ----QNQM-VNAHSMITRVKAGIFK--PKILLAQYL--ETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDK-KVIGCKWVF

Query:  KIKRNSDGSISWYKARLVAKGFHQDTDLDYTETFSQVVKPTTIRVLFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMDVQGSIPLVCRLKKVIY
          K NSDGS++ YKARLVAKG++Q   LDY ETFS V+K T+IR++  +A+   W +RQ+D+NNAFL G L+++V+M+QPPG   +     VC+L+K +Y
Subjt:  KIKRNSDGSISWYKARLVAKGFHQDTDLDYTETFSQVVKPTTIRVLFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMDVQGSIPLVCRLKKVIY

Query:  GLKQAPRAWFERLSLFLHTLGFSNSQAVFSLLFRHQDGQHCFILVYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVS-YPTSGGIFLSQ
        GLKQAPRAW+  L  +L T+GF NS +  SL    +     ++LVYVDDI+ITG+   ++   +  L+ +FS+KD  +L YFLG+E    PT  G+ LSQ
Subjt:  GLKQAPRAWFERLSLFLHTLGFSNSQAVFSLLFRHQDGQHCFILVYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVS-YPTSGGIFLSQ

Query:  QKYISDILHKTKMHEANSISTPMVGGSVISAFKGDIFHDVYLYRSTVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFR
        ++YI D+L +T M  A  ++TPM     +S + G    D   YR  VG+LQY   T  +I+Y+VN++ QFMH PT  H QA+KRILRYL G  NHGI  +
Subjt:  QKYISDILHKTKMHEANSISTPMVGGSVISAFKGDIFHDVYLYRSTVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFR

Query:  KPPDLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISWSSKKQAIISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANL
        K   L L  ++DADW  D DD  ST+G+ ++   + ISWSSKKQ  + RSSTEAE RS+A+ S+ + WI +L  +L +    PP ++CDN+GA +L AN 
Subjt:  KPPDLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISWSSKKQAIISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANL

Query:  VLHSRTKHVEIDIYFVRDLALQRRLQVCHLPSSAQVADIFSKPLSPTRFLSLRSKLNV
        V HSR KH+ ID +F+R+      L+V H+ +  Q+AD  +KPLS T F +  SK+ V
Subjt:  VLHSRTKHVEIDIYFVRDLALQRRLQVCHLPSSAQVADIFSKPLSPTRFLSLRSKLNV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.5e-17842.41Show/hide
Query:  SCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRSTP
        S P+T +  G+ ERKHRHIV+ GLTLLSHAS+   +W  AF+ AVYLINRLP  +    SP + LFG  PNY  LKVF C CYP LRPYN+HK+E +S  
Subjt:  SCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRSTP

Query:  CIFLGYSNTYKGYKCLSF-SGRVYVSRHVLFNEYSFP-------ASPHSSQQTIAPINFRSFLPILLPSSLPLISPTMPFFDRQTSHCPIPIALSSDPQT
        C F+GYS T   Y CL   +GR+Y SRHV F+E  FP        S    Q++ +  N+ S     LP++ PL+ P  P         P P +  S   T
Subjt:  CIFLGYSNTYKGYKCLSF-SGRVYVSRHVLFNEYSFP-------ASPHSSQQTIAPINFRSFLPILLPSSLPLISPTMPFFDRQTSHCPIPIALSSDPQT

Query:  FSSLNHIVQSTQVSSIVAQQSAAPL---PPTTSQ----------QPILSDLSHH--SFNVP----PIAGSSSASDLFPN-----TEPLCSSSSSDTLPS-
            +  + S+ +SS  + +  AP    P  T+Q           PIL++ + +  S N P    P+  S  +S   P      +EP   SSSS + P  
Subjt:  FSSLNHIVQSTQVSSIVAQQSAAPL---PPTTSQ----------QPILSDLSHH--SFNVP----PIAGSSSASDLFPN-----TEPLCSSSSSDTLPS-

Query:  ----GSPPVSQQNQM--VNAHSMITRVKAGIFKP--KILLAQYL--ETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDK-KVIGCKWVFKI
             +PP+ Q N    VN HSM TR K GI KP  K   A  L   +EP +   A++ D W +AM  E +A I N TW LVP PP    ++GC+W+F  
Subjt:  ----GSPPVSQQNQM--VNAHSMITRVKAGIFKP--KILLAQYL--ETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDK-KVIGCKWVFKI

Query:  KRNSDGSISWYKARLVAKGFHQDTDLDYTETFSQVVKPTTIRVLFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMDVQGSIPLVCRLKKVIYGL
        K NSDGS++ YKARLVAKG++Q   LDY ETFS V+K T+IR++  +A+   W +RQ+D+NNAFL G L++EV+M+QPPG   +     VCRL+K IYGL
Subjt:  KRNSDGSISWYKARLVAKGFHQDTDLDYTETFSQVVKPTTIRVLFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMDVQGSIPLVCRLKKVIYGL

Query:  KQAPRAWFERLSLFLHTLGFSNSQAVFSLLFRHQDGQHCFILVYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKY
        KQAPRAW+  L  +L T+GF NS +  SL    +     ++LVYVDDI+ITG+ ++++   +  L+ +FS+K+   L YFLG+E       G+ LSQ++Y
Subjt:  KQAPRAWFERLSLFLHTLGFSNSQAVFSLLFRHQDGQHCFILVYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKY

Query:  ISDILHKTKMHEANSISTPMVGGSVISAFKGDIFHDVYLYRSTVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPP
          D+L +T M  A  ++TPM     ++   G    D   YR  VG+LQY   T  +++Y+VN++ Q+MH PT  HW A+KR+LRYL G  +HGI  +K  
Subjt:  ISDILHKTKMHEANSISTPMVGGSVISAFKGDIFHDVYLYRSTVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPP

Query:  DLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISWSSKKQAIISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLH
         L L  ++DADW  D DD  ST+G+ ++   + ISWSSKKQ  + RSSTEAE RS+A+ S+ L WI +L  +L +    PP ++CDN+GA +L AN V H
Subjt:  DLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISWSSKKQAIISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLH

Query:  SRTKHVEIDIYFVRDLALQRRLQVCHLPSSAQVADIFSKPLSPTRFLSLRSKLNVV
        SR KH+ +D +F+R+      L+V H+ +  Q+AD  +KPLS   F +   K+ V+
Subjt:  SRTKHVEIDIYFVRDLALQRRLQVCHLPSSAQVADIFSKPLSPTRFLSLRSKLNVV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.2e-10742.13Show/hide
Query:  EPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQDTDLDYTETFSQVVKPTTIRVLFTLAL
        EP +   A +   W  AM DE  A+    TW +   PP+KK IGCKWV+KIK NSDG+I  YKARLVAKG+ Q   +D+ ETFS V K T+++++  ++ 
Subjt:  EPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQDTDLDYTETFSQVVKPTTIRVLFTLAL

Query:  AYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMDVQ--GSIP--LVCRLKKVIYGLKQAPRAWFERLSLFLHTLGFSNSQAVFSLLFRHQDGQHCFILVYV
         Y +TL Q+DI+NAFL+G L EE++M  PPG   +   S+P   VC LKK IYGLKQA R WF + S+ L   GF  S +  +   +        +LVYV
Subjt:  AYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMDVQ--GSIP--LVCRLKKVIYGLKQAPRAWFERLSLFLHTLGFSNSQAVFSLLFRHQDGQHCFILVYV

Query:  DDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPMVGGSVISAFKGDIFHDVYLYRSTVG
        DDI+I  ++   +D + + L + F L+DLG L YFLG+E++  ++ GI + Q+KY  D+L +T +      S PM      SA  G  F D   YR  +G
Subjt:  DDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPMVGGSVISAFKGDIFHDVYLYRSTVG

Query:  ALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISWSSKKQAIIS
         L Y  +T L+I+++VNK+ QF  AP + H QAV +IL Y+ G +  G+ +    ++ LQ F+DA + S  D R+ST+G+C+F   +LISW SKKQ ++S
Subjt:  ALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISWSSKKQAIIS

Query:  RSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLHSRTKHVEIDIYFVRD
        +SS EAE R+L+  +  ++W+   F +L L    P  ++CDN  A+H++ N V H RTKH+E D + VR+
Subjt:  RSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLHSRTKHVEIDIYFVRD

ATMG00240.1 Gag-Pol-related retrotransposon family protein3.0e-1335.51Show/hide
Query:  YATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKSTSGFC----IFFCGNLISWSSKKQAII
        Y T+T  ++ ++VN++ QF  A      QAV ++L Y+ G +  G+ +    DL L+ FAD+DW S PD R+S +GFC    ++F G L   S     ++
Subjt:  YATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKSTSGFC----IFFCGNLISWSSKKQAII

Query:  SRSSTEA
         R + EA
Subjt:  SRSSTEA

ATMG00810.1 DNA/RNA polymerases superfamily protein5.3e-5044.98Show/hide
Query:  FILVYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPM---VGGSVISAFKGDIFHD
        ++L+YVDDI++TGSS+ +++ ++  L++ FS+KDLG + YFLG+++    S G+FLSQ KY   IL+   M +   +STP+   +  SV +A     + D
Subjt:  FILVYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGIFLSQQKYISDILHKTKMHEANSISTPM---VGGSVISAFKGDIFHD

Query:  VYLYRSTVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISW
           +RS VGALQY TLT  +I+Y+VN VCQ MH PT+  +  +KR+LRY+ G + HG+   K   L +Q F D+DW      R+ST+GFC F   N+ISW
Subjt:  VYLYRSTVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLLLQGFADADWTSDPDDRKSTSGFCIFFCGNLISW

Query:  SSKKQAIISRSSTEAECRSLAHVSANLVW
        S+K+Q  +SRSSTE E R+LA  +A L W
Subjt:  SSKKQAIISRSSTEAECRSLAHVSANLVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.2e-2749.6Show/hide
Query:  MITRVKAGIFK--PK--ILLAQYLETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQD
        M+TR KAGI K  PK  + +   ++ EP SV  AL+   W +AM++E DAL RN TW LVP P ++ ++GCKWVFK K +SDG++   KARLVAKGFHQ+
Subjt:  MITRVKAGIFK--PK--ILLAQYLETEPPSVKTALQCDHWAKAMRDEYDALIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQD

Query:  TDLDYTETFSQVVKPTTIRVLFTLA
          + + ET+S VV+  TIR +  +A
Subjt:  TDLDYTETFSQVVKPTTIRVLFTLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTGTCCTTATACTTCTCAACAATACGGAATTGTTGAACGAAAACATCGTCACATTGTTGACACCGGTCTCACCCTACTTTCTCATGCTTCTCTTTCTATAGAGTT
TTGGGATGATGCTTTTGCCACTGCAGTTTATCTTATTAATCGTCTCCCTTGTACAGTTCATCATGGACTTTCACCTATGGAAGTTTTGTTTGGTTTGAAACCTAACTATT
CCTTCTTAAAAGTTTTCAGCTGTTTATGCTATCCTTCATTACGTCCTTACAATAAACACAAGATTGAACCTCGTTCTACTCCTTGTATTTTTCTTGGTTATAGCAACACT
TACAAAGGCTATAAATGTCTTTCATTTTCTGGTCGTGTGTATGTTTCAAGACATGTCCTGTTCAATGAATACTCTTTTCCGGCATCACCTCACTCATCGCAACAAACAAT
TGCCCCTATTAATTTTCGGTCCTTTTTGCCTATTTTGTTGCCATCTTCCCTACCCCTCATCTCACCCACTATGCCCTTTTTTGACCGTCAAACATCTCATTGTCCTATTC
CTATAGCTCTATCTTCTGACCCCCAAACTTTTTCTTCACTTAATCATATTGTCCAATCCACTCAAGTCTCATCTATTGTCGCACAACAATCTGCTGCCCCACTGCCACCT
ACTACCTCCCAACAACCTATTTTGTCTGATCTTTCACATCACTCATTTAATGTCCCACCTATAGCTGGCTCTTCTTCCGCCAGTGATTTGTTTCCTAATACTGAGCCTTT
ATGTAGTTCATCTTCTTCTGATACATTGCCTTCGGGCTCACCTCCAGTATCCCAACAAAATCAAATGGTGAATGCTCATTCCATGATCACTCGAGTTAAGGCGGGAATCT
TCAAGCCTAAGATTTTGCTTGCTCAATATTTGGAAACAGAACCTCCTTCTGTGAAGACTGCACTTCAGTGTGATCATTGGGCCAAGGCAATGCGTGACGAATATGATGCT
TTAATTAGAAATGATACTTGGTCTCTTGTTCCCTCACCACCTGATAAGAAAGTTATTGGTTGCAAATGGGTATTCAAGATCAAAAGGAACTCTGATGGTTCTATCTCTTG
GTATAAAGCTCGTCTTGTAGCTAAAGGTTTTCATCAAGATACTGATCTTGACTATACTGAAACCTTTAGCCAGGTTGTTAAACCCACCACTATTCGTGTGCTGTTTACAC
TTGCTCTTGCTTATGGCTGGACTCTGAGACAGGTGGATATCAACAATGCCTTTCTTCATGGTGTTTTAAGTGAAGAGGTTTTCATGACTCAACCTCCTGGCATGGATGTT
CAAGGATCTATTCCGCTTGTGTGCCGGTTGAAAAAGGTCATTTATGGTCTTAAACAGGCTCCTCGGGCTTGGTTTGAACGACTCAGTTTATTTCTTCATACTCTTGGTTT
CTCTAACTCTCAAGCTGTTTTTTCTCTTCTCTTTCGACATCAGGATGGTCAGCACTGCTTTATCTTGGTTTATGTTGACGATATTGTGATTACTGGTAGTTCTTCCTTAG
TCATTGACACGGTTGTGACTACTCTTAATAATCAGTTTTCTCTTAAGGATCTTGGACAGCTGAGTTACTTTCTTGGTGTTGAGGTGTCATATCCTACATCTGGGGGTATT
TTTCTTTCTCAACAAAAGTATATTTCAGATATACTTCACAAGACAAAAATGCATGAAGCTAATTCTATTTCAACTCCGATGGTTGGGGGCTCTGTGATTTCAGCTTTTAA
AGGTGATATTTTTCATGATGTCTATTTGTATCGTAGTACAGTCGGTGCTTTACAATATGCTACACTTACGCATCTTGAAATTGCATATAGTGTTAATAAGGTGTGCCAAT
TCATGCATGCTCCTACTATGCTGCATTGGCAAGCTGTTAAACGTATTTTACGTTATCTCAATGGAGCACTTAATCATGGTATACTATTTCGCAAGCCACCTGATCTTCTA
CTCCAAGGATTTGCTGATGCCGATTGGACTTCTGACCCAGATGATCGTAAGTCAACATCGGGTTTCTGTATATTTTTTTGTGGGAATTTGATCTCTTGGTCTTCTAAGAA
ACAAGCCATTATTTCTCGTTCTAGTACTGAGGCGGAATGTCGAAGTCTAGCACATGTTTCTGCTAATTTGGTATGGATTCAGACCTTATTTGCTGATTTGGCTTTGTCTT
TTCCTTGTCCACCAACTGTTTGGTGTGATAATCTTGGGGCTGTTCATCTTAGTGCAAATCTTGTTCTTCATTCCAGAACTAAACATGTTGAAATCGATATATATTTTGTT
CGAGATTTAGCTCTTCAGCGACGTTTACAGGTGTGTCATCTTCCTTCGTCTGCTCAAGTGGCTGACATTTTTTCTAAGCCCCTCTCTCCTACTCGATTTCTTTCATTACG
TTCCAAGCTCAATGTTGTGGATGCTTCCACCATTGGCTTGAGGGGGGGGGGGGGGGGGTGGGTGTTAAGAGAGCCCATTGATGAAAACAGAGGATTAGTTGTTATTCTGT
TTTTACTGTGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTTGTCCTTATACTTCTCAACAATACGGAATTGTTGAACGAAAACATCGTCACATTGTTGACACCGGTCTCACCCTACTTTCTCATGCTTCTCTTTCTATAGAGTT
TTGGGATGATGCTTTTGCCACTGCAGTTTATCTTATTAATCGTCTCCCTTGTACAGTTCATCATGGACTTTCACCTATGGAAGTTTTGTTTGGTTTGAAACCTAACTATT
CCTTCTTAAAAGTTTTCAGCTGTTTATGCTATCCTTCATTACGTCCTTACAATAAACACAAGATTGAACCTCGTTCTACTCCTTGTATTTTTCTTGGTTATAGCAACACT
TACAAAGGCTATAAATGTCTTTCATTTTCTGGTCGTGTGTATGTTTCAAGACATGTCCTGTTCAATGAATACTCTTTTCCGGCATCACCTCACTCATCGCAACAAACAAT
TGCCCCTATTAATTTTCGGTCCTTTTTGCCTATTTTGTTGCCATCTTCCCTACCCCTCATCTCACCCACTATGCCCTTTTTTGACCGTCAAACATCTCATTGTCCTATTC
CTATAGCTCTATCTTCTGACCCCCAAACTTTTTCTTCACTTAATCATATTGTCCAATCCACTCAAGTCTCATCTATTGTCGCACAACAATCTGCTGCCCCACTGCCACCT
ACTACCTCCCAACAACCTATTTTGTCTGATCTTTCACATCACTCATTTAATGTCCCACCTATAGCTGGCTCTTCTTCCGCCAGTGATTTGTTTCCTAATACTGAGCCTTT
ATGTAGTTCATCTTCTTCTGATACATTGCCTTCGGGCTCACCTCCAGTATCCCAACAAAATCAAATGGTGAATGCTCATTCCATGATCACTCGAGTTAAGGCGGGAATCT
TCAAGCCTAAGATTTTGCTTGCTCAATATTTGGAAACAGAACCTCCTTCTGTGAAGACTGCACTTCAGTGTGATCATTGGGCCAAGGCAATGCGTGACGAATATGATGCT
TTAATTAGAAATGATACTTGGTCTCTTGTTCCCTCACCACCTGATAAGAAAGTTATTGGTTGCAAATGGGTATTCAAGATCAAAAGGAACTCTGATGGTTCTATCTCTTG
GTATAAAGCTCGTCTTGTAGCTAAAGGTTTTCATCAAGATACTGATCTTGACTATACTGAAACCTTTAGCCAGGTTGTTAAACCCACCACTATTCGTGTGCTGTTTACAC
TTGCTCTTGCTTATGGCTGGACTCTGAGACAGGTGGATATCAACAATGCCTTTCTTCATGGTGTTTTAAGTGAAGAGGTTTTCATGACTCAACCTCCTGGCATGGATGTT
CAAGGATCTATTCCGCTTGTGTGCCGGTTGAAAAAGGTCATTTATGGTCTTAAACAGGCTCCTCGGGCTTGGTTTGAACGACTCAGTTTATTTCTTCATACTCTTGGTTT
CTCTAACTCTCAAGCTGTTTTTTCTCTTCTCTTTCGACATCAGGATGGTCAGCACTGCTTTATCTTGGTTTATGTTGACGATATTGTGATTACTGGTAGTTCTTCCTTAG
TCATTGACACGGTTGTGACTACTCTTAATAATCAGTTTTCTCTTAAGGATCTTGGACAGCTGAGTTACTTTCTTGGTGTTGAGGTGTCATATCCTACATCTGGGGGTATT
TTTCTTTCTCAACAAAAGTATATTTCAGATATACTTCACAAGACAAAAATGCATGAAGCTAATTCTATTTCAACTCCGATGGTTGGGGGCTCTGTGATTTCAGCTTTTAA
AGGTGATATTTTTCATGATGTCTATTTGTATCGTAGTACAGTCGGTGCTTTACAATATGCTACACTTACGCATCTTGAAATTGCATATAGTGTTAATAAGGTGTGCCAAT
TCATGCATGCTCCTACTATGCTGCATTGGCAAGCTGTTAAACGTATTTTACGTTATCTCAATGGAGCACTTAATCATGGTATACTATTTCGCAAGCCACCTGATCTTCTA
CTCCAAGGATTTGCTGATGCCGATTGGACTTCTGACCCAGATGATCGTAAGTCAACATCGGGTTTCTGTATATTTTTTTGTGGGAATTTGATCTCTTGGTCTTCTAAGAA
ACAAGCCATTATTTCTCGTTCTAGTACTGAGGCGGAATGTCGAAGTCTAGCACATGTTTCTGCTAATTTGGTATGGATTCAGACCTTATTTGCTGATTTGGCTTTGTCTT
TTCCTTGTCCACCAACTGTTTGGTGTGATAATCTTGGGGCTGTTCATCTTAGTGCAAATCTTGTTCTTCATTCCAGAACTAAACATGTTGAAATCGATATATATTTTGTT
CGAGATTTAGCTCTTCAGCGACGTTTACAGGTGTGTCATCTTCCTTCGTCTGCTCAAGTGGCTGACATTTTTTCTAAGCCCCTCTCTCCTACTCGATTTCTTTCATTACG
TTCCAAGCTCAATGTTGTGGATGCTTCCACCATTGGCTTGAGGGGGGGGGGGGGGGGGTGGGTGTTAAGAGAGCCCATTGATGAAAACAGAGGATTAGTTGTTATTCTGT
TTTTACTGTGTTAG
Protein sequenceShow/hide protein sequence
MSCPYTSQQYGIVERKHRHIVDTGLTLLSHASLSIEFWDDAFATAVYLINRLPCTVHHGLSPMEVLFGLKPNYSFLKVFSCLCYPSLRPYNKHKIEPRSTPCIFLGYSNT
YKGYKCLSFSGRVYVSRHVLFNEYSFPASPHSSQQTIAPINFRSFLPILLPSSLPLISPTMPFFDRQTSHCPIPIALSSDPQTFSSLNHIVQSTQVSSIVAQQSAAPLPP
TTSQQPILSDLSHHSFNVPPIAGSSSASDLFPNTEPLCSSSSSDTLPSGSPPVSQQNQMVNAHSMITRVKAGIFKPKILLAQYLETEPPSVKTALQCDHWAKAMRDEYDA
LIRNDTWSLVPSPPDKKVIGCKWVFKIKRNSDGSISWYKARLVAKGFHQDTDLDYTETFSQVVKPTTIRVLFTLALAYGWTLRQVDINNAFLHGVLSEEVFMTQPPGMDV
QGSIPLVCRLKKVIYGLKQAPRAWFERLSLFLHTLGFSNSQAVFSLLFRHQDGQHCFILVYVDDIVITGSSSLVIDTVVTTLNNQFSLKDLGQLSYFLGVEVSYPTSGGI
FLSQQKYISDILHKTKMHEANSISTPMVGGSVISAFKGDIFHDVYLYRSTVGALQYATLTHLEIAYSVNKVCQFMHAPTMLHWQAVKRILRYLNGALNHGILFRKPPDLL
LQGFADADWTSDPDDRKSTSGFCIFFCGNLISWSSKKQAIISRSSTEAECRSLAHVSANLVWIQTLFADLALSFPCPPTVWCDNLGAVHLSANLVLHSRTKHVEIDIYFV
RDLALQRRLQVCHLPSSAQVADIFSKPLSPTRFLSLRSKLNVVDASTIGLRGGGGGWVLREPIDENRGLVVILFLLC