; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024924 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024924
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE2
Genome locationtig00002486:4227567..4230227
RNA-Seq ExpressionSgr024924
SyntenySgr024924
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN81099.1 hypothetical protein VITISV_017741 [Vitis vinifera]6.8e-26853.59Show/hide
Query:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ
        PL LI  D+WGP++  S +G+RYY+ FVD +SRF+WI+ L++KS+   TF++F+T +E    L I+ +QTD GGEFRA   YL  NGI HR SCP+T QQ
Subjt:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ

Query:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN
        NG+ ERKHR IV+ GLTLL  ASLPL+FWD++F T VY  NRLP+ +LH   P+E LF + P YSFLK FGC CFP LRPYN+HKLQ+RS  CTFLGYS 
Subjt:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN

Query:  IHKGYKCMDSSGRIFISRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVT-TQKPVMLSSCDADNSNSLNSNSTGSTSTLPAVDNTLPA
         HKGYKCM S+GR++IS  V F+E SFPYS     K+  V+ C  + VS     P T  ++ +  P +LS        S       S   +  +DN +  
Subjt:  IHKGYKCMDSSGRIFISRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVT-TQKPVMLSSCDADNSNSLNSNSTGSTSTLPAVDNTLPA

Query:  LQSSPNSNLTAPFNADVPTGTDTCTSQPFVSIISNGQ---------SNIHPMVTRSKDGIFKPKVLLTEYTDVEPPSVKEALRCPHWLQAMKNEYAALLQ
           +PNS  T    A V +       Q  VS I++            N HPM+TR+K GI KPK+ +      EP SV  AL+   W +AM  EY AL +
Subjt:  LQSSPNSNLTAPFNADVPTGTDTCTSQPFVSIISNGQ---------SNIHPMVTRSKDGIFKPKVLLTEYTDVEPPSVKEALRCPHWLQAMKNEYAALLQ

Query:  NKTWSLVPRPSDHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILSEKVFMN
        N TWSLVP P+  Q IGCKWV+K K N DG+V +YKARLVAKGFHQ A  D+TETFSPVVKP T+RV+FT+AL+  W ++QLD+NNAFL+G L E+VFM 
Subjt:  NKTWSLVPRPSDHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILSEKVFMN

Query:  QPPGFTDSSTPTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLYRHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQFSLKDLGL
        QP GF D   P  VCRLHKALYGLKQAPRAW+E+L   L+S GF  +K+D SL  R       Y+L+YVDDI++ GS ++ I+SLI+ L+ +FSLKDLG 
Subjt:  QPPGFTDSSTPTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLYRHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQFSLKDLGL

Query:  LHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRLYRSIVGALQYVTITRPELSYSVNKVCQFMHSPTLVHW
        +HYFLGI+VS+ NN GL LSQ+KY+ DLL K  M    P RTP+ +G  +    G+   D+  YRS VGALQYVTITRPELS+SVNKVCQFM +PT  HW
Subjt:  LHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRLYRSIVGALQYVTITRPELSYSVNKVCQFMHSPTLVHW

Query:  QAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQALLKELIVI
        + VKRILRYL+G+   G+ LKK S+L L GF DADWASD DDR+STSG C+ LG NL++W SKKQ ++SRSS E E+RSLA   AE+ WL++LL EL + 
Subjt:  QAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQALLKELIVI

Query:  PSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLVQHLPAFDQLADIFTKPLSALSFHRLRSKLNVIDSIAIGLRGGVK
         ++PP++WCDNL  V LSANPVLH+RTKH+ELD+YFVR+ V+++ + V+H+P+ DQLAD+ TK +S+  F   R KL + +   + LRG V+
Subjt:  PSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLVQHLPAFDQLADIFTKPLSALSFHRLRSKLNVIDSIAIGLRGGVK

CAN83392.1 hypothetical protein VITISV_041406 [Vitis vinifera]5.8e-25953.1Show/hide
Query:  SRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQNGIVERKHRHIVDMGL
        S  GF YYVSF D YSR+TW+Y L++KS     F+ F+   E   G  ++  QTD GGEFR+L  Y   NGI HR SCP+TS+QNGI+ERKHRHIV++GL
Subjt:  SRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQNGIVERKHRHIVDMGL

Query:  TLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSNIHKGYKCMDSSGRIFI
        TLL+QASLPL++W DAFSTAV+ INRLP+ VL    P E LF +KP YS LK FGCLCFP LRPYN HKL FRSSPCTFLGYS+ HKGYKC++  GR+FI
Subjt:  TLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSNIHKGYKCMDSSGRIFI

Query:  SRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVTTQKPVMLSSCDADNSNSLNSN-------------STGSTSTLPAVDNTLPALQSS
        SR V FDE  FP++   + K   +       +  +P+      ++    + L +  A +S+ L+ N             +T S+ST+P +  +     SS
Subjt:  SRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVTTQKPVMLSSCDADNSNSLNSN-------------STGSTSTLPAVDNTLPALQSS

Query:  PNSNL--TAPFNADVPTGTDTCTSQPFVSIISNGQSNIHPMVTRSKDGIFKPKVLLTEYTDVEPPSVKEALRCPHWLQAMKNEYAALLQNKTWSLVPRPS
            L  T P + +     ++  ++P            H MVTRSK+GIFKP V   +    EP + +EA+  P W +AM  E+ AL++NKTWSLV  P+
Subjt:  PNSNL--TAPFNADVPTGTDTCTSQPFVSIISNGQSNIHPMVTRSKDGIFKPKVLLTEYTDVEPPSVKEALRCPHWLQAMKNEYAALLQNKTWSLVPRPS

Query:  DHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILSEKVFMNQPPGFTDSST-
        +   +GC+WVFK+KRN DGSVSRYKARLVAKG+ QV   D+ ETFSPVVKP TIRV+  +A++  W +RQLD+NNAFL+G L E+V+M+QPPGF   +  
Subjt:  DHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILSEKVFMNQPPGFTDSST-

Query:  -PTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLYRHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQFSLKDLGLLHYFLGIEV
            VC+LHKALYGLKQAPRAW+++L   L   GF  +K+D SL  R    S  ++L+YVDDI++TGSSS EI  LIS L   FSLKDLG L YFLGIEV
Subjt:  -PTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLYRHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQFSLKDLGLLHYFLGIEV

Query:  SYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRLYRSIVGALQYVTITRPELSYSVNKVCQFMHSPTLVHWQAVKRILRY
            +GGL LSQ KY+ DLL K  M  A  + TPM+SG  +SA  G+  ++V  YRS+VGALQY+TITRPE+++SVNKVCQFM  P   HW+AVKRILRY
Subjt:  SYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRLYRSIVGALQYVTITRPELSYSVNKVCQFMHSPTLVHWQAVKRILRY

Query:  LKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQALLKELIVIPSRPPILWC
        L G+   G++LK    + L GF DADW SD DDR+STSG C+ LG +LV+W SKKQ   SRSSTEAE+RSLA+ ++E++WLQ+LL EL    +  P++WC
Subjt:  LKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQALLKELIVIPSRPPILWC

Query:  DNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLVQHLPAFDQLADIFTKPLSALSFHRLRSKLNV
        DN+  V LSANPVLHSRTKH+ELD+YFVR+ V++R+L+V H+P  DQ+AD+FTKPLS   F +LR KL V
Subjt:  DNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLVQHLPAFDQLADIFTKPLSALSFHRLRSKLNV

RVW44519.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.3e-26653.55Show/hide
Query:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ
        PL L+ SD+WGP+   S  GF YYVSFVD YSR+TW+YFL++KS     F+ F+   E   G  ++  QTD GGEFR+L  Y   NGI HR SCP+TS+Q
Subjt:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ

Query:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN
        NGI+ERKHRHIV++GLTLL+QASLPL++W DAFSTAV+ INRLP+ VL    P E LF +KP YS LK FGCLCFP LRPYN HKL FRSSPCTFLGYS+
Subjt:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN

Query:  IHKGYKCMDSSGRIFISRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVTTQKPVMLSSCDADNSNSLNSN-------------STGST
         HKGYKC++  GR+FISR V FDE  FP++   + K   +       +  +P+      ++    + L +  A +S+ L+ N             +T S+
Subjt:  IHKGYKCMDSSGRIFISRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVTTQKPVMLSSCDADNSNSLNSN-------------STGST

Query:  STLPAVDNTLPALQSSPNSNLTAPFNADVPTGTDTCTSQPFVSIISNG---QSNIHPMVTRSKDGIFKPKVLLTEYTDVEPPSVKEALRCPHWLQAMKNE
        ST+P ++ +     S P+S+        +P  T+  + +P  SI +         H MVTRSK+GIFKPKV   +    EP + +EA+  P W +AM  E
Subjt:  STLPAVDNTLPALQSSPNSNLTAPFNADVPTGTDTCTSQPFVSIISNG---QSNIHPMVTRSKDGIFKPKVLLTEYTDVEPPSVKEALRCPHWLQAMKNE

Query:  YAALLQNKTWSLVPRPSDHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILS
        + AL++NKTWSLV  P++   +GC+WVFK+KRN DGSVSRYKARLVAKG+ QV   D+ ETFSPVVKP TIRV+  +A++  W +RQLD+NNAFL+G L 
Subjt:  YAALLQNKTWSLVPRPSDHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILS

Query:  EKVFMNQPPGFTDSST--PTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLYRHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQ
        E+V+M+QPPGF   +      VC+LHKALYGLKQAPRAW+++L   L   GF  +K+D SL  R    S  ++L+YVDDI++TGSSS EI  LIS L   
Subjt:  EKVFMNQPPGFTDSST--PTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLYRHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQ

Query:  FSLKDLGLLHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRLYRSIVGALQYVTITRPELSYSVNKVCQFM
        FSLKDLG L YFLGIEV    +GGL LSQ KY+ DLL K  M  A  + TPM+SG  +SA  G+  ++V  YRS+VGALQY+TITRPE+++SVNKVCQFM
Subjt:  FSLKDLGLLHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRLYRSIVGALQYVTITRPELSYSVNKVCQFM

Query:  HSPTLVHWQAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQA
          P   HW+AVKRILRYL G+   G++LK    + L GF DADW SD DDR+STSG C+ LG +LV+W SKKQ   SRSSTEAE+RSLA+ ++E++WLQ+
Subjt:  HSPTLVHWQAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQA

Query:  LLKELIVIPSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLVQHLPAFDQLADIFTKPLSALSFHRLRSKLNV
        LL EL    +  P++WCDN+  V LSANPVLHSRTKH+ELD+YFVR+ V++R+L+V H+P  DQ+AD+FTKPLS   F +LR KL V
Subjt:  LLKELIVIPSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLVQHLPAFDQLADIFTKPLSALSFHRLRSKLNV

RVW60229.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.7e-25852.54Show/hide
Query:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ
        PL L+ SD+WGP+   S  GF YYVSFVD YSR+TW+YFL++KS     F+ F+   E   G  ++  QTD GGEFR+L  Y   NGI HR SCP+TS+Q
Subjt:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ

Query:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN
        NGI+ERKHRHIV++GLTLL+QASLPL++W DAFSTAV+ INRLP+ VL    P E LF +KP YS LK FGCLCFP LRPYN HKL FRSSPCTFLGYS+
Subjt:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN

Query:  IHKGYKCMDSSGRIFISRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVTTQKPVMLSSCDADNSNSLNSN-------------STGST
         HKGYKC++  GR+FISR V FDE  FP++   + K   +       +  +P+      ++    + L +  A +S+ L+ N             +T S+
Subjt:  IHKGYKCMDSSGRIFISRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVTTQKPVMLSSCDADNSNSLNSN-------------STGST

Query:  STLPAVDNTLPALQSSPNSNLTAPFNADVPTGTDTCTSQPFVSIISNG---QSNIHPMVTRSKDGIFKPKVLLTEYTDVEPPSVKEALRCPHWLQAMKNE
        ST+P ++ +     S P+S+        +P  T+  + +P  SI +         H MVTRSK+GIFKPKV   +    EP + +EA+  P W +AM  E
Subjt:  STLPAVDNTLPALQSSPNSNLTAPFNADVPTGTDTCTSQPFVSIISNG---QSNIHPMVTRSKDGIFKPKVLLTEYTDVEPPSVKEALRCPHWLQAMKNE

Query:  YAALLQNKTWSLVPRPSDHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILS
        + AL++NKTWSLV  P++   +GC+WVFK+KRN DGSVSRYKARLVAKG+ QV   D+ ETFSPVVKP TIRV+  +A++  W +RQLD+NNAFL+G L 
Subjt:  YAALLQNKTWSLVPRPSDHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILS

Query:  EKVFMNQPPGFTDSST--PTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLYRHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQ
        E+V+M+QPPGF   +      VC+LHKALYGLKQAPRAW+++L   L   GF  +K+D SL  R    S  ++L+YVDDI++TGSSS EI  LIS L   
Subjt:  EKVFMNQPPGFTDSST--PTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLYRHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQ

Query:  FSLKDLGLLHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRLYRSIVGALQYVTITRPELSYSVNKVCQFM
        FSLKDLG L YFLGIE                  DLL K  M  A  + TPM+SG  +SA  G+  ++V  YRS+VGALQY+TITRPE+++SVNKVCQFM
Subjt:  FSLKDLGLLHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRLYRSIVGALQYVTITRPELSYSVNKVCQFM

Query:  HSPTLVHWQAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQA
          P   HW+AVKRILRYL G+   G++LK    + L GF DADW SD DDR+STSG C+ LG +LV+W SKKQ   SRSSTEAE+RSLA+ ++E++WLQ+
Subjt:  HSPTLVHWQAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQA

Query:  LLKELIVIPSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLVQHLPAFDQLADIFTKPLSALSFHRLRSKLNV
        LL EL    +  P++WCDN+  V LSANPVLHSRTKH+ELD+YFVR+ V++R+L+V H+P  DQ+AD+FTKPLS   F +LR KL V
Subjt:  LLKELIVIPSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLVQHLPAFDQLADIFTKPLSALSFHRLRSKLNV

RVX14937.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]6.6e-27154.25Show/hide
Query:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ
        PL LI SD+WGP+   S +G+RYY+ FVD +SRF+WI+ L++KS+   TF++F+T +E    L I+ +QTD GGEFRA   YL  NGI HR SCP+T QQ
Subjt:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ

Query:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN
        NG+ ERKHR IV+ GLTLL   SLPL+FWD++F T VY  NRLP+ VLH   P+E LF + P YSFLK FGC CFP LRPYN+HKLQ+RS  CTFLGYS 
Subjt:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN

Query:  IHKGYKCMDSSGRIFISRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVT-TQKPVMLSSCDADNSNSLNSNSTGSTSTLPAVDNTLPA
         HKGYKCM S+GR++ISR V F+E SFPYS     K+  V+ C  + VS     P T  ++ +  P +LS        S       S   +  +DN +  
Subjt:  IHKGYKCMDSSGRIFISRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVT-TQKPVMLSSCDADNSNSLNSNSTGSTSTLPAVDNTLPA

Query:  LQSSPNSNLTAPFNADVPTGTDTCTSQPFVSIISNGQ---------SNIHPMVTRSKDGIFKPKVLLTEYTDVEPPSVKEALRCPHWLQAMKNEYAALLQ
           +PNS  T    A V +       Q  VS I++            N HPM+TR+K GI KPK+ +      EP SV  AL+   W +AM  EY AL +
Subjt:  LQSSPNSNLTAPFNADVPTGTDTCTSQPFVSIISNGQ---------SNIHPMVTRSKDGIFKPKVLLTEYTDVEPPSVKEALRCPHWLQAMKNEYAALLQ

Query:  NKTWSLVPRPSDHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILSEKVFMN
        N TWSLVP P+  Q IGCKWV+K K N DG+V +YKARLVAKGFHQ A  D+TETFSPVVKP TIRV+FT+AL+  W ++QLD+NNAFL+G L E+VFM 
Subjt:  NKTWSLVPRPSDHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILSEKVFMN

Query:  QPPGFTDSSTPTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLYRHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQFSLKDLGL
        QP GF D   P  VCRLHKALYGLKQAPRAW+E+L   L+S GF  +K+D SL  R   +   Y+L+YVDDI++ GS ++ I+SLI+ L+ +FSLKDLG 
Subjt:  QPPGFTDSSTPTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLYRHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQFSLKDLGL

Query:  LHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRLYRSIVGALQYVTITRPELSYSVNKVCQFMHSPTLVHW
        +HYFLGI+VS+ NN GL LSQ+KY+ DLL K  M    P RTP+ +G  + A  G+  +D+  YRS VGALQYVTITRPELS+SVNKVCQFM +PT  HW
Subjt:  LHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRLYRSIVGALQYVTITRPELSYSVNKVCQFMHSPTLVHW

Query:  QAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQALLKELIVI
        +AVKRILRYL+G+   G+ LKK S+L L GF DADWASD DDR+STSG C+ LG NL++W SKKQ  +SRSSTEAE+RSLA   AE+ WL++LL EL + 
Subjt:  QAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQALLKELIVI

Query:  PSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLVQHLPAFDQLADIFTKPLSALSFHRLRSKLNVIDSIAIGLRGGVKEN
         ++PP++WCDNL  V LSANPVLH+RTKH+ELD+YFV + V+++ + V+H+P+ DQLAD+ TK +S+  F   R KL + +   + LRG V+E+
Subjt:  PSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLVQHLPAFDQLADIFTKPLSALSFHRLRSKLNVIDSIAIGLRGGVKEN

TrEMBL top hitse value%identityAlignment
A0A438EA49 Retrovirus-related Pol polyprotein from transposon TNT 1-946.2e-26753.55Show/hide
Query:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ
        PL L+ SD+WGP+   S  GF YYVSFVD YSR+TW+YFL++KS     F+ F+   E   G  ++  QTD GGEFR+L  Y   NGI HR SCP+TS+Q
Subjt:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ

Query:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN
        NGI+ERKHRHIV++GLTLL+QASLPL++W DAFSTAV+ INRLP+ VL    P E LF +KP YS LK FGCLCFP LRPYN HKL FRSSPCTFLGYS+
Subjt:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN

Query:  IHKGYKCMDSSGRIFISRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVTTQKPVMLSSCDADNSNSLNSN-------------STGST
         HKGYKC++  GR+FISR V FDE  FP++   + K   +       +  +P+      ++    + L +  A +S+ L+ N             +T S+
Subjt:  IHKGYKCMDSSGRIFISRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVTTQKPVMLSSCDADNSNSLNSN-------------STGST

Query:  STLPAVDNTLPALQSSPNSNLTAPFNADVPTGTDTCTSQPFVSIISNG---QSNIHPMVTRSKDGIFKPKVLLTEYTDVEPPSVKEALRCPHWLQAMKNE
        ST+P ++ +     S P+S+        +P  T+  + +P  SI +         H MVTRSK+GIFKPKV   +    EP + +EA+  P W +AM  E
Subjt:  STLPAVDNTLPALQSSPNSNLTAPFNADVPTGTDTCTSQPFVSIISNG---QSNIHPMVTRSKDGIFKPKVLLTEYTDVEPPSVKEALRCPHWLQAMKNE

Query:  YAALLQNKTWSLVPRPSDHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILS
        + AL++NKTWSLV  P++   +GC+WVFK+KRN DGSVSRYKARLVAKG+ QV   D+ ETFSPVVKP TIRV+  +A++  W +RQLD+NNAFL+G L 
Subjt:  YAALLQNKTWSLVPRPSDHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILS

Query:  EKVFMNQPPGFTDSST--PTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLYRHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQ
        E+V+M+QPPGF   +      VC+LHKALYGLKQAPRAW+++L   L   GF  +K+D SL  R    S  ++L+YVDDI++TGSSS EI  LIS L   
Subjt:  EKVFMNQPPGFTDSST--PTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLYRHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQ

Query:  FSLKDLGLLHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRLYRSIVGALQYVTITRPELSYSVNKVCQFM
        FSLKDLG L YFLGIEV    +GGL LSQ KY+ DLL K  M  A  + TPM+SG  +SA  G+  ++V  YRS+VGALQY+TITRPE+++SVNKVCQFM
Subjt:  FSLKDLGLLHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRLYRSIVGALQYVTITRPELSYSVNKVCQFM

Query:  HSPTLVHWQAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQA
          P   HW+AVKRILRYL G+   G++LK    + L GF DADW SD DDR+STSG C+ LG +LV+W SKKQ   SRSSTEAE+RSLA+ ++E++WLQ+
Subjt:  HSPTLVHWQAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQA

Query:  LLKELIVIPSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLVQHLPAFDQLADIFTKPLSALSFHRLRSKLNV
        LL EL    +  P++WCDN+  V LSANPVLHSRTKH+ELD+YFVR+ V++R+L+V H+P  DQ+AD+FTKPLS   F +LR KL V
Subjt:  LLKELIVIPSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLVQHLPAFDQLADIFTKPLSALSFHRLRSKLNV

A0A438FJP6 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-25852.54Show/hide
Query:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ
        PL L+ SD+WGP+   S  GF YYVSFVD YSR+TW+YFL++KS     F+ F+   E   G  ++  QTD GGEFR+L  Y   NGI HR SCP+TS+Q
Subjt:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ

Query:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN
        NGI+ERKHRHIV++GLTLL+QASLPL++W DAFSTAV+ INRLP+ VL    P E LF +KP YS LK FGCLCFP LRPYN HKL FRSSPCTFLGYS+
Subjt:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN

Query:  IHKGYKCMDSSGRIFISRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVTTQKPVMLSSCDADNSNSLNSN-------------STGST
         HKGYKC++  GR+FISR V FDE  FP++   + K   +       +  +P+      ++    + L +  A +S+ L+ N             +T S+
Subjt:  IHKGYKCMDSSGRIFISRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVTTQKPVMLSSCDADNSNSLNSN-------------STGST

Query:  STLPAVDNTLPALQSSPNSNLTAPFNADVPTGTDTCTSQPFVSIISNG---QSNIHPMVTRSKDGIFKPKVLLTEYTDVEPPSVKEALRCPHWLQAMKNE
        ST+P ++ +     S P+S+        +P  T+  + +P  SI +         H MVTRSK+GIFKPKV   +    EP + +EA+  P W +AM  E
Subjt:  STLPAVDNTLPALQSSPNSNLTAPFNADVPTGTDTCTSQPFVSIISNG---QSNIHPMVTRSKDGIFKPKVLLTEYTDVEPPSVKEALRCPHWLQAMKNE

Query:  YAALLQNKTWSLVPRPSDHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILS
        + AL++NKTWSLV  P++   +GC+WVFK+KRN DGSVSRYKARLVAKG+ QV   D+ ETFSPVVKP TIRV+  +A++  W +RQLD+NNAFL+G L 
Subjt:  YAALLQNKTWSLVPRPSDHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILS

Query:  EKVFMNQPPGFTDSST--PTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLYRHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQ
        E+V+M+QPPGF   +      VC+LHKALYGLKQAPRAW+++L   L   GF  +K+D SL  R    S  ++L+YVDDI++TGSSS EI  LIS L   
Subjt:  EKVFMNQPPGFTDSST--PTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLYRHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQ

Query:  FSLKDLGLLHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRLYRSIVGALQYVTITRPELSYSVNKVCQFM
        FSLKDLG L YFLGIE                  DLL K  M  A  + TPM+SG  +SA  G+  ++V  YRS+VGALQY+TITRPE+++SVNKVCQFM
Subjt:  FSLKDLGLLHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRLYRSIVGALQYVTITRPELSYSVNKVCQFM

Query:  HSPTLVHWQAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQA
          P   HW+AVKRILRYL G+   G++LK    + L GF DADW SD DDR+STSG C+ LG +LV+W SKKQ   SRSSTEAE+RSLA+ ++E++WLQ+
Subjt:  HSPTLVHWQAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQA

Query:  LLKELIVIPSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLVQHLPAFDQLADIFTKPLSALSFHRLRSKLNV
        LL EL    +  P++WCDN+  V LSANPVLHSRTKH+ELD+YFVR+ V++R+L+V H+P  DQ+AD+FTKPLS   F +LR KL V
Subjt:  LLKELIVIPSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLVQHLPAFDQLADIFTKPLSALSFHRLRSKLNV

A0A438K147 Retrovirus-related Pol polyprotein from transposon TNT 1-943.2e-27154.25Show/hide
Query:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ
        PL LI SD+WGP+   S +G+RYY+ FVD +SRF+WI+ L++KS+   TF++F+T +E    L I+ +QTD GGEFRA   YL  NGI HR SCP+T QQ
Subjt:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ

Query:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN
        NG+ ERKHR IV+ GLTLL   SLPL+FWD++F T VY  NRLP+ VLH   P+E LF + P YSFLK FGC CFP LRPYN+HKLQ+RS  CTFLGYS 
Subjt:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN

Query:  IHKGYKCMDSSGRIFISRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVT-TQKPVMLSSCDADNSNSLNSNSTGSTSTLPAVDNTLPA
         HKGYKCM S+GR++ISR V F+E SFPYS     K+  V+ C  + VS     P T  ++ +  P +LS        S       S   +  +DN +  
Subjt:  IHKGYKCMDSSGRIFISRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVT-TQKPVMLSSCDADNSNSLNSNSTGSTSTLPAVDNTLPA

Query:  LQSSPNSNLTAPFNADVPTGTDTCTSQPFVSIISNGQ---------SNIHPMVTRSKDGIFKPKVLLTEYTDVEPPSVKEALRCPHWLQAMKNEYAALLQ
           +PNS  T    A V +       Q  VS I++            N HPM+TR+K GI KPK+ +      EP SV  AL+   W +AM  EY AL +
Subjt:  LQSSPNSNLTAPFNADVPTGTDTCTSQPFVSIISNGQ---------SNIHPMVTRSKDGIFKPKVLLTEYTDVEPPSVKEALRCPHWLQAMKNEYAALLQ

Query:  NKTWSLVPRPSDHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILSEKVFMN
        N TWSLVP P+  Q IGCKWV+K K N DG+V +YKARLVAKGFHQ A  D+TETFSPVVKP TIRV+FT+AL+  W ++QLD+NNAFL+G L E+VFM 
Subjt:  NKTWSLVPRPSDHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILSEKVFMN

Query:  QPPGFTDSSTPTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLYRHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQFSLKDLGL
        QP GF D   P  VCRLHKALYGLKQAPRAW+E+L   L+S GF  +K+D SL  R   +   Y+L+YVDDI++ GS ++ I+SLI+ L+ +FSLKDLG 
Subjt:  QPPGFTDSSTPTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLYRHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQFSLKDLGL

Query:  LHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRLYRSIVGALQYVTITRPELSYSVNKVCQFMHSPTLVHW
        +HYFLGI+VS+ NN GL LSQ+KY+ DLL K  M    P RTP+ +G  + A  G+  +D+  YRS VGALQYVTITRPELS+SVNKVCQFM +PT  HW
Subjt:  LHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRLYRSIVGALQYVTITRPELSYSVNKVCQFMHSPTLVHW

Query:  QAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQALLKELIVI
        +AVKRILRYL+G+   G+ LKK S+L L GF DADWASD DDR+STSG C+ LG NL++W SKKQ  +SRSSTEAE+RSLA   AE+ WL++LL EL + 
Subjt:  QAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQALLKELIVI

Query:  PSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLVQHLPAFDQLADIFTKPLSALSFHRLRSKLNVIDSIAIGLRGGVKEN
         ++PP++WCDNL  V LSANPVLH+RTKH+ELD+YFV + V+++ + V+H+P+ DQLAD+ TK +S+  F   R KL + +   + LRG V+E+
Subjt:  PSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLVQHLPAFDQLADIFTKPLSALSFHRLRSKLNVIDSIAIGLRGGVKEN

A5BFT3 Integrase catalytic domain-containing protein3.3e-26853.59Show/hide
Query:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ
        PL LI  D+WGP++  S +G+RYY+ FVD +SRF+WI+ L++KS+   TF++F+T +E    L I+ +QTD GGEFRA   YL  NGI HR SCP+T QQ
Subjt:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ

Query:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN
        NG+ ERKHR IV+ GLTLL  ASLPL+FWD++F T VY  NRLP+ +LH   P+E LF + P YSFLK FGC CFP LRPYN+HKLQ+RS  CTFLGYS 
Subjt:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN

Query:  IHKGYKCMDSSGRIFISRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVT-TQKPVMLSSCDADNSNSLNSNSTGSTSTLPAVDNTLPA
         HKGYKCM S+GR++IS  V F+E SFPYS     K+  V+ C  + VS     P T  ++ +  P +LS        S       S   +  +DN +  
Subjt:  IHKGYKCMDSSGRIFISRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVT-TQKPVMLSSCDADNSNSLNSNSTGSTSTLPAVDNTLPA

Query:  LQSSPNSNLTAPFNADVPTGTDTCTSQPFVSIISNGQ---------SNIHPMVTRSKDGIFKPKVLLTEYTDVEPPSVKEALRCPHWLQAMKNEYAALLQ
           +PNS  T    A V +       Q  VS I++            N HPM+TR+K GI KPK+ +      EP SV  AL+   W +AM  EY AL +
Subjt:  LQSSPNSNLTAPFNADVPTGTDTCTSQPFVSIISNGQ---------SNIHPMVTRSKDGIFKPKVLLTEYTDVEPPSVKEALRCPHWLQAMKNEYAALLQ

Query:  NKTWSLVPRPSDHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILSEKVFMN
        N TWSLVP P+  Q IGCKWV+K K N DG+V +YKARLVAKGFHQ A  D+TETFSPVVKP T+RV+FT+AL+  W ++QLD+NNAFL+G L E+VFM 
Subjt:  NKTWSLVPRPSDHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILSEKVFMN

Query:  QPPGFTDSSTPTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLYRHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQFSLKDLGL
        QP GF D   P  VCRLHKALYGLKQAPRAW+E+L   L+S GF  +K+D SL  R       Y+L+YVDDI++ GS ++ I+SLI+ L+ +FSLKDLG 
Subjt:  QPPGFTDSSTPTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLYRHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQFSLKDLGL

Query:  LHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRLYRSIVGALQYVTITRPELSYSVNKVCQFMHSPTLVHW
        +HYFLGI+VS+ NN GL LSQ+KY+ DLL K  M    P RTP+ +G  +    G+   D+  YRS VGALQYVTITRPELS+SVNKVCQFM +PT  HW
Subjt:  LHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRLYRSIVGALQYVTITRPELSYSVNKVCQFMHSPTLVHW

Query:  QAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQALLKELIVI
        + VKRILRYL+G+   G+ LKK S+L L GF DADWASD DDR+STSG C+ LG NL++W SKKQ ++SRSS E E+RSLA   AE+ WL++LL EL + 
Subjt:  QAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQALLKELIVI

Query:  PSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLVQHLPAFDQLADIFTKPLSALSFHRLRSKLNVIDSIAIGLRGGVK
         ++PP++WCDNL  V LSANPVLH+RTKH+ELD+YFVR+ V+++ + V+H+P+ DQLAD+ TK +S+  F   R KL + +   + LRG V+
Subjt:  PSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLVQHLPAFDQLADIFTKPLSALSFHRLRSKLNVIDSIAIGLRGGVK

A5BMC8 Integrase catalytic domain-containing protein6.2e-25953.1Show/hide
Query:  SRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQNGIVERKHRHIVDMGL
        S  GF YYVSF D YSR+TW+Y L++KS     F+ F+   E   G  ++  QTD GGEFR+L  Y   NGI HR SCP+TS+QNGI+ERKHRHIV++GL
Subjt:  SRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQNGIVERKHRHIVDMGL

Query:  TLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSNIHKGYKCMDSSGRIFI
        TLL+QASLPL++W DAFSTAV+ INRLP+ VL    P E LF +KP YS LK FGCLCFP LRPYN HKL FRSSPCTFLGYS+ HKGYKC++  GR+FI
Subjt:  TLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSNIHKGYKCMDSSGRIFI

Query:  SRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVTTQKPVMLSSCDADNSNSLNSN-------------STGSTSTLPAVDNTLPALQSS
        SR V FDE  FP++   + K   +       +  +P+       +    + L +  A +S+ L+ N             +T S+ST+P +  +     SS
Subjt:  SRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVTTQKPVMLSSCDADNSNSLNSN-------------STGSTSTLPAVDNTLPALQSS

Query:  PNSNL--TAPFNADVPTGTDTCTSQPFVSIISNGQSNIHPMVTRSKDGIFKPKVLLTEYTDVEPPSVKEALRCPHWLQAMKNEYAALLQNKTWSLVPRPS
            L  T P + +     ++  ++P            H MVTRSK+GIFKP V   +    EP + +EA+  P W +AM  E+ AL++NKTWSLV  P+
Subjt:  PNSNL--TAPFNADVPTGTDTCTSQPFVSIISNGQSNIHPMVTRSKDGIFKPKVLLTEYTDVEPPSVKEALRCPHWLQAMKNEYAALLQNKTWSLVPRPS

Query:  DHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILSEKVFMNQPPGFTDSST-
        +   +GC+WVFK+KRN DGSVSRYKARLVAKG+ QV   D+ ETFSPVVKP TIRV+  +A++  W +RQLD+NNAFL+G L E+V+M+QPPGF   +  
Subjt:  DHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILSEKVFMNQPPGFTDSST-

Query:  -PTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLYRHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQFSLKDLGLLHYFLGIEV
            VC+LHKALYGLKQAPRAW+++L   L   GF  +K+D SL  R    S  ++L+YVDDI++TGSSS EI  LIS L   FSLKDLG L YFLGIEV
Subjt:  -PTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLYRHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQFSLKDLGLLHYFLGIEV

Query:  SYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRLYRSIVGALQYVTITRPELSYSVNKVCQFMHSPTLVHWQAVKRILRY
            +GGL LSQ KY+ DLL K  M  A  + TPM+SG  +SA  G+  ++V  YRS+VGALQY+TITRPE+++SVNKVCQFM  P   HW+AVKRILRY
Subjt:  SYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRLYRSIVGALQYVTITRPELSYSVNKVCQFMHSPTLVHWQAVKRILRY

Query:  LKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQALLKELIVIPSRPPILWC
        L G+   G++LK    + L GF DADW SD DDR+STSG C+ LG +LV+W SKKQ   SRSSTEAE+RSLA+ ++E++WLQ+LL EL    +  P++WC
Subjt:  LKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQALLKELIVIPSRPPILWC

Query:  DNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLVQHLPAFDQLADIFTKPLSALSFHRLRSKLNV
        DN+  V LSANPVLHSRTKH+ELD+YFVR+ V++R+L+V H+P  DQ+AD+FTKPLS   F +LR KL V
Subjt:  DNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLVQHLPAFDQLADIFTKPLSALSFHRLRSKLNV

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.5e-11630.9Show/hide
Query:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEF--RALTPYLRSNGISHRFSCPYTS
        PL ++ SDV GP    + +   Y+V FVD ++ +   Y ++ KSDV+S F  F    E    L +  +  D G E+    +  +    GIS+  + P+T 
Subjt:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEF--RALTPYLRSNGISHRFSCPYTS

Query:  QQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDIS--PMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFL
        Q NG+ ER  R I +   T++S A L   FW +A  TA Y INR+PS  L D S  P E     KP    L+ FG   +  ++     K   +S    F+
Subjt:  QQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDIS--PMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFL

Query:  GYSNIHKGYKCMDSSGRIFI-SRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVTTQKPVMLSSCD-----ADNSNSLNSNSTGSTSTL
        GY     G+K  D+    FI +R V  DE +   S + K ++ ++     +   + P       + T+ P     CD      D+  S N N    +  +
Subjt:  GYSNIHKGYKCMDSSGRIFI-SRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVTTQKPVMLSSCD-----ADNSNSLNSNSTGSTSTL

Query:  PAVD--------NTLPALQSSPNSN---------------LTAPFNADVPT--------------GTDTCTSQPFVSIISNGQSNI--HPMVTRSKDGIF
           +        + +  L+ S  SN               L     +  P               G D  T    + II+     +   P ++ +++   
Subjt:  PAVD--------NTLPALQSSPNSN---------------LTAPFNADVPT--------------GTDTCTSQPFVSIISNGQSNI--HPMVTRSKDGIF

Query:  KPKVLLTEYT---DVEPPSVKEAL---RCPHWLQAMKNEYAALLQNKTWSLVPRPSDHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTET
          KV+L  +T   DV P S  E         W +A+  E  A   N TW++  RP +  ++  +WVF +K N  G+  RYKARLVA+GF Q   +DY ET
Subjt:  KPKVLLTEYT---DVEPPSVKEAL---RCPHWLQAMKNEYAALLQNKTWSLVPRPSDHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTET

Query:  FSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILSEKVFMNQPPGFTDSSTPTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLY
        F+PV +  + R + +L + +  ++ Q+D+  AFL+G L E+++M  P G + +S    VC+L+KA+YGLKQA R W+E     L    F  S  D  +  
Subjt:  FSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILSEKVFMNQPPGFTDSSTPTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLY

Query:  RHKG--TSRCYILIYVDDIIITGSSSSEISSLISLLHKQFSLKDLGLLHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSG-NVVSA
          KG      Y+L+YVDD++I     + +++    L ++F + DL  + +F+GI +    +  ++LSQS YV  +L K +M   N + TP+ S  N    
Subjt:  RHKG--TSRCYILIYVDDIIITGSSSSEISSLISLLHKQFSLKDLGLLHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSG-NVVSA

Query:  FTGEKFNDVRLYRSIVGALQYVTI-TRPELSYSVNKVCQFMHSPTLVHWQAVKRILRYLKGSFSSGMLLKK--PSDLTLSGFADADWASDPDDRKSTSGF
         + E  N     RS++G L Y+ + TRP+L+ +VN + ++        WQ +KR+LRYLKG+    ++ KK    +  + G+ D+DWA    DRKST+G+
Subjt:  FTGEKFNDVRLYRSIVGALQYVTI-TRPELSYSVNKVCQFMHSPTLVHWQAVKRILRYLKGSFSSGMLLKK--PSDLTLSGFADADWASDPDDRKSTSGF

Query:  CM-MLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQALLKELIVIPSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLV
           M   NL+ W +K+Q+ ++ SSTEAE+ +L  A  E +WL+ LL  + +    P  ++ DN G + ++ NP  H R KH+++  +F R+ V    + +
Subjt:  CM-MLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQALLKELIVIPSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLV

Query:  QHLPAFDQLADIFTKPLSALSFHRLRSKLNVI
        +++P  +QLADIFTKPL A  F  LR KL ++
Subjt:  QHLPAFDQLADIFTKPLSALSFHRLRSKLNVI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.3e-13234.02Show/hide
Query:  LIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEF--RALTPYLRSNGISHRFSCPYTSQQN
        L+ SDV GP    S  G +Y+V+F+D  SR  W+Y L++K  V+  F  F   +E+  G  ++ +++D GGE+  R    Y  S+GI H  + P T Q N
Subjt:  LIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEF--RALTPYLRSNGISHRFSCPYTSQQN

Query:  GIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPL-YSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN
        G+ ER +R IV+   ++L  A LP  FW +A  TA Y INR PS  L    P ER++  K + YS LK FGC  F  +      KL  +S PC F+GY +
Subjt:  GIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPL-YSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN

Query:  IHKGYKCMDS-SGRIFISRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVTTQKPVMLSSCDADNSNSLNSNSTGSTSTLPAVDNTLPA
           GY+  D    ++  SR V F E+        +  +    +  N ++ +    P T                      ++N T + ST   V      
Subjt:  IHKGYKCMDS-SGRIFISRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVTTQKPVMLSSCDADNSNSLNSNSTGSTSTLPAVDNTLPA

Query:  LQSSPNSNLTAPFNADVPTGTDTCTSQPFVSIISNGQSNIHPMVTRSKDGIFKPKVLLTEYT----DVEPPSVKEALRCP---HWLQAMKNEYAALLQNK
         Q      +       +  G         V   + G+    P+    +  +   +   TEY     D EP S+KE L  P     ++AM+ E  +L +N 
Subjt:  LQSSPNSNLTAPFNADVPTGTDTCTSQPFVSIISNGQSNIHPMVTRSKDGIFKPKVLLTEYT----DVEPPSVKEALRCP---HWLQAMKNEYAALLQNK

Query:  TWSLVPRPSDHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILSEKVFMNQP
        T+ LV  P   + + CKWVFK+K++ D  + RYKARLV KGF Q   +D+ E FSPVVK  +IR + +LA +   ++ QLD+  AFLHG L E+++M QP
Subjt:  TWSLVPRPSDHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILSEKVFMNQP

Query:  PGFTDSSTPTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLY-RHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQFSLKDLGLL
         GF  +     VC+L+K+LYGLKQAPR WY +  SF+ S  +  + +D  + + R    +   +L+YVDD++I G     I+ L   L K F +KDLG  
Subjt:  PGFTDSSTPTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLY-RHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQFSLKDLGLL

Query:  HYFLGIE-VSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPM-----VSGNVVSAFTGEKFNDVRL-YRSIVGALQYVTI-TRPELSYSVNKVCQFMH
           LG++ V    +  L+LSQ KY+  +L + +M  A P+ TP+     +S  +      EK N  ++ Y S VG+L Y  + TRP+++++V  V +F+ 
Subjt:  HYFLGIE-VSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPM-----VSGNVVSAFTGEKFNDVRL-YRSIVGALQYVTI-TRPELSYSVNKVCQFMH

Query:  SPTLVHWQAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQAL
        +P   HW+AVK ILRYL+G+ +   L    SD  L G+ DAD A D D+RKS++G+     G  ++W SK Q  ++ S+TEAE+ +      E+IWL+  
Subjt:  SPTLVHWQAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQAL

Query:  LKELIVIPSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLVQHLPAFDQLADIFTK
        L+EL  +  +  +++CD+  A+ LS N + H+RTKH+++  +++R++V    L V  +   +  AD+ TK
Subjt:  LKELIVIPSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLVQHLPAFDQLADIFTK

P92519 Uncharacterized mitochondrial protein AtMg008102.7e-5748.47Show/hide
Query:  YILIYVDDIIITGSSSSEISSLISLLHKQFSLKDLGLLHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRL
        Y+L+YVDDI++TGSS++ ++ LI  L   FS+KDLG +HYFLGI++   +  GLFLSQ+KY   +L+ A M +  P+ TP+    + S+ +  K+ D   
Subjt:  YILIYVDDIIITGSSSSEISSLISLLHKQFSLKDLGLLHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRL

Query:  YRSIVGALQYVTITRPELSYSVNKVCQFMHSPTLVHWQAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSK
        +RSIVGALQY+T+TRP++SY+VN VCQ MH PTL  +  +KR+LRY+KG+   G+ + K S L +  F D+DWA     R+ST+GFC  LG N+++W +K
Subjt:  YRSIVGALQYVTITRPELSYSVNKVCQFMHSPTLVHWQAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSK

Query:  KQSVISRSSTEAEFRSLANASAELIWLQA
        +Q  +SRSSTE E+R+LA  +AEL W  A
Subjt:  KQSVISRSSTEAEFRSLANASAELIWLQA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.0e-21442.98Show/hide
Query:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ
        PL  I SDVW   +  S + +RYYV FVD ++R+TW+Y L+ KS V  TFI+F+  +E      I    +D GGEF AL  Y   +GISH  S P+T + 
Subjt:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ

Query:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN
        NG+ ERKHRHIV+ GLTLLS AS+P  +W  AF+ AVY INRLP+ +L   SP ++LFGT P Y  L+ FGC C+P LRPYN HKL  +S  C FLGYS 
Subjt:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN

Query:  IHKGYKCMD-SSGRIFISRHVQFDENSFPY-----------------SISYKPKSSYVTR-----------------------------------CDNAV
            Y C+   + R++ISRHV+FDEN FP+                 S  + P ++  TR                                    D++ 
Subjt:  IHKGYKCMD-SSGRIFISRHVQFDENSFPY-----------------SISYKPKSSYVTR-----------------------------------CDNAV

Query:  VSHLPITP------MTCEVTTQKPVMLSSCDADNSNSLNSNSTGSTSTLPAVDNTLPALQS----SPNSNLTAPFNADVPTGTDTCTSQPFVSIISNGQS
         S  P +P            T +P    +    + N+  +N T  + +  A   + PA  S    SP ++ ++   +  P         P   I++N   
Subjt:  VSHLPITP------MTCEVTTQKPVMLSSCDADNSNSLNSNSTGSTSTLPAVDNTLPALQS----SPNSNLTAPFNADVPTGTDTCTSQPFVSIISNGQS

Query:  ---NIHPMVTRSKDGIFKP----KVLLTEYTDVEPPSVKEALRCPHWLQAMKNEYAALLQNKTWSLVPRPSDH-QVIGCKWVFKIKRNTDGSVSRYKARL
           N H M TR+K GI KP     + ++   + EP +  +AL+   W  AM +E  A + N TW LVP P  H  ++GC+W+F  K N+DGS++RYKARL
Subjt:  ---NIHPMVTRSKDGIFKP----KVLLTEYTDVEPPSVKEALRCPHWLQAMKNEYAALLQNKTWSLVPRPSDH-QVIGCKWVFKIKRNTDGSVSRYKARL

Query:  VAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILSEKVFMNQPPGFTDSSTPTAVCRLHKALYGLKQAPRAWYERLSSFL
        VAKG++Q   +DY ETFSPV+K  +IR++  +A+   W +RQLD+NNAFL G L++ V+M+QPPGF D   P  VC+L KALYGLKQAPRAWY  L ++L
Subjt:  VAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILSEKVFMNQPPGFTDSSTPTAVCRLHKALYGLKQAPRAWYERLSSFL

Query:  VSLGFKCSKADTSLLYRHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQFSLKDLGLLHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANP
        +++GF  S +DTSL    +G S  Y+L+YVDDI+ITG+  + + + +  L ++FS+KD   LHYFLGIE       GL LSQ +Y+ DLL + +M  A P
Subjt:  VSLGFKCSKADTSLLYRHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQFSLKDLGLLHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANP

Query:  IRTPMVSGNVVSAFTGEKFNDVRLYRSIVGALQYVTITRPELSYSVNKVCQFMHSPTLVHWQAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASD
        + TPM     +S ++G K  D   YR IVG+LQY+  TRP++SY+VN++ QFMH PT  H QA+KRILRYL G+ + G+ LKK + L+L  ++DADWA D
Subjt:  IRTPMVSGNVVSAFTGEKFNDVRLYRSIVGALQYVTITRPELSYSVNKVCQFMHSPTLVHWQAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASD

Query:  PDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQALLKELIVIPSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRD
         DD  ST+G+ + LG + ++W SKKQ  + RSSTEAE+RS+AN S+E+ W+ +LL EL +  +RPP+++CDN+GA +L ANPV HSR KH+ +D +F+R+
Subjt:  PDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQALLKELIVIPSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRD

Query:  LVLQRRLLVQHLPAFDQLADIFTKPLSALSFHRLRSKLNV
         V    L V H+   DQLAD  TKPLS  +F    SK+ V
Subjt:  LVLQRRLLVQHLPAFDQLADIFTKPLSALSFHRLRSKLNV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.8e-21543.81Show/hide
Query:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ
        PL  I SDVW   +  S + +RYYV FVD ++R+TW+Y L+ KS V  TFI F++ +E      I  + +D GGEF  L  YL  +GISH  S P+T + 
Subjt:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ

Query:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN
        NG+ ERKHRHIV+MGLTLLS AS+P  +W  AFS AVY INRLP+ +L   SP ++LFG  P Y  LK FGC C+P LRPYN HKL+ +S  C F+GYS 
Subjt:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN

Query:  IHKGYKCMD-SSGRIFISRHVQFDENSFPYS-ISYKPKSSYVTRCDNAV--VSH--LPITPMTCEV---------------TTQKPVMLSSCDADN--SN
            Y C+   +GR++ SRHVQFDE  FP+S  ++   +S   R D+A    SH  LP TP+                   ++  P+  +   + N  S+
Subjt:  IHKGYKCMD-SSGRIFISRHVQFDENSFPYS-ISYKPKSSYVTRCDNAV--VSH--LPITPMTCEV---------------TTQKPVMLSSCDADN--SN

Query:  SLNSNST---------------------GSTSTLPAVDNTLPALQS--SPNSNLTAP----------------FNADVPTGTDTCT--------SQPFVS
        S++S S+                      S S  P ++N  P   S  SPN N   P                   + P+ + T T        + P + 
Subjt:  SLNSNST---------------------GSTSTLPAVDNTLPALQS--SPNSNLTAP----------------FNADVPTGTDTCT--------SQPFVS

Query:  IISNGQSNIHPMVTRSKDGIFKPKVLLTEYTDV----EPPSVKEALRCPHWLQAMKNEYAALLQNKTWSLVPRPSDH-QVIGCKWVFKIKRNTDGSVSRY
        + +    N H M TR+KDGI KP    +  T +    EP +  +A++   W QAM +E  A + N TW LVP P     ++GC+W+F  K N+DGS++RY
Subjt:  IISNGQSNIHPMVTRSKDGIFKPKVLLTEYTDV----EPPSVKEALRCPHWLQAMKNEYAALLQNKTWSLVPRPSDH-QVIGCKWVFKIKRNTDGSVSRY

Query:  KARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILSEKVFMNQPPGFTDSSTPTAVCRLHKALYGLKQAPRAWYERL
        KARLVAKG++Q   +DY ETFSPV+K  +IR++  +A+   W +RQLD+NNAFL G L+++V+M+QPPGF D   P  VCRL KA+YGLKQAPRAWY  L
Subjt:  KARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILSEKVFMNQPPGFTDSSTPTAVCRLHKALYGLKQAPRAWYERL

Query:  SSFLVSLGFKCSKADTSLLYRHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQFSLKDLGLLHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMF
         ++L+++GF  S +DTSL    +G S  Y+L+YVDDI+ITG+ +  +   +  L ++FS+K+   LHYFLGIE       GL LSQ +Y  DLL + +M 
Subjt:  SSFLVSLGFKCSKADTSLLYRHKGTSRCYILIYVDDIIITGSSSSEISSLISLLHKQFSLKDLGLLHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMF

Query:  EANPIRTPMVSGNVVSAFTGEKFNDVRLYRSIVGALQYVTITRPELSYSVNKVCQFMHSPTLVHWQAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADAD
         A P+ TPM +   ++  +G K  D   YR IVG+LQY+  TRP+LSY+VN++ Q+MH PT  HW A+KR+LRYL G+   G+ LKK + L+L  ++DAD
Subjt:  EANPIRTPMVSGNVVSAFTGEKFNDVRLYRSIVGALQYVTITRPELSYSVNKVCQFMHSPTLVHWQAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADAD

Query:  WASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQALLKELIVIPSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIY
        WA D DD  ST+G+ + LG + ++W SKKQ  + RSSTEAE+RS+AN S+EL W+ +LL EL +  S PP+++CDN+GA +L ANPV HSR KH+ LD +
Subjt:  WASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRSLANASAELIWLQALLKELIVIPSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIY

Query:  FVRDLVLQRRLLVQHLPAFDQLADIFTKPLSALSFHRLRSKLNVI
        F+R+ V    L V H+   DQLAD  TKPLS ++F     K+ VI
Subjt:  FVRDLVLQRRLLVQHLPAFDQLADIFTKPLSALSFHRLRSKLNVI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 87.1e-11441.15Show/hide
Query:  EPPSVKEALRCPHWLQAMKNEYAALLQNKTWSLVPRPSDHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLAL
        EP +  EA     W  AM +E  A+    TW +   P + + IGCKWV+KIK N+DG++ RYKARLVAKG+ Q   +D+ ETFSPV K  +++++  ++ 
Subjt:  EPPSVKEALRCPHWLQAMKNEYAALLQNKTWSLVPRPSDHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVADVDYTETFSPVVKPVTIRVLFTLAL

Query:  AFGWQLRQLDINNAFLHGILSEKVFMNQPPGFT----DSSTPTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLYRHKGTSRCYILIYV
         + + L QLDI+NAFL+G L E+++M  PPG+     DS  P AVC L K++YGLKQA R W+ + S  L+  GF  S +D +   +   T    +L+YV
Subjt:  AFGWQLRQLDINNAFLHGILSEKVFMNQPPGFT----DSSTPTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLYRHKGTSRCYILIYV

Query:  DDIIITGSSSSEISSLISLLHKQFSLKDLGLLHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRLYRSIVG
        DDIII  ++ + +  L S L   F L+DLG L YFLG+E++  +  G+ + Q KY  DLL +  +    P   PM      SA +G  F D + YR ++G
Subjt:  DDIIITGSSSSEISSLISLLHKQFSLKDLGLLHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRLYRSIVG

Query:  ALQYVTITRPELSYSVNKVCQFMHSPTLVHWQAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVIS
         L Y+ ITR ++S++VNK+ QF  +P L H QAV +IL Y+KG+   G+     +++ L  F+DA + S  D R+ST+G+CM LG +L++W SKKQ V+S
Subjt:  ALQYVTITRPELSYSVNKVCQFMHSPTLVHWQAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVIS

Query:  RSSTEAEFRSLANASAELIWLQALLKELIVIPSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLVQHLPAFDQLADIFTKPLSALS
        +SS EAE+R+L+ A+ E++WL    +EL +  S+P +L+CDN  A+H++ N V H RTKH+E D + VR+  + +  L     A+D+  D FT+ LS + 
Subjt:  RSSTEAEFRSLANASAELIWLQALLKELIVIPSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLVQHLPAFDQLADIFTKPLSALS

Query:  FHRLRSKLNVIDSIAIGLRG
           LR  +  I S+  GL G
Subjt:  FHRLRSKLNVIDSIAIGLRG

ATMG00240.1 Gag-Pol-related retrotransposon family protein8.1e-1742.27Show/hide
Query:  YVTITRPELSYSVNKVCQFMHSPTLVHWQAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVIS
        Y+TITRP+L+++VN++ QF  +      QAV ++L Y+KG+   G+     SDL L  FAD+DWAS PD R+S +GFC ++   L   G+ ++S++S
Subjt:  YVTITRPELSYSVNKVCQFMHSPTLVHWQAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVIS

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.8e-0738.24Show/hide
Query:  HRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCF
        +R I++   ++L +  LP  F  DA +TAV+ IN+ PST ++   P E  F + P YS+L+ FGC+ +
Subjt:  HRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCF

ATMG00810.1 DNA/RNA polymerases superfamily protein1.9e-5848.47Show/hide
Query:  YILIYVDDIIITGSSSSEISSLISLLHKQFSLKDLGLLHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRL
        Y+L+YVDDI++TGSS++ ++ LI  L   FS+KDLG +HYFLGI++   +  GLFLSQ+KY   +L+ A M +  P+ TP+    + S+ +  K+ D   
Subjt:  YILIYVDDIIITGSSSSEISSLISLLHKQFSLKDLGLLHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRL

Query:  YRSIVGALQYVTITRPELSYSVNKVCQFMHSPTLVHWQAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSK
        +RSIVGALQY+T+TRP++SY+VN VCQ MH PTL  +  +KR+LRY+KG+   G+ + K S L +  F D+DWA     R+ST+GFC  LG N+++W +K
Subjt:  YRSIVGALQYVTITRPELSYSVNKVCQFMHSPTLVHWQAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSK

Query:  KQSVISRSSTEAEFRSLANASAELIWLQA
        +Q  +SRSSTE E+R+LA  +AEL W  A
Subjt:  KQSVISRSSTEAEFRSLANASAELIWLQA

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.2e-2953.6Show/hide
Query:  MVTRSKDGIFK--PKVLLTEYTDV--EPPSVKEALRCPHWLQAMKNEYAALLQNKTWSLVPRPSDHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQV
        M+TRSK GI K  PK  LT  T +  EP SV  AL+ P W QAM+ E  AL +NKTW LVP P +  ++GCKWVFK K ++DG++ R KARLVAKGFHQ 
Subjt:  MVTRSKDGIFK--PKVLLTEYTDV--EPPSVKEALRCPHWLQAMKNEYAALLQNKTWSLVPRPSDHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQV

Query:  ADVDYTETFSPVVKPVTIRVLFTLA
          + + ET+SPVV+  TIR +  +A
Subjt:  ADVDYTETFSPVVKPVTIRVLFTLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCTTCTTCTTATTGAATCTGATGTATGGGGTCCTTCTGTCAAACCATCTCGCAATGGTTTTCGATATTATGTTAGCTTTGTTGATGTGTATTCCAGATTTACATG
GATATATTTTCTTCAATCTAAGTCAGATGTCTATTCCACTTTTATCTCTTTTCGTACTCATATAGAAAAACTTCTTGGATTACCCATTCGCATGATTCAAACTGATGGAG
GGGGGGAATTTCGCGCTCTCACCCCTTATTTACGCTCCAATGGTATTTCTCATCGCTTTTCTTGTCCTTATACGTCTCAGCAAAACGGTATTGTTGAACGGAAACATCGT
CATATAGTTGACATGGGTCTAACCTTACTTTCTCAAGCATCCTTACCACTTGAGTTTTGGGATGATGCTTTCTCCACAGCAGTTTATACCATTAATCGGCTGCCTTCTAC
AGTCCTGCATGACATCAGTCCAATGGAGCGTTTGTTCGGTACGAAACCCCTTTACTCTTTTCTTAAAACCTTTGGCTGCCTGTGTTTTCCTTGCTTACGTCCATATAACT
CTCATAAACTTCAATTTCGTTCTTCTCCTTGTACCTTTCTTGGCTATAGCAATATCCATAAAGGCTATAAATGCATGGATTCTTCTGGGCGAATTTTCATTTCCAGACAT
GTTCAATTTGATGAAAATTCTTTTCCTTATAGTATATCCTACAAGCCCAAGTCTTCTTATGTGACTAGATGTGATAATGCGGTAGTTTCTCACTTACCTATTACTCCTAT
GACATGTGAAGTTACTACCCAAAAACCTGTTATGCTCTCATCTTGTGATGCTGACAATTCAAACTCACTTAATAGTAATAGTACTGGTTCCACTAGTACTTTGCCAGCTG
TTGATAATACTTTACCAGCTCTTCAGTCTTCTCCAAACAGCAATTTAACAGCTCCATTTAATGCTGATGTGCCTACTGGTACAGACACCTGTACCTCACAGCCTTTTGTA
TCTATAATATCCAATGGTCAATCTAATATACATCCAATGGTCACACGTTCAAAGGATGGAATTTTCAAACCAAAGGTGTTACTTACTGAATATACAGATGTAGAACCTCC
TTCTGTCAAAGAAGCATTACGTTGTCCTCATTGGCTTCAAGCTATGAAGAATGAATATGCAGCTCTTCTTCAAAACAAGACTTGGTCTCTAGTTCCTCGACCATCTGATC
ATCAAGTCATTGGCTGCAAATGGGTATTCAAGATCAAAAGAAATACTGATGGTTCAGTCTCGAGGTACAAGGCTCGTTTAGTTGCGAAAGGTTTTCATCAAGTAGCTGAT
GTTGATTATACTGAAACTTTTAGCCCAGTTGTTAAACCGGTTACCATTCGTGTCTTGTTTACACTGGCTCTAGCATTTGGATGGCAGCTTCGACAACTTGATATAAACAA
TGCATTCTTGCATGGTATCCTCTCTGAGAAGGTTTTTATGAATCAGCCCCCTGGCTTCACTGATTCAAGCACTCCAACAGCAGTCTGTAGACTTCATAAAGCACTTTATG
GTCTCAAGCAAGCTCCTCGAGCTTGGTATGAACGGTTGTCGTCCTTTCTCGTCTCTCTTGGATTCAAATGTTCCAAGGCGGATACTTCTCTCCTATATCGACATAAAGGT
ACCTCTCGCTGTTATATTCTCATCTATGTAGATGATATAATCATCACTGGTTCTTCTTCATCAGAGATATCTTCCTTAATATCTCTGTTACATAAACAATTTTCTCTTAA
AGATCTTGGCCTGTTGCATTACTTTTTGGGTATTGAGGTATCATACCCCAACAATGGTGGTCTTTTTCTTTCCCAAAGTAAATATGTCGCTGATTTACTTCATAAAGCTC
ACATGTTTGAGGCAAATCCCATCCGTACTCCTATGGTGAGTGGTAATGTGGTATCTGCTTTTACTGGTGAAAAATTCAACGATGTTAGACTATACCGAAGCATCGTAGGT
GCTTTACAATATGTGACTATCACCAGACCCGAACTCTCATATAGTGTTAACAAGGTCTGTCAGTTCATGCACTCTCCCACTTTGGTTCATTGGCAAGCTGTCAAACGCAT
TCTACGTTATCTAAAGGGATCATTTTCTTCTGGCATGCTACTTAAGAAGCCTTCTGATCTAACTTTGAGTGGATTTGCTGATGCTGATTGGGCATCTGACCCAGATGATC
GAAAATCTACCTCAGGATTCTGTATGATGCTTGGTGGTAATCTTGTAGCTTGGGGATCAAAGAAGCAGTCAGTAATTTCTAGATCGAGTACAGAAGCTGAATTTCGCAGT
CTTGCAAATGCTTCTGCAGAGTTAATATGGCTTCAAGCTCTCCTTAAAGAACTTATTGTAATCCCTTCTCGTCCACCAATCCTTTGGTGTGATAACCTTGGAGCAGTGCA
TCTTAGTGCCAATCCGGTGCTTCATTCCAGAACAAAGCATGTGGAATTGGACATCTACTTTGTTCGTGACCTTGTTCTTCAAAGACGTTTATTGGTTCAACACCTCCCAG
CATTTGATCAGCTAGCTGATATCTTTACTAAGCCACTTTCAGCCTTATCGTTTCATCGTTTAAGGTCCAAGCTCAATGTCATTGATTCTATTGCCATTGGCTTGAGGGGG
GGTGTTAAGGAAAATCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTCTTCTTCTTATTGAATCTGATGTATGGGGTCCTTCTGTCAAACCATCTCGCAATGGTTTTCGATATTATGTTAGCTTTGTTGATGTGTATTCCAGATTTACATG
GATATATTTTCTTCAATCTAAGTCAGATGTCTATTCCACTTTTATCTCTTTTCGTACTCATATAGAAAAACTTCTTGGATTACCCATTCGCATGATTCAAACTGATGGAG
GGGGGGAATTTCGCGCTCTCACCCCTTATTTACGCTCCAATGGTATTTCTCATCGCTTTTCTTGTCCTTATACGTCTCAGCAAAACGGTATTGTTGAACGGAAACATCGT
CATATAGTTGACATGGGTCTAACCTTACTTTCTCAAGCATCCTTACCACTTGAGTTTTGGGATGATGCTTTCTCCACAGCAGTTTATACCATTAATCGGCTGCCTTCTAC
AGTCCTGCATGACATCAGTCCAATGGAGCGTTTGTTCGGTACGAAACCCCTTTACTCTTTTCTTAAAACCTTTGGCTGCCTGTGTTTTCCTTGCTTACGTCCATATAACT
CTCATAAACTTCAATTTCGTTCTTCTCCTTGTACCTTTCTTGGCTATAGCAATATCCATAAAGGCTATAAATGCATGGATTCTTCTGGGCGAATTTTCATTTCCAGACAT
GTTCAATTTGATGAAAATTCTTTTCCTTATAGTATATCCTACAAGCCCAAGTCTTCTTATGTGACTAGATGTGATAATGCGGTAGTTTCTCACTTACCTATTACTCCTAT
GACATGTGAAGTTACTACCCAAAAACCTGTTATGCTCTCATCTTGTGATGCTGACAATTCAAACTCACTTAATAGTAATAGTACTGGTTCCACTAGTACTTTGCCAGCTG
TTGATAATACTTTACCAGCTCTTCAGTCTTCTCCAAACAGCAATTTAACAGCTCCATTTAATGCTGATGTGCCTACTGGTACAGACACCTGTACCTCACAGCCTTTTGTA
TCTATAATATCCAATGGTCAATCTAATATACATCCAATGGTCACACGTTCAAAGGATGGAATTTTCAAACCAAAGGTGTTACTTACTGAATATACAGATGTAGAACCTCC
TTCTGTCAAAGAAGCATTACGTTGTCCTCATTGGCTTCAAGCTATGAAGAATGAATATGCAGCTCTTCTTCAAAACAAGACTTGGTCTCTAGTTCCTCGACCATCTGATC
ATCAAGTCATTGGCTGCAAATGGGTATTCAAGATCAAAAGAAATACTGATGGTTCAGTCTCGAGGTACAAGGCTCGTTTAGTTGCGAAAGGTTTTCATCAAGTAGCTGAT
GTTGATTATACTGAAACTTTTAGCCCAGTTGTTAAACCGGTTACCATTCGTGTCTTGTTTACACTGGCTCTAGCATTTGGATGGCAGCTTCGACAACTTGATATAAACAA
TGCATTCTTGCATGGTATCCTCTCTGAGAAGGTTTTTATGAATCAGCCCCCTGGCTTCACTGATTCAAGCACTCCAACAGCAGTCTGTAGACTTCATAAAGCACTTTATG
GTCTCAAGCAAGCTCCTCGAGCTTGGTATGAACGGTTGTCGTCCTTTCTCGTCTCTCTTGGATTCAAATGTTCCAAGGCGGATACTTCTCTCCTATATCGACATAAAGGT
ACCTCTCGCTGTTATATTCTCATCTATGTAGATGATATAATCATCACTGGTTCTTCTTCATCAGAGATATCTTCCTTAATATCTCTGTTACATAAACAATTTTCTCTTAA
AGATCTTGGCCTGTTGCATTACTTTTTGGGTATTGAGGTATCATACCCCAACAATGGTGGTCTTTTTCTTTCCCAAAGTAAATATGTCGCTGATTTACTTCATAAAGCTC
ACATGTTTGAGGCAAATCCCATCCGTACTCCTATGGTGAGTGGTAATGTGGTATCTGCTTTTACTGGTGAAAAATTCAACGATGTTAGACTATACCGAAGCATCGTAGGT
GCTTTACAATATGTGACTATCACCAGACCCGAACTCTCATATAGTGTTAACAAGGTCTGTCAGTTCATGCACTCTCCCACTTTGGTTCATTGGCAAGCTGTCAAACGCAT
TCTACGTTATCTAAAGGGATCATTTTCTTCTGGCATGCTACTTAAGAAGCCTTCTGATCTAACTTTGAGTGGATTTGCTGATGCTGATTGGGCATCTGACCCAGATGATC
GAAAATCTACCTCAGGATTCTGTATGATGCTTGGTGGTAATCTTGTAGCTTGGGGATCAAAGAAGCAGTCAGTAATTTCTAGATCGAGTACAGAAGCTGAATTTCGCAGT
CTTGCAAATGCTTCTGCAGAGTTAATATGGCTTCAAGCTCTCCTTAAAGAACTTATTGTAATCCCTTCTCGTCCACCAATCCTTTGGTGTGATAACCTTGGAGCAGTGCA
TCTTAGTGCCAATCCGGTGCTTCATTCCAGAACAAAGCATGTGGAATTGGACATCTACTTTGTTCGTGACCTTGTTCTTCAAAGACGTTTATTGGTTCAACACCTCCCAG
CATTTGATCAGCTAGCTGATATCTTTACTAAGCCACTTTCAGCCTTATCGTTTCATCGTTTAAGGTCCAAGCTCAATGTCATTGATTCTATTGCCATTGGCTTGAGGGGG
GGTGTTAAGGAAAATCACTGA
Protein sequenceShow/hide protein sequence
MPLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQNGIVERKHR
HIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFGTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSNIHKGYKCMDSSGRIFISRH
VQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVTTQKPVMLSSCDADNSNSLNSNSTGSTSTLPAVDNTLPALQSSPNSNLTAPFNADVPTGTDTCTSQPFV
SIISNGQSNIHPMVTRSKDGIFKPKVLLTEYTDVEPPSVKEALRCPHWLQAMKNEYAALLQNKTWSLVPRPSDHQVIGCKWVFKIKRNTDGSVSRYKARLVAKGFHQVAD
VDYTETFSPVVKPVTIRVLFTLALAFGWQLRQLDINNAFLHGILSEKVFMNQPPGFTDSSTPTAVCRLHKALYGLKQAPRAWYERLSSFLVSLGFKCSKADTSLLYRHKG
TSRCYILIYVDDIIITGSSSSEISSLISLLHKQFSLKDLGLLHYFLGIEVSYPNNGGLFLSQSKYVADLLHKAHMFEANPIRTPMVSGNVVSAFTGEKFNDVRLYRSIVG
ALQYVTITRPELSYSVNKVCQFMHSPTLVHWQAVKRILRYLKGSFSSGMLLKKPSDLTLSGFADADWASDPDDRKSTSGFCMMLGGNLVAWGSKKQSVISRSSTEAEFRS
LANASAELIWLQALLKELIVIPSRPPILWCDNLGAVHLSANPVLHSRTKHVELDIYFVRDLVLQRRLLVQHLPAFDQLADIFTKPLSALSFHRLRSKLNVIDSIAIGLRG
GVKENH