; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G13690 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G13690
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr4:11653706..11657053
RNA-Seq ExpressionCSPI04G13690
SyntenyCSPI04G13690
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]0.0e+0067.99Show/hide
Query:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESKAE------------------------------KAESVTSYFMRLKKITIELG
        MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCES  E                              KAESVTSYFMRLKKI  ELG
Subjt:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESKAE------------------------------KAESVTSYFMRLKKITIELG

Query:  LLLPFSLDV----------------------------------KIPSLDNAFTRVIRIESSSTGVSIPQPSSAIFSKNNNHRAPQRNSIDHRKPEFVEIV
        LLLPFS DV                                  KIPSLD+AFTRV+RIESS T VSIPQPSSA+FSKNNN RAPQRNS DHRKPE VEIV
Subjt:  LLLPFSLDV----------------------------------KIPSLDNAFTRVIRIESSSTGVSIPQPSSAIFSKNNNHRAPQRNSIDHRKPEFVEIV

Query:  CNYCRKSGHMKHDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNS
        CNYCRK GHMK DCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNS
Subjt:  CNYCRKSGHMKHDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNS

Query:  HLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSS-----------------------------------DRVTKKIIGRGYESGGLYLFDHQV
        HLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSS                                   DRVTKKIIGRGYESGGLYLFDHQV
Subjt:  HLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSS-----------------------------------DRVTKKIIGRGYESGGLYLFDHQV

Query:  SQAVACPVVPSPFEVHCRLGHPSFLSL------------------------------RVDKRAIAPFELVHSDIWGPCPV--------------------
        SQAVACPVVPSPFEVHCRLGHPS   L                              RVDKRAIAPFELVHSDIWGPCPV                    
Subjt:  SQAVACPVVPSPFEVHCRLGHPSFLSL------------------------------RVDKRAIAPFELVHSDIWGPCPV--------------------

Query:  ----------------------------------------------------------SSCADIPSQNGVAERKSRHLLETTRALSFQMHVSKTFCVDVI
                                                                  SSCAD PSQNGVAERK+RHLLET RALSFQMHVSK F VD +
Subjt:  ----------------------------------------------------------SSCADIPSQNGVAERKSRHLLETTRALSFQMHVSKTFCVDVI

Query:  STACFFINRMSSSVLNGEIPYRVLFPTKHLFPIAPKIFGC-----------------------------------------------------DTPFTSL
        STACF INRM SSVLNGEIPYRVLFPTKHLFPIAPKIFGC                                                     DTPFTS 
Subjt:  STACFFINRMSSSVLNGEIPYRVLFPTKHLFPIAPKIFGC-----------------------------------------------------DTPFTSL

Query:  PSSSCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFI
        PSS CQGEDDNLFIYEVTSPTPSLSTDV PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFI
Subjt:  PSSSCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFI

Query:  TSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKMNPDGTMARLKARLVAKGYAQIYGTDYSYTYSLVAKLTSIR
        TSLESTSIPNSVHEALSHP WQNAMIEEMTALDDNGTWDLVSRPAGKKA GCKWVFAVKMNPDGT+ARLKARLVAKGYAQIYGTDYS T+S VAKLTSIR
Subjt:  TSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKMNPDGTMARLKARLVAKGYAQIYGTDYSYTYSLVAKLTSIR

Query:  LFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIALLV
        LFL MAATNKWSLHQLDIKN F+HGDLQEEVYMEQPPGFVAQ ESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGI LLV
Subjt:  LFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIALLV

Query:  VYVDDIVITGNDAL---------------------------------------------------RKLGAKPSGTPMMSNQQLVKEEELCKDPERYRRLV
        VYVDDIVITGNDAL                                                    KLGAKPSGTPMM NQQLVKE ELCKDPERYRRLV
Subjt:  VYVDDIVITGNDAL---------------------------------------------------RKLGAKPSGTPMMSNQQLVKEEELCKDPERYRRLV

Query:  GKLNYLTVT
        GKLNYLTVT
Subjt:  GKLNYLTVT

XP_031744754.1 uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus]2.0e-25866.09Show/hide
Query:  GSGTIHLTPSFSLS-SDRVTKKIIGRGYESGGLYLFDHQVSQAVACPVVPSPFEVHCRLGHPSFLSL------------------------------RVD
        G G+ H   +  +   DRVTKKIIGRGYESGGLYLFDHQVSQAVACPVVPSPFEVHCRLGHPS   L                              RVD
Subjt:  GSGTIHLTPSFSLS-SDRVTKKIIGRGYESGGLYLFDHQVSQAVACPVVPSPFEVHCRLGHPSFLSL------------------------------RVD

Query:  KRAIAPFELVHSDIWGPCPV------------------------------------------------------------------------------SS
        KRAIAPFELVHSDIWGPCPV                                                                              SS
Subjt:  KRAIAPFELVHSDIWGPCPV------------------------------------------------------------------------------SS

Query:  CADIPSQNGVAERKSRHLLETTRALSFQMHVSKTFCVDVISTACFFINRMSSSVLNGEIPYRVLFPTKHLFPIAPKIFGC--------------------
        CAD PSQNGVAERK+RHLLET RALSFQMHVSK F VD +STACF INRM SSVLNGEIPYRVLFPTKHLFPIAPKIFGC                    
Subjt:  CADIPSQNGVAERKSRHLLETTRALSFQMHVSKTFCVDVISTACFFINRMSSSVLNGEIPYRVLFPTKHLFPIAPKIFGC--------------------

Query:  ---------------------------------DTPFTSLPSSSCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSC
                                         DTPFTS PSS CQGEDDNLFIYEVTSPTPSLSTDV PSRPLISQVYSRRPPPQPSDSCPPSMLPSSC
Subjt:  ---------------------------------DTPFTSLPSSSCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSC

Query:  DPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKM
        DPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHP WQNAMIEEMTALDDNGTWDLVSRPAGKKA GCKWVFAVKM
Subjt:  DPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKM

Query:  NPDGTMARLKARLVAKGYAQIYGTDYSYTYSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKVCRLRKSLYGLKQ
        NPDGT+ARLKARLVAKGYAQIYGTDYS T+S VAKLTSIRLFL MAATNKWSLHQLDIKN F+HGDLQEEVYMEQPPGFVAQ ESDKVCRLRKSLYGLKQ
Subjt:  NPDGTMARLKARLVAKGYAQIYGTDYSYTYSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKVCRLRKSLYGLKQ

Query:  SPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIALLVVYVDDIVITGNDAL----------------------------------------------
        SPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGI LLVVYVDDIVITGNDAL                                              
Subjt:  SPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIALLVVYVDDIVITGNDAL----------------------------------------------

Query:  -----RKLGAKPSGTPMMSNQQLVKEEELCKDPERYRRLVGKLNYLTVT
              KLGAKPSGTPMM NQQLVKE ELCKDPERYRRLVGKLNYLTVT
Subjt:  -----RKLGAKPSGTPMMSNQQLVKEEELCKDPERYRRLVGKLNYLTVT

XP_031744755.1 uncharacterized protein LOC101212255 isoform X3 [Cucumis sativus]2.0e-25866.09Show/hide
Query:  GSGTIHLTPSFSLS-SDRVTKKIIGRGYESGGLYLFDHQVSQAVACPVVPSPFEVHCRLGHPSFLSL------------------------------RVD
        G G+ H   +  +   DRVTKKIIGRGYESGGLYLFDHQVSQAVACPVVPSPFEVHCRLGHPS   L                              RVD
Subjt:  GSGTIHLTPSFSLS-SDRVTKKIIGRGYESGGLYLFDHQVSQAVACPVVPSPFEVHCRLGHPSFLSL------------------------------RVD

Query:  KRAIAPFELVHSDIWGPCPV------------------------------------------------------------------------------SS
        KRAIAPFELVHSDIWGPCPV                                                                              SS
Subjt:  KRAIAPFELVHSDIWGPCPV------------------------------------------------------------------------------SS

Query:  CADIPSQNGVAERKSRHLLETTRALSFQMHVSKTFCVDVISTACFFINRMSSSVLNGEIPYRVLFPTKHLFPIAPKIFGC--------------------
        CAD PSQNGVAERK+RHLLET RALSFQMHVSK F VD +STACF INRM SSVLNGEIPYRVLFPTKHLFPIAPKIFGC                    
Subjt:  CADIPSQNGVAERKSRHLLETTRALSFQMHVSKTFCVDVISTACFFINRMSSSVLNGEIPYRVLFPTKHLFPIAPKIFGC--------------------

Query:  ---------------------------------DTPFTSLPSSSCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSC
                                         DTPFTS PSS CQGEDDNLFIYEVTSPTPSLSTDV PSRPLISQVYSRRPPPQPSDSCPPSMLPSSC
Subjt:  ---------------------------------DTPFTSLPSSSCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSC

Query:  DPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKM
        DPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHP WQNAMIEEMTALDDNGTWDLVSRPAGKKA GCKWVFAVKM
Subjt:  DPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKM

Query:  NPDGTMARLKARLVAKGYAQIYGTDYSYTYSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKVCRLRKSLYGLKQ
        NPDGT+ARLKARLVAKGYAQIYGTDYS T+S VAKLTSIRLFL MAATNKWSLHQLDIKN F+HGDLQEEVYMEQPPGFVAQ ESDKVCRLRKSLYGLKQ
Subjt:  NPDGTMARLKARLVAKGYAQIYGTDYSYTYSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKVCRLRKSLYGLKQ

Query:  SPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIALLVVYVDDIVITGNDAL----------------------------------------------
        SPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGI LLVVYVDDIVITGNDAL                                              
Subjt:  SPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIALLVVYVDDIVITGNDAL----------------------------------------------

Query:  -----RKLGAKPSGTPMMSNQQLVKEEELCKDPERYRRLVGKLNYLTVT
              KLGAKPSGTPMM NQQLVKE ELCKDPERYRRLVGKLNYLTVT
Subjt:  -----RKLGAKPSGTPMMSNQQLVKEEELCKDPERYRRLVGKLNYLTVT

XP_031744756.1 uncharacterized protein LOC101212255 isoform X4 [Cucumis sativus]2.0e-25866.09Show/hide
Query:  GSGTIHLTPSFSLS-SDRVTKKIIGRGYESGGLYLFDHQVSQAVACPVVPSPFEVHCRLGHPSFLSL------------------------------RVD
        G G+ H   +  +   DRVTKKIIGRGYESGGLYLFDHQVSQAVACPVVPSPFEVHCRLGHPS   L                              RVD
Subjt:  GSGTIHLTPSFSLS-SDRVTKKIIGRGYESGGLYLFDHQVSQAVACPVVPSPFEVHCRLGHPSFLSL------------------------------RVD

Query:  KRAIAPFELVHSDIWGPCPV------------------------------------------------------------------------------SS
        KRAIAPFELVHSDIWGPCPV                                                                              SS
Subjt:  KRAIAPFELVHSDIWGPCPV------------------------------------------------------------------------------SS

Query:  CADIPSQNGVAERKSRHLLETTRALSFQMHVSKTFCVDVISTACFFINRMSSSVLNGEIPYRVLFPTKHLFPIAPKIFGC--------------------
        CAD PSQNGVAERK+RHLLET RALSFQMHVSK F VD +STACF INRM SSVLNGEIPYRVLFPTKHLFPIAPKIFGC                    
Subjt:  CADIPSQNGVAERKSRHLLETTRALSFQMHVSKTFCVDVISTACFFINRMSSSVLNGEIPYRVLFPTKHLFPIAPKIFGC--------------------

Query:  ---------------------------------DTPFTSLPSSSCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSC
                                         DTPFTS PSS CQGEDDNLFIYEVTSPTPSLSTDV PSRPLISQVYSRRPPPQPSDSCPPSMLPSSC
Subjt:  ---------------------------------DTPFTSLPSSSCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSC

Query:  DPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKM
        DPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHP WQNAMIEEMTALDDNGTWDLVSRPAGKKA GCKWVFAVKM
Subjt:  DPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKM

Query:  NPDGTMARLKARLVAKGYAQIYGTDYSYTYSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKVCRLRKSLYGLKQ
        NPDGT+ARLKARLVAKGYAQIYGTDYS T+S VAKLTSIRLFL MAATNKWSLHQLDIKN F+HGDLQEEVYMEQPPGFVAQ ESDKVCRLRKSLYGLKQ
Subjt:  NPDGTMARLKARLVAKGYAQIYGTDYSYTYSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKVCRLRKSLYGLKQ

Query:  SPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIALLVVYVDDIVITGNDAL----------------------------------------------
        SPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGI LLVVYVDDIVITGNDAL                                              
Subjt:  SPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIALLVVYVDDIVITGNDAL----------------------------------------------

Query:  -----RKLGAKPSGTPMMSNQQLVKEEELCKDPERYRRLVGKLNYLTVT
              KLGAKPSGTPMM NQQLVKE ELCKDPERYRRLVGKLNYLTVT
Subjt:  -----RKLGAKPSGTPMMSNQQLVKEEELCKDPERYRRLVGKLNYLTVT

XP_031744758.1 uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus]8.9e-25966.98Show/hide
Query:  LSSDRVTKKIIGRGYESGGLYLFDHQVSQAVACPVVPSPFEVHCRLGHPSFLSL------------------------------RVDKRAIAPFELVHSD
        L  DRVTKKIIGRGYESGGLYLFDHQVSQAVACPVVPSPFEVHCRLGHPS   L                              RVDKRAIAPFELVHSD
Subjt:  LSSDRVTKKIIGRGYESGGLYLFDHQVSQAVACPVVPSPFEVHCRLGHPSFLSL------------------------------RVDKRAIAPFELVHSD

Query:  IWGPCPV------------------------------------------------------------------------------SSCADIPSQNGVAER
        IWGPCPV                                                                              SSCAD PSQNGVAER
Subjt:  IWGPCPV------------------------------------------------------------------------------SSCADIPSQNGVAER

Query:  KSRHLLETTRALSFQMHVSKTFCVDVISTACFFINRMSSSVLNGEIPYRVLFPTKHLFPIAPKIFGC---------------------------------
        K+RHLLET RALSFQMHVSK F VD +STACF INRM SSVLNGEIPYRVLFPTKHLFPIAPKIFGC                                 
Subjt:  KSRHLLETTRALSFQMHVSKTFCVDVISTACFFINRMSSSVLNGEIPYRVLFPTKHLFPIAPKIFGC---------------------------------

Query:  --------------------DTPFTSLPSSSCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALR
                            DTPFTS PSS CQGEDDNLFIYEVTSPTPSLSTDV PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALR
Subjt:  --------------------DTPFTSLPSSSCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALR

Query:  KGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKMNPDGTMARLKARL
        KGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHP WQNAMIEEMTALDDNGTWDLVSRPAGKKA GCKWVFAVKMNPDGT+ARLKARL
Subjt:  KGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKMNPDGTMARLKARL

Query:  VAKGYAQIYGTDYSYTYSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKVCRLRKSLYGLKQSPRAWFGKFSQAL
        VAKGYAQIYGTDYS T+S VAKLTSIRLFL MAATNKWSLHQLDIKN F+HGDLQEEVYMEQPPGFVAQ ESDKVCRLRKSLYGLKQSPRAWFGKFSQAL
Subjt:  VAKGYAQIYGTDYSYTYSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKVCRLRKSLYGLKQSPRAWFGKFSQAL

Query:  VCFGMKKSTSDHSVFYRRSEKGIALLVVYVDDIVITGNDAL---------------------------------------------------RKLGAKPS
        VCFGMKKSTSDHSVFYRRSEKGI LLVVYVDDIVITGNDAL                                                    KLGAKPS
Subjt:  VCFGMKKSTSDHSVFYRRSEKGIALLVVYVDDIVITGNDAL---------------------------------------------------RKLGAKPS

Query:  GTPMMSNQQLVKEEELCKDPERYRRLVGKLNYLTVT
        GTPMM NQQLVKE ELCKDPERYRRLVGKLNYLTVT
Subjt:  GTPMMSNQQLVKEEELCKDPERYRRLVGKLNYLTVT

TrEMBL top hitse value%identityAlignment
A0A438GAA6 Retrovirus-related Pol polyprotein from transposon TNT 1-945.4e-19339.52Show/hide
Query:  LRDDARLYLQIKNSIESEIIGLVDHCESKAE------------------------------KAESVTSYFMRLKKITIELGLLLPFSLDVK---------
        ++DDARL+LQ+KNSI S+I+GL+ HCE   E                               A+S+T+YFM  KK+  EL  L+PFS DV+         
Subjt:  LRDDARLYLQIKNSIESEIIGLVDHCESKAE------------------------------KAESVTSYFMRLKKITIELGLLLPFSLDVK---------

Query:  -------------------------IPSLDNAFTRVIRIE----SSSTGVSIPQPSSAIFSKNNNHRAPQRNSIDHRKPEFVEIVCNYCRKSGHMKHDCR
                                 I SL   F+RV+R E    S  T V + +  +A  ++  N+R   R   +        IVC YC ++GH K +CR
Subjt:  -------------------------IPSLDNAFTRVIRIE----SSSTGVSIPQPSSAIFSKNNNHRAPQRNSIDHRKPEFVEIVCNYCRKSGHMKHDCR

Query:  KLLYKNSQRSQHAQIAST-----CDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPA
        KL  +N +R Q A +A++      D     VT++A+EF+K+  YQ++L+A   STP+ S +A     CL++SS KW+IDSGAT HMTGN   FS      
Subjt:  KLLYKNSQRSQHAQIAST-----CDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPA

Query:  PFPSVTLADGSTSSVLGSGTIHLTPSFSLSS-----------------------------------DRVTKKIIGRGYESGGLYLFDHQVSQAVACPVVP
          P VT+ADGST  + GSGT+  T S +LSS                                   D +TK+  G+G+ S GLY+ D  V + VAC    
Subjt:  PFPSVTLADGSTSSVLGSGTIHLTPSFSLSS-----------------------------------DRVTKKIIGRGYESGGLYLFDHQVSQAVACPVVP

Query:  SPFEVHCRLGHPSF------------------------------LSLRVDKRAIAPFELVHSDIWGPCPV------------------------------
        SP E HCRLGHPS                               L  R++KRA + FELVHSD+WGPCPV                              
Subjt:  SPFEVHCRLGHPSF------------------------------LSLRVDKRAIAPFELVHSDIWGPCPV------------------------------

Query:  ------------------------------------------------SSCADIPSQNGVAERKSRHLLETTRALSFQMHVSKTFCVDVISTACFFINRM
                                                        +SC D PSQNGVAERK+RHLLET RAL FQM V K F  D +STACF INRM
Subjt:  ------------------------------------------------SSCADIPSQNGVAERKSRHLLETTRALSFQMHVSKTFCVDVISTACFFINRM

Query:  SSSVLNGEIPYRVLFPTKHLFPIAPKIFGC-----------------------------------------------------DTPFTSLPSSSCQGEDD
         + VL  +IPY+V+ P K LFP+AP+IFGC                                                     DT F S P+SS   ED+
Subjt:  SSSVLNGEIPYRVLFPTKHLFPIAPKIFGC-----------------------------------------------------DTPFTSLPSSSCQGEDD

Query:  NLFIYEVTSPTP------------SLSTDVP-------PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKC--TYPVSSFISY
           +Y+V +  P            SL+   P       P++P I QVYSRR  P  +D+C P+  PSS DP+   DLPI+LRKGKR C   Y +++F+SY
Subjt:  NLFIYEVTSPTP------------SLSTDVP-------PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKC--TYPVSSFISY

Query:  HQLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKMNPDGTMARLKARLVAKGYAQIYGTDYSYT
          LS S+   + S++S S+P +V EAL+HP W+NAM+EE+ AL+DN TW LV  P GKK  GCKWVFAVK+NPDG++ARLKARLVA+GYAQ YG DYS T
Subjt:  HQLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKMNPDGTMARLKARLVAKGYAQIYGTDYSYT

Query:  YSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFY
        +S VAKL S+RLF+ +AA+ +W +HQLDIKN F+HGDL+EEVY+EQPPGFVAQ E  KVCRL+K+LYGLKQSPRAWFGKFS+ +  FGM KS  DHSVFY
Subjt:  YSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFY

Query:  RRSEKGIALLVVYVDDIVITGND---------------------------------------------------ALRKLGAKPSGTPMMSNQQLVKEE-E
        ++S  GI LLVVYVDDIVITGND                                                      K+ AKP  TPM+ N QL+ ++ +
Subjt:  RRSEKGIALLVVYVDDIVITGND---------------------------------------------------ALRKLGAKPSGTPMMSNQQLVKEE-E

Query:  LCKDPERYRRLVGKLNYLTVT
           +PERYRR+VGKLNYLTVT
Subjt:  LCKDPERYRRLVGKLNYLTVT

A0A438HEX0 Retrovirus-related Pol polyprotein from transposon TNT 1-944.2e-19841.62Show/hide
Query:  LRDDARLYLQIKNSIESEIIGLVDHCESKAE------------------------------KAESVTSYFMRLKKITIELGLLLPFSLDVK---------
        ++DDARL+LQ+KNSI S+I+GL+ HCE   E                               A+S+T+YFM  KK+  EL  L+PFS DV+         
Subjt:  LRDDARLYLQIKNSIESEIIGLVDHCESKAE------------------------------KAESVTSYFMRLKKITIELGLLLPFSLDVK---------

Query:  -------------------------IPSLDNAFTRVIRIE----SSSTGVSIPQPSSAIFSKNNNHRAPQRNSIDHRKPEFVEIVCNYCRKSGHMKHDCR
                                 I SL   F+RV+R E    S  T V + +  +A  ++  N+R   R   +        IVC YC ++GH K + R
Subjt:  -------------------------IPSLDNAFTRVIRIE----SSSTGVSIPQPSSAIFSKNNNHRAPQRNSIDHRKPEFVEIVCNYCRKSGHMKHDCR

Query:  KLLYKNSQRSQHAQIAST-----CDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPA
        KL  +N +R Q A +A++      D     VT++A+EF+K+  YQ++L+A   STP+ S +A     CL++SS KW+IDSGAT HMTGN   FS      
Subjt:  KLLYKNSQRSQHAQIAST-----CDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPA

Query:  PFPSVTLADGSTSSVLGSGTIHLTPS-------------FSLSSDRVTKKIIGRGYESGGLYLFDHQVSQAVACPVVPSPFEVHCRLGHPSF--------
          P VT+ADGST  + GSGT+  T S             F+L SD +TK+  G+G+ S GLY+ D  V + VAC    SP E HCRLGHPS         
Subjt:  PFPSVTLADGSTSSVLGSGTIHLTPS-------------FSLSSDRVTKKIIGRGYESGGLYLFDHQVSQAVACPVVPSPFEVHCRLGHPSF--------

Query:  ----------------------LSLRVDKRAIAPFELVHSDIWGPCPV-----------------------------------------SSCADIPSQNG
                              L  R++KRA + FELVHSD+WGPCPV                                         +SC D PSQNG
Subjt:  ----------------------LSLRVDKRAIAPFELVHSDIWGPCPV-----------------------------------------SSCADIPSQNG

Query:  VAERKSRHLLETTRALSFQMHVSKTFCVDVISTACFFINRMSSSVLNGEIPYRVLFPTKHLFPIAPKIFGC-----------------------------
        VAERK+RHLLET RAL FQM V K F  D +STACF INRM + VL G+IPY+V+ P K LF +AP+IFGC                             
Subjt:  VAERKSRHLLETTRALSFQMHVSKTFCVDVISTACFFINRMSSSVLNGEIPYRVLFPTKHLFPIAPKIFGC-----------------------------

Query:  ------------------------DTPFTSLPSSSCQGEDDNLFIYEVTSPTP------------SLSTDVP-------PSRPLISQVYSRRPPPQPSDS
                                DT F S P+SS   ED+   +Y+V +  P            SL+   P       P++P I QVYSRR  P  +D+
Subjt:  ------------------------DTPFTSLPSSSCQGEDDNLFIYEVTSPTP------------SLSTDVP-------PSRPLISQVYSRRPPPQPSDS

Query:  CPPSMLPSSCDPAPSDDLPIALRKGKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKK
        C P+  PSS DP+   DLPI+LRKGKR C   Y +++F+SY  LS S+   + S++S S+P +V EAL+HP W+NAM+EE+ AL DN TW LV  P GKK
Subjt:  CPPSMLPSSCDPAPSDDLPIALRKGKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKK

Query:  ATGCKWVFAVKMNPDGTMARLKARLVAKGYAQIYGTDYSYTYSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKV
          GCKWVFAVK+NPDG++ARLKARLVA+GYAQ YG DYS T+S VAKL S+RLF+ +AA+ +W +HQLDIKN F+HGDL+EEVY+EQPPGFVAQ E  KV
Subjt:  ATGCKWVFAVKMNPDGTMARLKARLVAKGYAQIYGTDYSYTYSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKV

Query:  CRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIALLVVYVDDIVITGND------------------------------------
        CRL+K+LYGLKQSPRAWFGKFS+ +  FGM KS  DHSVFY++S  GI LLVVYVDDIVITGND                                    
Subjt:  CRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIALLVVYVDDIVITGND------------------------------------

Query:  ---------------ALRKLGAKPSGTPMMSNQQLVKEE-ELCKDPERYRRLVGKLNYLTVT
                          K+ AKP  TPM+ N QL+ ++ +   +PERYRR+VGKLNYLTVT
Subjt:  ---------------ALRKLGAKPSGTPMMSNQQLVKEE-ELCKDPERYRRLVGKLNYLTVT

A0A438HPS2 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-19239.64Show/hide
Query:  DDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESKAE------------------------------KAESVTSYFMRLKKITIELGL
        DDH+TE+PP D   +K W++DDARL LQ+KNSI S+I+GL  HCE   E                               A+S+T+YFM  KK+  EL  
Subjt:  DDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESKAE------------------------------KAESVTSYFMRLKKITIELGL

Query:  LLPFSLDVK----------------------------------IPSLDNAFTRVIRIE----SSSTGVSIPQPSSAIFSKNNNHRAPQRNSIDHRKPEFV
        L+PFS DV+                                  I SL   F+RV+R E    S  T V + +  +A  ++  N+R   R + ++R  +  
Subjt:  LLPFSLDVK----------------------------------IPSLDNAFTRVIRIE----SSSTGVSIPQPSSAIFSKNNNHRAPQRNSIDHRKPEFV

Query:  EIVCNYCRKSGHMKHDCRKLLYKNSQRSQHAQIAST-----CDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGA
         IVC YC ++GH K +CRKL  +N +R Q A +A++      D  +  VT++A+EF+K+  YQ++L+A   STP+ S +A     CL++SS KW+IDSGA
Subjt:  EIVCNYCRKSGHMKHDCRKLLYKNSQRSQHAQIAST-----CDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGA

Query:  TAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSS-----------------------------------DRVTKKIIGRGYESGG
        T HMTGN   FS        P VT+ADGST  +  SGT+  T S +LSS                                   D +TK+  G+G+ S G
Subjt:  TAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSS-----------------------------------DRVTKKIIGRGYESGG

Query:  LYLFDHQVSQAVACPVVPSPFEVHCRLGHP------------------------------SFLSLRVDKRAIAPFELVHSDIWGPCPV------------
        LY+ D  V + VAC    SP E HCRLGHP                              S L  R++KRA + FELVHSD+WG CPV            
Subjt:  LYLFDHQVSQAVACPVVPSPFEVHCRLGHP------------------------------SFLSLRVDKRAIAPFELVHSDIWGPCPV------------

Query:  ------------------------------------------------------------------SSCADIPSQNGVAERKSRHLLETTRALSFQMHVS
                                                                          +SC D PSQNGVAERK+RHLLETTRAL FQM V 
Subjt:  ------------------------------------------------------------------SSCADIPSQNGVAERKSRHLLETTRALSFQMHVS

Query:  KTFCVDVISTACFFINRMSSSVLNGEIPYRVLFPTKHLFPIAPKIFGC----------------------------------------------------
        K F VD +STACF IN M + VL G+IPY+V+ P K LFP+ P+IFGC                                                    
Subjt:  KTFCVDVISTACFFINRMSSSVLNGEIPYRVLFPTKHLFPIAPKIFGC----------------------------------------------------

Query:  -DTPFTSLPSSSCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKC--TYPVSSFISYH
         DT F S P+SS   ED+   +Y+V +           SRP + Q  S    P  +D+C P+  PSS DP+   DL I+LRKGKR C   Y +++F+SY 
Subjt:  -DTPFTSLPSSSCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKC--TYPVSSFISYH

Query:  QLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKMNPDGTMARLKARLVAKGYAQIYGTDYSYTY
         LS S+   + S++S S+P +V EAL+HP W+NA++EE+ AL+DN TW LV  P GKK  GCKWVFAVK+NPDG++ARLKARLVAKGYAQ YG DYS T+
Subjt:  QLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKMNPDGTMARLKARLVAKGYAQIYGTDYSYTY

Query:  SLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYR
        S VAKL S+RLF+ +AA+ +W +HQLDIKN F+HGDL+EEVY+EQPPGFVAQ E  KVCRL+K+LYGLKQSPRAWFGKFS+ +  FGM KS  DHSVFY+
Subjt:  SLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYR

Query:  RSEKGIALLVVYVDDIVITGND---------------------------------------------------ALRKLGAKPSGTPMMSNQQLVKEE-EL
        +S  GI LLVVYVDDIVITGND                                                      K+ AKP  TPM+ N QL+ ++ + 
Subjt:  RSEKGIALLVVYVDDIVITGND---------------------------------------------------ALRKLGAKPSGTPMMSNQQLVKEE-EL

Query:  CKDPERYRRLVGKLNYLTVT
          +PERYRR+VGKLNYLTVT
Subjt:  CKDPERYRRLVGKLNYLTVT

A0A5D3E5M8 Copia protein1.9e-20656.88Show/hide
Query:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSI--ESEIIGLVDHCES---KAEKAESVTSYFMRLKKITIELGLLLPFSLDV--------KIPSLDNA
        MDDHMTED P+DAK KKDWLRDDARLYLQIKNSI  + ++  + + C       +KAESVT+YFMRLKKIT EL LLLPFS DV        KIPSLDNA
Subjt:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSI--ESEIIGLVDHCES---KAEKAESVTSYFMRLKKITIELGLLLPFSLDV--------KIPSLDNA

Query:  FTRVIRIESSSTGVSIPQPSSAIFSKNNNHRAPQ-------RNSIDHRKPEFVEIVCNYCRKSGHMKHDCRKLLYKNSQRSQHAQIASTCDIPEASVTIS
        FTRV+R ESS  GVSIPQ S+++ SKNNN RAP+         S DHRKP+  EIVCNYCRK  H K DCRKLLYKNSQ+SQHAQIASTCDIPEAS+TIS
Subjt:  FTRVIRIESSSTGVSIPQPSSAIFSKNNNHRAPQ-------RNSIDHRKPEFVEIVCNYCRKSGHMKHDCRKLLYKNSQRSQHAQIASTCDIPEASVTIS

Query:  ADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSDRV
        A+E AK QNYQ+SLQASSSSTPIASTV PGN KCLLTSSTKW                                                       DRV
Subjt:  ADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSDRV

Query:  TKKIIGRGYESGGLYLFDHQVSQAVACPVVPSPFEVHCRLGHPSFLSLRVDKRAIAPFELVHSDIWGPCPVSSCADIPSQNGVAERKSRHLLETTRALSF
        TKKIIG+GYESGGLYLFDHQVSQAVACPV+PSPFE                                    SSC + PSQ        R+LLET RALSF
Subjt:  TKKIIGRGYESGGLYLFDHQVSQAVACPVVPSPFEVHCRLGHPSFLSLRVDKRAIAPFELVHSDIWGPCPVSSCADIPSQNGVAERKSRHLLETTRALSF

Query:  QMHVSKTFCVDVISTACFFINRMSSSVLNGEIPYRVLFPTKHLFPIAPKIFGC-----------------------------------------------
        QMHV KTF  DV+STACF INRM SS+LNGEIPYRVLFPTK LFPI PKIFGC                                               
Subjt:  QMHVSKTFCVDVISTACFFINRMSSSVLNGEIPYRVLFPTKHLFPIAPKIFGC-----------------------------------------------

Query:  -----DTPFTSLPSSSCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFIS
             DTPFTS PSSSC+GEDDNLFIYE+T P     T+ PPSRPL S+VYS +PP QPSDSCP SM PSSCD  PSDDLPIALRK              
Subjt:  -----DTPFTSLPSSSCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFIS

Query:  YHQLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKMNPDGTMARLKARLVAKGYAQIYGTDYSY
                                  ALSHP W+NAMIEEMTALDDNGTWDLVSRP GKKA GCKWVF++K+N +GT+ R KARLVAK YAQ YG DYS 
Subjt:  YHQLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKMNPDGTMARLKARLVAKGYAQIYGTDYSY

Query:  TYSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPP
        T+SLV KLTSIRLFL MAAT+ WSLHQL+IKNVF+HGDLQE+VY+EQPP
Subjt:  TYSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPP

B0FBS2 Uncharacterized protein2.2e-19939.95Show/hide
Query:  DDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESKAE------------------------------KAESVTSYFMRLKKITIELGL
        DDH+TE+PP D   +K W++DDARL+LQ+KNSI S+I+GL+ HCE   E                               A+S+T+YFM  KK+  EL  
Subjt:  DDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESKAE------------------------------KAESVTSYFMRLKKITIELGL

Query:  LLPFSLDVK----------------------------------IPSLDNAFTRVIRIE----SSSTGVSIPQPSSAIFSKNNNHRAPQRNSIDHRKPEFV
        L+PFS DV+                                  I SL   F+RV+R E    S  T V I +  +A  ++  N+R   R   +       
Subjt:  LLPFSLDVK----------------------------------IPSLDNAFTRVIRIE----SSSTGVSIPQPSSAIFSKNNNHRAPQRNSIDHRKPEFV

Query:  EIVCNYCRKSGHMKHDCRKLLYKNSQRSQHAQIAST-----CDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGA
         IVC YC ++GH K +CRKL  +N +R Q A +A++      D     VT++A+EF+K+  YQ++L+A   STP+ S +A     CL++SS KW+IDSGA
Subjt:  EIVCNYCRKSGHMKHDCRKLLYKNSQRSQHAQIAST-----CDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGA

Query:  TAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSS-----------------------------------DRVTKKIIGRGYESGG
        T HMTGN   FS        P VT+ADGST  + GSGT+  T S +LSS                                   D +TK+  G+G+ S G
Subjt:  TAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSS-----------------------------------DRVTKKIIGRGYESGG

Query:  LYLFDHQVSQAVACPVVPSPFEVHCRLGHPSF------------------------------LSLRVDKRAIAPFELVHSDIWGPCPV------------
        LY+ D  V + VAC    SP E HCRLGHPS                               L  R++KRA + FELVHSD+WGPCPV            
Subjt:  LYLFDHQVSQAVACPVVPSPFEVHCRLGHPSF------------------------------LSLRVDKRAIAPFELVHSDIWGPCPV------------

Query:  ------------------------------------------------------------------SSCADIPSQNGVAERKSRHLLETTRALSFQMHVS
                                                                          +SC D PSQNGVAERK+RHLLET RAL FQM V 
Subjt:  ------------------------------------------------------------------SSCADIPSQNGVAERKSRHLLETTRALSFQMHVS

Query:  KTFCVDVISTACFFINRMSSSVLNGEIPYRVLFPTKHLFPIAPKIFGC----------------------------------------------------
        K F  D +STACF INRM + VL G+IPY+V+ P K LFP+AP+IFGC                                                    
Subjt:  KTFCVDVISTACFFINRMSSSVLNGEIPYRVLFPTKHLFPIAPKIFGC----------------------------------------------------

Query:  -DTPFTSLPSSSCQGEDDNLFIYEVTSPTP------------SLSTDVP-------PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALR
         DT F S P+SS   ED+   +Y+V +  P            SL+   P       P++P I QVYSRR  P  +D+C P+  PSS DP+   DLPI+LR
Subjt:  -DTPFTSLPSSSCQGEDDNLFIYEVTSPTP------------SLSTDVP-------PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALR

Query:  KGKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKMNPDGTMARLKA
        KGKR C   Y +++F+SY  LS S+   + S++S S+P +V EAL+HP W+NAM+EE+ AL+DN TW LV  P GKK  GCKWVFAVK+NPDG++ARLKA
Subjt:  KGKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKMNPDGTMARLKA

Query:  RLVAKGYAQIYGTDYSYTYSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKVCRLRKSLYGLKQSPRAWFGKFSQ
        RLVA+GYAQ YG DYS T+S VAKL S+RLF+ +AA+ +W +HQLDIKN F+HGDL+EEVY+EQPPGFVAQ E  KVCRL+K+LYGLKQSPRAWFGKFS+
Subjt:  RLVAKGYAQIYGTDYSYTYSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKVCRLRKSLYGLKQSPRAWFGKFSQ

Query:  ALVCFGMKKSTSDHSVFYRRSEKGIALLVVYVDDIVITGND---------------------------------------------------ALRKLGAK
         +  FGM KS  DHSVFY++S  GI LLVVYVDDIVITGND                                                      K+ AK
Subjt:  ALVCFGMKKSTSDHSVFYRRSEKGIALLVVYVDDIVITGND---------------------------------------------------ALRKLGAK

Query:  PSGTPMMSNQQLVKEE-ELCKDPERYRRLVGKLNYLTVT
        P  TPM+ N QL+ ++ +   +PERYRR+VGKLNYLTVT
Subjt:  PSGTPMMSNQQLVKEE-ELCKDPERYRRLVGKLNYLTVT

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.2e-3534.73Show/hide
Query:  DPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLES--TSIPNSVHEAL---SHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWV
        +P  +D + I  R+ +R  T P    ISY++   S    + +  +    +PNS  E         W+ A+  E+ A   N TW +  RP  K     +WV
Subjt:  DPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLES--TSIPNSVHEAL---SHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWV

Query:  FAVKMNPDGTMARLKARLVAKGYAQIYGTDYSYTYSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKVCRLRKSL
        F+VK N  G   R KARLVA+G+ Q Y  DY  T++ VA+++S R  L +       +HQ+D+K  F++G L+EE+YM  P G      SD VC+L K++
Subjt:  FAVKMNPDGTMARLKARLVAKGYAQIYGTDYSYTYSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKVCRLRKSL

Query:  YGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFY--RRSEKGIALLVVYVDDIVITGNDALR
        YGLKQ+ R WF  F QAL       S+ D  ++   + +      +++YVDD+VI   D  R
Subjt:  YGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFY--RRSEKGIALLVVYVDDIVITGNDALR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.0e-5334.84Show/hide
Query:  PSQNGVAERKSRHLLETTRALSFQMHVSKTFCVDVISTACFFINRMSSSVLNGEIPYRVLFPTKHLFPIAPKIFGCDTPFTSLP----------SSSC--
        P  NGVAER +R ++E  R++     + K+F  + + TAC+ INR  S  L  EIP RV +  K +     K+FGC   F  +P          S  C  
Subjt:  PSQNGVAERKSRHLLETTRALSFQMHVSKTFCVDVISTACFFINRMSSSVLNGEIPYRVLFPTKHLFPIAPKIFGCDTPFTSLP----------SSSC--

Query:  QGEDDNLFIYEVTSPTPS----------LSTDVPPSRPLISQVYSRRPP---PQPSDSCPPSMLPSSCD--------PAPSDDLPIALRKGKRKCTYPVS
         G  D  F Y +  P               ++V  +  +  +V +   P     PS S  P+   S+ D        P    +    L +G  +  +P  
Subjt:  QGEDDNLFIYEVTSPTPS----------LSTDVPPSRPLISQVYSRRPP---PQPSDSCPPSMLPSSCD--------PAPSDDLPIALRKGKRKCTYPVS

Query:  SFISYHQLSPSTYAFITSLESTSI----------PNSVHEALSHP---RWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKMNPDGTMARLKA
            +  L  S    + S    S           P S+ E LSHP   +   AM EEM +L  NGT+ LV  P GK+   CKWVF +K + D  + R KA
Subjt:  SFISYHQLSPSTYAFITSLESTSI----------PNSVHEALSHP---RWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKMNPDGTMARLKA

Query:  RLVAKGYAQIYGTDYSYTYSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKVCRLRKSLYGLKQSPRAWFGKFSQ
        RLV KG+ Q  G D+   +S V K+TSIR  L +AA+    + QLD+K  F+HGDL+EE+YMEQP GF    +   VC+L KSLYGLKQ+PR W+ KF  
Subjt:  RLVAKGYAQIYGTDYSYTYSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKVCRLRKSLYGLKQSPRAWFGKFSQ

Query:  ALVCFGMKKSTSDHSVFYRR-SEKGIALLVVYVDDIVITGND
         +      K+ SD  V+++R SE    +L++YVDD++I G D
Subjt:  ALVCFGMKKSTSDHSVFYRR-SEKGIALLVVYVDDIVITGND

P92520 Uncharacterized mitochondrial protein AtMg008209.3e-1744.44Show/hide
Query:  HQLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKMNPDGTMARLKARLVAKGYAQIYGTDYSYT
        ++L+P  Y+   +      P SV  AL  P W  AM EE+ AL  N TW LV  P  +   GCKWVF  K++ DGT+ RLKARLVAKG+ Q  G  +  T
Subjt:  HQLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKMNPDGTMARLKARLVAKGYAQIYGTDYSYT

Query:  YSLVAKLTSIRLFLFMA
        YS V +  +IR  L +A
Subjt:  YSLVAKLTSIRLFLFMA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.9e-5030.02Show/hide
Query:  PSQNGVAERKSRHLLETTRALSFQMHVSKTFCVDVISTACFFINRMSSSVLNGEIPYRVLFPTKHLFPIAPKIFGCD-----TPFTS------------L
        P  NG++ERK RH++ET   L     + KT+     + A + INR+ + +L  E P++ LF T   +    ++FGC       P+              L
Subjt:  PSQNGVAERKSRHLLETTRALSFQMHVSKTFCVDVISTACFFINRMSSSVLNGEIPYRVLFPTKHLFPIAPKIFGCD-----TPFTS------------L

Query:  PSSSCQGE-------------------DDNLFIYE------------------VTSPTPSLSTDVP--PSRPLISQVYSRRPPPQPS-------------
          S  Q                     D+N F +                   V SP  +L T  P  P+       ++  PP  PS             
Subjt:  PSSSCQGE-------------------DDNLFIYE------------------VTSPTPSLSTDVP--PSRPLISQVYSRRPPPQPS-------------

Query:  DSCPPSMLPSSCDPA------------------------------PSDDLPIALRK------------------GKRKCTYPVSSFISYHQLSP------
        DS   S  PSS +P                               P+++ P  L +                       T P    I  H   P      
Subjt:  DSCPPSMLPSSCDPA------------------------------PSDDLPIALRK------------------GKRKCTYPVSSFISYHQLSP------

Query:  ------------------------STYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGK-KATGCKWVFAVKMNPDGTMARL
                                  Y+   SL + S P +  +AL   RW+NAM  E+ A   N TWDLV  P       GC+W+F  K N DG++ R 
Subjt:  ------------------------STYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGK-KATGCKWVFAVKMNPDGTMARL

Query:  KARLVAKGYAQIYGTDYSYTYSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKVCRLRKSLYGLKQSPRAWFGKF
        KARLVAKGY Q  G DY+ T+S V K TSIR+ L +A    W + QLD+ N F+ G L ++VYM QPPGF+ +   + VC+LRK+LYGLKQ+PRAW+ + 
Subjt:  KARLVAKGYAQIYGTDYSYTYSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKVCRLRKSLYGLKQSPRAWFGKF

Query:  SQALVCFGMKKSTSDHSVFYRRSEKGIALLVVYVDDIVITGND
           L+  G   S SD S+F  +  K I  ++VYVDDI+ITGND
Subjt:  SQALVCFGMKKSTSDHSVFYRRSEKGIALLVVYVDDIVITGND

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.2e-5434.44Show/hide
Query:  DTPFTSLPSSSCQGEDDNLFIYEVTS---PTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYH
        + P  + PS +   ++  L    ++S   PTPS S   P S    S   +  PP  P    PP +  ++  P  +  +    + G RK            
Subjt:  DTPFTSLPSSSCQGEDDNLFIYEVTS---PTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYH

Query:  QLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLV-SRPAGKKATGCKWVFAVKMNPDGTMARLKARLVAKGYAQIYGTDYSYT
              Y++ TSL + S P +  +A+   RW+ AM  E+ A   N TWDLV   P      GC+W+F  K N DG++ R KARLVAKGY Q  G DY+ T
Subjt:  QLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLV-SRPAGKKATGCKWVFAVKMNPDGTMARLKARLVAKGYAQIYGTDYSYT

Query:  YSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFY
        +S V K TSIR+ L +A    W + QLD+ N F+ G L +EVYM QPPGFV +   D VCRLRK++YGLKQ+PRAW+ +    L+  G   S SD S+F 
Subjt:  YSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVAQWESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFY

Query:  RRSEKGIALLVVYVDDIVITGNDAL---------------------------------------------------RKLGAKPSGTPMMSNQQL-VKEEE
         +  + I  ++VYVDDI+ITGND +                                                     L AKP  TPM ++ +L +    
Subjt:  RRSEKGIALLVVYVDDIVITGNDAL---------------------------------------------------RKLGAKPSGTPMMSNQQL-VKEEE

Query:  LCKDPERYRRLVGKLNYLTVT
           DP  YR +VG L YL  T
Subjt:  LCKDPERYRRLVGKLNYLTVT

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.2e-0334.21Show/hide
Query:  PSQNGVAERKSRHLLETTRALSFQMHVSKTFCVDVISTACFFINRMSSSVLNGEIPYRVLFPTKHLFPIAPKIFGC
        P  NG++ERK RH++E    L     V KT+     S A + INR+ + +L  + P++ LF     +    K+FGC
Subjt:  PSQNGVAERKSRHLLETTRALSFQMHVSKTFCVDVISTACFFINRMSSSVLNGEIPYRVLFPTKHLFPIAPKIFGC

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.0e-5537.13Show/hide
Query:  YPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKMNPDGTMARLKARLVAKGYAQ
        + +S F+SY ++SP  ++F+  +     P++ +EA     W  AM +E+ A++   TW++ + P  KK  GCKWV+ +K N DGT+ R KARLVAKGY Q
Subjt:  YPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKMNPDGTMARLKARLVAKGYAQ

Query:  IYGTDYSYTYSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVA-QWES---DKVCRLRKSLYGLKQSPRAWFGKFSQALVCF
          G D+  T+S V KLTS++L L ++A   ++LHQLDI N F++GDL EE+YM+ PPG+ A Q +S   + VC L+KS+YGLKQ+ R WF KFS  L+ F
Subjt:  IYGTDYSYTYSLVAKLTSIRLFLFMAATNKWSLHQLDIKNVFVHGDLQEEVYMEQPPGFVA-QWES---DKVCRLRKSLYGLKQSPRAWFGKFSQALVCF

Query:  GMKKSTSDHSVFYRRSEKGIALLVVYVDDIVITGN---------------------------------------------------DALRKLGAKPSGTP
        G  +S SDH+ F + +      ++VYVDDI+I  N                                                   D    LG KPS  P
Subjt:  GMKKSTSDHSVFYRRSEKGIALLVVYVDDIVITGN---------------------------------------------------DALRKLGAKPSGTP

Query:  MMSNQQL-VKEEELCKDPERYRRLVGKLNYLTVT
        M  +            D + YRRL+G+L YL +T
Subjt:  MMSNQQL-VKEEELCKDPERYRRLVGKLNYLTVT

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.6e-1844.44Show/hide
Query:  HQLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKMNPDGTMARLKARLVAKGYAQIYGTDYSYT
        ++L+P  Y+   +      P SV  AL  P W  AM EE+ AL  N TW LV  P  +   GCKWVF  K++ DGT+ RLKARLVAKG+ Q  G  +  T
Subjt:  HQLSPSTYAFITSLESTSIPNSVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKMNPDGTMARLKARLVAKGYAQIYGTDYSYT

Query:  YSLVAKLTSIRLFLFMA
        YS V +  +IR  L +A
Subjt:  YSLVAKLTSIRLFLFMA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGATCATATGACTGAAGATCCCCCAAAAGATGCAAAGCAGAAGAAGGATTGGCTTCGTGATGATGCCCGTTTATATCTTCAGATCAAGAATTCAATTGAGAGTGA
GATAATTGGATTGGTTGATCACTGTGAGTCTAAAGCTGAGAAAGCTGAGTCTGTCACCAGCTACTTTATGCGGCTTAAGAAGATCACTATCGAGCTTGGCTTGTTATTAC
CTTTTAGTCTTGATGTTAAAATTCCATCATTAGATAATGCCTTCACTCGCGTCATTCGCATTGAAAGCTCTTCGACTGGTGTGTCTATTCCTCAACCCAGTAGTGCTATC
TTTAGCAAGAACAATAACCATCGGGCACCTCAGAGGAATAGTATTGATCATCGAAAACCAGAGTTTGTAGAGATTGTTTGTAACTACTGTCGTAAGTCAGGCCATATGAA
ACATGATTGTCGGAAATTGCTATATAAGAATAGTCAACGATCTCAACATGCTCAGATAGCCTCCACATGCGATATACCAGAGGCGTCAGTTACTATTTCTGCAGATGAGT
TTGCTAAGTTTCAGAATTACCAAGAGTCATTACAAGCGTCATCTTCCTCTACTCCGATTGCATCCACTGTTGCCCCAGGTAATATAAAGTGTCTTCTTACATCATCTACC
AAATGGGTCATAGACTCTGGTGCCACAGCTCATATGACAGGTAATTCTCACCTATTTTCTAGACCGTTGTCCCCTGCCCCTTTCCCATCTGTTACATTGGCCGATGGCTC
CACATCTTCTGTTCTTGGCTCTGGCACTATTCACCTTACCCCATCATTTTCTCTCTCTTCTGATCGTGTGACGAAGAAGATTATTGGTAGAGGATATGAGTCAGGAGGCC
TTTATCTCTTTGATCATCAAGTATCGCAAGCTGTGGCGTGTCCTGTCGTTCCCTCTCCTTTTGAAGTCCATTGTCGTTTAGGTCATCCATCTTTTTTGAGTCTTCGAGTC
GATAAACGAGCAATTGCTCCATTTGAGTTAGTTCATTCTGATATTTGGGGTCCGTGTCCAGTTTCTTCCTGTGCTGACATTCCATCTCAAAATGGTGTTGCAGAGCGGAA
AAGTAGGCATTTACTTGAAACTACCCGTGCTTTATCGTTTCAAATGCATGTTTCAAAAACCTTTTGCGTGGACGTTATCTCTACAGCTTGTTTTTTTATTAATAGAATGT
CTTCCTCTGTTCTTAATGGTGAGATTCCCTATCGTGTTCTTTTTCCTACCAAGCATTTGTTTCCTATTGCTCCTAAGATATTTGGTTGTGATACACCATTTACTTCATTA
CCATCGAGTTCGTGTCAGGGGGAGGATGACAATCTTTTTATATATGAGGTTACCTCTCCCACACCATCCTTGTCTACTGATGTGCCTCCTTCCCGCCCGTTGATTTCTCA
AGTCTACTCCCGACGACCTCCACCACAACCTTCAGACTCATGTCCTCCATCAATGCTTCCTTCATCATGTGATCCAGCGCCAAGTGATGATCTTCCCATTGCTCTTCGCA
AAGGTAAACGCAAGTGTACTTACCCCGTTTCTTCCTTTATTTCCTATCACCAGTTATCTCCCTCCACATATGCGTTTATTACGTCTCTTGAGTCCACATCTATTCCTAAC
TCTGTTCATGAAGCTTTGTCTCATCCTCGCTGGCAAAATGCAATGATTGAGGAGATGACTGCTTTAGATGATAATGGTACTTGGGATTTGGTATCTCGTCCTGCAGGAAA
GAAGGCCACTGGTTGTAAATGGGTGTTTGCTGTCAAGATGAATCCTGATGGAACAATGGCTCGATTAAAGGCTCGCCTTGTTGCCAAAGGTTATGCTCAAATCTATGGAA
CTGATTATTCATATACATACTCTCTGGTTGCCAAGTTAACTTCCATTCGCCTATTTCTTTTCATGGCTGCTACCAATAAATGGTCGTTGCATCAACTTGACATTAAGAAT
GTTTTTGTTCACGGTGATCTTCAAGAGGAAGTTTATATGGAACAACCACCAGGGTTTGTTGCTCAGTGGGAGAGTGATAAAGTATGTCGCCTTCGAAAATCTCTGTATGG
TTTGAAACAGAGTCCTCGTGCGTGGTTTGGTAAGTTTAGTCAAGCCCTTGTATGCTTTGGTATGAAGAAGAGTACATCTGATCATTCAGTTTTCTATCGCCGATCTGAGA
AGGGCATAGCTCTACTAGTTGTATATGTTGATGATATTGTTATTACTGGAAATGATGCATTGAGAAAATTAGGCGCCAAACCAAGTGGCACTCCAATGATGTCAAATCAG
CAACTTGTTAAAGAAGAAGAATTATGTAAAGATCCTGAGAGATATAGGAGATTAGTTGGGAAGTTGAACTACTTAACAGTGACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGATCATATGACTGAAGATCCCCCAAAAGATGCAAAGCAGAAGAAGGATTGGCTTCGTGATGATGCCCGTTTATATCTTCAGATCAAGAATTCAATTGAGAGTGA
GATAATTGGATTGGTTGATCACTGTGAGTCTAAAGCTGAGAAAGCTGAGTCTGTCACCAGCTACTTTATGCGGCTTAAGAAGATCACTATCGAGCTTGGCTTGTTATTAC
CTTTTAGTCTTGATGTTAAAATTCCATCATTAGATAATGCCTTCACTCGCGTCATTCGCATTGAAAGCTCTTCGACTGGTGTGTCTATTCCTCAACCCAGTAGTGCTATC
TTTAGCAAGAACAATAACCATCGGGCACCTCAGAGGAATAGTATTGATCATCGAAAACCAGAGTTTGTAGAGATTGTTTGTAACTACTGTCGTAAGTCAGGCCATATGAA
ACATGATTGTCGGAAATTGCTATATAAGAATAGTCAACGATCTCAACATGCTCAGATAGCCTCCACATGCGATATACCAGAGGCGTCAGTTACTATTTCTGCAGATGAGT
TTGCTAAGTTTCAGAATTACCAAGAGTCATTACAAGCGTCATCTTCCTCTACTCCGATTGCATCCACTGTTGCCCCAGGTAATATAAAGTGTCTTCTTACATCATCTACC
AAATGGGTCATAGACTCTGGTGCCACAGCTCATATGACAGGTAATTCTCACCTATTTTCTAGACCGTTGTCCCCTGCCCCTTTCCCATCTGTTACATTGGCCGATGGCTC
CACATCTTCTGTTCTTGGCTCTGGCACTATTCACCTTACCCCATCATTTTCTCTCTCTTCTGATCGTGTGACGAAGAAGATTATTGGTAGAGGATATGAGTCAGGAGGCC
TTTATCTCTTTGATCATCAAGTATCGCAAGCTGTGGCGTGTCCTGTCGTTCCCTCTCCTTTTGAAGTCCATTGTCGTTTAGGTCATCCATCTTTTTTGAGTCTTCGAGTC
GATAAACGAGCAATTGCTCCATTTGAGTTAGTTCATTCTGATATTTGGGGTCCGTGTCCAGTTTCTTCCTGTGCTGACATTCCATCTCAAAATGGTGTTGCAGAGCGGAA
AAGTAGGCATTTACTTGAAACTACCCGTGCTTTATCGTTTCAAATGCATGTTTCAAAAACCTTTTGCGTGGACGTTATCTCTACAGCTTGTTTTTTTATTAATAGAATGT
CTTCCTCTGTTCTTAATGGTGAGATTCCCTATCGTGTTCTTTTTCCTACCAAGCATTTGTTTCCTATTGCTCCTAAGATATTTGGTTGTGATACACCATTTACTTCATTA
CCATCGAGTTCGTGTCAGGGGGAGGATGACAATCTTTTTATATATGAGGTTACCTCTCCCACACCATCCTTGTCTACTGATGTGCCTCCTTCCCGCCCGTTGATTTCTCA
AGTCTACTCCCGACGACCTCCACCACAACCTTCAGACTCATGTCCTCCATCAATGCTTCCTTCATCATGTGATCCAGCGCCAAGTGATGATCTTCCCATTGCTCTTCGCA
AAGGTAAACGCAAGTGTACTTACCCCGTTTCTTCCTTTATTTCCTATCACCAGTTATCTCCCTCCACATATGCGTTTATTACGTCTCTTGAGTCCACATCTATTCCTAAC
TCTGTTCATGAAGCTTTGTCTCATCCTCGCTGGCAAAATGCAATGATTGAGGAGATGACTGCTTTAGATGATAATGGTACTTGGGATTTGGTATCTCGTCCTGCAGGAAA
GAAGGCCACTGGTTGTAAATGGGTGTTTGCTGTCAAGATGAATCCTGATGGAACAATGGCTCGATTAAAGGCTCGCCTTGTTGCCAAAGGTTATGCTCAAATCTATGGAA
CTGATTATTCATATACATACTCTCTGGTTGCCAAGTTAACTTCCATTCGCCTATTTCTTTTCATGGCTGCTACCAATAAATGGTCGTTGCATCAACTTGACATTAAGAAT
GTTTTTGTTCACGGTGATCTTCAAGAGGAAGTTTATATGGAACAACCACCAGGGTTTGTTGCTCAGTGGGAGAGTGATAAAGTATGTCGCCTTCGAAAATCTCTGTATGG
TTTGAAACAGAGTCCTCGTGCGTGGTTTGGTAAGTTTAGTCAAGCCCTTGTATGCTTTGGTATGAAGAAGAGTACATCTGATCATTCAGTTTTCTATCGCCGATCTGAGA
AGGGCATAGCTCTACTAGTTGTATATGTTGATGATATTGTTATTACTGGAAATGATGCATTGAGAAAATTAGGCGCCAAACCAAGTGGCACTCCAATGATGTCAAATCAG
CAACTTGTTAAAGAAGAAGAATTATGTAAAGATCCTGAGAGATATAGGAGATTAGTTGGGAAGTTGAACTACTTAACAGTGACTTGA
Protein sequenceShow/hide protein sequence
MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESKAEKAESVTSYFMRLKKITIELGLLLPFSLDVKIPSLDNAFTRVIRIESSSTGVSIPQPSSAI
FSKNNNHRAPQRNSIDHRKPEFVEIVCNYCRKSGHMKHDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSST
KWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSDRVTKKIIGRGYESGGLYLFDHQVSQAVACPVVPSPFEVHCRLGHPSFLSLRV
DKRAIAPFELVHSDIWGPCPVSSCADIPSQNGVAERKSRHLLETTRALSFQMHVSKTFCVDVISTACFFINRMSSSVLNGEIPYRVLFPTKHLFPIAPKIFGCDTPFTSL
PSSSCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPN
SVHEALSHPRWQNAMIEEMTALDDNGTWDLVSRPAGKKATGCKWVFAVKMNPDGTMARLKARLVAKGYAQIYGTDYSYTYSLVAKLTSIRLFLFMAATNKWSLHQLDIKN
VFVHGDLQEEVYMEQPPGFVAQWESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIALLVVYVDDIVITGNDALRKLGAKPSGTPMMSNQ
QLVKEEELCKDPERYRRLVGKLNYLTVT