; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G13740 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G13740
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr5:13693652..13695381
RNA-Seq ExpressionCSPI05G13740
SyntenyCSPI05G13740
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0000325 - plant-type vacuole (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]4.3e-31097.24Show/hide
Query:  SDSKIPPLDDAFTRVLRIVESSPTGVSIPQPSSALFGKNNNPQAPQRNSTDHQKPESVEIVRNYCRKSGHIKRDCRKLLYKNSQRSQHAQIASTCDIPEA
        SDSKIP LDDAFTRVLRI ESSPT VSIPQPSSALF KNNNP+APQRNSTDH+KPESVEIV NYCRK GH+KRDCRKLLYKNSQRSQHAQIASTCDIPEA
Subjt:  SDSKIPPLDDAFTRVLRIVESSPTGVSIPQPSSALFGKNNNPQAPQRNSTDHQKPESVEIVRNYCRKSGHIKRDCRKLLYKNSQRSQHAQIASTCDIPEA

Query:  SVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSL
        SVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSL
Subjt:  SVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSL

Query:  SSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSS
        SSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQ VACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSS
Subjt:  SSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSS

Query:  LNCDSCQFAKFHRLSSSPRVDKRAIAPFELAH-DIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAG
        LNCDSCQFAKFHRLSSSPRVDKRAIAPFEL H DIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAG
Subjt:  LNCDSCQFAKFHRLSSSPRVDKRAIAPFELAH-DIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAG

Query:  EYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFG
        EYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSK FWVD VSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFG
Subjt:  EYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFG

Query:  CVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKDYRCYCPTLKRY
        CVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQK YRCYCPTLKRY
Subjt:  CVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKDYRCYCPTLKRY

XP_031744754.1 uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus]9.9e-18097.73Show/hide
Query:  FQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELAH-DI
        F+DRVTKKIIGRGYESGGLYLFDHQVSQ VACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFEL H DI
Subjt:  FQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELAH-DI

Query:  WGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERK
        WGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERK
Subjt:  WGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERK

Query:  NRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKDYR
        NRHLLETARALSFQMHVSK FWVD VSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQK YR
Subjt:  NRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKDYR

Query:  CYCPTLKRY
        CYCPTLKRY
Subjt:  CYCPTLKRY

XP_031744755.1 uncharacterized protein LOC101212255 isoform X3 [Cucumis sativus]9.9e-18097.73Show/hide
Query:  FQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELAH-DI
        F+DRVTKKIIGRGYESGGLYLFDHQVSQ VACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFEL H DI
Subjt:  FQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELAH-DI

Query:  WGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERK
        WGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERK
Subjt:  WGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERK

Query:  NRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKDYR
        NRHLLETARALSFQMHVSK FWVD VSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQK YR
Subjt:  NRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKDYR

Query:  CYCPTLKRY
        CYCPTLKRY
Subjt:  CYCPTLKRY

XP_031744756.1 uncharacterized protein LOC101212255 isoform X4 [Cucumis sativus]9.9e-18097.73Show/hide
Query:  FQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELAH-DI
        F+DRVTKKIIGRGYESGGLYLFDHQVSQ VACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFEL H DI
Subjt:  FQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELAH-DI

Query:  WGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERK
        WGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERK
Subjt:  WGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERK

Query:  NRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKDYR
        NRHLLETARALSFQMHVSK FWVD VSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQK YR
Subjt:  NRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKDYR

Query:  CYCPTLKRY
        CYCPTLKRY
Subjt:  CYCPTLKRY

XP_031744758.1 uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus]1.7e-17998.05Show/hide
Query:  QDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELAH-DIW
        QDRVTKKIIGRGYESGGLYLFDHQVSQ VACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFEL H DIW
Subjt:  QDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELAH-DIW

Query:  GPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKN
        GPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKN
Subjt:  GPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKN

Query:  RHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKDYRC
        RHLLETARALSFQMHVSK FWVD VSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQK YRC
Subjt:  RHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKDYRC

Query:  YCPTLKRY
        YCPTLKRY
Subjt:  YCPTLKRY

TrEMBL top hitse value%identityAlignment
A0A438DZQ8 Retrovirus-related Pol polyprotein from transposon TNT 1-943.6e-16755.58Show/hide
Query:  SDSKIPPLDDAFTRVLRIVESSPTGVSIPQPSSALFGKNNNPQAPQR-------NSTDHQKPESVEIVRNYCRKSGHIKRDCRKLLYKNSQRSQHAQIAS
        S S I  L + F+RVLR        VS  Q ++ L  K  N +  +R        + ++   +S  IV  YC ++GH K++CRKL  +N +R Q   +A+
Subjt:  SDSKIPPLDDAFTRVLRIVESSPTGVSIPQPSSALFGKNNNPQAPQR-------NSTDHQKPESVEIVRNYCRKSGHIKRDCRKLLYKNSQRSQHAQIAS

Query:  T-----CDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLG
        +      D     VT++A EF+K+  YQ++L+A   STP+ S +A     CL++SS KW+IDSGAT HMTGN   FS        P VT+ADGST  + G
Subjt:  T-----CDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLG

Query:  SGTIHLTPSFSLSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVL
        SGT+  T S +LSSVL+LPNL+FNLIS S+LT +LNC V FF  +C+FQD +TK+  G+G+ S GLY+ D  V + VAC    SP E HCRLGHPSL VL
Subjt:  SGTIHLTPSFSLSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVL

Query:  KKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELAH-DIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFN
        KKL P+F +L SL+C+SC FAK HR S  PR++KR  + FEL H D+WGPCPV SQTGFRYFVTFVDD SR+TW+Y MKNRSE+ SHFCAF  EIK Q++
Subjt:  KKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELAH-DIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFN

Query:  VSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPT
        VS+K LR+DN  EY S+S  +Y+  NGI+HQ+SC DTPSQNGVAERKNRHLLETARAL FQM V K FW D VSTACFLINRMP+ VL G+IPY+ + P 
Subjt:  VSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPT

Query:  KHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKDYRCYCPTLKRY
        K LFP+AP+IFGC C+VRD RP  TKLDPK+L+C+FLGYSR+QK YRC+ P L +Y
Subjt:  KHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKDYRCYCPTLKRY

A0A438GAA6 Retrovirus-related Pol polyprotein from transposon TNT 1-949.4e-16855.8Show/hide
Query:  SDSKIPPLDDAFTRVLR---IVESSPTGVSIPQPSSALFGKNNNPQAPQRNSTDHQKPESVEIVRNYCRKSGHIKRDCRKLLYKNSQRSQHAQIAST---
        S S I  L + F+RVLR   +  S  T V + +  +A   +  N +   R   +     +  IV  YC ++GH K++CRKL  +N +R Q A +A++   
Subjt:  SDSKIPPLDDAFTRVLR---IVESSPTGVSIPQPSSALFGKNNNPQAPQRNSTDHQKPESVEIVRNYCRKSGHIKRDCRKLLYKNSQRSQHAQIAST---

Query:  --CDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTI
           D     VT++A+EF+K+  YQ++L+A   STP+ S +A     CL++SS KW+IDSGAT HMTGN   FS        P VT+ADGST  + GSGT+
Subjt:  --CDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTI

Query:  HLTPSFSLSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLY
          T S +LSSVL+LPNL+FNLIS S+LT +LNC V FF  +C+FQD +TK+  G+G+ S GLY+ D  V + VAC    SP E HCRLGHPSL VLKKL 
Subjt:  HLTPSFSLSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLY

Query:  PEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELAH-DIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIK
        P+F +L SL+C+SC FAK HR S  PR++KRA + FEL H D+WGPCPV SQTGFRYFVTFVDD SR+TW+Y MKNRSE+ SHFCAF  EIK Q++VS+K
Subjt:  PEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELAH-DIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIK

Query:  TLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLF
         LR+DN  EY S+S  +Y+  NGI+HQ+SC DTPSQNGVAERKNRHLLETARAL FQM V K FW D VSTACFLINRMP+ VL  +IPY+V+ P K LF
Subjt:  TLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLF

Query:  PIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKDYRCYCPTLKRY
        P+AP+IFGC C+VRD RP   KLDPK+L+C+FLGYSR+QK YRC+ P L +Y
Subjt:  PIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKDYRCYCPTLKRY

A0A438H537 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-16655.76Show/hide
Query:  SDSKIPPLDDAFTRVLRIVESSPTGVSIPQPSSALFGKNNNPQAPQR-------NSTDHQKPESVEIVRNYCRKSGHIKRDCRKLLYKNSQRSQHAQIAS
        S S I  L + F+RVLR        VS  Q ++ L  K  N +  +R        + +++  +S  IV  Y  ++GH K++CRKL  +N +R Q A +A+
Subjt:  SDSKIPPLDDAFTRVLRIVESSPTGVSIPQPSSALFGKNNNPQAPQR-------NSTDHQKPESVEIVRNYCRKSGHIKRDCRKLLYKNSQRSQHAQIAS

Query:  T-----CDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLG
        +      D     VT++A+EF+K+  YQ++L+A   STP+ + V  G   CL++SS KW+IDSGAT HMTGN   FS        P VT+ADGST  + G
Subjt:  T-----CDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLG

Query:  SGTIHLTPSFSLSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVL
        SGT+  T S +LSSVL+LPNL+FNLIS S+LT DLNC V FF  +C+FQD +TK+  G+G+ S GLY+ D  V + VAC    SP E HCRLGHPSL VL
Subjt:  SGTIHLTPSFSLSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVL

Query:  KKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELAH-DIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFN
        KKL P+F +L SL+C+SC FAK HR S  PR++KRA + FEL H D+WGPCPV SQTGFRYFVTFVDD SR+TW+Y MKNRSE+ SHFCAF  EIK Q++
Subjt:  KKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELAH-DIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFN

Query:  VSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPT
        VS+K LR+DN  EY S+S  +Y+  NGI+HQ+SC DTPSQNGVAERKNRHLLETARAL FQM V K FW D VS ACFLINRMP+ VL G+I Y+V+ P 
Subjt:  VSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPT

Query:  KHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKDYRCYCPTLKRY
        K LFP+AP+IFGC C+VRD RP  TKLDPK+L+C+FLGYSR+QK YRC+ P L +Y
Subjt:  KHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKDYRCYCPTLKRY

A0A438IRR9 Retrovirus-related Pol polyprotein from transposon TNT 1-944.2e-16855.98Show/hide
Query:  SDSKIPPLDDAFTRVLR---IVESSPTGVSIPQPSSALFGKNNNPQAPQRNSTDHQKPESVEIVRNYCRKSGHIKRDCRKLLYKNSQRSQHAQIAST---
        S S I  L + F+RVLR   +  S  T V + +  +A   +  N +   R   +     S  IV  YC ++GH K++CRKL  +N +R Q A +A++   
Subjt:  SDSKIPPLDDAFTRVLR---IVESSPTGVSIPQPSSALFGKNNNPQAPQRNSTDHQKPESVEIVRNYCRKSGHIKRDCRKLLYKNSQRSQHAQIAST---

Query:  --CDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTI
           D     VT++A+EF+K+  YQ++L+A   STP+ S +A     CL++SS KW+IDSGAT HMTGN   FS        P VT+ADGST  + GSGT+
Subjt:  --CDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTI

Query:  HLTPSFSLSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLY
          T S +LSSVL+LPNL+FNLIS S+LT +LN  V FF  +C+FQD +TK+  G+G+ S GLY+ D  V + VAC    SP E HC+LGHPSL VLKKL 
Subjt:  HLTPSFSLSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLY

Query:  PEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELAH-DIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIK
        P+F +L SL+C+SC FAK HR S  PR++KRA + FEL H D+WGPCPV SQTGFRYFVTFVDD SR+TW+Y MKNRSE+ SHFCAF  EIK Q++VS+K
Subjt:  PEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELAH-DIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIK

Query:  TLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLF
         LR+DN  EY S+S  +Y+ +NGI+HQ+SC DTPSQNGVAERKNRHLLETARAL FQM V K FW D VSTACFLINRMP+ VL G+IPY+V+ P K LF
Subjt:  TLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLF

Query:  PIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKDYRCYCPTLKRY
        P+AP+IFGC C+VRD RP  TKLDPK+L+C+FLGYSR+QK YRC+ P L +Y
Subjt:  PIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKDYRCYCPTLKRY

B0FBS2 Uncharacterized protein1.0e-16956.52Show/hide
Query:  SDSKIPPLDDAFTRVLR---IVESSPTGVSIPQPSSALFGKNNNPQAPQRNSTDHQKPESVEIVRNYCRKSGHIKRDCRKLLYKNSQRSQHAQIAST---
        S S I  L + F+RVLR   +  S  T V I +  +A   +  N +   R   +     S  IV  YC ++GH K++CRKL  +N +R Q A +A++   
Subjt:  SDSKIPPLDDAFTRVLR---IVESSPTGVSIPQPSSALFGKNNNPQAPQRNSTDHQKPESVEIVRNYCRKSGHIKRDCRKLLYKNSQRSQHAQIAST---

Query:  --CDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTI
           D     VT++A+EF+K+  YQ++L+A   STP+ S +A     CL++SS KW+IDSGAT HMTGN   FS        P VT+ADGST  + GSGT+
Subjt:  --CDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTI

Query:  HLTPSFSLSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLY
          T S +LSSVL+LPNL+FNLIS S+LT +LNC V FF  +C+FQD +TK+  G+G+ S GLY+ D  V + VAC    SP E HCRLGHPSL VLKKL 
Subjt:  HLTPSFSLSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLY

Query:  PEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELAH-DIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIK
        P+F +L SL+C+SC FAK HR S  PR++KRA + FEL H D+WGPCPV SQTGFRYFVTFVDD SR+TW+Y MKNRSE+ SHFCAF  EIK Q++VS+K
Subjt:  PEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELAH-DIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIK

Query:  TLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLF
         LR+DN  EY S+S  +Y+  NGI+HQ+SC DTPSQNGVAERKNRHLLETARAL FQM V K FW D VSTACFLINRMP+ VL G+IPY+V+ P K LF
Subjt:  TLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLF

Query:  PIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKDYRCYCPTLKRY
        P+AP+IFGC C+VRD RP  TKLDPK+L+C+FLGYSR+QK YRC+ P L +Y
Subjt:  PIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKDYRCYCPTLKRY

SwissProt top hitse value%identityAlignment
O13527 Truncated transposon Ty1-A Gag-Pol polyprotein2.7e-1825Show/hide
Query:  VACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKF--HRLSSSPRVD-KRAIAPFELAH-DIWGPCPVVSQTGFRYFVTFVDDHSRLT
        + C  +P   ++   L + ++    +   ++ S     C  C   K   HR     R+  + +  PF+  H DI+GP   + ++   YF++F D+ ++L 
Subjt:  VACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKF--HRLSSSPRVD-KRAIAPFELAH-DIWGPCPVVSQTGFRYFVTFVDDHSRLT

Query:  WLYLMKNRSE--LLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKTFWVD
        W+Y + +R E  +L  F      IKNQF  S+  ++ D   EY + +L  +L +NGI    +       +GVAER NR LL+  R       +    W  
Subjt:  WLYLMKNRSE--LLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKTFWVD

Query:  VVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKDYRCYCPTLKR
         +  +  + N + S                 +  + P  FG    V D  P ++K+ P+ +    L  SR    Y  Y P+LK+
Subjt:  VVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKDYRCYCPTLKR

P04146 Copia protein5.3e-2725.43Show/hide
Query:  SALFGKNNNPQAPQRNSTDHQKPESV-------EIVRNYCRKSGHIKRDC----RKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQA
        +A+   NNN            KP+ +       ++  ++C + GHIK+DC    R L  KN +  +  Q A                             
Subjt:  SALFGKNNNPQAPQRNSTDHQKPESV-------EIVRNYCRKSGHIKRDC----RKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQA

Query:  SSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLA-DGSTSSVLGSGTIHL--TPSFSLSSVLHLPNLSFNLISTSQ
          +S  IA  V   N   ++  +  +V+DSGA+ H+  +  L++  +   P   + +A  G        G + L      +L  VL     + NL+S  +
Subjt:  SSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLA-DGSTSSVLGSGTIHL--TPSFSLSSVLHLPNLSFNLISTSQ

Query:  LTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEV-HCRLGHPSLFVL-----KKLYPEFRSLSSLN-----CDSCQ
        L      +    SG  + ++ +   ++        + + + Q   + A     + F + H R GH S   L     K ++ +   L++L      C+ C 
Subjt:  LTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEV-HCRLGHPSLFVL-----KKLYPEFRSLSSLN-----CDSCQ

Query:  FAKFHRLSSSPRVDKRAI-APFELAH-DIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHS
          K  RL      DK  I  P  + H D+ GP   V+     YFV FVD  +     YL+K +S++ S F  F  + +  FN+ +  L  DN  EY S+ 
Subjt:  FAKFHRLSSSPRVDKRAI-APFELAH-DIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHS

Query:  LGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVL--NGEIPYRVLFPTKHLFPIAPKIFGCVCF
        +  +  + GI +  +   TP  NGV+ER  R + E AR +     + K+FW + V TA +LINR+PS  L  + + PY  ++  K  +    ++FG   +
Subjt:  LGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVL--NGEIPYRVLFPTKHLFPIAPKIFGCVCF

Query:  VRDVRPHHTKLDPKSLKCIFLGY
        V  ++    K D KS K IF+GY
Subjt:  VRDVRPHHTKLDPKSLKCIFLGY

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.5e-5630.75Show/hide
Query:  NNNPQAPQRNSTDHQKPESVEIVRNYCRKSGHIKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIK
        NN  ++  R  + ++    V    N C + GH KRDC      N ++ +        D   A++  + D    F N +E     S               
Subjt:  NNNPQAPQRNSTDHQKPESVEIVRNYCRKSGHIKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIK

Query:  CLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFS----LSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYC
              ++WV+D+ A+ H T    LF R ++   F +V + + S S + G G I +  +      L  V H+P+L  NLIS   L  D      + S + 
Subjt:  CLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFS----LSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYC

Query:  LFQDRVTK--KIIGRGYESGGLYLFDHQVSQ--VVACPVVPSPFEVHCRLGHPS-----LFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAI
          + R+TK   +I +G   G LY  + ++ Q  + A     S    H R+GH S     +   K L    +  +   CD C F K HR+S     +++  
Subjt:  LFQDRVTK--KIIGRGYESGGLYLFDHQVSQ--VVACPVVPSPFEVHCRLGHPS-----LFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAI

Query:  APFELAHDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTP
            +  D+ GP  + S  G +YFVTF+DD SR  W+Y++K + ++   F  FH  ++ +    +K LR+DN GEY S     Y   +GI H+ +   TP
Subjt:  APFELAHDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTP

Query:  SQNGVAERKNRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLG
          NGVAER NR ++E  R++     + K+FW + V TAC+LINR PS  L  EIP RV +  K +     K+FGC  F    +   TKLD KS+ CIF+G
Subjt:  SQNGVAERKNRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLG

Query:  YSRVQKDYRCYCPTLKR
        Y   +  YR + P  K+
Subjt:  YSRVQKDYRCYCPTLKR

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-5332.37Show/hide
Query:  SADEFAKFQNYQESLQASSSSTPIASTVAPGNIKC-LLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPS---FSL
        SA   ++ Q++  S+ +    +P        N+      SS  W++DSGAT H+T + +  S          V +ADGST  +  +G+  L+      +L
Subjt:  SADEFAKFQNYQESLQASSSSTPIASTVAPGNIKC-LLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPS---FSL

Query:  SSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEV----HCRLGHPSLFVLKKLYPEFR
         ++L++PN+  NLIS  +L +     V FF      +D  T   + +G     LY +    SQ V+    PS        H RLGHP+  +L  +   + 
Subjt:  SSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEV----HCRLGHPSLFVLKKLYPEFR

Query:  SLSSLN-------CDSCQFAKFHRLSSSPRVDKRAIAPFELAH-DIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNV
        SLS LN       C  C   K +++  S +    +  P E  + D+W   P++S   +RY+V FVD  +R TWLY +K +S++   F  F   ++N+F  
Subjt:  SLSSLN-------CDSCQFAKFHRLSSSPRVDKRAIAPFELAH-DIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNV

Query:  SIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTK
         I T  +DN GE+   +L  Y  ++GI H +S   TP  NG++ERK+RH++ET   L     + KT+W    + A +LINR+P+ +L  E P++ LF T 
Subjt:  SIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTK

Query:  HLFPIAPKIFGCVCFVRDVRPHHT-KLDPKSLKCIFLGYSRVQKDYRC
          +    ++FGC C+   +RP++  KLD KS +C+FLGYS  Q  Y C
Subjt:  HLFPIAPKIFGCVCFVRDVRPHHT-KLDPKSLKCIFLGYSRVQKDYRC

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.8e-5230.09Show/hide
Query:  IPQPSSALFGKNNNPQAPQRNSTDHQKPESVEIVRNYCRKSGHIKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTP
        +P  ++ +  +N N    Q N  D+         RNY   +              S   Q       C I       SA    +   +Q +     S++P
Subjt:  IPQPSSALFGKNNNPQAPQRNSTDHQKPESVEIVRNYCRKSGHIKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTP

Query:  IASTVAPGNIKCLLT-SSTKWVIDSGATAHMTG--NSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHL---TPSFSLSSVLHLPNLSFNLISTSQLTH
                N+      ++  W++DSGAT H+T   N+  F +P +      V +ADGST  +  +G+  L   + S  L+ VL++PN+  NLIS  +L +
Subjt:  IASTVAPGNIKCLLT-SSTKWVIDSGATAHMTG--NSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHL---TPSFSLSSVLHLPNLSFNLISTSQLTH

Query:  DLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVP----SPFEVHCRLGHPSLFVLKKLYPEFR------SLSSLNCDSCQFAKF
             V FF      +D  T   + +G     LY +    SQ V+    P    +    H RLGHPSL +L  +           S   L+C  C   K 
Subjt:  DLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVP----SPFEVHCRLGHPSLFVLKKLYPEFR------SLSSLNCDSCQFAKF

Query:  HRLSSSPRVDKRAIAPFELAHDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLC
        H++  S      +     +  D+W   P++S   +RY+V FVD  +R TWLY +K +S++   F  F + ++N+F   I TL +DN GE+    L  YL 
Subjt:  HRLSSSPRVDKRAIAPFELAHDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLC

Query:  ENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHH
        ++GI H +S   TP  NG++ERK+RH++E    L     V KT+W    S A +LINR+P+ +L  + P++ LF     +    K+FGC C+   +RP++
Subjt:  ENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHH

Query:  T-KLDPKSLKCIFLGYSRVQKDYRC-YCPTLKRYT
          KL+ KS +C F+GYS  Q  Y C + PT + YT
Subjt:  T-KLDPKSLKCIFLGYSRVQKDYRC-YCPTLKRYT

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.6e-0530.23Show/hide
Query:  NRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLK
        NR ++E  R++  +  + KTF  D  +TA  +IN+ PS+ +N  +P  V F +   +    + FGCV +   +     KL P++ K
Subjt:  NRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTCTTACCTGAATTTCTCTGACTCCAAGATTCCACCATTAGATGATGCCTTCACTCGCGTCCTTCGCATTGTTGAAAGCTCTCCGACTGGTGTGTCTATTCCTCA
ACCCAGTAGTGCTCTCTTTGGCAAGAACAATAACCCTCAGGCACCTCAGAGGAATAGTACTGATCATCAAAAACCAGAGTCTGTAGAGATTGTTCGTAACTACTGTCGTA
AGTCAGGCCATATAAAACGTGATTGTCGGAAATTGCTATATAAGAATAGTCAACGATCTCAACATGCTCAGATAGCCTCCACATGCGATATACCAGAGGCGTCAGTTACT
ATTTCTGCAGATGAGTTTGCTAAGTTTCAGAATTACCAAGAGTCATTACAAGCGTCATCTTCCTCTACTCCGATTGCATCCACTGTTGCCCCAGGTAATATAAAGTGTCT
TCTTACATCATCTACCAAATGGGTCATAGACTCTGGTGCCACAGCTCATATGACAGGTAATTCTCACCTATTTTCTAGACCGTTGTCCCCTGCCCCTTTCCCATCTGTTA
CATTGGCCGATGGCTCCACATCTTCTGTTCTTGGCTCTGGCACTATTCACCTTACCCCATCCTTTTCTCTCTCTTCTGTGTTACATTTGCCTAACTTGTCCTTTAATTTA
ATTTCTACTAGTCAACTTACTCATGACCTAAATTGTGTTGTCATGTTCTTTTCTGGTTATTGCTTGTTTCAGGATCGTGTGACGAAGAAGATTATTGGTAGAGGATATGA
GTCAGGAGGCCTTTATCTCTTTGATCATCAAGTATCGCAAGTTGTGGCGTGTCCTGTCGTTCCCTCTCCTTTTGAAGTCCATTGTCGTTTAGGTCATCCATCTTTGTTTG
TGTTGAAGAAACTTTATCCAGAATTTAGGTCTTTGTCCTCTTTAAATTGTGATTCGTGTCAATTTGCGAAATTTCATCGTCTTAGTTCGAGTCCTCGAGTCGATAAACGA
GCAATTGCTCCATTTGAGTTAGCTCATGATATTTGGGGTCCGTGTCCAGTTGTATCTCAAACAGGCTTTCGTTATTTTGTTACTTTTGTTGACGATCATTCTCGTCTAAC
TTGGTTATATTTAATGAAAAATCGTTCTGAGTTATTATCTCATTTTTGTGCCTTTCATACTGAAATAAAAAACCAATTTAATGTCTCTATCAAAACTTTGCGTACTGATA
ATGCGGGTGAATATTTTTCTCATTCTCTTGGCTCTTACTTGTGTGAAAATGGCATTATTCATCAATCTTCCTGTGCTGACACTCCATCTCAAAATGGTGTTGCAGAGCGG
AAAAATAGGCATTTACTTGAAACTGCCCGTGCTTTATCGTTTCAAATGCATGTTTCAAAAACCTTTTGGGTGGATGTTGTCTCTACAGCTTGTTTTTTGATTAATAGAAT
GCCTTCCTCTGTTCTTAATGGTGAGATTCCCTATCGTGTTCTTTTTCCTACCAAGCATTTGTTTCCTATTGCTCCTAAGATATTTGGTTGTGTCTGTTTTGTTCGTGACG
TTCGTCCTCATCATACTAAGTTAGATCCCAAATCCTTGAAGTGTATCTTCTTGGGTTATTCACGTGTTCAAAAGGATTATCGTTGTTATTGTCCTACCCTTAAAAGATAC
ACCCTTTACTTCATCACCATCGAGTTTGTGTCAGGGGGAGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGACTCTTACCTGAATTTCTCTGACTCCAAGATTCCACCATTAGATGATGCCTTCACTCGCGTCCTTCGCATTGTTGAAAGCTCTCCGACTGGTGTGTCTATTCCTCA
ACCCAGTAGTGCTCTCTTTGGCAAGAACAATAACCCTCAGGCACCTCAGAGGAATAGTACTGATCATCAAAAACCAGAGTCTGTAGAGATTGTTCGTAACTACTGTCGTA
AGTCAGGCCATATAAAACGTGATTGTCGGAAATTGCTATATAAGAATAGTCAACGATCTCAACATGCTCAGATAGCCTCCACATGCGATATACCAGAGGCGTCAGTTACT
ATTTCTGCAGATGAGTTTGCTAAGTTTCAGAATTACCAAGAGTCATTACAAGCGTCATCTTCCTCTACTCCGATTGCATCCACTGTTGCCCCAGGTAATATAAAGTGTCT
TCTTACATCATCTACCAAATGGGTCATAGACTCTGGTGCCACAGCTCATATGACAGGTAATTCTCACCTATTTTCTAGACCGTTGTCCCCTGCCCCTTTCCCATCTGTTA
CATTGGCCGATGGCTCCACATCTTCTGTTCTTGGCTCTGGCACTATTCACCTTACCCCATCCTTTTCTCTCTCTTCTGTGTTACATTTGCCTAACTTGTCCTTTAATTTA
ATTTCTACTAGTCAACTTACTCATGACCTAAATTGTGTTGTCATGTTCTTTTCTGGTTATTGCTTGTTTCAGGATCGTGTGACGAAGAAGATTATTGGTAGAGGATATGA
GTCAGGAGGCCTTTATCTCTTTGATCATCAAGTATCGCAAGTTGTGGCGTGTCCTGTCGTTCCCTCTCCTTTTGAAGTCCATTGTCGTTTAGGTCATCCATCTTTGTTTG
TGTTGAAGAAACTTTATCCAGAATTTAGGTCTTTGTCCTCTTTAAATTGTGATTCGTGTCAATTTGCGAAATTTCATCGTCTTAGTTCGAGTCCTCGAGTCGATAAACGA
GCAATTGCTCCATTTGAGTTAGCTCATGATATTTGGGGTCCGTGTCCAGTTGTATCTCAAACAGGCTTTCGTTATTTTGTTACTTTTGTTGACGATCATTCTCGTCTAAC
TTGGTTATATTTAATGAAAAATCGTTCTGAGTTATTATCTCATTTTTGTGCCTTTCATACTGAAATAAAAAACCAATTTAATGTCTCTATCAAAACTTTGCGTACTGATA
ATGCGGGTGAATATTTTTCTCATTCTCTTGGCTCTTACTTGTGTGAAAATGGCATTATTCATCAATCTTCCTGTGCTGACACTCCATCTCAAAATGGTGTTGCAGAGCGG
AAAAATAGGCATTTACTTGAAACTGCCCGTGCTTTATCGTTTCAAATGCATGTTTCAAAAACCTTTTGGGTGGATGTTGTCTCTACAGCTTGTTTTTTGATTAATAGAAT
GCCTTCCTCTGTTCTTAATGGTGAGATTCCCTATCGTGTTCTTTTTCCTACCAAGCATTTGTTTCCTATTGCTCCTAAGATATTTGGTTGTGTCTGTTTTGTTCGTGACG
TTCGTCCTCATCATACTAAGTTAGATCCCAAATCCTTGAAGTGTATCTTCTTGGGTTATTCACGTGTTCAAAAGGATTATCGTTGTTATTGTCCTACCCTTAAAAGATAC
ACCCTTTACTTCATCACCATCGAGTTTGTGTCAGGGGGAGGATGA
Protein sequenceShow/hide protein sequence
MDSYLNFSDSKIPPLDDAFTRVLRIVESSPTGVSIPQPSSALFGKNNNPQAPQRNSTDHQKPESVEIVRNYCRKSGHIKRDCRKLLYKNSQRSQHAQIASTCDIPEASVT
ISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPNLSFNL
ISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKR
AIAPFELAHDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAER
KNRHLLETARALSFQMHVSKTFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKDYRCYCPTLKRY
TLYFITIEFVSGGG