; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G12940 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G12940
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionZinc finger, CCHC-type
Genome locationChr5:12333306..12334547
RNA-Seq ExpressionCSPI05G12940
SyntenyCSPI05G12940
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EEC84282.1 hypothetical protein OsI_30754 [Oryza sativa Indica Group]5.5e-11149.03Show/hide
Query:  GHRFGGTDQGRWRGRGRGTERQNSAR--GTSNTRNGTRDKSHIKFFTCNKMGHYASEC-HGKDHDDEAHLTCAVEEEPGLMMAVSQEGTCLRCDQKDVIL
        G+R  G  +GR RGRG G  R ++ R  G   +  G RDKSHIK + C + GHY+++C H K    EAHL    +  P L++AV+++      +    ++
Subjt:  GHRFGGTDQGRWRGRGRGTERQNSAR--GTSNTRNGTRDKSHIKFFTCNKMGHYASEC-HGKDHDDEAHLTCAVEEEPGLMMAVSQEGTCLRCDQKDVIL

Query:  LNKERLLPEMYCNDQNGENNDVWYLNNGASNHMTGPRETFQELDESFTRRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNIISLRQMTE
        +++ER+ P++   D      D+W+L+NGASNHMTG R  F++LD S T  VKF D ST++I GKG+++F CKN DQ  LQ+V+YIP  C N++SL Q+TE
Subjt:  LNKERLLPEMYCNDQNGENNDVWYLNNGASNHMTGPRETFQELDESFTRRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNIISLRQMTE

Query:  NENKVQMTEDVMKMSDRSG-KLLMSVKRTQKYLYKITLKTLKQVCLLTSLKDPTWLWHVRLGHVNFYDLKLIREKKLVVGVPLVTQPNKLCEACVITKQV
          ++V M EDV+++ D+S  +L+M V+RT   LY+I LK    VCLLT + +P WLWH RLGHVNF  +KL+ +K +  G+P +T PN+LC+AC++ KQ+
Subjt:  NENKVQMTEDVMKMSDRSG-KLLMSVKRTQKYLYKITLKTLKQVCLLTSLKDPTWLWHVRLGHVNFYDLKLIREKKLVVGVPLVTQPNKLCEACVITKQV

Query:  RLPFHRQSTYRVEKPLELLHADICGPISPCTLAGNKYFLLIVDDSTRWMWMYMLEAKSDAFEAFKKCKLLMENKTEYKIKTLQTDRGGEFLSAGFTQFCK
        R PF   + +R E+PLELLH D+CGPI+P T+AGN+YF+LIVDD +RWMWM++++ K  A EAF K K L EN    +IKTL++DRGGEFLS  F Q C+
Subjt:  RLPFHRQSTYRVEKPLELLHADICGPISPCTLAGNKYFLLIVDDSTRWMWMYMLEAKSDAFEAFKKCKLLMENKTEYKIKTLQTDRGGEFLSAGFTQFCK

Query:  KEGIQGHLIAPYSP
        + GIQ HL APYSP
Subjt:  KEGIQGHLIAPYSP

XP_020258980.1 uncharacterized protein LOC109835417 [Asparagus officinalis]5.0e-11250.36Show/hide
Query:  GHRFGGTDQGRWRGRG---------RGTERQNSARGTSNTRNGTRDKSHIKFFTCNKMGHYASECH--GKDHDDEAHLTCAVEEEPGLMMAVSQEGTCLR
        G   G   +G + GRG          G  RQ     TS    G RDKS+IK F C   GH+ SECH   ++++ EAHLT A +EEP LMMA+ +E +   
Subjt:  GHRFGGTDQGRWRGRG---------RGTERQNSARGTSNTRNGTRDKSHIKFFTCNKMGHYASECH--GKDHDDEAHLTCAVEEEPGLMMAVSQEGTCLR

Query:  CDQKDVILLNKERLLPEMYCNDQNGENNDVWYLNNGASNHMTGPRETFQELDESFTRRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNI
             V+LLN+E+++P++    +   +N+ WYL+NGASNHMTG R+ F+ELDE+    VKF DGS ++I GKG+++F+C N DQ+ L EVYYIP    NI
Subjt:  CDQKDVILLNKERLLPEMYCNDQNGENNDVWYLNNGASNHMTGPRETFQELDESFTRRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNI

Query:  ISLRQMTENENKVQMTEDVMKMSDRSGKLLMSVKRTQKYLYKITLKTLKQVCLLTSLKDPTWLWHVRLGHVNFYDLKLIREKKLVVGVPLVTQPNKLCEA
        ISL QMTE+ + V ++ + +++ DR+G LLM V R+   LY+I LKT K  CL+ ++ DP      RLGHVNF+ LK++ +K++  G+P +  PN+LCE 
Subjt:  ISLRQMTENENKVQMTEDVMKMSDRSGKLLMSVKRTQKYLYKITLKTLKQVCLLTSLKDPTWLWHVRLGHVNFYDLKLIREKKLVVGVPLVTQPNKLCEA

Query:  CVITKQVRLPFHRQSTYRVEKPLELLHADICGPISPCTLAGNKYFLLIVDDSTRWMWMYMLEAKSDAFEAFKKCKLLMENKTEYKIKTLQTDRGGEFLSA
        C++ KQ RLPF  Q+ +R +KPLELLHAD+CGPI+P TLAGN YFLLIVDD +RWMW+YML+AKS A E FKK K ++EN  ++K+KTL+TDRGGEFLS 
Subjt:  CVITKQVRLPFHRQSTYRVEKPLELLHADICGPISPCTLAGNKYFLLIVDDSTRWMWMYMLEAKSDAFEAFKKCKLLMENKTEYKIKTLQTDRGGEFLSA

Query:  GFTQFCKKEGIQGHLIAPYSP
         FT+FC+  GI+ HL APY+P
Subjt:  GFTQFCKKEGIQGHLIAPYSP

XP_020271888.1 uncharacterized protein LOC109847051 [Asparagus officinalis]7.9e-11851.78Show/hide
Query:  GHRFGGTDQGRWRGRG---------RGTERQNSARGTSNTRNGTRDKSHIKFFTCNKMGHYASECH--GKDHDDEAHLTCAVEEEPGLMMAVSQEGTCLR
        G   G   +G + GRG          G  RQ     TS    G RDKS+IK F C   GH+ SECH   ++H+ EAHLT A +EEP LMMA+ +E +   
Subjt:  GHRFGGTDQGRWRGRG---------RGTERQNSARGTSNTRNGTRDKSHIKFFTCNKMGHYASECH--GKDHDDEAHLTCAVEEEPGLMMAVSQEGTCLR

Query:  CDQKDVILLNKERLLPEMYCNDQNGENNDVWYLNNGASNHMTGPRETFQELDESFTRRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNI
              ILLN+E+++P++    +    NDVWYL+NGASNHMTG R+ F+ELDE+    VKF DGS ++I GKG+++F+C N DQ+ L EVYYIP    NI
Subjt:  CDQKDVILLNKERLLPEMYCNDQNGENNDVWYLNNGASNHMTGPRETFQELDESFTRRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNI

Query:  ISLRQMTENENKVQMTEDVMKMSDRSGKLLMSVKRTQKYLYKITLKTLKQVCLLTSLKDPTWLWHVRLGHVNFYDLKLIREKKLVVGVPLVTQPNKLCEA
        ISL QMTE+ + V ++ + +++ DR+G LLM V R+   LY+I LKT K  CL+ ++ DP WLWH RLGHVNF+ LK++ +K++  G+P +  PN+LCE 
Subjt:  ISLRQMTENENKVQMTEDVMKMSDRSGKLLMSVKRTQKYLYKITLKTLKQVCLLTSLKDPTWLWHVRLGHVNFYDLKLIREKKLVVGVPLVTQPNKLCEA

Query:  CVITKQVRLPFHRQSTYRVEKPLELLHADICGPISPCTLAGNKYFLLIVDDSTRWMWMYMLEAKSDAFEAFKKCKLLMENKTEYKIKTLQTDRGGEFLSA
        C++ KQ RL F  Q+ +R +KPLELLHAD+CGPISP TLAGN+YFLL+VDD +RWMW+YML+AKS A E FKK K  +EN  ++K+KTL+TDRGGEFLS 
Subjt:  CVITKQVRLPFHRQSTYRVEKPLELLHADICGPISPCTLAGNKYFLLIVDDSTRWMWMYMLEAKSDAFEAFKKCKLLMENKTEYKIKTLQTDRGGEFLSA

Query:  GFTQFCKKEGIQGHLIAPYSP
         FT+FC+  GI+ HL APY+P
Subjt:  GFTQFCKKEGIQGHLIAPYSP

XP_031741708.1 uncharacterized protein LOC116403903 [Cucumis sativus]1.0e-13382.12Show/hide
Query:  MNKGHRFGGTDQGRWRGRGRGTERQNSARGTSNTRNGTRDKSHIKFFTCNKMGHYASECHGKDHDDEAHLTCAVEEEPGLMMAVSQEGTCLRCDQKDVIL
        MNKG  FGGTD+GRWRGRGRGTE QNSA GTSNTRNGTRDKS IK FTCNKM HYASEC GK  DDEAHLTC  EEE   MM VSQEGT  RCDQ++ IL
Subjt:  MNKGHRFGGTDQGRWRGRGRGTERQNSARGTSNTRNGTRDKSHIKFFTCNKMGHYASECHGKDHDDEAHLTCAVEEEPGLMMAVSQEGTCLRCDQKDVIL

Query:  LNKERLLPEMYCNDQNGENNDVWYLNNGASNHMTGPRETFQELDESFTRRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNIISLRQMTE
        L KERLLPEMY ND+NGENN VWYL+NGASNHMTG RE F ELD+SFT RVKF DG  IQ M KG VMFECKN DQKALQEVYYIPK C NIISL QMTE
Subjt:  LNKERLLPEMYCNDQNGENNDVWYLNNGASNHMTGPRETFQELDESFTRRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNIISLRQMTE

Query:  NENKVQMTEDVMKMSDRSGKLLMSVKRTQKYLYKITLKTLKQVCLLTSLKDPTWLWHVRLGHVNFYDLKLIREKKLVVGVPLVTQPNKLCEACVITKQVR
        N NKVQMTEDVMK+SDRSGKLLMSVKRTQ  LYKITLKTLKQVCLLTSL+DPTWLWHVRLGHVNF+DLKL+ EKKLVVGVPLVTQPNKLC AC+ITKQ +
Subjt:  NENKVQMTEDVMKMSDRSGKLLMSVKRTQKYLYKITLKTLKQVCLLTSLKDPTWLWHVRLGHVNFYDLKLIREKKLVVGVPLVTQPNKLCEACVITKQVR

Query:  LP
        LP
Subjt:  LP

XP_031741713.1 uncharacterized protein LOC116403908 [Cucumis sativus]1.1e-11683.72Show/hide
Query:  MNKGHRFGGTDQGRWRGRGRGTERQNSARGTSNTRNGTRDKSHIKFFTCNKMGHYASECHGKDHDDEAHLTCAVEEEPGLMMAVSQEGTCLRCDQKDVIL
        MNKG RF G D+GRWRGRGRGTERQNSA GTSNT NGTRDKSHIK FTCNKMGHYA E  GK HD+E+HLTC VEEEP LMMA+SQEGT  RCDQ+D IL
Subjt:  MNKGHRFGGTDQGRWRGRGRGTERQNSARGTSNTRNGTRDKSHIKFFTCNKMGHYASECHGKDHDDEAHLTCAVEEEPGLMMAVSQEGTCLRCDQKDVIL

Query:  LNKERLLPEMYCNDQNGENNDVWYLNNGASNHMTGPRETFQELDESFTRRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNIISLRQMTE
        LNKERLLPEMYCND+NGENNDVWYL+NGASNHM G RE FQELDESFT +VKF DG TIQIM KGTVMFECKN DQKALQEVYYIPK C NIISL QMTE
Subjt:  LNKERLLPEMYCNDQNGENNDVWYLNNGASNHMTGPRETFQELDESFTRRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNIISLRQMTE

Query:  NENKVQMTEDVMKMSDRSGKLLMSVKRTQKYLYKITLKTLKQVCLLTSLKDPTWLWHV
        N NKVQM ED+MK+SDRSGKLLMSVK TQ  LYKITLKTLKQVCLLTSL+DPTWLWHV
Subjt:  NENKVQMTEDVMKMSDRSGKLLMSVKRTQKYLYKITLKTLKQVCLLTSLKDPTWLWHV

TrEMBL top hitse value%identityAlignment
A0A0P0XB91 Os08g0125300 protein1.0e-11048.56Show/hide
Query:  NKGHRFGGTDQGRWRGRGRGTERQNSAR--GTSNTRNGTRDKSHIKFFTCNKMGHYASEC-HGKDHDDEAHLTCAVEEEPGLMMAVSQEGTCLRCDQKDV
        ++G+R  G  +GR RG G G  R ++ R  G   +  G RDKSHIK + C + GHY+++C H K    EAHL    +  P L++AV+++      +    
Subjt:  NKGHRFGGTDQGRWRGRGRGTERQNSAR--GTSNTRNGTRDKSHIKFFTCNKMGHYASEC-HGKDHDDEAHLTCAVEEEPGLMMAVSQEGTCLRCDQKDV

Query:  ILLNKERLLPEMYCNDQNGENNDVWYLNNGASNHMTGPRETFQELDESFTRRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNIISLRQM
        +++++ER+ P++   D      D+W+L+NGASNHMTG R  F++LD S T  VKF D ST++I GKG+++F CKN DQ  LQ+V+YIP  C N++SL Q+
Subjt:  ILLNKERLLPEMYCNDQNGENNDVWYLNNGASNHMTGPRETFQELDESFTRRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNIISLRQM

Query:  TENENKVQMTEDVMKMSDRSG-KLLMSVKRTQKYLYKITLKTLKQVCLLTSLKDPTWLWHVRLGHVNFYDLKLIREKKLVVGVPLVTQPNKLCEACVITK
        TE  ++V M EDV+++ D+S  +L+M V+RT   LY+I LK    VCLLT + +P WLWH RLGHVNF  +KL+ +K +  G+P +T PN+LC+AC++ K
Subjt:  TENENKVQMTEDVMKMSDRSG-KLLMSVKRTQKYLYKITLKTLKQVCLLTSLKDPTWLWHVRLGHVNFYDLKLIREKKLVVGVPLVTQPNKLCEACVITK

Query:  QVRLPFHRQSTYRVEKPLELLHADICGPISPCTLAGNKYFLLIVDDSTRWMWMYMLEAKSDAFEAFKKCKLLMENKTEYKIKTLQTDRGGEFLSAGFTQF
        Q+R PF   + +R E+PLELLH D+CGPI+P T+AGN+YF+LIVDD +RWMWM++++ K  A EAF K K L EN    +IKTL++DRGGEFLS  F Q 
Subjt:  QVRLPFHRQSTYRVEKPLELLHADICGPISPCTLAGNKYFLLIVDDSTRWMWMYMLEAKSDAFEAFKKCKLLMENKTEYKIKTLQTDRGGEFLSAGFTQF

Query:  CKKEGIQGHLIAPYSP
        C++ GIQ HL APYSP
Subjt:  CKKEGIQGHLIAPYSP

B8BDZ6 Uncharacterized protein2.7e-11149.03Show/hide
Query:  GHRFGGTDQGRWRGRGRGTERQNSAR--GTSNTRNGTRDKSHIKFFTCNKMGHYASEC-HGKDHDDEAHLTCAVEEEPGLMMAVSQEGTCLRCDQKDVIL
        G+R  G  +GR RGRG G  R ++ R  G   +  G RDKSHIK + C + GHY+++C H K    EAHL    +  P L++AV+++      +    ++
Subjt:  GHRFGGTDQGRWRGRGRGTERQNSAR--GTSNTRNGTRDKSHIKFFTCNKMGHYASEC-HGKDHDDEAHLTCAVEEEPGLMMAVSQEGTCLRCDQKDVIL

Query:  LNKERLLPEMYCNDQNGENNDVWYLNNGASNHMTGPRETFQELDESFTRRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNIISLRQMTE
        +++ER+ P++   D      D+W+L+NGASNHMTG R  F++LD S T  VKF D ST++I GKG+++F CKN DQ  LQ+V+YIP  C N++SL Q+TE
Subjt:  LNKERLLPEMYCNDQNGENNDVWYLNNGASNHMTGPRETFQELDESFTRRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNIISLRQMTE

Query:  NENKVQMTEDVMKMSDRSG-KLLMSVKRTQKYLYKITLKTLKQVCLLTSLKDPTWLWHVRLGHVNFYDLKLIREKKLVVGVPLVTQPNKLCEACVITKQV
          ++V M EDV+++ D+S  +L+M V+RT   LY+I LK    VCLLT + +P WLWH RLGHVNF  +KL+ +K +  G+P +T PN+LC+AC++ KQ+
Subjt:  NENKVQMTEDVMKMSDRSG-KLLMSVKRTQKYLYKITLKTLKQVCLLTSLKDPTWLWHVRLGHVNFYDLKLIREKKLVVGVPLVTQPNKLCEACVITKQV

Query:  RLPFHRQSTYRVEKPLELLHADICGPISPCTLAGNKYFLLIVDDSTRWMWMYMLEAKSDAFEAFKKCKLLMENKTEYKIKTLQTDRGGEFLSAGFTQFCK
        R PF   + +R E+PLELLH D+CGPI+P T+AGN+YF+LIVDD +RWMWM++++ K  A EAF K K L EN    +IKTL++DRGGEFLS  F Q C+
Subjt:  RLPFHRQSTYRVEKPLELLHADICGPISPCTLAGNKYFLLIVDDSTRWMWMYMLEAKSDAFEAFKKCKLLMENKTEYKIKTLQTDRGGEFLSAGFTQFCK

Query:  KEGIQGHLIAPYSP
        + GIQ HL APYSP
Subjt:  KEGIQGHLIAPYSP

Q0J8A6 Os08g0125300 protein1.0e-11048.56Show/hide
Query:  NKGHRFGGTDQGRWRGRGRGTERQNSAR--GTSNTRNGTRDKSHIKFFTCNKMGHYASEC-HGKDHDDEAHLTCAVEEEPGLMMAVSQEGTCLRCDQKDV
        ++G+R  G  +GR RG G G  R ++ R  G   +  G RDKSHIK + C + GHY+++C H K    EAHL    +  P L++AV+++      +    
Subjt:  NKGHRFGGTDQGRWRGRGRGTERQNSAR--GTSNTRNGTRDKSHIKFFTCNKMGHYASEC-HGKDHDDEAHLTCAVEEEPGLMMAVSQEGTCLRCDQKDV

Query:  ILLNKERLLPEMYCNDQNGENNDVWYLNNGASNHMTGPRETFQELDESFTRRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNIISLRQM
        +++++ER+ P++   D      D+W+L+NGASNHMTG R  F++LD S T  VKF D ST++I GKG+++F CKN DQ  LQ+V+YIP  C N++SL Q+
Subjt:  ILLNKERLLPEMYCNDQNGENNDVWYLNNGASNHMTGPRETFQELDESFTRRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNIISLRQM

Query:  TENENKVQMTEDVMKMSDRSG-KLLMSVKRTQKYLYKITLKTLKQVCLLTSLKDPTWLWHVRLGHVNFYDLKLIREKKLVVGVPLVTQPNKLCEACVITK
        TE  ++V M EDV+++ D+S  +L+M V+RT   LY+I LK    VCLLT + +P WLWH RLGHVNF  +KL+ +K +  G+P +T PN+LC+AC++ K
Subjt:  TENENKVQMTEDVMKMSDRSG-KLLMSVKRTQKYLYKITLKTLKQVCLLTSLKDPTWLWHVRLGHVNFYDLKLIREKKLVVGVPLVTQPNKLCEACVITK

Query:  QVRLPFHRQSTYRVEKPLELLHADICGPISPCTLAGNKYFLLIVDDSTRWMWMYMLEAKSDAFEAFKKCKLLMENKTEYKIKTLQTDRGGEFLSAGFTQF
        Q+R PF   + +R E+PLELLH D+CGPI+P T+AGN+YF+LIVDD +RWMWM++++ K  A EAF K K L EN    +IKTL++DRGGEFLS  F Q 
Subjt:  QVRLPFHRQSTYRVEKPLELLHADICGPISPCTLAGNKYFLLIVDDSTRWMWMYMLEAKSDAFEAFKKCKLLMENKTEYKIKTLQTDRGGEFLSAGFTQF

Query:  CKKEGIQGHLIAPYSP
        C++ GIQ HL APYSP
Subjt:  CKKEGIQGHLIAPYSP

Q7XMW2 OSJNBb0040D15.12 protein1.8e-10749.26Show/hide
Query:  GRWRGRGRGTERQNSARGTSNTRNGTRDKSHIKFFTCNKMGHYASEC-HGKDHDDEAHLTCAVEEEPGLMMAVSQEGTCLRC-DQKDVILLNKERLLPEM
        GR RGRGRG     S+ G+S      RDKSHIK F C + GHY+++C H K    EAHL    +  P L++AV++      C D    +++++ER+ P +
Subjt:  GRWRGRGRGTERQNSARGTSNTRNGTRDKSHIKFFTCNKMGHYASEC-HGKDHDDEAHLTCAVEEEPGLMMAVSQEGTCLRC-DQKDVILLNKERLLPEM

Query:  YCNDQNGENNDVWYLNNGASNHMTGPRETFQELDESFTRRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNIISLRQMTENENKVQMTED
           ++     D+WYL+NGASNHM+G R  F+ELDE+ T +V+F D S++QIMG+G+++F CKN DQ  L +VYYIP  C N++SL Q+TE  ++V M  D
Subjt:  YCNDQNGENNDVWYLNNGASNHMTGPRETFQELDESFTRRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNIISLRQMTENENKVQMTED

Query:  VMKMSDRS-GKLLMSVKRTQKYLYKITLKTLKQVCLLTSLKDPTWLWHVRLGHVNFYDLKLIREKKLVVGVPLVTQPNKLCEACVITKQVRLPFHRQSTY
         +++ D++  +L+M V+RT   LY+I L+    VCLL SL DP WLWH RLGHVNF+ LKL+ +K++V GVP V  PN+LC+AC++ KQVR  F   + Y
Subjt:  VMKMSDRS-GKLLMSVKRTQKYLYKITLKTLKQVCLLTSLKDPTWLWHVRLGHVNFYDLKLIREKKLVVGVPLVTQPNKLCEACVITKQVRLPFHRQSTY

Query:  RVEKPLELLHADICGPISPCTLAGNKYFLLIVDDSTRWMWMYMLEAKSDAFEAFKKCKLLMENKTEYKIKTLQTDRGGEFLSAGFTQFCKKEGIQGHLIA
        R E PLELLH D+CGPI+P T AGN+YF+LIVDD +RWMW++++++K  A  A +K K L EN     IKTL+TDRG EFLS  F + C   GI+ HL A
Subjt:  RVEKPLELLHADICGPISPCTLAGNKYFLLIVDDSTRWMWMYMLEAKSDAFEAFKKCKLLMENKTEYKIKTLQTDRGGEFLSAGFTQFCKKEGIQGHLIA

Query:  PYSP
        PYSP
Subjt:  PYSP

Q94I37 Putative retroelement4.0e-10748.55Show/hide
Query:  KGHRFGGTDQGRWRGRGRGTERQNSARGTSNTRNGTRDKSHIKFFTCNKMGHYASEC-HGKDHDDEAHLTCAVEEEPGLMMAVSQE-GTCLRCDQKDVIL
        +G   G  ++GR RGRGRG   Q S  G+S    G RDKSHIK F C + GHY+++C H K    EAHL    +  P L++AV++      R D    ++
Subjt:  KGHRFGGTDQGRWRGRGRGTERQNSARGTSNTRNGTRDKSHIKFFTCNKMGHYASEC-HGKDHDDEAHLTCAVEEEPGLMMAVSQE-GTCLRCDQKDVIL

Query:  LNKERLLPEMYCNDQNGENNDVWYLNNGASNHMTGPRETFQELDESFTRRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNIISLRQMTE
        +++ER+ P +   ++     D+WYL+NGASNHM+G R  F+ELDE+ T +V+F D S++QIMG G+++F CKN DQ  L +VYYIP  C N++SL Q+TE
Subjt:  LNKERLLPEMYCNDQNGENNDVWYLNNGASNHMTGPRETFQELDESFTRRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNIISLRQMTE

Query:  NENKVQMTEDVMKMSDRS-GKLLMSVKRTQKYLYKITLKTLKQVCLLTSLKDPTWLWHVRLGHVNFYDLKLIREKKLVVGVPLVTQPNKLCEACVITKQV
          ++V M  D +K+ D++  +L+M V+RT   LY+I L+   QVCLL SL +P WLWH R+GHVNF+ LKL+ +K++  GVP V  PN+LC+AC++ KQV
Subjt:  NENKVQMTEDVMKMSDRS-GKLLMSVKRTQKYLYKITLKTLKQVCLLTSLKDPTWLWHVRLGHVNFYDLKLIREKKLVVGVPLVTQPNKLCEACVITKQV

Query:  RLPFHRQSTYRVEKPLELLHADICGPISPCTLAGNKYFLLIVDDSTRWMWMYMLEAKSDAFEAFKKCKLLMENKTEYKIKTLQTDRGGEFLSAGFTQFCK
        R PF   + YR E PLELLH D+CGPI+P T AGN+YF+LIVDD + WMW+++++ K  A   F+K K L +N     IKTL+TDRGGEFLS  F + C 
Subjt:  RLPFHRQSTYRVEKPLELLHADICGPISPCTLAGNKYFLLIVDDSTRWMWMYMLEAKSDAFEAFKKCKLLMENKTEYKIKTLQTDRGGEFLSAGFTQFCK

Query:  KEGIQGHLIAPYSP
           I+ HL APYSP
Subjt:  KEGIQGHLIAPYSP

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.0e-2727.36Show/hide
Query:  NDQNGENNDVWYLNNGASNHMTGPRETFQELDESF--TRRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNIISLRQMTENENKVQMTED
        N+ +  +N  + L++GAS+H+      + +  E     +      G  I    +G V    +N  +  L++V +  ++  N++S++++ E    ++  + 
Subjt:  NDQNGENNDVWYLNNGASNHMTGPRETFQELDESF--TRRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNIISLRQMTENENKVQMTED

Query:  VMKMSDRSGKLLMSVKRTQKYLYKITLKTLKQVCLLTSLKDPTWLWHVRLGHVNFYDLKLIREKKLVVGVPLVTQPN---KLCEACVITKQVRLPFHR-Q
         + +S      LM VK +   L  + +   +   +    K+   LWH R GH++   L  I+ K +     L+       ++CE C+  KQ RLPF + +
Subjt:  VMKMSDRSGKLLMSVKRTQKYLYKITLKTLKQVCLLTSLKDPTWLWHVRLGHVNFYDLKLIREKKLVVGVPLVTQPN---KLCEACVITKQVRLPFHR-Q

Query:  STYRVEKPLELLHADICGPISPCTLAGNKYFLLIVDDSTRWMWMYMLEAKSDAFEAFKKCKLLMENKTEYKIKTLQTDRGGEFLSAGFTQFCKKEGIQGH
            +++PL ++H+D+CGPI+P TL    YF++ VD  T +   Y+++ KSD F  F+      E     K+  L  D G E+LS    QFC K+GI  H
Subjt:  STYRVEKPLELLHADICGPISPCTLAGNKYFLLIVDDSTRWMWMYMLEAKSDAFEAFKKCKLLMENKTEYKIKTLQTDRGGEFLSAGFTQFCKKEGIQGH

Query:  LIAPYSP
        L  P++P
Subjt:  LIAPYSP

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-3325.36Show/hide
Query:  GRGRGTERQNSARGTSNTRNGTRDKSHIKF---FTCNKMGHYASEC----------HGKDHDDEAHLTCAVEEEPGLMMAVSQEGTCLRCDQKDVILLNK
        GRGR  +R ++  G S  R  ++++S  +    + CN+ GH+  +C           G+ +DD  +    V+    +++ +++E  C+            
Subjt:  GRGRGTERQNSARGTSNTRNGTRDKSHIKF---FTCNKMGHYASEC----------HGKDHDDEAHLTCAVEEEPGLMMAVSQEGTCLRCDQKDVILLNK

Query:  ERLLPEMYCNDQNGENNDVWYLNNGASNHMTGPRETFQELDESFTRRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNIISLRQMTENEN
                    +G  ++ W ++  AS+H T  R+ F          VK  + S  +I G G +  +        L++V ++P    N+IS   +  +  
Subjt:  ERLLPEMYCNDQNGENNDVWYLNNGASNHMTGPRETFQELDESFTRRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNIISLRQMTENEN

Query:  KVQMTEDVMKMSDRSGKLLMSVKRTQKYLYKITLKTLKQVCL--LTSLKD--PTWLWHVRLGHVNFYDLKLIREKKLVVGVPLVTQPNKLCEACVITKQV
        +        +++  S  +   V R   Y      +T  ++C   L + +D     LWH R+GH++   L+++ +K L+      T   K C+ C+  KQ 
Subjt:  KVQMTEDVMKMSDRSGKLLMSVKRTQKYLYKITLKTLKQVCL--LTSLKD--PTWLWHVRLGHVNFYDLKLIREKKLVVGVPLVTQPNKLCEACVITKQV

Query:  RLPFHRQSTYRVEKPLELLHADICGPISPCTLAGNKYFLLIVDDSTRWMWMYMLEAKSDAFEAFKKCKLLMENKTEYKIKTLQTDRGGEFLSAGFTQFCK
        R+ F + S+ R    L+L+++D+CGP+   ++ GNKYF+  +DD++R +W+Y+L+ K   F+ F+K   L+E +T  K+K L++D GGE+ S  F ++C 
Subjt:  RLPFHRQSTYRVEKPLELLHADICGPISPCTLAGNKYFLLIVDDSTRWMWMYMLEAKSDAFEAFKKCKLLMENKTEYKIKTLQTDRGGEFLSAGFTQFCK

Query:  KEGIQGHLIAPYSP
          GI+     P +P
Subjt:  KEGIQGHLIAPYSP

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-2128.1Show/hide
Query:  WYLNNGASNHMTGPRETFQELDESFT--RRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNIISL-RQMTENENKVQMTEDVMKMSDRSG
        W L++GA++H+T        L + +T    V   DGSTI I   G+     K S    L  + Y+P   +N+IS+ R    N   V+      ++ D + 
Subjt:  WYLNNGASNHMTGPRETFQELDESFT--RRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNIISL-RQMTENENKVQMTEDVMKMSDRSG

Query:  KLLMSVKRTQKYLYKITLKTLKQVCLLT--SLKDPTWLWHVRLGH---------VNFYDLKLIREKKLVVGVPLVTQPNKLCEACVITKQVRLPFHRQST
         + +   +T+  LY+  + + + V L    S K     WH RLGH         ++ Y L ++      +           C  C+I K  ++PF  QST
Subjt:  KLLMSVKRTQKYLYKITLKTLKQVCLLT--SLKDPTWLWHVRLGH---------VNFYDLKLIREKKLVVGVPLVTQPNKLCEACVITKQVRLPFHRQST

Query:  YRVEKPLELLHADICGPISPCTLAGN-KYFLLIVDDSTRWMWMYMLEAKSDAFEAFKKCKLLMENKTEYKIKTLQTDRGGEFLSAGFTQFCKKEGIQGHL
            +PLE +++D+    SP     N +Y+++ VD  TR+ W+Y L+ KS   E F   K L+EN+ + +I T  +D GGEF++    ++  + GI    
Subjt:  YRVEKPLELLHADICGPISPCTLAGN-KYFLLIVDDSTRWMWMYMLEAKSDAFEAFKKCKLLMENKTEYKIKTLQTDRGGEFLSAGFTQFCKKEGIQGHL

Query:  IAPYSP
          P++P
Subjt:  IAPYSP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.8e-2228.71Show/hide
Query:  NNDVWYLNNGASNHMTGPRETFQELDESFT--RRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNIISL-RQMTENENKVQMTEDVMKMS
        N + W L++GA++H+T          + +T    V   DGSTI I   G+      +S    L +V Y+P   +N+IS+ R    N   V+      ++ 
Subjt:  NNDVWYLNNGASNHMTGPRETFQELDESFT--RRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNIISL-RQMTENENKVQMTEDVMKMS

Query:  DRSGKLLMSVKRTQKYLYKITLKTLKQVCLLTS--LKDPTWLWHVRLGHVNFYDLKLIREKKLVVGVPLVTQPNKL--CEACVITKQVRLPFHRQSTYRV
        D +  + +   +T+  LY+  + + + V +  S   K     WH RLGH +   L  +        +P++   +KL  C  C I K  ++PF   ST   
Subjt:  DRSGKLLMSVKRTQKYLYKITLKTLKQVCLLTS--LKDPTWLWHVRLGHVNFYDLKLIREKKLVVGVPLVTQPNKL--CEACVITKQVRLPFHRQSTYRV

Query:  EKPLELLHADICGPISPCTLAGN-KYFLLIVDDSTRWMWMYMLEAKSDAFEAFKKCKLLMENKTEYKIKTLQTDRGGEFLSAGFTQFCKKEGIQGHLIAP
         KPLE +++D+    SP     N +Y+++ VD  TR+ W+Y L+ KS   + F   K L+EN+ + +I TL +D GGEF+      +  + GI      P
Subjt:  EKPLELLHADICGPISPCTLAGN-KYFLLIVDDSTRWMWMYMLEAKSDAFEAFKKCKLLMENKTEYKIKTLQTDRGGEFLSAGFTQFCKKEGIQGHLIAP

Query:  YSP
        ++P
Subjt:  YSP

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAAAGGACACAGATTTGGTGGTACCGATCAAGGAAGATGGCGAGGACGTGGTCGTGGCACTGAGCGTCAAAATAGTGCGAGAGGCACTAGCAACACTAGAAATGG
CACTCGTGATAAAAGTCACATTAAGTTTTTCACTTGCAACAAGATGGGACATTACGCGTCAGAATGTCATGGAAAAGATCATGACGACGAAGCTCATCTAACTTGTGCTG
TCGAAGAAGAACCAGGTTTGATGATGGCCGTGTCCCAGGAGGGGACATGCCTTAGATGTGATCAGAAGGATGTCATACTACTCAACAAAGAGCGGTTGTTGCCAGAGATG
TATTGCAACGACCAGAATGGAGAAAATAATGATGTTTGGTATCTTAACAACGGTGCTAGTAACCACATGACTGGCCCCCGTGAGACGTTCCAAGAATTAGATGAAAGCTT
CACTAGGAGAGTGAAATTTGACGATGGATCAACCATTCAGATCATGGGAAAAGGAACGGTCATGTTCGAGTGCAAGAACAGTGATCAGAAGGCTCTCCAAGAGGTGTATT
ACATTCCAAAGTCCTGTAGGAACATCATTAGCCTCAGACAAATGACAGAAAATGAAAACAAGGTACAAATGACAGAAGATGTCATGAAAATGTCTGACAGGAGTGGGAAG
CTTTTGATGTCGGTGAAGCGAACTCAAAAATATTTATACAAGATAACTTTGAAGACGCTCAAGCAAGTCTGCCTTCTGACAAGCCTAAAAGATCCAACATGGTTATGGCA
CGTGAGACTTGGTCATGTAAATTTTTATGACTTGAAGCTCATAAGGGAGAAGAAATTGGTAGTTGGAGTACCACTAGTGACTCAACCGAACAAGTTATGTGAAGCGTGCG
TGATTACCAAACAAGTCAGATTGCCTTTCCACCGTCAATCAACATATAGAGTAGAGAAGCCATTAGAACTCCTCCATGCTGATATATGCGGACCGATTTCACCATGTACT
CTTGCTGGAAACAAGTATTTTCTGTTGATTGTTGACGATTCCACGAGATGGATGTGGATGTATATGTTGGAGGCAAAAAGTGACGCATTTGAAGCATTCAAGAAATGCAA
ACTCTTAATGGAGAACAAGACGGAGTACAAGATCAAAACGCTCCAGACGGATCGAGGTGGTGAGTTCTTATCTGCAGGGTTCACTCAATTTTGCAAAAAAGAAGGAATCC
AAGGACACCTCATCGCTCCATATTCACCATAA
mRNA sequenceShow/hide mRNA sequence
ATGAACAAAGGACACAGATTTGGTGGTACCGATCAAGGAAGATGGCGAGGACGTGGTCGTGGCACTGAGCGTCAAAATAGTGCGAGAGGCACTAGCAACACTAGAAATGG
CACTCGTGATAAAAGTCACATTAAGTTTTTCACTTGCAACAAGATGGGACATTACGCGTCAGAATGTCATGGAAAAGATCATGACGACGAAGCTCATCTAACTTGTGCTG
TCGAAGAAGAACCAGGTTTGATGATGGCCGTGTCCCAGGAGGGGACATGCCTTAGATGTGATCAGAAGGATGTCATACTACTCAACAAAGAGCGGTTGTTGCCAGAGATG
TATTGCAACGACCAGAATGGAGAAAATAATGATGTTTGGTATCTTAACAACGGTGCTAGTAACCACATGACTGGCCCCCGTGAGACGTTCCAAGAATTAGATGAAAGCTT
CACTAGGAGAGTGAAATTTGACGATGGATCAACCATTCAGATCATGGGAAAAGGAACGGTCATGTTCGAGTGCAAGAACAGTGATCAGAAGGCTCTCCAAGAGGTGTATT
ACATTCCAAAGTCCTGTAGGAACATCATTAGCCTCAGACAAATGACAGAAAATGAAAACAAGGTACAAATGACAGAAGATGTCATGAAAATGTCTGACAGGAGTGGGAAG
CTTTTGATGTCGGTGAAGCGAACTCAAAAATATTTATACAAGATAACTTTGAAGACGCTCAAGCAAGTCTGCCTTCTGACAAGCCTAAAAGATCCAACATGGTTATGGCA
CGTGAGACTTGGTCATGTAAATTTTTATGACTTGAAGCTCATAAGGGAGAAGAAATTGGTAGTTGGAGTACCACTAGTGACTCAACCGAACAAGTTATGTGAAGCGTGCG
TGATTACCAAACAAGTCAGATTGCCTTTCCACCGTCAATCAACATATAGAGTAGAGAAGCCATTAGAACTCCTCCATGCTGATATATGCGGACCGATTTCACCATGTACT
CTTGCTGGAAACAAGTATTTTCTGTTGATTGTTGACGATTCCACGAGATGGATGTGGATGTATATGTTGGAGGCAAAAAGTGACGCATTTGAAGCATTCAAGAAATGCAA
ACTCTTAATGGAGAACAAGACGGAGTACAAGATCAAAACGCTCCAGACGGATCGAGGTGGTGAGTTCTTATCTGCAGGGTTCACTCAATTTTGCAAAAAAGAAGGAATCC
AAGGACACCTCATCGCTCCATATTCACCATAA
Protein sequenceShow/hide protein sequence
MNKGHRFGGTDQGRWRGRGRGTERQNSARGTSNTRNGTRDKSHIKFFTCNKMGHYASECHGKDHDDEAHLTCAVEEEPGLMMAVSQEGTCLRCDQKDVILLNKERLLPEM
YCNDQNGENNDVWYLNNGASNHMTGPRETFQELDESFTRRVKFDDGSTIQIMGKGTVMFECKNSDQKALQEVYYIPKSCRNIISLRQMTENENKVQMTEDVMKMSDRSGK
LLMSVKRTQKYLYKITLKTLKQVCLLTSLKDPTWLWHVRLGHVNFYDLKLIREKKLVVGVPLVTQPNKLCEACVITKQVRLPFHRQSTYRVEKPLELLHADICGPISPCT
LAGNKYFLLIVDDSTRWMWMYMLEAKSDAFEAFKKCKLLMENKTEYKIKTLQTDRGGEFLSAGFTQFCKKEGIQGHLIAPYSP