; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG07G014680 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG07G014680
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionTetratricopeptide repeat (TPR)-like superfamily protein
Genome locationCG_Chr07:31091479..31112134
RNA-Seq ExpressionClCG07G014680
SyntenyClCG07G014680
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144414.1 uncharacterized protein LOC101220521 isoform X3 [Cucumis sativus]2.1e-16989.33Show/hide
Query:  ADKASNSGNPKGRVYEPRKPISKQSSTVPTQAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEKKTIGLGTKIGIG
        ADKA +SG  KGRVY+PRKPI KQSSTVPTQAPAVS R+DGNSY+KSLDLQFE+RLEAVKRSALEKKKAD KKEFGAIDYDAPVESE+KTIGLGTK+GIG
Subjt:  ADKASNSGNPKGRVYEPRKPISKQSSTVPTQAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEKKTIGLGTKIGIG

Query:  VAVLVFGLVFALGDFLPSGSISSVKDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIKEKSDDFDIFRLL
        VAVLVFG VFALGDFLPSGS   VKDSVVEN+KLSREEESNLKNMLKEYE TLRSNPKDPTA+EGAAVTS ELGEYA+AASLLEDLIKEKSDD DIFRLL
Subjt:  VAVLVFGLVFALGDFLPSGSISSVKDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIKEKSDDFDIFRLL

Query:  GEVKYKLKDYDGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLLGKSYSDWGHVGD
        GEVKYKLKDYDGSVAAYKSAT L +DVNFEVLRGLTN+LLAAGKPDEAVQFLLD R+ L +VKLG   EGKEMETKLSIDPVQV+LLLGKSYSDWGHV D
Subjt:  GEVKYKLKDYDGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLLGKSYSDWGHVGD

Query:  AVSVYDQLISSHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR
        AVSVYDQLISSHP+DFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR
Subjt:  AVSVYDQLISSHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR

XP_008465106.1 PREDICTED: uncharacterized protein LOC103502794 [Cucumis melo]1.0e-17190.45Show/hide
Query:  ADKASNSGNPKGRVYEPRKPISKQSSTVPTQAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEKKTIGLGTKIGIG
        ADKA +SGN KGRVY+PRKPI KQSSTVPTQAPAVS R+DGNSY+KSLDLQFE+RLEAVKRSALEKKKAD KKEFGAIDYDAPVESE+KTIG GTKIGIG
Subjt:  ADKASNSGNPKGRVYEPRKPISKQSSTVPTQAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEKKTIGLGTKIGIG

Query:  VAVLVFGLVFALGDFLPSGSISSVKDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIKEKSDDFDIFRLL
        VAVLVFG VFALGDFLPSGS    KDSVVEN+KLSREEESNLKNMLKEYE TLRSNPKDPTA+EGAAVTS ELGEYA+AASLLEDLIKEKSDD DIFRLL
Subjt:  VAVLVFGLVFALGDFLPSGSISSVKDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIKEKSDDFDIFRLL

Query:  GEVKYKLKDYDGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLLGKSYSDWGHVGD
        GEVKYKLKDYDGSVAAYKSAT LS+DVNFEVLRGLTN+LLAAGKPDE+VQFLLDCRE LKSVKLG   EGKEMETKLSIDPVQV+LLLGKSYSDWGHV D
Subjt:  GEVKYKLKDYDGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLLGKSYSDWGHVGD

Query:  AVSVYDQLISSHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR
        AVSVYDQLISSHP+DFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR
Subjt:  AVSVYDQLISSHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR

XP_022980644.1 uncharacterized protein LOC111479948 isoform X1 [Cucurbita maxima]1.3e-16386.8Show/hide
Query:  ADKASNSGNPKGRVYEPRKPISKQSSTVPTQAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEKKTIGLGTKIGIG
        A+KA NS N KGRV +PRKPI KQSSTVPTQAPAV+ R D NSY+KSLD  FEKRLEAVKRSALEKKKAD KKEFGAIDYDAPVE E+KTIGLGTKIGIG
Subjt:  ADKASNSGNPKGRVYEPRKPISKQSSTVPTQAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEKKTIGLGTKIGIG

Query:  VAVLVFGLVFALGDFLPSGSISSVKDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIKEKSDDFDIFRLL
        VAVLVFGLVFALGDFLPSGSIS V+DSVV + KLS+EEESNL+NMLKEYE TLRSNPKDPTAMEGAAVTS ELG+YA AASLLEDLIK KSDD DIFRLL
Subjt:  VAVLVFGLVFALGDFLPSGSISSVKDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIKEKSDDFDIFRLL

Query:  GEVKYKLKDYDGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLLGKSYSDWGHVGD
        GEVKYKLKDYDGS+AAYKSAT +S+DVNFEVLRGLTNALLAAGKPDEAVQFLLD RERLKSV LGSMAEG++M+TKL IDPVQVELLLGK+YSDWGHV D
Subjt:  GEVKYKLKDYDGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLLGKSYSDWGHVGD

Query:  AVSVYDQLISSHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR
        AVSVYDQLI+SHP+DFRGYLAKGIILKENG +GDAERMFIQARFFAPENAKMLV+R
Subjt:  AVSVYDQLISSHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR

XP_031738478.1 uncharacterized protein LOC101220521 isoform X1 [Cucumis sativus]2.2e-16686.14Show/hide
Query:  ADKASNSGNPKGRVYEPRKPISKQSSTVPT------------QAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEK
        ADKA +SG  KGRVY+PRKPI KQSSTVPT            +APAVS R+DGNSY+KSLDLQFE+RLEAVKRSALEKKKAD KKEFGAIDYDAPVESE+
Subjt:  ADKASNSGNPKGRVYEPRKPISKQSSTVPT------------QAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEK

Query:  KTIGLGTKIGIGVAVLVFGLVFALGDFLPSGSISSVKDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIK
        KTIGLGTK+GIGVAVLVFG VFALGDFLPSGS   VKDSVVEN+KLSREEESNLKNMLKEYE TLRSNPKDPTA+EGAAVTS ELGEYA+AASLLEDLIK
Subjt:  KTIGLGTKIGIGVAVLVFGLVFALGDFLPSGSISSVKDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIK

Query:  EKSDDFDIFRLLGEVKYKLKDYDGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLL
        EKSDD DIFRLLGEVKYKLKDYDGSVAAYKSAT L +DVNFEVLRGLTN+LLAAGKPDEAVQFLLD R+ L +VKLG   EGKEMETKLSIDPVQV+LLL
Subjt:  EKSDDFDIFRLLGEVKYKLKDYDGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLL

Query:  GKSYSDWGHVGDAVSVYDQLISSHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR
        GKSYSDWGHV DAVSVYDQLISSHP+DFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR
Subjt:  GKSYSDWGHVGDAVSVYDQLISSHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR

XP_031738479.1 uncharacterized protein LOC101220521 isoform X2 [Cucumis sativus]3.4e-16787.81Show/hide
Query:  ADKASNSGNPKGRVYEPRKPISKQSSTVPT-----QAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEKKTIGLGT
        ADKA +SG  KGRVY+PRKPI KQSSTVPT     +APAVS R+DGNSY+KSLDLQFE+RLEAVKRSALEKKKAD KKEFGAIDYDAPVESE+KTIGLGT
Subjt:  ADKASNSGNPKGRVYEPRKPISKQSSTVPT-----QAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEKKTIGLGT

Query:  KIGIGVAVLVFGLVFALGDFLPSGSISSVKDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIKEKSDDFD
        K+GIGVAVLVFG VFALGDFLPSGS   VKDSVVEN+KLSREEESNLKNMLKEYE TLRSNPKDPTA+EGAAVTS ELGEYA+AASLLEDLIKEKSDD D
Subjt:  KIGIGVAVLVFGLVFALGDFLPSGSISSVKDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIKEKSDDFD

Query:  IFRLLGEVKYKLKDYDGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLLGKSYSDW
        IFRLLGEVKYKLKDYDGSVAAYKSAT L +DVNFEVLRGLTN+LLAAGKPDEAVQFLLD R+ L +VKLG   EGKEMETKLSIDPVQV+LLLGKSYSDW
Subjt:  IFRLLGEVKYKLKDYDGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLLGKSYSDW

Query:  GHVGDAVSVYDQLISSHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR
        GHV DAVSVYDQLISSHP+DFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR
Subjt:  GHVGDAVSVYDQLISSHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR

TrEMBL top hitse value%identityAlignment
A0A1S3CN41 uncharacterized protein LOC1035027944.9e-17290.45Show/hide
Query:  ADKASNSGNPKGRVYEPRKPISKQSSTVPTQAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEKKTIGLGTKIGIG
        ADKA +SGN KGRVY+PRKPI KQSSTVPTQAPAVS R+DGNSY+KSLDLQFE+RLEAVKRSALEKKKAD KKEFGAIDYDAPVESE+KTIG GTKIGIG
Subjt:  ADKASNSGNPKGRVYEPRKPISKQSSTVPTQAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEKKTIGLGTKIGIG

Query:  VAVLVFGLVFALGDFLPSGSISSVKDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIKEKSDDFDIFRLL
        VAVLVFG VFALGDFLPSGS    KDSVVEN+KLSREEESNLKNMLKEYE TLRSNPKDPTA+EGAAVTS ELGEYA+AASLLEDLIKEKSDD DIFRLL
Subjt:  VAVLVFGLVFALGDFLPSGSISSVKDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIKEKSDDFDIFRLL

Query:  GEVKYKLKDYDGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLLGKSYSDWGHVGD
        GEVKYKLKDYDGSVAAYKSAT LS+DVNFEVLRGLTN+LLAAGKPDE+VQFLLDCRE LKSVKLG   EGKEMETKLSIDPVQV+LLLGKSYSDWGHV D
Subjt:  GEVKYKLKDYDGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLLGKSYSDWGHVGD

Query:  AVSVYDQLISSHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR
        AVSVYDQLISSHP+DFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR
Subjt:  AVSVYDQLISSHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR

A0A6J1GV96 uncharacterized protein LOC111457503 isoform X21.5e-16085.96Show/hide
Query:  ADKASNSGNPKGRVYEPRKPISKQSSTVPTQAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEKKTIGLGTKIGIG
        A+KA NS N KGRV +PRKPI KQSSTVPTQAPAV+ R D NSY+ SLD  FEKRLEAVKRSALEKKKAD KKEFGAIDYDAPVE E+KTIGLGTKIGIG
Subjt:  ADKASNSGNPKGRVYEPRKPISKQSSTVPTQAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEKKTIGLGTKIGIG

Query:  VAVLVFGLVFALGDFLPSGSISSVKDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIKEKSDDFDIFRLL
        VAVLVFGLVFALGDFLPSG  S V+DS V + KLS+EEESNL+NMLKEYE TLRSNPKDPTAMEGAAVTS ELG+YA AASLLEDLIK KSDD DIFRLL
Subjt:  VAVLVFGLVFALGDFLPSGSISSVKDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIKEKSDDFDIFRLL

Query:  GEVKYKLKDYDGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLLGKSYSDWGHVGD
        GEVKYKLKDYDGS+AAYKSAT +S+DVNFEVLRGLTNALLAAGKPDEAVQFLLD RERLKSV LGSMAEG+EM+TKL IDPVQVELLLGK+YSDWGHV D
Subjt:  GEVKYKLKDYDGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLLGKSYSDWGHVGD

Query:  AVSVYDQLISSHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR
        AVSVYDQLI+SHP+DFRGYLAKGIILKENG +GDAERMFIQARFFAPENAKMLV+R
Subjt:  AVSVYDQLISSHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR

A0A6J1GWF7 uncharacterized protein LOC111457503 isoform X17.1e-16386.52Show/hide
Query:  ADKASNSGNPKGRVYEPRKPISKQSSTVPTQAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEKKTIGLGTKIGIG
        A+KA NS N KGRV +PRKPI KQSSTVPTQAPAV+ R D NSY+ SLD  FEKRLEAVKRSALEKKKAD KKEFGAIDYDAPVE E+KTIGLGTKIGIG
Subjt:  ADKASNSGNPKGRVYEPRKPISKQSSTVPTQAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEKKTIGLGTKIGIG

Query:  VAVLVFGLVFALGDFLPSGSISSVKDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIKEKSDDFDIFRLL
        VAVLVFGLVFALGDFLPSGSIS V+DS V + KLS+EEESNL+NMLKEYE TLRSNPKDPTAMEGAAVTS ELG+YA AASLLEDLIK KSDD DIFRLL
Subjt:  VAVLVFGLVFALGDFLPSGSISSVKDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIKEKSDDFDIFRLL

Query:  GEVKYKLKDYDGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLLGKSYSDWGHVGD
        GEVKYKLKDYDGS+AAYKSAT +S+DVNFEVLRGLTNALLAAGKPDEAVQFLLD RERLKSV LGSMAEG+EM+TKL IDPVQVELLLGK+YSDWGHV D
Subjt:  GEVKYKLKDYDGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLLGKSYSDWGHVGD

Query:  AVSVYDQLISSHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR
        AVSVYDQLI+SHP+DFRGYLAKGIILKENG +GDAERMFIQARFFAPENAKMLV+R
Subjt:  AVSVYDQLISSHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR

A0A6J1IX11 uncharacterized protein LOC111479948 isoform X21.3e-16186.24Show/hide
Query:  ADKASNSGNPKGRVYEPRKPISKQSSTVPTQAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEKKTIGLGTKIGIG
        A+KA NS N KGRV +PRKPI KQSSTVPTQAPAV+ R D NSY+KSLD  FEKRLEAVKRSALEKKKAD KKEFGAIDYDAPVE E+KTIGLGTKIGIG
Subjt:  ADKASNSGNPKGRVYEPRKPISKQSSTVPTQAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEKKTIGLGTKIGIG

Query:  VAVLVFGLVFALGDFLPSGSISSVKDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIKEKSDDFDIFRLL
        VAVLVFGLVFALGDFLPSG  S V+DSVV + KLS+EEESNL+NMLKEYE TLRSNPKDPTAMEGAAVTS ELG+YA AASLLEDLIK KSDD DIFRLL
Subjt:  VAVLVFGLVFALGDFLPSGSISSVKDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIKEKSDDFDIFRLL

Query:  GEVKYKLKDYDGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLLGKSYSDWGHVGD
        GEVKYKLKDYDGS+AAYKSAT +S+DVNFEVLRGLTNALLAAGKPDEAVQFLLD RERLKSV LGSMAEG++M+TKL IDPVQVELLLGK+YSDWGHV D
Subjt:  GEVKYKLKDYDGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLLGKSYSDWGHVGD

Query:  AVSVYDQLISSHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR
        AVSVYDQLI+SHP+DFRGYLAKGIILKENG +GDAERMFIQARFFAPENAKMLV+R
Subjt:  AVSVYDQLISSHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR

A0A6J1IZV0 uncharacterized protein LOC111479948 isoform X16.4e-16486.8Show/hide
Query:  ADKASNSGNPKGRVYEPRKPISKQSSTVPTQAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEKKTIGLGTKIGIG
        A+KA NS N KGRV +PRKPI KQSSTVPTQAPAV+ R D NSY+KSLD  FEKRLEAVKRSALEKKKAD KKEFGAIDYDAPVE E+KTIGLGTKIGIG
Subjt:  ADKASNSGNPKGRVYEPRKPISKQSSTVPTQAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEKKTIGLGTKIGIG

Query:  VAVLVFGLVFALGDFLPSGSISSVKDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIKEKSDDFDIFRLL
        VAVLVFGLVFALGDFLPSGSIS V+DSVV + KLS+EEESNL+NMLKEYE TLRSNPKDPTAMEGAAVTS ELG+YA AASLLEDLIK KSDD DIFRLL
Subjt:  VAVLVFGLVFALGDFLPSGSISSVKDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIKEKSDDFDIFRLL

Query:  GEVKYKLKDYDGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLLGKSYSDWGHVGD
        GEVKYKLKDYDGS+AAYKSAT +S+DVNFEVLRGLTNALLAAGKPDEAVQFLLD RERLKSV LGSMAEG++M+TKL IDPVQVELLLGK+YSDWGHV D
Subjt:  GEVKYKLKDYDGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLLGKSYSDWGHVGD

Query:  AVSVYDQLISSHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR
        AVSVYDQLI+SHP+DFRGYLAKGIILKENG +GDAERMFIQARFFAPENAKMLV+R
Subjt:  AVSVYDQLISSHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-0530.91Show/hide
Query:  LSKKFEIKDLGHLRYFLEMEVA--RSSKGISITQQKFTLDLLKETGMSGYRPADRPMEANSK------PGVNPDDGPVDRDRYQRLVGKLIY-LTHTRPD
        LSK F++KDLG  +  L M++   R+S+ + ++Q+K+   +L+   M   +P   P+  + K      P    + G + +  Y   VG L+Y +  TRPD
Subjt:  LSKKFEIKDLGHLRYFLEMEVA--RSSKGISITQQKFTLDLLKETGMSGYRPADRPMEANSK------PGVNPDDGPVDRDRYQRLVGKLIY-LTHTRPD

Query:  ISFAVCVADK
        I+ AV V  +
Subjt:  ISFAVCVADK

P92519 Uncharacterized mitochondrial protein AtMg008101.8e-0936.46Show/hide
Query:  LSKKFEIKDLGHLRYFLEMEVARSSKGISITQQKFTLDLLKETGMSGYRPADRPMEANSKPGVNPDDGPVDRDRYQRLVGKLIYLTHTRPDISFAV
        LS  F +KDLG + YFL +++     G+ ++Q K+   +L   GM   +P   P+       V+    P D   ++ +VG L YLT TRPDIS+AV
Subjt:  LSKKFEIKDLGHLRYFLEMEVARSSKGISITQQKFTLDLLKETGMSGYRPADRPMEANSKPGVNPDDGPVDRDRYQRLVGKLIYLTHTRPDISFAV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.5e-1239.58Show/hide
Query:  LSKKFEIKDLGHLRYFLEMEVARSSKGISITQQKFTLDLLKETGMSGYRPADRPMEANSKPGVNPDDGPVDRDRYQRLVGKLIYLTHTRPDISFAV
        LS++F +KD   L YFL +E  R   G+ ++Q+++ LDLL  T M   +P   PM  + K  +       D   Y+ +VG L YL  TRPDIS+AV
Subjt:  LSKKFEIKDLGHLRYFLEMEVARSSKGISITQQKFTLDLLKETGMSGYRPADRPMEANSKPGVNPDDGPVDRDRYQRLVGKLIYLTHTRPDISFAV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.0e-1238Show/hide
Query:  TQQLLSKKFEIKDLGHLRYFLEMEVARSSKGISITQQKFTLDLLKETGMSGYRPADRPMEANSKPGVNPDDGPVDRDRYQRLVGKLIYLTHTRPDISFAV
        T   LS++F +K+   L YFL +E  R  +G+ ++Q+++TLDLL  T M   +P   PM  + K  ++      D   Y+ +VG L YL  TRPD+S+AV
Subjt:  TQQLLSKKFEIKDLGHLRYFLEMEVARSSKGISITQQKFTLDLLKETGMSGYRPADRPMEANSKPGVNPDDGPVDRDRYQRLVGKLIYLTHTRPDISFAV

Arabidopsis top hitse value%identityAlignment
AT1G78915.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.9e-11260.98Show/hide
Query:  KGRVYEPRKPISKQSSTVPTQAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEKKTIGLGTKIGIGVAVLVFGLVF
        K    + RK  SKQS +VP +AP ++ + +G S  +S D+ F++RLE ++RSALE+KK +  KEFG IDYDAPV+S++KTIGLGTK+G+G+AV+VFGLVF
Subjt:  KGRVYEPRKPISKQSSTVPTQAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEKKTIGLGTKIGIGVAVLVFGLVF

Query:  ALGDFLPSGSISSVKDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIKEKSDDFDIFRLLGEVKYKLKDY
        ALGDFLP+GS S  K++ V   ++S EE++ L+  LKE+E TL   P+D  A+EGAAVT  ELG+Y+RAA+ LE L KE+  D D+FRLLGEV Y+L +Y
Subjt:  ALGDFLPSGSISSVKDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIKEKSDDFDIFRLLGEVKYKLKDY

Query:  DGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLLGKSYSDWGHVGDAVSVYDQLIS
        +GS+AAYK +  +SK ++ EV RGL NA LAA KPDEAV+FLLD RERL + K  S  +    ET L  DP+QVELLLGK+YSDWGH+ DA++VYDQLIS
Subjt:  DGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLLGKSYSDWGHVGDAVSVYDQLIS

Query:  SHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR
        +HP+DFRGYLAKGIIL+ENG  GDAERMFIQARFFAP  AK LVDR
Subjt:  SHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR

AT1G78915.2 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-10958.4Show/hide
Query:  KGRVYEPRKPISKQSSTVPTQAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEKKTIGLGTKIGIGVAVLVFGLVF
        K    + RK  SKQS +VP +AP ++ + +G S  +S D+ F++RLE ++RSALE+KK +  KEFG IDYDAPV+S++KTIGLGTK+G+G+AV+VFGLVF
Subjt:  KGRVYEPRKPISKQSSTVPTQAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEKKTIGLGTKIGIGVAVLVFGLVF

Query:  ALGDFLPSGSISSV-----------------KDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIKEKSDD
        ALGDFLP+G IS V                 K++ V   ++S EE++ L+  LKE+E TL   P+D  A+EGAAVT  ELG+Y+RAA+ LE L KE+  D
Subjt:  ALGDFLPSGSISSV-----------------KDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIKEKSDD

Query:  FDIFRLLGEVKYKLKDYDGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLLGKSYS
         D+FRLLGEV Y+L +Y+GS+AAYK +  +SK ++ EV RGL NA LAA KPDEAV+FLLD RERL + K  S  +    ET L  DP+QVELLLGK+YS
Subjt:  FDIFRLLGEVKYKLKDYDGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLLGKSYS

Query:  DWGHVGDAVSVYDQLISSHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR
        DWGH+ DA++VYDQLIS+HP+DFRGYLAKGIIL+ENG  GDAERMFIQARFFAP  AK LVDR
Subjt:  DWGHVGDAVSVYDQLISSHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR

AT1G78915.3 Tetratricopeptide repeat (TPR)-like superfamily protein1.7e-10857.38Show/hide
Query:  KGRVYEPRKPISKQSSTVPTQAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEKKTIGLGTKIGIGVAVLVFGLVF
        K    + RK  SKQS +VP +AP ++ + +G S  +S D+ F++RLE ++RSALE+KK +  KEFG IDYDAPV+S++KTIGLGTK+G+G+AV+VFGLVF
Subjt:  KGRVYEPRKPISKQSSTVPTQAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEKKTIGLGTKIGIGVAVLVFGLVF

Query:  ALGDFLPSGSI--------------------SSVKDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIKEK
        ALGDFLP+G                      S  K++ V   ++S EE++ L+  LKE+E TL   P+D  A+EGAAVT  ELG+Y+RAA+ LE L KE+
Subjt:  ALGDFLPSGSI--------------------SSVKDSVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIKEK

Query:  SDDFDIFRLLGEVKYKLKDYDGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLLGK
          D D+FRLLGEV Y+L +Y+GS+AAYK +  +SK ++ EV RGL NA LAA KPDEAV+FLLD RERL + K  S  +    ET L  DP+QVELLLGK
Subjt:  SDDFDIFRLLGEVKYKLKDYDGSVAAYKSATMLSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLLGK

Query:  SYSDWGHVGDAVSVYDQLISSHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR
        +YSDWGH+ DA++VYDQLIS+HP+DFRGYLAKGIIL+ENG  GDAERMFIQARFFAP  AK LVDR
Subjt:  SYSDWGHVGDAVSVYDQLISSHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFAPENAKMLVDR

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.6e-1847.92Show/hide
Query:  LSKKFEIKDLGHLRYFLEMEVARSSKGISITQQKFTLDLLKETGMSGYRPADRPMEANSKPGVNPDDGPVDRDRYQRLVGKLIYLTHTRPDISFAV
        L   F+++DLG L+YFL +E+ARS+ GI+I Q+K+ LDLL ETG+ G +P+  PM+ +     +     VD   Y+RL+G+L+YL  TR DISFAV
Subjt:  LSKKFEIKDLGHLRYFLEMEVARSSKGISITQQKFTLDLLKETGMSGYRPADRPMEANSKPGVNPDDGPVDRDRYQRLVGKLIYLTHTRPDISFAV

ATMG00810.1 DNA/RNA polymerases superfamily protein1.3e-1036.46Show/hide
Query:  LSKKFEIKDLGHLRYFLEMEVARSSKGISITQQKFTLDLLKETGMSGYRPADRPMEANSKPGVNPDDGPVDRDRYQRLVGKLIYLTHTRPDISFAV
        LS  F +KDLG + YFL +++     G+ ++Q K+   +L   GM   +P   P+       V+    P D   ++ +VG L YLT TRPDIS+AV
Subjt:  LSKKFEIKDLGHLRYFLEMEVARSSKGISITQQKFTLDLLKETGMSGYRPADRPMEANSKPGVNPDDGPVDRDRYQRLVGKLIYLTHTRPDISFAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCTACCATAGAGTTGACAAGCCACGTCATAACCATGGAGTTTTCCGCATCCCACACAGTGAATAAGGGGTCATCTGGACTAGGAGCGACTTTTTCTCTTGTGAG
GTCCACCGGAGAGTCGCCGGAACTGGTCAGAAATCACCCGGAGACTTGTGCAGACCTGGTCGATCGGCGGAGCAGAGGTGAACCTAGGCGGTGTCGGCGACGTCGGCATC
GGCGTCAGCTAGCTACAGCTTGTTCGATGTCTGGGCTTCACGAACGACGAGATCCACGGACGACCGGTGCTGGGAGTTGTCCGGTGACTCAGCAACTCCTGTCAAAGAAA
TTTGAGATTAAAGACTTGGGACATCTAAGGTACTTCCTCGAGATGGAAGTAGCCAGATCCAGCAAAGGTATCTCTATCACTCAACAGAAATTCACTTTAGATCTCTTAAA
GGAGACTGGAATGAGTGGTTATAGACCGGCTGATAGACCTATGGAAGCAAATTCAAAACCTGGAGTTAATCCTGATGATGGACCAGTTGATCGAGATAGGTATCAACGGT
TGGTGGGAAAGTTAATATACTTGACTCACACCCGACCAGATATTAGCTTTGCTGTTTGCGTGGCCGACAAAGCCAGCAACTCGGGCAACCCGAAGGGTAGGGTGTACGAA
CCAAGGAAACCCATTTCAAAACAATCTAGTACAGTACCCACACAGGCACCTGCTGTGAGTGTCCGTTCTGATGGAAACTCTTATAGCAAATCGTTGGACCTTCAATTCGA
AAAACGACTTGAAGCAGTTAAAAGATCAGCACTTGAGAAGAAAAAGGCAGACACTAAAAAAGAATTTGGAGCAATTGACTATGACGCACCAGTGGAATCAGAAAAGAAAA
CAATTGGACTTGGTACCAAGATTGGAATAGGTGTAGCTGTTTTGGTCTTTGGCTTGGTTTTTGCGCTTGGAGACTTTCTGCCCTCTGGAAGTATCAGTTCTGTTAAGGAT
TCTGTGGTGGAAAATGTCAAACTATCGAGAGAAGAAGAAAGTAATCTTAAGAATATGCTCAAAGAATACGAGGCTACACTTCGTAGCAACCCAAAAGATCCAACTGCTAT
GGAAGGTGCGGCAGTTACCTCAGTTGAATTAGGTGAATATGCACGAGCAGCCTCTTTGCTTGAAGACTTGATAAAGGAGAAGTCGGATGATTTTGACATTTTCCGCTTGC
TTGGGGAAGTAAAATATAAGCTTAAAGATTATGATGGGAGTGTTGCAGCATACAAGAGTGCCACAATGTTATCCAAAGATGTCAATTTTGAGGTTCTGCGTGGTCTTACA
AATGCGTTACTTGCTGCTGGGAAACCAGATGAGGCTGTTCAATTCCTTTTGGACTGTCGTGAACGTCTTAAAAGTGTAAAATTAGGAAGTATGGCTGAGGGCAAGGAGAT
GGAAACAAAATTATCGATTGATCCTGTTCAAGTTGAGTTACTGCTTGGAAAATCATACTCAGATTGGGGACATGTTGGTGATGCTGTATCTGTTTATGATCAACTTATCT
CCAGCCACCCTGATGACTTCCGTGGTTACTTAGCTAAGGGAATTATTCTAAAGGAAAATGGAAGATCTGGAGATGCTGAGAGGATGTTCATCCAAGCCCGATTCTTTGCT
CCGGAGAATGCCAAGATGCTTGTAGACCGGAGATGGAGAATCAAACCTCCAATCTCAAGGTCAATAGTACGTACTTATGCCATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCTACCATAGAGTTGACAAGCCACGTCATAACCATGGAGTTTTCCGCATCCCACACAGTGAATAAGGGGTCATCTGGACTAGGAGCGACTTTTTCTCTTGTGAG
GTCCACCGGAGAGTCGCCGGAACTGGTCAGAAATCACCCGGAGACTTGTGCAGACCTGGTCGATCGGCGGAGCAGAGGTGAACCTAGGCGGTGTCGGCGACGTCGGCATC
GGCGTCAGCTAGCTACAGCTTGTTCGATGTCTGGGCTTCACGAACGACGAGATCCACGGACGACCGGTGCTGGGAGTTGTCCGGTGACTCAGCAACTCCTGTCAAAGAAA
TTTGAGATTAAAGACTTGGGACATCTAAGGTACTTCCTCGAGATGGAAGTAGCCAGATCCAGCAAAGGTATCTCTATCACTCAACAGAAATTCACTTTAGATCTCTTAAA
GGAGACTGGAATGAGTGGTTATAGACCGGCTGATAGACCTATGGAAGCAAATTCAAAACCTGGAGTTAATCCTGATGATGGACCAGTTGATCGAGATAGGTATCAACGGT
TGGTGGGAAAGTTAATATACTTGACTCACACCCGACCAGATATTAGCTTTGCTGTTTGCGTGGCCGACAAAGCCAGCAACTCGGGCAACCCGAAGGGTAGGGTGTACGAA
CCAAGGAAACCCATTTCAAAACAATCTAGTACAGTACCCACACAGGCACCTGCTGTGAGTGTCCGTTCTGATGGAAACTCTTATAGCAAATCGTTGGACCTTCAATTCGA
AAAACGACTTGAAGCAGTTAAAAGATCAGCACTTGAGAAGAAAAAGGCAGACACTAAAAAAGAATTTGGAGCAATTGACTATGACGCACCAGTGGAATCAGAAAAGAAAA
CAATTGGACTTGGTACCAAGATTGGAATAGGTGTAGCTGTTTTGGTCTTTGGCTTGGTTTTTGCGCTTGGAGACTTTCTGCCCTCTGGAAGTATCAGTTCTGTTAAGGAT
TCTGTGGTGGAAAATGTCAAACTATCGAGAGAAGAAGAAAGTAATCTTAAGAATATGCTCAAAGAATACGAGGCTACACTTCGTAGCAACCCAAAAGATCCAACTGCTAT
GGAAGGTGCGGCAGTTACCTCAGTTGAATTAGGTGAATATGCACGAGCAGCCTCTTTGCTTGAAGACTTGATAAAGGAGAAGTCGGATGATTTTGACATTTTCCGCTTGC
TTGGGGAAGTAAAATATAAGCTTAAAGATTATGATGGGAGTGTTGCAGCATACAAGAGTGCCACAATGTTATCCAAAGATGTCAATTTTGAGGTTCTGCGTGGTCTTACA
AATGCGTTACTTGCTGCTGGGAAACCAGATGAGGCTGTTCAATTCCTTTTGGACTGTCGTGAACGTCTTAAAAGTGTAAAATTAGGAAGTATGGCTGAGGGCAAGGAGAT
GGAAACAAAATTATCGATTGATCCTGTTCAAGTTGAGTTACTGCTTGGAAAATCATACTCAGATTGGGGACATGTTGGTGATGCTGTATCTGTTTATGATCAACTTATCT
CCAGCCACCCTGATGACTTCCGTGGTTACTTAGCTAAGGGAATTATTCTAAAGGAAAATGGAAGATCTGGAGATGCTGAGAGGATGTTCATCCAAGCCCGATTCTTTGCT
CCGGAGAATGCCAAGATGCTTGTAGACCGGAGATGGAGAATCAAACCTCCAATCTCAAGGTCAATAGTACGTACTTATGCCATTTAA
Protein sequenceShow/hide protein sequence
MSSTIELTSHVITMEFSASHTVNKGSSGLGATFSLVRSTGESPELVRNHPETCADLVDRRSRGEPRRCRRRRHRRQLATACSMSGLHERRDPRTTGAGSCPVTQQLLSKK
FEIKDLGHLRYFLEMEVARSSKGISITQQKFTLDLLKETGMSGYRPADRPMEANSKPGVNPDDGPVDRDRYQRLVGKLIYLTHTRPDISFAVCVADKASNSGNPKGRVYE
PRKPISKQSSTVPTQAPAVSVRSDGNSYSKSLDLQFEKRLEAVKRSALEKKKADTKKEFGAIDYDAPVESEKKTIGLGTKIGIGVAVLVFGLVFALGDFLPSGSISSVKD
SVVENVKLSREEESNLKNMLKEYEATLRSNPKDPTAMEGAAVTSVELGEYARAASLLEDLIKEKSDDFDIFRLLGEVKYKLKDYDGSVAAYKSATMLSKDVNFEVLRGLT
NALLAAGKPDEAVQFLLDCRERLKSVKLGSMAEGKEMETKLSIDPVQVELLLGKSYSDWGHVGDAVSVYDQLISSHPDDFRGYLAKGIILKENGRSGDAERMFIQARFFA
PENAKMLVDRRWRIKPPISRSIVRTYAI