; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG01G011365 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG01G011365
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionTetratricopeptide repeat (TPR)-like superfamily protein
Genome locationCG_Chr01:18641979..18647473
RNA-Seq ExpressionClCG01G011365
SyntenyClCG01G011365
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011654469.1 putative pentatricopeptide repeat-containing protein At5g08310, mitochondrial [Cucumis sativus]1.5e-14176.85Show/hide
Query:  VLEKVFNGREIW-----FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCV
        ++E V NG   W     FF WASKQHGY+HNC  FNAIASI SHAR+N PLRA+AMDVLNFRCSMTPRA  VFLR  GSVGLVEEAN+LF QVRSM LC+
Subjt:  VLEKVFNGREIW-----FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCV

Query:  PNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYVFSILALAFRKWGEVDRVMQFIDRMGD
        PN+YSYNCLLEILSK N+IDSIENRL EMKDFGWEVDKYTLTPVL AYCNAGKFDKALIV+NDMHERGWVDGYVFSILALAF KWGEVDR MQFIDRM D
Subjt:  PNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYVFSILALAFRKWGEVDRVMQFIDRMGD

Query:  QNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEERVVIMLLGER
        QNL LN KTFYALIHGFVKESR DMALKL+EKMLKL FT D+SIYDVLIGGLCKK AFEKAMALF+KMK+ GI PDV+ILAKL+ASS EERVVIMLLGER
Subjt:  QNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEERVVIMLLGER

Query:  PKDISYEGFLPDIVAYSAAMDGLVKIHEVDRAFEMFQ
        PKDI+ EG    I  +++ +  LV   +V+    + Q
Subjt:  PKDISYEGFLPDIVAYSAAMDGLVKIHEVDRAFEMFQ

XP_022141713.1 putative pentatricopeptide repeat-containing protein At5g08310, mitochondrial [Momordica charantia]3.1e-13976.81Show/hide
Query:  VLEKVFNGREIW-----FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCV
        ++E V NG   W     FFIWASKQ+GY+HNCY +NAIASI S ARQN PLRA+AMDVLNFRCSMTP A  VFLR  GSVGLVEEANFLF QVR+MGLCV
Subjt:  VLEKVFNGREIW-----FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCV

Query:  PNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYVFSILALAFRKWGEVDRVMQFIDRMGD
        PNSYSYNCLLEILSKAN++DSIE RL EMK  G EVDKYTLTPVLKAY N+GKFDKAL VYNDMHERGWVDGY FSILALAF KWGEVDR M+FIDRMGD
Subjt:  PNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYVFSILALAFRKWGEVDRVMQFIDRMGD

Query:  QNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEERVVIMLLGER
        QN GLNEKTFYALIHGFVKESR DMALKL+EKM KL F+PDISIYDVLIG LCKKG FEKAMALFWKMKL GI+PD++ILAKLIAS SEERVVIMLL ER
Subjt:  QNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEERVVIMLLGER

Query:  PKDISYEGFLPDIVAYSAAMDGLVKIHEVDRA
        P+DI+ E     I+ Y++ +   V    +DRA
Subjt:  PKDISYEGFLPDIVAYSAAMDGLVKIHEVDRA

XP_022964323.1 putative pentatricopeptide repeat-containing protein At5g08310, mitochondrial isoform X3 [Cucurbita moschata]1.7e-14565.57Show/hide
Query:  VLEKVFNGREIW-----FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCV
        ++E V NG   W     FFIWASKQHGY+HNCY FN IASI SHARQN PLRAIA DVLN RCSMTP A  +FLR  GSVGLVEEANFLF QVR MGLCV
Subjt:  VLEKVFNGREIW-----FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCV

Query:  PNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYVFSILALAFRKWGEVDRVMQFIDRMGD
        PNSY+YNCLLEILSKANAIDSIENRLREMK +G EVDKYTLTPVL+AYCNAGKFDKAL VYND+HERGW+DGYVFSIL LAF KWGEVDR M+ I+R GD
Subjt:  PNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYVFSILALAFRKWGEVDRVMQFIDRMGD

Query:  QNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEERVVIMLLGER
        QN  L EKTFYALIHGFVKESR DMA+KL+EKM KL F PDISIYDVLIGGLCKKG+FEKAM LFWKMKL GI PD++ILAKLIASSSEER +IMLL ER
Subjt:  QNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEERVVIMLLGER

Query:  PKDI--------SY------------------------------------------------------------------------------EGFLPDIV
        PKD+        SY                                                                              EGFLPDIV
Subjt:  PKDI--------SY------------------------------------------------------------------------------EGFLPDIV

Query:  AYSAAMDGLVKIHEVDRAFEMFQDICT
        AYSAAMDGLVKIHEVDRAFEMFQDICT
Subjt:  AYSAAMDGLVKIHEVDRAFEMFQDICT

XP_022990710.1 putative pentatricopeptide repeat-containing protein At5g08310, mitochondrial isoform X3 [Cucurbita maxima]7.6e-14665.57Show/hide
Query:  VLEKVFNGREIW-----FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCV
        ++E V NG   W     FFIWASKQHGY+HNCY FN IASI SHARQN PLRAIAMDV+N RCSMTP A  +FLR  GSVGLVEEANFLF QVR MGLCV
Subjt:  VLEKVFNGREIW-----FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCV

Query:  PNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYVFSILALAFRKWGEVDRVMQFIDRMGD
        PNSY+YNCLLEILSKANAIDSIENRLREMK +G EVDKYTLTPVLKAYCN GKFDKAL VYND+HERGW+DGYVFSIL LAF KWGEVDR M+ I+R GD
Subjt:  PNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYVFSILALAFRKWGEVDRVMQFIDRMGD

Query:  QNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEERVVIMLLGER
        QN  L EKTFYALIHGFVKESR DMA+KL+EKM KL F PDISIYDVLIGGLCKKG+FEKAM LFWKMKL GI PD++ILAKLIASSSEER +IMLL ER
Subjt:  QNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEERVVIMLLGER

Query:  PKDI--------SY------------------------------------------------------------------------------EGFLPDIV
        PKD+        SY                                                                              EGFLPDIV
Subjt:  PKDI--------SY------------------------------------------------------------------------------EGFLPDIV

Query:  AYSAAMDGLVKIHEVDRAFEMFQDICT
        AYSAAMDGLVKIHEVDRAFEMFQDICT
Subjt:  AYSAAMDGLVKIHEVDRAFEMFQDICT

XP_038894372.1 LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At5g08310, mitochondrial [Benincasa hispida]8.6e-15080.88Show/hide
Query:  RGLSSNNGIIVLEKVFNGREIW-----FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLF
        + L+S     ++E V NG   W     FF WASKQHGY+HNCY FNAIASI SHARQN PLRAIA DVLNFRCSMTP A  VFLR  G+VGLVEEANFLF
Subjt:  RGLSSNNGIIVLEKVFNGREIW-----FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLF

Query:  YQVRSMGLCVPNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYVFSILALAFRKWGEVDR
         QVRSM LCVPNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNA +FDKALIVY++MHERGWVDGYVFSILALAF KWGEVDR
Subjt:  YQVRSMGLCVPNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYVFSILALAFRKWGEVDR

Query:  VMQFIDRMGDQNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEE
         MQFIDRMGDQNLGLNEKTFYALIHGFVKESR DMALKL EKMLKL FTPDISIYDVLIGGLCKKGAFEKAMALF KMKLFGI PDV+ILAKLIASSSEE
Subjt:  VMQFIDRMGDQNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEE

Query:  RVVIMLLGERPKDISYEGFLPDIVAYSAAMDGLVKIHEVD
        RVVIMLLGERPKDI+ EG    I+ Y++ +  LV   +V+
Subjt:  RVVIMLLGERPKDISYEGFLPDIVAYSAAMDGLVKIHEVD

TrEMBL top hitse value%identityAlignment
A0A0A0KMI5 Uncharacterized protein7.2e-14276.85Show/hide
Query:  VLEKVFNGREIW-----FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCV
        ++E V NG   W     FF WASKQHGY+HNC  FNAIASI SHAR+N PLRA+AMDVLNFRCSMTPRA  VFLR  GSVGLVEEAN+LF QVRSM LC+
Subjt:  VLEKVFNGREIW-----FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCV

Query:  PNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYVFSILALAFRKWGEVDRVMQFIDRMGD
        PN+YSYNCLLEILSK N+IDSIENRL EMKDFGWEVDKYTLTPVL AYCNAGKFDKALIV+NDMHERGWVDGYVFSILALAF KWGEVDR MQFIDRM D
Subjt:  PNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYVFSILALAFRKWGEVDRVMQFIDRMGD

Query:  QNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEERVVIMLLGER
        QNL LN KTFYALIHGFVKESR DMALKL+EKMLKL FT D+SIYDVLIGGLCKK AFEKAMALF+KMK+ GI PDV+ILAKL+ASS EERVVIMLLGER
Subjt:  QNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEERVVIMLLGER

Query:  PKDISYEGFLPDIVAYSAAMDGLVKIHEVDRAFEMFQ
        PKDI+ EG    I  +++ +  LV   +V+    + Q
Subjt:  PKDISYEGFLPDIVAYSAAMDGLVKIHEVDRAFEMFQ

A0A6J1CLB5 putative pentatricopeptide repeat-containing protein At5g08310, mitochondrial1.5e-13976.81Show/hide
Query:  VLEKVFNGREIW-----FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCV
        ++E V NG   W     FFIWASKQ+GY+HNCY +NAIASI S ARQN PLRA+AMDVLNFRCSMTP A  VFLR  GSVGLVEEANFLF QVR+MGLCV
Subjt:  VLEKVFNGREIW-----FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCV

Query:  PNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYVFSILALAFRKWGEVDRVMQFIDRMGD
        PNSYSYNCLLEILSKAN++DSIE RL EMK  G EVDKYTLTPVLKAY N+GKFDKAL VYNDMHERGWVDGY FSILALAF KWGEVDR M+FIDRMGD
Subjt:  PNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYVFSILALAFRKWGEVDRVMQFIDRMGD

Query:  QNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEERVVIMLLGER
        QN GLNEKTFYALIHGFVKESR DMALKL+EKM KL F+PDISIYDVLIG LCKKG FEKAMALFWKMKL GI+PD++ILAKLIAS SEERVVIMLL ER
Subjt:  QNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEERVVIMLLGER

Query:  PKDISYEGFLPDIVAYSAAMDGLVKIHEVDRA
        P+DI+ E     I+ Y++ +   V    +DRA
Subjt:  PKDISYEGFLPDIVAYSAAMDGLVKIHEVDRA

A0A6J1HIJ6 putative pentatricopeptide repeat-containing protein At5g08310, mitochondrial isoform X19.7e-13975.3Show/hide
Query:  VLEKVFNGREIW-----FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCV
        ++E V NG   W     FFIWASKQHGY+HNCY FN IASI SHARQN PLRAIA DVLN RCSMTP A  +FLR  GSVGLVEEANFLF QVR MGLCV
Subjt:  VLEKVFNGREIW-----FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCV

Query:  PNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYVFSILALAFRKWGEVDRVMQFIDRMGD
        PNSY+YNCLLEILSKANAIDSIENRLREMK +G EVDKYTLTPVL+AYCNAGKFDKAL VYND+HERGW+DGYVFSIL LAF KWGEVDR M+ I+R GD
Subjt:  PNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYVFSILALAFRKWGEVDRVMQFIDRMGD

Query:  QNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEERVVIMLLGER
        QN  L EKTFYALIHGFVKESR DMA+KL+EKM KL F PDISIYDVLIGGLCKKG+FEKAM LFWKMKL GI PD++ILAKLIASSSEER +IMLL ER
Subjt:  QNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEERVVIMLLGER

Query:  PKDISYEGFLPDIVAYSAAMDGLVKIHEVDRA
        PKD++ EG    I+ Y++ +   V +  +D+A
Subjt:  PKDISYEGFLPDIVAYSAAMDGLVKIHEVDRA

A0A6J1HKH1 putative pentatricopeptide repeat-containing protein At5g08310, mitochondrial isoform X38.2e-14665.57Show/hide
Query:  VLEKVFNGREIW-----FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCV
        ++E V NG   W     FFIWASKQHGY+HNCY FN IASI SHARQN PLRAIA DVLN RCSMTP A  +FLR  GSVGLVEEANFLF QVR MGLCV
Subjt:  VLEKVFNGREIW-----FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCV

Query:  PNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYVFSILALAFRKWGEVDRVMQFIDRMGD
        PNSY+YNCLLEILSKANAIDSIENRLREMK +G EVDKYTLTPVL+AYCNAGKFDKAL VYND+HERGW+DGYVFSIL LAF KWGEVDR M+ I+R GD
Subjt:  PNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYVFSILALAFRKWGEVDRVMQFIDRMGD

Query:  QNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEERVVIMLLGER
        QN  L EKTFYALIHGFVKESR DMA+KL+EKM KL F PDISIYDVLIGGLCKKG+FEKAM LFWKMKL GI PD++ILAKLIASSSEER +IMLL ER
Subjt:  QNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEERVVIMLLGER

Query:  PKDI--------SY------------------------------------------------------------------------------EGFLPDIV
        PKD+        SY                                                                              EGFLPDIV
Subjt:  PKDI--------SY------------------------------------------------------------------------------EGFLPDIV

Query:  AYSAAMDGLVKIHEVDRAFEMFQDICT
        AYSAAMDGLVKIHEVDRAFEMFQDICT
Subjt:  AYSAAMDGLVKIHEVDRAFEMFQDICT

A0A6J1JQU8 putative pentatricopeptide repeat-containing protein At5g08310, mitochondrial isoform X33.7e-14665.57Show/hide
Query:  VLEKVFNGREIW-----FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCV
        ++E V NG   W     FFIWASKQHGY+HNCY FN IASI SHARQN PLRAIAMDV+N RCSMTP A  +FLR  GSVGLVEEANFLF QVR MGLCV
Subjt:  VLEKVFNGREIW-----FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCV

Query:  PNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYVFSILALAFRKWGEVDRVMQFIDRMGD
        PNSY+YNCLLEILSKANAIDSIENRLREMK +G EVDKYTLTPVLKAYCN GKFDKAL VYND+HERGW+DGYVFSIL LAF KWGEVDR M+ I+R GD
Subjt:  PNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYVFSILALAFRKWGEVDRVMQFIDRMGD

Query:  QNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEERVVIMLLGER
        QN  L EKTFYALIHGFVKESR DMA+KL+EKM KL F PDISIYDVLIGGLCKKG+FEKAM LFWKMKL GI PD++ILAKLIASSSEER +IMLL ER
Subjt:  QNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEERVVIMLLGER

Query:  PKDI--------SY------------------------------------------------------------------------------EGFLPDIV
        PKD+        SY                                                                              EGFLPDIV
Subjt:  PKDI--------SY------------------------------------------------------------------------------EGFLPDIV

Query:  AYSAAMDGLVKIHEVDRAFEMFQDICT
        AYSAAMDGLVKIHEVDRAFEMFQDICT
Subjt:  AYSAAMDGLVKIHEVDRAFEMFQDICT

SwissProt top hitse value%identityAlignment
P0C8Q6 Putative pentatricopeptide repeat-containing protein At5g08310, mitochondrial5.9e-8546.09Show/hide
Query:  VLEKVFNGREIW-----FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCV
        V+E V NG + W     FF WASKQ GY+++ YA+NA+ASI S ARQN  L+A+ +DVLN RC M+P A   F+R  G+ GLV+EA+ +F +VR MGLCV
Subjt:  VLEKVFNGREIW-----FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCV

Query:  PNSYSYNCLLEILSKAN--AIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYVFSILALAFRKWGEVDRVMQFIDRM
        PN+Y+YNCLLE +SK+N  +++ +E RL+EM+D G+  DK+TLTPVL+ YCN GK ++AL V+N++  RGW+D ++ +IL ++F KWG+VD+  + I+ +
Subjt:  PNSYSYNCLLEILSKAN--AIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYVFSILALAFRKWGEVDRVMQFIDRM

Query:  GDQNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEE----RVVI
         ++++ LN KT+  LIHGFVKESR D A +L EKM ++    DI++YDVLIGGLCK    E A++L+ ++K  GI PD  IL KL+ S SEE    R+  
Subjt:  GDQNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEE----RVVI

Query:  MLLGERPKDISYEGFLPDIVAYSAAMDGLVKIHEVDRAFEMFQDI
        +++G+  K          ++ Y +  +G ++   V  A+   Q++
Subjt:  MLLGERPKDISYEGFLPDIVAYSAAMDGLVKIHEVDRAFEMFQDI

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397101.8e-2527.34Show/hide
Query:  FYQVRSMGL--CVPNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYV-FSILALAFRKWG
        F  +RSM L    PN  SYN ++  L +   +  +   L EM   G+ +D+ T   ++K YC  G F +AL+++ +M   G     + ++ L  +  K G
Subjt:  FYQVRSMGL--CVPNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYV-FSILALAFRKWG

Query:  EVDRVMQFIDRMGDQNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIAS
         ++R M+F+D+M  + L  NE+T+  L+ GF ++   + A +++ +M    F+P +  Y+ LI G C  G  E A+A+   MK  G+ PDV   + +++ 
Subjt:  EVDRVMQFIDRMGDQNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIAS

Query:  SSEERVVIMLLGERPKDISYEGFLPDIVAYSAAMDGLVKIHEVDRAFEMFQDICTV
              V   L  + +++  +G  PD + YS+ + G  +      A ++++++  V
Subjt:  SSEERVVIMLLGERPKDISYEGFLPDIVAYSAAMDGLVKIHEVDRAFEMFQDICTV

Q9LPX2 Pentatricopeptide repeat-containing protein At1g12775, mitochondrial4.9e-2328.1Show/hide
Query:  NSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYV-FSILALAFRKWGEVDRVMQFIDRMGD
        ++  Y+ +++ L K  ++D+  N   EM+  G++ D  T   ++  +CNAG++D    +  DM +R      V FS+L  +F K G++    Q +  M  
Subjt:  NSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYV-FSILALAFRKWGEVDRVMQFIDRMGD

Query:  QNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDV---KILAKLIASSSEERVVIMLL
        + +  N  T+ +LI GF KE+R + A+++++ M+     PDI  +++LI G CK    +  + LF +M L G++ +      L +    S +  V   L 
Subjt:  QNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDV---KILAKLIASSSEERVVIMLL

Query:  GERPKDISYEGFLPDIVAYSAAMDGLVKIHEVDRAFEMFQDI
            +++      PDIV+Y   +DGL    E+++A E+F  I
Subjt:  GERPKDISYEGFLPDIVAYSAAMDGLVKIHEVDRAFEMFQDI

Q9LR67 Pentatricopeptide repeat-containing protein At1g03560, mitochondrial4.2e-3026.63Show/hide
Query:  FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCVPNSYSYNCLLEILSKAN
        FF W+ KQ  Y HN   + ++  + + A+    +R ++ ++  F   MT  A+   ++SFG +G+VEE  +++ +++  G+  P  Y+YN L+  L  A 
Subjt:  FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCVPNSYSYNCLLEILSKAN

Query:  AIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGW-VDGYVFSILALAFRKWGEVDRVMQFIDRMGDQNLGLNEKTFYALIHG
         +DS E     M+    + D  T   ++K YC AG+  KA+    DM  RG   D   +  +  A     +    +     M ++ + +    F  +I G
Subjt:  AIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGW-VDGYVFSILALAFRKWGEVDRVMQFIDRMGDQNLGLNEKTFYALIHG

Query:  FVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEERVVIMLLGERPKDISYEGFLPDIVAY
          KE + +    + E M++    P+++IY VLI G  K G+ E A+ L  +M   G  PDV   + ++    +   V   L +      ++G   + + Y
Subjt:  FVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEERVVIMLLGERPKDISYEGFLPDIVAY

Query:  SAAMDGLVKIHEVDRAFEMFQDI
        S+ +DGL K   VD A  +F+++
Subjt:  SAAMDGLVKIHEVDRAFEMFQDI

Q9SVH3 Pentatricopeptide repeat-containing protein At4g207404.2e-2226.85Show/hide
Query:  FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCVPNSYSYNCLLEILSKAN
        FF WA KQ GYKH+  A+NA A       +N   RA                                A+ L   + S G   P+   +  L+ + +   
Subjt:  FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCVPNSYSYNCLLEILSKAN

Query:  AIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWV-DGYVFSILALAFRKWGEVDRVMQFIDRMGDQNLGLNEKTFYALIHG
            +     +MK FG++   +    ++ A    G FD AL VY D  E G V +   F IL     K G ++ +++ + RM +     +   + A+I  
Subjt:  AIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWV-DGYVFSILALAFRKWGEVDRVMQFIDRMGDQNLGLNEKTFYALIHG

Query:  FVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLI---ASSSEERVVIMLLGERPKDISYEGFLPDI
         V E   D +L++ ++M + +  PD+  Y  L+ GLCK G  E+   LF +MK   I+ D +I   LI    +  + R    L     +D+   G++ DI
Subjt:  FVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLI---ASSSEERVVIMLLGERPKDISYEGFLPDI

Query:  VAYSAAMDGLVKIHEVDRAFEMFQ
          Y+A + GL  +++VD+A+++FQ
Subjt:  VAYSAAMDGLVKIHEVDRAFEMFQ

Arabidopsis top hitse value%identityAlignment
AT1G03560.1 Pentatricopeptide repeat (PPR-like) superfamily protein3.0e-3126.63Show/hide
Query:  FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCVPNSYSYNCLLEILSKAN
        FF W+ KQ  Y HN   + ++  + + A+    +R ++ ++  F   MT  A+   ++SFG +G+VEE  +++ +++  G+  P  Y+YN L+  L  A 
Subjt:  FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCVPNSYSYNCLLEILSKAN

Query:  AIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGW-VDGYVFSILALAFRKWGEVDRVMQFIDRMGDQNLGLNEKTFYALIHG
         +DS E     M+    + D  T   ++K YC AG+  KA+    DM  RG   D   +  +  A     +    +     M ++ + +    F  +I G
Subjt:  AIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGW-VDGYVFSILALAFRKWGEVDRVMQFIDRMGDQNLGLNEKTFYALIHG

Query:  FVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEERVVIMLLGERPKDISYEGFLPDIVAY
          KE + +    + E M++    P+++IY VLI G  K G+ E A+ L  +M   G  PDV   + ++    +   V   L +      ++G   + + Y
Subjt:  FVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEERVVIMLLGERPKDISYEGFLPDIVAY

Query:  SAAMDGLVKIHEVDRAFEMFQDI
        S+ +DGL K   VD A  +F+++
Subjt:  SAAMDGLVKIHEVDRAFEMFQDI

AT1G12775.1 Pentatricopeptide repeat (PPR) superfamily protein3.5e-2428.1Show/hide
Query:  NSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYV-FSILALAFRKWGEVDRVMQFIDRMGD
        ++  Y+ +++ L K  ++D+  N   EM+  G++ D  T   ++  +CNAG++D    +  DM +R      V FS+L  +F K G++    Q +  M  
Subjt:  NSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYV-FSILALAFRKWGEVDRVMQFIDRMGD

Query:  QNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDV---KILAKLIASSSEERVVIMLL
        + +  N  T+ +LI GF KE+R + A+++++ M+     PDI  +++LI G CK    +  + LF +M L G++ +      L +    S +  V   L 
Subjt:  QNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDV---KILAKLIASSSEERVVIMLL

Query:  GERPKDISYEGFLPDIVAYSAAMDGLVKIHEVDRAFEMFQDI
            +++      PDIV+Y   +DGL    E+++A E+F  I
Subjt:  GERPKDISYEGFLPDIVAYSAAMDGLVKIHEVDRAFEMFQDI

AT4G20740.1 Pentatricopeptide repeat (PPR-like) superfamily protein3.0e-2326.85Show/hide
Query:  FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCVPNSYSYNCLLEILSKAN
        FF WA KQ GYKH+  A+NA A       +N   RA                                A+ L   + S G   P+   +  L+ + +   
Subjt:  FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCVPNSYSYNCLLEILSKAN

Query:  AIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWV-DGYVFSILALAFRKWGEVDRVMQFIDRMGDQNLGLNEKTFYALIHG
            +     +MK FG++   +    ++ A    G FD AL VY D  E G V +   F IL     K G ++ +++ + RM +     +   + A+I  
Subjt:  AIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWV-DGYVFSILALAFRKWGEVDRVMQFIDRMGDQNLGLNEKTFYALIHG

Query:  FVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLI---ASSSEERVVIMLLGERPKDISYEGFLPDI
         V E   D +L++ ++M + +  PD+  Y  L+ GLCK G  E+   LF +MK   I+ D +I   LI    +  + R    L     +D+   G++ DI
Subjt:  FVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLI---ASSSEERVVIMLLGERPKDISYEGFLPDI

Query:  VAYSAAMDGLVKIHEVDRAFEMFQ
          Y+A + GL  +++VD+A+++FQ
Subjt:  VAYSAAMDGLVKIHEVDRAFEMFQ

AT5G08310.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.2e-8646.09Show/hide
Query:  VLEKVFNGREIW-----FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCV
        V+E V NG + W     FF WASKQ GY+++ YA+NA+ASI S ARQN  L+A+ +DVLN RC M+P A   F+R  G+ GLV+EA+ +F +VR MGLCV
Subjt:  VLEKVFNGREIW-----FFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEEANFLFYQVRSMGLCV

Query:  PNSYSYNCLLEILSKAN--AIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYVFSILALAFRKWGEVDRVMQFIDRM
        PN+Y+YNCLLE +SK+N  +++ +E RL+EM+D G+  DK+TLTPVL+ YCN GK ++AL V+N++  RGW+D ++ +IL ++F KWG+VD+  + I+ +
Subjt:  PNSYSYNCLLEILSKAN--AIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYVFSILALAFRKWGEVDRVMQFIDRM

Query:  GDQNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEE----RVVI
         ++++ LN KT+  LIHGFVKESR D A +L EKM ++    DI++YDVLIGGLCK    E A++L+ ++K  GI PD  IL KL+ S SEE    R+  
Subjt:  GDQNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEE----RVVI

Query:  MLLGERPKDISYEGFLPDIVAYSAAMDGLVKIHEVDRAFEMFQDI
        +++G+  K          ++ Y +  +G ++   V  A+   Q++
Subjt:  MLLGERPKDISYEGFLPDIVAYSAAMDGLVKIHEVDRAFEMFQDI

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-2627.34Show/hide
Query:  FYQVRSMGL--CVPNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYV-FSILALAFRKWG
        F  +RSM L    PN  SYN ++  L +   +  +   L EM   G+ +D+ T   ++K YC  G F +AL+++ +M   G     + ++ L  +  K G
Subjt:  FYQVRSMGL--CVPNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYV-FSILALAFRKWG

Query:  EVDRVMQFIDRMGDQNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIAS
         ++R M+F+D+M  + L  NE+T+  L+ GF ++   + A +++ +M    F+P +  Y+ LI G C  G  E A+A+   MK  G+ PDV   + +++ 
Subjt:  EVDRVMQFIDRMGDQNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIAS

Query:  SSEERVVIMLLGERPKDISYEGFLPDIVAYSAAMDGLVKIHEVDRAFEMFQDICTV
              V   L  + +++  +G  PD + YS+ + G  +      A ++++++  V
Subjt:  SSEERVVIMLLGERPKDISYEGFLPDIVAYSAAMDGLVKIHEVDRAFEMFQDICTV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATTTTCCAGTGCAAGCACCTAAAGGTGAATTTGGTGTCTCTCTGGTCAGTAATAGAAGCAATCATCCCTACTGTCATAGAATAAGAGCACCTGGCTCTGCCCATTC
ACAAGGAGTGAGGGAGGACCAAACGAAGGAATGGAGATCCATGCCAAAATCCACTCTCAAACCTTGGTCCTTTGTCAAGCCTGACCCTCGGGGTCTTTATGGTCTGGAGA
TCTATAGGACATACACGAACACAAACTTCACAAGCAATGCATTTATCAACTTCAAAATGGATTCGACCACGGTCAGGACTAAGGTAACGCATGGTCAGGGTGAACGCATA
GAAATAACCCGAGAGCTTGAAGGAAAAGAAGTGACGCGTAGAGGAGCAATCAAGGAATCTTCAAAGATAGGAGGAAGATCATGGACGCATGACAGTATGCGGTTCAGGCT
GACGCATGGTATTACGCAAAAAGGAGCGACTGATGACTCTGAAAGAAGAATTCTGGAGAGAAAGAGAAGCAAGAAGGCAGAAAGGAGGAATCTTAGCGTCCGTAGTAGGA
GTCTGTGGCCAGGAAAGAAGATAGGTGAAGCGTTCAAGTTGGACCTTGCGTCCAAGAGAAGTCGATACATCCAAAAGAGAAGTAAGAAGAGTTGGGCTGCACTTTGCGGA
AGTAATCAACCTTGCGTTGAAGAGCAGGGTAGCATGACAGTTCCCGTTGCCAATCGAAGTCGAGGTTTGTCAAGCAATAACGGTATTATCGTCCTAGAAAAAGTGTTTAA
TGGAAGGGAAATATGGTTCTTCATTTGGGCCTCAAAACAACATGGGTACAAGCATAATTGTTATGCTTTCAATGCCATTGCATCAATCCAATCACATGCTCGACAAAACA
CCCCACTAAGAGCCATTGCTATGGATGTCCTTAACTTTCGTTGTTCGATGACCCCTCGAGCTTCGAGAGTCTTTTTAAGAAGTTTTGGAAGCGTGGGGTTGGTTGAGGAA
GCTAACTTTTTGTTTTATCAGGTTAGATCAATGGGTCTCTGTGTTCCAAATAGTTATAGTTATAACTGTTTGTTGGAAATTTTGTCCAAGGCAAATGCTATTGATTCCAT
CGAGAACAGGTTGAGGGAGATGAAAGATTTCGGGTGGGAAGTGGATAAGTACACATTGACACCTGTTTTGAAGGCTTATTGCAATGCTGGAAAATTTGACAAAGCTCTAA
TTGTGTATAATGATATGCATGAGAGAGGGTGGGTTGATGGGTATGTCTTTTCTATCTTGGCATTAGCTTTTAGAAAGTGGGGTGAGGTAGATAGAGTAATGCAATTCATA
GACAGAATGGGAGATCAAAATCTTGGGTTAAATGAAAAGACATTCTATGCATTGATTCATGGTTTTGTGAAGGAATCCAGAAATGATATGGCTCTTAAGTTGATTGAGAA
AATGCTGAAACTGGATTTTACTCCTGATATTTCAATCTATGATGTGCTAATAGGAGGACTTTGTAAGAAAGGGGCATTTGAGAAAGCAATGGCTTTGTTTTGGAAGATGA
AGTTGTTTGGAATTATGCCTGACGTTAAGATACTTGCAAAGCTGATAGCATCTTCTTCTGAAGAAAGAGTTGTAATCATGTTACTTGGGGAAAGACCAAAAGATATAAGT
TATGAAGGCTTCCTGCCTGATATAGTTGCCTACTCTGCTGCCATGGATGGGCTAGTCAAGATTCACGAAGTGGATCGTGCTTTCGAGATGTTCCAAGACATTTGTACTGT
GGTTATCGTCCAGATGTGGTTTCTTATAACATAA
mRNA sequenceShow/hide mRNA sequence
ATGCATTTTCCAGTGCAAGCACCTAAAGGTGAATTTGGTGTCTCTCTGGTCAGTAATAGAAGCAATCATCCCTACTGTCATAGAATAAGAGCACCTGGCTCTGCCCATTC
ACAAGGAGTGAGGGAGGACCAAACGAAGGAATGGAGATCCATGCCAAAATCCACTCTCAAACCTTGGTCCTTTGTCAAGCCTGACCCTCGGGGTCTTTATGGTCTGGAGA
TCTATAGGACATACACGAACACAAACTTCACAAGCAATGCATTTATCAACTTCAAAATGGATTCGACCACGGTCAGGACTAAGGTAACGCATGGTCAGGGTGAACGCATA
GAAATAACCCGAGAGCTTGAAGGAAAAGAAGTGACGCGTAGAGGAGCAATCAAGGAATCTTCAAAGATAGGAGGAAGATCATGGACGCATGACAGTATGCGGTTCAGGCT
GACGCATGGTATTACGCAAAAAGGAGCGACTGATGACTCTGAAAGAAGAATTCTGGAGAGAAAGAGAAGCAAGAAGGCAGAAAGGAGGAATCTTAGCGTCCGTAGTAGGA
GTCTGTGGCCAGGAAAGAAGATAGGTGAAGCGTTCAAGTTGGACCTTGCGTCCAAGAGAAGTCGATACATCCAAAAGAGAAGTAAGAAGAGTTGGGCTGCACTTTGCGGA
AGTAATCAACCTTGCGTTGAAGAGCAGGGTAGCATGACAGTTCCCGTTGCCAATCGAAGTCGAGGTTTGTCAAGCAATAACGGTATTATCGTCCTAGAAAAAGTGTTTAA
TGGAAGGGAAATATGGTTCTTCATTTGGGCCTCAAAACAACATGGGTACAAGCATAATTGTTATGCTTTCAATGCCATTGCATCAATCCAATCACATGCTCGACAAAACA
CCCCACTAAGAGCCATTGCTATGGATGTCCTTAACTTTCGTTGTTCGATGACCCCTCGAGCTTCGAGAGTCTTTTTAAGAAGTTTTGGAAGCGTGGGGTTGGTTGAGGAA
GCTAACTTTTTGTTTTATCAGGTTAGATCAATGGGTCTCTGTGTTCCAAATAGTTATAGTTATAACTGTTTGTTGGAAATTTTGTCCAAGGCAAATGCTATTGATTCCAT
CGAGAACAGGTTGAGGGAGATGAAAGATTTCGGGTGGGAAGTGGATAAGTACACATTGACACCTGTTTTGAAGGCTTATTGCAATGCTGGAAAATTTGACAAAGCTCTAA
TTGTGTATAATGATATGCATGAGAGAGGGTGGGTTGATGGGTATGTCTTTTCTATCTTGGCATTAGCTTTTAGAAAGTGGGGTGAGGTAGATAGAGTAATGCAATTCATA
GACAGAATGGGAGATCAAAATCTTGGGTTAAATGAAAAGACATTCTATGCATTGATTCATGGTTTTGTGAAGGAATCCAGAAATGATATGGCTCTTAAGTTGATTGAGAA
AATGCTGAAACTGGATTTTACTCCTGATATTTCAATCTATGATGTGCTAATAGGAGGACTTTGTAAGAAAGGGGCATTTGAGAAAGCAATGGCTTTGTTTTGGAAGATGA
AGTTGTTTGGAATTATGCCTGACGTTAAGATACTTGCAAAGCTGATAGCATCTTCTTCTGAAGAAAGAGTTGTAATCATGTTACTTGGGGAAAGACCAAAAGATATAAGT
TATGAAGGCTTCCTGCCTGATATAGTTGCCTACTCTGCTGCCATGGATGGGCTAGTCAAGATTCACGAAGTGGATCGTGCTTTCGAGATGTTCCAAGACATTTGTACTGT
GGTTATCGTCCAGATGTGGTTTCTTATAACATAA
Protein sequenceShow/hide protein sequence
MHFPVQAPKGEFGVSLVSNRSNHPYCHRIRAPGSAHSQGVREDQTKEWRSMPKSTLKPWSFVKPDPRGLYGLEIYRTYTNTNFTSNAFINFKMDSTTVRTKVTHGQGERI
EITRELEGKEVTRRGAIKESSKIGGRSWTHDSMRFRLTHGITQKGATDDSERRILERKRSKKAERRNLSVRSRSLWPGKKIGEAFKLDLASKRSRYIQKRSKKSWAALCG
SNQPCVEEQGSMTVPVANRSRGLSSNNGIIVLEKVFNGREIWFFIWASKQHGYKHNCYAFNAIASIQSHARQNTPLRAIAMDVLNFRCSMTPRASRVFLRSFGSVGLVEE
ANFLFYQVRSMGLCVPNSYSYNCLLEILSKANAIDSIENRLREMKDFGWEVDKYTLTPVLKAYCNAGKFDKALIVYNDMHERGWVDGYVFSILALAFRKWGEVDRVMQFI
DRMGDQNLGLNEKTFYALIHGFVKESRNDMALKLIEKMLKLDFTPDISIYDVLIGGLCKKGAFEKAMALFWKMKLFGIMPDVKILAKLIASSSEERVVIMLLGERPKDIS
YEGFLPDIVAYSAAMDGLVKIHEVDRAFEMFQDICTVVIVQMWFLIT