; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10022776 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10022776
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr05:28160243..28162105
RNA-Seq ExpressionHG10022776
SyntenyHG10022776
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN45331.2 hypothetical protein Csa_015776 [Cucumis sativus]6.3e-8937.98Show/hide
Query:  MASKT-VASASSKPN---HLLSSSLFTHYSPKIPLSNHPKCQPQPLASPENFNSR-ISFQHRLPTFLQNCRTGKITATQALHFFDLMMRSNPTPPISSFN
        M SKT + SAS KPN       SS      P+   +NHP         P  FN + ISF H L  FL+NC+TG ITATQA HFFDLM+RS P PPISSFN
Subjt:  MASKT-VASASSKPN---HLLSSSLFTHYSPKIPLSNHPKCQPQPLASPENFNSR-ISFQHRLPTFLQNCRTGKITATQALHFFDLMMRSNPTPPISSFN

Query:  HLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIKGLY-----------------------
         LLGGLAKINHYS++F LY KM LAG+ P+ FTL+IL NCLCNVNRV E LAAMA I R GYIPN +TY TLIKGL                        
Subjt:  HLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIKGLY-----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------GLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKA
                                                                    GLCKNGCL+EA+E FN LKS N+KL+IES++CLIDGLCKA
Subjt:  ------------------------------------------------------------GLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKA

Query:  GKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVAD
        GKLETAWELFEKL QEGLQ DVVTY+IMIHGFCK GQVDKANILF+KMEENGCTP+IITYNTLL G C+SNKS+EVV+LLH+M+Q+D+SPDA  C IV D
Subjt:  GKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVAD

Query:  MLCKDVKYQECLDLVQDFPAQERR
        ML KD KYQECLDL+  F  QERR
Subjt:  MLCKDVKYQECLDLVQDFPAQERR

XP_011659000.1 pentatricopeptide repeat-containing protein At1g62720 isoform X2 [Cucumis sativus]2.5e-9049.08Show/hide
Query:  SKTVASASS----KPNHLLSSSLFTH------YSPKIPLSNHPKCQPQPLASPENFNSRISFQHRLPTFLQNCRTGKITATQALHFFDLMMRSNPTPPIS
        + TVASASS         L SSLFTH       +P+I  +N+PK      ASPE    RISFQH +P FL  C+TG I+ TQA  FFDLMMRS     I 
Subjt:  SKTVASASS----KPNHLLSSSLFTH------YSPKIPLSNHPKCQPQPLASPENFNSRISFQHRLPTFLQNCRTGKITATQALHFFDLMMRSNPTPPIS

Query:  SFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIKGLY--------------------
        SFN LL GLAKI HYS+VF LY +MHLAG+ P+  TLNIL+NCLCNVNR+ EGLAAMA I R GYIP+ +T+TTLIKGL                     
Subjt:  SFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIKGLY--------------------

Query:  -----------------------------------------------------GLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCK--------
                                                             GLCK G   EAI LFN +    ++ N+ +FS LID LCK        
Subjt:  -----------------------------------------------------GLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCK--------

Query:  --------AGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPD
                 GKLETAWELFEKL +EG+QPD + YS MIHGFCK GQVDKANILFQKMEENGC+P++ITY+ L+RG  ESNK E+VVQLLHRM++KDV PD
Subjt:  --------AGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPD

Query:  ADICTIVADMLCKDVKYQECLDLVQDFPAQERRH
          I  IV DM+CKD KY+E LDL+Q F  Q+ R+
Subjt:  ADICTIVADMLCKDVKYQECLDLVQDFPAQERRH

XP_011659273.2 pentatricopeptide repeat-containing protein At3g22470, mitochondrial [Cucumis sativus]4.8e-8938.14Show/hide
Query:  MASKT-VASASSKPNHLLSSSLFTHYS------PKIPLSNHPKCQPQPLASPENFN-SRISFQHRLPTFLQNCRTGKITATQALHFFDLMMRSNPTPPIS
        M SKT + S S KPN    S L TH S      P+   ++H    P P+  P  FN   ISF H L  FL+NC+TG ITA QA HFFDLMMRS+P PPIS
Subjt:  MASKT-VASASSKPNHLLSSSLFTHYS------PKIPLSNHPKCQPQPLASPENFN-SRISFQHRLPTFLQNCRTGKITATQALHFFDLMMRSNPTPPIS

Query:  SFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIKGLY--------------------
        SFN LLGGLAKINHYS++F LY +M LAG+ P+ FTL+IL NCLCNVNRV E LAAMA I R GYIPN +TYTTLIKGL                     
Subjt:  SFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIKGLY--------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------GLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGL
                                                                       GLCKN CLFEA+ELFN LKS N KLNIE++SCLIDGL
Subjt:  ---------------------------------------------------------------GLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGL

Query:  CKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTI
        CKAGKLETAWELFEKLSQEGLQPDVVTY+IMIHGFCK GQVD ANILF+KMEENGCTP+II YNTLL G CE NK EEV++LLH+MVQKDVSP+A  CTI
Subjt:  CKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTI

Query:  VADMLCKDVKYQECLDLVQDFPAQ
        V DMLCKD KY++ +DL+  FP Q
Subjt:  VADMLCKDVKYQECLDLVQDFPAQ

XP_038877920.1 pentatricopeptide repeat-containing protein At1g63330-like isoform X1 [Benincasa hispida]4.6e-11644.34Show/hide
Query:  MASKTVASASS----KPNHLLSSSLFTHYSPKIPLSNHPKCQPQPLASPE----NFNSRISFQHRLPTFLQNCRTGKITATQALHFFDLMMRSNPTPPIS
        MASK+VASASS         LSSSLFT  SP IP SNHPKC PQPLASPE    NFN+ + FQ RL TFLQNCRTGKIT T+ALHFFDLMM S+PTPP+S
Subjt:  MASKTVASASS----KPNHLLSSSLFTHYSPKIPLSNHPKCQPQPLASPE----NFNSRISFQHRLPTFLQNCRTGKITATQALHFFDLMMRSNPTPPIS

Query:  SFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIKGLY--------------------
        SFN LLGGLAKI HYS+VFQLYYKMHLAG  PNFFTLNILMNCLCNVNRV EGLAAMARIFR GY+PNKMTYTTLIKGL                     
Subjt:  SFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIKGLY--------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------GLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGL
                                                                       GLCKNGCLFEAIE FN LKS NLKL+IESFSCLIDGL
Subjt:  ---------------------------------------------------------------GLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGL

Query:  CKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTI
        CKAGKLETAWE FEKLSQEGLQPDVVTYSIMIHGFCK GQVDKANILFQKM+ENGCTPNIITY+TLL G CESNKS+EVVQ   +MVQKDV P+A ICTI
Subjt:  CKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTI

Query:  VADMLCKDVKYQECLDLVQDFPAQERR
        V DM+CKD KYQECLDL++ FP Q+R+
Subjt:  VADMLCKDVKYQECLDLVQDFPAQERR

XP_038896203.1 pentatricopeptide repeat-containing protein At1g63330-like [Benincasa hispida]2.0e-10340.76Show/hide
Query:  MASKTVASASSK---PNHLLSSSLFTHYSPKIPL-------SNHPKCQPQPLASPENFNSRISFQHRLPTFLQNCRTGKITATQALHFFDLMMRSNPTPP
        MASKT+ SASS     +    SSLFTH SP IP        SNHPK  PQPL SP+NF +RIS QHRLPTFLQNCR GKITAT+ALHFFDLM+RSNPTPP
Subjt:  MASKTVASASSK---PNHLLSSSLFTHYSPKIPL-------SNHPKCQPQPLASPENFNSRISFQHRLPTFLQNCRTGKITATQALHFFDLMMRSNPTPP

Query:  ISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIKGLY------------------
          SFN LLGGLA+I HYS+VF LY KM LAG+ PNFFTLNIL+NCLCNVNRV EG AAMA I R GYIP+K+TY+TLIKGL                   
Subjt:  ISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIKGLY------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------GLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLID
                                                                         GLCKNGCLFEA+E FN LKS NLKL+I  F+ LID
Subjt:  -----------------------------------------------------------------GLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLID

Query:  GLCKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADIC
        GLCK GKLETAWELF+KLSQEGLQP+VVTY+IMIHGFC+ GQVD ANILFQ MEEN CTPN+IT NTLLRG CESNKS+EVV+LLHRMVQ+DV PD   C
Subjt:  GLCKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADIC

Query:  TIVADMLCKDVKYQECLDLVQDFPAQER
        TIV DMLCKD KY+ECLDL+  FP Q+R
Subjt:  TIVADMLCKDVKYQECLDLVQDFPAQER

TrEMBL top hitse value%identityAlignment
A0A0A0K5M0 Uncharacterized protein4.4e-11260.59Show/hide
Query:  MASKT-VASASSKPN---HLLSSSLFTHYSPKIPLSNHPKCQPQPLASPENFNSR-ISFQHRLPTFLQNCRTGKITATQALHFFDLMMRSNPTPPISSFN
        M SKT + SAS KPN       SS      P+   +NHP         P  FN + ISF H L  FL+NC+TG ITATQA HFFDLM+RS P PPISSFN
Subjt:  MASKT-VASASSKPN---HLLSSSLFTHYSPKIPLSNHPKCQPQPLASPENFNSR-ISFQHRLPTFLQNCRTGKITATQALHFFDLMMRSNPTPPISSFN

Query:  HLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIKGLY-----------------------
         LLGGLAKINHYS++F LY KM LAG+ P+ FTL+IL NCLCNVNRV E +     + + G  PN  TY TL+ GL+                       
Subjt:  HLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIKGLY-----------------------

Query:  ---------GLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKMEEN
                 GLCKNGCL+EA+E FN LKS N+KL+IES++CLIDGLCKAGKLETAWELFEKL QEGLQ DVVTY+IMIHGFCK GQVDKANILF+KMEEN
Subjt:  ---------GLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKMEEN

Query:  GCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVADMLCKDVKYQECLDLVQDFPAQERR
        GCTP+IITYNTLL G C+SNKS+EVV+LLH+M+Q+D+SPDA  C IV DML KD KYQECLDL+  F  QERR
Subjt:  GCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVADMLCKDVKYQECLDLVQDFPAQERR

A0A0A0K784 Uncharacterized protein1.3e-11160.59Show/hide
Query:  MASKT-VASASSKPNHLLSSSLFTHYS------PKIPLSNHPKCQPQPLASPENFN-SRISFQHRLPTFLQNCRTGKITATQALHFFDLMMRSNPTPPIS
        M SKT + S S KPN    S L TH S      P+   ++H    P P+  P  FN   ISF H L  FL+NC+TG ITA QA HFFDLMMRS+P PPIS
Subjt:  MASKT-VASASSKPNHLLSSSLFTHYS------PKIPLSNHPKCQPQPLASPENFN-SRISFQHRLPTFLQNCRTGKITATQALHFFDLMMRSNPTPPIS

Query:  SFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIKGLY--------------------
        SFN LLGGLAKINHYS++F LY +M LAG+ P+ FTL+IL NCLCNVNRV E +     + + G  PN  TY TL+ GL+                    
Subjt:  SFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIKGLY--------------------

Query:  ------------GLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKM
                    GLCKN CLFEA+ELFN LKS N KLNIE++SCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVTY+IMIHGFCK GQVD ANILF+KM
Subjt:  ------------GLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKM

Query:  EENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVADMLCKDVKYQECLDLVQDFPAQ
        EENGCTP+II YNTLL G CE NK EEV++LLH+MVQKDVSP+A  CTIV DMLCKD KY++ +DL+  FP Q
Subjt:  EENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVADMLCKDVKYQECLDLVQDFPAQ

A0A5A7TFD7 Pentatricopeptide repeat-containing protein4.5e-8538.57Show/hide
Query:  SKTVASASS----KPNHLLSSSLFTHYSPKIPLSNHPKCQPQPLASPENFNSRISFQHRLPTFLQNCRTGKITATQALHFFDLMMRSNPTPPISSFNHLL
        + TVASASS         + SSLFTH SP IP SN          S    + R+S QH LP F+ NC+ G ITATQAL FF LMMRS     I SFN LL
Subjt:  SKTVASASS----KPNHLLSSSLFTHYSPKIPLSNHPKCQPQPLASPENFNSRISFQHRLPTFLQNCRTGKITATQALHFFDLMMRSNPTPPISSFNHLL

Query:  GGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIKGLY--------------------------
        G LAKI HYS+VF LY KMHLAG+ PNFFTL+IL+NCLCNVNRV E L+AMA I R GYIP+ +TYT+LIKGL                           
Subjt:  GGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIKGLY--------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------GLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQV
                              GLCKNGCLFEA+ELFN LKS N+KL+IESF+CLIDGLCKA KLETAWELFEKLSQEGLQPDVVTY IMI+GFCKDGQV
Subjt:  ----------------------GLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQV

Query:  DKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVADMLCKDVKYQECLDLVQDFPAQERR
        D ANILFQ MEENGCTPN+ TY+ L+ G  ++NK EEVVQLLH+M+QKDVS  A I TIV DM+ KD +Y+E LD++Q FP Q+ +
Subjt:  DKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVADMLCKDVKYQECLDLVQDFPAQERR

A0A5A7UUW8 Pentatricopeptide repeat-containing protein1.3e-8437.72Show/hide
Query:  RLPTFLQNCRTGKITATQALHFFDLMMRSNPTPPISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSG
        ++P FL+NC+TG ITA QA HFFDLM+RS P  PISSFN LLGGLAKINHYS++F LY KM LAG+ P+  TLNIL+NCLCNVNRV E LAA+A I R G
Subjt:  RLPTFLQNCRTGKITATQALHFFDLMMRSNPTPPISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSG

Query:  YIPNKMTYTTLIKGLY------------------------------------------------------------------------------------
        YIP+ +TY TLIKGL                                                                                     
Subjt:  YIPNKMTYTTLIKGLY------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------G
                                                                                                           G
Subjt:  ---------------------------------------------------------------------------------------------------G

Query:  LCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYN
        LCKNGCLFEA+ELF  LKS N KL IE++SCLIDGLCKAGKLETAWELFEKLSQEGLQP+VVTY+IMI GFCK G VDKANILF+KMEENGCTP+IITY+
Subjt:  LCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYN

Query:  TLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVADMLCKDVKYQECLDLVQDFPAQER
         LLR  C+SNKSEEVV+LLH+MVQ+DVSPD  ICTIV DMLCKD KY+ECLDL+  FP QER
Subjt:  TLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVADMLCKDVKYQECLDLVQDFPAQER

A0A6J1DSW3 pentatricopeptide repeat-containing protein At1g63330-like1.3e-8737.42Show/hide
Query:  ASKTVASASSKPNHLLSSSLFTHYSPKIP-----------------LSNHPKCQPQPLASPENFNSRISFQHRLPTFLQNCRTGKITATQALHFFDLMMR
        AS  VA +SS P     SSLFT  SP+I                  LS  PK    P  SPE   + ISFQ R  TFLQNC+TG +TA +AL FFDLM+R
Subjt:  ASKTVASASSKPNHLLSSSLFTHYSPKIP-----------------LSNHPKCQPQPLASPENFNSRISFQHRLPTFLQNCRTGKITATQALHFFDLMMR

Query:  SNPTPPISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIKGLY------------
        + PTP +SSFN LLGGLAKI HYSEV  LY +M LAGILPN+ TLNIL+NCLCNVNRV EGLAAMA I R GYIPN +TYT+LIKGL             
Subjt:  SNPTPPISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIKGLY------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------GLCKNGCLFEAIELFNVLKSCNLKLNIE
                                                                                GLCKN CL EAIELFN LK  NLKLNIE
Subjt:  ------------------------------------------------------------------------GLCKNGCLFEAIELFNVLKSCNLKLNIE

Query:  SFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDV
         F+CLIDGLCKAGKLETAWELF+K S EGL P+VVTYSIMIHG CKDGQ++KA  LF+KMEENGCTPNIITYNTL+RG  E+NK EEVV+LLHRMV+K+V
Subjt:  SFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDV

Query:  SPDADICTIVADMLCKDVKYQECLDLVQDFPAQERR
         PDA  CTIV DML +D KYQECL+L+  FPAQE R
Subjt:  SPDADICTIVADMLCKDVKYQECLDLVQDFPAQERR

SwissProt top hitse value%identityAlignment
P0C7Q7 Putative pentatricopeptide repeat-containing protein At1g12700, mitochondrial8.1e-3929.18Show/hide
Query:  NSRISFQHRLPTFLQNCRTGKITATQALHFFDLMMRSNPTPPISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAA
        N  + F+ RL + + +     I    A+  F  M+RS P P +  F+     +A+   ++ V     ++ L GI  N +TLNI++NC C   +     + 
Subjt:  NSRISFQHRLPTFLQNCRTGKITATQALHFFDLMMRSNPTPPISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAA

Query:  MARIFRSGYIPNKMTYTTLIKGLYGLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDG
        + ++ + GY P+  T+ TLIKGL+     G + EA+ L + +     + ++ +++ +++G+C++G    A +L  K+ +  ++ DV TYS +I   C+DG
Subjt:  MARIFRSGYIPNKMTYTTLIKGLYGLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDG

Query:  QVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVADMLCKDVKYQECLDLVQD
         +D A  LF++ME  G   +++TYN+L+RGLC++ K  +   LL  MV +++ P+     ++ D+  K+ K QE  +L ++
Subjt:  QVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVADMLCKDVKYQECLDLVQD

Q0WKV3 Pentatricopeptide repeat-containing protein At1g12300, mitochondrial2.8e-3931.51Show/hide
Query:  SFQHRLPTFLQNCRTG--KITATQALHFFDLMMRSNPTPPISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMA
        +F  R  ++ +  R+G   I A  A+  F  M+ S P P +  F+ L   +AK   Y  V  L  +M L GI  N +TL+I++NC C   ++    +AM 
Subjt:  SFQHRLPTFLQNCRTG--KITATQALHFFDLMMRSNPTPPISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMA

Query:  RIFRSGYIPNKMTYTTLIKGLYGLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVT--------------
        +I + GY PN +T++TLI    GLC  G + EA+EL + +     K ++ + + L++GLC +GK   A  L +K+ + G QP+ VT              
Subjt:  RIFRSGYIPNKMTYTTLIKGLYGLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVT--------------

Query:  ---------------------YSIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVADMLC
                             YSI+I G CK G +D A  LF +ME  G T NIITYN L+ G C + + ++  +LL  M+++ ++P+    +++ D   
Subjt:  ---------------------YSIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVADMLC

Query:  KDVKYQECLDL
        K+ K +E  +L
Subjt:  KDVKYQECLDL

Q0WVK7 Pentatricopeptide repeat-containing protein At1g05670, mitochondrial8.4e-3629.85Show/hide
Query:  CRTGKITATQALHFFDLMMRSNPTPPISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTY
        C+ G+I   +A H   LM     TP + S++ ++ G  +     +V++L   M   G+ PN +    ++  LC + ++ E   A + + R G +P+ + Y
Subjt:  CRTGKITATQALHFFDLMMRSNPTPPISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTY

Query:  TTLIKGLYGLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKMEENG
        TTLI    G CK G +  A + F  + S ++  ++ +++ +I G C+ G +  A +LF ++  +GL+PD VT++ +I+G+CK G +  A  +   M + G
Subjt:  TTLIKGLYGLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKMEENG

Query:  CTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVADMLCKDVKYQECLDLVQDFPA
        C+PN++TY TL+ GLC+    +   +LLH M +  + P+      + + LCK    +E + LV +F A
Subjt:  CTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVADMLCKDVKYQECLDLVQDFPA

Q9ASZ8 Pentatricopeptide repeat-containing protein At1g126203.4e-3728.88Show/hide
Query:  CQPQPLASPENFNSRISFQHRLPTFLQNCRTGKITATQALHFFDLMMRSNPTPPISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCL
        C  +  +S  +   ++S++ RL + + +     I    A+  F  M RS P P +  F+ L   +A+   Y  V  L  +M L GI  N +TL+I++NC 
Subjt:  CQPQPLASPENFNSRISFQHRLPTFLQNCRTGKITATQALHFFDLMMRSNPTPPISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCL

Query:  CNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIKGL--------------------------------YGLCKNGCLFEAIELFNVLKSCNLKLNIESFSC
        C   ++    +AM +I + GY P+ +T++TLI GL                                 GLC NG + +A+ L + +     + N  ++  
Subjt:  CNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIKGL--------------------------------YGLCKNGCLFEAIELFNVLKSCNLKLNIESFSC

Query:  LIDGLCKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDA
        ++  +CK+G+   A EL  K+ +  ++ D V YSI+I G CKDG +D A  LF +ME  G   +II Y TL+RG C + + ++  +LL  M+++ ++PD 
Subjt:  LIDGLCKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDA

Query:  DICTIVADMLCKDVKYQECLDL
           + + D   K+ K +E  +L
Subjt:  DICTIVADMLCKDVKYQECLDL

Q9LPX2 Pentatricopeptide repeat-containing protein At1g12775, mitochondrial7.6e-3729.45Show/hide
Query:  ITATQALHFFDLMMRSNPTPPISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIK
        I A  A+  F  M++S P P +  FN L   +AK   Y  V  L  +M   GI  + +TL+I++NC C   ++    + M +I + GY P+ + + TL+ 
Subjt:  ITATQALHFFDLMMRSNPTPPISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIK

Query:  GL--------------------------------YGLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVTY
        GL                                 GLC NG + +A+ L + +     + N  ++  +++ +CK+G+   A EL  K+ +  ++ D V Y
Subjt:  GL--------------------------------YGLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVTY

Query:  SIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVADMLCKDVKYQECLDLVQD
        SI+I G CKDG +D A  LF +ME  G   +IITYNTL+ G C + + ++  +LL  M+++ +SP+    +++ D   K+ K +E   L+++
Subjt:  SIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVADMLCKDVKYQECLDLVQD

Arabidopsis top hitse value%identityAlignment
AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein5.9e-3729.85Show/hide
Query:  CRTGKITATQALHFFDLMMRSNPTPPISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTY
        C+ G+I   +A H   LM     TP + S++ ++ G  +     +V++L   M   G+ PN +    ++  LC + ++ E   A + + R G +P+ + Y
Subjt:  CRTGKITATQALHFFDLMMRSNPTPPISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTY

Query:  TTLIKGLYGLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKMEENG
        TTLI    G CK G +  A + F  + S ++  ++ +++ +I G C+ G +  A +LF ++  +GL+PD VT++ +I+G+CK G +  A  +   M + G
Subjt:  TTLIKGLYGLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKMEENG

Query:  CTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVADMLCKDVKYQECLDLVQDFPA
        C+PN++TY TL+ GLC+    +   +LLH M +  + P+      + + LCK    +E + LV +F A
Subjt:  CTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVADMLCKDVKYQECLDLVQDFPA

AT1G12300.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.0e-4031.51Show/hide
Query:  SFQHRLPTFLQNCRTG--KITATQALHFFDLMMRSNPTPPISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMA
        +F  R  ++ +  R+G   I A  A+  F  M+ S P P +  F+ L   +AK   Y  V  L  +M L GI  N +TL+I++NC C   ++    +AM 
Subjt:  SFQHRLPTFLQNCRTG--KITATQALHFFDLMMRSNPTPPISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMA

Query:  RIFRSGYIPNKMTYTTLIKGLYGLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVT--------------
        +I + GY PN +T++TLI    GLC  G + EA+EL + +     K ++ + + L++GLC +GK   A  L +K+ + G QP+ VT              
Subjt:  RIFRSGYIPNKMTYTTLIKGLYGLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVT--------------

Query:  ---------------------YSIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVADMLC
                             YSI+I G CK G +D A  LF +ME  G T NIITYN L+ G C + + ++  +LL  M+++ ++P+    +++ D   
Subjt:  ---------------------YSIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVADMLC

Query:  KDVKYQECLDL
        K+ K +E  +L
Subjt:  KDVKYQECLDL

AT1G12620.1 Pentatricopeptide repeat (PPR) superfamily protein2.4e-3828.88Show/hide
Query:  CQPQPLASPENFNSRISFQHRLPTFLQNCRTGKITATQALHFFDLMMRSNPTPPISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCL
        C  +  +S  +   ++S++ RL + + +     I    A+  F  M RS P P +  F+ L   +A+   Y  V  L  +M L GI  N +TL+I++NC 
Subjt:  CQPQPLASPENFNSRISFQHRLPTFLQNCRTGKITATQALHFFDLMMRSNPTPPISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCL

Query:  CNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIKGL--------------------------------YGLCKNGCLFEAIELFNVLKSCNLKLNIESFSC
        C   ++    +AM +I + GY P+ +T++TLI GL                                 GLC NG + +A+ L + +     + N  ++  
Subjt:  CNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIKGL--------------------------------YGLCKNGCLFEAIELFNVLKSCNLKLNIESFSC

Query:  LIDGLCKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDA
        ++  +CK+G+   A EL  K+ +  ++ D V YSI+I G CKDG +D A  LF +ME  G   +II Y TL+RG C + + ++  +LL  M+++ ++PD 
Subjt:  LIDGLCKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDA

Query:  DICTIVADMLCKDVKYQECLDL
           + + D   K+ K +E  +L
Subjt:  DICTIVADMLCKDVKYQECLDL

AT1G12700.1 ATP binding;nucleic acid binding;helicases5.7e-4029.18Show/hide
Query:  NSRISFQHRLPTFLQNCRTGKITATQALHFFDLMMRSNPTPPISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAA
        N  + F+ RL + + +     I    A+  F  M+RS P P +  F+     +A+   ++ V     ++ L GI  N +TLNI++NC C   +     + 
Subjt:  NSRISFQHRLPTFLQNCRTGKITATQALHFFDLMMRSNPTPPISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAA

Query:  MARIFRSGYIPNKMTYTTLIKGLYGLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDG
        + ++ + GY P+  T+ TLIKGL+     G + EA+ L + +     + ++ +++ +++G+C++G    A +L  K+ +  ++ DV TYS +I   C+DG
Subjt:  MARIFRSGYIPNKMTYTTLIKGLYGLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVTYSIMIHGFCKDG

Query:  QVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVADMLCKDVKYQECLDLVQD
         +D A  LF++ME  G   +++TYN+L+RGLC++ K  +   LL  MV +++ P+     ++ D+  K+ K QE  +L ++
Subjt:  QVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVADMLCKDVKYQECLDLVQD

AT1G12775.1 Pentatricopeptide repeat (PPR) superfamily protein5.4e-3829.45Show/hide
Query:  ITATQALHFFDLMMRSNPTPPISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIK
        I A  A+  F  M++S P P +  FN L   +AK   Y  V  L  +M   GI  + +TL+I++NC C   ++    + M +I + GY P+ + + TL+ 
Subjt:  ITATQALHFFDLMMRSNPTPPISSFNHLLGGLAKINHYSEVFQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIK

Query:  GL--------------------------------YGLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVTY
        GL                                 GLC NG + +A+ L + +     + N  ++  +++ +CK+G+   A EL  K+ +  ++ D V Y
Subjt:  GL--------------------------------YGLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKAGKLETAWELFEKLSQEGLQPDVVTY

Query:  SIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVADMLCKDVKYQECLDLVQD
        SI+I G CKDG +D A  LF +ME  G   +IITYNTL+ G C + + ++  +LL  M+++ +SP+    +++ D   K+ K +E   L+++
Subjt:  SIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVADMLCKDVKYQECLDLVQD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCGAAGACTGTGGCTTCAGCTTCTTCCAAACCTAACCATTTACTCTCCTCCTCTCTATTTACTCACTACTCTCCAAAAATTCCATTGTCAAATCATCCTAAATG
CCAACCCCAACCCCTTGCATCACCGGAAAACTTCAATTCACGGATTTCCTTTCAACACCGGCTTCCGACGTTCTTACAGAATTGCAGAACAGGTAAGATTACTGCAACCC
AAGCATTACATTTCTTTGACTTAATGATGCGTTCAAATCCTACCCCTCCCATATCTTCATTCAATCATTTACTTGGTGGACTTGCTAAGATTAACCACTACTCTGAGGTT
TTTCAGCTGTATTATAAAATGCACCTAGCTGGAATTTTGCCTAATTTCTTCACGCTCAATATTTTGATGAATTGCCTTTGTAATGTGAATCGTGTTATCGAAGGTCTTGC
GGCCATGGCGAGGATTTTCAGGAGTGGTTATATTCCTAATAAAATGACATATACGACCTTGATTAAGGGCTTGTATGGGTTGTGTAAGAATGGCTGTTTATTTGAAGCGA
TCGAACTTTTTAATGTGCTGAAATCATGCAACTTGAAATTGAATATTGAAAGCTTTAGTTGTCTAATTGATGGCCTATGCAAAGCAGGGAAACTTGAAACTGCTTGGGAG
CTTTTCGAAAAACTATCCCAGGAAGGGCTTCAACCAGATGTTGTGACTTATTCCATTATGATCCATGGGTTTTGTAAAGATGGACAAGTAGATAAGGCAAATATTTTGTT
TCAAAAGATGGAAGAAAATGGTTGTACTCCCAACATAATTACTTATAATACCCTTTTGCGTGGTTTATGCGAGAGTAATAAATCAGAGGAGGTGGTTCAACTTCTTCATA
GGATGGTTCAGAAGGATGTGTCGCCAGATGCTGACATTTGCACCATAGTCGCAGACATGCTTTGCAAAGATGTAAAATATCAAGAATGTCTTGACTTGGTTCAAGATTTT
CCTGCCCAAGAGCGTCGACATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCGAAGACTGTGGCTTCAGCTTCTTCCAAACCTAACCATTTACTCTCCTCCTCTCTATTTACTCACTACTCTCCAAAAATTCCATTGTCAAATCATCCTAAATG
CCAACCCCAACCCCTTGCATCACCGGAAAACTTCAATTCACGGATTTCCTTTCAACACCGGCTTCCGACGTTCTTACAGAATTGCAGAACAGGTAAGATTACTGCAACCC
AAGCATTACATTTCTTTGACTTAATGATGCGTTCAAATCCTACCCCTCCCATATCTTCATTCAATCATTTACTTGGTGGACTTGCTAAGATTAACCACTACTCTGAGGTT
TTTCAGCTGTATTATAAAATGCACCTAGCTGGAATTTTGCCTAATTTCTTCACGCTCAATATTTTGATGAATTGCCTTTGTAATGTGAATCGTGTTATCGAAGGTCTTGC
GGCCATGGCGAGGATTTTCAGGAGTGGTTATATTCCTAATAAAATGACATATACGACCTTGATTAAGGGCTTGTATGGGTTGTGTAAGAATGGCTGTTTATTTGAAGCGA
TCGAACTTTTTAATGTGCTGAAATCATGCAACTTGAAATTGAATATTGAAAGCTTTAGTTGTCTAATTGATGGCCTATGCAAAGCAGGGAAACTTGAAACTGCTTGGGAG
CTTTTCGAAAAACTATCCCAGGAAGGGCTTCAACCAGATGTTGTGACTTATTCCATTATGATCCATGGGTTTTGTAAAGATGGACAAGTAGATAAGGCAAATATTTTGTT
TCAAAAGATGGAAGAAAATGGTTGTACTCCCAACATAATTACTTATAATACCCTTTTGCGTGGTTTATGCGAGAGTAATAAATCAGAGGAGGTGGTTCAACTTCTTCATA
GGATGGTTCAGAAGGATGTGTCGCCAGATGCTGACATTTGCACCATAGTCGCAGACATGCTTTGCAAAGATGTAAAATATCAAGAATGTCTTGACTTGGTTCAAGATTTT
CCTGCCCAAGAGCGTCGACATTGA
Protein sequenceShow/hide protein sequence
MASKTVASASSKPNHLLSSSLFTHYSPKIPLSNHPKCQPQPLASPENFNSRISFQHRLPTFLQNCRTGKITATQALHFFDLMMRSNPTPPISSFNHLLGGLAKINHYSEV
FQLYYKMHLAGILPNFFTLNILMNCLCNVNRVIEGLAAMARIFRSGYIPNKMTYTTLIKGLYGLCKNGCLFEAIELFNVLKSCNLKLNIESFSCLIDGLCKAGKLETAWE
LFEKLSQEGLQPDVVTYSIMIHGFCKDGQVDKANILFQKMEENGCTPNIITYNTLLRGLCESNKSEEVVQLLHRMVQKDVSPDADICTIVADMLCKDVKYQECLDLVQDF
PAQERRH