; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G03250 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G03250
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationClcChr02:2851329..2856686
RNA-Seq ExpressionClc02G03250
SyntenyClc02G03250
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001841 - Zinc finger, RING-type
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR013083 - Zinc finger, RING/FYVE/PHD-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011839.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]5.4e-14373.61Show/hide
Query:  MGAL--QFQSRLRVSKIFSSTIWKWEREFPLLRDPLRMSDFSLSPAQSRCFSSNSMDDDWAFSKSQPRGSPPQRRSPPDHREPR----------------
        MGAL  Q QSRLRVSKI SSTI  WER FPLL DPLR  D SLSP +SRCFSSNSMDDDW FSKS+PR SPPQRRSPPD RE R                
Subjt:  MGAL--QFQSRLRVSKIFSSTIWKWEREFPLLRDPLRMSDFSLSPAQSRCFSSNSMDDDWAFSKSQPRGSPPQRRSPPDHREPR----------------

Query:  -RNQRFNRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEESSSARLSEITWKKQLQPQPSPEADETFGQMKKTEESSSASQP
         RNQRFNRHSEGS SRFA+EG +S  R+ES SQKDFSFLEKFKLNTDNQS+S EK E SSSA+ SE    KQ                         SQP
Subjt:  -RNQRFNRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEESSSARLSEITWKKQLQPQPSPEADETFGQMKKTEESSSASQP

Query:  QRSPEADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLY
        Q  PEADEIFR+MK+TGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVV+YTAVVDGFCKAEK DEAIRIFRKMQNNGISPNAFS+G+LIQGLY
Subjt:  QRSPEADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLY

Query:  KCKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHIWEV
        K K++EDAV+FC EMLESGH+PNLTTFVGLID LC EKGVDEAH+V+ETLKQKGFLINE  LREFLNKR PFSPHIWE+
Subjt:  KCKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHIWEV

XP_022952706.1 pentatricopeptide repeat-containing protein At4g38150-like [Cucurbita moschata]4.1e-14373.35Show/hide
Query:  MGAL--QFQSRLRVSKIFSSTIWKWEREFPLLRDPLRMSDFSLSPAQSRCFSSNSMDDDWAFSKSQPRGSPPQRRSPPDHREPR----------------
        MGAL  Q QSRLRVSKI SSTI  WER FPLL DPLR  D SLSP +SRCFSSNSMDDDW FS+S+PR SPPQRRSPPD RE R                
Subjt:  MGAL--QFQSRLRVSKIFSSTIWKWEREFPLLRDPLRMSDFSLSPAQSRCFSSNSMDDDWAFSKSQPRGSPPQRRSPPDHREPR----------------

Query:  -RNQRFNRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEESSSARLSEITWKKQLQPQPSPEADETFGQMKKTEESSSASQP
         RNQRFNRHSEGS SRFA++G +S  R+ES SQKDFSFLEKFKLNTDNQS+S EK E SSSA++SE    KQ                         SQP
Subjt:  -RNQRFNRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEESSSARLSEITWKKQLQPQPSPEADETFGQMKKTEESSSASQP

Query:  QRSPEADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLY
        Q  PEADEIFR+MK+TGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVV+YTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFS+G+LIQGLY
Subjt:  QRSPEADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLY

Query:  KCKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHIWEV
        K K++EDAV+FCIEMLESGH+PNLTTFVGLID LC EKGVDEAH+V+ETLKQKGFLINE  LR+FLNKR PFSPHIWE+
Subjt:  KCKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHIWEV

XP_022969144.1 pentatricopeptide repeat-containing protein At4g38150-like [Cucurbita maxima]8.3e-14474.14Show/hide
Query:  MGAL--QFQSRLRVSKIFSSTIWKWEREFPLLRDPLRMSDFSLSPAQSRCFSSNSMDDDWAFSKSQPRGSPPQRRSPPDHREPR----------------
        MGAL  Q QSRLRVSKI  STI  WER FPLL DPLR  D SLSP +SRCFSSNSMDDDW FS+S+PR SPPQRRSPPD RE R                
Subjt:  MGAL--QFQSRLRVSKIFSSTIWKWEREFPLLRDPLRMSDFSLSPAQSRCFSSNSMDDDWAFSKSQPRGSPPQRRSPPDHREPR----------------

Query:  -RNQRFNRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEESSSARLSEITWKKQLQPQPSPEADETFGQMKKTEESSSASQP
         RNQRFNRH EGS SRFA+EG +S +RSES SQKDFSFLEKFKLNTDN S+S EKTE SSSA+ SE    KQ                         SQP
Subjt:  -RNQRFNRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEESSSARLSEITWKKQLQPQPSPEADETFGQMKKTEESSSASQP

Query:  QRSPEADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLY
        Q  PEADEIFR+MK+TGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVV+YTAVVDGFCKAE LDEAIRIFRKMQNNGISPNAFS+GVLIQGLY
Subjt:  QRSPEADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLY

Query:  KCKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHIWEV
        KCKK+EDAV+FCIEMLESGH+PNLTTFVGLID LC EKGVDEAH+V+ETLKQKGFLINE +LREFLNKR PFSPHIWE+
Subjt:  KCKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHIWEV

XP_023554525.1 pentatricopeptide repeat-containing protein At4g38150-like isoform X1 [Cucurbita pepo subsp. pepo]9.2e-14373.35Show/hide
Query:  MGAL--QFQSRLRVSKIFSSTIWKWEREFPLLRDPLRMSDFSLSPAQSRCFSSNSMDDDWAFSKSQPRGSPPQRRSPPDHREPR----------------
        MGAL  Q QSRLRVSKI SSTI  WER FPLL DPLR  D SLSP +SRCFSSNSMDDDW FS+S+PR SPPQRRSPPD RE R                
Subjt:  MGAL--QFQSRLRVSKIFSSTIWKWEREFPLLRDPLRMSDFSLSPAQSRCFSSNSMDDDWAFSKSQPRGSPPQRRSPPDHREPR----------------

Query:  -RNQRFNRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEESSSARLSEITWKKQLQPQPSPEADETFGQMKKTEESSSASQP
         RNQRFNRHSEGS SRFA+EG +S +R+ES SQKDFSFLEKFKLNTDNQS+S EK E SSSA+ SE    +Q                         SQP
Subjt:  -RNQRFNRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEESSSARLSEITWKKQLQPQPSPEADETFGQMKKTEESSSASQP

Query:  QRSPEADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLY
        Q  PEADEIFR+MK+TGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVV+YTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFS+G+LIQGLY
Subjt:  QRSPEADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLY

Query:  KCKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHIWEV
        K K++EDAV+FCIEMLESGH+PNLTTFVGLID LC EKG DEAH+V+ETLKQKGFLINE +LREFLNKR PFSPHIWE+
Subjt:  KCKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHIWEV

XP_038888880.1 pentatricopeptide repeat-containing protein At4g38150 isoform X1 [Benincasa hispida]2.3e-14977.66Show/hide
Query:  ALQFQSRLRVSKIFSSTIWKWEREFPLLRDPLRMSDFSLSPAQSRCFSSNSMDDDWAFSKSQPRGSPPQRRS-PPDHREPR-----------------RN
        A QFQSRLRV KI SSTI KWER+FPLL D LRM +FSLSPAQSRCF+SNS+DDDWAFSKS+PR SPPQRRS PPDHRE R                 RN
Subjt:  ALQFQSRLRVSKIFSSTIWKWEREFPLLRDPLRMSDFSLSPAQSRCFSSNSMDDDWAFSKSQPRGSPPQRRS-PPDHREPR-----------------RN

Query:  QRFNRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEESSSARLSEITWKKQLQPQPSPEADETFGQMKKTEESSSASQPQRS
        QRFNR SEGS SRFA+E SIS QR+ES +QKDFSFLE+FKLNTDNQSSS EKTEESSSARLSE   +KQ                         SQPQR 
Subjt:  QRFNRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEESSSARLSEITWKKQLQPQPSPEADETFGQMKKTEESSSASQPQRS

Query:  PEADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLYKCK
        PEADEIFR+MK+TGLIPNAVAMLDGLCKDGLIQEAMKLF LIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGI PNAFSFGVLIQGLYKCK
Subjt:  PEADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLYKCK

Query:  KIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHIWEV
        K+EDAVAFCIEMLESGH+PNL TFVGLID LCNEKGVDEAHSVVETLKQKGFLINE +LRE LNKR PFSPHIWEV
Subjt:  KIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHIWEV

TrEMBL top hitse value%identityAlignment
A0A0A0K6C0 Uncharacterized protein7.1e-14174.6Show/hide
Query:  MGALQFQSRLRVSKIFSSTIWKWEREFPLLRDPLRMSDFSLSPAQSRCFSSNSMDDDWAFSKSQPRGSPPQRRSPPDHREPR-------------RNQRF
        MGA Q QSRL+VSKI SSTI KWE +FPLL +PLR  DFSL PAQSR F+SNSMDDD AFS S+P  +PPQRR PPDHRE R             RNQRF
Subjt:  MGALQFQSRLRVSKIFSSTIWKWEREFPLLRDPLRMSDFSLSPAQSRCFSSNSMDDDWAFSKSQPRGSPPQRRSPPDHREPR-------------RNQRF

Query:  NRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEE-SSSARLSEITWKKQLQPQPSPEADETFGQMKKTEESSSASQPQRSPE
        NRHSEGS SRF +EGS S QRSES SQKDFSFLEKFKLNTDNQSS KEKTEE SSSA +SE   +KQ                         S+PQR PE
Subjt:  NRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEE-SSSARLSEITWKKQLQPQPSPEADETFGQMKKTEESSSASQPQRSPE

Query:  ADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLYKCKKI
        ADEIF +MK+TGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEK DEAIRIFRKMQ+NGI PNAFSFGVLIQGLYKCKK+
Subjt:  ADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLYKCKKI

Query:  EDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHIWEV
        +DAVAFC EMLESGH PNLTTFVGLID LCNEKGVDEAHSVVET KQKGFLI+E +LRE LNKR PFSP IW+V
Subjt:  EDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHIWEV

A0A1S3CFW0 pentatricopeptide repeat-containing protein At4g381501.3e-13974.06Show/hide
Query:  MGALQFQSRLRVSKIFSSTIWKWEREFPLLRDPLRMSDFSLSPAQSRCFSSNSMDDDWAFSKSQPRGSPPQRRSPPDHREPR-------------RNQRF
        M A QFQSRL+VSKI SS+I KWE +FPLL +PLR  DFSLSPA SR FSSNSMDDD AFS+S+PR +PPQRR PPDHRE R             RNQRF
Subjt:  MGALQFQSRLRVSKIFSSTIWKWEREFPLLRDPLRMSDFSLSPAQSRCFSSNSMDDDWAFSKSQPRGSPPQRRSPPDHREPR-------------RNQRF

Query:  NRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEE-SSSARLSEITWKKQLQPQPSPEADETFGQMKKTEESSSASQPQRSPE
        NRHSEGS SRF +EGS S QRSES SQKDFSFLEKFKLNTDNQSS  EKTEE SSSA +SE   +KQ                         SQ QR P 
Subjt:  NRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEE-SSSARLSEITWKKQLQPQPSPEADETFGQMKKTEESSSASQPQRSPE

Query:  ADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLYKCKKI
        ADEIFR+MK++GLIPNAVAMLDGLCKDGLIQEAMKLF LIREKGTIPEVVIYTAVVDGFCKAEK DEAIRIFRKMQN GISPNAFSFGVLIQGLYKCKK+
Subjt:  ADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLYKCKKI

Query:  EDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHIWEV
        +DAVAFC EMLESGH PNLTTFVGLID LC+EKGVDEAH+VVET KQKGFLI+E +LREFLNKR PFSP IW+V
Subjt:  EDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHIWEV

A0A5D3D5J5 Pentatricopeptide repeat-containing protein1.3e-13974.06Show/hide
Query:  MGALQFQSRLRVSKIFSSTIWKWEREFPLLRDPLRMSDFSLSPAQSRCFSSNSMDDDWAFSKSQPRGSPPQRRSPPDHREPR-------------RNQRF
        M A QFQSRL+VSKI SS+I KWE +FPLL +PLR  DFSLSPA SR FSSNSMDDD AFS+S+PR +PPQRR PPDHRE R             RNQRF
Subjt:  MGALQFQSRLRVSKIFSSTIWKWEREFPLLRDPLRMSDFSLSPAQSRCFSSNSMDDDWAFSKSQPRGSPPQRRSPPDHREPR-------------RNQRF

Query:  NRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEE-SSSARLSEITWKKQLQPQPSPEADETFGQMKKTEESSSASQPQRSPE
        NRHSEGS SRF +EGS S QRSES SQKDFSFLEKFKLNTDNQSS  EKTEE SSSA +SE   +KQ                         SQ QR P 
Subjt:  NRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEE-SSSARLSEITWKKQLQPQPSPEADETFGQMKKTEESSSASQPQRSPE

Query:  ADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLYKCKKI
        ADEIFR+MK++GLIPNAVAMLDGLCKDGLIQEAMKLF LIREKGTIPEVVIYTAVVDGFCKAEK DEAIRIFRKMQN GISPNAFSFGVLIQGLYKCKK+
Subjt:  ADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLYKCKKI

Query:  EDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHIWEV
        +DAVAFC EMLESGH PNLTTFVGLID LC+EKGVDEAH+VVET KQKGFLI+E +LREFLNKR PFSP IW+V
Subjt:  EDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHIWEV

A0A6J1GKZ0 pentatricopeptide repeat-containing protein At4g38150-like2.0e-14373.35Show/hide
Query:  MGAL--QFQSRLRVSKIFSSTIWKWEREFPLLRDPLRMSDFSLSPAQSRCFSSNSMDDDWAFSKSQPRGSPPQRRSPPDHREPR----------------
        MGAL  Q QSRLRVSKI SSTI  WER FPLL DPLR  D SLSP +SRCFSSNSMDDDW FS+S+PR SPPQRRSPPD RE R                
Subjt:  MGAL--QFQSRLRVSKIFSSTIWKWEREFPLLRDPLRMSDFSLSPAQSRCFSSNSMDDDWAFSKSQPRGSPPQRRSPPDHREPR----------------

Query:  -RNQRFNRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEESSSARLSEITWKKQLQPQPSPEADETFGQMKKTEESSSASQP
         RNQRFNRHSEGS SRFA++G +S  R+ES SQKDFSFLEKFKLNTDNQS+S EK E SSSA++SE    KQ                         SQP
Subjt:  -RNQRFNRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEESSSARLSEITWKKQLQPQPSPEADETFGQMKKTEESSSASQP

Query:  QRSPEADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLY
        Q  PEADEIFR+MK+TGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVV+YTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFS+G+LIQGLY
Subjt:  QRSPEADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLY

Query:  KCKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHIWEV
        K K++EDAV+FCIEMLESGH+PNLTTFVGLID LC EKGVDEAH+V+ETLKQKGFLINE  LR+FLNKR PFSPHIWE+
Subjt:  KCKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHIWEV

A0A6J1I1Q2 pentatricopeptide repeat-containing protein At4g38150-like4.0e-14474.14Show/hide
Query:  MGAL--QFQSRLRVSKIFSSTIWKWEREFPLLRDPLRMSDFSLSPAQSRCFSSNSMDDDWAFSKSQPRGSPPQRRSPPDHREPR----------------
        MGAL  Q QSRLRVSKI  STI  WER FPLL DPLR  D SLSP +SRCFSSNSMDDDW FS+S+PR SPPQRRSPPD RE R                
Subjt:  MGAL--QFQSRLRVSKIFSSTIWKWEREFPLLRDPLRMSDFSLSPAQSRCFSSNSMDDDWAFSKSQPRGSPPQRRSPPDHREPR----------------

Query:  -RNQRFNRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEESSSARLSEITWKKQLQPQPSPEADETFGQMKKTEESSSASQP
         RNQRFNRH EGS SRFA+EG +S +RSES SQKDFSFLEKFKLNTDN S+S EKTE SSSA+ SE    KQ                         SQP
Subjt:  -RNQRFNRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEESSSARLSEITWKKQLQPQPSPEADETFGQMKKTEESSSASQP

Query:  QRSPEADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLY
        Q  PEADEIFR+MK+TGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVV+YTAVVDGFCKAE LDEAIRIFRKMQNNGISPNAFS+GVLIQGLY
Subjt:  QRSPEADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLY

Query:  KCKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHIWEV
        KCKK+EDAV+FCIEMLESGH+PNLTTFVGLID LC EKGVDEAH+V+ETLKQKGFLINE +LREFLNKR PFSPHIWE+
Subjt:  KCKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHIWEV

SwissProt top hitse value%identityAlignment
Q0WVK7 Pentatricopeptide repeat-containing protein At1g05670, mitochondrial3.4e-2333.13Show/hide
Query:  EADEIFRQMKKTGLIPNAV---AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLYK
        EA+E F +M + G++P+ V    ++DG CK G I+ A K F  +  +   P+V+ YTA++ GFC+   + EA ++F +M   G+ P++ +F  LI G  K
Subjt:  EADEIFRQMKKTGLIPNAV---AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLYK

Query:  CKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLN
           ++DA      M+++G +PN+ T+  LIDGLC E  +D A+ ++  + + G   N  +    +N
Subjt:  CKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLN

Q6NQ83 Pentatricopeptide repeat-containing protein At3g22470, mitochondrial1.7e-2229.8Show/hide
Query:  EADEIFRQMKKTGLIPNAV---AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLYK
        EA E++ +M   G+ P+ +   +++DG CK+  + EA ++F L+  KG  P++V Y+ +++ +CKA+++D+ +R+FR++ + G+ PN  ++  L+ G  +
Subjt:  EADEIFRQMKKTGLIPNAV---AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLYK

Query:  CKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQ
          K+  A     EM+  G  P++ T+  L+DGLC+   +++A  + E +++
Subjt:  CKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQ

Q9FNL2 Pentatricopeptide repeat-containing protein At5g461001.7e-2236Show/hide
Query:  EIFRQMKKTGLIPNAV---AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLYKCKK
        +IF +M K G  P++     ++ GLC+ G I EA KLF  + EK   P VV YT++++G C ++ +DEA+R   +M++ GI PN F++  L+ GL K  +
Subjt:  EIFRQMKKTGLIPNAV---AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLYKCKK

Query:  IEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKG
           A+     M+  G  PN+ T+  LI GLC E+ + EA  +++ +  +G
Subjt:  IEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKG

Q9LPX2 Pentatricopeptide repeat-containing protein At1g12775, mitochondrial3.8e-2235.53Show/hide
Query:  ADEIFRQMKKTGLIPNAV---AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLYKC
        A E+ R+M++  +  +AV    ++DGLCKDG +  A  LF  +  KG   +++ Y  ++ GFC A + D+  ++ R M    ISPN  +F VLI    K 
Subjt:  ADEIFRQMKKTGLIPNAV---AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLYKC

Query:  KKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKG
         K+ +A     EM++ G  PN  T+  LIDG C E  ++EA  +V+ +  KG
Subjt:  KKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKG

Q9SZL5 Pentatricopeptide repeat-containing protein At4g381501.6e-6848.34Show/hide
Query:  FSKSQPRGSPPQRRSPPD--HREPRRNQRFNRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEESSSARLSEITWKKQLQPQ
        F  +   G   ++++PP+     P R +R +      P+R A     +L +S++ +  D  FLE+FKL  +  S    K E+               +P 
Subjt:  FSKSQPRGSPPQRRSPPD--HREPRRNQRFNRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEESSSARLSEITWKKQLQPQ

Query:  PSPEADETFGQMKKTEESSSASQPQRSPEADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRI
        P PE                        ++DEIF++MK+ GLIPNAVAMLDGLCKDGL+QEAMKLFGL+R+KGTIPEVVIYTAVV+ FCKA K+++A RI
Subjt:  PSPEADETFGQMKKTEESSSASQPQRSPEADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRI

Query:  FRKMQNNGISPNAFSFGVLIQGLYKCKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHI
        FRKMQNNGI+PNAFS+GVL+QGLY C  ++DAVAFC EMLESGH+PN+ TFV L+D LC  KGV++A S ++TL QKGF +N  +++EF++KR PF    
Subjt:  FRKMQNNGISPNAFSFGVLIQGLYKCKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHI

Query:  WE
        WE
Subjt:  WE

Arabidopsis top hitse value%identityAlignment
AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein2.4e-2433.13Show/hide
Query:  EADEIFRQMKKTGLIPNAV---AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLYK
        EA+E F +M + G++P+ V    ++DG CK G I+ A K F  +  +   P+V+ YTA++ GFC+   + EA ++F +M   G+ P++ +F  LI G  K
Subjt:  EADEIFRQMKKTGLIPNAV---AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLYK

Query:  CKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLN
           ++DA      M+++G +PN+ T+  LIDGLC E  +D A+ ++  + + G   N  +    +N
Subjt:  CKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLN

AT1G05670.2 Pentatricopeptide repeat (PPR-like) superfamily protein2.4e-2433.13Show/hide
Query:  EADEIFRQMKKTGLIPNAV---AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLYK
        EA+E F +M + G++P+ V    ++DG CK G I+ A K F  +  +   P+V+ YTA++ GFC+   + EA ++F +M   G+ P++ +F  LI G  K
Subjt:  EADEIFRQMKKTGLIPNAV---AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLYK

Query:  CKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLN
           ++DA      M+++G +PN+ T+  LIDGLC E  +D A+ ++  + + G   N  +    +N
Subjt:  CKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLN

AT4G38150.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-6948.34Show/hide
Query:  FSKSQPRGSPPQRRSPPD--HREPRRNQRFNRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEESSSARLSEITWKKQLQPQ
        F  +   G   ++++PP+     P R +R +      P+R A     +L +S++ +  D  FLE+FKL  +  S    K E+               +P 
Subjt:  FSKSQPRGSPPQRRSPPD--HREPRRNQRFNRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEESSSARLSEITWKKQLQPQ

Query:  PSPEADETFGQMKKTEESSSASQPQRSPEADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRI
        P PE                        ++DEIF++MK+ GLIPNAVAMLDGLCKDGL+QEAMKLFGL+R+KGTIPEVVIYTAVV+ FCKA K+++A RI
Subjt:  PSPEADETFGQMKKTEESSSASQPQRSPEADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRI

Query:  FRKMQNNGISPNAFSFGVLIQGLYKCKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHI
        FRKMQNNGI+PNAFS+GVL+QGLY C  ++DAVAFC EMLESGH+PN+ TFV L+D LC  KGV++A S ++TL QKGF +N  +++EF++KR PF    
Subjt:  FRKMQNNGISPNAFSFGVLIQGLYKCKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHI

Query:  WE
        WE
Subjt:  WE

AT4G38150.2 Pentatricopeptide repeat (PPR) superfamily protein1.1e-6948.34Show/hide
Query:  FSKSQPRGSPPQRRSPPD--HREPRRNQRFNRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEESSSARLSEITWKKQLQPQ
        F  +   G   ++++PP+     P R +R +      P+R A     +L +S++ +  D  FLE+FKL  +  S    K E+               +P 
Subjt:  FSKSQPRGSPPQRRSPPD--HREPRRNQRFNRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTDNQSSSKEKTEESSSARLSEITWKKQLQPQ

Query:  PSPEADETFGQMKKTEESSSASQPQRSPEADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRI
        P PE                        ++DEIF++MK+ GLIPNAVAMLDGLCKDGL+QEAMKLFGL+R+KGTIPEVVIYTAVV+ FCKA K+++A RI
Subjt:  PSPEADETFGQMKKTEESSSASQPQRSPEADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRI

Query:  FRKMQNNGISPNAFSFGVLIQGLYKCKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHI
        FRKMQNNGI+PNAFS+GVL+QGLY C  ++DAVAFC EMLESGH+PN+ TFV L+D LC  KGV++A S ++TL QKGF +N  +++EF++KR PF    
Subjt:  FRKMQNNGISPNAFSFGVLIQGLYKCKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFLNKRTPFSPHI

Query:  WE
        WE
Subjt:  WE

AT5G03560.2 Tetratricopeptide repeat (TPR)-like superfamily protein4.4e-2630.94Show/hide
Query:  RRSPPDHREPRRNQRFNRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTD-NQSSSKEKTEESSSARLSEITWKKQLQPQPSPEADETFGQMK
        R+ PP+ R  R   R  R+S      F   G +  ++  S         E  ++NT  N S S + ++E +  +++E + KKQ   +P P   + F +  
Subjt:  RRSPPDHREPRRNQRFNRHSEGSPSRFADEGSISLQRSESFSQKDFSFLEKFKLNTD-NQSSSKEKTEESSSARLSEITWKKQLQPQPSPEADETFGQMK

Query:  KTEESSSASQPQRSPEADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNA
          EE      P+      EIF +M+  G    AV M D L KDG   EA++LF  I++K  +P+VV +TA+V+ +  A +  E +++F +M  +G+SPNA
Subjt:  KTEESSSASQPQRSPEADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNA

Query:  FSFGVLIQGLYKCKKI-EDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFL
        +++ VLI+GL    K  +DA  + +EM+ +G +PN  T+  + +    E   + A  +++ +K KGF+ +E ++RE L
Subjt:  FSFGVLIQGLYKCKKI-EDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINENSLREFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGCGTTACAATTTCAATCTCGTCTTCGAGTTTCGAAGATTTTTTCTTCAACAATTTGGAAATGGGAAAGAGAATTTCCTCTGCTCCGCGATCCATTGCGAATGTC
GGATTTCTCACTCTCACCGGCGCAATCTCGCTGTTTTAGCTCCAATTCCATGGATGACGATTGGGCCTTCTCAAAGTCCCAGCCGAGAGGCAGCCCGCCACAACGGCGTT
CTCCTCCGGATCATCGCGAACCTCGTCGGAATCAACGATTTAATAGGCACTCGGAAGGTTCCCCGTCTCGATTCGCAGATGAAGGGTCAATTTCGCTGCAAAGGAGCGAA
TCTTTTTCTCAGAAGGATTTTAGCTTTCTTGAAAAATTCAAGCTGAACACTGATAATCAGAGTAGTAGTAAAGAGAAGACTGAGGAGAGCTCTTCTGCTCGGTTATCTGA
AATAACGTGGAAGAAGCAGTTGCAACCGCAGCCTTCTCCAGAAGCCGATGAGACATTCGGGCAAATGAAGAAGACTGAGGAGAGCTCTTCTGCTTCCCAACCGCAGCGTT
CTCCAGAAGCCGATGAGATATTCAGGCAAATGAAGAAGACTGGTCTGATTCCCAACGCTGTCGCTATGCTTGATGGGCTTTGTAAAGATGGACTTATTCAAGAAGCAATG
AAACTATTTGGTTTGATTCGTGAAAAGGGTACAATTCCAGAAGTTGTGATTTACACTGCTGTCGTTGATGGGTTTTGCAAGGCGGAGAAGCTTGATGAAGCAATTAGGAT
TTTCAGGAAAATGCAGAATAATGGCATTTCTCCCAATGCCTTTAGTTTTGGCGTCTTGATACAGGGACTGTACAAATGCAAAAAAATAGAGGATGCTGTAGCATTTTGCA
TTGAGATGTTAGAATCTGGACATACTCCAAATCTAACCACTTTTGTTGGCTTAATTGATGGGTTATGCAATGAGAAGGGCGTGGACGAAGCTCATAGTGTCGTAGAAACC
TTGAAACAAAAGGGATTCTTGATTAATGAGAATTCTTTAAGGGAATTTTTGAATAAAAGAACCCCATTTTCACCACATATCTGGGAAGTGAAGCTACCAATCACATCACT
GAGAAGAAACCACAAGTTGAAATCATTGGATCTCAGTAGCCAGAATGTGAGAAGAGTGAAGAATATGGAACAGACACACCATCTGGCTTTGGAGGGGAGAATGAGAAAGG
AAGCCCACGTTCCAAGAATGATAGGCTTGGCATTATGTAGAGGTTCAATGTGCATAGCCACCATTGTCTTCTTAACTTGTGTGTGGATCCCTCTTCTTCAACTGAAGGCA
GCCATAGCGAGAATGTTCGGCGTTCTATCGAGCAGGACGTGTCAGACAGTCGCAACGGCGTCGCCACCATTTGAGGTAGAACTTCTGGTTTGTAGGTTCAAAGAGCTGCA
ACGTGCAGGCGGATCGGAGGAGGAGATTTGCTCCGTTTGCTTGTCGGAGTTCACCGAAGAAGATTTGGTGAGCCAATTGCATAGATGTAGCCATGTTTTCCATCTAGAAT
GTATTGAGAATTGGCTACAAAGAAACCACTTTACTTGCCCTCTTTGCAGATCCCTCCTCTTCCAAGCATTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGGCGTTACAATTTCAATCTCGTCTTCGAGTTTCGAAGATTTTTTCTTCAACAATTTGGAAATGGGAAAGAGAATTTCCTCTGCTCCGCGATCCATTGCGAATGTC
GGATTTCTCACTCTCACCGGCGCAATCTCGCTGTTTTAGCTCCAATTCCATGGATGACGATTGGGCCTTCTCAAAGTCCCAGCCGAGAGGCAGCCCGCCACAACGGCGTT
CTCCTCCGGATCATCGCGAACCTCGTCGGAATCAACGATTTAATAGGCACTCGGAAGGTTCCCCGTCTCGATTCGCAGATGAAGGGTCAATTTCGCTGCAAAGGAGCGAA
TCTTTTTCTCAGAAGGATTTTAGCTTTCTTGAAAAATTCAAGCTGAACACTGATAATCAGAGTAGTAGTAAAGAGAAGACTGAGGAGAGCTCTTCTGCTCGGTTATCTGA
AATAACGTGGAAGAAGCAGTTGCAACCGCAGCCTTCTCCAGAAGCCGATGAGACATTCGGGCAAATGAAGAAGACTGAGGAGAGCTCTTCTGCTTCCCAACCGCAGCGTT
CTCCAGAAGCCGATGAGATATTCAGGCAAATGAAGAAGACTGGTCTGATTCCCAACGCTGTCGCTATGCTTGATGGGCTTTGTAAAGATGGACTTATTCAAGAAGCAATG
AAACTATTTGGTTTGATTCGTGAAAAGGGTACAATTCCAGAAGTTGTGATTTACACTGCTGTCGTTGATGGGTTTTGCAAGGCGGAGAAGCTTGATGAAGCAATTAGGAT
TTTCAGGAAAATGCAGAATAATGGCATTTCTCCCAATGCCTTTAGTTTTGGCGTCTTGATACAGGGACTGTACAAATGCAAAAAAATAGAGGATGCTGTAGCATTTTGCA
TTGAGATGTTAGAATCTGGACATACTCCAAATCTAACCACTTTTGTTGGCTTAATTGATGGGTTATGCAATGAGAAGGGCGTGGACGAAGCTCATAGTGTCGTAGAAACC
TTGAAACAAAAGGGATTCTTGATTAATGAGAATTCTTTAAGGGAATTTTTGAATAAAAGAACCCCATTTTCACCACATATCTGGGAAGTGAAGCTACCAATCACATCACT
GAGAAGAAACCACAAGTTGAAATCATTGGATCTCAGTAGCCAGAATGTGAGAAGAGTGAAGAATATGGAACAGACACACCATCTGGCTTTGGAGGGGAGAATGAGAAAGG
AAGCCCACGTTCCAAGAATGATAGGCTTGGCATTATGTAGAGGTTCAATGTGCATAGCCACCATTGTCTTCTTAACTTGTGTGTGGATCCCTCTTCTTCAACTGAAGGCA
GCCATAGCGAGAATGTTCGGCGTTCTATCGAGCAGGACGTGTCAGACAGTCGCAACGGCGTCGCCACCATTTGAGGTAGAACTTCTGGTTTGTAGGTTCAAAGAGCTGCA
ACGTGCAGGCGGATCGGAGGAGGAGATTTGCTCCGTTTGCTTGTCGGAGTTCACCGAAGAAGATTTGGTGAGCCAATTGCATAGATGTAGCCATGTTTTCCATCTAGAAT
GTATTGAGAATTGGCTACAAAGAAACCACTTTACTTGCCCTCTTTGCAGATCCCTCCTCTTCCAAGCATTGTAA
Protein sequenceShow/hide protein sequence
MGALQFQSRLRVSKIFSSTIWKWEREFPLLRDPLRMSDFSLSPAQSRCFSSNSMDDDWAFSKSQPRGSPPQRRSPPDHREPRRNQRFNRHSEGSPSRFADEGSISLQRSE
SFSQKDFSFLEKFKLNTDNQSSSKEKTEESSSARLSEITWKKQLQPQPSPEADETFGQMKKTEESSSASQPQRSPEADEIFRQMKKTGLIPNAVAMLDGLCKDGLIQEAM
KLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGISPNAFSFGVLIQGLYKCKKIEDAVAFCIEMLESGHTPNLTTFVGLIDGLCNEKGVDEAHSVVET
LKQKGFLINENSLREFLNKRTPFSPHIWEVKLPITSLRRNHKLKSLDLSSQNVRRVKNMEQTHHLALEGRMRKEAHVPRMIGLALCRGSMCIATIVFLTCVWIPLLQLKA
AIARMFGVLSSRTCQTVATASPPFEVELLVCRFKELQRAGGSEEEICSVCLSEFTEEDLVSQLHRCSHVFHLECIENWLQRNHFTCPLCRSLLFQAL