; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G002175 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G002175
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCG_Chr05:2103413..2111256
RNA-Seq ExpressionClCG05G002175
SyntenyClCG05G002175
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001810 - F-box domain
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR033443 - Pentacotripeptide-repeat region of PRORP
IPR036047 - F-box-like domain superfamily
IPR044657 - Pentatricopeptide repeat-containing protein NFD5-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582566.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]6.0e-19586.91Show/hide
Query:  NILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMK
        NIL+Q+HPKQPLVNG   SSYSCY RG   +   VL  RRRC QLA VAAIVEE H LES REKPRFRWVEVGSDITEMQK+AISQLP KMTKRCKA+MK
Subjt:  NILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMK

Query:  QLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACDQITL
        Q+ICFSPQ GNLSDMLAAWVRIMKP+RADWLSVLKH+RI +HPLYIEVAEAALVE TFEA+TRDYTKIIHY+GKRNQLEDAE+ILL M+ERGFACDQITL
Subjt:  QLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACDQITL

Query:  TTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLAAIPP
        TTMIHIYSKAD+LNLAK+TFE++KLLE+PLD+RSY AMIMA++RAGMPEEGENILKEMD KDI AGSEVYKALLRAYSMA NAEGAQRVFDAIQLAAIPP
Subjt:  TTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLAAIPP

Query:  DEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTV
        D+KLCGLLINAYLMAGQSQKAQIAFDNMRRAG+EPSDKCIALVLSAYEKENRLNAALELLIDLEK+ LMVGKEASE+LAAWLKRLGVVEEVELVLREY V
Subjt:  DEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTV

Query:  KEASG
        KEASG
Subjt:  KEASG

XP_022924339.1 pentatricopeptide repeat-containing protein At1g01970 [Cucurbita moschata]3.0e-19486.67Show/hide
Query:  NILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMK
        NIL+Q+HPKQPLVNG   SSYSCY RG   +   VL  RRRC QL  VAAIVEE HKLES REKPRFRW+EVGSDITEMQK+AISQLP KMTKRCKA+MK
Subjt:  NILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMK

Query:  QLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACDQITL
        Q+ICFSPQ GNLSDMLAAWVRIMKP+RADWLSVLKH+RI +HPLYIEVAEAALVE TFEA+TRDYTKIIHY+GKRNQLEDAE+ILL MRERGFACDQITL
Subjt:  QLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACDQITL

Query:  TTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLAAIPP
        TTMIHIYSKAD+L+LAK+TFEELKLLE+PLD+RSY AMIMA++RAGMPEEGENILKEMD KDI AGSEVYKALLRAYSMA NAEGAQRVFDAIQLAAIPP
Subjt:  TTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLAAIPP

Query:  DEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTV
        D+KLCGLLINAYLMA QSQKAQIAFDNMRRAG+EPSDKCIALVLSAYEKENRLNAALELLIDLEK+ L+VGKEASE+LAAWLKRLGVVEEVELVLREY V
Subjt:  DEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTV

Query:  KEASG
        KEASG
Subjt:  KEASG

XP_022980308.1 pentatricopeptide repeat-containing protein At1g01970 [Cucurbita maxima]3.5e-19587.41Show/hide
Query:  NILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMK
        NIL+Q+HPKQPLVNG   SSYSCY RG   +   VL  RR C QLATVAAIVEE HKLES REKPRFRWVEVGSDITEMQK+AISQLP KMTKRCKA+MK
Subjt:  NILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMK

Query:  QLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACDQITL
        Q+ICF PQ G+LSDMLAAWVRIMKP+RADWLSVLKH+RI +HPLYIEVAEAALVE TFEA+TRDYTKIIHY+GK+NQLEDAE+ILL MRERGFACDQITL
Subjt:  QLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACDQITL

Query:  TTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLAAIPP
        TTMIHIYSKAD+LNLAK+TFEELKLLE+PLD+RSY AMIMA+VRAGMPEEGENILKEMD KDI AGSEVYKALLRAYSMA NAEGAQRVFDAIQLAAIPP
Subjt:  TTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLAAIPP

Query:  DEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTV
        D+KLCGLLINAYLMAGQSQKAQIAFDNMRRAG+EPSDKCIALVLSAYEKENRLNAALELLIDLEK+ LMVGKEASE+LAAWLKRLGVVEEVELVLREY V
Subjt:  DEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTV

Query:  KEASG
        KEASG
Subjt:  KEASG

XP_023526280.1 pentatricopeptide repeat-containing protein At1g01970 isoform X1 [Cucurbita pepo subsp. pepo]7.1e-19687.41Show/hide
Query:  NILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMK
        NIL+Q+HPKQPLVNG   SSYSCY RG   +   VL  RRRC QLATVAAIVEE H LES REKPRFRWVEVGSDITEMQK+AISQLP KMTKRCKA+MK
Subjt:  NILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMK

Query:  QLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACDQITL
        Q+ICFSPQ GNLSDMLAAWVRIMKP+RADWLSVLKH+RI +HPLYIEVAEAALVE TFEA+TRDYTKIIHY+GKRNQLEDAE+ILL MRERGFACDQITL
Subjt:  QLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACDQITL

Query:  TTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLAAIPP
        TTMIHIYSKAD+L+LAK+TFEELKLLE+PLD+RSY AMIMA++RAGMPEEGENILKEMD KDI AGSEVYKALLRAYSMA NA+GAQRVFDAIQLAAIPP
Subjt:  TTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLAAIPP

Query:  DEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTV
        D+KLCGLLINAYLMAGQSQKAQIAFDNMRRAG+EPSDKCIALVLSAYEKENRLNAALELLIDLEK+ LMVGKEASE+LAAWLKRLGVVEEVELVLREY V
Subjt:  DEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTV

Query:  KEASG
        KEASG
Subjt:  KEASG

XP_038903030.1 pentatricopeptide repeat-containing protein At1g01970 [Benincasa hispida]7.8e-20389.24Show/hide
Query:  LQSSNILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCK
        + +S+I +QLHPKQPLVNG P SSY+ YWRGSI QT  VL SRRRCS+LATVAAIVEE HKLE+EREKPRFRWVEVGSDITE+QK+AISQLP KMTKRCK
Subjt:  LQSSNILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCK

Query:  ALMKQLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACD
        ALMKQ+ICFSPQKG+LSDML AWVRIMKPERADWLSVLKH+RISNHPLYIEVAEAALVEITFEANTRDYTKIIH++GKRNQLEDAEK+LL MRERG ACD
Subjt:  ALMKQLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACD

Query:  QITLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLA
        QITLTTMIHIYSKAD+LNLAK+TFEELKLLE+ LD+RSYGAMIMAYVRAGMPEEGENILKEMDAK I AGSEVYKALLRAYSMAGNAEGAQRVFDAIQLA
Subjt:  QITLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLA

Query:  AIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLR
         IPPDEKLCGLLINAYLMAG+S+KAQIAFDNMRRAGIEPSDKCIALVLSAYE ENRLNAALELLIDLEKDNL+V KEASEILAAWLKRLGVVEEVELVLR
Subjt:  AIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLR

Query:  EYTVKEASG
        EYT KE SG
Subjt:  EYTVKEASG

TrEMBL top hitse value%identityAlignment
A0A1S3AWA7 pentatricopeptide repeat-containing protein At1g019702.0e-19185.54Show/hide
Query:  LQSSNILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCK
        + +SNIL+QLH   PLVNG   +S S YW+ SI     VL SRRRCSQ+ATV AIV+E HKLESEREKPRFRWVEVG +ITE QK+AISQLP KMTK+CK
Subjt:  LQSSNILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCK

Query:  ALMKQLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACD
        A+MKQ+ICFSPQKG LSDMLAAWVRIMKPERADWLSVLKH+RI NHPLYI+VAEAALVEITFEANTRDYTKIIH++GK+NQLEDAEK+LL MRERGFACD
Subjt:  ALMKQLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACD

Query:  QITLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLA
        QITLTTMIHIYSKADKL LAK+TFEELKLLEQ LDKRSYGAMIMAYVRAG+PEEGE ILKEMDAKDI AGSEVYKALLRAYSMAG+AEGAQRVFDAIQLA
Subjt:  QITLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLA

Query:  AIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLR
        AIPPDEKLCGLL+NAYLMAGQS+KAQIAFDNMRRAGIEPSDKCIAL LSAYEKENRLNAALELLIDLEKDN+MVGKEAS+ILAAWLKRLGVVEE+E+VLR
Subjt:  AIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLR

Query:  EYTVKEAS
        EYT KE +
Subjt:  EYTVKEAS

A0A5A7U612 Pentatricopeptide repeat-containing protein2.0e-19185.54Show/hide
Query:  LQSSNILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCK
        + +SNIL+QLH   PLVNG   +S S YW+ SI     VL SRRRCSQ+ATV AIV+E HKLESEREKPRFRWVEVG +ITE QK+AISQLP KMTK+CK
Subjt:  LQSSNILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCK

Query:  ALMKQLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACD
        A+MKQ+ICFSPQKG LSDMLAAWVRIMKPERADWLSVLKH+RI NHPLYI+VAEAALVEITFEANTRDYTKIIH++GK+NQLEDAEK+LL MRERGFACD
Subjt:  ALMKQLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACD

Query:  QITLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLA
        QITLTTMIHIYSKADKL LAK+TFEELKLLEQ LDKRSYGAMIMAYVRAG+PEEGE ILKEMDAKDI AGSEVYKALLRAYSMAG+AEGAQRVFDAIQLA
Subjt:  QITLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLA

Query:  AIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLR
        AIPPDEKLCGLL+NAYLMAGQS+KAQIAFDNMRRAGIEPSDKCIAL LSAYEKENRLNAALELLIDLEKDN+MVGKEAS+ILAAWLKRLGVVEE+E+VLR
Subjt:  AIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLR

Query:  EYTVKEAS
        EYT KE +
Subjt:  EYTVKEAS

A0A5D3D032 Pentatricopeptide repeat-containing protein8.8e-19285.78Show/hide
Query:  LQSSNILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCK
        + +SNIL+QLH   PLVNG   +S S YW+ SI     VL SRRRCSQ+ATV AIV+E HKLESEREKPRFRWVEVG +ITE QK+AISQLP KMTK+CK
Subjt:  LQSSNILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCK

Query:  ALMKQLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACD
        A+MKQ+ICFSPQKG LSDMLAAWVRIMKPERADWLSVLKH+RI NHPLYI+VAEAALVEITFEANTRDYTKIIH++GK+NQLEDAEK+LL MRERGFACD
Subjt:  ALMKQLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACD

Query:  QITLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLA
        QITLTTMIHIYSKADKL LAK+TFEELKLLEQ LDKRSYGAMIMAYVRAG+PEEGE ILKEMDAKDI AGSEVYKALLRAYSMAG+AEGAQRVFDAIQLA
Subjt:  QITLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLA

Query:  AIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLR
        AIPPDEKLCGLL+NAYLMAGQS+KAQIAFDNMRRAGIEPSDKCIAL LSAYEKENRLNAALELLIDLEKDN+MVGKEAS+ILAAWLKRLGVVEE+E+VLR
Subjt:  AIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLR

Query:  EYTVKEAS
        EYT KE S
Subjt:  EYTVKEAS

A0A6J1EER0 pentatricopeptide repeat-containing protein At1g019701.4e-19486.67Show/hide
Query:  NILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMK
        NIL+Q+HPKQPLVNG   SSYSCY RG   +   VL  RRRC QL  VAAIVEE HKLES REKPRFRW+EVGSDITEMQK+AISQLP KMTKRCKA+MK
Subjt:  NILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMK

Query:  QLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACDQITL
        Q+ICFSPQ GNLSDMLAAWVRIMKP+RADWLSVLKH+RI +HPLYIEVAEAALVE TFEA+TRDYTKIIHY+GKRNQLEDAE+ILL MRERGFACDQITL
Subjt:  QLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACDQITL

Query:  TTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLAAIPP
        TTMIHIYSKAD+L+LAK+TFEELKLLE+PLD+RSY AMIMA++RAGMPEEGENILKEMD KDI AGSEVYKALLRAYSMA NAEGAQRVFDAIQLAAIPP
Subjt:  TTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLAAIPP

Query:  DEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTV
        D+KLCGLLINAYLMA QSQKAQIAFDNMRRAG+EPSDKCIALVLSAYEKENRLNAALELLIDLEK+ L+VGKEASE+LAAWLKRLGVVEEVELVLREY V
Subjt:  DEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTV

Query:  KEASG
        KEASG
Subjt:  KEASG

A0A6J1IR15 pentatricopeptide repeat-containing protein At1g019701.7e-19587.41Show/hide
Query:  NILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMK
        NIL+Q+HPKQPLVNG   SSYSCY RG   +   VL  RR C QLATVAAIVEE HKLES REKPRFRWVEVGSDITEMQK+AISQLP KMTKRCKA+MK
Subjt:  NILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMK

Query:  QLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACDQITL
        Q+ICF PQ G+LSDMLAAWVRIMKP+RADWLSVLKH+RI +HPLYIEVAEAALVE TFEA+TRDYTKIIHY+GK+NQLEDAE+ILL MRERGFACDQITL
Subjt:  QLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACDQITL

Query:  TTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLAAIPP
        TTMIHIYSKAD+LNLAK+TFEELKLLE+PLD+RSY AMIMA+VRAGMPEEGENILKEMD KDI AGSEVYKALLRAYSMA NAEGAQRVFDAIQLAAIPP
Subjt:  TTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLAAIPP

Query:  DEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTV
        D+KLCGLLINAYLMAGQSQKAQIAFDNMRRAG+EPSDKCIALVLSAYEKENRLNAALELLIDLEK+ LMVGKEASE+LAAWLKRLGVVEEVELVLREY V
Subjt:  DEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTV

Query:  KEASG
        KEASG
Subjt:  KEASG

SwissProt top hitse value%identityAlignment
Q3E6P4 F-box protein At2g022401.1e-1835.4Show/hide
Query:  DRLPVECICCILAFTSPKDVCKIAGVSTAFRSAADSDLLWRTFLPPDYRQIIAQSSSSPLFWFLNSLPEKALYFHLSDHPLLIGTGNSSITLEKDSGSKC
        D LP +CI  I++FTSP+D C  A VS  F SA  SD +W  FLPP+Y  ++++S             +K LYF L  +P+LI  G  S  LEK SG +C
Subjt:  DRLPVECICCILAFTSPKDVCKIAGVSTAFRSAADSDLLWRTFLPPDYRQIIAQSSSSPLFWFLNSLPEKALYFHLSDHPLLIGTGNSSITLEKDSGSKC

Query:  YMIGSRDIDAT--MRQPYWTWKFVLQS-----SNIL----HQLHPKQPLVNGNPGSSYSCY
         M+ S+++  T      YW W  + +S     + +L     ++  K      +PG+ YS Y
Subjt:  YMIGSRDIDAT--MRQPYWTWKFVLQS-----SNIL----HQLHPKQPLVNGNPGSSYSCY

Q6NPT8 F-box protein PP2-B16.6e-1937.01Show/hide
Query:  DRLPVECICCILAFTSPKDVCKIAGVSTAFRSAADSDLLWRTFLPPDYRQIIAQSSSSPLFWFLNSLPEKALYFHLSDHPLLIGTGNSSITLEKDSGSKC
        D LP +CI  +++ TSP+D C +A VS + +SAA SDL+W  FLP +Y  ++ QS+        N L +K ++  L+D+ +L+  G  S  +EK SG KC
Subjt:  DRLPVECICCILAFTSPKDVCKIAGVSTAFRSAADSDLLWRTFLPPDYRQIIAQSSSSPLFWFLNSLPEKALYFHLSDHPLLIGTGNSSITLEKDSGSKC

Query:  YMIGSRDIDATM--RQPYWTWKFVLQS
        YM+ + ++         YW W  V +S
Subjt:  YMIGSRDIDATM--RQPYWTWKFVLQS

Q940Z1 Pentatricopeptide repeat-containing protein At1g195259.8e-3942.58Show/hide
Query:  MRERGFACDQITLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQ
        M + G   D +T T ++H+YSK+     A   FE LK      D++ Y AMI+ YV AG P+ GE ++KEM AK++ A  EVY ALLRAY+  G+A GA 
Subjt:  MRERGFACDQITLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQ

Query:  RVFDAIQLAAIPP-DEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLG
         +  ++Q A+  P   +   L + AY  AGQ  KA+  FD MR+ G +P DKCIA ++ AY+ EN L+ AL LL+ LEKD + +G     +L  W+  LG
Subjt:  RVFDAIQLAAIPP-DEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLG

Query:  VVEEVELVL
        ++EE E +L
Subjt:  VVEEVELVL

Q9LPC4 Pentatricopeptide repeat-containing protein At1g019705.8e-12459.34Show/hide
Query:  RRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMKQLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRI
        R CS     +  + E  + E   +   F W +VG ++TE Q +AI+++P KM+KRC+ALM+Q+ICFSP+KG+  D+L AW+R M P RADWLS+LK ++ 
Subjt:  RRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMKQLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRI

Query:  SNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACDQITLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMI
         + P YI+VAE +L++ +FEAN RDYTKIIHY+GK NQ+EDAE+ LL M+ RGF  DQ+TLT M+ +YSKA    LA+ TF E+KLL +PLD RSYG+MI
Subjt:  SNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACDQITLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMI

Query:  MAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLAAIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKC
        MAY+RAG+PE+GE++L+EMD+++I AG EVYKALLR YSM G+AEGA+RVFDA+Q+A I PD KLCGLLINAY ++GQSQ A++AF+NMR+AGI+ +DKC
Subjt:  MAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLAAIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKC

Query:  IALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTVKEA
        +ALVL+AYEKE +LN AL  L++LEKD++M+GKEAS +LA W K+LGVVEEVEL+LRE++  ++
Subjt:  IALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTVKEA

Q9ZVQ8 Putative F-box protein PP2-B81.1e-1838.21Show/hide
Query:  MDRLPVECICCILAFTSPKDVCKIAGVSTAFRSAADSDLLWRTFLPPDYRQIIAQSSSSPLFWFLNSLPEKALYFHLSDHPLLIGTGNSSITLEKDSGSK
        +D LP EC+  I++FTSP+D C +A VS  F SA  SD++W  F+PP+Y  +I+QS +   F FL+   +K LYF L D  +LI  G  S+ +EK +  +
Subjt:  MDRLPVECICCILAFTSPKDVCKIAGVSTAFRSAADSDLLWRTFLPPDYRQIIAQSSSSPLFWFLNSLPEKALYFHLSDHPLLIGTGNSSITLEKDSGSK

Query:  CYMIGSRDIDATMRQPYWTWKFV
        C MI + ++         +W+++
Subjt:  CYMIGSRDIDATMRQPYWTWKFV

Arabidopsis top hitse value%identityAlignment
AT1G01970.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.1e-12559.34Show/hide
Query:  RRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMKQLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRI
        R CS     +  + E  + E   +   F W +VG ++TE Q +AI+++P KM+KRC+ALM+Q+ICFSP+KG+  D+L AW+R M P RADWLS+LK ++ 
Subjt:  RRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKAISQLPAKMTKRCKALMKQLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRI

Query:  SNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACDQITLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMI
         + P YI+VAE +L++ +FEAN RDYTKIIHY+GK NQ+EDAE+ LL M+ RGF  DQ+TLT M+ +YSKA    LA+ TF E+KLL +PLD RSYG+MI
Subjt:  SNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRMRERGFACDQITLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMI

Query:  MAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLAAIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKC
        MAY+RAG+PE+GE++L+EMD+++I AG EVYKALLR YSM G+AEGA+RVFDA+Q+A I PD KLCGLLINAY ++GQSQ A++AF+NMR+AGI+ +DKC
Subjt:  MAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAIQLAAIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKC

Query:  IALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTVKEA
        +ALVL+AYEKE +LN AL  L++LEKD++M+GKEAS +LA W K+LGVVEEVEL+LRE++  ++
Subjt:  IALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREYTVKEA

AT1G19520.1 pentatricopeptide (PPR) repeat-containing protein3.3e-6640.91Show/hide
Query:  RWVEVGSDITEMQKKAISQLPAKMTKRCKALMKQLICFSPQKG-NLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYT
        +WVE+   I E +++A  + P  +T +CK +M++L   S Q+G + S +LA W  +++P R DW++++  +R  N   Y++VAE  L E +F A+  DY+
Subjt:  RWVEVGSDITEMQKKAISQLPAKMTKRCKALMKQLICFSPQKG-NLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYT

Query:  KIIHYHGKRNQLEDAEKILLRMRERGFACDQITLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAG
        K+IH H K N +ED E+IL +M + G   D +T T ++H+YSK+     A   FE LK      D++ Y AMI+ YV AG P+ GE ++KEM AK++ A 
Subjt:  KIIHYHGKRNQLEDAEKILLRMRERGFACDQITLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAG

Query:  SEVYKALLRAYSMAGNAEGAQRVFDAIQLAAIPP-DEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEK
         EVY ALLRAY+  G+A GA  +  ++Q A+  P   +   L + AY  AGQ  KA+  FD MR+ G +P DKCIA ++ AY+ EN L+ AL LL+ LEK
Subjt:  SEVYKALLRAYSMAGNAEGAQRVFDAIQLAAIPP-DEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEK

Query:  DNLMVGKEASEILAAWLKRLGVVEEVELVL
        D + +G     +L  W+  LG++EE E +L
Subjt:  DNLMVGKEASEILAAWLKRLGVVEEVELVL

AT2G02230.1 phloem protein 2-B14.7e-2037.01Show/hide
Query:  DRLPVECICCILAFTSPKDVCKIAGVSTAFRSAADSDLLWRTFLPPDYRQIIAQSSSSPLFWFLNSLPEKALYFHLSDHPLLIGTGNSSITLEKDSGSKC
        D LP +CI  +++ TSP+D C +A VS + +SAA SDL+W  FLP +Y  ++ QS+        N L +K ++  L+D+ +L+  G  S  +EK SG KC
Subjt:  DRLPVECICCILAFTSPKDVCKIAGVSTAFRSAADSDLLWRTFLPPDYRQIIAQSSSSPLFWFLNSLPEKALYFHLSDHPLLIGTGNSSITLEKDSGSKC

Query:  YMIGSRDIDATM--RQPYWTWKFVLQS
        YM+ + ++         YW W  V +S
Subjt:  YMIGSRDIDATM--RQPYWTWKFVLQS

AT2G02240.1 F-box family protein8.0e-2035.4Show/hide
Query:  DRLPVECICCILAFTSPKDVCKIAGVSTAFRSAADSDLLWRTFLPPDYRQIIAQSSSSPLFWFLNSLPEKALYFHLSDHPLLIGTGNSSITLEKDSGSKC
        D LP +CI  I++FTSP+D C  A VS  F SA  SD +W  FLPP+Y  ++++S             +K LYF L  +P+LI  G  S  LEK SG +C
Subjt:  DRLPVECICCILAFTSPKDVCKIAGVSTAFRSAADSDLLWRTFLPPDYRQIIAQSSSSPLFWFLNSLPEKALYFHLSDHPLLIGTGNSSITLEKDSGSKC

Query:  YMIGSRDIDAT--MRQPYWTWKFVLQS-----SNIL----HQLHPKQPLVNGNPGSSYSCY
         M+ S+++  T      YW W  + +S     + +L     ++  K      +PG+ YS Y
Subjt:  YMIGSRDIDAT--MRQPYWTWKFVLQS-----SNIL----HQLHPKQPLVNGNPGSSYSCY

AT2G02340.1 phloem protein 2-B88.0e-2038.21Show/hide
Query:  MDRLPVECICCILAFTSPKDVCKIAGVSTAFRSAADSDLLWRTFLPPDYRQIIAQSSSSPLFWFLNSLPEKALYFHLSDHPLLIGTGNSSITLEKDSGSK
        +D LP EC+  I++FTSP+D C +A VS  F SA  SD++W  F+PP+Y  +I+QS +   F FL+   +K LYF L D  +LI  G  S+ +EK +  +
Subjt:  MDRLPVECICCILAFTSPKDVCKIAGVSTAFRSAADSDLLWRTFLPPDYRQIIAQSSSSPLFWFLNSLPEKALYFHLSDHPLLIGTGNSSITLEKDSGSK

Query:  CYMIGSRDIDATMRQPYWTWKFV
        C MI + ++         +W+++
Subjt:  CYMIGSRDIDATMRQPYWTWKFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAGACTACCGGTGGAGTGCATCTGTTGCATTTTGGCCTTCACGTCGCCGAAAGATGTTTGCAAAATCGCCGGAGTCTCAACGGCGTTCAGATCCGCCGCC
GATTCCGACCTTCTCTGGAGGACTTTTCTGCCGCCGGATTACCGGCAGATTATTGCTCAGTCGTCGTCCTCGCCTTTGTTTTGGTTTCTAAATTCCTTGCCGGAG
AAGGCTCTCTATTTCCATCTCTCCGATCATCCACTTCTTATAGGCACTGGAAATTCGAGTATTACATTGGAAAAAGATAGTGGAAGTAAATGTTATATGATTGGT
TCAAGGGACATAGATGCTACAATGAGACAACCATATTGGACCTGGAAATTTGTCCTCCAATCAAGTAACATTCTCCATCAACTTCATCCCAAACAGCCGCTAGTT
AATGGAAATCCTGGGAGTTCGTATTCTTGTTACTGGAGAGGCTCAATTGCTCAAACATTCGGAGTCTTAAGATCTCGCCGAAGATGCTCTCAATTGGCTACTGTT
GCTGCCATTGTTGAGGAATTTCACAAATTAGAGAGTGAAAGAGAGAAGCCAAGGTTTCGATGGGTCGAGGTGGGCTCTGATATTACTGAAATGCAGAAGAAAGCT
ATATCTCAGCTTCCTGCTAAGATGACTAAAAGATGTAAGGCTCTGATGAAACAACTTATATGTTTTTCGCCTCAGAAGGGTAATTTATCAGATATGTTGGCGGCT
TGGGTGAGGATTATGAAGCCTGAAAGAGCAGATTGGCTTTCAGTTCTTAAGCATATGAGGATCTCGAATCATCCTCTTTACATCGAGGTGGCAGAAGCTGCTCTT
GTAGAGATAACATTTGAGGCCAATACTCGAGACTACACAAAGATTATTCATTACCATGGGAAGCGAAACCAACTCGAGGATGCTGAAAAAATTCTCTTAAGGATG
AGAGAAAGGGGTTTTGCTTGTGATCAGATAACATTGACCACAATGATCCACATATATAGCAAGGCTGACAAACTTAATCTTGCCAAACGAACTTTTGAAGAGCTC
AAACTGCTCGAGCAACCATTGGATAAAAGATCGTACGGTGCAATGATTATGGCATATGTCAGGGCCGGGATGCCCGAGGAAGGAGAAAATATTCTGAAAGAAATG
GATGCGAAAGATATTAATGCAGGAAGTGAAGTTTACAAGGCTTTGTTAAGAGCATATTCCATGGCTGGCAATGCTGAAGGAGCCCAAAGGGTATTCGATGCAATT
CAATTGGCTGCTATTCCTCCTGATGAAAAGTTATGTGGTCTGCTGATCAATGCGTATCTGATGGCCGGTCAAAGCCAAAAGGCGCAAATTGCTTTCGACAATATG
AGGAGGGCAGGTATTGAACCTAGTGACAAATGCATAGCTTTGGTATTAAGTGCATATGAAAAGGAAAACAGGCTAAACGCAGCATTGGAACTTCTAATAGATTTA
GAGAAGGATAACCTCATGGTTGGGAAGGAAGCTTCAGAAATATTAGCAGCTTGGCTTAAAAGACTTGGGGTGGTAGAAGAGGTTGAACTTGTCTTGAGGGAATAC
ACTGTGAAAGAAGCAAGCGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGATAGACTACCGGTGGAGTGCATCTGTTGCATTTTGGCCTTCACGTCGCCGAAAGATGTTTGCAAAATCGCCGGAGTCTCAACGGCGTTCAGATCCGCCGCC
GATTCCGACCTTCTCTGGAGGACTTTTCTGCCGCCGGATTACCGGCAGATTATTGCTCAGTCGTCGTCCTCGCCTTTGTTTTGGTTTCTAAATTCCTTGCCGGAG
AAGGCTCTCTATTTCCATCTCTCCGATCATCCACTTCTTATAGGCACTGGAAATTCGAGTATTACATTGGAAAAAGATAGTGGAAGTAAATGTTATATGATTGGT
TCAAGGGACATAGATGCTACAATGAGACAACCATATTGGACCTGGAAATTTGTCCTCCAATCAAGTAACATTCTCCATCAACTTCATCCCAAACAGCCGCTAGTT
AATGGAAATCCTGGGAGTTCGTATTCTTGTTACTGGAGAGGCTCAATTGCTCAAACATTCGGAGTCTTAAGATCTCGCCGAAGATGCTCTCAATTGGCTACTGTT
GCTGCCATTGTTGAGGAATTTCACAAATTAGAGAGTGAAAGAGAGAAGCCAAGGTTTCGATGGGTCGAGGTGGGCTCTGATATTACTGAAATGCAGAAGAAAGCT
ATATCTCAGCTTCCTGCTAAGATGACTAAAAGATGTAAGGCTCTGATGAAACAACTTATATGTTTTTCGCCTCAGAAGGGTAATTTATCAGATATGTTGGCGGCT
TGGGTGAGGATTATGAAGCCTGAAAGAGCAGATTGGCTTTCAGTTCTTAAGCATATGAGGATCTCGAATCATCCTCTTTACATCGAGGTGGCAGAAGCTGCTCTT
GTAGAGATAACATTTGAGGCCAATACTCGAGACTACACAAAGATTATTCATTACCATGGGAAGCGAAACCAACTCGAGGATGCTGAAAAAATTCTCTTAAGGATG
AGAGAAAGGGGTTTTGCTTGTGATCAGATAACATTGACCACAATGATCCACATATATAGCAAGGCTGACAAACTTAATCTTGCCAAACGAACTTTTGAAGAGCTC
AAACTGCTCGAGCAACCATTGGATAAAAGATCGTACGGTGCAATGATTATGGCATATGTCAGGGCCGGGATGCCCGAGGAAGGAGAAAATATTCTGAAAGAAATG
GATGCGAAAGATATTAATGCAGGAAGTGAAGTTTACAAGGCTTTGTTAAGAGCATATTCCATGGCTGGCAATGCTGAAGGAGCCCAAAGGGTATTCGATGCAATT
CAATTGGCTGCTATTCCTCCTGATGAAAAGTTATGTGGTCTGCTGATCAATGCGTATCTGATGGCCGGTCAAAGCCAAAAGGCGCAAATTGCTTTCGACAATATG
AGGAGGGCAGGTATTGAACCTAGTGACAAATGCATAGCTTTGGTATTAAGTGCATATGAAAAGGAAAACAGGCTAAACGCAGCATTGGAACTTCTAATAGATTTA
GAGAAGGATAACCTCATGGTTGGGAAGGAAGCTTCAGAAATATTAGCAGCTTGGCTTAAAAGACTTGGGGTGGTAGAAGAGGTTGAACTTGTCTTGAGGGAATAC
ACTGTGAAAGAAGCAAGCGGATAAGCAGGAGAAACCCTGAGATTAGTAGTTCAATCACCCGTTAGGCTCCTTTAGGGACGCACGGAAATAGCGAGGCACCAATGT
GTCGCATTGAGGCCAGTTGTTTTTGGACGATTTGTTGATCAATGCTGCTCATGGATGGGAAACTGTTGCTTCCAATTTGGGACAATTCCAAGGGAGATTGCAACT
TCGGCCGTGAAGCTTGCTCGATGGGATCGAGAAGTCGAGAAAGATCTGATTCAGCAGGAAAAGAAGGGCAATCTGATGGAAGGTTTACAATTGAGGATAGAGTTG
GCTTCTTGACTAGAGCTGGTGGCTGTGCCTGTAGATTGGGTGCGGGTCTTGGAAGCTCAGATGTTGCTAGGTGCGCTTGCAAATACGAGATCTCTACTTGCAACC
TTGCAGCCTAAAAATTAATGTATATCACTCAAATTTTGGCATTTATATATTGATATACAAATTATATGTGAAACTAAAATTGAAAACTAAATAATGTATACAAGT
CTGTTTCGCATATGAAGATGTATGATGATGTATGATGATGATCACAAGTCTCAAATGGATATTATAGAAGTAATTGAGGGCTCCATTTGAT
Protein sequenceShow/hide protein sequence
MDRLPVECICCILAFTSPKDVCKIAGVSTAFRSAADSDLLWRTFLPPDYRQIIAQSSSSPLFWFLNSLPEKALYFHLSDHPLLIGTGNSSITLEKDSGSKCYMIG
SRDIDATMRQPYWTWKFVLQSSNILHQLHPKQPLVNGNPGSSYSCYWRGSIAQTFGVLRSRRRCSQLATVAAIVEEFHKLESEREKPRFRWVEVGSDITEMQKKA
ISQLPAKMTKRCKALMKQLICFSPQKGNLSDMLAAWVRIMKPERADWLSVLKHMRISNHPLYIEVAEAALVEITFEANTRDYTKIIHYHGKRNQLEDAEKILLRM
RERGFACDQITLTTMIHIYSKADKLNLAKRTFEELKLLEQPLDKRSYGAMIMAYVRAGMPEEGENILKEMDAKDINAGSEVYKALLRAYSMAGNAEGAQRVFDAI
QLAAIPPDEKLCGLLINAYLMAGQSQKAQIAFDNMRRAGIEPSDKCIALVLSAYEKENRLNAALELLIDLEKDNLMVGKEASEILAAWLKRLGVVEEVELVLREY
TVKEASG