; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g0622 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g0622
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationMC09:5692620..5699400
RNA-Seq ExpressionMC09g0622
SyntenyMC09g0622
Gene Ontology termsGO:0006749 - glutathione metabolic process (biological process)
GO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR004045 - Glutathione S-transferase, N-terminal
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR036249 - Thioredoxin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575842.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]9.07e-28184.25Show/hide
Query:  MKPSAAQLWAQSNRILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAH
        M     Q+WA+SNRILSL A SLSPIHIPQIQ+QLI+QNLHS+  IAHHFIN CH LRLLDSAL FFT  P PHVFV NSLIRAFSHSKIP TPLSIYAH
Subjt:  MKPSAAQLWAQSNRILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAH

Query:  MNRNSILPNNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVAFE
        MNRNSILPNNYTFPFLLKSL+DFN LV GQSVH HVVK G+VSDVYVQNSLMDVYASCGRMGLC+KVFDEMP RDVVSWTVLIMGYR  LMFDDAL+AFE
Subjt:  MNRNSILPNNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVAFE

Query:  HMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWF
         MQYAGVEPN VTMVNALAACA +GAIEMGVWIHEFVKRRGWE+DVILGTSLIDMYGKCGRI+EGLVVFQAMK+KNV+TWNALI GLALAKSGEEAIAWF
Subjt:  HMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWF

Query:  KRMDEEG-VEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLEV
        KRM+EEG +EADEVTLVAVLCACSHSGLV+ GR+IF +++DG +GFSPGI+H+SCMVDLLARSGCIEE+F LIK+MPFDATKAMWGSLLAG RA GSLE+
Subjt:  KRMDEEG-VEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLEV

Query:  SEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVE
        SEFAARKLVEMEPENGAYY VLSNI AEMG+W EVE+VR+IM+  GLKKD GSSSVE
Subjt:  SEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVE

XP_022149919.1 pentatricopeptide repeat-containing protein At1g09190-like isoform X1 [Momordica charantia]0.0100Show/hide
Query:  MASSIKVHGIPISTATCRVLACLYEKELQFELVNVKMHEEEHKKEPFLSLNPFGQIPGFQNAPVGLVLTFKKDFFVYVESSSGAMAMKPSAAQLWAQSNR
        MASSIKVHGIPISTATCRVLACLYEKELQFELVNVKMHEEEHKKEPFLSLNPFGQIPGFQNAPVGLVLTFKKDFFVYVESSSGAMAMKPSAAQLWAQSNR
Subjt:  MASSIKVHGIPISTATCRVLACLYEKELQFELVNVKMHEEEHKKEPFLSLNPFGQIPGFQNAPVGLVLTFKKDFFVYVESSSGAMAMKPSAAQLWAQSNR

Query:  ILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAHMNRNSILPNNYTFP
        ILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAHMNRNSILPNNYTFP
Subjt:  ILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAHMNRNSILPNNYTFP

Query:  FLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVAFEHMQYAGVEPNCVTM
        FLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVAFEHMQYAGVEPNCVTM
Subjt:  FLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVAFEHMQYAGVEPNCVTM

Query:  VNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWFKRMDEEGVEADEVT
        VNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWFKRMDEEGVEADEVT
Subjt:  VNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWFKRMDEEGVEADEVT

Query:  LVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLEVSEFAARKLVEMEPEN
        LVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLEVSEFAARKLVEMEPEN
Subjt:  LVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLEVSEFAARKLVEMEPEN

Query:  GAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQE
        GAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQE
Subjt:  GAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQE

XP_022149927.1 pentatricopeptide repeat-containing protein At5g56310-like isoform X2 [Momordica charantia]0.0100Show/hide
Query:  MAMKPSAAQLWAQSNRILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIY
        MAMKPSAAQLWAQSNRILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIY
Subjt:  MAMKPSAAQLWAQSNRILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIY

Query:  AHMNRNSILPNNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVA
        AHMNRNSILPNNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVA
Subjt:  AHMNRNSILPNNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVA

Query:  FEHMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIA
        FEHMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIA
Subjt:  FEHMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIA

Query:  WFKRMDEEGVEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLE
        WFKRMDEEGVEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLE
Subjt:  WFKRMDEEGVEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLE

Query:  VSEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQE
        VSEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQE
Subjt:  VSEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQE

XP_023548659.1 pentatricopeptide repeat-containing protein At5g56310-like [Cucurbita pepo subsp. pepo]3.88e-28284.46Show/hide
Query:  MKPSAAQLWAQSNRILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAH
        M     Q+WA+SNRILSL A SLSPIHIPQIQ+QLI+QNLHS+  IAHHFIN CH LRLLDSALLFFT  P PHVFV NSLIRAFSHSKIPHTPLSIYAH
Subjt:  MKPSAAQLWAQSNRILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAH

Query:  MNRNSILPNNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVAFE
        MNRNSILPNNYTFPFLLKSL+DFN LV GQSVH HVVK G+VSDVYVQNSLMDVYASCGRMGLC+KVFDEMP RDVVSWTVLIMGYR  LMFDDAL+AFE
Subjt:  MNRNSILPNNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVAFE

Query:  HMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWF
         MQYAGVEPN VTMVNALAACA +GAIEMG+WIHEFVKRRGWE+DVILGTSLIDMYGKCGRI+EGLVVFQAMK+KNV+TWNALI GLALAKSGEEAIAWF
Subjt:  HMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWF

Query:  KRMDEEG-VEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLEV
        KRMDE+G V+ADEVTLVAVLCACSHSGLV+ GR+IF +++DG +GFSPGI+H+SCMVDLLARSGCIEE+F LIK+MPFDATKAMWGSLLAG RA GSLE+
Subjt:  KRMDEEG-VEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLEV

Query:  SEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVE
        SEFAARKLVEMEPENGAYY VLSNI AEMG+W EVE+VR IM+  GLKKD GSSSVE
Subjt:  SEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVE

XP_038898992.1 pentatricopeptide repeat-containing protein At5g56310-like [Benincasa hispida]3.25e-28584.53Show/hide
Query:  MKPSAAQLWAQSNRILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAH
        M P  AQ  A+SNRILS+   SLSPIHIPQIQ+QLILQNLHS+  IAHHFIN CH L+LLDSALLFF H P PHVFV NSLIRAFSHSKIPHTPLSIY H
Subjt:  MKPSAAQLWAQSNRILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAH

Query:  MNRNSILPNNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVAFE
        MNRNSI PNNYTFPFLLKSL+DFNDLV GQSVHTHV+K G+V+DVYVQNSLMDVYASCG+MGLC+KVFDEMPQRDVVSWTVLIMGYR  LMFDDAL+AFE
Subjt:  MNRNSILPNNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVAFE

Query:  HMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWF
         MQYAGVEPN VTMVNALAACA +GAIEMGVWIHEFVKR+GWE+DVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNV+TWNALIKGLALAKSGEEAIAWF
Subjt:  HMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWF

Query:  KRMDEEGVEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLEVS
         RMDEEGVE DEVTLVAVLCACSHSGLV+ G++IF++L D  +GFSPGIKH+SCMVDLLAR GCIE AFVLIKDMPF+ATKAMWGSLLAG RA+G LEVS
Subjt:  KRMDEEGVEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLEVS

Query:  EFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQE
        E AA+KLVEMEPENGAYYVVLSNI AEM +W EVE+VR++M+ERGLKKD GSSSVELQE
Subjt:  EFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQE

TrEMBL top hitse value%identityAlignment
A0A1S3BRY1 pentatricopeptide repeat-containing protein At5g56310-like1.91e-27480.61Show/hide
Query:  MKPSAAQLWAQSNRILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAH
        M+P   Q  A+S RILS+   SLSP+HIPQIQ+QLIL+NLHS   IAHHFIN CH L LLDSA LFFTH P PHVF+ NSLIRAF+HS IPHTPLSIY H
Subjt:  MKPSAAQLWAQSNRILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAH

Query:  MNRNSILPNNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVAFE
        MNRNSI PNNYTFPF+LKSL+DF DLV GQSVHTHVVK G  SD+YVQN+LMDVYASCG+MGLC+KVFDEM QRDVVSWT+LIMGYR  LM DDAL+ FE
Subjt:  MNRNSILPNNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVAFE

Query:  HMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWF
         MQYAGVEPN VT+VNALAACA +GAIEMGVWIHEFVK + WEVDV+LGT+LIDMYGKCGRIKE L VFQAMKEKNV+TWN LI GLALAKSGEEAIAWF
Subjt:  HMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWF

Query:  KRMDEEGVEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLEVS
        KRMDEEGVEAD+VTLVAVLCACSHSGLVN GR+IFR+L+ G +GFSP IKH+SCMVD+LAR+GCIEEAFV+IKDMPF+ATKAMWGSLL G RA+G+LEVS
Subjt:  KRMDEEGVEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLEVS

Query:  EFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQE
        E AARKLVEMEPENGAYYVVLSNI AEMG+W EVE+VR+IM+ERGLKKD GSSSVELQE
Subjt:  EFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQE

A0A6J1D8G0 pentatricopeptide repeat-containing protein At1g09190-like isoform X10.0100Show/hide
Query:  MASSIKVHGIPISTATCRVLACLYEKELQFELVNVKMHEEEHKKEPFLSLNPFGQIPGFQNAPVGLVLTFKKDFFVYVESSSGAMAMKPSAAQLWAQSNR
        MASSIKVHGIPISTATCRVLACLYEKELQFELVNVKMHEEEHKKEPFLSLNPFGQIPGFQNAPVGLVLTFKKDFFVYVESSSGAMAMKPSAAQLWAQSNR
Subjt:  MASSIKVHGIPISTATCRVLACLYEKELQFELVNVKMHEEEHKKEPFLSLNPFGQIPGFQNAPVGLVLTFKKDFFVYVESSSGAMAMKPSAAQLWAQSNR

Query:  ILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAHMNRNSILPNNYTFP
        ILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAHMNRNSILPNNYTFP
Subjt:  ILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAHMNRNSILPNNYTFP

Query:  FLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVAFEHMQYAGVEPNCVTM
        FLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVAFEHMQYAGVEPNCVTM
Subjt:  FLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVAFEHMQYAGVEPNCVTM

Query:  VNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWFKRMDEEGVEADEVT
        VNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWFKRMDEEGVEADEVT
Subjt:  VNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWFKRMDEEGVEADEVT

Query:  LVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLEVSEFAARKLVEMEPEN
        LVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLEVSEFAARKLVEMEPEN
Subjt:  LVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLEVSEFAARKLVEMEPEN

Query:  GAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQE
        GAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQE
Subjt:  GAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQE

A0A6J1D9C2 pentatricopeptide repeat-containing protein At5g56310-like isoform X20.0100Show/hide
Query:  MAMKPSAAQLWAQSNRILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIY
        MAMKPSAAQLWAQSNRILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIY
Subjt:  MAMKPSAAQLWAQSNRILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIY

Query:  AHMNRNSILPNNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVA
        AHMNRNSILPNNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVA
Subjt:  AHMNRNSILPNNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVA

Query:  FEHMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIA
        FEHMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIA
Subjt:  FEHMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIA

Query:  WFKRMDEEGVEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLE
        WFKRMDEEGVEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLE
Subjt:  WFKRMDEEGVEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLE

Query:  VSEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQE
        VSEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQE
Subjt:  VSEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQE

A0A6J1GP72 pentatricopeptide repeat-containing protein At5g56310-like4.44e-27883.81Show/hide
Query:  MKPSAAQLWAQSNRILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAH
        M     Q+WA+SNRILSL A SLSPIHIPQIQ+QLI+QNLHS+  IAHHFIN CH LRLLDSAL FFT  P PHVFV NSLIRAFSHSKIP TPLSIYAH
Subjt:  MKPSAAQLWAQSNRILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAH

Query:  MNRNSILPNNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVAFE
        MNRNSILPNNYTFPFLLKSL+DFN LV GQSVH HVVK G+VSDVYVQNSLMDVYASCGRMGLC+KVFDEMP RDVVSWTVLIMGYR  LMFDDAL+AFE
Subjt:  MNRNSILPNNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVAFE

Query:  HMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWF
         MQYAGV PN VTMVNALAACA +GAIEMGVWIHEFVKRRGWE+DVILGTSLIDMYGKCGRI+EGLVVFQAMK+KNV+TWNALI GLALAKSGEEAIAWF
Subjt:  HMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWF

Query:  KRMDEEG-VEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLEV
        KRM+EEG +EADEVTLVAVLCACSHSGLV+ GR+IF +++DG +GFSPGI+H+SCMVDLLARSG IEE+F LIK+MPFDATKAMWGSLLAG RA GSLE+
Subjt:  KRMDEEG-VEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLEV

Query:  SEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVE
        SEFAARKLVEMEPENGAYY VLSNI AEMG+W EVE+VR+IM+  GLKKD GSSSVE
Subjt:  SEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVE

A0A6J1JWL5 pentatricopeptide repeat-containing protein At5g56310-like2.53e-28084.03Show/hide
Query:  MKPSAAQLWAQSNRILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAH
        M     Q+WA+SNRILSL A SLSPIHIPQIQ+QLI+QNLHS+  IAHHFIN CH LRLLDSAL FFT  P PHVFV NSLIRAFSHSKIPHTPLSIYAH
Subjt:  MKPSAAQLWAQSNRILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAH

Query:  MNRNSILPNNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVAFE
        MNR SILPNNYTFPFLLKSL+DFN LV GQSVH HVVK G+VSDVYVQNSLMDVYASCGRMGLC+KVFDEMP RDVVSWTVLIMGYR  LMFDDAL+AFE
Subjt:  MNRNSILPNNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVAFE

Query:  HMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWF
         MQYAGVEPN VTMVNALAACA +GAIEMGVWIH FVKRRGWE+DVILGTSLIDMYGKCGRI+EGLVVFQAMK+KNV+TWNALI GLALAKSGEEAIAWF
Subjt:  HMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWF

Query:  KRMDEEG-VEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLEV
        KRMDE G VEADEVTLV VLCACSHSGLV+ GR+IF +++DG +GFSPGI+H+SCMVDLLARSGCIEE+F LIK+MPFDATKAMWGSLLAG RA GSLE+
Subjt:  KRMDEEG-VEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLEV

Query:  SEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVE
        SEFAARKLVEMEPENGAYY VLSNI AEMG+W EVE+VR+IM+  GLKKD GSSSVE
Subjt:  SEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVE

SwissProt top hitse value%identityAlignment
O80488 Pentatricopeptide repeat-containing protein At1g091901.8e-7836.21Show/hide
Query:  RILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAHMNRNSILPNNYTF
        ++L LL    +   +P+I + L+   LH S  +  HFI+ C  L   D A   F+H  NP+V VFN++I+ +S    P   LS ++ M    I  + YT+
Subjt:  RILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAHMNRNSILPNNYTF

Query:  PFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGY-------RGGLMFD-------------
          LLKS S  +DL  G+ VH  +++ GF     ++  ++++Y S GRMG  +KVFDEM +R+VV W ++I G+       RG  +F              
Subjt:  PFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGY-------RGGLMFD-------------

Query:  -----------DALVAFEHMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVI-LGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWN
                   +AL  F  M   G +P+  T+V  L   A  G ++ G WIH   +  G   D I +G +L+D Y K G ++    +F+ M+ +NV +WN
Subjt:  -----------DALVAFEHMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVI-LGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWN

Query:  ALIKGLALAKSGEEAIAWFKRMDEEG-VEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDAT
         LI G A+   GE  I  F  M EEG V  +E T + VL  CS++G V +G E+F  +++  +      +H+  MVDL++RSG I EAF  +K+MP +A 
Subjt:  ALIKGLALAKSGEEAIAWFKRMDEEG-VEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDAT

Query:  KAMWGSLLAGGRANGSLEVSEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSV
         AMWGSLL+  R++G ++++E AA +LV++EP N   YV+LSN+ AE GRW +VE+VR +M++  L+K +G S++
Subjt:  KAMWGSLLAGGRANGSLEVSEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSV

P93011 Pentatricopeptide repeat-containing protein At2g337601.2e-7934.81Show/hide
Query:  IPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAHMNRNSILPNNYTFPFLLKSLSDFNDLV
        + Q+ + LI+     S ++    I      R +    L F   P P  F+FNS+I++ S  ++P   ++ Y  M  +++ P+NYTF  ++KS +D + L 
Subjt:  IPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAHMNRNSILPNNYTFPFLLKSLSDFNDLV

Query:  GGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVAFEHMQYAGVEPNCVTMVNALAACAGYGAI
         G+ VH H V  GF  D YVQ +L+  Y+ CG M   R+VFD MP++ +V+W  L+ G+    + D+A+  F  M+ +G EP+  T V+ L+ACA  GA+
Subjt:  GGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVAFEHMQYAGVEPNCVTMVNALAACAGYGAI

Query:  EMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWFKRMDEE-GVEADEVTLVAVLCACSHSG
         +G W+H+++   G +++V LGT+LI++Y +CG + +   VF  MKE NV  W A+I        G++A+  F +M+++ G   + VT VAVL AC+H+G
Subjt:  EMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWFKRMDEE-GVEADEVTLVAVLCACSHSG

Query:  LVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDAT-----KAMWGSLLAGGRANGSLEVSEFAARKLVEMEPENGAYYVVL
        LV +GR +++ +   SY   PG++H  CMVD+L R+G ++EA+  I  +  DAT      A+W ++L   + + + ++    A++L+ +EP+N  ++V+L
Subjt:  LVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDAT-----KAMWGSLLAGGRANGSLEVSEFAARKLVEMEPENGAYYVVL

Query:  SNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQEKAIQTTTKHQSPQ
        SNI A  G+  EV  +RD M    L+K  G S +E++ K    +   +S Q
Subjt:  SNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQEKAIQTTTKHQSPQ

Q56X05 Pentatricopeptide repeat-containing protein At1g061433.1e-7831.55Show/hide
Query:  MKPSAAQLWAQSNRILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAH
        ++  +A L      +  ++    +P  +    + +I  +L+    + + FI AC   + LD A+   T    P+VFV+N+L + F     P   L +Y  
Subjt:  MKPSAAQLWAQSNRILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAH

Query:  MNRNSILPNNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQR--------------------------
        M R+S+ P++YT+  L+K+ S  +    G+S+  H+ K+GF   V +Q +L+D Y++ GR+   RKVFDEMP+R                          
Subjt:  MNRNSILPNNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQR--------------------------

Query:  ------------------------------------DVVSWTVLIMGYRGGLMFDDALVAFEHMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVK
                                            D++SWT +I GY     + +A+  F  M   G+ P+ VTM   ++ACA  G +E+G  +H +  
Subjt:  ------------------------------------DVVSWTVLIMGYRGGLMFDDALVAFEHMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVK

Query:  RRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWFKRMDEEGVEADEVTLVAVLCACSHSGLVNKGREIFRAL
        + G+ +DV +G++L+DMY KCG ++  L+VF  + +KN+F WN++I+GLA     +EA+  F +M+ E V+ + VT V+V  AC+H+GLV++GR I+R++
Subjt:  RRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWFKRMDEEGVEADEVTLVAVLCACSHSGLVNKGREIFRAL

Query:  VDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLEVSEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVR
        +D  Y     ++H+  MV L +++G I EA  LI +M F+    +WG+LL G R + +L ++E A  KL+ +EP N  YY +L ++ AE  RW +V E+R
Subjt:  VDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLEVSEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVR

Query:  DIMRERGLKK-DSGSSSVELQEK
          MRE G++K   G+SS+ + ++
Subjt:  DIMRERGLKK-DSGSSSVELQEK

Q9FG16 Pentatricopeptide repeat-containing protein At5g065403.1e-7834.78Show/hide
Query:  LSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINAC-------HFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAHMNRNSILP
        L+LL +  S   +  I   L+  +L S   +A   +  C           LL  A   F+   NP++FVFN LIR FS    P      Y  M ++ I P
Subjt:  LSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINAC-------HFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAHMNRNSILP

Query:  NNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASC----------GRMGL---------------------CRKVFDEMPQRDVV
        +N TFPFL+K+ S+   ++ G+  H+ +V++GF +DVYV+NSL+ +YA+C          G+MG                       R++FDEMP R++ 
Subjt:  NNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASC----------GRMGL---------------------CRKVFDEMPQRDVV

Query:  SWTVLIMGYRGGLMFDDALVAFEHMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNV
        +W+++I GY     F+ A+  FE M+  GV  N   MV+ +++CA  GA+E G   +E+V +    V++ILGT+L+DM+ +CG I++ + VF+ + E + 
Subjt:  SWTVLIMGYRGGLMFDDALVAFEHMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNV

Query:  FTWNALIKGLALAKSGEEAIAWFKRMDEEGVEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPF
         +W+++IKGLA+     +A+ +F +M   G    +VT  AVL ACSH GLV KG EI+  +    +G  P ++H+ C+VD+L R+G + EA   I  M  
Subjt:  FTWNALIKGLALAKSGEEAIAWFKRMDEEGVEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPF

Query:  DATKAMWGSLLAGGRANGSLEVSEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQEK
             + G+LL   +   + EV+E     L++++PE+  YYV+LSNI A  G+W ++E +RD+M+E+ +KK  G S +E+  K
Subjt:  DATKAMWGSLLAGGRANGSLEVSEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQEK

Q9FMA1 Pentatricopeptide repeat-containing protein At5g563103.5e-8235.54Show/hide
Query:  IPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHT---PLSIYAHMNRNSILPNNYTFPFLLKSLSDFN
        + Q    +I+  L+        FI AC     L  A   FTH P P+ ++ N++IRA S    P+     +++Y  +      P+ +TFPF+LK     +
Subjt:  IPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHT---PLSIYAHMNRNSILPNNYTFPFLLKSLSDFN

Query:  DLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDV---------------------------------VSWTVLIMGYRGGLM
        D+  G+ +H  VV +GF S V+V   L+ +Y SCG +G  RK+FDEM  +DV                                 VSWT +I GY     
Subjt:  DLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDV---------------------------------VSWTVLIMGYRGGLM

Query:  FDDALVAFEHMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAK
          +A+  F+ M    VEP+ VT++  L+ACA  G++E+G  I  +V  RG    V L  ++IDMY K G I + L VF+ + E+NV TW  +I GLA   
Subjt:  FDDALVAFEHMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAK

Query:  SGEEAIAWFKRMDEEGVEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGG
         G EA+A F RM + GV  ++VT +A+L ACSH G V+ G+ +F ++    YG  P I+H+ CM+DLL R+G + EA  +IK MPF A  A+WGSLLA  
Subjt:  SGEEAIAWFKRMDEEGVEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGG

Query:  RANGSLEVSEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQEKAIQTTTKHQS-PQNPHFHSHLQGSSLSVR
          +  LE+ E A  +L+++EP N   Y++L+N+ + +GRW E   +R++M+  G+KK +G SS+E++ +  +  +   + PQ    H  LQ   L ++
Subjt:  RANGSLEVSEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQEKAIQTTTKHQS-PQNPHFHSHLQGSSLSVR

Arabidopsis top hitse value%identityAlignment
AT1G06150.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.2e-7931.55Show/hide
Query:  MKPSAAQLWAQSNRILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAH
        ++  +A L      +  ++    +P  +    + +I  +L+    + + FI AC   + LD A+   T    P+VFV+N+L + F     P   L +Y  
Subjt:  MKPSAAQLWAQSNRILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAH

Query:  MNRNSILPNNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQR--------------------------
        M R+S+ P++YT+  L+K+ S  +    G+S+  H+ K+GF   V +Q +L+D Y++ GR+   RKVFDEMP+R                          
Subjt:  MNRNSILPNNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQR--------------------------

Query:  ------------------------------------DVVSWTVLIMGYRGGLMFDDALVAFEHMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVK
                                            D++SWT +I GY     + +A+  F  M   G+ P+ VTM   ++ACA  G +E+G  +H +  
Subjt:  ------------------------------------DVVSWTVLIMGYRGGLMFDDALVAFEHMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVK

Query:  RRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWFKRMDEEGVEADEVTLVAVLCACSHSGLVNKGREIFRAL
        + G+ +DV +G++L+DMY KCG ++  L+VF  + +KN+F WN++I+GLA     +EA+  F +M+ E V+ + VT V+V  AC+H+GLV++GR I+R++
Subjt:  RRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWFKRMDEEGVEADEVTLVAVLCACSHSGLVNKGREIFRAL

Query:  VDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLEVSEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVR
        +D  Y     ++H+  MV L +++G I EA  LI +M F+    +WG+LL G R + +L ++E A  KL+ +EP N  YY +L ++ AE  RW +V E+R
Subjt:  VDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLEVSEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVR

Query:  DIMRERGLKK-DSGSSSVELQEK
          MRE G++K   G+SS+ + ++
Subjt:  DIMRERGLKK-DSGSSSVELQEK

AT1G09190.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-7936.21Show/hide
Query:  RILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAHMNRNSILPNNYTF
        ++L LL    +   +P+I + L+   LH S  +  HFI+ C  L   D A   F+H  NP+V VFN++I+ +S    P   LS ++ M    I  + YT+
Subjt:  RILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAHMNRNSILPNNYTF

Query:  PFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGY-------RGGLMFD-------------
          LLKS S  +DL  G+ VH  +++ GF     ++  ++++Y S GRMG  +KVFDEM +R+VV W ++I G+       RG  +F              
Subjt:  PFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGY-------RGGLMFD-------------

Query:  -----------DALVAFEHMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVI-LGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWN
                   +AL  F  M   G +P+  T+V  L   A  G ++ G WIH   +  G   D I +G +L+D Y K G ++    +F+ M+ +NV +WN
Subjt:  -----------DALVAFEHMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVI-LGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWN

Query:  ALIKGLALAKSGEEAIAWFKRMDEEG-VEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDAT
         LI G A+   GE  I  F  M EEG V  +E T + VL  CS++G V +G E+F  +++  +      +H+  MVDL++RSG I EAF  +K+MP +A 
Subjt:  ALIKGLALAKSGEEAIAWFKRMDEEG-VEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDAT

Query:  KAMWGSLLAGGRANGSLEVSEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSV
         AMWGSLL+  R++G ++++E AA +LV++EP N   YV+LSN+ AE GRW +VE+VR +M++  L+K +G S++
Subjt:  KAMWGSLLAGGRANGSLEVSEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSV

AT2G33760.1 Pentatricopeptide repeat (PPR) superfamily protein8.9e-8134.81Show/hide
Query:  IPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAHMNRNSILPNNYTFPFLLKSLSDFNDLV
        + Q+ + LI+     S ++    I      R +    L F   P P  F+FNS+I++ S  ++P   ++ Y  M  +++ P+NYTF  ++KS +D + L 
Subjt:  IPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAHMNRNSILPNNYTFPFLLKSLSDFNDLV

Query:  GGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVAFEHMQYAGVEPNCVTMVNALAACAGYGAI
         G+ VH H V  GF  D YVQ +L+  Y+ CG M   R+VFD MP++ +V+W  L+ G+    + D+A+  F  M+ +G EP+  T V+ L+ACA  GA+
Subjt:  GGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVAFEHMQYAGVEPNCVTMVNALAACAGYGAI

Query:  EMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWFKRMDEE-GVEADEVTLVAVLCACSHSG
         +G W+H+++   G +++V LGT+LI++Y +CG + +   VF  MKE NV  W A+I        G++A+  F +M+++ G   + VT VAVL AC+H+G
Subjt:  EMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWFKRMDEE-GVEADEVTLVAVLCACSHSG

Query:  LVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDAT-----KAMWGSLLAGGRANGSLEVSEFAARKLVEMEPENGAYYVVL
        LV +GR +++ +   SY   PG++H  CMVD+L R+G ++EA+  I  +  DAT      A+W ++L   + + + ++    A++L+ +EP+N  ++V+L
Subjt:  LVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDAT-----KAMWGSLLAGGRANGSLEVSEFAARKLVEMEPENGAYYVVL

Query:  SNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQEKAIQTTTKHQSPQ
        SNI A  G+  EV  +RD M    L+K  G S +E++ K    +   +S Q
Subjt:  SNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQEKAIQTTTKHQSPQ

AT5G06540.1 Pentatricopeptide repeat (PPR) superfamily protein2.2e-7934.78Show/hide
Query:  LSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINAC-------HFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAHMNRNSILP
        L+LL +  S   +  I   L+  +L S   +A   +  C           LL  A   F+   NP++FVFN LIR FS    P      Y  M ++ I P
Subjt:  LSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINAC-------HFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAHMNRNSILP

Query:  NNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASC----------GRMGL---------------------CRKVFDEMPQRDVV
        +N TFPFL+K+ S+   ++ G+  H+ +V++GF +DVYV+NSL+ +YA+C          G+MG                       R++FDEMP R++ 
Subjt:  NNYTFPFLLKSLSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASC----------GRMGL---------------------CRKVFDEMPQRDVV

Query:  SWTVLIMGYRGGLMFDDALVAFEHMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNV
        +W+++I GY     F+ A+  FE M+  GV  N   MV+ +++CA  GA+E G   +E+V +    V++ILGT+L+DM+ +CG I++ + VF+ + E + 
Subjt:  SWTVLIMGYRGGLMFDDALVAFEHMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNV

Query:  FTWNALIKGLALAKSGEEAIAWFKRMDEEGVEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPF
         +W+++IKGLA+     +A+ +F +M   G    +VT  AVL ACSH GLV KG EI+  +    +G  P ++H+ C+VD+L R+G + EA   I  M  
Subjt:  FTWNALIKGLALAKSGEEAIAWFKRMDEEGVEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPF

Query:  DATKAMWGSLLAGGRANGSLEVSEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQEK
             + G+LL   +   + EV+E     L++++PE+  YYV+LSNI A  G+W ++E +RD+M+E+ +KK  G S +E+  K
Subjt:  DATKAMWGSLLAGGRANGSLEVSEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQEK

AT5G56310.1 Pentatricopeptide repeat (PPR) superfamily protein2.5e-8335.54Show/hide
Query:  IPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHT---PLSIYAHMNRNSILPNNYTFPFLLKSLSDFN
        + Q    +I+  L+        FI AC     L  A   FTH P P+ ++ N++IRA S    P+     +++Y  +      P+ +TFPF+LK     +
Subjt:  IPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHT---PLSIYAHMNRNSILPNNYTFPFLLKSLSDFN

Query:  DLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDV---------------------------------VSWTVLIMGYRGGLM
        D+  G+ +H  VV +GF S V+V   L+ +Y SCG +G  RK+FDEM  +DV                                 VSWT +I GY     
Subjt:  DLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDV---------------------------------VSWTVLIMGYRGGLM

Query:  FDDALVAFEHMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAK
          +A+  F+ M    VEP+ VT++  L+ACA  G++E+G  I  +V  RG    V L  ++IDMY K G I + L VF+ + E+NV TW  +I GLA   
Subjt:  FDDALVAFEHMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAK

Query:  SGEEAIAWFKRMDEEGVEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGG
         G EA+A F RM + GV  ++VT +A+L ACSH G V+ G+ +F ++    YG  P I+H+ CM+DLL R+G + EA  +IK MPF A  A+WGSLLA  
Subjt:  SGEEAIAWFKRMDEEGVEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGG

Query:  RANGSLEVSEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQEKAIQTTTKHQS-PQNPHFHSHLQGSSLSVR
          +  LE+ E A  +L+++EP N   Y++L+N+ + +GRW E   +R++M+  G+KK +G SS+E++ +  +  +   + PQ    H  LQ   L ++
Subjt:  RANGSLEVSEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQEKAIQTTTKHQS-PQNPHFHSHLQGSSLSVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAGCAGCATCAAAGTCCACGGAATCCCCATTTCCACAGCCACCTGTAGGGTTCTAGCTTGTCTGTACGAGAAGGAGCTTCAATTCGAGCTCGTTAATGTCAAAAT
GCACGAAGAGGAACATAAGAAAGAGCCCTTCCTCTCACTCAATCCGTTTGGTCAAATTCCTGGGTTCCAAAATGCACCTGTAGGGTTGGTTTTAACTTTTAAGAAAGACT
TTTTCGTCTATGTAGAATCGAGCAGTGGAGCCATGGCCATGAAACCTTCTGCCGCACAATTGTGGGCTCAATCAAACAGAATCCTCTCTCTTCTTGCCGCTTCTCTTTCC
CCAATTCACATCCCCCAAATTCAATCGCAACTCATCCTTCAAAACCTCCATTCCAGCACCGCCATAGCCCACCACTTCATAAACGCCTGCCACTTCCTTCGACTTCTCGA
TTCGGCTCTCCTCTTCTTCACCCATTTCCCCAATCCCCACGTCTTCGTCTTTAATTCCCTGATCAGAGCTTTCTCTCACTCCAAAATCCCCCACACCCCTCTTTCAATAT
ACGCCCACATGAACAGAAACTCGATTCTTCCCAACAACTACACCTTCCCTTTCCTTCTCAAGTCCTTGTCTGACTTTAACGACCTTGTGGGTGGGCAATCTGTTCACACC
CATGTTGTGAAATGGGGTTTTGTTTCTGATGTTTATGTGCAGAACTCATTGATGGATGTTTATGCGTCGTGTGGGAGAATGGGGCTCTGTAGGAAGGTGTTCGACGAAAT
GCCCCAGAGAGATGTTGTGTCTTGGACTGTTCTGATTATGGGGTACCGAGGTGGGTTGATGTTTGATGACGCTTTGGTTGCGTTTGAGCACATGCAATATGCGGGTGTCG
AACCGAACTGCGTGACGATGGTGAATGCGTTGGCTGCGTGTGCAGGCTATGGAGCCATTGAAATGGGTGTTTGGATACACGAGTTTGTGAAGAGGAGAGGGTGGGAAGTG
GATGTGATTTTGGGGACTTCTCTGATTGATATGTATGGGAAGTGTGGGAGAATCAAGGAGGGGTTGGTTGTTTTTCAGGCCATGAAAGAGAAGAATGTATTTACTTGGAA
TGCACTCATTAAAGGGCTAGCTTTGGCCAAGAGTGGAGAAGAGGCCATTGCCTGGTTTAAGAGAATGGATGAAGAAGGAGTTGAGGCTGATGAAGTGACATTAGTGGCAG
TGCTTTGTGCTTGTAGCCACTCCGGATTGGTCAACAAGGGCAGGGAGATCTTTCGAGCCTTGGTCGATGGGAGCTATGGGTTTTCTCCCGGAATCAAACATTTTTCGTGT
ATGGTAGATCTCTTGGCGCGTTCAGGGTGTATTGAGGAGGCTTTTGTGTTGATAAAAGATATGCCCTTTGATGCCACAAAAGCAATGTGGGGGTCTTTGTTAGCTGGTGG
CAGGGCTAATGGAAGCTTGGAAGTGAGCGAGTTTGCAGCAAGGAAGCTTGTTGAAATGGAGCCAGAGAATGGTGCATATTATGTTGTGTTGTCTAATATTCTTGCAGAGA
TGGGGAGATGGGGTGAGGTTGAGGAAGTGAGAGATATTATGAGAGAGAGAGGACTGAAGAAGGATTCGGGGTCGAGTTCTGTTGAGCTTCAAGAGAAAGCTATACAGACA
ACAACAAAGCATCAAAGTCCACAGAATCCCCATTTCCACAGCCACCTGCAGGGTTCTAGCTTGTCTGTACGAGAAGGAGCTTCAATTCCAGTTCGTTAA
mRNA sequenceShow/hide mRNA sequence
TCAGCGCAATCTTATTTTGTTTAAACAAATTTCAATCCGCGACTCCACCCGTCGATTTTTCCCTCTCTTCCTTCATCCCACTGTGGTCGCTCCCCACCAAAGTCGCCGCC
GTCGCCGCATCCCGCAGAAACCACTGCCGCCGCCGCATCGTTTCTGGACGTTTCAGAGTCTCTGCAACGTTGCTCACCGTCCTCAGCCTCTTCGAACTTCTTTTACTTCT
TCCAAGTGATGTTGCCGACGTGACGAACCACCCATCCGAGCGCTCCCTAATTTTGATCATCGAACCAAGATGCGTCAAAATTCATTTCTAGTTTGACCGAGGACTTGCCT
TTCATTACTTCTTTCACTATGAGGGAACTGTGTTCGTATCTGTAAATACTCTCTTGTCGGAATCAATGGCGAGCAGCATCAAAGTCCACGGAATCCCCATTTCCACAGCC
ACCTGTAGGGTTCTAGCTTGTCTGTACGAGAAGGAGCTTCAATTCGAGCTCGTTAATGTCAAAATGCACGAAGAGGAACATAAGAAAGAGCCCTTCCTCTCACTCAATCC
GTTTGGTCAAATTCCTGGGTTCCAAAATGCACCTGTAGGGTTGGTTTTAACTTTTAAGAAAGACTTTTTCGTCTATGTAGAATCGAGCAGTGGAGCCATGGCCATGAAAC
CTTCTGCCGCACAATTGTGGGCTCAATCAAACAGAATCCTCTCTCTTCTTGCCGCTTCTCTTTCCCCAATTCACATCCCCCAAATTCAATCGCAACTCATCCTTCAAAAC
CTCCATTCCAGCACCGCCATAGCCCACCACTTCATAAACGCCTGCCACTTCCTTCGACTTCTCGATTCGGCTCTCCTCTTCTTCACCCATTTCCCCAATCCCCACGTCTT
CGTCTTTAATTCCCTGATCAGAGCTTTCTCTCACTCCAAAATCCCCCACACCCCTCTTTCAATATACGCCCACATGAACAGAAACTCGATTCTTCCCAACAACTACACCT
TCCCTTTCCTTCTCAAGTCCTTGTCTGACTTTAACGACCTTGTGGGTGGGCAATCTGTTCACACCCATGTTGTGAAATGGGGTTTTGTTTCTGATGTTTATGTGCAGAAC
TCATTGATGGATGTTTATGCGTCGTGTGGGAGAATGGGGCTCTGTAGGAAGGTGTTCGACGAAATGCCCCAGAGAGATGTTGTGTCTTGGACTGTTCTGATTATGGGGTA
CCGAGGTGGGTTGATGTTTGATGACGCTTTGGTTGCGTTTGAGCACATGCAATATGCGGGTGTCGAACCGAACTGCGTGACGATGGTGAATGCGTTGGCTGCGTGTGCAG
GCTATGGAGCCATTGAAATGGGTGTTTGGATACACGAGTTTGTGAAGAGGAGAGGGTGGGAAGTGGATGTGATTTTGGGGACTTCTCTGATTGATATGTATGGGAAGTGT
GGGAGAATCAAGGAGGGGTTGGTTGTTTTTCAGGCCATGAAAGAGAAGAATGTATTTACTTGGAATGCACTCATTAAAGGGCTAGCTTTGGCCAAGAGTGGAGAAGAGGC
CATTGCCTGGTTTAAGAGAATGGATGAAGAAGGAGTTGAGGCTGATGAAGTGACATTAGTGGCAGTGCTTTGTGCTTGTAGCCACTCCGGATTGGTCAACAAGGGCAGGG
AGATCTTTCGAGCCTTGGTCGATGGGAGCTATGGGTTTTCTCCCGGAATCAAACATTTTTCGTGTATGGTAGATCTCTTGGCGCGTTCAGGGTGTATTGAGGAGGCTTTT
GTGTTGATAAAAGATATGCCCTTTGATGCCACAAAAGCAATGTGGGGGTCTTTGTTAGCTGGTGGCAGGGCTAATGGAAGCTTGGAAGTGAGCGAGTTTGCAGCAAGGAA
GCTTGTTGAAATGGAGCCAGAGAATGGTGCATATTATGTTGTGTTGTCTAATATTCTTGCAGAGATGGGGAGATGGGGTGAGGTTGAGGAAGTGAGAGATATTATGAGAG
AGAGAGGACTGAAGAAGGATTCGGGGTCGAGTTCTGTTGAGCTTCAAGAGAAAGCTATACAGACAACAACAAAGCATCAAAGTCCACAGAATCCCCATTTCCACAGCCAC
CTGCAGGGTTCTAGCTTGTCTGTACGAGAAGGAGCTTCAATTCCAGTTCGTTAATGTCAAAATGCACGAAGAGGAACATAAGAAAGAGCCCTTCCTCTCACTCAATCCGT
TTGGTCAAATTCCTGGGTTCCAAGATGGAGATTTCTCGCTTTTTGAATCCAGGGCAATCACGCAGTACATCTCGACAACTTATGCTACTAATGGAACCCAATTGATTTCC
CAAGACCCCAAGAAAATGGTGGCCATATTAACGTGGGTTGAGGTGGAGAGCCACCATTTGGACCAAGCAGCCATGAAAGTGATTTGGGAGCTCTGCTTAAAGCCAATGTT
GGGTTTGGGTGAGGCAGATGCTGCTGTGGCTGAGAAAGGTGAAGCTGAGTTAGGCAAAGTTCTTGATATCTATGAAAGAAGCTGCCTCAGTCCAAGTACTTGGCTGGTGA
GTTTCTTCACTAGATGGGTGATCGCCATATCATCAGAACTTGAATAAAAGGCTGTGTTCTCTGGTCTGTCTTTTTTTTGTGTTGTGAGAGTTCTGTTTGTCACTTGCTGT
GTATCACATGTATCAAAAATTGGTGGTTGTGTGGTTCTGAATTTGATGATATGATTTGAGATTTTGTCTTTTAATTCACTGTTGTTTTTAGAACTGTCACCATCTTGCGT
AGAATTGGTTTGCAGGTAATTGAATCTGCTTGGAAATTCTTCATATTATGTTATGAACGTTAAAT
Protein sequenceShow/hide protein sequence
MASSIKVHGIPISTATCRVLACLYEKELQFELVNVKMHEEEHKKEPFLSLNPFGQIPGFQNAPVGLVLTFKKDFFVYVESSSGAMAMKPSAAQLWAQSNRILSLLAASLS
PIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLLDSALLFFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAHMNRNSILPNNYTFPFLLKSLSDFNDLVGGQSVHT
HVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSWTVLIMGYRGGLMFDDALVAFEHMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKRRGWEV
DVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAWFKRMDEEGVEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSC
MVDLLARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLEVSEFAARKLVEMEPENGAYYVVLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVELQEKAIQT
TTKHQSPQNPHFHSHLQGSSLSVREGASIPVR