; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi11G013190 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi11G013190
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionAP complex subunit sigma
Genome locationchr11:21808263..21817603
RNA-Seq ExpressionLsi11G013190
SyntenyLsi11G013190
Gene Ontology termsGO:0006886 - intracellular protein transport (biological process)
GO:0016192 - vesicle-mediated transport (biological process)
GO:0030121 - AP-1 adaptor complex (cellular component)
GO:0035615 - clathrin adaptor activity (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011012 - Longin-like domain superfamily
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR022775 - AP complex, mu/sigma subunit
IPR044733 - AP-1 complex subunit sigma


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011839.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]2.4e-16685.88Show/hide
Query:  MGA--FQFQSRLRVSKILSSTIRKWERDFPLLLDPLRTPDFSLSPAQSRCFSSNSMDDDWAFSKSQPRTSPPQQRSPLDHREARRVPKFTGDVSSPYPED
        MGA   Q QSRLRVSKILSSTIR WER FPLL DPLRTPD SLSP +SRCFSSNSMDDDW FSKS+PR+SPPQ+RSP D REARRVPKF G+VS+PYPED
Subjt:  MGA--FQFQSRLRVSKILSSTIRKWERDFPLLLDPLRTPDFSLSPAQSRCFSSNSMDDDWAFSKSQPRTSPPQQRSPLDHREARRVPKFTGDVSSPYPED

Query:  NRNQRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEEISSAQLSESMQEKQSQPQRPPEADEIFKKMKETGLIPNAVTM
        NRNQRFNRHSEGSSSRFANEG +SR R+ SLSQKDFSFLEKFKLNTDNQS+S EK E  SSAQ SESMQ KQSQPQ PPEADEIF+KMKETGLIPNAV M
Subjt:  NRNQRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEEISSAQLSESMQEKQSQPQRPPEADEIFKKMKETGLIPNAVTM

Query:  LDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESGHSPNLT
        LDGLCKDGLIQEAMKLFGLIREKGTIPEVV+YTAVVDGFCKAEK DEAIRIFRKMQNNGI PNAFS+G+LIQGLYK K++EDAV+FC EMLESGHSPNLT
Subjt:  LDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESGHSPNLT

Query:  TFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWEV
        TFVGLID LC EKGVDEAH+V+ETLKQKGFLINEK LREFLNKRAPFSPHIWE+
Subjt:  TFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWEV

XP_022952706.1 pentatricopeptide repeat-containing protein At4g38150-like [Cucurbita moschata]1.9e-16685.59Show/hide
Query:  MGA--FQFQSRLRVSKILSSTIRKWERDFPLLLDPLRTPDFSLSPAQSRCFSSNSMDDDWAFSKSQPRTSPPQQRSPLDHREARRVPKFTGDVSSPYPED
        MGA   Q QSRLRVSKILSSTIR WER FPLL DPLRTPD SLSP +SRCFSSNSMDDDW FS+S+PR+SPPQ+RSP D REARRVPKF G+VS+PYPED
Subjt:  MGA--FQFQSRLRVSKILSSTIRKWERDFPLLLDPLRTPDFSLSPAQSRCFSSNSMDDDWAFSKSQPRTSPPQQRSPLDHREARRVPKFTGDVSSPYPED

Query:  NRNQRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEEISSAQLSESMQEKQSQPQRPPEADEIFKKMKETGLIPNAVTM
        NRNQRFNRHSEGSSSRFAN+G +SR R+ SLSQKDFSFLEKFKLNTDNQS+S EK E  SSAQ+SESMQ KQSQPQ PPEADEIF+KMKETGLIPNAV M
Subjt:  NRNQRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEEISSAQLSESMQEKQSQPQRPPEADEIFKKMKETGLIPNAVTM

Query:  LDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESGHSPNLT
        LDGLCKDGLIQEAMKLFGLIREKGTIPEVV+YTAVVDGFCKAEKLDEAIRIFRKMQNNGI PNAFS+G+LIQGLYK K++EDAV+FCIEMLESGHSPNLT
Subjt:  LDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESGHSPNLT

Query:  TFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWEV
        TFVGLID LC EKGVDEAH+V+ETLKQKGFLINEK LR+FLNKRAPFSPHIWE+
Subjt:  TFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWEV

XP_022969144.1 pentatricopeptide repeat-containing protein At4g38150-like [Cucurbita maxima]6.4e-16786.16Show/hide
Query:  MGA--FQFQSRLRVSKILSSTIRKWERDFPLLLDPLRTPDFSLSPAQSRCFSSNSMDDDWAFSKSQPRTSPPQQRSPLDHREARRVPKFTGDVSSPYPED
        MGA   Q QSRLRVSKIL STIR WER FPLL DPLRTPD SLSP +SRCFSSNSMDDDW FS+S+PR+SPPQ+RSP D REARRVPKF G+VS+PYPED
Subjt:  MGA--FQFQSRLRVSKILSSTIRKWERDFPLLLDPLRTPDFSLSPAQSRCFSSNSMDDDWAFSKSQPRTSPPQQRSPLDHREARRVPKFTGDVSSPYPED

Query:  NRNQRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEEISSAQLSESMQEKQSQPQRPPEADEIFKKMKETGLIPNAVTM
        NRNQRFNRH EGS SRFANEG +SR+RS SLSQKDFSFLEKFKLNTDN S+S EKTE  SSAQ SESMQ KQSQPQ PPEADEIF+KMKETGLIPNAV M
Subjt:  NRNQRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEEISSAQLSESMQEKQSQPQRPPEADEIFKKMKETGLIPNAVTM

Query:  LDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESGHSPNLT
        LDGLCKDGLIQEAMKLFGLIREKGTIPEVV+YTAVVDGFCKAE LDEAIRIFRKMQNNGI PNAFS+GVLIQGLYKCK++EDAV+FCIEMLESGHSPNLT
Subjt:  LDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESGHSPNLT

Query:  TFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWEV
        TFVGLID LC EKGVDEAH+V+ETLKQKGFLINEKALREFLNKRAPFSPHIWE+
Subjt:  TFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWEV

XP_023554525.1 pentatricopeptide repeat-containing protein At4g38150-like isoform X1 [Cucurbita pepo subsp. pepo]7.1e-16685.59Show/hide
Query:  MGA--FQFQSRLRVSKILSSTIRKWERDFPLLLDPLRTPDFSLSPAQSRCFSSNSMDDDWAFSKSQPRTSPPQQRSPLDHREARRVPKFTGDVSSPYPED
        MGA   Q QSRLRVSKILSSTIR WER FPLL DPLRTPD SLSP +SRCFSSNSMDDDW FS+S+PR+SPPQ+RSP D REARRVPKF G+ S+PYPED
Subjt:  MGA--FQFQSRLRVSKILSSTIRKWERDFPLLLDPLRTPDFSLSPAQSRCFSSNSMDDDWAFSKSQPRTSPPQQRSPLDHREARRVPKFTGDVSSPYPED

Query:  NRNQRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEEISSAQLSESMQEKQSQPQRPPEADEIFKKMKETGLIPNAVTM
        NRNQRFNRHSEGSSSRFANEG +SR+R+ SLSQKDFSFLEKFKLNTDNQS+S EK E  SSAQ SESMQ +QSQPQ PPEADEIF+KMKETGLIPNAV M
Subjt:  NRNQRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEEISSAQLSESMQEKQSQPQRPPEADEIFKKMKETGLIPNAVTM

Query:  LDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESGHSPNLT
        LDGLCKDGLIQEAMKLFGLIREKGTIPEVV+YTAVVDGFCKAEKLDEAIRIFRKMQNNGI PNAFS+G+LIQGLYK K++EDAV+FCIEMLESGHSPNLT
Subjt:  LDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESGHSPNLT

Query:  TFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWEV
        TFVGLID LC EKG DEAH+V+ETLKQKGFLINEKALREFLNKRAPFSPHIWE+
Subjt:  TFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWEV

XP_038888880.1 pentatricopeptide repeat-containing protein At4g38150 isoform X1 [Benincasa hispida]1.7e-17290.03Show/hide
Query:  AFQFQSRLRVSKILSSTIRKWERDFPLLLDPLRTPDFSLSPAQSRCFSSNSMDDDWAFSKSQPRTSPPQQRS-PLDHREARRVPKFTGDVSSPYPEDNRN
        AFQFQSRLRV KILSSTIRKWERDFPLL D LR P+FSLSPAQSRCF+SNS+DDDWAFSKS+PRTSPPQ+RS P DHREARRVP F  +VS+P+P+DNRN
Subjt:  AFQFQSRLRVSKILSSTIRKWERDFPLLLDPLRTPDFSLSPAQSRCFSSNSMDDDWAFSKSQPRTSPPQQRS-PLDHREARRVPKFTGDVSSPYPEDNRN

Query:  QRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEEISSAQLSESMQEKQSQPQRPPEADEIFKKMKETGLIPNAVTMLDG
        QRFNR SEGSSSRFANE SISRQR+ SL+QKDFSFLE+FKLNTDNQSSS EKTEE SSA+LSESMQEKQSQPQRPPEADEIF+KMKETGLIPNAV MLDG
Subjt:  QRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEEISSAQLSESMQEKQSQPQRPPEADEIFKKMKETGLIPNAVTMLDG

Query:  LCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESGHSPNLTTFV
        LCKDGLIQEAMKLF LIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCK++EDAVAFCIEMLESGHSPNL TFV
Subjt:  LCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESGHSPNLTTFV

Query:  GLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWEV
        GLID LCNEKGVDEAHSVVETLKQKGFLINEKALRE LNKRAPFSPHIWEV
Subjt:  GLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWEV

TrEMBL top hitse value%identityAlignment
A0A0A0K6C0 Uncharacterized protein6.5e-15785.03Show/hide
Query:  MGAFQFQSRLRVSKILSSTIRKWERDFPLLLDPLRTPDFSLSPAQSRCFSSNSMDDDWAFSKSQPRTSPPQQRSPLDHREARRVPKFTGDVSSPYPEDNR
        MGAFQ QSRL+VSKILSSTIRKWE DFPLL +PLRTPDFSL PAQSR F+SNSMDDD AFS S+P  +PPQ+R P DHREARRVP+  G V++ Y  DNR
Subjt:  MGAFQFQSRLRVSKILSSTIRKWERDFPLLLDPLRTPDFSLSPAQSRCFSSNSMDDDWAFSKSQPRTSPPQQRSPLDHREARRVPKFTGDVSSPYPEDNR

Query:  NQRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEE-ISSAQLSESMQEK-QSQPQRPPEADEIFKKMKETGLIPNAVTM
        NQRFNRHSEGSSSRF NEGS S QRS SLSQKDFSFLEKFKLNTDNQSS KEKTEE  SSA +SESM EK QS+PQRPPEADEIF KMKETGLIPNAV M
Subjt:  NQRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEE-ISSAQLSESMQEK-QSQPQRPPEADEIFKKMKETGLIPNAVTM

Query:  LDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESGHSPNLT
        LDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEK DEAIRIFRKMQ+NGIPPNAFSFGVLIQGLYKCK+++DAVAFC EMLESGH PNLT
Subjt:  LDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESGHSPNLT

Query:  TFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWEV
        TFVGLID LCNEKGVDEAHSVVET KQKGFLI+EKALRE LNKRAPFSP IW+V
Subjt:  TFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWEV

A0A1S3CFW0 pentatricopeptide repeat-containing protein At4g381503.6e-15584.18Show/hide
Query:  MGAFQFQSRLRVSKILSSTIRKWERDFPLLLDPLRTPDFSLSPAQSRCFSSNSMDDDWAFSKSQPRTSPPQQRSPLDHREARRVPKFTGDVSSPYPEDNR
        M AFQFQSRL+VSKILSS+IRKWE DFPLL +PLRTPDFSLSPA SR FSSNSMDDD AFS+S+PR +PPQ+R P DHREARRVP+  G V+  Y  DNR
Subjt:  MGAFQFQSRLRVSKILSSTIRKWERDFPLLLDPLRTPDFSLSPAQSRCFSSNSMDDDWAFSKSQPRTSPPQQRSPLDHREARRVPKFTGDVSSPYPEDNR

Query:  NQRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEE-ISSAQLSESMQEK-QSQPQRPPEADEIFKKMKETGLIPNAVTM
        NQRFNRHSEGSSSRF NEGS SRQRS SLSQKDFSFLEKFKLNTDNQSS  EKTEE  SSA +SESMQEK QSQ QRPP ADEIF+KMKE+GLIPNAV M
Subjt:  NQRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEE-ISSAQLSESMQEK-QSQPQRPPEADEIFKKMKETGLIPNAVTM

Query:  LDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESGHSPNLT
        LDGLCKDGLIQEAMKLF LIREKGTIPEVVIYTAVVDGFCKAEK DEAIRIFRKMQN GI PNAFSFGVLIQGLYKCK+++DAVAFC EMLESGH PNLT
Subjt:  LDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESGHSPNLT

Query:  TFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWEV
        TFVGLID LC+EKGVDEAH+VVET KQKGFLI+EKALREFLNKRAPFSP IW+V
Subjt:  TFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWEV

A0A5D3D5J5 Pentatricopeptide repeat-containing protein3.6e-15584.18Show/hide
Query:  MGAFQFQSRLRVSKILSSTIRKWERDFPLLLDPLRTPDFSLSPAQSRCFSSNSMDDDWAFSKSQPRTSPPQQRSPLDHREARRVPKFTGDVSSPYPEDNR
        M AFQFQSRL+VSKILSS+IRKWE DFPLL +PLRTPDFSLSPA SR FSSNSMDDD AFS+S+PR +PPQ+R P DHREARRVP+  G V+  Y  DNR
Subjt:  MGAFQFQSRLRVSKILSSTIRKWERDFPLLLDPLRTPDFSLSPAQSRCFSSNSMDDDWAFSKSQPRTSPPQQRSPLDHREARRVPKFTGDVSSPYPEDNR

Query:  NQRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEE-ISSAQLSESMQEK-QSQPQRPPEADEIFKKMKETGLIPNAVTM
        NQRFNRHSEGSSSRF NEGS SRQRS SLSQKDFSFLEKFKLNTDNQSS  EKTEE  SSA +SESMQEK QSQ QRPP ADEIF+KMKE+GLIPNAV M
Subjt:  NQRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEE-ISSAQLSESMQEK-QSQPQRPPEADEIFKKMKETGLIPNAVTM

Query:  LDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESGHSPNLT
        LDGLCKDGLIQEAMKLF LIREKGTIPEVVIYTAVVDGFCKAEK DEAIRIFRKMQN GI PNAFSFGVLIQGLYKCK+++DAVAFC EMLESGH PNLT
Subjt:  LDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESGHSPNLT

Query:  TFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWEV
        TFVGLID LC+EKGVDEAH+VVET KQKGFLI+EKALREFLNKRAPFSP IW+V
Subjt:  TFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWEV

A0A6J1GKZ0 pentatricopeptide repeat-containing protein At4g38150-like9.0e-16785.59Show/hide
Query:  MGA--FQFQSRLRVSKILSSTIRKWERDFPLLLDPLRTPDFSLSPAQSRCFSSNSMDDDWAFSKSQPRTSPPQQRSPLDHREARRVPKFTGDVSSPYPED
        MGA   Q QSRLRVSKILSSTIR WER FPLL DPLRTPD SLSP +SRCFSSNSMDDDW FS+S+PR+SPPQ+RSP D REARRVPKF G+VS+PYPED
Subjt:  MGA--FQFQSRLRVSKILSSTIRKWERDFPLLLDPLRTPDFSLSPAQSRCFSSNSMDDDWAFSKSQPRTSPPQQRSPLDHREARRVPKFTGDVSSPYPED

Query:  NRNQRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEEISSAQLSESMQEKQSQPQRPPEADEIFKKMKETGLIPNAVTM
        NRNQRFNRHSEGSSSRFAN+G +SR R+ SLSQKDFSFLEKFKLNTDNQS+S EK E  SSAQ+SESMQ KQSQPQ PPEADEIF+KMKETGLIPNAV M
Subjt:  NRNQRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEEISSAQLSESMQEKQSQPQRPPEADEIFKKMKETGLIPNAVTM

Query:  LDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESGHSPNLT
        LDGLCKDGLIQEAMKLFGLIREKGTIPEVV+YTAVVDGFCKAEKLDEAIRIFRKMQNNGI PNAFS+G+LIQGLYK K++EDAV+FCIEMLESGHSPNLT
Subjt:  LDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESGHSPNLT

Query:  TFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWEV
        TFVGLID LC EKGVDEAH+V+ETLKQKGFLINEK LR+FLNKRAPFSPHIWE+
Subjt:  TFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWEV

A0A6J1I1Q2 pentatricopeptide repeat-containing protein At4g38150-like3.1e-16786.16Show/hide
Query:  MGA--FQFQSRLRVSKILSSTIRKWERDFPLLLDPLRTPDFSLSPAQSRCFSSNSMDDDWAFSKSQPRTSPPQQRSPLDHREARRVPKFTGDVSSPYPED
        MGA   Q QSRLRVSKIL STIR WER FPLL DPLRTPD SLSP +SRCFSSNSMDDDW FS+S+PR+SPPQ+RSP D REARRVPKF G+VS+PYPED
Subjt:  MGA--FQFQSRLRVSKILSSTIRKWERDFPLLLDPLRTPDFSLSPAQSRCFSSNSMDDDWAFSKSQPRTSPPQQRSPLDHREARRVPKFTGDVSSPYPED

Query:  NRNQRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEEISSAQLSESMQEKQSQPQRPPEADEIFKKMKETGLIPNAVTM
        NRNQRFNRH EGS SRFANEG +SR+RS SLSQKDFSFLEKFKLNTDN S+S EKTE  SSAQ SESMQ KQSQPQ PPEADEIF+KMKETGLIPNAV M
Subjt:  NRNQRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEEISSAQLSESMQEKQSQPQRPPEADEIFKKMKETGLIPNAVTM

Query:  LDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESGHSPNLT
        LDGLCKDGLIQEAMKLFGLIREKGTIPEVV+YTAVVDGFCKAE LDEAIRIFRKMQNNGI PNAFS+GVLIQGLYKCK++EDAV+FCIEMLESGHSPNLT
Subjt:  LDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESGHSPNLT

Query:  TFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWEV
        TFVGLID LC EKGVDEAH+V+ETLKQKGFLINEKALREFLNKRAPFSPHIWE+
Subjt:  TFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWEV

SwissProt top hitse value%identityAlignment
B0G185 AP-1 complex subunit sigma-25.6e-5772.34Show/hide
Query:  HFVLLISRQGKVRLTKWYSPHSQKERSKVIRELSGMILNRGPKLCNFVEWRGLKAVYKRYASLYFCMCIDQDDNELEILEIIHHYVEILDRYFGSVCELD
        HF+LL+SRQGK RLTKWYSP + KE+S+  RE+  M+LNR PKLCNF+EW+  K ++KRYASLYF +C D++DNEL +LEIIHH+VEILDRYFG+VCELD
Subjt:  HFVLLISRQGKVRLTKWYSPHSQKERSKVIRELSGMILNRGPKLCNFVEWRGLKAVYKRYASLYFCMCIDQDDNELEILEIIHHYVEILDRYFGSVCELD

Query:  LIFNFHKAYYILDELLIAGELQESSKKTVARLIAAQDSLVE
        LIFNFHKAYYILDEL++AGELQE+SKKTV RLI+ QD+L+E
Subjt:  LIFNFHKAYYILDELLIAGELQESSKKTVARLIAAQDSLVE

O23685 AP-1 complex subunit sigma-23.1e-7991.25Show/hide
Query:  LIHFVLLISRQGKVRLTKWYSPHSQKERSKVIRELSGMILNRGPKLCNFVEWRGLKAVYKRYASLYFCMCIDQDDNELEILEIIHHYVEILDRYFGSVCE
        +IHFVLL+SRQGKVRLTKWYSP++QKERSKVIRELSG+ILNRGPKLCNFVEWRG K VYKRYASLYFCMCIDQ+DNELE+LEIIHHYVEILDRYFGSVCE
Subjt:  LIHFVLLISRQGKVRLTKWYSPHSQKERSKVIRELSGMILNRGPKLCNFVEWRGLKAVYKRYASLYFCMCIDQDDNELEILEIIHHYVEILDRYFGSVCE

Query:  LDLIFNFHKAYYILDELLIAGELQESSKKTVARLIAAQDSLVETAKEQASSISNIIAQAT
        LDLIFNFHKAYYILDELLIAGELQESSKKTVAR+I+AQD LVE AKE+ASSISNIIAQAT
Subjt:  LDLIFNFHKAYYILDELLIAGELQESSKKTVARLIAAQDSLVETAKEQASSISNIIAQAT

Q3ZBS3 AP-1 complex subunit sigma-23.7e-4864.38Show/hide
Query:  IHFVLLISRQGKVRLTKWYSPHSQKERSKVIRELSGMILNRGPKLCNFVEWRGLKAVYKRYASLYFCMCIDQDDNELEILEIIHHYVEILDRYFGSVCEL
        + F+LL SRQGK+RL KWY P S KE+ K+ REL   +L R PK+C+F+EWR LK VYKRYASLYFC  I+  DNEL  LEIIH YVE+LD+YFGSVCEL
Subjt:  IHFVLLISRQGKVRLTKWYSPHSQKERSKVIRELSGMILNRGPKLCNFVEWRGLKAVYKRYASLYFCMCIDQDDNELEILEIIHHYVEILDRYFGSVCEL

Query:  DLIFNFHKAYYILDELLIAGELQESSKKTVARLIAAQDSLVETAKE
        D+IFNF KAY+ILDE L+ GE+QE+SKK V + I   D L E AKE
Subjt:  DLIFNFHKAYYILDELLIAGELQESSKKTVARLIAAQDSLVETAKE

Q8LEZ8 AP-1 complex subunit sigma-16.8e-7990.06Show/hide
Query:  LIHFVLLISRQGKVRLTKWYSPHSQKERSKVIRELSGMILNRGPKLCNFVEWRGLKAVYKRYASLYFCMCIDQDDNELEILEIIHHYVEILDRYFGSVCE
        +IHFVLL+SRQGKVRLTKWYSP++QKERSKVIRELSG+ILNRGPKLCNF+EWRG K VYKRYASLYFCMCID+ DNELE+LEIIHHYVEILDRYFGSVCE
Subjt:  LIHFVLLISRQGKVRLTKWYSPHSQKERSKVIRELSGMILNRGPKLCNFVEWRGLKAVYKRYASLYFCMCIDQDDNELEILEIIHHYVEILDRYFGSVCE

Query:  LDLIFNFHKAYYILDELLIAGELQESSKKTVARLIAAQDSLVETAKEQASSISNIIAQATK
        LDLIFNFHKAYYILDELLIAGELQESSKKTVAR+I+AQD LVE AKE+ASSISNIIAQATK
Subjt:  LDLIFNFHKAYYILDELLIAGELQESSKKTVARLIAAQDSLVETAKEQASSISNIIAQATK

Q9SZL5 Pentatricopeptide repeat-containing protein At4g381506.6e-7457.92Show/hide
Query:  PYPEDNRNQRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEEISSAQLSESMQEKQSQPQRPPE-ADEIFKKMKETGLI
        P P  NR  R  R S       A +     +   +LS  D  FLE+FKL  +  S    K E+               +P  PPE +DEIFKKMKE GLI
Subjt:  PYPEDNRNQRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEEISSAQLSESMQEKQSQPQRPPE-ADEIFKKMKETGLI

Query:  PNAVTMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESG
        PNAV MLDGLCKDGL+QEAMKLFGL+R+KGTIPEVVIYTAVV+ FCKA K+++A RIFRKMQNNGI PNAFS+GVL+QGLY C  ++DAVAFC EMLESG
Subjt:  PNAVTMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESG

Query:  HSPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWE
        HSPN+ TFV L+D LC  KGV++A S ++TL QKGF +N KA++EF++KRAPF    WE
Subjt:  HSPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWE

Arabidopsis top hitse value%identityAlignment
AT2G17380.1 associated protein 194.9e-8090.06Show/hide
Query:  LIHFVLLISRQGKVRLTKWYSPHSQKERSKVIRELSGMILNRGPKLCNFVEWRGLKAVYKRYASLYFCMCIDQDDNELEILEIIHHYVEILDRYFGSVCE
        +IHFVLL+SRQGKVRLTKWYSP++QKERSKVIRELSG+ILNRGPKLCNF+EWRG K VYKRYASLYFCMCID+ DNELE+LEIIHHYVEILDRYFGSVCE
Subjt:  LIHFVLLISRQGKVRLTKWYSPHSQKERSKVIRELSGMILNRGPKLCNFVEWRGLKAVYKRYASLYFCMCIDQDDNELEILEIIHHYVEILDRYFGSVCE

Query:  LDLIFNFHKAYYILDELLIAGELQESSKKTVARLIAAQDSLVETAKEQASSISNIIAQATK
        LDLIFNFHKAYYILDELLIAGELQESSKKTVAR+I+AQD LVE AKE+ASSISNIIAQATK
Subjt:  LDLIFNFHKAYYILDELLIAGELQESSKKTVARLIAAQDSLVETAKEQASSISNIIAQATK

AT4G35410.1 Clathrin adaptor complex small chain family protein2.2e-5691.74Show/hide
Query:  LIHFVLLISRQGKVRLTKWYSPHSQKERSKVIRELSGMILNRGPKLCNFVEWRGLKAVYKRYASLYFCMCIDQDDNELEILEIIHHYVEILDRYFGSVCE
        +IHFVLL+SRQGKVRLTKWYSP++QKERSKVIRELSG+ILNRGPKLCNFVEWRG K VYKRYASLYFCMCIDQ+DNELE+LEIIHHYVEILDRYFGSVCE
Subjt:  LIHFVLLISRQGKVRLTKWYSPHSQKERSKVIRELSGMILNRGPKLCNFVEWRGLKAVYKRYASLYFCMCIDQDDNELEILEIIHHYVEILDRYFGSVCE

Query:  LDLIFNFHK
        LDLIFNFHK
Subjt:  LDLIFNFHK

AT4G35410.2 Clathrin adaptor complex small chain family protein2.2e-8091.25Show/hide
Query:  LIHFVLLISRQGKVRLTKWYSPHSQKERSKVIRELSGMILNRGPKLCNFVEWRGLKAVYKRYASLYFCMCIDQDDNELEILEIIHHYVEILDRYFGSVCE
        +IHFVLL+SRQGKVRLTKWYSP++QKERSKVIRELSG+ILNRGPKLCNFVEWRG K VYKRYASLYFCMCIDQ+DNELE+LEIIHHYVEILDRYFGSVCE
Subjt:  LIHFVLLISRQGKVRLTKWYSPHSQKERSKVIRELSGMILNRGPKLCNFVEWRGLKAVYKRYASLYFCMCIDQDDNELEILEIIHHYVEILDRYFGSVCE

Query:  LDLIFNFHKAYYILDELLIAGELQESSKKTVARLIAAQDSLVETAKEQASSISNIIAQAT
        LDLIFNFHKAYYILDELLIAGELQESSKKTVAR+I+AQD LVE AKE+ASSISNIIAQAT
Subjt:  LDLIFNFHKAYYILDELLIAGELQESSKKTVARLIAAQDSLVETAKEQASSISNIIAQAT

AT4G38150.1 Pentatricopeptide repeat (PPR) superfamily protein4.7e-7557.92Show/hide
Query:  PYPEDNRNQRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEEISSAQLSESMQEKQSQPQRPPE-ADEIFKKMKETGLI
        P P  NR  R  R S       A +     +   +LS  D  FLE+FKL  +  S    K E+               +P  PPE +DEIFKKMKE GLI
Subjt:  PYPEDNRNQRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEEISSAQLSESMQEKQSQPQRPPE-ADEIFKKMKETGLI

Query:  PNAVTMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESG
        PNAV MLDGLCKDGL+QEAMKLFGL+R+KGTIPEVVIYTAVV+ FCKA K+++A RIFRKMQNNGI PNAFS+GVL+QGLY C  ++DAVAFC EMLESG
Subjt:  PNAVTMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESG

Query:  HSPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWE
        HSPN+ TFV L+D LC  KGV++A S ++TL QKGF +N KA++EF++KRAPF    WE
Subjt:  HSPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWE

AT4G38150.2 Pentatricopeptide repeat (PPR) superfamily protein4.7e-7557.92Show/hide
Query:  PYPEDNRNQRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEEISSAQLSESMQEKQSQPQRPPE-ADEIFKKMKETGLI
        P P  NR  R  R S       A +     +   +LS  D  FLE+FKL  +  S    K E+               +P  PPE +DEIFKKMKE GLI
Subjt:  PYPEDNRNQRFNRHSEGSSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEEISSAQLSESMQEKQSQPQRPPE-ADEIFKKMKETGLI

Query:  PNAVTMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESG
        PNAV MLDGLCKDGL+QEAMKLFGL+R+KGTIPEVVIYTAVV+ FCKA K+++A RIFRKMQNNGI PNAFS+GVL+QGLY C  ++DAVAFC EMLESG
Subjt:  PNAVTMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESG

Query:  HSPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWE
        HSPN+ TFV L+D LC  KGV++A S ++TL QKGF +N KA++EF++KRAPF    WE
Subjt:  HSPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLINEKALREFLNKRAPFSPHIWE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGCGTTTCAATTTCAATCTCGTCTTCGTGTTTCGAAGATTTTGTCTTCAACAATTCGGAAATGGGAAAGAGACTTTCCTCTGCTTCTTGATCCATTGCGAACGCC
GGATTTCTCACTCTCGCCGGCGCAATCTCGCTGTTTTAGCTCCAATTCCATGGATGACGATTGGGCTTTCTCAAAGTCCCAGCCGAGAACCAGCCCGCCGCAACAGCGTT
CTCCTCTGGATCATCGTGAAGCTCGTCGAGTTCCCAAATTCACTGGGGATGTATCTTCTCCGTACCCTGAGGATAATCGGAACCAACGCTTTAATAGACATTCGGAAGGT
TCCTCGTCTCGATTCGCAAATGAAGGGTCGATTTCGCGGCAAAGAAGTGCGTCTTTGTCTCAGAAGGATTTTAGCTTTCTTGAAAAATTCAAACTGAACACTGATAATCA
GAGTAGTAGTAAAGAGAAGACTGAGGAGATCTCTTCTGCTCAGCTTTCTGAATCAATGCAGGAGAAGCAGTCACAACCGCAGCGTCCTCCAGAAGCCGATGAGATATTCA
AGAAAATGAAGGAGACTGGTCTAATTCCCAACGCTGTCACTATGCTTGATGGGCTTTGTAAAGATGGACTTATTCAAGAAGCAATGAAACTATTTGGTTTGATCCGTGAG
AAGGGTACAATTCCAGAAGTTGTGATTTACACTGCTGTCGTTGATGGGTTTTGCAAGGCGGAGAAGCTTGATGAAGCAATTAGGATTTTCAGGAAAATGCAGAATAATGG
TATTCCTCCCAATGCCTTCAGTTTCGGCGTCTTGATACAGGGACTGTACAAATGCAAAAGAATAGAGGATGCTGTAGCATTTTGCATTGAGATGTTAGAATCTGGGCATT
CTCCAAATCTTACCACTTTTGTTGGCCTAATTGATGGGTTATGCAATGAAAAGGGCGTGGACGAAGCTCATAGTGTCGTAGAAACCTTGAAACAGAAGGGATTCTTGATT
AATGAGAAAGCTTTAAGAGAATTTTTGAATAAAAGAGCCCCATTTTCACCACATATCTGGGAAGTGAAGCTACCAATCATATCACTGAGAAGAAACTACAAGTTGAAAAT
ATTGGAACTCAGTAGCCGGAATGTGAGAAGAGTGAAGAATATGGAAGAGACAGACCATCTGGCTTTGGAGAGGAGAATGAGAAAGAAAGCCCACGTTGCAAGAGGTTCAA
TGTGCATAGCCACCATTGTCTTCTTAACTTGTGTATGGATCCCTCTGTATCAACTGAAGGCAGCCATCGCGAGCATGTTCGACATTCTGTCGGGCAGGACGTGTCAGATG
CTCGCAACAGCGTCGCCACTGTGTGAGGTAAAACTTATGGTTTGTAGGTTCAAAGAGCTGCAGCGTGTGGCCGGATCGGAGGAGGAGATTTGCTCCGTTTGCTTGGCGGA
GTTCACCGGAGAAGATTTGATTCACTTTGTGCTTCTTATAAGTAGACAAGGAAAAGTGAGATTGACAAAATGGTACTCTCCTCATTCTCAGAAGGAGAGATCTAAGGTTA
TCCGTGAGCTTAGTGGAATGATTCTAAATCGGGGGCCTAAGCTTTGTAACTTTGTGGAATGGAGGGGACTCAAAGCTGTCTATAAAAGATATGCTAGTCTTTACTTTTGC
ATGTGCATTGATCAGGATGACAATGAATTAGAGATCCTTGAAATTATTCATCATTACGTTGAGATTTTGGACCGTTACTTTGGCAGTGTCTGTGAGTTGGATTTGATCTT
CAACTTTCACAAGGCCTATTATATATTGGATGAGCTTCTAATTGCTGGCGAACTCCAAGAATCAAGCAAGAAAACAGTAGCTAGATTGATAGCCGCACAGGATTCGTTGG
TGGAGACTGCAAAGGAGCAAGCAAGTTCAATAAGTAACATAATTGCACAGGCCACCAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGGCGTTTCAATTTCAATCTCGTCTTCGTGTTTCGAAGATTTTGTCTTCAACAATTCGGAAATGGGAAAGAGACTTTCCTCTGCTTCTTGATCCATTGCGAACGCC
GGATTTCTCACTCTCGCCGGCGCAATCTCGCTGTTTTAGCTCCAATTCCATGGATGACGATTGGGCTTTCTCAAAGTCCCAGCCGAGAACCAGCCCGCCGCAACAGCGTT
CTCCTCTGGATCATCGTGAAGCTCGTCGAGTTCCCAAATTCACTGGGGATGTATCTTCTCCGTACCCTGAGGATAATCGGAACCAACGCTTTAATAGACATTCGGAAGGT
TCCTCGTCTCGATTCGCAAATGAAGGGTCGATTTCGCGGCAAAGAAGTGCGTCTTTGTCTCAGAAGGATTTTAGCTTTCTTGAAAAATTCAAACTGAACACTGATAATCA
GAGTAGTAGTAAAGAGAAGACTGAGGAGATCTCTTCTGCTCAGCTTTCTGAATCAATGCAGGAGAAGCAGTCACAACCGCAGCGTCCTCCAGAAGCCGATGAGATATTCA
AGAAAATGAAGGAGACTGGTCTAATTCCCAACGCTGTCACTATGCTTGATGGGCTTTGTAAAGATGGACTTATTCAAGAAGCAATGAAACTATTTGGTTTGATCCGTGAG
AAGGGTACAATTCCAGAAGTTGTGATTTACACTGCTGTCGTTGATGGGTTTTGCAAGGCGGAGAAGCTTGATGAAGCAATTAGGATTTTCAGGAAAATGCAGAATAATGG
TATTCCTCCCAATGCCTTCAGTTTCGGCGTCTTGATACAGGGACTGTACAAATGCAAAAGAATAGAGGATGCTGTAGCATTTTGCATTGAGATGTTAGAATCTGGGCATT
CTCCAAATCTTACCACTTTTGTTGGCCTAATTGATGGGTTATGCAATGAAAAGGGCGTGGACGAAGCTCATAGTGTCGTAGAAACCTTGAAACAGAAGGGATTCTTGATT
AATGAGAAAGCTTTAAGAGAATTTTTGAATAAAAGAGCCCCATTTTCACCACATATCTGGGAAGTGAAGCTACCAATCATATCACTGAGAAGAAACTACAAGTTGAAAAT
ATTGGAACTCAGTAGCCGGAATGTGAGAAGAGTGAAGAATATGGAAGAGACAGACCATCTGGCTTTGGAGAGGAGAATGAGAAAGAAAGCCCACGTTGCAAGAGGTTCAA
TGTGCATAGCCACCATTGTCTTCTTAACTTGTGTATGGATCCCTCTGTATCAACTGAAGGCAGCCATCGCGAGCATGTTCGACATTCTGTCGGGCAGGACGTGTCAGATG
CTCGCAACAGCGTCGCCACTGTGTGAGGTAAAACTTATGGTTTGTAGGTTCAAAGAGCTGCAGCGTGTGGCCGGATCGGAGGAGGAGATTTGCTCCGTTTGCTTGGCGGA
GTTCACCGGAGAAGATTTGATTCACTTTGTGCTTCTTATAAGTAGACAAGGAAAAGTGAGATTGACAAAATGGTACTCTCCTCATTCTCAGAAGGAGAGATCTAAGGTTA
TCCGTGAGCTTAGTGGAATGATTCTAAATCGGGGGCCTAAGCTTTGTAACTTTGTGGAATGGAGGGGACTCAAAGCTGTCTATAAAAGATATGCTAGTCTTTACTTTTGC
ATGTGCATTGATCAGGATGACAATGAATTAGAGATCCTTGAAATTATTCATCATTACGTTGAGATTTTGGACCGTTACTTTGGCAGTGTCTGTGAGTTGGATTTGATCTT
CAACTTTCACAAGGCCTATTATATATTGGATGAGCTTCTAATTGCTGGCGAACTCCAAGAATCAAGCAAGAAAACAGTAGCTAGATTGATAGCCGCACAGGATTCGTTGG
TGGAGACTGCAAAGGAGCAAGCAAGTTCAATAAGTAACATAATTGCACAGGCCACCAAGTAGGAATTCTTGAGTTTATGTTTCAGTTCTTGTGAAATTGATTTGTTCTGA
AGTATTGTAATTATATGTTATCTGAACTTGTATCAAACAATTGAATGCCTTATTTTTGCTTATTGCATTTGTAGTATTACACAAATTGTTTTTCTGATTTTCTAGTGATT
CCATCTACCTGGCTAAATATTTCCTAGATTGTTCTATTGTTAATCACAAATACAAACGTTATTGTTTGCTAGAAGTTTTTTCTCTCTTTT
Protein sequenceShow/hide protein sequence
MGAFQFQSRLRVSKILSSTIRKWERDFPLLLDPLRTPDFSLSPAQSRCFSSNSMDDDWAFSKSQPRTSPPQQRSPLDHREARRVPKFTGDVSSPYPEDNRNQRFNRHSEG
SSSRFANEGSISRQRSASLSQKDFSFLEKFKLNTDNQSSSKEKTEEISSAQLSESMQEKQSQPQRPPEADEIFKKMKETGLIPNAVTMLDGLCKDGLIQEAMKLFGLIRE
KGTIPEVVIYTAVVDGFCKAEKLDEAIRIFRKMQNNGIPPNAFSFGVLIQGLYKCKRIEDAVAFCIEMLESGHSPNLTTFVGLIDGLCNEKGVDEAHSVVETLKQKGFLI
NEKALREFLNKRAPFSPHIWEVKLPIISLRRNYKLKILELSSRNVRRVKNMEETDHLALERRMRKKAHVARGSMCIATIVFLTCVWIPLYQLKAAIASMFDILSGRTCQM
LATASPLCEVKLMVCRFKELQRVAGSEEEICSVCLAEFTGEDLIHFVLLISRQGKVRLTKWYSPHSQKERSKVIRELSGMILNRGPKLCNFVEWRGLKAVYKRYASLYFC
MCIDQDDNELEILEIIHHYVEILDRYFGSVCELDLIFNFHKAYYILDELLIAGELQESSKKTVARLIAAQDSLVETAKEQASSISNIIAQATK