; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0013130 (gene) of Chayote v1 genome

Gene IDSed0013130
OrganismSechium edule (Chayote v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein isoform 2
Genome locationLG07:12131324..12140494
RNA-Seq ExpressionSed0013130
SyntenySed0013130
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577138.1 hypothetical protein SDJN03_24712, partial [Cucurbita argyrosperma subsp. sororia]3.2e-27186.53Show/hide
Query:  MILNLASPWLTLTRF-PPPKLIEPVLPPNNG-----------GVLFAFTSFSKSVRVRASSNGGDDGGAAAFESPVSELLDNELIGAVSAAQDAGEALRL
        MIL+L+SPWLT+TR  PPPKLIEP    +NG             LF FTSFSKS RVRAS N  +  GAAAFE+PVSELLD+ELIG VS ++DA E LRL
Subjt:  MILNLASPWLTLTRF-PPPKLIEPVLPPNNG-----------GVLFAFTSFSKSVRVRASSNGGDDGGAAAFESPVSELLDNELIGAVSAAQDAGEALRL

Query:  IADRSGRNGGTVSVLDCRLIIAAALERNNPELALSVFYAMRSSFYPAAAWEGVSENSSSIERWKWSRPDVLVYTLLIQGLAASLRVSDALRMIEIICRVG
        IAD+SGRNGGTVSV DCRLIIAAAL+RNN ELALSVFYAMRSSFY   AWEGV++N SS+ERWKW+RPDV VYTLLIQGLAASLRVSDALR+IEIICRVG
Subjt:  IADRSGRNGGTVSVLDCRLIIAAALERNNPELALSVFYAMRSSFYPAAAWEGVSENSSSIERWKWSRPDVLVYTLLIQGLAASLRVSDALRMIEIICRVG

Query:  VSPAEEVPFGKVVQCPSCMIAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMETPAWEKALRFLNIVKKKIPAAVHSIVVQTPSGVARTQKF
        VSPAEEVPFGKVVQCPSCM+A+AVAQPQHGIQIVSCAKCRYQYELISGNIV+IESEEISM+TPAWEKALRFLN++K+K+PAAVHSIVVQTPSGVARTQKF
Subjt:  VSPAEEVPFGKVVQCPSCMIAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMETPAWEKALRFLNIVKKKIPAAVHSIVVQTPSGVARTQKF

Query:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKGASSLLNPSILFPLIALSAAGDAASGVIDPSLPR
        ATETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAK  S LL PS LFPLI LS AGDAASGV+DPSLPR
Subjt:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKGASSLLNPSILFPLIALSAAGDAASGVIDPSLPR

Query:  LLLVAGFASLAAGATLNSFIFPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN
        LLLVAGFASLAAGATLNSFI PQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVR+GLEN
Subjt:  LLLVAGFASLAAGATLNSFIFPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN

Query:  SLKQRIELVESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEERWKIQAEANDEAERLLKQSMPTE
        SLKQRIEL+ESYARISSMIEIEVEMESDVIAAEAASSVERV+EQIEQIMVLENLEERW++QAEANDEAERL  QSMPTE
Subjt:  SLKQRIELVESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEERWKIQAEANDEAERLLKQSMPTE

XP_022136767.1 uncharacterized protein LOC111008391 isoform X1 [Momordica charantia]6.5e-27286.01Show/hide
Query:  MILNLASPWLTLT-----RFPPPKLIEPVLPPNNGGV----------LFAFTSFSKSVRVRASSN----GGDDGGAAAFESPVSELLDNELIGAVSAAQD
        MIL+  SPWLT+T       PPPKLIEP+  PNNG V          LFAFTSFSKS  V+ASSN    G   G AAA E+PVSELLD EL+G VS A+D
Subjt:  MILNLASPWLTLT-----RFPPPKLIEPVLPPNNGGV----------LFAFTSFSKSVRVRASSN----GGDDGGAAAFESPVSELLDNELIGAVSAAQD

Query:  AGEALRLIADRSGRNGGTVSVLDCRLIIAAALERNNPELALSVFYAMRSSFYPAAAWEGVSENSSSIERWKWSRPDVLVYTLLIQGLAASLRVSDALRMI
        AGEAL +IAD+SGR+GGTV+V DCRLIIAAALERNNPELALSVFYAMRSSFY A AWEGV++N+SS+ERWKWSRPDV VYTLLIQGLAASLRVSDALRMI
Subjt:  AGEALRLIADRSGRNGGTVSVLDCRLIIAAALERNNPELALSVFYAMRSSFYPAAAWEGVSENSSSIERWKWSRPDVLVYTLLIQGLAASLRVSDALRMI

Query:  EIICRVGVSPAEEVPFGKVVQCPSCMIAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMETPAWEKALRFLNIVKKKIPAAVHSIVVQTPSG
        EIICRVGVSPAEEVPFGKV+QCP CM+AIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISM+TPAWEKALRFLNI+K++IPAA+HSIVVQTPSG
Subjt:  EIICRVGVSPAEEVPFGKVVQCPSCMIAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMETPAWEKALRFLNIVKKKIPAAVHSIVVQTPSG

Query:  VARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKGASSLLNPSILFPLIALSAAGDAASGV
        VARTQKFATETADLPAREGERVTIAAAAPSNVFREVGP KFSPKDP  YSGEPMCLTNH+DGRESLLLRVPAKG SSLLNPS LFPLIALSAAGDAA+GV
Subjt:  VARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKGASSLLNPSILFPLIALSAAGDAASGV

Query:  IDPSLPRLLLVAGFASLAAGATLNSFIFPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKK
        +DPSLPRLLLV GFASLAAGATLNSF+ PQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKK
Subjt:  IDPSLPRLLLVAGFASLAAGATLNSFIFPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKK

Query:  VREGLENSLKQRIELVESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEERWKIQAEANDEAERLLKQSMPTE
        VREGLENSLKQRIEL+ESYARISSMIEIEVEMESDVIAA AASSVERV+EQIEQIM+LENLEE+WK+QAEANDEAERLL QSMPTE
Subjt:  VREGLENSLKQRIELVESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEERWKIQAEANDEAERLLKQSMPTE

XP_022931517.1 uncharacterized protein LOC111437671 isoform X1 [Cucurbita moschata]2.2e-27286.57Show/hide
Query:  MILNLASPWLTLTRF-PPPKLIEPVLPPNNG-----------GVLFAFTSFSKSVRVRASSNGGDDGGAAAFESPVSELLDNELIGAVSAAQDAGEALRL
        MIL+L+SPWLT+TR  PPPKLIEP+   +NG             LF FTSFSKS RVRAS N  +  GAAAFE+PVSELLD+ELIG VS A+DA E LRL
Subjt:  MILNLASPWLTLTRF-PPPKLIEPVLPPNNG-----------GVLFAFTSFSKSVRVRASSNGGDDGGAAAFESPVSELLDNELIGAVSAAQDAGEALRL

Query:  IADRSGRNGGTVSVLDCRLIIAAALERNNPELALSVFYAMRSSFYPAAAWEGVSENSSSIERWKWSRPDVLVYTLLIQGLAASLRVSDALRMIEIICRVG
        IAD+SGRNGGTVSV DCRLIIAAAL+RNN ELALSVFYAMRSSFY   AWEGV++N SS+ERWKW+RPDV VYTLLIQGLAASLRVSDALR+IEIICRVG
Subjt:  IADRSGRNGGTVSVLDCRLIIAAALERNNPELALSVFYAMRSSFYPAAAWEGVSENSSSIERWKWSRPDVLVYTLLIQGLAASLRVSDALRMIEIICRVG

Query:  VSPAEEVPFGKVVQCPSCMIAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMETPAWEKALRFLNIVKKKIPAAVHSIVVQTPSGVARTQKF
        VSPAEEVPFGKVVQCPSCM+A+AVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISM+TPAWEKALRFLN++K+K+PAAVHSIVVQTPSGVARTQKF
Subjt:  VSPAEEVPFGKVVQCPSCMIAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMETPAWEKALRFLNIVKKKIPAAVHSIVVQTPSGVARTQKF

Query:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKGASSLLNPSILFPLIALSAAGDAASGVIDPSLPR
        ATETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAK  S LL PS LFPLI LS AGD +SGV+DPSLPR
Subjt:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKGASSLLNPSILFPLIALSAAGDAASGVIDPSLPR

Query:  LLLVAGFASLAAGATLNSFIFPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN
        LLLVAGFASLAAGATLNSFI PQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN
Subjt:  LLLVAGFASLAAGATLNSFIFPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN

Query:  SLKQRIELVESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEERWKIQAEANDEAERLLKQSMPTETV
        SLKQRIEL+ESYARISSMIEIEVEMESDVIAAEAASSVERV+EQIEQIMVLENLEERW++QAEANDEAERL  QSMPTE V
Subjt:  SLKQRIELVESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEERWKIQAEANDEAERLLKQSMPTETV

XP_038895173.1 uncharacterized protein LOC120083467 isoform X2 [Benincasa hispida]1.0e-27286.38Show/hide
Query:  MILNLASPWLTLTRFPPPKLIEPVLPPNNGG-----------VLFAFTSFSKSVRVRASSNGGDDGGAAAFESPVSELLDNELIGAVSAAQDAGEALRLI
        MILNL SPWL +TR PPPKL EP+    NG             LFAFTSFSKS++VRAS +G D  GAAAFE+PVS+LL NELI AVS A+DA EALR+I
Subjt:  MILNLASPWLTLTRFPPPKLIEPVLPPNNGG-----------VLFAFTSFSKSVRVRASSNGGDDGGAAAFESPVSELLDNELIGAVSAAQDAGEALRLI

Query:  ADRSGRNGGTVSVLDCRLIIAAALERNNPELALSVFYAMRSSFYPAAAWEGVSENSSSIERWKWSRPDVLVYTLLIQGLAASLRVSDALRMIEIICRVGV
        AD+SGR+GGTVS  DC LIIAAAL+ NNPELALSVFYAMRS+FY   AWEGV+EN+S++ERWKWSRPDV VYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  ADRSGRNGGTVSVLDCRLIIAAALERNNPELALSVFYAMRSSFYPAAAWEGVSENSSSIERWKWSRPDVLVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVQCPSCMIAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMETPAWEKALRFLNIVKKKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVQCPSCM+A+AVAQPQHGIQIVSCA+CRY+YELISGNIVNI+SEEISM+TPAWEKALRFLNI+K+KIPAAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVQCPSCMIAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMETPAWEKALRFLNIVKKKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKGASSLLNPSILFPLIALSAAGDAASGVIDPSLPRL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN YSGEPMCLTNHSDGRESLLLRVPAKG SSLLNPS LFPLI LSAAGDAASGV+DPSLP+L
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKGASSLLNPSILFPLIALSAAGDAASGVIDPSLPRL

Query:  LLVAGFASLAAGATLNSFIFPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAG ASLAAGATLNS I PQ NRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI+KVREGLENS
Subjt:  LLVAGFASLAAGATLNSFIFPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELVESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEERWKIQAEANDEAERLLKQSMPTETV
        LKQRIEL+ESYARISSMIEIEVEMESDVIAAEAASSVERV+EQIEQIM LENLEERWK+QAEANDEAERLL QSMPTE V
Subjt:  LKQRIELVESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEERWKIQAEANDEAERLLKQSMPTETV

XP_038895181.1 uncharacterized protein LOC120083467 isoform X3 [Benincasa hispida]3.2e-27186.38Show/hide
Query:  MILNLASPWLTLTRFPPPKLIEPVLPPNNGG-----------VLFAFTSFSKSVRVRASSNGGDDGGAAAFESPVSELLDNELIGAVSAAQDAGEALRLI
        MILNL SPWL +TR PPPKL EP+    NG             LFAFTSFSKS++VRAS +G D  GAAAFE+PVS+LL NELI AVS A+DA EALR+I
Subjt:  MILNLASPWLTLTRFPPPKLIEPVLPPNNGG-----------VLFAFTSFSKSVRVRASSNGGDDGGAAAFESPVSELLDNELIGAVSAAQDAGEALRLI

Query:  ADRSGRNGGTVSVLDCRLIIAAALERNNPELALSVFYAMRSSFYPAAAWEGVSENSSSIERWKWSRPDVLVYTLLIQGLAASLRVSDALRMIEIICRVGV
        AD+SGR+GGTVS  DC LIIAAAL+ NNPELALSVFYAMRS+FY   AWEGV+EN+S++ERWKWSRPDV VYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  ADRSGRNGGTVSVLDCRLIIAAALERNNPELALSVFYAMRSSFYPAAAWEGVSENSSSIERWKWSRPDVLVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVQCPSCMIAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMETPAWEKALRFLNIVKKKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVQCPSCM+A+AVAQPQHGIQIVSCA+CRY+YELISGNIVNI+SEEISM+TPAWEKALRFLNI+K+KIPAAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVQCPSCMIAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMETPAWEKALRFLNIVKKKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKGASSLLNPSILFPLIALSAAGDAASGVIDPSLPRL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN YSGEPMCLTNHSDGRESLLLRVPAKG SSLLNPS LFPLI LSAAGDAASGV+DPSLP+L
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKGASSLLNPSILFPLIALSAAGDAASGVIDPSLPRL

Query:  LLVAGFASLAAGATLNSFIFPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAG ASLAAGATLNS I PQ NRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI+KVREGLENS
Subjt:  LLVAGFASLAAGATLNSFIFPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELVESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEERWKIQAEANDEAERLLKQSMPTETV
        LKQRIEL+ESYARISSMIEIEVEMESDVIAAEAASSVERV+EQIEQIM LENLEERWK+QAEANDEAERLL QSMPTE V
Subjt:  LKQRIELVESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEERWKIQAEANDEAERLLKQSMPTETV

TrEMBL top hitse value%identityAlignment
A0A6J1C6D2 uncharacterized protein LOC111008391 isoform X13.2e-27286.01Show/hide
Query:  MILNLASPWLTLT-----RFPPPKLIEPVLPPNNGGV----------LFAFTSFSKSVRVRASSN----GGDDGGAAAFESPVSELLDNELIGAVSAAQD
        MIL+  SPWLT+T       PPPKLIEP+  PNNG V          LFAFTSFSKS  V+ASSN    G   G AAA E+PVSELLD EL+G VS A+D
Subjt:  MILNLASPWLTLT-----RFPPPKLIEPVLPPNNGGV----------LFAFTSFSKSVRVRASSN----GGDDGGAAAFESPVSELLDNELIGAVSAAQD

Query:  AGEALRLIADRSGRNGGTVSVLDCRLIIAAALERNNPELALSVFYAMRSSFYPAAAWEGVSENSSSIERWKWSRPDVLVYTLLIQGLAASLRVSDALRMI
        AGEAL +IAD+SGR+GGTV+V DCRLIIAAALERNNPELALSVFYAMRSSFY A AWEGV++N+SS+ERWKWSRPDV VYTLLIQGLAASLRVSDALRMI
Subjt:  AGEALRLIADRSGRNGGTVSVLDCRLIIAAALERNNPELALSVFYAMRSSFYPAAAWEGVSENSSSIERWKWSRPDVLVYTLLIQGLAASLRVSDALRMI

Query:  EIICRVGVSPAEEVPFGKVVQCPSCMIAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMETPAWEKALRFLNIVKKKIPAAVHSIVVQTPSG
        EIICRVGVSPAEEVPFGKV+QCP CM+AIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISM+TPAWEKALRFLNI+K++IPAA+HSIVVQTPSG
Subjt:  EIICRVGVSPAEEVPFGKVVQCPSCMIAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMETPAWEKALRFLNIVKKKIPAAVHSIVVQTPSG

Query:  VARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKGASSLLNPSILFPLIALSAAGDAASGV
        VARTQKFATETADLPAREGERVTIAAAAPSNVFREVGP KFSPKDP  YSGEPMCLTNH+DGRESLLLRVPAKG SSLLNPS LFPLIALSAAGDAA+GV
Subjt:  VARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKGASSLLNPSILFPLIALSAAGDAASGV

Query:  IDPSLPRLLLVAGFASLAAGATLNSFIFPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKK
        +DPSLPRLLLV GFASLAAGATLNSF+ PQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKK
Subjt:  IDPSLPRLLLVAGFASLAAGATLNSFIFPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKK

Query:  VREGLENSLKQRIELVESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEERWKIQAEANDEAERLLKQSMPTE
        VREGLENSLKQRIEL+ESYARISSMIEIEVEMESDVIAA AASSVERV+EQIEQIM+LENLEE+WK+QAEANDEAERLL QSMPTE
Subjt:  VREGLENSLKQRIELVESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEERWKIQAEANDEAERLLKQSMPTE

A0A6J1ETW5 uncharacterized protein LOC111437671 isoform X11.1e-27286.57Show/hide
Query:  MILNLASPWLTLTRF-PPPKLIEPVLPPNNG-----------GVLFAFTSFSKSVRVRASSNGGDDGGAAAFESPVSELLDNELIGAVSAAQDAGEALRL
        MIL+L+SPWLT+TR  PPPKLIEP+   +NG             LF FTSFSKS RVRAS N  +  GAAAFE+PVSELLD+ELIG VS A+DA E LRL
Subjt:  MILNLASPWLTLTRF-PPPKLIEPVLPPNNG-----------GVLFAFTSFSKSVRVRASSNGGDDGGAAAFESPVSELLDNELIGAVSAAQDAGEALRL

Query:  IADRSGRNGGTVSVLDCRLIIAAALERNNPELALSVFYAMRSSFYPAAAWEGVSENSSSIERWKWSRPDVLVYTLLIQGLAASLRVSDALRMIEIICRVG
        IAD+SGRNGGTVSV DCRLIIAAAL+RNN ELALSVFYAMRSSFY   AWEGV++N SS+ERWKW+RPDV VYTLLIQGLAASLRVSDALR+IEIICRVG
Subjt:  IADRSGRNGGTVSVLDCRLIIAAALERNNPELALSVFYAMRSSFYPAAAWEGVSENSSSIERWKWSRPDVLVYTLLIQGLAASLRVSDALRMIEIICRVG

Query:  VSPAEEVPFGKVVQCPSCMIAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMETPAWEKALRFLNIVKKKIPAAVHSIVVQTPSGVARTQKF
        VSPAEEVPFGKVVQCPSCM+A+AVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISM+TPAWEKALRFLN++K+K+PAAVHSIVVQTPSGVARTQKF
Subjt:  VSPAEEVPFGKVVQCPSCMIAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMETPAWEKALRFLNIVKKKIPAAVHSIVVQTPSGVARTQKF

Query:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKGASSLLNPSILFPLIALSAAGDAASGVIDPSLPR
        ATETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAK  S LL PS LFPLI LS AGD +SGV+DPSLPR
Subjt:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKGASSLLNPSILFPLIALSAAGDAASGVIDPSLPR

Query:  LLLVAGFASLAAGATLNSFIFPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN
        LLLVAGFASLAAGATLNSFI PQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN
Subjt:  LLLVAGFASLAAGATLNSFIFPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN

Query:  SLKQRIELVESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEERWKIQAEANDEAERLLKQSMPTETV
        SLKQRIEL+ESYARISSMIEIEVEMESDVIAAEAASSVERV+EQIEQIMVLENLEERW++QAEANDEAERL  QSMPTE V
Subjt:  SLKQRIELVESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEERWKIQAEANDEAERLLKQSMPTETV

A0A6J1EUG6 uncharacterized protein LOC111437671 isoform X37.8e-27187.39Show/hide
Query:  MILNLASPWLTLTRF-PPPKLIEPVLPPNNG-GVLFAFTSFSKSVRVRASSNGGDDGGAAAFESPVSELLDNELIGAVSAAQDAGEALRLIADRSGRNGG
        MIL+L+SPWLT+TR  PPPKLIEP+   +NG  VL      SKS RVRAS N  +  GAAAFE+PVSELLD+ELIG VS A+DA E LRLIAD+SGRNGG
Subjt:  MILNLASPWLTLTRF-PPPKLIEPVLPPNNG-GVLFAFTSFSKSVRVRASSNGGDDGGAAAFESPVSELLDNELIGAVSAAQDAGEALRLIADRSGRNGG

Query:  TVSVLDCRLIIAAALERNNPELALSVFYAMRSSFYPAAAWEGVSENSSSIERWKWSRPDVLVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFG
        TVSV DCRLIIAAAL+RNN ELALSVFYAMRSSFY   AWEGV++N SS+ERWKW+RPDV VYTLLIQGLAASLRVSDALR+IEIICRVGVSPAEEVPFG
Subjt:  TVSVLDCRLIIAAALERNNPELALSVFYAMRSSFYPAAAWEGVSENSSSIERWKWSRPDVLVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFG

Query:  KVVQCPSCMIAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMETPAWEKALRFLNIVKKKIPAAVHSIVVQTPSGVARTQKFATETADLPAR
        KVVQCPSCM+A+AVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISM+TPAWEKALRFLN++K+K+PAAVHSIVVQTPSGVARTQKFATETADLPAR
Subjt:  KVVQCPSCMIAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMETPAWEKALRFLNIVKKKIPAAVHSIVVQTPSGVARTQKFATETADLPAR

Query:  EGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKGASSLLNPSILFPLIALSAAGDAASGVIDPSLPRLLLVAGFASL
        EGERVTIAAAAPSNV+REVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAK  S LL PS LFPLI LS AGD +SGV+DPSLPRLLLVAGFASL
Subjt:  EGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKGASSLLNPSILFPLIALSAAGDAASGVIDPSLPRLLLVAGFASL

Query:  AAGATLNSFIFPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELVE
        AAGATLNSFI PQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIEL+E
Subjt:  AAGATLNSFIFPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELVE

Query:  SYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEERWKIQAEANDEAERLLKQSMPTETV
        SYARISSMIEIEVEMESDVIAAEAASSVERV+EQIEQIMVLENLEERW++QAEANDEAERL  QSMPTE V
Subjt:  SYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEERWKIQAEANDEAERLLKQSMPTETV

A0A6J1EYW6 uncharacterized protein LOC111437671 isoform X23.5e-27186.57Show/hide
Query:  MILNLASPWLTLTRF-PPPKLIEPVLPPNNG-----------GVLFAFTSFSKSVRVRASSNGGDDGGAAAFESPVSELLDNELIGAVSAAQDAGEALRL
        MIL+L+SPWLT+TR  PPPKLIEP+   +NG             LF FTSFSKS RVRAS N  +  GAAAFE+PVSELLD+ELIG VS A+DA E LRL
Subjt:  MILNLASPWLTLTRF-PPPKLIEPVLPPNNG-----------GVLFAFTSFSKSVRVRASSNGGDDGGAAAFESPVSELLDNELIGAVSAAQDAGEALRL

Query:  IADRSGRNGGTVSVLDCRLIIAAALERNNPELALSVFYAMRSSFYPAAAWEGVSENSSSIERWKWSRPDVLVYTLLIQGLAASLRVSDALRMIEIICRVG
        IAD+SGRNGGTVSV DCRLIIAAAL+RNN ELALSVFYAMRSSFY   AWEGV++N SS+ERWKW+RPDV VYTLLIQGLAASLRVSDALR+IEIICRVG
Subjt:  IADRSGRNGGTVSVLDCRLIIAAALERNNPELALSVFYAMRSSFYPAAAWEGVSENSSSIERWKWSRPDVLVYTLLIQGLAASLRVSDALRMIEIICRVG

Query:  VSPAEEVPFGKVVQCPSCMIAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMETPAWEKALRFLNIVKKKIPAAVHSIVVQTPSGVARTQKF
        VSPAEEVPFGKVVQCPSCM+A+AVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISM+TPAWEKALRFLN++K+K+PAAVHSIVVQTPSGVARTQKF
Subjt:  VSPAEEVPFGKVVQCPSCMIAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMETPAWEKALRFLNIVKKKIPAAVHSIVVQTPSGVARTQKF

Query:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKGASSLLNPSILFPLIALSAAGDAASGVIDPSLPR
        ATETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAK  S LL PS LFPLI LS AGD +SGV+DPSLPR
Subjt:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKGASSLLNPSILFPLIALSAAGDAASGVIDPSLPR

Query:  LLLVAGFASLAAGATLNSFIFPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN
        LLLVAGFASLAAGATLNSFI PQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN
Subjt:  LLLVAGFASLAAGATLNSFIFPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN

Query:  SLKQRIELVESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEERWKIQAEANDEAERLLKQSMPTETV
        SLKQRIEL+ESYARISSMIEIEVEMESDVIAAEAASSVERV+EQIEQIMVLENLEERW++QAEANDEAERL  QSMPTE V
Subjt:  SLKQRIELVESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEERWKIQAEANDEAERLLKQSMPTETV

A0A6J1J4R1 uncharacterized protein LOC111483407 isoform X15.0e-27085.52Show/hide
Query:  MILNLASPWLTLTRFPPPKLIEPVLPPNNG-----------GVLFAFTSFSKSVRVRASSNGGDDGGAAAFESPVSELLDNELIGAVSAAQDAGEALRLI
        MIL+L+SPWLT+TR P PKLIEP+   +NG              F FTSFS+S RVRAS N  +  GAAAFE+PVS+LLD+ELI  VS A+DA E LR+I
Subjt:  MILNLASPWLTLTRFPPPKLIEPVLPPNNG-----------GVLFAFTSFSKSVRVRASSNGGDDGGAAAFESPVSELLDNELIGAVSAAQDAGEALRLI

Query:  ADRSGRNGGTVSVLDCRLIIAAALERNNPELALSVFYAMRSSFYPAAAWEGVSENSSSIERWKWSRPDVLVYTLLIQGLAASLRVSDALRMIEIICRVGV
        A++SGRNGGTVSV DCRLIIAAAL+RNN ELALSVFYAMRSSFY   AWEGV++N SS+ERWKW+RPDV VYTLLIQGLAASLRVSDALR+IEIICRVGV
Subjt:  ADRSGRNGGTVSVLDCRLIIAAALERNNPELALSVFYAMRSSFYPAAAWEGVSENSSSIERWKWSRPDVLVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVQCPSCMIAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMETPAWEKALRFLNIVKKKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVQCPSCM+A+AVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISM+TPAWEKALRFLN++K+K+PAAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVQCPSCMIAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMETPAWEKALRFLNIVKKKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKGASSLLNPSILFPLIALSAAGDAASGVIDPSLPRL
        TETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLL+RVPAK  S LL PS LFPLI LS AGDAASGV+DPSLPR+
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRVPAKGASSLLNPSILFPLIALSAAGDAASGVIDPSLPRL

Query:  LLVAGFASLAAGATLNSFIFPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAGFASLAAGATLNSFI PQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSFIFPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELVESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEERWKIQAEANDEAERLLKQSMPTETV
        LKQRIEL+ESYARISSMIEIEVEMESDVIAAEAASSVERV+EQIEQIMVLENLEERW++QAEANDEAERL  QSMPTE V
Subjt:  LKQRIELVESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEERWKIQAEANDEAERLLKQSMPTETV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64430.1 Pentatricopeptide repeat (PPR) superfamily protein6.6e-19067.98Show/hide
Query:  DDGGAAAFESPVSELLDNELIGAVSAAQDAGEALRLIADRSGRN-GGTVSVLDCRLIIAAALERNNPELALSVFYAMRSSFYPAAAWEGVSENSSSIERW
        D  G+AA  S  S +LD+EL+ +VSA +DA EAL +I+DR G N GG V + DCR II+AA+ R N +LALS+FY MR+SF             S  +RW
Subjt:  DDGGAAAFESPVSELLDNELIGAVSAAQDAGEALRLIADRSGRN-GGTVSVLDCRLIIAAALERNNPELALSVFYAMRSSFYPAAAWEGVSENSSSIERW

Query:  KWSRPDVLVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVQCPSCMIAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMETP
         WSRPDV VYT+L+ GLAASLRVSD+LR+I  ICRVG+SPAEEVPFGK+V+CPSC+IAIAVAQPQHG+QIVSCA CRYQYEL SG+I +I+SEE+  + P
Subjt:  KWSRPDVLVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVQCPSCMIAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMETP

Query:  AWEKALRFLNIVKKKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRV
         WEK LR + I K KI ++VHSIVVQTPSG ART +FATETA+LPA+EGERVTIA+AAPSNV+R+VGP KF  K PN Y GEPM LT H DGRES+LLR 
Subjt:  AWEKALRFLNIVKKKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRV

Query:  PAKGASSLLNPSILFPLIALSAAGDAASGVIDPSLPRLLLVAGFASLAAGATLNSFIFPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEV
        P+K    +L PS L PL+A+ A GDAASGVIDPSLP+LL VA   SLA GAT+NSF+ P+ N+LP+R+VD++ IKQQLLSQY+VLQ RIRDLK A EKEV
Subjt:  PAKGASSLLNPSILFPLIALSAAGDAASGVIDPSLPRLLLVAGFASLAAGATLNSFIFPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEV

Query:  WMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELVESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEERWKIQAE
        WMLARMCQLENKI AVGEP+YR RR+R+KKVRE LENS+K +I+L++SYARISSMIEIEVEM+SDV+AAEA ++ E +A+QIEQIM LENLEE+WKIQAE
Subjt:  WMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELVESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEERWKIQAE

Query:  ANDEAERLL
        ANDEAERLL
Subjt:  ANDEAERLL

AT1G64430.2 Pentatricopeptide repeat (PPR) superfamily protein6.6e-19067.98Show/hide
Query:  DDGGAAAFESPVSELLDNELIGAVSAAQDAGEALRLIADRSGRN-GGTVSVLDCRLIIAAALERNNPELALSVFYAMRSSFYPAAAWEGVSENSSSIERW
        D  G+AA  S  S +LD+EL+ +VSA +DA EAL +I+DR G N GG V + DCR II+AA+ R N +LALS+FY MR+SF             S  +RW
Subjt:  DDGGAAAFESPVSELLDNELIGAVSAAQDAGEALRLIADRSGRN-GGTVSVLDCRLIIAAALERNNPELALSVFYAMRSSFYPAAAWEGVSENSSSIERW

Query:  KWSRPDVLVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVQCPSCMIAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMETP
         WSRPDV VYT+L+ GLAASLRVSD+LR+I  ICRVG+SPAEEVPFGK+V+CPSC+IAIAVAQPQHG+QIVSCA CRYQYEL SG+I +I+SEE+  + P
Subjt:  KWSRPDVLVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVQCPSCMIAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMETP

Query:  AWEKALRFLNIVKKKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRV
         WEK LR + I K KI ++VHSIVVQTPSG ART +FATETA+LPA+EGERVTIA+AAPSNV+R+VGP KF  K PN Y GEPM LT H DGRES+LLR 
Subjt:  AWEKALRFLNIVKKKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHSDGRESLLLRV

Query:  PAKGASSLLNPSILFPLIALSAAGDAASGVIDPSLPRLLLVAGFASLAAGATLNSFIFPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEV
        P+K    +L PS L PL+A+ A GDAASGVIDPSLP+LL VA   SLA GAT+NSF+ P+ N+LP+R+VD++ IKQQLLSQY+VLQ RIRDLK A EKEV
Subjt:  PAKGASSLLNPSILFPLIALSAAGDAASGVIDPSLPRLLLVAGFASLAAGATLNSFIFPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEV

Query:  WMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELVESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEERWKIQAE
        WMLARMCQLENKI AVGEP+YR RR+R+KKVRE LENS+K +I+L++SYARISSMIEIEVEM+SDV+AAEA ++ E +A+QIEQIM LENLEE+WKIQAE
Subjt:  WMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELVESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEERWKIQAE

Query:  ANDEAERLL
        ANDEAERLL
Subjt:  ANDEAERLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCTCAACTTGGCTTCGCCATGGCTCACCCTCACTCGCTTCCCTCCTCCCAAGCTCATCGAACCAGTCCTTCCTCCCAACAATGGCGGCGTACTCTTTGCGTTCAC
TTCCTTTTCCAAGTCGGTGCGAGTTAGAGCTTCTTCCAACGGCGGAGACGACGGCGGTGCTGCGGCTTTTGAGAGTCCTGTTTCGGAGTTGCTAGACAATGAGCTGATTG
GGGCTGTTTCGGCTGCTCAGGATGCCGGTGAAGCGTTGCGCTTGATTGCTGATAGGTCGGGGAGAAATGGAGGTACTGTATCGGTTTTGGATTGTCGGTTGATTATTGCG
GCTGCGCTTGAGCGTAACAATCCTGAGCTTGCTTTGTCTGTGTTCTACGCTATGCGCTCTAGTTTCTATCCAGCTGCTGCATGGGAAGGTGTTAGTGAAAATTCTTCCTC
TATTGAGAGATGGAAATGGTCCAGGCCAGATGTCCTTGTGTATACATTGCTGATTCAAGGTCTGGCAGCATCCTTGAGGGTTTCTGATGCTCTTAGAATGATTGAGATTA
TTTGCCGAGTTGGTGTTTCACCGGCAGAGGAGGTCCCATTTGGAAAGGTAGTGCAATGTCCCAGTTGTATGATAGCAATTGCAGTTGCACAACCCCAGCACGGTATTCAG
ATTGTATCCTGTGCAAAGTGCCGCTACCAGTATGAACTTATATCTGGAAACATAGTTAATATTGAGTCAGAGGAAATTAGCATGGAAACTCCAGCATGGGAAAAAGCACT
CCGATTCTTGAATATAGTGAAGAAAAAAATCCCAGCTGCTGTCCACTCCATTGTGGTACAAACTCCTTCTGGAGTGGCTCGAACCCAGAAGTTTGCTACTGAAACAGCAG
ATCTCCCAGCCCGAGAAGGAGAAAGGGTGACAATTGCTGCTGCAGCTCCGTCAAATGTATTCAGAGAAGTTGGTCCAATTAAATTTAGTCCAAAGGATCCGAATTTGTAC
TCTGGAGAGCCTATGTGCCTGACAAATCATAGTGATGGCCGGGAATCACTATTATTAAGAGTGCCAGCAAAGGGAGCCTCATCCTTACTAAACCCATCGATTCTCTTTCC
ACTCATAGCTTTGTCTGCCGCTGGAGATGCTGCTTCTGGAGTTATTGACCCCAGCTTGCCTCGGTTGCTGTTAGTTGCTGGGTTTGCTTCTCTGGCTGCAGGAGCTACTT
TGAATTCATTTATTTTTCCTCAATTCAATCGGCTTCCTCAACGATCAGTTGATATCATTGCTATCAAGCAGCAGCTTTTATCTCAATATAATGTGCTTCAATCTCGTATC
AGGGATTTAAAACTAGCTGCTGAAAAGGAGGTATGGATGTTGGCCCGGATGTGCCAATTAGAGAACAAAATTTTTGCCGTAGGAGAACCTTCTTACCGCGCACGTAGAAG
TAGGATAAAAAAGGTGCGAGAAGGCTTGGAAAATTCCCTTAAGCAACGGATTGAACTAGTAGAAAGCTATGCAAGGATTTCCTCAATGATTGAAATCGAAGTTGAAATGG
AGTCTGATGTTATTGCTGCCGAAGCAGCCAGCAGTGTGGAAAGGGTTGCTGAACAGATTGAGCAAATCATGGTACTGGAAAATCTAGAAGAGAGATGGAAAATACAAGCA
GAAGCCAACGATGAAGCCGAAAGACTTCTCAAACAATCAATGCCAACCGAAACAGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATTCTCAACTTGGCTTCGCCATGGCTCACCCTCACTCGCTTCCCTCCTCCCAAGCTCATCGAACCAGTCCTTCCTCCCAACAATGGCGGCGTACTCTTTGCGTTCAC
TTCCTTTTCCAAGTCGGTGCGAGTTAGAGCTTCTTCCAACGGCGGAGACGACGGCGGTGCTGCGGCTTTTGAGAGTCCTGTTTCGGAGTTGCTAGACAATGAGCTGATTG
GGGCTGTTTCGGCTGCTCAGGATGCCGGTGAAGCGTTGCGCTTGATTGCTGATAGGTCGGGGAGAAATGGAGGTACTGTATCGGTTTTGGATTGTCGGTTGATTATTGCG
GCTGCGCTTGAGCGTAACAATCCTGAGCTTGCTTTGTCTGTGTTCTACGCTATGCGCTCTAGTTTCTATCCAGCTGCTGCATGGGAAGGTGTTAGTGAAAATTCTTCCTC
TATTGAGAGATGGAAATGGTCCAGGCCAGATGTCCTTGTGTATACATTGCTGATTCAAGGTCTGGCAGCATCCTTGAGGGTTTCTGATGCTCTTAGAATGATTGAGATTA
TTTGCCGAGTTGGTGTTTCACCGGCAGAGGAGGTCCCATTTGGAAAGGTAGTGCAATGTCCCAGTTGTATGATAGCAATTGCAGTTGCACAACCCCAGCACGGTATTCAG
ATTGTATCCTGTGCAAAGTGCCGCTACCAGTATGAACTTATATCTGGAAACATAGTTAATATTGAGTCAGAGGAAATTAGCATGGAAACTCCAGCATGGGAAAAAGCACT
CCGATTCTTGAATATAGTGAAGAAAAAAATCCCAGCTGCTGTCCACTCCATTGTGGTACAAACTCCTTCTGGAGTGGCTCGAACCCAGAAGTTTGCTACTGAAACAGCAG
ATCTCCCAGCCCGAGAAGGAGAAAGGGTGACAATTGCTGCTGCAGCTCCGTCAAATGTATTCAGAGAAGTTGGTCCAATTAAATTTAGTCCAAAGGATCCGAATTTGTAC
TCTGGAGAGCCTATGTGCCTGACAAATCATAGTGATGGCCGGGAATCACTATTATTAAGAGTGCCAGCAAAGGGAGCCTCATCCTTACTAAACCCATCGATTCTCTTTCC
ACTCATAGCTTTGTCTGCCGCTGGAGATGCTGCTTCTGGAGTTATTGACCCCAGCTTGCCTCGGTTGCTGTTAGTTGCTGGGTTTGCTTCTCTGGCTGCAGGAGCTACTT
TGAATTCATTTATTTTTCCTCAATTCAATCGGCTTCCTCAACGATCAGTTGATATCATTGCTATCAAGCAGCAGCTTTTATCTCAATATAATGTGCTTCAATCTCGTATC
AGGGATTTAAAACTAGCTGCTGAAAAGGAGGTATGGATGTTGGCCCGGATGTGCCAATTAGAGAACAAAATTTTTGCCGTAGGAGAACCTTCTTACCGCGCACGTAGAAG
TAGGATAAAAAAGGTGCGAGAAGGCTTGGAAAATTCCCTTAAGCAACGGATTGAACTAGTAGAAAGCTATGCAAGGATTTCCTCAATGATTGAAATCGAAGTTGAAATGG
AGTCTGATGTTATTGCTGCCGAAGCAGCCAGCAGTGTGGAAAGGGTTGCTGAACAGATTGAGCAAATCATGGTACTGGAAAATCTAGAAGAGAGATGGAAAATACAAGCA
GAAGCCAACGATGAAGCCGAAAGACTTCTCAAACAATCAATGCCAACCGAAACAGTTTAG
Protein sequenceShow/hide protein sequence
MILNLASPWLTLTRFPPPKLIEPVLPPNNGGVLFAFTSFSKSVRVRASSNGGDDGGAAAFESPVSELLDNELIGAVSAAQDAGEALRLIADRSGRNGGTVSVLDCRLIIA
AALERNNPELALSVFYAMRSSFYPAAAWEGVSENSSSIERWKWSRPDVLVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVQCPSCMIAIAVAQPQHGIQ
IVSCAKCRYQYELISGNIVNIESEEISMETPAWEKALRFLNIVKKKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLY
SGEPMCLTNHSDGRESLLLRVPAKGASSLLNPSILFPLIALSAAGDAASGVIDPSLPRLLLVAGFASLAAGATLNSFIFPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRI
RDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELVESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEERWKIQA
EANDEAERLLKQSMPTETV