; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS012838 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS012838
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationscaffold63:3870001..3875837
RNA-Seq ExpressionMS012838
SyntenyMS012838
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136767.1 uncharacterized protein LOC111008391 isoform X1 [Momordica charantia]4.1e-28795.21Show/hide
Query:  MILSSTSPWLTLTLTRLPLSSPPKLIEPLASANNGGVLMPLLLCSHALFAFTSFSKCTRSVKASSNDR----GSGSAAASENPVSELLDEEFLGDVSGAK
        MILSSTSPWLT+TLTRLPLS PPKLIEPL S NNG VLMPLLLCS ALFAFTSFSK +RSVKASSNDR    GSGSAAASENPVSELLDEE LGDVSGAK
Subjt:  MILSSTSPWLTLTLTRLPLSSPPKLIEPLASANNGGVLMPLLLCSHALFAFTSFSKCTRSVKASSNDR----GSGSAAASENPVSELLDEEFLGDVSGAK

Query:  DAGEALLVIADRSGRSGGTVSVSDCCLIIAAALERNNPELALSVFYAMRSSFYQATASVGVNQNASSVERWKWSRPDVHVYTLLIHGLAASLRVSDALRM
        DAGEALLVIAD+SGRSGGTV+VSDC LIIAAALERNNPELALSVFYAMRSSFYQATA  GVNQNASSVERWKWSRPDVHVYTLLI GLAASLRVSDALRM
Subjt:  DAGEALLVIADRSGRSGGTVSVSDCCLIIAAALERNNPELALSVFYAMRSSFYQATASVGVNQNASSVERWKWSRPDVHVYTLLIHGLAASLRVSDALRM

Query:  IEIICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLNIMKREIPAAIHSIVVQTPS
        IEIICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLNIMKREIPAAIHSIVVQTPS
Subjt:  IEIICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLNIMKREIPAAIHSIVVQTPS

Query:  GVARTQKFATETADLPAQEGERVTIAAAAPSNVFREVGPFKFSPKDPNFYSGESMCLTNHTDGRESLLLRVPAKGTSSLLNPSTLFPLIALSAAGDAASG
        GVARTQKFATETADLPA+EGERVTIAAAAPSNVFREVGPFKFSPKDP FYSGE MCLTNHTDGRESLLLRVPAKGTSSLLNPSTLFPLIALSAAGDAA+G
Subjt:  GVARTQKFATETADLPAQEGERVTIAAAAPSNVFREVGPFKFSPKDPNFYSGESMCLTNHTDGRESLLLRVPAKGTSSLLNPSTLFPLIALSAAGDAASG

Query:  VIDPSLPRLLLVAGFASVAAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIK
        V+DPSLPRLLLV GFAS+AAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIK
Subjt:  VIDPSLPRLLLVAGFASVAAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIK

Query:  KVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMMLENLEE
        KVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAA AASSVERVSEQIEQIMMLENLEE
Subjt:  KVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMMLENLEE

XP_022931517.1 uncharacterized protein LOC111437671 isoform X1 [Cucurbita moschata]8.1e-25986.99Show/hide
Query:  MILSSTSPWLTLTLTRLPLSSPPKLIEPLASANNG-GVLMPLLLCSHALFAFTSFSKCTRSVKASSNDRGSGSAAASENPVSELLDEEFLGDVSGAKDAG
        MIL  +SPW  LT+TRLP   PPKLIEPLASA+NG  VLMPLLLCSHALF FTSFSK TR V+AS N      AAA ENPVSELLD+E +G VSGAKDA 
Subjt:  MILSSTSPWLTLTLTRLPLSSPPKLIEPLASANNG-GVLMPLLLCSHALFAFTSFSKCTRSVKASSNDRGSGSAAASENPVSELLDEEFLGDVSGAKDAG

Query:  EALLVIADRSGRSGGTVSVSDCCLIIAAALERNNPELALSVFYAMRSSFYQATASVGVNQNASSVERWKWSRPDVHVYTLLIHGLAASLRVSDALRMIEI
        E L +IAD+SGR+GGTVSV DC LIIAAAL+RNN ELALSVFYAMRSSFY+ TA  GVN N SSVERWKW+RPDVHVYTLLI GLAASLRVSDALR+IEI
Subjt:  EALLVIADRSGRSGGTVSVSDCCLIIAAALERNNPELALSVFYAMRSSFYQATASVGVNQNASSVERWKWSRPDVHVYTLLIHGLAASLRVSDALRMIEI

Query:  ICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLNIMKREIPAAIHSIVVQTPSGVA
        ICRVGVSPAEEVPFGKV+QCP CMVA+AVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLN+MK+++PAA+HSIVVQTPSGVA
Subjt:  ICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLNIMKREIPAAIHSIVVQTPSGVA

Query:  RTQKFATETADLPAQEGERVTIAAAAPSNVFREVGPFKFSPKDPNFYSGESMCLTNHTDGRESLLLRVPAKGTSSLLNPSTLFPLIALSAAGDAASGVID
        RTQKFATETADLPA+EGERVTIAAAAPSNV+REVGP KFSPKDPN YSGE MCLTNH+DGRESLLLRVPAK TS LL PS LFPLI LS AGD +SGV+D
Subjt:  RTQKFATETADLPAQEGERVTIAAAAPSNVFREVGPFKFSPKDPNFYSGESMCLTNHTDGRESLLLRVPAKGTSSLLNPSTLFPLIALSAAGDAASGVID

Query:  PSLPRLLLVAGFASVAAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVR
        PSLPRLLLVAGFAS+AAGATLNSF+LPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVR
Subjt:  PSLPRLLLVAGFASVAAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVR

Query:  EGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMMLENLEE
        EGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM+LENLEE
Subjt:  EGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMMLENLEE

XP_038895166.1 uncharacterized protein LOC120083467 isoform X1 [Benincasa hispida]5.3e-25885.99Show/hide
Query:  MILSSTSPWLTLTLTRLPLSSPPKLIEPLASANNGG-VLMPLLLCSHALFAFTSFSKCTRSVKASSNDRGSGSAAASENPVSELLDEEFLGDVSGAKDAG
        MIL+ TSPW  L +TRLP   PPKL EPLASA NG  VLMPLLLCSHALFAFTSFSK +  V+AS +      AAA ENPVS+LL  E +  VSGAKDA 
Subjt:  MILSSTSPWLTLTLTRLPLSSPPKLIEPLASANNGG-VLMPLLLCSHALFAFTSFSKCTRSVKASSNDRGSGSAAASENPVSELLDEEFLGDVSGAKDAG

Query:  EALLVIADRSGRSGGTVSVSDCCLIIAAALERNNPELALSVFYAMRSSFYQ----------ATASVGVNQNASSVERWKWSRPDVHVYTLLIHGLAASLR
        EAL +IAD+SGRSGGTVS SDCCLIIAAAL+ NNPELALSVFYAMRS+FYQ           TA  GVN+NAS+VERWKWSRPDVHVYTLLI GLAASLR
Subjt:  EALLVIADRSGRSGGTVSVSDCCLIIAAALERNNPELALSVFYAMRSSFYQ----------ATASVGVNQNASSVERWKWSRPDVHVYTLLIHGLAASLR

Query:  VSDALRMIEIICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLNIMKREIPAAIHS
        VSDALRMIEIICRVGVSPAEEVPFGKV+QCP CMVA+AVAQPQHGIQIVSCA+CRY+YELISGNIVNI+SEEISMDTPAWEKALRFLNIMKR+IPAA+HS
Subjt:  VSDALRMIEIICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLNIMKREIPAAIHS

Query:  IVVQTPSGVARTQKFATETADLPAQEGERVTIAAAAPSNVFREVGPFKFSPKDPNFYSGESMCLTNHTDGRESLLLRVPAKGTSSLLNPSTLFPLIALSA
        IVVQTPSGVARTQKFATETADLPA+EGERVTIAAAAPSNVFREVGP KFSPKDPNFYSGE MCLTNH+DGRESLLLRVPAKGTSSLLNPSTLFPLI LSA
Subjt:  IVVQTPSGVARTQKFATETADLPAQEGERVTIAAAAPSNVFREVGPFKFSPKDPNFYSGESMCLTNHTDGRESLLLRVPAKGTSSLLNPSTLFPLIALSA

Query:  AGDAASGVIDPSLPRLLLVAGFASVAAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYR
        AGDAASGV+DPSLP+LLLVAG AS+AAGATLNS +LPQ NRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYR
Subjt:  AGDAASGVIDPSLPRLLLVAGFASVAAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYR

Query:  ARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMMLENLEE
        ARRSRI+KVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEE
Subjt:  ARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMMLENLEE

XP_038895173.1 uncharacterized protein LOC120083467 isoform X2 [Benincasa hispida]1.9e-26087.52Show/hide
Query:  MILSSTSPWLTLTLTRLPLSSPPKLIEPLASANNGG-VLMPLLLCSHALFAFTSFSKCTRSVKASSNDRGSGSAAASENPVSELLDEEFLGDVSGAKDAG
        MIL+ TSPW  L +TRLP   PPKL EPLASA NG  VLMPLLLCSHALFAFTSFSK +  V+AS +      AAA ENPVS+LL  E +  VSGAKDA 
Subjt:  MILSSTSPWLTLTLTRLPLSSPPKLIEPLASANNGG-VLMPLLLCSHALFAFTSFSKCTRSVKASSNDRGSGSAAASENPVSELLDEEFLGDVSGAKDAG

Query:  EALLVIADRSGRSGGTVSVSDCCLIIAAALERNNPELALSVFYAMRSSFYQATASVGVNQNASSVERWKWSRPDVHVYTLLIHGLAASLRVSDALRMIEI
        EAL +IAD+SGRSGGTVS SDCCLIIAAAL+ NNPELALSVFYAMRS+FYQ TA  GVN+NAS+VERWKWSRPDVHVYTLLI GLAASLRVSDALRMIEI
Subjt:  EALLVIADRSGRSGGTVSVSDCCLIIAAALERNNPELALSVFYAMRSSFYQATASVGVNQNASSVERWKWSRPDVHVYTLLIHGLAASLRVSDALRMIEI

Query:  ICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLNIMKREIPAAIHSIVVQTPSGVA
        ICRVGVSPAEEVPFGKV+QCP CMVA+AVAQPQHGIQIVSCA+CRY+YELISGNIVNI+SEEISMDTPAWEKALRFLNIMKR+IPAA+HSIVVQTPSGVA
Subjt:  ICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLNIMKREIPAAIHSIVVQTPSGVA

Query:  RTQKFATETADLPAQEGERVTIAAAAPSNVFREVGPFKFSPKDPNFYSGESMCLTNHTDGRESLLLRVPAKGTSSLLNPSTLFPLIALSAAGDAASGVID
        RTQKFATETADLPA+EGERVTIAAAAPSNVFREVGP KFSPKDPNFYSGE MCLTNH+DGRESLLLRVPAKGTSSLLNPSTLFPLI LSAAGDAASGV+D
Subjt:  RTQKFATETADLPAQEGERVTIAAAAPSNVFREVGPFKFSPKDPNFYSGESMCLTNHTDGRESLLLRVPAKGTSSLLNPSTLFPLIALSAAGDAASGVID

Query:  PSLPRLLLVAGFASVAAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVR
        PSLP+LLLVAG AS+AAGATLNS +LPQ NRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI+KVR
Subjt:  PSLPRLLLVAGFASVAAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVR

Query:  EGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMMLENLEE
        EGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEE
Subjt:  EGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMMLENLEE

XP_038895181.1 uncharacterized protein LOC120083467 isoform X3 [Benincasa hispida]1.4e-25887.34Show/hide
Query:  MILSSTSPWLTLTLTRLPLSSPPKLIEPLASANNGG-VLMPLLLCSHALFAFTSFSKCTRSVKASSNDRGSGSAAASENPVSELLDEEFLGDVSGAKDAG
        MIL+ TSPW  L +TRLP   PPKL EPLASA NG  VLMPLLLCSHALFAFTSFSK +  V+AS +      AAA ENPVS+LL  E +  VSGAKDA 
Subjt:  MILSSTSPWLTLTLTRLPLSSPPKLIEPLASANNGG-VLMPLLLCSHALFAFTSFSKCTRSVKASSNDRGSGSAAASENPVSELLDEEFLGDVSGAKDAG

Query:  EALLVIADRSGRSGGTVSVSDCCLIIAAALERNNPELALSVFYAMRSSFYQATASVGVNQNASSVERWKWSRPDVHVYTLLIHGLAASLRVSDALRMIEI
        EAL +IAD+SGRSGGTVS SDCCLIIAAAL+ NNPELALSVFYAMRS+FYQA    GVN+NAS+VERWKWSRPDVHVYTLLI GLAASLRVSDALRMIEI
Subjt:  EALLVIADRSGRSGGTVSVSDCCLIIAAALERNNPELALSVFYAMRSSFYQATASVGVNQNASSVERWKWSRPDVHVYTLLIHGLAASLRVSDALRMIEI

Query:  ICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLNIMKREIPAAIHSIVVQTPSGVA
        ICRVGVSPAEEVPFGKV+QCP CMVA+AVAQPQHGIQIVSCA+CRY+YELISGNIVNI+SEEISMDTPAWEKALRFLNIMKR+IPAA+HSIVVQTPSGVA
Subjt:  ICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLNIMKREIPAAIHSIVVQTPSGVA

Query:  RTQKFATETADLPAQEGERVTIAAAAPSNVFREVGPFKFSPKDPNFYSGESMCLTNHTDGRESLLLRVPAKGTSSLLNPSTLFPLIALSAAGDAASGVID
        RTQKFATETADLPA+EGERVTIAAAAPSNVFREVGP KFSPKDPNFYSGE MCLTNH+DGRESLLLRVPAKGTSSLLNPSTLFPLI LSAAGDAASGV+D
Subjt:  RTQKFATETADLPAQEGERVTIAAAAPSNVFREVGPFKFSPKDPNFYSGESMCLTNHTDGRESLLLRVPAKGTSSLLNPSTLFPLIALSAAGDAASGVID

Query:  PSLPRLLLVAGFASVAAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVR
        PSLP+LLLVAG AS+AAGATLNS +LPQ NRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI+KVR
Subjt:  PSLPRLLLVAGFASVAAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVR

Query:  EGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMMLENLEE
        EGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEE
Subjt:  EGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMMLENLEE

TrEMBL top hitse value%identityAlignment
A0A6J1C6D2 uncharacterized protein LOC111008391 isoform X12.0e-28795.21Show/hide
Query:  MILSSTSPWLTLTLTRLPLSSPPKLIEPLASANNGGVLMPLLLCSHALFAFTSFSKCTRSVKASSNDR----GSGSAAASENPVSELLDEEFLGDVSGAK
        MILSSTSPWLT+TLTRLPLS PPKLIEPL S NNG VLMPLLLCS ALFAFTSFSK +RSVKASSNDR    GSGSAAASENPVSELLDEE LGDVSGAK
Subjt:  MILSSTSPWLTLTLTRLPLSSPPKLIEPLASANNGGVLMPLLLCSHALFAFTSFSKCTRSVKASSNDR----GSGSAAASENPVSELLDEEFLGDVSGAK

Query:  DAGEALLVIADRSGRSGGTVSVSDCCLIIAAALERNNPELALSVFYAMRSSFYQATASVGVNQNASSVERWKWSRPDVHVYTLLIHGLAASLRVSDALRM
        DAGEALLVIAD+SGRSGGTV+VSDC LIIAAALERNNPELALSVFYAMRSSFYQATA  GVNQNASSVERWKWSRPDVHVYTLLI GLAASLRVSDALRM
Subjt:  DAGEALLVIADRSGRSGGTVSVSDCCLIIAAALERNNPELALSVFYAMRSSFYQATASVGVNQNASSVERWKWSRPDVHVYTLLIHGLAASLRVSDALRM

Query:  IEIICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLNIMKREIPAAIHSIVVQTPS
        IEIICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLNIMKREIPAAIHSIVVQTPS
Subjt:  IEIICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLNIMKREIPAAIHSIVVQTPS

Query:  GVARTQKFATETADLPAQEGERVTIAAAAPSNVFREVGPFKFSPKDPNFYSGESMCLTNHTDGRESLLLRVPAKGTSSLLNPSTLFPLIALSAAGDAASG
        GVARTQKFATETADLPA+EGERVTIAAAAPSNVFREVGPFKFSPKDP FYSGE MCLTNHTDGRESLLLRVPAKGTSSLLNPSTLFPLIALSAAGDAA+G
Subjt:  GVARTQKFATETADLPAQEGERVTIAAAAPSNVFREVGPFKFSPKDPNFYSGESMCLTNHTDGRESLLLRVPAKGTSSLLNPSTLFPLIALSAAGDAASG

Query:  VIDPSLPRLLLVAGFASVAAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIK
        V+DPSLPRLLLV GFAS+AAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIK
Subjt:  VIDPSLPRLLLVAGFASVAAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIK

Query:  KVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMMLENLEE
        KVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAA AASSVERVSEQIEQIMMLENLEE
Subjt:  KVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMMLENLEE

A0A6J1ETW5 uncharacterized protein LOC111437671 isoform X13.9e-25986.99Show/hide
Query:  MILSSTSPWLTLTLTRLPLSSPPKLIEPLASANNG-GVLMPLLLCSHALFAFTSFSKCTRSVKASSNDRGSGSAAASENPVSELLDEEFLGDVSGAKDAG
        MIL  +SPW  LT+TRLP   PPKLIEPLASA+NG  VLMPLLLCSHALF FTSFSK TR V+AS N      AAA ENPVSELLD+E +G VSGAKDA 
Subjt:  MILSSTSPWLTLTLTRLPLSSPPKLIEPLASANNG-GVLMPLLLCSHALFAFTSFSKCTRSVKASSNDRGSGSAAASENPVSELLDEEFLGDVSGAKDAG

Query:  EALLVIADRSGRSGGTVSVSDCCLIIAAALERNNPELALSVFYAMRSSFYQATASVGVNQNASSVERWKWSRPDVHVYTLLIHGLAASLRVSDALRMIEI
        E L +IAD+SGR+GGTVSV DC LIIAAAL+RNN ELALSVFYAMRSSFY+ TA  GVN N SSVERWKW+RPDVHVYTLLI GLAASLRVSDALR+IEI
Subjt:  EALLVIADRSGRSGGTVSVSDCCLIIAAALERNNPELALSVFYAMRSSFYQATASVGVNQNASSVERWKWSRPDVHVYTLLIHGLAASLRVSDALRMIEI

Query:  ICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLNIMKREIPAAIHSIVVQTPSGVA
        ICRVGVSPAEEVPFGKV+QCP CMVA+AVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLN+MK+++PAA+HSIVVQTPSGVA
Subjt:  ICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLNIMKREIPAAIHSIVVQTPSGVA

Query:  RTQKFATETADLPAQEGERVTIAAAAPSNVFREVGPFKFSPKDPNFYSGESMCLTNHTDGRESLLLRVPAKGTSSLLNPSTLFPLIALSAAGDAASGVID
        RTQKFATETADLPA+EGERVTIAAAAPSNV+REVGP KFSPKDPN YSGE MCLTNH+DGRESLLLRVPAK TS LL PS LFPLI LS AGD +SGV+D
Subjt:  RTQKFATETADLPAQEGERVTIAAAAPSNVFREVGPFKFSPKDPNFYSGESMCLTNHTDGRESLLLRVPAKGTSSLLNPSTLFPLIALSAAGDAASGVID

Query:  PSLPRLLLVAGFASVAAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVR
        PSLPRLLLVAGFAS+AAGATLNSF+LPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVR
Subjt:  PSLPRLLLVAGFASVAAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVR

Query:  EGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMMLENLEE
        EGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM+LENLEE
Subjt:  EGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMMLENLEE

A0A6J1EYW6 uncharacterized protein LOC111437671 isoform X22.8e-25786.81Show/hide
Query:  MILSSTSPWLTLTLTRLPLSSPPKLIEPLASANNG-GVLMPLLLCSHALFAFTSFSKCTRSVKASSNDRGSGSAAASENPVSELLDEEFLGDVSGAKDAG
        MIL  +SPW  LT+TRLP   PPKLIEPLASA+NG  VLMPLLLCSHALF FTSFSK TR V+AS N      AAA ENPVSELLD+E +G VSGAKDA 
Subjt:  MILSSTSPWLTLTLTRLPLSSPPKLIEPLASANNG-GVLMPLLLCSHALFAFTSFSKCTRSVKASSNDRGSGSAAASENPVSELLDEEFLGDVSGAKDAG

Query:  EALLVIADRSGRSGGTVSVSDCCLIIAAALERNNPELALSVFYAMRSSFYQATASVGVNQNASSVERWKWSRPDVHVYTLLIHGLAASLRVSDALRMIEI
        E L +IAD+SGR+GGTVSV DC LIIAAAL+RNN ELALSVFYAMRSSFY+A    GVN N SSVERWKW+RPDVHVYTLLI GLAASLRVSDALR+IEI
Subjt:  EALLVIADRSGRSGGTVSVSDCCLIIAAALERNNPELALSVFYAMRSSFYQATASVGVNQNASSVERWKWSRPDVHVYTLLIHGLAASLRVSDALRMIEI

Query:  ICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLNIMKREIPAAIHSIVVQTPSGVA
        ICRVGVSPAEEVPFGKV+QCP CMVA+AVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLN+MK+++PAA+HSIVVQTPSGVA
Subjt:  ICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLNIMKREIPAAIHSIVVQTPSGVA

Query:  RTQKFATETADLPAQEGERVTIAAAAPSNVFREVGPFKFSPKDPNFYSGESMCLTNHTDGRESLLLRVPAKGTSSLLNPSTLFPLIALSAAGDAASGVID
        RTQKFATETADLPA+EGERVTIAAAAPSNV+REVGP KFSPKDPN YSGE MCLTNH+DGRESLLLRVPAK TS LL PS LFPLI LS AGD +SGV+D
Subjt:  RTQKFATETADLPAQEGERVTIAAAAPSNVFREVGPFKFSPKDPNFYSGESMCLTNHTDGRESLLLRVPAKGTSSLLNPSTLFPLIALSAAGDAASGVID

Query:  PSLPRLLLVAGFASVAAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVR
        PSLPRLLLVAGFAS+AAGATLNSF+LPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVR
Subjt:  PSLPRLLLVAGFASVAAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVR

Query:  EGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMMLENLEE
        EGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM+LENLEE
Subjt:  EGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMMLENLEE

A0A6J1J4R1 uncharacterized protein LOC111483407 isoform X11.5e-25585.92Show/hide
Query:  MILSSTSPWLTLTLTRLPLSSPPKLIEPLASANNG-GVLMPLLLCSHALFAFTSFSKCTRSVKASSNDRGSGSAAASENPVSELLDEEFLGDVSGAKDAG
        MIL  +SPW  LT+TRLP    PKLIEPLASA+NG  VLMPLLLCSHA F FTSFS+ TR V+AS N      AAA ENPVS+LLD+E +  VSGAKDA 
Subjt:  MILSSTSPWLTLTLTRLPLSSPPKLIEPLASANNG-GVLMPLLLCSHALFAFTSFSKCTRSVKASSNDRGSGSAAASENPVSELLDEEFLGDVSGAKDAG

Query:  EALLVIADRSGRSGGTVSVSDCCLIIAAALERNNPELALSVFYAMRSSFYQATASVGVNQNASSVERWKWSRPDVHVYTLLIHGLAASLRVSDALRMIEI
        E L +IA++SGR+GGTVSV DC LIIAAAL+RNN ELALSVFYAMRSSFY+ TA  GVN N SSVERWKW+RPDVHVYTLLI GLAASLRVSDALR+IEI
Subjt:  EALLVIADRSGRSGGTVSVSDCCLIIAAALERNNPELALSVFYAMRSSFYQATASVGVNQNASSVERWKWSRPDVHVYTLLIHGLAASLRVSDALRMIEI

Query:  ICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLNIMKREIPAAIHSIVVQTPSGVA
        ICRVGVSPAEEVPFGKV+QCP CMVA+AVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLN+MK+++PAA+HSIVVQTPSGVA
Subjt:  ICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLNIMKREIPAAIHSIVVQTPSGVA

Query:  RTQKFATETADLPAQEGERVTIAAAAPSNVFREVGPFKFSPKDPNFYSGESMCLTNHTDGRESLLLRVPAKGTSSLLNPSTLFPLIALSAAGDAASGVID
        RTQKFATETADLPA+EGERVTIAAAAPSNV+REVGP KFSPKDPN YSGE MCLTNH+DGRESLL+RVPAK TS LL PS LFPLI LS AGDAASGV+D
Subjt:  RTQKFATETADLPAQEGERVTIAAAAPSNVFREVGPFKFSPKDPNFYSGESMCLTNHTDGRESLLLRVPAKGTSSLLNPSTLFPLIALSAAGDAASGVID

Query:  PSLPRLLLVAGFASVAAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVR
        PSLPR+LLVAGFAS+AAGATLNSF+LPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVR
Subjt:  PSLPRLLLVAGFASVAAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVR

Query:  EGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMMLENLEE
        EGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM+LENLEE
Subjt:  EGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMMLENLEE

A0A6J1JDG3 uncharacterized protein LOC111483407 isoform X28.5e-25485.74Show/hide
Query:  MILSSTSPWLTLTLTRLPLSSPPKLIEPLASANNG-GVLMPLLLCSHALFAFTSFSKCTRSVKASSNDRGSGSAAASENPVSELLDEEFLGDVSGAKDAG
        MIL  +SPW  LT+TRLP    PKLIEPLASA+NG  VLMPLLLCSHA F FTSFS+ TR V+AS N      AAA ENPVS+LLD+E +  VSGAKDA 
Subjt:  MILSSTSPWLTLTLTRLPLSSPPKLIEPLASANNG-GVLMPLLLCSHALFAFTSFSKCTRSVKASSNDRGSGSAAASENPVSELLDEEFLGDVSGAKDAG

Query:  EALLVIADRSGRSGGTVSVSDCCLIIAAALERNNPELALSVFYAMRSSFYQATASVGVNQNASSVERWKWSRPDVHVYTLLIHGLAASLRVSDALRMIEI
        E L +IA++SGR+GGTVSV DC LIIAAAL+RNN ELALSVFYAMRSSFY+A    GVN N SSVERWKW+RPDVHVYTLLI GLAASLRVSDALR+IEI
Subjt:  EALLVIADRSGRSGGTVSVSDCCLIIAAALERNNPELALSVFYAMRSSFYQATASVGVNQNASSVERWKWSRPDVHVYTLLIHGLAASLRVSDALRMIEI

Query:  ICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLNIMKREIPAAIHSIVVQTPSGVA
        ICRVGVSPAEEVPFGKV+QCP CMVA+AVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLN+MK+++PAA+HSIVVQTPSGVA
Subjt:  ICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLNIMKREIPAAIHSIVVQTPSGVA

Query:  RTQKFATETADLPAQEGERVTIAAAAPSNVFREVGPFKFSPKDPNFYSGESMCLTNHTDGRESLLLRVPAKGTSSLLNPSTLFPLIALSAAGDAASGVID
        RTQKFATETADLPA+EGERVTIAAAAPSNV+REVGP KFSPKDPN YSGE MCLTNH+DGRESLL+RVPAK TS LL PS LFPLI LS AGDAASGV+D
Subjt:  RTQKFATETADLPAQEGERVTIAAAAPSNVFREVGPFKFSPKDPNFYSGESMCLTNHTDGRESLLLRVPAKGTSSLLNPSTLFPLIALSAAGDAASGVID

Query:  PSLPRLLLVAGFASVAAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVR
        PSLPR+LLVAGFAS+AAGATLNSF+LPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVR
Subjt:  PSLPRLLLVAGFASVAAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVR

Query:  EGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMMLENLEE
        EGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM+LENLEE
Subjt:  EGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMMLENLEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64430.1 Pentatricopeptide repeat (PPR) superfamily protein1.4e-17965.26Show/hide
Query:  SSNDRGSGSAAASENPVSELLDEEFLGDVSGAKDAGEALLVIADRSGRS-GGTVSVSDCCLIIAAALERNNPELALSVFYAMRSSFYQATASVGVNQNAS
        S  DR   S  ++ +  S +LD+E L  VS  +DA EAL +I+DR G + GG V + DC  II+AA+ R N +LALS+FY MR+SF         +   S
Subjt:  SSNDRGSGSAAASENPVSELLDEEFLGDVSGAKDAGEALLVIADRSGRS-GGTVSVSDCCLIIAAALERNNPELALSVFYAMRSSFYQATASVGVNQNAS

Query:  SVERWKWSRPDVHVYTLLIHGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEI
          +RW WSRPDV VYT+L++GLAASLRVSD+LR+I  ICRVG+SPAEEVPFGK+++CP C++AIAVAQPQHG+QIVSCA CRYQYEL SG+I +I+SEE+
Subjt:  SVERWKWSRPDVHVYTLLIHGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEI

Query:  SMDTPAWEKALRFLNIMKREIPAAIHSIVVQTPSGVARTQKFATETADLPAQEGERVTIAAAAPSNVFREVGPFKFSPKDPNFYSGESMCLTNHTDGRES
          D P WEK LR + I K +I +++HSIVVQTPSG ART +FATETA+LPAQEGERVTIA+AAPSNV+R+VGPFKF  K PNFY GE M LT H DGRES
Subjt:  SMDTPAWEKALRFLNIMKREIPAAIHSIVVQTPSGVARTQKFATETADLPAQEGERVTIAAAAPSNVFREVGPFKFSPKDPNFYSGESMCLTNHTDGRES

Query:  LLLRVPAKGTSSLLNPSTLFPLIALSAAGDAASGVIDPSLPRLLLVAGFASVAAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLA
        +LLR P+K    +L PS L PL+A+ A GDAASGVIDPSLP+LL VA   S+A GAT+NSF+LP+ N+LP+R+VD++ IKQQLLSQY+VLQ RIRDLK A
Subjt:  LLLRVPAKGTSSLLNPSTLFPLIALSAAGDAASGVIDPSLPRLLLVAGFASVAAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLA

Query:  AEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMMLENLEE
         EKEVWMLARMCQLENKI AVGEP+YR RR+R+KKVRE LENS+K +I+LI+SYARISSMIEIEVEM+SDV+AAEA ++ E +++QIEQIM LENLEE
Subjt:  AEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMMLENLEE

AT1G64430.2 Pentatricopeptide repeat (PPR) superfamily protein1.4e-17965.26Show/hide
Query:  SSNDRGSGSAAASENPVSELLDEEFLGDVSGAKDAGEALLVIADRSGRS-GGTVSVSDCCLIIAAALERNNPELALSVFYAMRSSFYQATASVGVNQNAS
        S  DR   S  ++ +  S +LD+E L  VS  +DA EAL +I+DR G + GG V + DC  II+AA+ R N +LALS+FY MR+SF         +   S
Subjt:  SSNDRGSGSAAASENPVSELLDEEFLGDVSGAKDAGEALLVIADRSGRS-GGTVSVSDCCLIIAAALERNNPELALSVFYAMRSSFYQATASVGVNQNAS

Query:  SVERWKWSRPDVHVYTLLIHGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEI
          +RW WSRPDV VYT+L++GLAASLRVSD+LR+I  ICRVG+SPAEEVPFGK+++CP C++AIAVAQPQHG+QIVSCA CRYQYEL SG+I +I+SEE+
Subjt:  SVERWKWSRPDVHVYTLLIHGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVIQCPCCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEI

Query:  SMDTPAWEKALRFLNIMKREIPAAIHSIVVQTPSGVARTQKFATETADLPAQEGERVTIAAAAPSNVFREVGPFKFSPKDPNFYSGESMCLTNHTDGRES
          D P WEK LR + I K +I +++HSIVVQTPSG ART +FATETA+LPAQEGERVTIA+AAPSNV+R+VGPFKF  K PNFY GE M LT H DGRES
Subjt:  SMDTPAWEKALRFLNIMKREIPAAIHSIVVQTPSGVARTQKFATETADLPAQEGERVTIAAAAPSNVFREVGPFKFSPKDPNFYSGESMCLTNHTDGRES

Query:  LLLRVPAKGTSSLLNPSTLFPLIALSAAGDAASGVIDPSLPRLLLVAGFASVAAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLA
        +LLR P+K    +L PS L PL+A+ A GDAASGVIDPSLP+LL VA   S+A GAT+NSF+LP+ N+LP+R+VD++ IKQQLLSQY+VLQ RIRDLK A
Subjt:  LLLRVPAKGTSSLLNPSTLFPLIALSAAGDAASGVIDPSLPRLLLVAGFASVAAGATLNSFLLPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLA

Query:  AEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMMLENLEE
         EKEVWMLARMCQLENKI AVGEP+YR RR+R+KKVRE LENS+K +I+LI+SYARISSMIEIEVEM+SDV+AAEA ++ E +++QIEQIM LENLEE
Subjt:  AEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMMLENLEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCTGAGCTCGACTTCGCCATGGCTCACTCTCACTCTCACTCGTCTCCCTCTCTCCAGTCCTCCCAAACTTATAGAGCCACTCGCCTCTGCAAACAATGGCGGCGT
TCTTATGCCTCTCCTTCTGTGTTCCCACGCTCTCTTTGCTTTCACTTCCTTCTCCAAGTGCACCAGATCAGTTAAAGCTTCTTCAAATGACCGCGGCAGCGGCAGCGCTG
CGGCTTCTGAGAATCCTGTTTCGGAATTACTCGACGAAGAGTTTCTCGGAGATGTTTCGGGTGCCAAGGATGCCGGCGAGGCGTTGCTGGTGATTGCTGATAGGTCCGGG
AGAAGTGGAGGCACTGTGTCTGTTTCGGACTGTTGTTTGATTATCGCGGCTGCACTTGAGCGTAACAATCCTGAGCTTGCCTTATCCGTGTTCTACGCAATGCGCTCCAG
TTTCTATCAAGCTACTGCATCGGTAGGTGTTAATCAAAATGCTTCCTCTGTTGAGAGATGGAAATGGTCAAGGCCAGATGTCCATGTATACACATTGCTGATTCATGGTC
TTGCAGCGTCCTTGAGGGTTTCCGATGCTCTTAGGATGATCGAGATTATTTGCCGAGTTGGTGTATCACCTGCTGAAGAGGTCCCATTTGGAAAGGTAATACAGTGTCCC
TGTTGTATGGTAGCAATTGCAGTTGCACAACCCCAGCACGGTATTCAGATTGTATCTTGTGCAAAGTGCCGCTACCAGTATGAACTTATTTCAGGAAATATAGTTAATAT
TGAGTCAGAAGAAATCAGCATGGATACTCCAGCTTGGGAAAAAGCACTCCGATTCTTGAATATAATGAAGCGAGAAATCCCTGCTGCTATTCACTCCATTGTGGTACAAA
CTCCTTCTGGAGTGGCACGAACCCAAAAGTTTGCTACTGAAACTGCAGATCTCCCAGCACAAGAGGGAGAAAGGGTCACAATTGCTGCTGCAGCTCCATCAAATGTATTT
AGAGAAGTTGGTCCTTTTAAATTTAGTCCAAAGGATCCCAATTTTTACTCTGGGGAGTCTATGTGCCTGACAAATCATACGGATGGCCGGGAATCACTATTATTAAGAGT
GCCAGCAAAGGGAACCTCATCCTTACTTAACCCATCGACCCTCTTCCCACTCATAGCGTTGTCTGCTGCTGGAGATGCTGCCTCCGGAGTTATTGACCCCAGCTTGCCTC
GGTTGCTTTTAGTTGCTGGATTTGCTTCTGTAGCTGCAGGAGCTACTTTGAACTCATTTCTTTTGCCTCAATTTAACCGGCTTCCTCAACGATCAGTTGATATCATTGCT
ATCAAGCAGCAGCTTTTATCTCAATATAATGTGCTTCAGTCTCGTATCAGGGATTTAAAGCTAGCTGCTGAGAAGGAGGTATGGATGTTGGCTCGGATGTGTCAATTAGA
GAACAAAATTTTTGCTGTAGGAGAACCTTCTTATCGCGCACGTAGAAGTAGGATAAAGAAGGTGCGAGAAGGCCTGGAGAATTCCCTTAAGCAACGGATTGAACTAATAG
AAAGCTATGCAAGAATTTCCTCAATGATTGAGATAGAAGTTGAAATGGAATCTGATGTTATCGCTGCTGAAGCAGCCAGTAGCGTGGAAAGGGTTTCTGAACAGATTGAG
CAAATAATGATGCTGGAAAATCTAGAAGAG
mRNA sequenceShow/hide mRNA sequence
ATGATTCTGAGCTCGACTTCGCCATGGCTCACTCTCACTCTCACTCGTCTCCCTCTCTCCAGTCCTCCCAAACTTATAGAGCCACTCGCCTCTGCAAACAATGGCGGCGT
TCTTATGCCTCTCCTTCTGTGTTCCCACGCTCTCTTTGCTTTCACTTCCTTCTCCAAGTGCACCAGATCAGTTAAAGCTTCTTCAAATGACCGCGGCAGCGGCAGCGCTG
CGGCTTCTGAGAATCCTGTTTCGGAATTACTCGACGAAGAGTTTCTCGGAGATGTTTCGGGTGCCAAGGATGCCGGCGAGGCGTTGCTGGTGATTGCTGATAGGTCCGGG
AGAAGTGGAGGCACTGTGTCTGTTTCGGACTGTTGTTTGATTATCGCGGCTGCACTTGAGCGTAACAATCCTGAGCTTGCCTTATCCGTGTTCTACGCAATGCGCTCCAG
TTTCTATCAAGCTACTGCATCGGTAGGTGTTAATCAAAATGCTTCCTCTGTTGAGAGATGGAAATGGTCAAGGCCAGATGTCCATGTATACACATTGCTGATTCATGGTC
TTGCAGCGTCCTTGAGGGTTTCCGATGCTCTTAGGATGATCGAGATTATTTGCCGAGTTGGTGTATCACCTGCTGAAGAGGTCCCATTTGGAAAGGTAATACAGTGTCCC
TGTTGTATGGTAGCAATTGCAGTTGCACAACCCCAGCACGGTATTCAGATTGTATCTTGTGCAAAGTGCCGCTACCAGTATGAACTTATTTCAGGAAATATAGTTAATAT
TGAGTCAGAAGAAATCAGCATGGATACTCCAGCTTGGGAAAAAGCACTCCGATTCTTGAATATAATGAAGCGAGAAATCCCTGCTGCTATTCACTCCATTGTGGTACAAA
CTCCTTCTGGAGTGGCACGAACCCAAAAGTTTGCTACTGAAACTGCAGATCTCCCAGCACAAGAGGGAGAAAGGGTCACAATTGCTGCTGCAGCTCCATCAAATGTATTT
AGAGAAGTTGGTCCTTTTAAATTTAGTCCAAAGGATCCCAATTTTTACTCTGGGGAGTCTATGTGCCTGACAAATCATACGGATGGCCGGGAATCACTATTATTAAGAGT
GCCAGCAAAGGGAACCTCATCCTTACTTAACCCATCGACCCTCTTCCCACTCATAGCGTTGTCTGCTGCTGGAGATGCTGCCTCCGGAGTTATTGACCCCAGCTTGCCTC
GGTTGCTTTTAGTTGCTGGATTTGCTTCTGTAGCTGCAGGAGCTACTTTGAACTCATTTCTTTTGCCTCAATTTAACCGGCTTCCTCAACGATCAGTTGATATCATTGCT
ATCAAGCAGCAGCTTTTATCTCAATATAATGTGCTTCAGTCTCGTATCAGGGATTTAAAGCTAGCTGCTGAGAAGGAGGTATGGATGTTGGCTCGGATGTGTCAATTAGA
GAACAAAATTTTTGCTGTAGGAGAACCTTCTTATCGCGCACGTAGAAGTAGGATAAAGAAGGTGCGAGAAGGCCTGGAGAATTCCCTTAAGCAACGGATTGAACTAATAG
AAAGCTATGCAAGAATTTCCTCAATGATTGAGATAGAAGTTGAAATGGAATCTGATGTTATCGCTGCTGAAGCAGCCAGTAGCGTGGAAAGGGTTTCTGAACAGATTGAG
CAAATAATGATGCTGGAAAATCTAGAAGAG
Protein sequenceShow/hide protein sequence
MILSSTSPWLTLTLTRLPLSSPPKLIEPLASANNGGVLMPLLLCSHALFAFTSFSKCTRSVKASSNDRGSGSAAASENPVSELLDEEFLGDVSGAKDAGEALLVIADRSG
RSGGTVSVSDCCLIIAAALERNNPELALSVFYAMRSSFYQATASVGVNQNASSVERWKWSRPDVHVYTLLIHGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVIQCP
CCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEEISMDTPAWEKALRFLNIMKREIPAAIHSIVVQTPSGVARTQKFATETADLPAQEGERVTIAAAAPSNVF
REVGPFKFSPKDPNFYSGESMCLTNHTDGRESLLLRVPAKGTSSLLNPSTLFPLIALSAAGDAASGVIDPSLPRLLLVAGFASVAAGATLNSFLLPQFNRLPQRSVDIIA
IKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIE
QIMMLENLEE