; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004925 (gene) of Snake gourd v1 genome

Gene IDTan0004925
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein isoform 2
Genome locationLG01:10569495..10576780
RNA-Seq ExpressionTan0004925
SyntenyTan0004925
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577138.1 hypothetical protein SDJN03_24712, partial [Cucurbita argyrosperma subsp. sororia]6.8e-28589.97Show/hide
Query:  MILNLSSPWLSLTRL-PPPKLIEPLTSANNGTGALMPLLLCSHALFAFTSFSKSTRVRASFNGCDSDGAAAFENPVSELLDGELLGVVSGAKDADEALRM
        MIL+LSSPWL++TRL PPPKLIEP TSA+NGT  LMPLLLCSHALF FTSFSKSTRVRAS N  + DGAAAFENPVSELLD EL+GVVS +KDADE LR+
Subjt:  MILNLSSPWLSLTRL-PPPKLIEPLTSANNGTGALMPLLLCSHALFAFTSFSKSTRVRASFNGCDSDGAAAFENPVSELLDGELLGVVSGAKDADEALRM

Query:  IADELGRSGGTVSVSDCGLIIAAALERNNPELALSVFYAMRSSFYQATALEGVSENASSVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVG
        IAD+ GR+GGTVSV DC LIIAAAL+RNN ELALSVFYAMRSSFY+ TA EGV++N SSVERWKW+RPDVHVYTLLIQGLAASLRVSDALR+IEIICRVG
Subjt:  IADELGRSGGTVSVSDCGLIIAAALERNNPELALSVFYAMRSSFYQATALEGVSENASSVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVG

Query:  VSPAEEVPFGKVVQCPSCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEERSMDTPAWEKALQFLNIMKQKIPAAVHSIVVQTPSGVARTQKF
        VSPAEEVPFGKVVQCPSCMVA+AVAQPQHGIQIVSCAKCRYQYELISGNIV+IESEE SMDTPAWEKAL+FLN+MKQK+PAAVHSIVVQTPSGVARTQKF
Subjt:  VSPAEEVPFGKVVQCPSCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEERSMDTPAWEKALQFLNIMKQKIPAAVHSIVVQTPSGVARTQKF

Query:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHTDGRESLLLRVPAKGASFLLNPSILFPLIVLSAVGDAASGVIDPSLPR
        ATETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPNLYSGEPMCLTNH+DGRESLLLRVPAK  SFLL PS LFPLI+LS  GDAASGV+DPSLPR
Subjt:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHTDGRESLLLRVPAKGASFLLNPSILFPLIVLSAVGDAASGVIDPSLPR

Query:  LLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQFRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLES
        LLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQ RIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVR+GLE+
Subjt:  LLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQFRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLES

Query:  SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEEKWKLQAEANDEAERLLNQSMPT
        SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERV+EQIEQIMVLENLEE+W+LQAEANDEAERL NQSMPT
Subjt:  SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEEKWKLQAEANDEAERLLNQSMPT

XP_022931517.1 uncharacterized protein LOC111437671 isoform X1 [Cucurbita moschata]2.8e-28690.31Show/hide
Query:  MILNLSSPWLSLTRL-PPPKLIEPLTSANNGTGALMPLLLCSHALFAFTSFSKSTRVRASFNGCDSDGAAAFENPVSELLDGELLGVVSGAKDADEALRM
        MIL+LSSPWL++TRL PPPKLIEPL SA+NGT  LMPLLLCSHALF FTSFSKSTRVRAS N  + DGAAAFENPVSELLD EL+GVVSGAKDADE LR+
Subjt:  MILNLSSPWLSLTRL-PPPKLIEPLTSANNGTGALMPLLLCSHALFAFTSFSKSTRVRASFNGCDSDGAAAFENPVSELLDGELLGVVSGAKDADEALRM

Query:  IADELGRSGGTVSVSDCGLIIAAALERNNPELALSVFYAMRSSFYQATALEGVSENASSVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVG
        IAD+ GR+GGTVSV DC LIIAAAL+RNN ELALSVFYAMRSSFY+ TA EGV++N SSVERWKW+RPDVHVYTLLIQGLAASLRVSDALR+IEIICRVG
Subjt:  IADELGRSGGTVSVSDCGLIIAAALERNNPELALSVFYAMRSSFYQATALEGVSENASSVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVG

Query:  VSPAEEVPFGKVVQCPSCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEERSMDTPAWEKALQFLNIMKQKIPAAVHSIVVQTPSGVARTQKF
        VSPAEEVPFGKVVQCPSCMVA+AVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEE SMDTPAWEKAL+FLN+MKQK+PAAVHSIVVQTPSGVARTQKF
Subjt:  VSPAEEVPFGKVVQCPSCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEERSMDTPAWEKALQFLNIMKQKIPAAVHSIVVQTPSGVARTQKF

Query:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHTDGRESLLLRVPAKGASFLLNPSILFPLIVLSAVGDAASGVIDPSLPR
        ATETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPNLYSGEPMCLTNH+DGRESLLLRVPAK  SFLL PS LFPLI+LS  GD +SGV+DPSLPR
Subjt:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHTDGRESLLLRVPAKGASFLLNPSILFPLIVLSAVGDAASGVIDPSLPR

Query:  LLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQFRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLES
        LLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQ RIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLE+
Subjt:  LLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQFRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLES

Query:  SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEEKWKLQAEANDEAERLLNQSMPT
        SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERV+EQIEQIMVLENLEE+W+LQAEANDEAERL NQSMPT
Subjt:  SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEEKWKLQAEANDEAERLLNQSMPT

XP_022931518.1 uncharacterized protein LOC111437671 isoform X2 [Cucurbita moschata]3.4e-28490.14Show/hide
Query:  MILNLSSPWLSLTRL-PPPKLIEPLTSANNGTGALMPLLLCSHALFAFTSFSKSTRVRASFNGCDSDGAAAFENPVSELLDGELLGVVSGAKDADEALRM
        MIL+LSSPWL++TRL PPPKLIEPL SA+NGT  LMPLLLCSHALF FTSFSKSTRVRAS N  + DGAAAFENPVSELLD EL+GVVSGAKDADE LR+
Subjt:  MILNLSSPWLSLTRL-PPPKLIEPLTSANNGTGALMPLLLCSHALFAFTSFSKSTRVRASFNGCDSDGAAAFENPVSELLDGELLGVVSGAKDADEALRM

Query:  IADELGRSGGTVSVSDCGLIIAAALERNNPELALSVFYAMRSSFYQATALEGVSENASSVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVG
        IAD+ GR+GGTVSV DC LIIAAAL+RNN ELALSVFYAMRSSFY+  A EGV++N SSVERWKW+RPDVHVYTLLIQGLAASLRVSDALR+IEIICRVG
Subjt:  IADELGRSGGTVSVSDCGLIIAAALERNNPELALSVFYAMRSSFYQATALEGVSENASSVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVG

Query:  VSPAEEVPFGKVVQCPSCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEERSMDTPAWEKALQFLNIMKQKIPAAVHSIVVQTPSGVARTQKF
        VSPAEEVPFGKVVQCPSCMVA+AVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEE SMDTPAWEKAL+FLN+MKQK+PAAVHSIVVQTPSGVARTQKF
Subjt:  VSPAEEVPFGKVVQCPSCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEERSMDTPAWEKALQFLNIMKQKIPAAVHSIVVQTPSGVARTQKF

Query:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHTDGRESLLLRVPAKGASFLLNPSILFPLIVLSAVGDAASGVIDPSLPR
        ATETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPNLYSGEPMCLTNH+DGRESLLLRVPAK  SFLL PS LFPLI+LS  GD +SGV+DPSLPR
Subjt:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHTDGRESLLLRVPAKGASFLLNPSILFPLIVLSAVGDAASGVIDPSLPR

Query:  LLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQFRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLES
        LLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQ RIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLE+
Subjt:  LLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQFRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLES

Query:  SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEEKWKLQAEANDEAERLLNQSMPT
        SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERV+EQIEQIMVLENLEE+W+LQAEANDEAERL NQSMPT
Subjt:  SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEEKWKLQAEANDEAERLLNQSMPT

XP_022985382.1 uncharacterized protein LOC111483407 isoform X1 [Cucurbita maxima]3.4e-28489.6Show/hide
Query:  MILNLSSPWLSLTRLPPPKLIEPLTSANNGTGALMPLLLCSHALFAFTSFSKSTRVRASFNGCDSDGAAAFENPVSELLDGELLGVVSGAKDADEALRMI
        MIL+LSSPWL++TRLP PKLIEPL SA+NGT  LMPLLLCSHA F FTSFS+STRVRAS N  + DGAAAFENPVS+LLD EL+ VVSGAKDADE LRMI
Subjt:  MILNLSSPWLSLTRLPPPKLIEPLTSANNGTGALMPLLLCSHALFAFTSFSKSTRVRASFNGCDSDGAAAFENPVSELLDGELLGVVSGAKDADEALRMI

Query:  ADELGRSGGTVSVSDCGLIIAAALERNNPELALSVFYAMRSSFYQATALEGVSENASSVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        A++ GR+GGTVSV DC LIIAAAL+RNN ELALSVFYAMRSSFY+ TA EGV++N SSVERWKW+RPDVHVYTLLIQGLAASLRVSDALR+IEIICRVGV
Subjt:  ADELGRSGGTVSVSDCGLIIAAALERNNPELALSVFYAMRSSFYQATALEGVSENASSVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVQCPSCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEERSMDTPAWEKALQFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVQCPSCMVA+AVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEE SMDTPAWEKAL+FLN+MKQK+PAAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVQCPSCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEERSMDTPAWEKALQFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHTDGRESLLLRVPAKGASFLLNPSILFPLIVLSAVGDAASGVIDPSLPRL
        TETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPNLYSGEPMCLTNH+DGRESLL+RVPAK  SFLL PS LFPLI+LS  GDAASGV+DPSLPR+
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHTDGRESLLLRVPAKGASFLLNPSILFPLIVLSAVGDAASGVIDPSLPRL

Query:  LLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQFRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLESS
        LLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQ RIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLE+S
Subjt:  LLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQFRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLESS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEEKWKLQAEANDEAERLLNQSMPT
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERV+EQIEQIMVLENLEE+W+LQAEANDEAERL NQSMPT
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEEKWKLQAEANDEAERLLNQSMPT

XP_038895173.1 uncharacterized protein LOC120083467 isoform X2 [Benincasa hispida]5.2e-28590.29Show/hide
Query:  MILNLSSPWLSLTRLPPPKLIEPLTSANNGTGALMPLLLCSHALFAFTSFSKSTRVRASFNGCDSDGAAAFENPVSELLDGELLGVVSGAKDADEALRMI
        MILNL+SPWL++TRLPPPKL EPL SA NG   LMPLLLCSHALFAFTSFSKS +VRAS +G D DGAAAFENPVS+LL  EL+  VSGAKDADEALRMI
Subjt:  MILNLSSPWLSLTRLPPPKLIEPLTSANNGTGALMPLLLCSHALFAFTSFSKSTRVRASFNGCDSDGAAAFENPVSELLDGELLGVVSGAKDADEALRMI

Query:  ADELGRSGGTVSVSDCGLIIAAALERNNPELALSVFYAMRSSFYQATALEGVSENASSVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        AD+ GRSGGTVS SDC LIIAAAL+ NNPELALSVFYAMRS+FYQ TA EGV+ENAS+VERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  ADELGRSGGTVSVSDCGLIIAAALERNNPELALSVFYAMRSSFYQATALEGVSENASSVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVQCPSCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEERSMDTPAWEKALQFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVQCPSCMVA+AVAQPQHGIQIVSCA+CRY+YELISGNIVNI+SEE SMDTPAWEKAL+FLNIMK+KIPAAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVQCPSCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEERSMDTPAWEKALQFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHTDGRESLLLRVPAKGASFLLNPSILFPLIVLSAVGDAASGVIDPSLPRL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN YSGEPMCLTNH+DGRESLLLRVPAKG S LLNPS LFPLIVLSA GDAASGV+DPSLP+L
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHTDGRESLLLRVPAKGASFLLNPSILFPLIVLSAVGDAASGVIDPSLPRL

Query:  LLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQFRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLESS
        LLVAG ASLAAGATLNS ILPQ NRLPQRSVDIIAIKQQLLSQYNVLQ RIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI+KVREGLE+S
Subjt:  LLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQFRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLESS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEEKWKLQAEANDEAERLLNQSMPT
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERV+EQIEQIM LENLEE+WKLQAEANDEAERLLNQSMPT
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEEKWKLQAEANDEAERLLNQSMPT

TrEMBL top hitse value%identityAlignment
A0A1S3BTU3 uncharacterized protein LOC1034931031.1e-28089.77Show/hide
Query:  MILNLSSPWLSLTRLPPPKLIEPLTSANNGTGALMPLLLCSHALFAFTSFSKSTRVRASFNGCDSDGAAAFENPVSELLDGELLGVVSGAKDADEALRMI
        MILN +SP+L+LTRLPPPKL+EPL S+ NG    +PLLLCSHALFAFTSFSKS RVR S +G D DG+AAFENP SELLD EL+ VVSGAKDADEAL MI
Subjt:  MILNLSSPWLSLTRLPPPKLIEPLTSANNGTGALMPLLLCSHALFAFTSFSKSTRVRASFNGCDSDGAAAFENPVSELLDGELLGVVSGAKDADEALRMI

Query:  ADELGRSGGTVSVSDCGLIIAAALERNNPELALSVFYAMRSSFYQATALEGVSENASSVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
         D+ GRSGGTVSVSDC LIIAAAL+RNNPELALSVFYAMRS+FYQ TA E V+ENAS VERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  ADELGRSGGTVSVSDCGLIIAAALERNNPELALSVFYAMRSSFYQATALEGVSENASSVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVQCPSCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEERSMDTPAWEKALQFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVV+CPSCMVA+AVAQPQHGIQIVSCAKCRY+YELISGNIVNIESEE SMDTPAWEKAL+FLNIMK+KIP AVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVQCPSCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEERSMDTPAWEKALQFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHTDGRESLLLRVPAKGASFLLNPSILFPLIVLSAVGDAASGVIDPSLPRL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNL SGE MCLTNH+DGRESLLLRVPAK  S LLNPSILFPLIVLSA GDAASGVIDPSLP+L
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHTDGRESLLLRVPAKGASFLLNPSILFPLIVLSAVGDAASGVIDPSLPRL

Query:  LLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQFRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLESS
        LLVAGFASLAAGATLNS ILPQF+RLPQRSVDIIAIKQQLLSQYNVLQ RI DLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLE+S
Subjt:  LLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQFRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLESS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEEKWKLQAEANDEAERLLNQSMPT
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERV+EQIEQIM LENLEE+WKLQAEANDEAERLLNQSMPT
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEEKWKLQAEANDEAERLLNQSMPT

A0A6J1ETW5 uncharacterized protein LOC111437671 isoform X11.3e-28690.31Show/hide
Query:  MILNLSSPWLSLTRL-PPPKLIEPLTSANNGTGALMPLLLCSHALFAFTSFSKSTRVRASFNGCDSDGAAAFENPVSELLDGELLGVVSGAKDADEALRM
        MIL+LSSPWL++TRL PPPKLIEPL SA+NGT  LMPLLLCSHALF FTSFSKSTRVRAS N  + DGAAAFENPVSELLD EL+GVVSGAKDADE LR+
Subjt:  MILNLSSPWLSLTRL-PPPKLIEPLTSANNGTGALMPLLLCSHALFAFTSFSKSTRVRASFNGCDSDGAAAFENPVSELLDGELLGVVSGAKDADEALRM

Query:  IADELGRSGGTVSVSDCGLIIAAALERNNPELALSVFYAMRSSFYQATALEGVSENASSVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVG
        IAD+ GR+GGTVSV DC LIIAAAL+RNN ELALSVFYAMRSSFY+ TA EGV++N SSVERWKW+RPDVHVYTLLIQGLAASLRVSDALR+IEIICRVG
Subjt:  IADELGRSGGTVSVSDCGLIIAAALERNNPELALSVFYAMRSSFYQATALEGVSENASSVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVG

Query:  VSPAEEVPFGKVVQCPSCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEERSMDTPAWEKALQFLNIMKQKIPAAVHSIVVQTPSGVARTQKF
        VSPAEEVPFGKVVQCPSCMVA+AVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEE SMDTPAWEKAL+FLN+MKQK+PAAVHSIVVQTPSGVARTQKF
Subjt:  VSPAEEVPFGKVVQCPSCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEERSMDTPAWEKALQFLNIMKQKIPAAVHSIVVQTPSGVARTQKF

Query:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHTDGRESLLLRVPAKGASFLLNPSILFPLIVLSAVGDAASGVIDPSLPR
        ATETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPNLYSGEPMCLTNH+DGRESLLLRVPAK  SFLL PS LFPLI+LS  GD +SGV+DPSLPR
Subjt:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHTDGRESLLLRVPAKGASFLLNPSILFPLIVLSAVGDAASGVIDPSLPR

Query:  LLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQFRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLES
        LLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQ RIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLE+
Subjt:  LLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQFRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLES

Query:  SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEEKWKLQAEANDEAERLLNQSMPT
        SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERV+EQIEQIMVLENLEE+W+LQAEANDEAERL NQSMPT
Subjt:  SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEEKWKLQAEANDEAERLLNQSMPT

A0A6J1EYW6 uncharacterized protein LOC111437671 isoform X21.6e-28490.14Show/hide
Query:  MILNLSSPWLSLTRL-PPPKLIEPLTSANNGTGALMPLLLCSHALFAFTSFSKSTRVRASFNGCDSDGAAAFENPVSELLDGELLGVVSGAKDADEALRM
        MIL+LSSPWL++TRL PPPKLIEPL SA+NGT  LMPLLLCSHALF FTSFSKSTRVRAS N  + DGAAAFENPVSELLD EL+GVVSGAKDADE LR+
Subjt:  MILNLSSPWLSLTRL-PPPKLIEPLTSANNGTGALMPLLLCSHALFAFTSFSKSTRVRASFNGCDSDGAAAFENPVSELLDGELLGVVSGAKDADEALRM

Query:  IADELGRSGGTVSVSDCGLIIAAALERNNPELALSVFYAMRSSFYQATALEGVSENASSVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVG
        IAD+ GR+GGTVSV DC LIIAAAL+RNN ELALSVFYAMRSSFY+  A EGV++N SSVERWKW+RPDVHVYTLLIQGLAASLRVSDALR+IEIICRVG
Subjt:  IADELGRSGGTVSVSDCGLIIAAALERNNPELALSVFYAMRSSFYQATALEGVSENASSVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVG

Query:  VSPAEEVPFGKVVQCPSCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEERSMDTPAWEKALQFLNIMKQKIPAAVHSIVVQTPSGVARTQKF
        VSPAEEVPFGKVVQCPSCMVA+AVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEE SMDTPAWEKAL+FLN+MKQK+PAAVHSIVVQTPSGVARTQKF
Subjt:  VSPAEEVPFGKVVQCPSCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEERSMDTPAWEKALQFLNIMKQKIPAAVHSIVVQTPSGVARTQKF

Query:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHTDGRESLLLRVPAKGASFLLNPSILFPLIVLSAVGDAASGVIDPSLPR
        ATETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPNLYSGEPMCLTNH+DGRESLLLRVPAK  SFLL PS LFPLI+LS  GD +SGV+DPSLPR
Subjt:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHTDGRESLLLRVPAKGASFLLNPSILFPLIVLSAVGDAASGVIDPSLPR

Query:  LLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQFRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLES
        LLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQ RIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLE+
Subjt:  LLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQFRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLES

Query:  SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEEKWKLQAEANDEAERLLNQSMPT
        SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERV+EQIEQIMVLENLEE+W+LQAEANDEAERL NQSMPT
Subjt:  SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEEKWKLQAEANDEAERLLNQSMPT

A0A6J1J4R1 uncharacterized protein LOC111483407 isoform X11.6e-28489.6Show/hide
Query:  MILNLSSPWLSLTRLPPPKLIEPLTSANNGTGALMPLLLCSHALFAFTSFSKSTRVRASFNGCDSDGAAAFENPVSELLDGELLGVVSGAKDADEALRMI
        MIL+LSSPWL++TRLP PKLIEPL SA+NGT  LMPLLLCSHA F FTSFS+STRVRAS N  + DGAAAFENPVS+LLD EL+ VVSGAKDADE LRMI
Subjt:  MILNLSSPWLSLTRLPPPKLIEPLTSANNGTGALMPLLLCSHALFAFTSFSKSTRVRASFNGCDSDGAAAFENPVSELLDGELLGVVSGAKDADEALRMI

Query:  ADELGRSGGTVSVSDCGLIIAAALERNNPELALSVFYAMRSSFYQATALEGVSENASSVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        A++ GR+GGTVSV DC LIIAAAL+RNN ELALSVFYAMRSSFY+ TA EGV++N SSVERWKW+RPDVHVYTLLIQGLAASLRVSDALR+IEIICRVGV
Subjt:  ADELGRSGGTVSVSDCGLIIAAALERNNPELALSVFYAMRSSFYQATALEGVSENASSVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVQCPSCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEERSMDTPAWEKALQFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVQCPSCMVA+AVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEE SMDTPAWEKAL+FLN+MKQK+PAAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVQCPSCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEERSMDTPAWEKALQFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHTDGRESLLLRVPAKGASFLLNPSILFPLIVLSAVGDAASGVIDPSLPRL
        TETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPNLYSGEPMCLTNH+DGRESLL+RVPAK  SFLL PS LFPLI+LS  GDAASGV+DPSLPR+
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHTDGRESLLLRVPAKGASFLLNPSILFPLIVLSAVGDAASGVIDPSLPRL

Query:  LLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQFRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLESS
        LLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQ RIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLE+S
Subjt:  LLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQFRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLESS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEEKWKLQAEANDEAERLLNQSMPT
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERV+EQIEQIMVLENLEE+W+LQAEANDEAERL NQSMPT
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEEKWKLQAEANDEAERLLNQSMPT

A0A6J1JDG3 uncharacterized protein LOC111483407 isoform X22.0e-28289.43Show/hide
Query:  MILNLSSPWLSLTRLPPPKLIEPLTSANNGTGALMPLLLCSHALFAFTSFSKSTRVRASFNGCDSDGAAAFENPVSELLDGELLGVVSGAKDADEALRMI
        MIL+LSSPWL++TRLP PKLIEPL SA+NGT  LMPLLLCSHA F FTSFS+STRVRAS N  + DGAAAFENPVS+LLD EL+ VVSGAKDADE LRMI
Subjt:  MILNLSSPWLSLTRLPPPKLIEPLTSANNGTGALMPLLLCSHALFAFTSFSKSTRVRASFNGCDSDGAAAFENPVSELLDGELLGVVSGAKDADEALRMI

Query:  ADELGRSGGTVSVSDCGLIIAAALERNNPELALSVFYAMRSSFYQATALEGVSENASSVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        A++ GR+GGTVSV DC LIIAAAL+RNN ELALSVFYAMRSSFY+  A EGV++N SSVERWKW+RPDVHVYTLLIQGLAASLRVSDALR+IEIICRVGV
Subjt:  ADELGRSGGTVSVSDCGLIIAAALERNNPELALSVFYAMRSSFYQATALEGVSENASSVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVQCPSCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEERSMDTPAWEKALQFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVQCPSCMVA+AVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEE SMDTPAWEKAL+FLN+MKQK+PAAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVQCPSCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEERSMDTPAWEKALQFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHTDGRESLLLRVPAKGASFLLNPSILFPLIVLSAVGDAASGVIDPSLPRL
        TETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPNLYSGEPMCLTNH+DGRESLL+RVPAK  SFLL PS LFPLI+LS  GDAASGV+DPSLPR+
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHTDGRESLLLRVPAKGASFLLNPSILFPLIVLSAVGDAASGVIDPSLPRL

Query:  LLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQFRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLESS
        LLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQ RIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLE+S
Subjt:  LLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQFRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLESS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEEKWKLQAEANDEAERLLNQSMPT
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERV+EQIEQIMVLENLEE+W+LQAEANDEAERL NQSMPT
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEEKWKLQAEANDEAERLLNQSMPT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64430.1 Pentatricopeptide repeat (PPR) superfamily protein7.5e-18967.84Show/hide
Query:  DSDGAAAFENPVSELLDGELLGVVSGAKDADEALRMIADELGRS-GGTVSVSDCGLIIAAALERNNPELALSVFYAMRSSFYQATALEGVSENASSVERW
        DS G+AA  +  S +LD ELL  VS  +DADEAL MI+D  G + GG V + DC  II+AA+ R N +LALS+FY MR+SF       G S+N    +RW
Subjt:  DSDGAAAFENPVSELLDGELLGVVSGAKDADEALRMIADELGRS-GGTVSVSDCGLIIAAALERNNPELALSVFYAMRSSFYQATALEGVSENASSVERW

Query:  KWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVQCPSCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEERSMDTP
         WSRPDV VYT+L+ GLAASLRVSD+LR+I  ICRVG+SPAEEVPFGK+V+CPSC++AIAVAQPQHG+QIVSCA CRYQYEL SG+I +I+SEE   D P
Subjt:  KWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVQCPSCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEERSMDTP

Query:  AWEKALQFLNIMKQKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHTDGRESLLLRV
         WEK L+ + I K KI ++VHSIVVQTPSG ART +FATETA+LPA+EGERVTIA+AAPSNV+R+VGP KF  K PN Y GEPM LT H DGRES+LLR 
Subjt:  AWEKALQFLNIMKQKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHTDGRESLLLRV

Query:  PAKGASFLLNPSILFPLIVLSAVGDAASGVIDPSLPRLLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQFRIRDLKLAAEKEV
        P+K    +L PS L PL+ + A GDAASGVIDPSLP+LL VA   SLA GAT+NSF+LP+ N+LP+R+VD++ IKQQLLSQY+VLQ RIRDLK A EKEV
Subjt:  PAKGASFLLNPSILFPLIVLSAVGDAASGVIDPSLPRLLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQFRIRDLKLAAEKEV

Query:  WMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLESSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEEKWKLQAE
        WMLARMCQLENKI AVGEP+YR RR+R+KKVRE LE+S+K +I+LI+SYARISSMIEIEVEM+SDV+AAEA ++ E +A+QIEQIM LENLEEKWK+QAE
Subjt:  WMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLESSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEEKWKLQAE

Query:  ANDEAERLLN
        ANDEAERLL+
Subjt:  ANDEAERLLN

AT1G64430.2 Pentatricopeptide repeat (PPR) superfamily protein7.5e-18967.84Show/hide
Query:  DSDGAAAFENPVSELLDGELLGVVSGAKDADEALRMIADELGRS-GGTVSVSDCGLIIAAALERNNPELALSVFYAMRSSFYQATALEGVSENASSVERW
        DS G+AA  +  S +LD ELL  VS  +DADEAL MI+D  G + GG V + DC  II+AA+ R N +LALS+FY MR+SF       G S+N    +RW
Subjt:  DSDGAAAFENPVSELLDGELLGVVSGAKDADEALRMIADELGRS-GGTVSVSDCGLIIAAALERNNPELALSVFYAMRSSFYQATALEGVSENASSVERW

Query:  KWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVQCPSCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEERSMDTP
         WSRPDV VYT+L+ GLAASLRVSD+LR+I  ICRVG+SPAEEVPFGK+V+CPSC++AIAVAQPQHG+QIVSCA CRYQYEL SG+I +I+SEE   D P
Subjt:  KWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVQCPSCMVAIAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEERSMDTP

Query:  AWEKALQFLNIMKQKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHTDGRESLLLRV
         WEK L+ + I K KI ++VHSIVVQTPSG ART +FATETA+LPA+EGERVTIA+AAPSNV+R+VGP KF  K PN Y GEPM LT H DGRES+LLR 
Subjt:  AWEKALQFLNIMKQKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEPMCLTNHTDGRESLLLRV

Query:  PAKGASFLLNPSILFPLIVLSAVGDAASGVIDPSLPRLLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQFRIRDLKLAAEKEV
        P+K    +L PS L PL+ + A GDAASGVIDPSLP+LL VA   SLA GAT+NSF+LP+ N+LP+R+VD++ IKQQLLSQY+VLQ RIRDLK A EKEV
Subjt:  PAKGASFLLNPSILFPLIVLSAVGDAASGVIDPSLPRLLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQLLSQYNVLQFRIRDLKLAAEKEV

Query:  WMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLESSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEEKWKLQAE
        WMLARMCQLENKI AVGEP+YR RR+R+KKVRE LE+S+K +I+LI+SYARISSMIEIEVEM+SDV+AAEA ++ E +A+QIEQIM LENLEEKWK+QAE
Subjt:  WMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLESSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVLENLEEKWKLQAE

Query:  ANDEAERLLN
        ANDEAERLL+
Subjt:  ANDEAERLLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCTGAACTTGAGTTCGCCATGGCTCTCTCTCACTCGCCTCCCTCCTCCCAAACTCATCGAACCACTCACCTCTGCAAACAATGGCACTGGCGCACTCATGCCTCT
TCTACTATGTTCCCACGCTCTCTTTGCTTTCACCTCCTTCTCCAAGTCGACGCGAGTTAGAGCTTCTTTCAATGGCTGCGACAGCGACGGCGCCGCGGCTTTTGAGAATC
CTGTTTCGGAGTTACTCGACGGGGAGCTTCTTGGGGTTGTTTCGGGTGCTAAGGATGCAGATGAAGCGTTACGGATGATTGCTGATGAGTTAGGGAGAAGTGGAGGCACT
GTGTCGGTTTCGGACTGTGGTTTGATTATTGCGGCTGCACTTGAGCGTAACAATCCCGAGCTTGCTTTGTCTGTGTTCTACGCAATGCGTTCTAGTTTCTATCAAGCTAC
TGCATTGGAAGGTGTTAGTGAAAATGCTTCCTCTGTTGAGAGATGGAAATGGTCAAGGCCGGATGTCCATGTGTATACATTGCTGATTCAAGGTCTTGCAGCATCCTTGA
GGGTTTCTGATGCTCTTAGGATGATTGAGATTATTTGCCGAGTTGGTGTATCACCCGCTGAAGAGGTCCCATTTGGAAAGGTAGTGCAGTGTCCCAGTTGTATGGTAGCA
ATTGCAGTTGCACAACCCCAACACGGTATTCAGATTGTATCCTGTGCAAAGTGCCGGTACCAGTATGAACTTATTTCAGGAAACATAGTTAATATTGAGTCAGAAGAACG
TAGCATGGATACTCCAGCATGGGAAAAAGCACTCCAATTCTTGAATATAATGAAGCAAAAAATCCCTGCTGCTGTGCACTCCATTGTGGTTCAAACTCCTTCTGGAGTGG
CACGAACCCAGAAGTTTGCTACTGAAACAGCAGATCTCCCAGCACGAGAAGGAGAAAGGGTGACAATTGCTGCTGCAGCTCCGTCAAATGTATTCAGAGAAGTTGGTCCT
ATTAAATTTAGTCCAAAGGATCCCAATTTGTACTCTGGGGAGCCTATGTGCCTGACAAATCATACAGATGGCCGGGAATCACTATTATTAAGAGTACCAGCAAAGGGAGC
CTCATTCTTACTTAACCCATCGATCCTCTTTCCACTCATAGTTTTATCTGCCGTTGGAGATGCTGCCTCTGGAGTTATTGACCCCAGCTTGCCTCGGCTGCTTTTAGTTG
CTGGATTTGCTTCTCTAGCTGCAGGAGCTACTTTGAATTCATTTATTTTGCCTCAATTCAATCGGCTTCCTCAACGATCAGTTGATATTATTGCTATCAAGCAGCAGCTT
TTATCTCAATATAATGTGCTTCAGTTTCGTATTAGGGACTTAAAACTAGCTGCTGAAAAGGAGGTATGGATGTTGGCTCGGATGTGCCAATTAGAGAACAAAATTTTTGC
CGTAGGAGAACCTTCTTACCGTGCACGTAGAAGTCGGATAAAAAAGGTGCGAGAAGGTTTGGAAAGTTCCCTTAAGCAACGGATTGAACTAATAGAAAGCTATGCAAGGA
TTTCCTCAATGATTGAGATAGAAGTTGAAATGGAGTCTGATGTTATTGCTGCTGAAGCAGCCAGCAGTGTGGAAAGGGTTGCTGAACAGATTGAGCAAATCATGGTGCTG
GAAAATCTAGAAGAGAAATGGAAATTACAAGCAGAAGCCAACGATGAAGCCGAAAGACTTCTCAACCAATCAATGCCAACATAA
mRNA sequenceShow/hide mRNA sequence
ATGATTCTGAACTTGAGTTCGCCATGGCTCTCTCTCACTCGCCTCCCTCCTCCCAAACTCATCGAACCACTCACCTCTGCAAACAATGGCACTGGCGCACTCATGCCTCT
TCTACTATGTTCCCACGCTCTCTTTGCTTTCACCTCCTTCTCCAAGTCGACGCGAGTTAGAGCTTCTTTCAATGGCTGCGACAGCGACGGCGCCGCGGCTTTTGAGAATC
CTGTTTCGGAGTTACTCGACGGGGAGCTTCTTGGGGTTGTTTCGGGTGCTAAGGATGCAGATGAAGCGTTACGGATGATTGCTGATGAGTTAGGGAGAAGTGGAGGCACT
GTGTCGGTTTCGGACTGTGGTTTGATTATTGCGGCTGCACTTGAGCGTAACAATCCCGAGCTTGCTTTGTCTGTGTTCTACGCAATGCGTTCTAGTTTCTATCAAGCTAC
TGCATTGGAAGGTGTTAGTGAAAATGCTTCCTCTGTTGAGAGATGGAAATGGTCAAGGCCGGATGTCCATGTGTATACATTGCTGATTCAAGGTCTTGCAGCATCCTTGA
GGGTTTCTGATGCTCTTAGGATGATTGAGATTATTTGCCGAGTTGGTGTATCACCCGCTGAAGAGGTCCCATTTGGAAAGGTAGTGCAGTGTCCCAGTTGTATGGTAGCA
ATTGCAGTTGCACAACCCCAACACGGTATTCAGATTGTATCCTGTGCAAAGTGCCGGTACCAGTATGAACTTATTTCAGGAAACATAGTTAATATTGAGTCAGAAGAACG
TAGCATGGATACTCCAGCATGGGAAAAAGCACTCCAATTCTTGAATATAATGAAGCAAAAAATCCCTGCTGCTGTGCACTCCATTGTGGTTCAAACTCCTTCTGGAGTGG
CACGAACCCAGAAGTTTGCTACTGAAACAGCAGATCTCCCAGCACGAGAAGGAGAAAGGGTGACAATTGCTGCTGCAGCTCCGTCAAATGTATTCAGAGAAGTTGGTCCT
ATTAAATTTAGTCCAAAGGATCCCAATTTGTACTCTGGGGAGCCTATGTGCCTGACAAATCATACAGATGGCCGGGAATCACTATTATTAAGAGTACCAGCAAAGGGAGC
CTCATTCTTACTTAACCCATCGATCCTCTTTCCACTCATAGTTTTATCTGCCGTTGGAGATGCTGCCTCTGGAGTTATTGACCCCAGCTTGCCTCGGCTGCTTTTAGTTG
CTGGATTTGCTTCTCTAGCTGCAGGAGCTACTTTGAATTCATTTATTTTGCCTCAATTCAATCGGCTTCCTCAACGATCAGTTGATATTATTGCTATCAAGCAGCAGCTT
TTATCTCAATATAATGTGCTTCAGTTTCGTATTAGGGACTTAAAACTAGCTGCTGAAAAGGAGGTATGGATGTTGGCTCGGATGTGCCAATTAGAGAACAAAATTTTTGC
CGTAGGAGAACCTTCTTACCGTGCACGTAGAAGTCGGATAAAAAAGGTGCGAGAAGGTTTGGAAAGTTCCCTTAAGCAACGGATTGAACTAATAGAAAGCTATGCAAGGA
TTTCCTCAATGATTGAGATAGAAGTTGAAATGGAGTCTGATGTTATTGCTGCTGAAGCAGCCAGCAGTGTGGAAAGGGTTGCTGAACAGATTGAGCAAATCATGGTGCTG
GAAAATCTAGAAGAGAAATGGAAATTACAAGCAGAAGCCAACGATGAAGCCGAAAGACTTCTCAACCAATCAATGCCAACATAA
Protein sequenceShow/hide protein sequence
MILNLSSPWLSLTRLPPPKLIEPLTSANNGTGALMPLLLCSHALFAFTSFSKSTRVRASFNGCDSDGAAAFENPVSELLDGELLGVVSGAKDADEALRMIADELGRSGGT
VSVSDCGLIIAAALERNNPELALSVFYAMRSSFYQATALEGVSENASSVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVQCPSCMVA
IAVAQPQHGIQIVSCAKCRYQYELISGNIVNIESEERSMDTPAWEKALQFLNIMKQKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGP
IKFSPKDPNLYSGEPMCLTNHTDGRESLLLRVPAKGASFLLNPSILFPLIVLSAVGDAASGVIDPSLPRLLLVAGFASLAAGATLNSFILPQFNRLPQRSVDIIAIKQQL
LSQYNVLQFRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLESSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVAEQIEQIMVL
ENLEEKWKLQAEANDEAERLLNQSMPT