; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G06430 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G06430
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationChr4:4464154..4470602
RNA-Seq ExpressionCSPI04G06430
SyntenyCSPI04G06430
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148995.1 uncharacterized protein LOC101209802 [Cucumis sativus]0.0e+0099.65Show/hide
Query:  MILNFTSPCLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHAFFAFTSFSKSLRVRTSLSSSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMI
        MILNFTSPCLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHAFFAFTSFSKSLRVRTSLS SDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMI
Subjt:  MILNFTSPCLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHAFFAFTSFSKSLRVRTSLSSSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMI

Query:  SDKSGRSGGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIEGLAASLRVSDALRMIEIICRVGVTP
         DKSGRSGGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIEGLAASLRVSDALRMIEIICRVGVTP
Subjt:  SDKSGRSGGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIEGLAASLRVSDALRMIEIICRVGVTP

Query:  AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATE
        AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATE
Subjt:  AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATE

Query:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLI
        TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLI
Subjt:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLI

Query:  VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK
        VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK
Subjt:  VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK

Query:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERWKLQAEANDEAERLLNQSMPTEKV
        QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERWKLQAEANDEAERLLNQSMPTEKV
Subjt:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERWKLQAEANDEAERLLNQSMPTEKV

XP_008451955.1 PREDICTED: uncharacterized protein LOC103493103 [Cucumis melo]2.5e-30396.38Show/hide
Query:  MILNFTSPCLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHAFFAFTSFSKSLRVRTSLSSSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMI
        MILNFTSP LTLTRLPPPKLLEPL SSTNGATVF+PLLLCSHA FAFTSFSKS+RVRTSLS SDIDGSAAFENPASELLDDELI+VVSGAKDADEALGMI
Subjt:  MILNFTSPCLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHAFFAFTSFSKSLRVRTSLSSSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMI

Query:  SDKSGRSGGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASIVERWKWSRPDVHVYTLLIEGLAASLRVSDALRMIEIICRVGV
         DKSGRSGGTVSVSDCRLII+AALKRNNPELALSVFYAMRSTFYQ  AWE VNENASIVERWKWSRPDVHVYTLLI+GLAASLRVSDALRMIEIICRVGV
Subjt:  SDKSGRSGGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASIVERWKWSRPDVHVYTLLIEGLAASLRVSDALRMIEIICRVGV

Query:  TPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
        +PAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKC YKYELISGNIVNIESEEI MDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
Subjt:  TPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNL SGEAMCLTNHSDGRESLLLRVP KENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL

Query:  LIVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        L+VAGFASLAAGATLNSLILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LIVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERWKLQAEANDEAERLLNQSMPTEKV
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEERWKLQAEANDEAERLLNQSMPTEKV
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERWKLQAEANDEAERLLNQSMPTEKV

XP_038895166.1 uncharacterized protein LOC120083467 isoform X1 [Benincasa hispida]2.1e-28690Show/hide
Query:  MILNFTSPCLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHAFFAFTSFSKSLRVRTSLSSSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMI
        MILN TSP L +TRLPPPKL EPLAS+TNGATV MPLLLCSHA FAFTSFSKS++VR SLS SDIDG+AAFENP S+LL +ELI  VSGAKDADEAL MI
Subjt:  MILNFTSPCLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHAFFAFTSFSKSLRVRTSLSSSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMI

Query:  SDKSGRSGGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQ------------AWEGVNENASIVERWKWSRPDVHVYTLLIEGLAASLRVSDALR
        +DKSGRSGGTVS SDC LII+AALK NNPELALSVFYAMRSTFYQ            AWEGVNENAS VERWKWSRPDVHVYTLLI+GLAASLRVSDALR
Subjt:  SDKSGRSGGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQ------------AWEGVNENASIVERWKWSRPDVHVYTLLIEGLAASLRVSDALR

Query:  MIEIICRVGVTPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTP
        MIEIICRVGV+PAEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCA+C YKYELISGNIVNI+SEEI MDTPAWEKALRFLNIMKRKIP AVHSIVVQTP
Subjt:  MIEIICRVGVTPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTP

Query:  SGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPGKENSSLLNPSILFPLIVLSAAGDAAS
        SGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN YSGE MCLTNHSDGRESLLLRVP K  SSLLNPS LFPLIVLSAAGDAAS
Subjt:  SGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPGKENSSLLNPSILFPLIVLSAAGDAAS

Query:  GVIDPSLPQLLIVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI
        GV+DPSLPQLL+VAG ASLAAGATLNSLILPQ NRLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI
Subjt:  GVIDPSLPQLLIVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI

Query:  KKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERWKLQAEANDEAERLLNQSMPTEKV
        +KVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEERWKLQAEANDEAERLLNQSMPTEKV
Subjt:  KKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERWKLQAEANDEAERLLNQSMPTEKV

XP_038895173.1 uncharacterized protein LOC120083467 isoform X2 [Benincasa hispida]1.5e-28791.55Show/hide
Query:  MILNFTSPCLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHAFFAFTSFSKSLRVRTSLSSSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMI
        MILN TSP L +TRLPPPKL EPLAS+TNGATV MPLLLCSHA FAFTSFSKS++VR SLS SDIDG+AAFENP S+LL +ELI  VSGAKDADEAL MI
Subjt:  MILNFTSPCLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHAFFAFTSFSKSLRVRTSLSSSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMI

Query:  SDKSGRSGGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASIVERWKWSRPDVHVYTLLIEGLAASLRVSDALRMIEIICRVGV
        +DKSGRSGGTVS SDC LII+AALK NNPELALSVFYAMRSTFYQ  AWEGVNENAS VERWKWSRPDVHVYTLLI+GLAASLRVSDALRMIEIICRVGV
Subjt:  SDKSGRSGGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASIVERWKWSRPDVHVYTLLIEGLAASLRVSDALRMIEIICRVGV

Query:  TPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
        +PAEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCA+C YKYELISGNIVNI+SEEI MDTPAWEKALRFLNIMKRKIP AVHSIVVQTPSGVARTQKFA
Subjt:  TPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN YSGE MCLTNHSDGRESLLLRVP K  SSLLNPS LFPLIVLSAAGDAASGV+DPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL

Query:  LIVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        L+VAG ASLAAGATLNSLILPQ NRLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI+KVREGLENS
Subjt:  LIVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERWKLQAEANDEAERLLNQSMPTEKV
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEERWKLQAEANDEAERLLNQSMPTEKV
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERWKLQAEANDEAERLLNQSMPTEKV

XP_038895181.1 uncharacterized protein LOC120083467 isoform X3 [Benincasa hispida]4.6e-28991.87Show/hide
Query:  MILNFTSPCLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHAFFAFTSFSKSLRVRTSLSSSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMI
        MILN TSP L +TRLPPPKL EPLAS+TNGATV MPLLLCSHA FAFTSFSKS++VR SLS SDIDG+AAFENP S+LL +ELI  VSGAKDADEAL MI
Subjt:  MILNFTSPCLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHAFFAFTSFSKSLRVRTSLSSSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMI

Query:  SDKSGRSGGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIEGLAASLRVSDALRMIEIICRVGVTP
        +DKSGRSGGTVS SDC LII+AALK NNPELALSVFYAMRSTFYQAWEGVNENAS VERWKWSRPDVHVYTLLI+GLAASLRVSDALRMIEIICRVGV+P
Subjt:  SDKSGRSGGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIEGLAASLRVSDALRMIEIICRVGVTP

Query:  AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATE
        AEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCA+C YKYELISGNIVNI+SEEI MDTPAWEKALRFLNIMKRKIP AVHSIVVQTPSGVARTQKFATE
Subjt:  AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATE

Query:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLI
        TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN YSGE MCLTNHSDGRESLLLRVP K  SSLLNPS LFPLIVLSAAGDAASGV+DPSLPQLL+
Subjt:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLI

Query:  VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK
        VAG ASLAAGATLNSLILPQ NRLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI+KVREGLENSLK
Subjt:  VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK

Query:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERWKLQAEANDEAERLLNQSMPTEKV
        QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEERWKLQAEANDEAERLLNQSMPTEKV
Subjt:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERWKLQAEANDEAERLLNQSMPTEKV

TrEMBL top hitse value%identityAlignment
A0A0A0KZV4 Uncharacterized protein0.0e+0099.65Show/hide
Query:  MILNFTSPCLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHAFFAFTSFSKSLRVRTSLSSSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMI
        MILNFTSPCLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHAFFAFTSFSKSLRVRTSLS SDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMI
Subjt:  MILNFTSPCLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHAFFAFTSFSKSLRVRTSLSSSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMI

Query:  SDKSGRSGGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIEGLAASLRVSDALRMIEIICRVGVTP
         DKSGRSGGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIEGLAASLRVSDALRMIEIICRVGVTP
Subjt:  SDKSGRSGGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIEGLAASLRVSDALRMIEIICRVGVTP

Query:  AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATE
        AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATE
Subjt:  AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATE

Query:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLI
        TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLI
Subjt:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLI

Query:  VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK
        VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK
Subjt:  VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK

Query:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERWKLQAEANDEAERLLNQSMPTEKV
        QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERWKLQAEANDEAERLLNQSMPTEKV
Subjt:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERWKLQAEANDEAERLLNQSMPTEKV

A0A1S3BTU3 uncharacterized protein LOC1034931031.2e-30396.38Show/hide
Query:  MILNFTSPCLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHAFFAFTSFSKSLRVRTSLSSSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMI
        MILNFTSP LTLTRLPPPKLLEPL SSTNGATVF+PLLLCSHA FAFTSFSKS+RVRTSLS SDIDGSAAFENPASELLDDELI+VVSGAKDADEALGMI
Subjt:  MILNFTSPCLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHAFFAFTSFSKSLRVRTSLSSSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMI

Query:  SDKSGRSGGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASIVERWKWSRPDVHVYTLLIEGLAASLRVSDALRMIEIICRVGV
         DKSGRSGGTVSVSDCRLII+AALKRNNPELALSVFYAMRSTFYQ  AWE VNENASIVERWKWSRPDVHVYTLLI+GLAASLRVSDALRMIEIICRVGV
Subjt:  SDKSGRSGGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASIVERWKWSRPDVHVYTLLIEGLAASLRVSDALRMIEIICRVGV

Query:  TPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
        +PAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKC YKYELISGNIVNIESEEI MDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
Subjt:  TPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNL SGEAMCLTNHSDGRESLLLRVP KENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL

Query:  LIVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        L+VAGFASLAAGATLNSLILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LIVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERWKLQAEANDEAERLLNQSMPTEKV
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEERWKLQAEANDEAERLLNQSMPTEKV
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERWKLQAEANDEAERLLNQSMPTEKV

A0A5A7TPS5 Pentatricopeptide repeat (PPR) superfamily protein isoform 26.4e-28196.1Show/hide
Query:  MILNFTSPCLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHAFFAFTSFSKSLRVRTSLSSSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMI
        MILNFTSP LTLTRLPPPKLLEPL SSTNGATVF+PLLLCSHA FAFTSFSKS+RVRTSLS SDIDGSAAFENPASELLDDELI+VVSGAKDADEALGMI
Subjt:  MILNFTSPCLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHAFFAFTSFSKSLRVRTSLSSSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMI

Query:  SDKSGRSGGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASIVERWKWSRPDVHVYTLLIEGLAASLRVSDALRMIEIICRVGV
         DKSGRSGGTVSVSDCRLII+AALKRNNPELALSVFYAMRSTFYQ  AWE VNENASIVERWKWSRPDVHVYTLLI+GLAASLRVSDALRMIEIICRVGV
Subjt:  SDKSGRSGGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASIVERWKWSRPDVHVYTLLIEGLAASLRVSDALRMIEIICRVGV

Query:  TPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
        +PAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKC YKYELISGNIVNIESEEI MDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
Subjt:  TPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNL SGEAMCLTNHSDGRESLLLRVP KENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL

Query:  LIVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        L+VAGFASLAAGATLNSLILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LIVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSV R
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER

A0A6J1EYW6 uncharacterized protein LOC111437671 isoform X29.9e-28288.6Show/hide
Query:  MILNFTSPCLTLTRL-PPPKLLEPLASSTNGATVFMPLLLCSHAFFAFTSFSKSLRVRTSLSSSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGM
        MIL+ +SP LT+TRL PPPKL+EPLAS++NG +V MPLLLCSHA F FTSFSKS RVR SL++S+IDG+AAFENP SELLDDELI VVSGAKDADE L +
Subjt:  MILNFTSPCLTLTRL-PPPKLLEPLASSTNGATVFMPLLLCSHAFFAFTSFSKSLRVRTSLSSSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGM

Query:  ISDKSGRSGGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIEGLAASLRVSDALRMIEIICRVGVT
        I+DKSGR+GGTVSV DCRLII+AALKRNN ELALSVFYAMRS+FY+AWEGVN+N S VERWKW+RPDVHVYTLLI+GLAASLRVSDALR+IEIICRVGV+
Subjt:  ISDKSGRSGGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIEGLAASLRVSDALRMIEIICRVGVT

Query:  PAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFAT
        PAEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCAKC Y+YELISGNIVNIESEEI MDTPAWEKALRFLN+MK+K+P AVHSIVVQTPSGVARTQKFAT
Subjt:  PAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFAT

Query:  ETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLL
        ETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPNLYSGE MCLTNHSDGRESLLLRVP KE S LL PS LFPLI+LS AGD +SGV+DPSLP+LL
Subjt:  ETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLL

Query:  IVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSL
        +VAGFASLAAGATLNS ILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSL
Subjt:  IVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSL

Query:  KQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERWKLQAEANDEAERLLNQSMPTEKV
        KQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERW+LQAEANDEAERL NQSMPTE+V
Subjt:  KQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERWKLQAEANDEAERLLNQSMPTEKV

A0A6J1JDG3 uncharacterized protein LOC111483407 isoform X24.5e-28288.41Show/hide
Query:  MILNFTSPCLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHAFFAFTSFSKSLRVRTSLSSSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMI
        MIL+ +SP LT+TRLP PKL+EPLAS++NG +V MPLLLCSHAFF FTSFS+S RVR SL+ S+IDG+AAFENP S+LLDDELI VVSGAKDADE L MI
Subjt:  MILNFTSPCLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHAFFAFTSFSKSLRVRTSLSSSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMI

Query:  SDKSGRSGGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIEGLAASLRVSDALRMIEIICRVGVTP
        ++KSGR+GGTVSV DCRLII+AALKRNN ELALSVFYAMRS+FY+AWEGVN+N S VERWKW+RPDVHVYTLLI+GLAASLRVSDALR+IEIICRVGV+P
Subjt:  SDKSGRSGGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIEGLAASLRVSDALRMIEIICRVGVTP

Query:  AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATE
        AEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCAKC Y+YELISGNIVNIESEEI MDTPAWEKALRFLN+MK+K+P AVHSIVVQTPSGVARTQKFATE
Subjt:  AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATE

Query:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLI
        TADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPNLYSGE MCLTNHSDGRESLL+RVP KE S LL PS LFPLI+LS AGDAASGV+DPSLP++L+
Subjt:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLI

Query:  VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK
        VAGFASLAAGATLNS ILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK
Subjt:  VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK

Query:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERWKLQAEANDEAERLLNQSMPTEKV
        QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERW+LQAEANDEAERL NQSMPTE+V
Subjt:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERWKLQAEANDEAERLLNQSMPTEKV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64430.1 Pentatricopeptide repeat (PPR) superfamily protein2.2e-18866.86Show/hide
Query:  SSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMISDKSGRS-GGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERW
        S D  GSAA  + +S +LDDEL+  VS  +DADEAL MISD+ G + GG V + DCR IISAA+ R N +LALS+FY MR++F       +   S  +RW
Subjt:  SSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMISDKSGRS-GGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERW

Query:  KWSRPDVHVYTLLIEGLAASLRVSDALRMIEIICRVGVTPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTP
         WSRPDV VYT+L+ GLAASLRVSD+LR+I  ICRVG++PAEEVPFGK+V+CPSC++A+AVAQPQHG+QIVSCA C Y+YEL SG+I +I+SEE+  D P
Subjt:  KWSRPDVHVYTLLIEGLAASLRVSDALRMIEIICRVGVTPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTP

Query:  AWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRV
         WEK LR + I K KI  +VHSIVVQTPSG ART +FATETA+LPA+EGERVTIA+AAPSNV+R+VGP KF  K PN Y GE M LT H DGRES+LLR 
Subjt:  AWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRV

Query:  PGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLIVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEV
        P K+   +L PS L PL+ + A GDAASGVIDPSLPQLL VA   SLA GAT+NS +LP+ N+LP+R+VD++ IKQQLLSQY+VLQ RI DLK A EKEV
Subjt:  PGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLIVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEV

Query:  WMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERWKLQAE
        WMLARMCQLENKI AVGEP+YR RR+R+KKVRE LENS+K +I+LI+SYARISSMIEIEVEM+SDV+AAEA ++ E +++QIEQIM LENLEE+WK+QAE
Subjt:  WMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERWKLQAE

Query:  ANDEAERLLN
        ANDEAERLL+
Subjt:  ANDEAERLLN

AT1G64430.2 Pentatricopeptide repeat (PPR) superfamily protein2.2e-18866.86Show/hide
Query:  SSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMISDKSGRS-GGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERW
        S D  GSAA  + +S +LDDEL+  VS  +DADEAL MISD+ G + GG V + DCR IISAA+ R N +LALS+FY MR++F       +   S  +RW
Subjt:  SSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMISDKSGRS-GGTVSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERW

Query:  KWSRPDVHVYTLLIEGLAASLRVSDALRMIEIICRVGVTPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTP
         WSRPDV VYT+L+ GLAASLRVSD+LR+I  ICRVG++PAEEVPFGK+V+CPSC++A+AVAQPQHG+QIVSCA C Y+YEL SG+I +I+SEE+  D P
Subjt:  KWSRPDVHVYTLLIEGLAASLRVSDALRMIEIICRVGVTPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTP

Query:  AWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRV
         WEK LR + I K KI  +VHSIVVQTPSG ART +FATETA+LPA+EGERVTIA+AAPSNV+R+VGP KF  K PN Y GE M LT H DGRES+LLR 
Subjt:  AWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRV

Query:  PGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLIVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEV
        P K+   +L PS L PL+ + A GDAASGVIDPSLPQLL VA   SLA GAT+NS +LP+ N+LP+R+VD++ IKQQLLSQY+VLQ RI DLK A EKEV
Subjt:  PGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLIVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEV

Query:  WMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERWKLQAE
        WMLARMCQLENKI AVGEP+YR RR+R+KKVRE LENS+K +I+LI+SYARISSMIEIEVEM+SDV+AAEA ++ E +++QIEQIM LENLEE+WK+QAE
Subjt:  WMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEERWKLQAE

Query:  ANDEAERLLN
        ANDEAERLL+
Subjt:  ANDEAERLLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCTGAACTTCACTTCACCATGCCTCACTCTCACTCGTCTCCCTCCTCCTAAACTCCTCGAACCACTCGCCTCTTCAACCAATGGCGCTACCGTTTTCATGCCTCT
CCTTCTTTGTTCTCACGCTTTCTTTGCTTTCACCTCCTTCTCTAAGTCGTTGCGAGTTAGAACTTCTTTAAGTAGCAGCGACATCGACGGCTCTGCGGCTTTTGAGAATC
CTGCTTCGGAGTTACTCGACGACGAGCTGATTGTAGTTGTTTCGGGTGCTAAGGATGCTGATGAAGCGCTAGGGATGATCAGTGATAAGTCAGGGAGAAGTGGAGGTACT
GTGTCTGTTTCCGACTGTCGTTTGATTATTTCGGCTGCACTTAAGCGTAACAATCCCGAGCTTGCTTTGTCTGTGTTCTACGCAATGCGTTCCACTTTCTATCAAGCATG
GGAAGGTGTTAATGAAAATGCTTCCATCGTTGAGAGATGGAAATGGTCAAGGCCAGATGTTCATGTATATACATTGCTGATTGAAGGTCTTGCAGCATCCTTGAGGGTAT
CTGATGCACTTAGAATGATCGAAATTATTTGCCGTGTTGGTGTAACACCTGCTGAGGAGGTACCATTTGGAAAAGTGGTGAAGTGTCCCAGTTGTATGGTAGCAGTTGCA
GTTGCACAACCGCAGCATGGTATTCAGATTGTATCCTGTGCAAAGTGCTGCTACAAGTATGAACTTATTTCAGGAAACATAGTTAATATCGAGTCAGAAGAAATTCGCAT
GGATACTCCTGCATGGGAAAAAGCACTTCGGTTCTTGAACATAATGAAGCGAAAAATTCCTGTTGCTGTGCACTCCATTGTGGTACAAACTCCTTCTGGAGTGGCACGAA
CCCAGAAGTTTGCTACTGAAACAGCAGATCTCCCGGCTCGAGAGGGAGAAAGGGTGACAATTGCTGCTGCAGCTCCATCAAATGTATTCAGAGAAGTTGGTCCTATTAAA
TTTAGTCCAAAGGATCCCAATTTGTACTCTGGAGAGGCCATGTGCCTGACAAATCATTCAGATGGCCGGGAGTCACTATTATTAAGAGTGCCAGGAAAGGAAAACTCATC
CTTACTTAACCCATCGATCCTCTTTCCACTCATAGTTTTGTCTGCCGCTGGAGATGCTGCCTCTGGAGTAATTGACCCCAGCTTGCCTCAGTTGCTTATAGTTGCTGGAT
TTGCTTCTCTAGCTGCAGGAGCTACCTTGAATTCATTAATTTTGCCTCAATTCAATCGGCTTCCTCAACGATCAGTTGATATCATTGCTATCAAGCAGCAACTTTTATCT
CAATATAATGTGCTTCAGTCTCGTATCGGGGATTTAAAACTAGCTGCTGAAAAGGAGGTATGGATGTTGGCTCGGATGTGCCAATTGGAGAACAAAATTTTTGCCGTAGG
AGAACCTTCTTACCGCGCACGTAGAAGTAGGATAAAAAAGGTGCGAGAAGGCTTGGAAAATTCCCTTAAGCAACGGATTGAACTAATCGAAAGCTATGCAAGGATTTCCT
CGATGATTGAGATCGAAGTTGAAATGGAGTCTGATGTTATTGCAGCTGAAGCAGCTAGCAGTGTGGAAAGAGTGTCTGAACAGATAGAGCAAATCATGGTGCTGGAAAAT
CTAGAAGAGAGATGGAAATTACAAGCAGAAGCCAACGATGAAGCGGAAAGACTTCTCAACCAATCAATGCCAACAGAAAAGGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATTCTGAACTTCACTTCACCATGCCTCACTCTCACTCGTCTCCCTCCTCCTAAACTCCTCGAACCACTCGCCTCTTCAACCAATGGCGCTACCGTTTTCATGCCTCT
CCTTCTTTGTTCTCACGCTTTCTTTGCTTTCACCTCCTTCTCTAAGTCGTTGCGAGTTAGAACTTCTTTAAGTAGCAGCGACATCGACGGCTCTGCGGCTTTTGAGAATC
CTGCTTCGGAGTTACTCGACGACGAGCTGATTGTAGTTGTTTCGGGTGCTAAGGATGCTGATGAAGCGCTAGGGATGATCAGTGATAAGTCAGGGAGAAGTGGAGGTACT
GTGTCTGTTTCCGACTGTCGTTTGATTATTTCGGCTGCACTTAAGCGTAACAATCCCGAGCTTGCTTTGTCTGTGTTCTACGCAATGCGTTCCACTTTCTATCAAGCATG
GGAAGGTGTTAATGAAAATGCTTCCATCGTTGAGAGATGGAAATGGTCAAGGCCAGATGTTCATGTATATACATTGCTGATTGAAGGTCTTGCAGCATCCTTGAGGGTAT
CTGATGCACTTAGAATGATCGAAATTATTTGCCGTGTTGGTGTAACACCTGCTGAGGAGGTACCATTTGGAAAAGTGGTGAAGTGTCCCAGTTGTATGGTAGCAGTTGCA
GTTGCACAACCGCAGCATGGTATTCAGATTGTATCCTGTGCAAAGTGCTGCTACAAGTATGAACTTATTTCAGGAAACATAGTTAATATCGAGTCAGAAGAAATTCGCAT
GGATACTCCTGCATGGGAAAAAGCACTTCGGTTCTTGAACATAATGAAGCGAAAAATTCCTGTTGCTGTGCACTCCATTGTGGTACAAACTCCTTCTGGAGTGGCACGAA
CCCAGAAGTTTGCTACTGAAACAGCAGATCTCCCGGCTCGAGAGGGAGAAAGGGTGACAATTGCTGCTGCAGCTCCATCAAATGTATTCAGAGAAGTTGGTCCTATTAAA
TTTAGTCCAAAGGATCCCAATTTGTACTCTGGAGAGGCCATGTGCCTGACAAATCATTCAGATGGCCGGGAGTCACTATTATTAAGAGTGCCAGGAAAGGAAAACTCATC
CTTACTTAACCCATCGATCCTCTTTCCACTCATAGTTTTGTCTGCCGCTGGAGATGCTGCCTCTGGAGTAATTGACCCCAGCTTGCCTCAGTTGCTTATAGTTGCTGGAT
TTGCTTCTCTAGCTGCAGGAGCTACCTTGAATTCATTAATTTTGCCTCAATTCAATCGGCTTCCTCAACGATCAGTTGATATCATTGCTATCAAGCAGCAACTTTTATCT
CAATATAATGTGCTTCAGTCTCGTATCGGGGATTTAAAACTAGCTGCTGAAAAGGAGGTATGGATGTTGGCTCGGATGTGCCAATTGGAGAACAAAATTTTTGCCGTAGG
AGAACCTTCTTACCGCGCACGTAGAAGTAGGATAAAAAAGGTGCGAGAAGGCTTGGAAAATTCCCTTAAGCAACGGATTGAACTAATCGAAAGCTATGCAAGGATTTCCT
CGATGATTGAGATCGAAGTTGAAATGGAGTCTGATGTTATTGCAGCTGAAGCAGCTAGCAGTGTGGAAAGAGTGTCTGAACAGATAGAGCAAATCATGGTGCTGGAAAAT
CTAGAAGAGAGATGGAAATTACAAGCAGAAGCCAACGATGAAGCGGAAAGACTTCTCAACCAATCAATGCCAACAGAAAAGGTTTAG
Protein sequenceShow/hide protein sequence
MILNFTSPCLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHAFFAFTSFSKSLRVRTSLSSSDIDGSAAFENPASELLDDELIVVVSGAKDADEALGMISDKSGRSGGT
VSVSDCRLIISAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIEGLAASLRVSDALRMIEIICRVGVTPAEEVPFGKVVKCPSCMVAVA
VAQPQHGIQIVSCAKCCYKYELISGNIVNIESEEIRMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIK
FSPKDPNLYSGEAMCLTNHSDGRESLLLRVPGKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLIVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLS
QYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLEN
LEERWKLQAEANDEAERLLNQSMPTEKV