; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0003454 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0003454
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationchr07:5433501..5440835
RNA-Seq ExpressionPI0003454
SyntenyPI0003454
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044898.1 Pentatricopeptide repeat (PPR) superfamily protein isoform 2 [Cucumis melo var. makuwa]1.6e-28296.85Show/hide
Query:  MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMI
        MILNFTSP+LTLTRLPPPKLLEPL SSTNGATVF+PLLLCSHALFAFTSFSKSMRVR SLSGSDIDG+AAFENP SELLDDELI VVSGAKDADEALGMI
Subjt:  MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ  AWE VNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEIS DTPAWEKALRFLNIMKRKIP AVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNL SGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFP+IVLSAAGDAASGVIDPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAGFASLAAGATLNSLILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRIG+LKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSV R
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER

XP_004148995.1 uncharacterized protein LOC101209802 [Cucumis sativus]6.5e-29296.75Show/hide
Query:  MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMI
        MILNFTSP LTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHA FAFTSFSKS+RVR SLSGSDIDG+AAFENP SELLDDELI VVSGAKDADEALGMI
Subjt:  MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSP
        GDKSGRSGGTVSVSDCRLII+AALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLI+GLAASLRVSDALRMIEIICRVGV+P
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSP

Query:  AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATE
        AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKC YKYELISGNIVNIESEEI  DTPAWEKALRFLNIMKRKIP AVHSIVVQTPSGVARTQKFATE
Subjt:  AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATE

Query:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLL
        TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVP KENSSLLNPSILFP+IVLSAAGDAASGVIDPSLPQLL+
Subjt:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLL

Query:  VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK
        VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIG+LKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK
Subjt:  VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK

Query:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE
        QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE
Subjt:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE

XP_008451955.1 PREDICTED: uncharacterized protein LOC103493103 [Cucumis melo]5.5e-29196.94Show/hide
Query:  MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMI
        MILNFTSP+LTLTRLPPPKLLEPL SSTNGATVF+PLLLCSHALFAFTSFSKSMRVR SLSGSDIDG+AAFENP SELLDDELI VVSGAKDADEALGMI
Subjt:  MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ  AWE VNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEIS DTPAWEKALRFLNIMKRKIP AVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNL SGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFP+IVLSAAGDAASGVIDPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAGFASLAAGATLNSLILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRIG+LKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEE
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE

XP_038895173.1 uncharacterized protein LOC120083467 isoform X2 [Benincasa hispida]8.8e-28193.51Show/hide
Query:  MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMI
        MILN TSPWL +TRLPPPKL EPLAS+TNGATV MPLLLCSHALFAFTSFSKSM+VRASLSGSDIDGAAAFENPVS+LL +ELIR VSGAKDADEAL MI
Subjt:  MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
         DKSGRSGGTVS SDC LIIAAALK NNPELALSVFYAMRSTFYQ  AWEGVNENAS VERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCA+CRYKYELISGNIVNI+SEEIS DTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN YSGE MCLTNHSDGRESLLLRVPAK  SSLLNPS LFP+IVLSAAGDAASGV+DPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAG ASLAAGATLNSLILPQ NRLPQRSVDIIAIKQQLLSQYNVLQSRI +LKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI+KVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEE
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE

XP_038895181.1 uncharacterized protein LOC120083467 isoform X3 [Benincasa hispida]2.7e-28293.85Show/hide
Query:  MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMI
        MILN TSPWL +TRLPPPKL EPLAS+TNGATV MPLLLCSHALFAFTSFSKSM+VRASLSGSDIDGAAAFENPVS+LL +ELIR VSGAKDADEAL MI
Subjt:  MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSP
         DKSGRSGGTVS SDC LIIAAALK NNPELALSVFYAMRSTFYQAWEGVNENAS VERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSP
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSP

Query:  AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATE
        AEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCA+CRYKYELISGNIVNI+SEEIS DTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATE
Subjt:  AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATE

Query:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLL
        TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN YSGE MCLTNHSDGRESLLLRVPAK  SSLLNPS LFP+IVLSAAGDAASGV+DPSLPQLLL
Subjt:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLL

Query:  VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK
        VAG ASLAAGATLNSLILPQ NRLPQRSVDIIAIKQQLLSQYNVLQSRI +LKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI+KVREGLENSLK
Subjt:  VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK

Query:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE
        QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEE
Subjt:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE

TrEMBL top hitse value%identityAlignment
A0A0A0KZV4 Uncharacterized protein3.1e-29296.75Show/hide
Query:  MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMI
        MILNFTSP LTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHA FAFTSFSKS+RVR SLSGSDIDG+AAFENP SELLDDELI VVSGAKDADEALGMI
Subjt:  MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSP
        GDKSGRSGGTVSVSDCRLII+AALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLI+GLAASLRVSDALRMIEIICRVGV+P
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSP

Query:  AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATE
        AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKC YKYELISGNIVNIESEEI  DTPAWEKALRFLNIMKRKIP AVHSIVVQTPSGVARTQKFATE
Subjt:  AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATE

Query:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLL
        TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVP KENSSLLNPSILFP+IVLSAAGDAASGVIDPSLPQLL+
Subjt:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLL

Query:  VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK
        VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIG+LKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK
Subjt:  VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK

Query:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE
        QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE
Subjt:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE

A0A1S3BTU3 uncharacterized protein LOC1034931032.6e-29196.94Show/hide
Query:  MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMI
        MILNFTSP+LTLTRLPPPKLLEPL SSTNGATVF+PLLLCSHALFAFTSFSKSMRVR SLSGSDIDG+AAFENP SELLDDELI VVSGAKDADEALGMI
Subjt:  MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ  AWE VNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEIS DTPAWEKALRFLNIMKRKIP AVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNL SGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFP+IVLSAAGDAASGVIDPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAGFASLAAGATLNSLILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRIG+LKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEE
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE

A0A5A7TPS5 Pentatricopeptide repeat (PPR) superfamily protein isoform 27.7e-28396.85Show/hide
Query:  MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMI
        MILNFTSP+LTLTRLPPPKLLEPL SSTNGATVF+PLLLCSHALFAFTSFSKSMRVR SLSGSDIDG+AAFENP SELLDDELI VVSGAKDADEALGMI
Subjt:  MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ  AWE VNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEIS DTPAWEKALRFLNIMKRKIP AVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNL SGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFP+IVLSAAGDAASGVIDPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAGFASLAAGATLNSLILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRIG+LKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSV R
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER

A0A6J1EYW6 uncharacterized protein LOC111437671 isoform X22.9e-27490.43Show/hide
Query:  MILNFTSPWLTLTRL-PPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGM
        MIL+ +SPWLT+TRL PPPKL+EPLAS++NG +V MPLLLCSHALF FTSFSKS RVRASL+ S+IDGAAAFENPVSELLDDELI VVSGAKDADE L +
Subjt:  MILNFTSPWLTLTRL-PPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGM

Query:  IGDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVS
        I DKSGR+GGTVSV DCRLIIAAALKRNN ELALSVFYAMRS+FY+AWEGVN+N S VERWKW+RPDVHVYTLLIQGLAASLRVSDALR+IEIICRVGVS
Subjt:  IGDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVS

Query:  PAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFAT
        PAEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCAKCRY+YELISGNIVNIESEEIS DTPAWEKALRFLN+MK+K+PAAVHSIVVQTPSGVARTQKFAT
Subjt:  PAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFAT

Query:  ETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLL
        ETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPNLYSGE MCLTNHSDGRESLLLRVPAKE S LL PS LFP+I+LS AGD +SGV+DPSLP+LL
Subjt:  ETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLL

Query:  LVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSL
        LVAGFASLAAGATLNS ILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRI +LKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSL
Subjt:  LVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSL

Query:  KQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE
        KQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE
Subjt:  KQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE

A0A6J1JDG3 uncharacterized protein LOC111483407 isoform X24.2e-27389.87Show/hide
Query:  MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMI
        MIL+ +SPWLT+TRLP PKL+EPLAS++NG +V MPLLLCSHA F FTSFS+S RVRASL+ S+IDGAAAFENPVS+LLDDELI VVSGAKDADE L MI
Subjt:  MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSP
         +KSGR+GGTVSV DCRLIIAAALKRNN ELALSVFYAMRS+FY+AWEGVN+N S VERWKW+RPDVHVYTLLIQGLAASLRVSDALR+IEIICRVGVSP
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSP

Query:  AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATE
        AEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCAKCRY+YELISGNIVNIESEEIS DTPAWEKALRFLN+MK+K+PAAVHSIVVQTPSGVARTQKFATE
Subjt:  AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATE

Query:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLL
        TADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPNLYSGE MCLTNHSDGRESLL+RVPAKE S LL PS LFP+I+LS AGDAASGV+DPSLP++LL
Subjt:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLL

Query:  VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK
        VAGFASLAAGATLNS ILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRI +LKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK
Subjt:  VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK

Query:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE
        QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE
Subjt:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64430.1 Pentatricopeptide repeat (PPR) superfamily protein3.2e-18065.64Show/hide
Query:  DGAAAFENPVSELLDDELIRVVSGAKDADEALGMIGDKSGRS-GGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSR
        D   +  +  S +LDDEL+  VS  +DADEAL MI D+ G + GG V + DCR II+AA+ R N +LALS+FY MR++F       +   S  +RW WSR
Subjt:  DGAAAFENPVSELLDDELIRVVSGAKDADEALGMIGDKSGRS-GGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSR

Query:  PDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEK
        PDV VYT+L+ GLAASLRVSD+LR+I  ICRVG+SPAEEVPFGK+V+CPSC++A+AVAQPQHG+QIVSCA CRY+YEL SG+I +I+SEE+ KD P WEK
Subjt:  PDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEK

Query:  ALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKE
         LR + I K KI ++VHSIVVQTPSG ART +FATETA+LPA+EGERVTIA+AAPSNV+R+VGP KF  K PN Y GE M LT H DGRES+LLR P+K+
Subjt:  ALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKE

Query:  NSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLA
           +L PS L P++ + A GDAASGVIDPSLPQLL VA   SLA GAT+NS +LP+ N+LP+R+VD++ IKQQLLSQY+VLQ RI +LK A EKEVWMLA
Subjt:  NSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLA

Query:  RMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE
        RMCQLENKI AVGEP+YR RR+R+KKVRE LENS+K +I+LI+SYARISSMIEIEVEM+SDV+AAEA ++ E +++QIEQIM LENLEE
Subjt:  RMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE

AT1G64430.2 Pentatricopeptide repeat (PPR) superfamily protein3.2e-18065.64Show/hide
Query:  DGAAAFENPVSELLDDELIRVVSGAKDADEALGMIGDKSGRS-GGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSR
        D   +  +  S +LDDEL+  VS  +DADEAL MI D+ G + GG V + DCR II+AA+ R N +LALS+FY MR++F       +   S  +RW WSR
Subjt:  DGAAAFENPVSELLDDELIRVVSGAKDADEALGMIGDKSGRS-GGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSR

Query:  PDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEK
        PDV VYT+L+ GLAASLRVSD+LR+I  ICRVG+SPAEEVPFGK+V+CPSC++A+AVAQPQHG+QIVSCA CRY+YEL SG+I +I+SEE+ KD P WEK
Subjt:  PDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEK

Query:  ALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKE
         LR + I K KI ++VHSIVVQTPSG ART +FATETA+LPA+EGERVTIA+AAPSNV+R+VGP KF  K PN Y GE M LT H DGRES+LLR P+K+
Subjt:  ALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKE

Query:  NSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLA
           +L PS L P++ + A GDAASGVIDPSLPQLL VA   SLA GAT+NS +LP+ N+LP+R+VD++ IKQQLLSQY+VLQ RI +LK A EKEVWMLA
Subjt:  NSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIGNLKLAAEKEVWMLA

Query:  RMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE
        RMCQLENKI AVGEP+YR RR+R+KKVRE LENS+K +I+LI+SYARISSMIEIEVEM+SDV+AAEA ++ E +++QIEQIM LENLEE
Subjt:  RMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLENLEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCTGAACTTCACTTCACCATGGCTCACTCTCACTCGTCTCCCTCCTCCTAAACTCCTCGAACCACTCGCCTCTTCAACCAATGGCGCTACCGTTTTCATGCCTCT
CCTTCTTTGTTCCCACGCTCTCTTTGCTTTCACCTCCTTCTCTAAGTCGATGCGAGTTAGAGCTTCTTTAAGTGGCAGCGACATCGATGGCGCTGCGGCTTTTGAGAATC
CTGTTTCGGAGTTACTCGACGACGAGCTGATTAGAGTTGTTTCGGGTGCTAAGGATGCTGATGAAGCGCTAGGGATGATCGGTGATAAGTCAGGGAGAAGTGGTGGTACT
GTGTCTGTTTCGGACTGTCGTTTGATTATTGCGGCTGCACTTAAGCGTAACAATCCCGAGCTTGCTTTGTCTGTGTTCTACGCAATGCGTTCCACTTTCTATCAAGCTTG
GGAAGGTGTTAATGAAAATGCTTCCATTGTTGAGAGATGGAAATGGTCAAGGCCAGATGTTCATGTATATACATTGCTGATTCAAGGTCTTGCAGCATCCTTGAGGGTAT
CTGATGCACTTAGAATGATCGAAATTATTTGCCGTGTTGGTGTATCACCTGCTGAGGAGGTACCGTTTGGAAAGGTAGTGAAGTGTCCCAGTTGTATGGTAGCAGTTGCA
GTTGCACAACCCCAGCACGGTATTCAGATTGTATCCTGTGCAAAGTGCCGCTACAAGTATGAACTGATTTCAGGAAACATAGTTAATATTGAGTCAGAAGAAATTAGCAA
GGATACTCCTGCATGGGAAAAAGCACTCCGATTCTTGAATATAATGAAGCGAAAAATTCCAGCTGCTGTGCACTCCATTGTGGTACAAACTCCTTCTGGAGTGGCACGAA
CCCAGAAGTTTGCTACTGAAACAGCAGATCTCCCAGCTCGAGAGGGAGAAAGGGTGACAATTGCTGCTGCAGCTCCATCAAATGTATTCAGAGAAGTTGGTCCTATTAAA
TTTAGTCCAAAGGATCCCAATTTGTACTCTGGGGAGGCTATGTGCCTGACAAATCATTCAGATGGACGGGAGTCACTATTATTAAGAGTGCCAGCAAAGGAAAATTCATC
CTTACTTAACCCATCGATCCTCTTTCCAGTCATAGTTTTGTCTGCCGCTGGAGATGCTGCCTCTGGAGTAATTGACCCCAGCTTGCCTCAGTTGCTTTTAGTTGCTGGAT
TTGCTTCTCTAGCTGCAGGAGCTACCTTGAATTCATTAATTTTGCCTCAATTCAACCGGCTTCCTCAACGATCAGTTGATATTATTGCTATCAAGCAGCAGCTTTTATCT
CAATATAATGTGCTTCAGTCTCGTATCGGGAATTTAAAACTAGCTGCTGAAAAGGAGGTATGGATGTTGGCTCGGATGTGCCAATTAGAGAACAAAATTTTTGCCGTAGG
GGAACCTTCTTATCGCGCACGTAGAAGTAGGATAAAAAAGGTGCGAGAAGGCTTGGAAAATTCCCTTAAGCAACGGATTGAACTAATCGAAAGCTATGCAAGGATTTCCT
CGATGATTGAGATCGAAGTTGAAATGGAGTCTGATGTTATTGCAGCTGAAGCAGCTAGCAGTGTGGAAAGGGTTTCTGAACAGATAGAGCAAATCATGGTGCTGGAAAAT
CTAGAAGAGGCATGTCTTTATCACTGTGAACTAATTTTTTGTCTTCATGCCTTACTATGGATGTTGATGTCTAATTTTTTGTTCACGTTAACTGTTGGAGATCATGACTT
TCTCTTACCTCATCCATACTACCATCGGTCTCTGAGTATTTTACTCGTCAACATTTTGAGAAACTTGCTCTGGTTTACTATGGTTCATTCACTATTACTGCCAGGGCTCT
CCGTAGCATACCCACTGGACTTATTTTTAGTCACTTCCATCCATACGGTACTACACGAAGCACTGGACTTACCTTTTTGA
mRNA sequenceShow/hide mRNA sequence
AAAAAAAAAAAAAAAAAAAAAGAACAAAAACTCGCGGAAGAGAAAGAAACGAAAGCATTCGACAAATTCTATGCCATTTCTCTACACAAATTCTCCTTCGTTTCAATCCA
AATCTCCATTGACACAATGATTCTGAACTTCACTTCACCATGGCTCACTCTCACTCGTCTCCCTCCTCCTAAACTCCTCGAACCACTCGCCTCTTCAACCAATGGCGCTA
CCGTTTTCATGCCTCTCCTTCTTTGTTCCCACGCTCTCTTTGCTTTCACCTCCTTCTCTAAGTCGATGCGAGTTAGAGCTTCTTTAAGTGGCAGCGACATCGATGGCGCT
GCGGCTTTTGAGAATCCTGTTTCGGAGTTACTCGACGACGAGCTGATTAGAGTTGTTTCGGGTGCTAAGGATGCTGATGAAGCGCTAGGGATGATCGGTGATAAGTCAGG
GAGAAGTGGTGGTACTGTGTCTGTTTCGGACTGTCGTTTGATTATTGCGGCTGCACTTAAGCGTAACAATCCCGAGCTTGCTTTGTCTGTGTTCTACGCAATGCGTTCCA
CTTTCTATCAAGCTTGGGAAGGTGTTAATGAAAATGCTTCCATTGTTGAGAGATGGAAATGGTCAAGGCCAGATGTTCATGTATATACATTGCTGATTCAAGGTCTTGCA
GCATCCTTGAGGGTATCTGATGCACTTAGAATGATCGAAATTATTTGCCGTGTTGGTGTATCACCTGCTGAGGAGGTACCGTTTGGAAAGGTAGTGAAGTGTCCCAGTTG
TATGGTAGCAGTTGCAGTTGCACAACCCCAGCACGGTATTCAGATTGTATCCTGTGCAAAGTGCCGCTACAAGTATGAACTGATTTCAGGAAACATAGTTAATATTGAGT
CAGAAGAAATTAGCAAGGATACTCCTGCATGGGAAAAAGCACTCCGATTCTTGAATATAATGAAGCGAAAAATTCCAGCTGCTGTGCACTCCATTGTGGTACAAACTCCT
TCTGGAGTGGCACGAACCCAGAAGTTTGCTACTGAAACAGCAGATCTCCCAGCTCGAGAGGGAGAAAGGGTGACAATTGCTGCTGCAGCTCCATCAAATGTATTCAGAGA
AGTTGGTCCTATTAAATTTAGTCCAAAGGATCCCAATTTGTACTCTGGGGAGGCTATGTGCCTGACAAATCATTCAGATGGACGGGAGTCACTATTATTAAGAGTGCCAG
CAAAGGAAAATTCATCCTTACTTAACCCATCGATCCTCTTTCCAGTCATAGTTTTGTCTGCCGCTGGAGATGCTGCCTCTGGAGTAATTGACCCCAGCTTGCCTCAGTTG
CTTTTAGTTGCTGGATTTGCTTCTCTAGCTGCAGGAGCTACCTTGAATTCATTAATTTTGCCTCAATTCAACCGGCTTCCTCAACGATCAGTTGATATTATTGCTATCAA
GCAGCAGCTTTTATCTCAATATAATGTGCTTCAGTCTCGTATCGGGAATTTAAAACTAGCTGCTGAAAAGGAGGTATGGATGTTGGCTCGGATGTGCCAATTAGAGAACA
AAATTTTTGCCGTAGGGGAACCTTCTTATCGCGCACGTAGAAGTAGGATAAAAAAGGTGCGAGAAGGCTTGGAAAATTCCCTTAAGCAACGGATTGAACTAATCGAAAGC
TATGCAAGGATTTCCTCGATGATTGAGATCGAAGTTGAAATGGAGTCTGATGTTATTGCAGCTGAAGCAGCTAGCAGTGTGGAAAGGGTTTCTGAACAGATAGAGCAAAT
CATGGTGCTGGAAAATCTAGAAGAGGCATGTCTTTATCACTGTGAACTAATTTTTTGTCTTCATGCCTTACTATGGATGTTGATGTCTAATTTTTTGTTCACGTTAACTG
TTGGAGATCATGACTTTCTCTTACCTCATCCATACTACCATCGGTCTCTGAGTATTTTACTCGTCAACATTTTGAGAAACTTGCTCTGGTTTACTATGGTTCATTCACTA
TTACTGCCAGGGCTCTCCGTAGCATACCCACTGGACTTATTTTTAGTCACTTCCATCCATACGGTACTACACGAAGCACTGGACTTACCTTTTTGATCTCTCGTTCCTCA
TGAACTACCGTTTCATGAAGCAGTGTGAGAATACCAATAAACTGCTAAAATATTTTCTTGTTTTCACCTTGAGAATAGGTGAAACTTTGGGTGGAGGTAATGAAAGTTAT
ATTTTGCAAGTTACATGTACGAGGAGATGGTGGAAACATTGACAATTTCTGAGGCGGTTTGAGGGAGGGGGTATTTCGGTTTTTGCGTGCATGAAATGTGGAAATGTATC
AGAGGGAGGATGGTGCCGAGAATGTCGGAAAAATTAGTTATGTAGGGAGTTAGGAGAGAGTGAAAATATTATATTGAGAAGCTTTTGTTCAGGAGGGGTGTAAGTTCTTG
CATTATCCTACTTAGCGACTTTCTGTCGAGTTTTTTCTGATTTCATTCAACATTTACATGATAGAAATCCATTCTAATTCCATCAAGTCAGTCAAGCTGCCATGGGCCTA
GCCATAGAGATGGACCATATAGTGGTGCCTTCTACCACCACCACACTTAGAGCCAGCCTTACAAACAGCCGTGGTTATATGTCCCTTCGTCGTATTCAAGTCCTTGACTA
GCAACAATGAACTTTTCAAACTCAATTAAGCTGGTATATCGGAACTCAGAAGAATTTCCGTGAGAGTGCACCGAAGGGTCTTCAAGCTAAGGTCTCAATCACCTATATAC
ATTTGAATGAAATTACTACCCATATGTTGCCTTCCAAACGAATTTAGCGCATGATAGTTACTATCAAAGATTGAACCTACTATCAGCTGATGCTGGGTGTGGGTGTGACA
CAGAGCTATTTTTAAGTCAGATTGCATTGATCTTCAATTACATGAAAAATATATGACTTTTGTGTCATCAAAGTAGAGTTGTCTGCTGAGTGTTTAGTTGTTCACAGAGT
TAGTTCGGCTACTTTTTTTAATATATGATTCGCATCGGTATTTTTACTGAACAATAAAATCCTCACTCAACACCCTTAAATTCAACGAAATACATGGCTTTGGTTTCTGT
TGTCTTCTTAGTTCCTCATATTTTCATCTGAATTAGAATGGTTTCCTGATCTTTGGAGCCAATCAGCAGACACTATTTCCATCTTTGGCTAATTAATTATCTTTTTCTTC
TAACCTTTTATGTCAATGCCAATTCAGAGATGGAAATTACAAGCAGAAGCCAATGATGAAGCCGAACGACTTCTCAACCAATCAATGCCAACAGAAAAGGTTTAAGGGTG
TTCTCGTGTTTTCAGTCCAGGTAACTGTTAGGCCCCAATTCTTCATACATCCATATCTCGATTTCCACATCACCTTGTACACGCACTACCTTCCCAGAATTACTATTTGT
CAAACAAATCTTGGAGCATCAATAACAAAACCAAATACTTCGTAAAGATTCTTACTTCTCAACGAAAATGCCAACCAATGTCTACTTGGTCCATGTACATGAAGTTTTGA
AGCAGCATCAGGTCAGTGCTTCGTGATCAGATATCTTATTACTGAAGATCTCATTTGTTAAATTGTATTATTGTTACGAAAAAACATTGAAATCTAAGTTTCATTTGGAA
CTTCTCCCTCCTGAGTACTGAAATGTGATTTATCACGTAAATATAACACGTTAATGATTCTATGAATTAATCGCCATCTTTGCATTTCA
Protein sequenceShow/hide protein sequence
MILNFTSPWLTLTRLPPPKLLEPLASSTNGATVFMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDDELIRVVSGAKDADEALGMIGDKSGRSGGT
VSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVA
VAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISKDTPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIK
FSPKDPNLYSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPVIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLS
QYNVLQSRIGNLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMVLEN
LEEACLYHCELIFCLHALLWMLMSNFLFTLTVGDHDFLLPHPYYHRSLSILLVNILRNLLWFTMVHSLLLPGLSVAYPLDLFLVTSIHTVLHEALDLPF