; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0024946 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0024946
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationchr07:19828871..19836179
RNA-Seq ExpressionIVF0024946
SyntenyIVF0024946
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044898.1 Pentatricopeptide repeat (PPR) superfamily protein isoform 2 [Cucumis melo var. makuwa]0.097.77Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMI
        MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ  AWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKILP-------ARRSRIKKVREGLENS
        LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKI         ARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKILP-------ARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSV R
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER

XP_004148995.1 uncharacterized protein LOC101209802 [Cucumis sativus]0.095.5Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMI
        MILNFTSP LTLTRLPPPKLLEPL SSTNGATVF+PLLLCSHA FAFTSFSKS+RVRTSLSGSDIDGSAAFENPASELLDDELI+VVSGAKDADEALGMI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSP
        GDKSGRSGGTVSVSDCRLII+AALKRNNPELALSVFYAMRSTFYQAWE VNENASIVERWKWSRPDVHVYTLLI+GLAASLRVSDALRMIEIICRVGV+P
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSP

Query:  AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATE
        AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKC YKYELISGNIVNIESEEI MDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATE
Subjt:  AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATE

Query:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLL
        TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNL SGEAMCLTNHSDGRESLLLRVP KENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLL+
Subjt:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLL

Query:  VAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKILP-------ARRSRIKKVREGLENSLK
        VAGFASLAAGATLNSLILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKI         ARRSRIKKVREGLENSLK
Subjt:  VAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKILP-------ARRSRIKKVREGLENSLK

Query:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV
        QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEERWKLQAEANDEAERLLNQSMPTEKV
Subjt:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV

XP_008451955.1 PREDICTED: uncharacterized protein LOC103493103 [Cucumis melo]0.098.1Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMI
        MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ  AWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKILP-------ARRSRIKKVREGLENS
        LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKI         ARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKILP-------ARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV

XP_038895173.1 uncharacterized protein LOC120083467 isoform X2 [Benincasa hispida]0.091.03Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMI
        MILN TSP+L +TRLPPPKL EPL S+TNGATV +PLLLCSHALFAFTSFSKSM+VR SLSGSDIDG+AAFENP S+LL +ELI  VSGAKDADEAL MI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
         DKSGRSGGTVS SDC LIIAAALK NNPELALSVFYAMRSTFYQ  AWE VNENAS VERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCA+CRYKYELISGNIVNI+SEEISMDTPAWEKALRFLNIMKRKIP AVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN  SGE MCLTNHSDGRESLLLRVPAK  SSLLNPS LFPLIVLSAAGDAASGV+DPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKILP-------ARRSRIKKVREGLENS
        LLVAG ASLAAGATLNSLILPQ +RLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKI         ARRSRI+KVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKILP-------ARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV

XP_038895181.1 uncharacterized protein LOC120083467 isoform X3 [Benincasa hispida]0.091.35Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMI
        MILN TSP+L +TRLPPPKL EPL S+TNGATV +PLLLCSHALFAFTSFSKSM+VR SLSGSDIDG+AAFENP S+LL +ELI  VSGAKDADEAL MI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSP
         DKSGRSGGTVS SDC LIIAAALK NNPELALSVFYAMRSTFYQAWE VNENAS VERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSP
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSP

Query:  AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATE
        AEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCA+CRYKYELISGNIVNI+SEEISMDTPAWEKALRFLNIMKRKIP AVHSIVVQTPSGVARTQKFATE
Subjt:  AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATE

Query:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLL
        TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN  SGE MCLTNHSDGRESLLLRVPAK  SSLLNPS LFPLIVLSAAGDAASGV+DPSLPQLLL
Subjt:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLL

Query:  VAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKILP-------ARRSRIKKVREGLENSLK
        VAG ASLAAGATLNSLILPQ +RLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKI         ARRSRI+KVREGLENSLK
Subjt:  VAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKILP-------ARRSRIKKVREGLENSLK

Query:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV
        QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV
Subjt:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV

TrEMBL top hitse value%identityAlignment
A0A0A0KZV4 Uncharacterized protein0.0e+0095.54Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMI
        MILNFTSP LTLTRLPPPKLLEPL SSTNGATVF+PLLLCSHA FAFTSFSKS+RVRTSLSGSDIDGSAAFENPASELLDDELI+VVSGAKDADEALGMI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSP
        GDKSGRSGGTVSVSDCRLII+AALKRNNPELALSVFYAMRSTFYQAWE VNENASIVERWKWSRPDVHVYTLLI+GLAASLRVSDALRMIEIICRVGV+P
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSP

Query:  AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATE
        AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKC YKYELISGNIVNIESEEI MDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATE
Subjt:  AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATE

Query:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLL
        TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNL SGEAMCLTNHSDGRESLLLRVP KENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLL+
Subjt:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLL

Query:  VAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKILP-------ARRSRIKKVREGLENSLK
        VAGFASLAAGATLNSLILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKI         ARRSRIKKVREGLENSLK
Subjt:  VAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKILP-------ARRSRIKKVREGLENSLK

Query:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKVTVRTPILHTSISRFPHHLVPAL
        QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEERWKLQAEANDEAERLLNQSMPTEKVTVR PILH SISRFPHHLVPAL
Subjt:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKVTVRTPILHTSISRFPHHLVPAL

Query:  PTQNYYLSNKSWSINNKTKYFTNILTSQ
        PTQNYYLSNKSWSINNKTKYFTNILTSQ
Subjt:  PTQNYYLSNKSWSINNKTKYFTNILTSQ

A0A1S3BTU3 uncharacterized protein LOC1034931036.9e-30598.1Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMI
        MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ  AWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKILP-------ARRSRIKKVREGLENS
        LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKI         ARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKILP-------ARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV

A0A5A7TPS5 Pentatricopeptide repeat (PPR) superfamily protein isoform 24.8e-28297.77Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMI
        MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ  AWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKILP-------ARRSRIKKVREGLENS
        LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKI         ARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKILP-------ARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSV R
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER

A0A6J1EYW6 uncharacterized protein LOC111437671 isoform X26.9e-27387.39Show/hide
Query:  MILNFTSPFLTLTRL-PPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGM
        MIL+ +SP+LT+TRL PPPKL+EPL S++NG +V +PLLLCSHALF FTSFSKS RVR SL+ S+IDG+AAFENP SELLDDELI VVSGAKDADE L +
Subjt:  MILNFTSPFLTLTRL-PPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGM

Query:  IGDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVS
        I DKSGR+GGTVSV DCRLIIAAALKRNN ELALSVFYAMRS+FY+AWE VN+N S VERWKW+RPDVHVYTLLIQGLAASLRVSDALR+IEIICRVGVS
Subjt:  IGDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVS

Query:  PAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFAT
        PAEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCAKCRY+YELISGNIVNIESEEISMDTPAWEKALRFLN+MK+K+P AVHSIVVQTPSGVARTQKFAT
Subjt:  PAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFAT

Query:  ETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLL
        ETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPNL SGE MCLTNHSDGRESLLLRVPAKE S LL PS LFPLI+LS AGD +SGV+DPSLP+LL
Subjt:  ETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLL

Query:  LVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKILP-------ARRSRIKKVREGLENSL
        LVAGFASLAAGATLNS ILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKI         ARRSRIKKVREGLENSL
Subjt:  LVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKILP-------ARRSRIKKVREGLENSL

Query:  KQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV
        KQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEERW+LQAEANDEAERL NQSMPTE+V
Subjt:  KQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV

A0A6J1JDG3 uncharacterized protein LOC111483407 isoform X23.4e-27286.85Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMI
        MIL+ +SP+LT+TRLP PKL+EPL S++NG +V +PLLLCSHA F FTSFS+S RVR SL+ S+IDG+AAFENP S+LLDDELI VVSGAKDADE L MI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSP
         +KSGR+GGTVSV DCRLIIAAALKRNN ELALSVFYAMRS+FY+AWE VN+N S VERWKW+RPDVHVYTLLIQGLAASLRVSDALR+IEIICRVGVSP
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSP

Query:  AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATE
        AEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCAKCRY+YELISGNIVNIESEEISMDTPAWEKALRFLN+MK+K+P AVHSIVVQTPSGVARTQKFATE
Subjt:  AEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATE

Query:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLL
        TADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPNL SGE MCLTNHSDGRESLL+RVPAKE S LL PS LFPLI+LS AGDAASGV+DPSLP++LL
Subjt:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLL

Query:  VAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKILP-------ARRSRIKKVREGLENSLK
        VAGFASLAAGATLNS ILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKI         ARRSRIKKVREGLENSLK
Subjt:  VAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKILP-------ARRSRIKKVREGLENSLK

Query:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV
        QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEERW+LQAEANDEAERL NQSMPTE+V
Subjt:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64430.1 Pentatricopeptide repeat (PPR) superfamily protein1.8e-18065.75Show/hide
Query:  DIDGSAAFENPASELLDDELIIVVSGAKDADEALGMIGDKSGRS-GGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEAVNENASIVERWKW
        D  GSAA  + +S +LDDEL+  VS  +DADEAL MI D+ G + GG V + DCR II+AA+ R N +LALS+FY MR++F       +   S  +RW W
Subjt:  DIDGSAAFENPASELLDDELIIVVSGAKDADEALGMIGDKSGRS-GGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEAVNENASIVERWKW

Query:  SRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAW
        SRPDV VYT+L+ GLAASLRVSD+LR+I  ICRVG+SPAEEVPFGK+V+CPSC++A+AVAQPQHG+QIVSCA CRY+YEL SG+I +I+SEE+  D P W
Subjt:  SRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAW

Query:  EKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPA
        EK LR + I K KI  +VHSIVVQTPSG ART +FATETA+LPA+EGERVTIA+AAPSNV+R+VGP KF  K PN   GE M LT H DGRES+LLR P+
Subjt:  EKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPA

Query:  KENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWM
        K+   +L PS L PL+ + A GDAASGVIDPSLPQLL VA   SLA GAT+NS +LP+ ++LP+R+VD++ IKQQLLSQY+VLQ RI DLK A EKEVWM
Subjt:  KENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWM

Query:  LARMCQLENKIL----PA---RRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEAN
        LARMCQLENKIL    PA   RR+R+KKVRE LENS+K +I+LI+SYARISSMIEIEVEM+SDV+AAEA ++ E +++QIEQIM LENLEE+WK+QAEAN
Subjt:  LARMCQLENKIL----PA---RRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEAN

Query:  DEAERLLN
        DEAERLL+
Subjt:  DEAERLLN

AT1G64430.2 Pentatricopeptide repeat (PPR) superfamily protein1.8e-18065.75Show/hide
Query:  DIDGSAAFENPASELLDDELIIVVSGAKDADEALGMIGDKSGRS-GGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEAVNENASIVERWKW
        D  GSAA  + +S +LDDEL+  VS  +DADEAL MI D+ G + GG V + DCR II+AA+ R N +LALS+FY MR++F       +   S  +RW W
Subjt:  DIDGSAAFENPASELLDDELIIVVSGAKDADEALGMIGDKSGRS-GGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEAVNENASIVERWKW

Query:  SRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAW
        SRPDV VYT+L+ GLAASLRVSD+LR+I  ICRVG+SPAEEVPFGK+V+CPSC++A+AVAQPQHG+QIVSCA CRY+YEL SG+I +I+SEE+  D P W
Subjt:  SRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAW

Query:  EKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPA
        EK LR + I K KI  +VHSIVVQTPSG ART +FATETA+LPA+EGERVTIA+AAPSNV+R+VGP KF  K PN   GE M LT H DGRES+LLR P+
Subjt:  EKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPA

Query:  KENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWM
        K+   +L PS L PL+ + A GDAASGVIDPSLPQLL VA   SLA GAT+NS +LP+ ++LP+R+VD++ IKQQLLSQY+VLQ RI DLK A EKEVWM
Subjt:  KENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWM

Query:  LARMCQLENKIL----PA---RRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEAN
        LARMCQLENKIL    PA   RR+R+KKVRE LENS+K +I+LI+SYARISSMIEIEVEM+SDV+AAEA ++ E +++QIEQIM LENLEE+WK+QAEAN
Subjt:  LARMCQLENKIL----PA---RRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEAN

Query:  DEAERLLN
        DEAERLL+
Subjt:  DEAERLLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCTTAACTTCACTTCTCCATTCCTCACTCTCACTCGTCTCCCTCCTCCTAAACTCCTCGAACCACTCCCCTCTTCAACCAATGGCGCTACCGTTTTCGTGCCTCT
CCTTCTTTGTTCCCACGCTCTCTTTGCTTTCACCTCCTTCTCTAAGTCGATGCGAGTTAGAACTTCTTTAAGTGGCAGCGACATCGACGGCTCTGCGGCTTTTGAGAATC
CTGCTTCGGAGTTACTCGACGACGAGCTGATAATAGTTGTTTCGGGTGCTAAGGATGCTGATGAAGCGCTAGGGATGATCGGTGATAAGTCTGGGAGAAGTGGAGGTACT
GTGTCTGTTTCCGACTGTCGTTTGATTATTGCGGCTGCACTTAAGCGTAACAATCCCGAGCTTGCTTTGTCTGTGTTCTACGCAATGCGTTCCACTTTCTATCAAGCATG
GGAAGCTGTTAATGAAAATGCTTCCATTGTTGAGAGATGGAAATGGTCAAGGCCAGATGTTCATGTATATACATTGCTGATTCAAGGTCTTGCAGCATCCTTGAGGGTAT
CTGATGCACTTAGAATGATCGAAATTATTTGCCGTGTTGGTGTATCACCTGCTGAGGAGGTGCCATTTGGAAAAGTAGTGAAGTGTCCCAGTTGTATGGTAGCAGTTGCA
GTTGCACAACCCCAGCACGGTATTCAGATTGTATCCTGTGCGAAGTGCCGCTACAAGTATGAACTTATTTCAGGAAACATAGTTAATATTGAGTCAGAAGAAATTAGCAT
GGATACTCCTGCATGGGAAAAAGCACTCCGGTTCTTGAATATAATGAAGCGAAAGATTCCTGTTGCTGTGCACTCCATTGTGGTACAAACTCCTTCTGGAGTGGCACGAA
CTCAGAAGTTTGCTACTGAAACGGCAGATCTCCCAGCTCGAGAGGGAGAAAGAGTGACAATTGCTGCTGCAGCTCCATCAAATGTATTCAGAGAAGTTGGTCCTATTAAA
TTTAGTCCAAAGGATCCCAATTTAAACTCTGGGGAGGCTATGTGCCTGACAAATCATTCAGATGGCCGGGAGTCACTATTATTAAGAGTGCCAGCAAAGGAAAACTCATC
TTTACTTAACCCATCGATCCTCTTTCCACTCATAGTTTTATCTGCCGCTGGAGATGCTGCCTCTGGAGTAATTGACCCCAGCTTGCCTCAGTTGCTTTTAGTTGCTGGAT
TTGCTTCTCTAGCTGCAGGAGCTACCTTGAATTCATTAATTTTGCCTCAATTCAGTCGGCTTCCTCAACGATCAGTTGATATCATTGCTATCAAGCAGCAGCTTTTATCT
CAATATAATGTGCTTCAGTCTCGTATTGGGGATTTAAAACTTGCTGCTGAAAAGGAGGTATGGATGTTGGCTCGGATGTGCCAATTGGAGAACAAAATTTTGCCCGCACG
TAGAAGTAGGATAAAAAAGGTGCGAGAAGGCTTGGAAAATTCCCTTAAGCAACGGATTGAACTAATAGAAAGCTATGCAAGGATTTCCTCGATGATTGAGATTGAAGTTG
AAATGGAGTCTGATGTTATTGCGGCTGAAGCAGCTAGCAGTGTGGAAAGGGTTTCTGAACAGATAGAGCAAATCATGGCGCTGGAAAATCTAGAAGAGAGATGGAAATTA
CAAGCAGAAGCCAACGATGAAGCCGAAAGACTTCTCAACCAATCAATGCCAACAGAAAAGGTAACTGTTAGGACCCCAATTCTTCATACATCCATATCTCGATTTCCACA
TCACCTTGTACCCGCACTACCTACCCAGAATTACTATTTGTCAAACAAATCTTGGAGCATCAATAACAAAACCAAATACTTCACAAATATTCTTACTTCTCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGATTCTTAACTTCACTTCTCCATTCCTCACTCTCACTCGTCTCCCTCCTCCTAAACTCCTCGAACCACTCCCCTCTTCAACCAATGGCGCTACCGTTTTCGTGCCTCT
CCTTCTTTGTTCCCACGCTCTCTTTGCTTTCACCTCCTTCTCTAAGTCGATGCGAGTTAGAACTTCTTTAAGTGGCAGCGACATCGACGGCTCTGCGGCTTTTGAGAATC
CTGCTTCGGAGTTACTCGACGACGAGCTGATAATAGTTGTTTCGGGTGCTAAGGATGCTGATGAAGCGCTAGGGATGATCGGTGATAAGTCTGGGAGAAGTGGAGGTACT
GTGTCTGTTTCCGACTGTCGTTTGATTATTGCGGCTGCACTTAAGCGTAACAATCCCGAGCTTGCTTTGTCTGTGTTCTACGCAATGCGTTCCACTTTCTATCAAGCATG
GGAAGCTGTTAATGAAAATGCTTCCATTGTTGAGAGATGGAAATGGTCAAGGCCAGATGTTCATGTATATACATTGCTGATTCAAGGTCTTGCAGCATCCTTGAGGGTAT
CTGATGCACTTAGAATGATCGAAATTATTTGCCGTGTTGGTGTATCACCTGCTGAGGAGGTGCCATTTGGAAAAGTAGTGAAGTGTCCCAGTTGTATGGTAGCAGTTGCA
GTTGCACAACCCCAGCACGGTATTCAGATTGTATCCTGTGCGAAGTGCCGCTACAAGTATGAACTTATTTCAGGAAACATAGTTAATATTGAGTCAGAAGAAATTAGCAT
GGATACTCCTGCATGGGAAAAAGCACTCCGGTTCTTGAATATAATGAAGCGAAAGATTCCTGTTGCTGTGCACTCCATTGTGGTACAAACTCCTTCTGGAGTGGCACGAA
CTCAGAAGTTTGCTACTGAAACGGCAGATCTCCCAGCTCGAGAGGGAGAAAGAGTGACAATTGCTGCTGCAGCTCCATCAAATGTATTCAGAGAAGTTGGTCCTATTAAA
TTTAGTCCAAAGGATCCCAATTTAAACTCTGGGGAGGCTATGTGCCTGACAAATCATTCAGATGGCCGGGAGTCACTATTATTAAGAGTGCCAGCAAAGGAAAACTCATC
TTTACTTAACCCATCGATCCTCTTTCCACTCATAGTTTTATCTGCCGCTGGAGATGCTGCCTCTGGAGTAATTGACCCCAGCTTGCCTCAGTTGCTTTTAGTTGCTGGAT
TTGCTTCTCTAGCTGCAGGAGCTACCTTGAATTCATTAATTTTGCCTCAATTCAGTCGGCTTCCTCAACGATCAGTTGATATCATTGCTATCAAGCAGCAGCTTTTATCT
CAATATAATGTGCTTCAGTCTCGTATTGGGGATTTAAAACTTGCTGCTGAAAAGGAGGTATGGATGTTGGCTCGGATGTGCCAATTGGAGAACAAAATTTTGCCCGCACG
TAGAAGTAGGATAAAAAAGGTGCGAGAAGGCTTGGAAAATTCCCTTAAGCAACGGATTGAACTAATAGAAAGCTATGCAAGGATTTCCTCGATGATTGAGATTGAAGTTG
AAATGGAGTCTGATGTTATTGCGGCTGAAGCAGCTAGCAGTGTGGAAAGGGTTTCTGAACAGATAGAGCAAATCATGGCGCTGGAAAATCTAGAAGAGAGATGGAAATTA
CAAGCAGAAGCCAACGATGAAGCCGAAAGACTTCTCAACCAATCAATGCCAACAGAAAAGGTAACTGTTAGGACCCCAATTCTTCATACATCCATATCTCGATTTCCACA
TCACCTTGTACCCGCACTACCTACCCAGAATTACTATTTGTCAAACAAATCTTGGAGCATCAATAACAAAACCAAATACTTCACAAATATTCTTACTTCTCAATGA
Protein sequenceShow/hide protein sequence
MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLLCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSGAKDADEALGMIGDKSGRSGGT
VSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVA
VAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIK
FSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLS
QYNVLQSRIGDLKLAAEKEVWMLARMCQLENKILPARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKL
QAEANDEAERLLNQSMPTEKVTVRTPILHTSISRFPHHLVPALPTQNYYLSNKSWSINNKTKYFTNILTSQ