; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc07g0198521 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc07g0198521
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationCMiso1.1chr07:21700473..21708269
RNA-Seq ExpressionCmc07g0198521
SyntenyCmc07g0198521
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044898.1 Pentatricopeptide repeat (PPR) superfamily protein isoform 2 [Cucumis melo var. makuwa]1.1e-29099.44Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI
        MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLL CSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVS AKDADEALGMI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSV R
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER

XP_004148995.1 uncharacterized protein LOC101209802 [Cucumis sativus]2.1e-30296.38Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI
        MILNFTSP LTLTRLPPPKLLEPL SSTNGATVF+PLL CSHA FAFTSFSKS+RVRTSLSGSDIDGSAAFENPASELLDDELI+VVS AKDADEALGMI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        GDKSGRSGGTVSVSDCRLII+AALKRNNPELALSVFYAMRSTFYQ  AWE VNENASIVERWKWSRPDVHVYTLLI+GLAASLRVSDALRMIEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
        +PAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKC YKYELISGNIVNIESEEI MDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNL SGEAMCLTNHSDGRESLLLRVP KENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        L+VAGFASLAAGATLNSLILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEERWKLQAEANDEAERLLNQSMPTEKV
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV

XP_008451955.1 PREDICTED: uncharacterized protein LOC103493103 [Cucumis melo]0.0e+0099.66Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI
        MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLL CSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVS AKDADEALGMI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV

XP_038895166.1 uncharacterized protein LOC120083467 isoform X1 [Benincasa hispida]1.9e-28791.02Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI
        MILN TSP+L +TRLPPPKL EPL S+TNGATV +PLL CSHALFAFTSFSKSM+VR SLSGSDIDG+AAFENP S+LL +ELI  VS AKDADEAL MI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ----------VTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALR
         DKSGRSGGTVS SDC LIIAAALK NNPELALSVFYAMRSTFYQ          VTAWE VNENAS VERWKWSRPDVHVYTLLIQGLAASLRVSDALR
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ----------VTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALR

Query:  MIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTP
        MIEIICRVGVSPAEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCA+CRYKYELISGNIVNI+SEEISMDTPAWEKALRFLNIMKRKIP AVHSIVVQTP
Subjt:  MIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTP

Query:  SGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAAS
        SGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN  SGE MCLTNHSDGRESLLLRVPAK  SSLLNPS LFPLIVLSAAGDAAS
Subjt:  SGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAAS

Query:  GVIDPSLPQLLLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI
        GV+DPSLPQLLLVAG ASLAAGATLNSLILPQ +RLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI
Subjt:  GVIDPSLPQLLLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI

Query:  KKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV
        +KVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV
Subjt:  KKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV

XP_038895173.1 uncharacterized protein LOC120083467 isoform X2 [Benincasa hispida]7.1e-29092.59Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI
        MILN TSP+L +TRLPPPKL EPL S+TNGATV +PLL CSHALFAFTSFSKSM+VR SLSGSDIDG+AAFENP S+LL +ELI  VS AKDADEAL MI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
         DKSGRSGGTVS SDC LIIAAALK NNPELALSVFYAMRSTFYQVTAWE VNENAS VERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCA+CRYKYELISGNIVNI+SEEISMDTPAWEKALRFLNIMKRKIP AVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN  SGE MCLTNHSDGRESLLLRVPAK  SSLLNPS LFPLIVLSAAGDAASGV+DPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAG ASLAAGATLNSLILPQ +RLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI+KVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV

TrEMBL top hitse value%identityAlignment
A0A0A0KZV4 Uncharacterized protein1.0e-30296.38Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI
        MILNFTSP LTLTRLPPPKLLEPL SSTNGATVF+PLL CSHA FAFTSFSKS+RVRTSLSGSDIDGSAAFENPASELLDDELI+VVS AKDADEALGMI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        GDKSGRSGGTVSVSDCRLII+AALKRNNPELALSVFYAMRSTFYQ  AWE VNENASIVERWKWSRPDVHVYTLLI+GLAASLRVSDALRMIEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
        +PAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKC YKYELISGNIVNIESEEI MDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNL SGEAMCLTNHSDGRESLLLRVP KENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        L+VAGFASLAAGATLNSLILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEERWKLQAEANDEAERLLNQSMPTEKV
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV

A0A1S3BTU3 uncharacterized protein LOC1034931030.0e+0099.66Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI
        MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLL CSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVS AKDADEALGMI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV

A0A5A7TPS5 Pentatricopeptide repeat (PPR) superfamily protein isoform 25.3e-29199.44Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI
        MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLL CSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVS AKDADEALGMI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSV R
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER

A0A6J1ETW5 uncharacterized protein LOC111437671 isoform X12.5e-28088.64Show/hide
Query:  MILNFTSPFLTLTRL-PPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGM
        MIL+ +SP+LT+TRL PPPKL+EPL S++NG +V +PLL CSHALF FTSFSKS RVR SL+ S+IDG+AAFENP SELLDDELI VVS AKDADE L +
Subjt:  MILNFTSPFLTLTRL-PPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGM

Query:  IGDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVG
        I DKSGR+GGTVSV DCRLIIAAALKRNN ELALSVFYAMRS+FY+VTAWE VN+N S VERWKW+RPDVHVYTLLIQGLAASLRVSDALR+IEIICRVG
Subjt:  IGDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVG

Query:  VSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKF
        VSPAEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCAKCRY+YELISGNIVNIESEEISMDTPAWEKALRFLN+MK+K+P AVHSIVVQTPSGVARTQKF
Subjt:  VSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKF

Query:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQ
        ATETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPNL SGE MCLTNHSDGRESLLLRVPAKE S LL PS LFPLI+LS AGD +SGV+DPSLP+
Subjt:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQ

Query:  LLLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN
        LLLVAGFASLAAGATLNS ILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN
Subjt:  LLLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN

Query:  SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV
        SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEERW+LQAEANDEAERL NQSMPTE+V
Subjt:  SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV

A0A6J1J4R1 uncharacterized protein LOC111483407 isoform X11.2e-27988.1Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI
        MIL+ +SP+LT+TRLP PKL+EPL S++NG +V +PLL CSHA F FTSFS+S RVR SL+ S+IDG+AAFENP S+LLDDELI VVS AKDADE L MI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
         +KSGR+GGTVSV DCRLIIAAALKRNN ELALSVFYAMRS+FY+VTAWE VN+N S VERWKW+RPDVHVYTLLIQGLAASLRVSDALR+IEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCAKCRY+YELISGNIVNIESEEISMDTPAWEKALRFLN+MK+K+P AVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPNL SGE MCLTNHSDGRESLL+RVPAKE S LL PS LFPLI+LS AGDAASGV+DPSLP++
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAGFASLAAGATLNS ILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEERW+LQAEANDEAERL NQSMPTE+V
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAEANDEAERLLNQSMPTEKV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64430.1 Pentatricopeptide repeat (PPR) superfamily protein1.6e-18666.27Show/hide
Query:  DIDGSAAFENPASELLDDELIIVVSSAKDADEALGMIGDKSGRS-GGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERW
        D  GSAA  + +S +LDDEL+  VS+ +DADEAL MI D+ G + GG V + DCR II+AA+ R N +LALS+FY MR++F         +   S  +RW
Subjt:  DIDGSAAFENPASELLDDELIIVVSSAKDADEALGMIGDKSGRS-GGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERW

Query:  KWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTP
         WSRPDV VYT+L+ GLAASLRVSD+LR+I  ICRVG+SPAEEVPFGK+V+CPSC++A+AVAQPQHG+QIVSCA CRY+YEL SG+I +I+SEE+  D P
Subjt:  KWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTP

Query:  AWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRV
         WEK LR + I K KI  +VHSIVVQTPSG ART +FATETA+LPA+EGERVTIA+AAPSNV+R+VGP KF  K PN   GE M LT H DGRES+LLR 
Subjt:  AWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRV

Query:  PAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEV
        P+K+   +L PS L PL+ + A GDAASGVIDPSLPQLL VA   SLA GAT+NS +LP+ ++LP+R+VD++ IKQQLLSQY+VLQ RI DLK A EKEV
Subjt:  PAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEV

Query:  WMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAE
        WMLARMCQLENKI AVGEP+YR RR+R+KKVRE LENS+K +I+LI+SYARISSMIEIEVEM+SDV+AAEA ++ E +++QIEQIM LENLEE+WK+QAE
Subjt:  WMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAE

Query:  ANDEAERLLN
        ANDEAERLL+
Subjt:  ANDEAERLLN

AT1G64430.2 Pentatricopeptide repeat (PPR) superfamily protein1.6e-18666.27Show/hide
Query:  DIDGSAAFENPASELLDDELIIVVSSAKDADEALGMIGDKSGRS-GGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERW
        D  GSAA  + +S +LDDEL+  VS+ +DADEAL MI D+ G + GG V + DCR II+AA+ R N +LALS+FY MR++F         +   S  +RW
Subjt:  DIDGSAAFENPASELLDDELIIVVSSAKDADEALGMIGDKSGRS-GGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERW

Query:  KWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTP
         WSRPDV VYT+L+ GLAASLRVSD+LR+I  ICRVG+SPAEEVPFGK+V+CPSC++A+AVAQPQHG+QIVSCA CRY+YEL SG+I +I+SEE+  D P
Subjt:  KWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTP

Query:  AWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRV
         WEK LR + I K KI  +VHSIVVQTPSG ART +FATETA+LPA+EGERVTIA+AAPSNV+R+VGP KF  K PN   GE M LT H DGRES+LLR 
Subjt:  AWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRV

Query:  PAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEV
        P+K+   +L PS L PL+ + A GDAASGVIDPSLPQLL VA   SLA GAT+NS +LP+ ++LP+R+VD++ IKQQLLSQY+VLQ RI DLK A EKEV
Subjt:  PAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEV

Query:  WMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAE
        WMLARMCQLENKI AVGEP+YR RR+R+KKVRE LENS+K +I+LI+SYARISSMIEIEVEM+SDV+AAEA ++ E +++QIEQIM LENLEE+WK+QAE
Subjt:  WMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEERWKLQAE

Query:  ANDEAERLLN
        ANDEAERLL+
Subjt:  ANDEAERLLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCTTAACTTCACTTCTCCATTCCTCACTCTCACTCGTCTCCCTCCTCCTAAACTCCTCGAACCACTCCCCTCTTCAACCAATGGCGCTACCGTTTTCGTGCCTCT
CCTTCCTTGTTCCCACGCTCTCTTTGCTTTCACCTCCTTCTCTAAGTCGATGCGAGTTAGAACTTCTTTAAGTGGCAGCGACATCGACGGCTCTGCGGCTTTTGAGAATC
CTGCTTCGGAGTTACTCGACGACGAGCTGATAATAGTTGTTTCGAGTGCTAAGGATGCTGATGAAGCGCTAGGGATGATCGGTGATAAGTCTGGGAGAAGTGGAGGTACT
GTGTCTGTTTCCGACTGTCGTTTGATTATTGCGGCTGCACTTAAGCGTAACAATCCCGAGCTTGCTTTGTCTGTGTTCTACGCAATGCGTTCAACTTTTTATCAAGTTAC
AGCATGGGAAGCTGTTAATGAAAATGCTTCCATTGTTGAGAGATGGAAATGGTCAAGGCCAGATGTTCATGTATATACATTGCTGATTCAAGGTCTTGCAGCATCCTTGA
GGGTATCTGATGCACTTAGAATGATCGAAATTATTTGCCGTGTTGGTGTATCACCTGCTGAGGAGGTGCCATTTGGAAAAGTAGTGAAGTGTCCCAGTTGTATGGTAGCA
GTTGCAGTTGCACAACCCCAGCACGGTATTCAGATTGTATCCTGTGCGAAGTGCCGCTACAAGTATGAACTTATTTCAGGAAACATAGTTAATATTGAGTCAGAAGAAAT
TAGCATGGATACTCCTGCATGGGAAAAAGCACTCCGGTTCTTGAATATAATGAAGCGAAAGATTCCTGTTGCTGTGCACTCCATTGTGGTACAAACTCCTTCTGGAGTGG
CACGAACTCAGAAGTTTGCTACTGAAACGGCAGATCTCCCAGCTCGAGAGGGAGAAAGAGTGACAATTGCTGCTGCAGCTCCATCAAATGTATTCAGAGAAGTTGGTCCT
ATTAAATTTAGTCCAAAGGATCCCAATTTAAACTCTGGGGAGGCTATGTGCCTGACAAATCATTCAGATGGCCGGGAGTCACTATTATTAAGAGTGCCAGCAAAGGAAAA
CTCATCTTTACTTAACCCATCGATCCTCTTTCCACTCATAGTTTTGTCTGCCGCTGGAGATGCTGCCTCTGGAGTAATTGACCCCAGCTTGCCTCAGTTGCTTTTAGTTG
CTGGATTTGCTTCTCTAGCTGCAGGAGCTACCTTGAATTCATTAATTTTGCCTCAATTCAGTCGGCTTCCTCAACGATCAGTTGATATCATTGCTATCAAGCAGCAGCTT
TTATCTCAATATAATGTGCTTCAGTCTCGTATTGGGGATTTAAAACTAGCTGCTGAAAAGGAGGTATGGATGTTGGCTCGGATGTGCCAATTGGAGAACAAAATTTTTGC
CGTAGGAGAACCTTCTTACCGCGCACGTAGAAGTAGGATAAAAAAGGTGCGAGAAGGCTTGGAAAATTCCCTTAAGCAACGGATTGAACTAATAGAAAGCTATGCAAGGA
TTTCCTCGATGATTGAGATTGAAGTTGAAATGGAGTCTGATGTTATTGCGGCTGAAGCAGCTAGCAGTGTGGAAAGGGTTTCTGAACAGATAGAGCAAATCATGGCGCTG
GAAAATCTAGAAGAGAGATGGAAATTACAAGCAGAAGCCAACGATGAAGCCGAAAGACTTCTCAACCAATCAATGCCAACAGAAAAGGTTTAG
mRNA sequenceShow/hide mRNA sequence
GCACACAACCACTCCGCCTTCGTCTTCTCCTAACGGCCTCTCATTCTTTATCCGCCAAAATCCTTCACTCCAGCAGTCCAGTGAGAGAAACCAAAGCATTCGACAAATTC
TATGCCATTTCTCTACACGGATTCTCCTTCGTTTCTATCCAAATCTCCATTCACAAAATGATTCTTAACTTCACTTCTCCATTCCTCACTCTCACTCGTCTCCCTCCTCC
TAAACTCCTCGAACCACTCCCCTCTTCAACCAATGGCGCTACCGTTTTCGTGCCTCTCCTTCCTTGTTCCCACGCTCTCTTTGCTTTCACCTCCTTCTCTAAGTCGATGC
GAGTTAGAACTTCTTTAAGTGGCAGCGACATCGACGGCTCTGCGGCTTTTGAGAATCCTGCTTCGGAGTTACTCGACGACGAGCTGATAATAGTTGTTTCGAGTGCTAAG
GATGCTGATGAAGCGCTAGGGATGATCGGTGATAAGTCTGGGAGAAGTGGAGGTACTGTGTCTGTTTCCGACTGTCGTTTGATTATTGCGGCTGCACTTAAGCGTAACAA
TCCCGAGCTTGCTTTGTCTGTGTTCTACGCAATGCGTTCAACTTTTTATCAAGTTACAGCATGGGAAGCTGTTAATGAAAATGCTTCCATTGTTGAGAGATGGAAATGGT
CAAGGCCAGATGTTCATGTATATACATTGCTGATTCAAGGTCTTGCAGCATCCTTGAGGGTATCTGATGCACTTAGAATGATCGAAATTATTTGCCGTGTTGGTGTATCA
CCTGCTGAGGAGGTGCCATTTGGAAAAGTAGTGAAGTGTCCCAGTTGTATGGTAGCAGTTGCAGTTGCACAACCCCAGCACGGTATTCAGATTGTATCCTGTGCGAAGTG
CCGCTACAAGTATGAACTTATTTCAGGAAACATAGTTAATATTGAGTCAGAAGAAATTAGCATGGATACTCCTGCATGGGAAAAAGCACTCCGGTTCTTGAATATAATGA
AGCGAAAGATTCCTGTTGCTGTGCACTCCATTGTGGTACAAACTCCTTCTGGAGTGGCACGAACTCAGAAGTTTGCTACTGAAACGGCAGATCTCCCAGCTCGAGAGGGA
GAAAGAGTGACAATTGCTGCTGCAGCTCCATCAAATGTATTCAGAGAAGTTGGTCCTATTAAATTTAGTCCAAAGGATCCCAATTTAAACTCTGGGGAGGCTATGTGCCT
GACAAATCATTCAGATGGCCGGGAGTCACTATTATTAAGAGTGCCAGCAAAGGAAAACTCATCTTTACTTAACCCATCGATCCTCTTTCCACTCATAGTTTTGTCTGCCG
CTGGAGATGCTGCCTCTGGAGTAATTGACCCCAGCTTGCCTCAGTTGCTTTTAGTTGCTGGATTTGCTTCTCTAGCTGCAGGAGCTACCTTGAATTCATTAATTTTGCCT
CAATTCAGTCGGCTTCCTCAACGATCAGTTGATATCATTGCTATCAAGCAGCAGCTTTTATCTCAATATAATGTGCTTCAGTCTCGTATTGGGGATTTAAAACTAGCTGC
TGAAAAGGAGGTATGGATGTTGGCTCGGATGTGCCAATTGGAGAACAAAATTTTTGCCGTAGGAGAACCTTCTTACCGCGCACGTAGAAGTAGGATAAAAAAGGTGCGAG
AAGGCTTGGAAAATTCCCTTAAGCAACGGATTGAACTAATAGAAAGCTATGCAAGGATTTCCTCGATGATTGAGATTGAAGTTGAAATGGAGTCTGATGTTATTGCGGCT
GAAGCAGCTAGCAGTGTGGAAAGGGTTTCTGAACAGATAGAGCAAATCATGGCGCTGGAAAATCTAGAAGAGAGATGGAAATTACAAGCAGAAGCCAACGATGAAGCCGA
AAGACTTCTCAACCAATCAATGCCAACAGAAAAGGTTTAGACAAGCGTGTTCTCGTGTTTTCAATCCAGGTAACTGTTAGGACCCCAATTCTTCATACATCCATATCTCG
ATTTCCACATCACCTTGTACCTGCACTACCTACCCAGAATTACTATTTGTCAAACAAATCTTGGAGCATCAATAACAAAACCAAATACTTCACAAATATTCTTACTTCTC
AATGAAAATGTCAATCAATGGCTACTTGGTCCATGTACATCAAGTTTTGAAGTAGCATCAGGTCAGTGCTTCGTGATCAGATATCTTGTTACTGAAGATCTCATTTGTTA
AATTGTATTATTGTTACGAAAAAGTACATTGAAATCTAAGTTTCATTTGGAACTTTTCCCTCCTGAATACTGAAATGTGATTTATCACGTAAATACAACACGTTAATGAT
TCTATGCCTTAATAGCCATCTTTGCATATCAATTATGTTCTTAGTCGGATCTGATACTTTCTTCTTATT
Protein sequenceShow/hide protein sequence
MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMIGDKSGRSGGT
VSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVA
VAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGP
IKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQL
LSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMAL
ENLEERWKLQAEANDEAERLLNQSMPTEKV