; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0012433 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0012433
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationchr07:24412444..24420177
RNA-Seq ExpressionPay0012433
SyntenyPay0012433
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044898.1 Pentatricopeptide repeat (PPR) superfamily protein isoform 2 [Cucumis melo var. makuwa]1.1e-29099.44Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI
        MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLL CSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVS AKDADEALGMI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSV R
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER

XP_004148995.1 uncharacterized protein LOC101209802 [Cucumis sativus]1.8e-28896.04Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI
        MILNFTSP LTLTRLPPPKLLEPL SSTNGATVF+PLL CSHA FAFTSFSKS+RVRTSLSGSDIDGSAAFENPASELLDDELI+VVS AKDADEALGMI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        GDKSGRSGGTVSVSDCRLII+AALKRNNPELALSVFYAMRSTFYQ  AWE VNENASIVERWKWSRPDVHVYTLLI+GLAASLRVSDALRMIEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
        +PAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKC YKYELISGNIVNIESEEI MDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNL SGEAMCLTNHSDGRESLLLRVP KENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        L+VAGFASLAAGATLNSLILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEK
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEE+
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEK

XP_008451955.1 PREDICTED: uncharacterized protein LOC103493103 [Cucumis melo]1.3e-29999.46Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI
        MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLL CSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVS AKDADEALGMI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEK
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE+
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEK

XP_038895166.1 uncharacterized protein LOC120083467 isoform X1 [Benincasa hispida]1.6e-27390.46Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI
        MILN TSP+L +TRLPPPKL EPL S+TNGATV +PLL CSHALFAFTSFSKSM+VR SLSGSDIDG+AAFENP S+LL +ELI  VS AKDADEAL MI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ----------VTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALR
         DKSGRSGGTVS SDC LIIAAALK NNPELALSVFYAMRSTFYQ          VTAWE VNENAS VERWKWSRPDVHVYTLLIQGLAASLRVSDALR
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ----------VTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALR

Query:  MIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTP
        MIEIICRVGVSPAEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCA+CRYKYELISGNIVNI+SEEISMDTPAWEKALRFLNIMKRKIP AVHSIVVQTP
Subjt:  MIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTP

Query:  SGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAAS
        SGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN  SGE MCLTNHSDGRESLLLRVPAK  SSLLNPS LFPLIVLSAAGDAAS
Subjt:  SGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAAS

Query:  GVIDPSLPQLLLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI
        GV+DPSLPQLLLVAG ASLAAGATLNSLILPQ +RLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI
Subjt:  GVIDPSLPQLLLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI

Query:  KKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEK
        +KVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE+
Subjt:  KKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEK

XP_038895173.1 uncharacterized protein LOC120083467 isoform X2 [Benincasa hispida]5.8e-27692.09Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI
        MILN TSP+L +TRLPPPKL EPL S+TNGATV +PLL CSHALFAFTSFSKSM+VR SLSGSDIDG+AAFENP S+LL +ELI  VS AKDADEAL MI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
         DKSGRSGGTVS SDC LIIAAALK NNPELALSVFYAMRSTFYQVTAWE VNENAS VERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCA+CRYKYELISGNIVNI+SEEISMDTPAWEKALRFLNIMKRKIP AVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN  SGE MCLTNHSDGRESLLLRVPAK  SSLLNPS LFPLIVLSAAGDAASGV+DPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAG ASLAAGATLNSLILPQ +RLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI+KVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEK
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE+
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEK

TrEMBL top hitse value%identityAlignment
A0A0A0KZV4 Uncharacterized protein8.5e-28996.04Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI
        MILNFTSP LTLTRLPPPKLLEPL SSTNGATVF+PLL CSHA FAFTSFSKS+RVRTSLSGSDIDGSAAFENPASELLDDELI+VVS AKDADEALGMI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        GDKSGRSGGTVSVSDCRLII+AALKRNNPELALSVFYAMRSTFYQ  AWE VNENASIVERWKWSRPDVHVYTLLI+GLAASLRVSDALRMIEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
        +PAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKC YKYELISGNIVNIESEEI MDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNL SGEAMCLTNHSDGRESLLLRVP KENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        L+VAGFASLAAGATLNSLILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEK
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEE+
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEK

A0A1S3BTU3 uncharacterized protein LOC1034931036.3e-30099.46Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI
        MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLL CSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVS AKDADEALGMI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEK
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE+
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEK

A0A5A7TPS5 Pentatricopeptide repeat (PPR) superfamily protein isoform 25.3e-29199.44Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI
        MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLL CSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVS AKDADEALGMI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSV R
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVER

A0A6J1ETW5 uncharacterized protein LOC111437671 isoform X11.1e-26788.51Show/hide
Query:  MILNFTSPFLTLTRL-PPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGM
        MIL+ +SP+LT+TRL PPPKL+EPL S++NG +V +PLL CSHALF FTSFSKS RVR SL+ S+IDG+AAFENP SELLDDELI VVS AKDADE L +
Subjt:  MILNFTSPFLTLTRL-PPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGM

Query:  IGDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVG
        I DKSGR+GGTVSV DCRLIIAAALKRNN ELALSVFYAMRS+FY+VTAWE VN+N S VERWKW+RPDVHVYTLLIQGLAASLRVSDALR+IEIICRVG
Subjt:  IGDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVG

Query:  VSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKF
        VSPAEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCAKCRY+YELISGNIVNIESEEISMDTPAWEKALRFLN+MK+K+P AVHSIVVQTPSGVARTQKF
Subjt:  VSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKF

Query:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQ
        ATETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPNL SGE MCLTNHSDGRESLLLRVPAKE S LL PS LFPLI+LS AGD +SGV+DPSLP+
Subjt:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQ

Query:  LLLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN
        LLLVAGFASLAAGATLNS ILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN
Subjt:  LLLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN

Query:  SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEK
        SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEE+
Subjt:  SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEK

A0A6J1J4R1 uncharacterized protein LOC111483407 isoform X15.3e-26787.95Show/hide
Query:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI
        MIL+ +SP+LT+TRLP PKL+EPL S++NG +V +PLL CSHA F FTSFS+S RVR SL+ S+IDG+AAFENP S+LLDDELI VVS AKDADE L MI
Subjt:  MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMI

Query:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
         +KSGR+GGTVSV DCRLIIAAALKRNN ELALSVFYAMRS+FY+VTAWE VN+N S VERWKW+RPDVHVYTLLIQGLAASLRVSDALR+IEIICRVGV
Subjt:  GDKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCAKCRY+YELISGNIVNIESEEISMDTPAWEKALRFLN+MK+K+P AVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL
        TETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPNL SGE MCLTNHSDGRESLL+RVPAKE S LL PS LFPLI+LS AGDAASGV+DPSLP++
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQL

Query:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAGFASLAAGATLNS ILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEK
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEE+
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64430.1 Pentatricopeptide repeat (PPR) superfamily protein1.2e-17865.79Show/hide
Query:  DIDGSAAFENPASELLDDELIIVVSSAKDADEALGMIGDKSGRS-GGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERW
        D  GSAA  + +S +LDDEL+  VS+ +DADEAL MI D+ G + GG V + DCR II+AA+ R N +LALS+FY MR++F         +   S  +RW
Subjt:  DIDGSAAFENPASELLDDELIIVVSSAKDADEALGMIGDKSGRS-GGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERW

Query:  KWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTP
         WSRPDV VYT+L+ GLAASLRVSD+LR+I  ICRVG+SPAEEVPFGK+V+CPSC++A+AVAQPQHG+QIVSCA CRY+YEL SG+I +I+SEE+  D P
Subjt:  KWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTP

Query:  AWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRV
         WEK LR + I K KI  +VHSIVVQTPSG ART +FATETA+LPA+EGERVTIA+AAPSNV+R+VGP KF  K PN   GE M LT H DGRES+LLR 
Subjt:  AWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRV

Query:  PAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEV
        P+K+   +L PS L PL+ + A GDAASGVIDPSLPQLL VA   SLA GAT+NS +LP+ ++LP+R+VD++ IKQQLLSQY+VLQ RI DLK A EKEV
Subjt:  PAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEV

Query:  WMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEK
        WMLARMCQLENKI AVGEP+YR RR+R+KKVRE LENS+K +I+LI+SYARISSMIEIEVEM+SDV+AAEA ++ E +++QIEQIM LENLEEK
Subjt:  WMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEK

AT1G64430.2 Pentatricopeptide repeat (PPR) superfamily protein1.2e-17865.79Show/hide
Query:  DIDGSAAFENPASELLDDELIIVVSSAKDADEALGMIGDKSGRS-GGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERW
        D  GSAA  + +S +LDDEL+  VS+ +DADEAL MI D+ G + GG V + DCR II+AA+ R N +LALS+FY MR++F         +   S  +RW
Subjt:  DIDGSAAFENPASELLDDELIIVVSSAKDADEALGMIGDKSGRS-GGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERW

Query:  KWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTP
         WSRPDV VYT+L+ GLAASLRVSD+LR+I  ICRVG+SPAEEVPFGK+V+CPSC++A+AVAQPQHG+QIVSCA CRY+YEL SG+I +I+SEE+  D P
Subjt:  KWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTP

Query:  AWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRV
         WEK LR + I K KI  +VHSIVVQTPSG ART +FATETA+LPA+EGERVTIA+AAPSNV+R+VGP KF  K PN   GE M LT H DGRES+LLR 
Subjt:  AWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNLNSGEAMCLTNHSDGRESLLLRV

Query:  PAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEV
        P+K+   +L PS L PL+ + A GDAASGVIDPSLPQLL VA   SLA GAT+NS +LP+ ++LP+R+VD++ IKQQLLSQY+VLQ RI DLK A EKEV
Subjt:  PAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQLLSQYNVLQSRIGDLKLAAEKEV

Query:  WMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEK
        WMLARMCQLENKI AVGEP+YR RR+R+KKVRE LENS+K +I+LI+SYARISSMIEIEVEM+SDV+AAEA ++ E +++QIEQIM LENLEEK
Subjt:  WMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCTTAACTTCACTTCTCCATTCCTCACTCTCACTCGTCTCCCTCCTCCTAAACTCCTCGAACCACTCCCCTCTTCAACCAATGGCGCTACCGTTTTCGTGCCTCT
CCTTCCTTGTTCCCACGCTCTCTTTGCTTTCACCTCCTTCTCTAAGTCGATGCGAGTTAGAACTTCTTTAAGTGGCAGCGACATCGACGGCTCTGCGGCTTTTGAGAATC
CTGCTTCGGAGTTACTCGACGACGAGCTGATAATAGTTGTTTCGAGTGCTAAGGATGCTGATGAAGCGCTAGGGATGATCGGTGATAAGTCTGGGAGAAGTGGAGGTACT
GTGTCTGTTTCCGACTGTCGTTTGATTATTGCGGCTGCACTTAAGCGTAACAATCCCGAGCTTGCTTTGTCTGTGTTCTACGCAATGCGTTCAACTTTTTATCAAGTTAC
AGCATGGGAAGCTGTTAATGAAAATGCTTCCATTGTTGAGAGATGGAAATGGTCAAGGCCAGATGTTCATGTATATACATTGCTGATTCAAGGTCTTGCAGCATCCTTGA
GGGTATCTGATGCACTTAGAATGATCGAAATTATTTGCCGTGTTGGTGTATCACCTGCTGAGGAGGTGCCATTTGGAAAAGTAGTGAAGTGTCCCAGTTGTATGGTAGCA
GTTGCAGTTGCACAACCCCAGCACGGTATTCAGATTGTATCCTGTGCGAAGTGCCGCTACAAGTATGAACTTATTTCAGGAAACATAGTTAATATTGAGTCAGAAGAAAT
TAGCATGGATACTCCTGCATGGGAAAAAGCACTCCGGTTCTTGAATATAATGAAGCGAAAGATTCCTGTTGCTGTGCACTCCATTGTGGTACAAACTCCTTCTGGAGTGG
CACGAACTCAGAAGTTTGCTACTGAAACGGCAGATCTCCCAGCTCGAGAGGGAGAAAGAGTGACAATTGCTGCTGCAGCTCCATCAAATGTATTCAGAGAAGTTGGTCCT
ATTAAATTTAGTCCAAAGGATCCCAATTTAAACTCTGGGGAGGCTATGTGCCTGACAAATCATTCAGATGGCCGGGAGTCACTATTATTAAGAGTGCCAGCAAAGGAAAA
CTCATCTTTACTTAACCCATCGATCCTCTTTCCACTCATAGTTTTGTCTGCCGCTGGAGATGCTGCCTCTGGAGTAATTGACCCCAGCTTGCCTCAGTTGCTTTTAGTTG
CTGGATTTGCTTCTCTAGCTGCAGGAGCTACCTTGAATTCATTAATTTTGCCTCAATTCAGTCGGCTTCCTCAACGATCAGTTGATATCATTGCTATCAAGCAGCAGCTT
TTATCTCAATATAATGTGCTTCAGTCTCGTATTGGGGATTTAAAACTAGCTGCTGAAAAGGAGGTATGGATGTTGGCTCGGATGTGCCAATTGGAGAACAAAATTTTTGC
CGTAGGAGAACCTTCTTACCGCGCACGTAGAAGTAGGATAAAAAAGGTGCGAGAAGGCTTGGAAAATTCCCTTAAGCAACGGATTGAACTAATAGAAAGCTATGCAAGGA
TTTCCTCGATGATTGAGATTGAAGTTGAAATGGAGTCTGATGTTATTGCGGCTGAAGCAGCTAGCAGTGTGGAAAGGGTTTCTGAACAGATAGAGCAAATCATGGCGCTG
GAAAATCTAGAAGAGAAATCCATTCTAATTCCATCAAGTCAGTCAAGCTGCCATGTGCTTAGCCATAGAGATGGACGTGGTGCCCTCTGCCACCACCACACTTAG
mRNA sequenceShow/hide mRNA sequence
GCACACAACCACTCCGCCTTCGTCTTCTCCTAACGGCCTCTCATTCTTTATCCGCCAAAATCCTTCACTCCAGCAGTCCAGTGAGAGAAACCAAAGCATTCGACAAATTC
TATGCCATTTCTCTACACGGATTCTCCTTCGTTTCTATCCAAATCTCCATTCACAAAATGATTCTTAACTTCACTTCTCCATTCCTCACTCTCACTCGTCTCCCTCCTCC
TAAACTCCTCGAACCACTCCCCTCTTCAACCAATGGCGCTACCGTTTTCGTGCCTCTCCTTCCTTGTTCCCACGCTCTCTTTGCTTTCACCTCCTTCTCTAAGTCGATGC
GAGTTAGAACTTCTTTAAGTGGCAGCGACATCGACGGCTCTGCGGCTTTTGAGAATCCTGCTTCGGAGTTACTCGACGACGAGCTGATAATAGTTGTTTCGAGTGCTAAG
GATGCTGATGAAGCGCTAGGGATGATCGGTGATAAGTCTGGGAGAAGTGGAGGTACTGTGTCTGTTTCCGACTGTCGTTTGATTATTGCGGCTGCACTTAAGCGTAACAA
TCCCGAGCTTGCTTTGTCTGTGTTCTACGCAATGCGTTCAACTTTTTATCAAGTTACAGCATGGGAAGCTGTTAATGAAAATGCTTCCATTGTTGAGAGATGGAAATGGT
CAAGGCCAGATGTTCATGTATATACATTGCTGATTCAAGGTCTTGCAGCATCCTTGAGGGTATCTGATGCACTTAGAATGATCGAAATTATTTGCCGTGTTGGTGTATCA
CCTGCTGAGGAGGTGCCATTTGGAAAAGTAGTGAAGTGTCCCAGTTGTATGGTAGCAGTTGCAGTTGCACAACCCCAGCACGGTATTCAGATTGTATCCTGTGCGAAGTG
CCGCTACAAGTATGAACTTATTTCAGGAAACATAGTTAATATTGAGTCAGAAGAAATTAGCATGGATACTCCTGCATGGGAAAAAGCACTCCGGTTCTTGAATATAATGA
AGCGAAAGATTCCTGTTGCTGTGCACTCCATTGTGGTACAAACTCCTTCTGGAGTGGCACGAACTCAGAAGTTTGCTACTGAAACGGCAGATCTCCCAGCTCGAGAGGGA
GAAAGAGTGACAATTGCTGCTGCAGCTCCATCAAATGTATTCAGAGAAGTTGGTCCTATTAAATTTAGTCCAAAGGATCCCAATTTAAACTCTGGGGAGGCTATGTGCCT
GACAAATCATTCAGATGGCCGGGAGTCACTATTATTAAGAGTGCCAGCAAAGGAAAACTCATCTTTACTTAACCCATCGATCCTCTTTCCACTCATAGTTTTGTCTGCCG
CTGGAGATGCTGCCTCTGGAGTAATTGACCCCAGCTTGCCTCAGTTGCTTTTAGTTGCTGGATTTGCTTCTCTAGCTGCAGGAGCTACCTTGAATTCATTAATTTTGCCT
CAATTCAGTCGGCTTCCTCAACGATCAGTTGATATCATTGCTATCAAGCAGCAGCTTTTATCTCAATATAATGTGCTTCAGTCTCGTATTGGGGATTTAAAACTAGCTGC
TGAAAAGGAGGTATGGATGTTGGCTCGGATGTGCCAATTGGAGAACAAAATTTTTGCCGTAGGAGAACCTTCTTACCGCGCACGTAGAAGTAGGATAAAAAAGGTGCGAG
AAGGCTTGGAAAATTCCCTTAAGCAACGGATTGAACTAATAGAAAGCTATGCAAGGATTTCCTCGATGATTGAGATTGAAGTTGAAATGGAGTCTGATGTTATTGCGGCT
GAAGCAGCTAGCAGTGTGGAAAGGGTTTCTGAACAGATAGAGCAAATCATGGCGCTGGAAAATCTAGAAGAGAAATCCATTCTAATTCCATCAAGTCAGTCAAGCTGCCA
TGTGCTTAGCCATAGAGATGGACGTGGTGCCCTCTGCCACCACCACACTTAGCCTTACAAACAGCCGGGGTTATGTGTCCCTTCATCATATTCAAGTCCTTGACTAGCAA
CAATAAGCTTTTAGGGCCTTCAAGCTAAGGTCTCAATCACCTAATATACATTTGAATGAAACTACAACTACCCATACGTTGCCTTCAAAACGAATTTAGCACATCATAGT
TACTATCAAAGATTGAACCTACTATCAGCTGATGCTGGGTGTGGGTGTGACAGAAAGCTATTTATAAGTCAGATTGCATTGATTTTCAATTACATGAAAAATATATGACT
TCTGTGTCATCAAAATAGAGTTGTCTGCTGAGTTTTTAATTGTTCACAAGAGTTAGTTCGGCTACTTTTTTTAATATATGATTCGCATTGATATTTTTACTGAACAATAA
AATCCTCGTTCAACACCCTTAAATTCAGAACGAAATACATGGCTTTGGTTTCTGTTGTCTTCTTAGTTTCTCATATTTTCATCTGGACTATAGTGGTTTCCTGATCTTTG
GAGCCAATCAGCAGACACTATTTCCATCTTTGGCTAGTTAATTATCCTTTTCTTCTAACCTTTTATGTCAAT
Protein sequenceShow/hide protein sequence
MILNFTSPFLTLTRLPPPKLLEPLPSSTNGATVFVPLLPCSHALFAFTSFSKSMRVRTSLSGSDIDGSAAFENPASELLDDELIIVVSSAKDADEALGMIGDKSGRSGGT
VSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEAVNENASIVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVKCPSCMVA
VAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDTPAWEKALRFLNIMKRKIPVAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGP
IKFSPKDPNLNSGEAMCLTNHSDGRESLLLRVPAKENSSLLNPSILFPLIVLSAAGDAASGVIDPSLPQLLLVAGFASLAAGATLNSLILPQFSRLPQRSVDIIAIKQQL
LSQYNVLQSRIGDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMAL
ENLEEKSILIPSSQSSCHVLSHRDGRGALCHHHT