; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G11250 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G11250
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationClcChr07:25895843..25903103
RNA-Seq ExpressionClc07G11250
SyntenyClc07G11250
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044898.1 Pentatricopeptide repeat (PPR) superfamily protein isoform 2 [Cucumis melo var. makuwa]5.3e-27292.1Show/hide
Query:  MILNLSSPWLAITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSNSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAKDADEVLGMI
        MILN +SP+L +TRLPPPKL EPL S+TNGATV +PLLLCSHALFAFTSFS SMRVR SLSGSDIDG+AAFENP SELLD+ELI VVSGAKDADE LGMI
Subjt:  MILNLSSPWLAITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSNSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAKDADEVLGMI

Query:  ADKAGRSGGTVSVLDCRLIIAAALKRNNPELALSVLYAMRSTFYQVTAWESVNENAPTVERWEWSRPDVHIYTLLIQGLAASLRVSDALRMIEIICRVGV
         DK+GRSGGTVSV DCRLIIAAALKRNNPELALSV YAMRSTFYQVTAWE+VNENA  VERW+WSRPDVH+YTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  ADKAGRSGGTVSVLDCRLIIAAALKRNNPELALSVLYAMRSTFYQVTAWESVNENAPTVERWEWSRPDVHIYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISVDAPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEIS+D PAWEKALRFLNIMKRKIP AVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISVDAPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMSLTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIVFSAAGDAASGVIDPSLPRL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN  SGE M LTNHSDGRESLLLRVPAK  SSLLNPS LFPLIV SAAGDAASGVIDPSLP+L
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMSLTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIVFSAAGDAASGVIDPSLPRL

Query:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKVAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAGFASLAAGATLNSLILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRI DLK+AAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKVAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVVRAISAL
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVVRAI+AL
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVVRAISAL

XP_008451955.1 PREDICTED: uncharacterized protein LOC103493103 [Cucumis melo]1.9e-26991.01Show/hide
Query:  MILNLSSPWLAITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSNSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAKDADEVLGMI
        MILN +SP+L +TRLPPPKL EPL S+TNGATV +PLLLCSHALFAFTSFS SMRVR SLSGSDIDG+AAFENP SELLD+ELI VVSGAKDADE LGMI
Subjt:  MILNLSSPWLAITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSNSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAKDADEVLGMI

Query:  ADKAGRSGGTVSVLDCRLIIAAALKRNNPELALSVLYAMRSTFYQVTAWESVNENAPTVERWEWSRPDVHIYTLLIQGLAASLRVSDALRMIEIICRVGV
         DK+GRSGGTVSV DCRLIIAAALKRNNPELALSV YAMRSTFYQVTAWE+VNENA  VERW+WSRPDVH+YTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  ADKAGRSGGTVSVLDCRLIIAAALKRNNPELALSVLYAMRSTFYQVTAWESVNENAPTVERWEWSRPDVHIYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISVDAPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEIS+D PAWEKALRFLNIMKRKIP AVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISVDAPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMSLTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIVFSAAGDAASGVIDPSLPRL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN  SGE M LTNHSDGRESLLLRVPAK  SSLLNPS LFPLIV SAAGDAASGVIDPSLP+L
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMSLTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIVFSAAGDAASGVIDPSLPRL

Query:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKVAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAGFASLAAGATLNSLILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRI DLK+AAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKVAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVVRAISALD
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSV R    ++
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVVRAISALD

XP_038895166.1 uncharacterized protein LOC120083467 isoform X1 [Benincasa hispida]3.3e-27491.35Show/hide
Query:  MILNLSSPWLAITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSNSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAKDADEVLGMI
        MILNL+SPWL ITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFS SM+VRASLSGSDIDGAAAFENPVS+LL NELIR VSGAKDADE L MI
Subjt:  MILNLSSPWLAITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSNSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAKDADEVLGMI

Query:  ADKAGRSGGTVSVLDCRLIIAAALKRNNPELALSVLYAMRSTFYQ----------VTAWESVNENAPTVERWEWSRPDVHIYTLLIQGLAASLRVSDALR
        ADK+GRSGGTVS  DC LIIAAALK NNPELALSV YAMRSTFYQ          VTAWE VNENA TVERW+WSRPDVH+YTLLIQGLAASLRVSDALR
Subjt:  ADKAGRSGGTVSVLDCRLIIAAALKRNNPELALSVLYAMRSTFYQ----------VTAWESVNENAPTVERWEWSRPDVHIYTLLIQGLAASLRVSDALR

Query:  MIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISVDAPAWEKALRFLNIMKRKIPAAVHSIVVQTP
        MIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCA+CRYKYELISGNIVNI+SEEIS+D PAWEKALRFLNIMKRKIPAAVHSIVVQTP
Subjt:  MIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISVDAPAWEKALRFLNIMKRKIPAAVHSIVVQTP

Query:  SGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMSLTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIVFSAAGDAAS
        SGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPM LTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIV SAAGDAAS
Subjt:  SGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMSLTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIVFSAAGDAAS

Query:  GVIDPSLPRLLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKVAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI
        GV+DPSLP+LLLVAG ASLAAGATLNSLILPQ NRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLK+AAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI
Subjt:  GVIDPSLPRLLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKVAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI

Query:  KKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVVRAISALD
        +KVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSV R    ++
Subjt:  KKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVVRAISALD

XP_038895173.1 uncharacterized protein LOC120083467 isoform X2 [Benincasa hispida]1.2e-27693.03Show/hide
Query:  MILNLSSPWLAITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSNSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAKDADEVLGMI
        MILNL+SPWL ITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFS SM+VRASLSGSDIDGAAAFENPVS+LL NELIR VSGAKDADE L MI
Subjt:  MILNLSSPWLAITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSNSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAKDADEVLGMI

Query:  ADKAGRSGGTVSVLDCRLIIAAALKRNNPELALSVLYAMRSTFYQVTAWESVNENAPTVERWEWSRPDVHIYTLLIQGLAASLRVSDALRMIEIICRVGV
        ADK+GRSGGTVS  DC LIIAAALK NNPELALSV YAMRSTFYQVTAWE VNENA TVERW+WSRPDVH+YTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  ADKAGRSGGTVSVLDCRLIIAAALKRNNPELALSVLYAMRSTFYQVTAWESVNENAPTVERWEWSRPDVHIYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISVDAPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCA+CRYKYELISGNIVNI+SEEIS+D PAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISVDAPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMSLTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIVFSAAGDAASGVIDPSLPRL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPM LTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIV SAAGDAASGV+DPSLP+L
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMSLTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIVFSAAGDAASGVIDPSLPRL

Query:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKVAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAG ASLAAGATLNSLILPQ NRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLK+AAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI+KVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKVAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVVRAISALD
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSV R    ++
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVVRAISALD

XP_038895181.1 uncharacterized protein LOC120083467 isoform X3 [Benincasa hispida]4.3e-27492.66Show/hide
Query:  MILNLSSPWLAITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSNSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAKDADEVLGMI
        MILNL+SPWL ITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFS SM+VRASLSGSDIDGAAAFENPVS+LL NELIR VSGAKDADE L MI
Subjt:  MILNLSSPWLAITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSNSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAKDADEVLGMI

Query:  ADKAGRSGGTVSVLDCRLIIAAALKRNNPELALSVLYAMRSTFYQVTAWESVNENAPTVERWEWSRPDVHIYTLLIQGLAASLRVSDALRMIEIICRVGV
        ADK+GRSGGTVS  DC LIIAAALK NNPELALSV YAMRSTFYQ  AWE VNENA TVERW+WSRPDVH+YTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  ADKAGRSGGTVSVLDCRLIIAAALKRNNPELALSVLYAMRSTFYQVTAWESVNENAPTVERWEWSRPDVHIYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISVDAPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCA+CRYKYELISGNIVNI+SEEIS+D PAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISVDAPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMSLTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIVFSAAGDAASGVIDPSLPRL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPM LTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIV SAAGDAASGV+DPSLP+L
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMSLTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIVFSAAGDAASGVIDPSLPRL

Query:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKVAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAG ASLAAGATLNSLILPQ NRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLK+AAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI+KVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKVAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVVRAISALD
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSV R    ++
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVVRAISALD

TrEMBL top hitse value%identityAlignment
A0A0A0KZV4 Uncharacterized protein2.3e-26589.72Show/hide
Query:  MILNLSSPWLAITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSNSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAKDADEVLGMI
        MILN +SP L +TRLPPPKL EPLAS+TNGATV MPLLLCSHA FAFTSFS S+RVR SLSGSDIDG+AAFENP SELLD+ELI VVSGAKDADE LGMI
Subjt:  MILNLSSPWLAITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSNSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAKDADEVLGMI

Query:  ADKAGRSGGTVSVLDCRLIIAAALKRNNPELALSVLYAMRSTFYQVTAWESVNENAPTVERWEWSRPDVHIYTLLIQGLAASLRVSDALRMIEIICRVGV
         DK+GRSGGTVSV DCRLII+AALKRNNPELALSV YAMRSTFYQ  AWE VNENA  VERW+WSRPDVH+YTLLI+GLAASLRVSDALRMIEIICRVGV
Subjt:  ADKAGRSGGTVSVLDCRLIIAAALKRNNPELALSVLYAMRSTFYQVTAWESVNENAPTVERWEWSRPDVHIYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISVDAPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA
        +PAEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCAKC YKYELISGNIVNIESEEI +D PAWEKALRFLNIMKRKIP AVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISVDAPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMSLTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIVFSAAGDAASGVIDPSLPRL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN YSGE M LTNHSDGRESLLLRVP K  SSLLNPS LFPLIV SAAGDAASGVIDPSLP+L
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMSLTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIVFSAAGDAASGVIDPSLPRL

Query:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKVAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        L+VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRI DLK+AAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKVAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVVRAISALD
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSV R    ++
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVVRAISALD

A0A1S3BTU3 uncharacterized protein LOC1034931039.1e-27091.01Show/hide
Query:  MILNLSSPWLAITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSNSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAKDADEVLGMI
        MILN +SP+L +TRLPPPKL EPL S+TNGATV +PLLLCSHALFAFTSFS SMRVR SLSGSDIDG+AAFENP SELLD+ELI VVSGAKDADE LGMI
Subjt:  MILNLSSPWLAITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSNSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAKDADEVLGMI

Query:  ADKAGRSGGTVSVLDCRLIIAAALKRNNPELALSVLYAMRSTFYQVTAWESVNENAPTVERWEWSRPDVHIYTLLIQGLAASLRVSDALRMIEIICRVGV
         DK+GRSGGTVSV DCRLIIAAALKRNNPELALSV YAMRSTFYQVTAWE+VNENA  VERW+WSRPDVH+YTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  ADKAGRSGGTVSVLDCRLIIAAALKRNNPELALSVLYAMRSTFYQVTAWESVNENAPTVERWEWSRPDVHIYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISVDAPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEIS+D PAWEKALRFLNIMKRKIP AVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISVDAPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMSLTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIVFSAAGDAASGVIDPSLPRL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN  SGE M LTNHSDGRESLLLRVPAK  SSLLNPS LFPLIV SAAGDAASGVIDPSLP+L
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMSLTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIVFSAAGDAASGVIDPSLPRL

Query:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKVAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAGFASLAAGATLNSLILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRI DLK+AAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKVAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVVRAISALD
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSV R    ++
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVVRAISALD

A0A5A7TPS5 Pentatricopeptide repeat (PPR) superfamily protein isoform 22.6e-27292.1Show/hide
Query:  MILNLSSPWLAITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSNSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAKDADEVLGMI
        MILN +SP+L +TRLPPPKL EPL S+TNGATV +PLLLCSHALFAFTSFS SMRVR SLSGSDIDG+AAFENP SELLD+ELI VVSGAKDADE LGMI
Subjt:  MILNLSSPWLAITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSNSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAKDADEVLGMI

Query:  ADKAGRSGGTVSVLDCRLIIAAALKRNNPELALSVLYAMRSTFYQVTAWESVNENAPTVERWEWSRPDVHIYTLLIQGLAASLRVSDALRMIEIICRVGV
         DK+GRSGGTVSV DCRLIIAAALKRNNPELALSV YAMRSTFYQVTAWE+VNENA  VERW+WSRPDVH+YTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  ADKAGRSGGTVSVLDCRLIIAAALKRNNPELALSVLYAMRSTFYQVTAWESVNENAPTVERWEWSRPDVHIYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISVDAPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVV+CPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEIS+D PAWEKALRFLNIMKRKIP AVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISVDAPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMSLTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIVFSAAGDAASGVIDPSLPRL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN  SGE M LTNHSDGRESLLLRVPAK  SSLLNPS LFPLIV SAAGDAASGVIDPSLP+L
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMSLTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIVFSAAGDAASGVIDPSLPRL

Query:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKVAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAGFASLAAGATLNSLILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRI DLK+AAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKVAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVVRAISAL
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVVRAI+AL
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVVRAISAL

A0A6J1ETW5 uncharacterized protein LOC111437671 isoform X15.2e-26588.83Show/hide
Query:  MILNLSSPWLAITRL-PPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSNSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAKDADEVLGM
        MIL+LSSPWL ITRL PPPKL EPLASA+NG +VLMPLLLCSHALF FTSFS S RVRASL+ S+IDGAAAFENPVSELLD+ELI VVSGAKDADEVL +
Subjt:  MILNLSSPWLAITRL-PPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSNSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAKDADEVLGM

Query:  IADKAGRSGGTVSVLDCRLIIAAALKRNNPELALSVLYAMRSTFYQVTAWESVNENAPTVERWEWSRPDVHIYTLLIQGLAASLRVSDALRMIEIICRVG
        IADK+GR+GGTVSV DCRLIIAAALKRNN ELALSV YAMRS+FY+VTAWE VN+N  +VERW+W+RPDVH+YTLLIQGLAASLRVSDALR+IEIICRVG
Subjt:  IADKAGRSGGTVSVLDCRLIIAAALKRNNPELALSVLYAMRSTFYQVTAWESVNENAPTVERWEWSRPDVHIYTLLIQGLAASLRVSDALRMIEIICRVG

Query:  VSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISVDAPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKF
        VSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRY+YELISGNIVNIESEEIS+D PAWEKALRFLN+MK+K+PAAVHSIVVQTPSGVARTQKF
Subjt:  VSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISVDAPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKF

Query:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMSLTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIVFSAAGDAASGVIDPSLPR
        ATETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPN YSGEPM LTNHSDGRESLLLRVPAK TS LL PS LFPLI+ S AGD +SGV+DPSLPR
Subjt:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMSLTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIVFSAAGDAASGVIDPSLPR

Query:  LLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKVAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN
        LLLVAGFASLAAGATLNS ILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLK+AAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN
Subjt:  LLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKVAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN

Query:  SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVVRAISALD
        SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSV R    ++
Subjt:  SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVVRAISALD

A0A6J1J4R1 uncharacterized protein LOC111483407 isoform X12.6e-26488.44Show/hide
Query:  MILNLSSPWLAITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSNSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAKDADEVLGMI
        MIL+LSSPWL ITRLP PKL EPLASA+NG +VLMPLLLCSHA F FTSFS S RVRASL+ S+IDGAAAFENPVS+LLD+ELI VVSGAKDADEVL MI
Subjt:  MILNLSSPWLAITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSNSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAKDADEVLGMI

Query:  ADKAGRSGGTVSVLDCRLIIAAALKRNNPELALSVLYAMRSTFYQVTAWESVNENAPTVERWEWSRPDVHIYTLLIQGLAASLRVSDALRMIEIICRVGV
        A+K+GR+GGTVSV DCRLIIAAALKRNN ELALSV YAMRS+FY+VTAWE VN+N  +VERW+W+RPDVH+YTLLIQGLAASLRVSDALR+IEIICRVGV
Subjt:  ADKAGRSGGTVSVLDCRLIIAAALKRNNPELALSVLYAMRSTFYQVTAWESVNENAPTVERWEWSRPDVHIYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISVDAPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRY+YELISGNIVNIESEEIS+D PAWEKALRFLN+MK+K+PAAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISVDAPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMSLTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIVFSAAGDAASGVIDPSLPRL
        TETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPN YSGEPM LTNHSDGRESLL+RVPAK TS LL PS LFPLI+ S AGDAASGV+DPSLPR+
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMSLTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIVFSAAGDAASGVIDPSLPRL

Query:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKVAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAGFASLAAGATLNS ILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLK+AAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKVAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVVRAISALD
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSV R    ++
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVVRAISALD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64430.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-17162.38Show/hide
Query:  LLCSHALFAFTSFSNSMRVR-----ASLSGSDIDGAAAFENPVSELLDNELIRVVSGAKDADEVLGMIADKAGRS-GGTVSVLDCRLIIAAALKRNNPEL
        LL SH    F   +N   +R      S      D   +  +  S +LD+EL+  VS  +DADE L MI+D+ G + GG V + DCR II+AA+ R N +L
Subjt:  LLCSHALFAFTSFSNSMRVR-----ASLSGSDIDGAAAFENPVSELLDNELIRVVSGAKDADEVLGMIADKAGRS-GGTVSVLDCRLIIAAALKRNNPEL

Query:  ALSVLYAMRSTFYQVTAWESVNENAPTVERWEWSRPDVHIYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQ
        ALS+ Y MR++F         +      +RW WSRPDV +YT+L+ GLAASLRVSD+LR+I  ICRVG+SPAEEVPFGK+V+CPSC++A+AVAQPQHG+Q
Subjt:  ALSVLYAMRSTFYQVTAWESVNENAPTVERWEWSRPDVHIYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQ

Query:  IVSCAKCRYKYELISGNIVNIESEEISVDAPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPI
        IVSCA CRY+YEL SG+I +I+SEE+  D P WEK LR + I K KI ++VHSIVVQTPSG ART +FATETA+LPA+EGERVTIA+AAPSNV+R+VGP 
Subjt:  IVSCAKCRYKYELISGNIVNIESEEISVDAPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPI

Query:  KFSPKDPNFYSGEPMSLTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIVFSAAGDAASGVIDPSLPRLLLVAGFASLAAGATLNSLILPQFNRLPQRSV
        KF  K PNFY GEPMSLT H DGRES+LLR P+K    +L PS L PL+   A GDAASGVIDPSLP+LL VA   SLA GAT+NS +LP+ N+LP+R+V
Subjt:  KFSPKDPNFYSGEPMSLTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIVFSAAGDAASGVIDPSLPRLLLVAGFASLAAGATLNSLILPQFNRLPQRSV

Query:  DIIAIKQQLLSQYNVLQSRIRDLKVAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAA
        D++ IKQQLLSQY+VLQ RIRDLK A EKEVWMLARMCQLENKI AVGEP+YR RR+R+KKVRE LENS+K +I+LI+SYARISSMIEIEVEM+SDV+AA
Subjt:  DIIAIKQQLLSQYNVLQSRIRDLKVAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAA

Query:  EAASS
        EA ++
Subjt:  EAASS

AT1G64430.2 Pentatricopeptide repeat (PPR) superfamily protein1.1e-17162.38Show/hide
Query:  LLCSHALFAFTSFSNSMRVR-----ASLSGSDIDGAAAFENPVSELLDNELIRVVSGAKDADEVLGMIADKAGRS-GGTVSVLDCRLIIAAALKRNNPEL
        LL SH    F   +N   +R      S      D   +  +  S +LD+EL+  VS  +DADE L MI+D+ G + GG V + DCR II+AA+ R N +L
Subjt:  LLCSHALFAFTSFSNSMRVR-----ASLSGSDIDGAAAFENPVSELLDNELIRVVSGAKDADEVLGMIADKAGRS-GGTVSVLDCRLIIAAALKRNNPEL

Query:  ALSVLYAMRSTFYQVTAWESVNENAPTVERWEWSRPDVHIYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQ
        ALS+ Y MR++F         +      +RW WSRPDV +YT+L+ GLAASLRVSD+LR+I  ICRVG+SPAEEVPFGK+V+CPSC++A+AVAQPQHG+Q
Subjt:  ALSVLYAMRSTFYQVTAWESVNENAPTVERWEWSRPDVHIYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVQCPSCMVAVAVAQPQHGIQ

Query:  IVSCAKCRYKYELISGNIVNIESEEISVDAPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPI
        IVSCA CRY+YEL SG+I +I+SEE+  D P WEK LR + I K KI ++VHSIVVQTPSG ART +FATETA+LPA+EGERVTIA+AAPSNV+R+VGP 
Subjt:  IVSCAKCRYKYELISGNIVNIESEEISVDAPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPI

Query:  KFSPKDPNFYSGEPMSLTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIVFSAAGDAASGVIDPSLPRLLLVAGFASLAAGATLNSLILPQFNRLPQRSV
        KF  K PNFY GEPMSLT H DGRES+LLR P+K    +L PS L PL+   A GDAASGVIDPSLP+LL VA   SLA GAT+NS +LP+ N+LP+R+V
Subjt:  KFSPKDPNFYSGEPMSLTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIVFSAAGDAASGVIDPSLPRLLLVAGFASLAAGATLNSLILPQFNRLPQRSV

Query:  DIIAIKQQLLSQYNVLQSRIRDLKVAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAA
        D++ IKQQLLSQY+VLQ RIRDLK A EKEVWMLARMCQLENKI AVGEP+YR RR+R+KKVRE LENS+K +I+LI+SYARISSMIEIEVEM+SDV+AA
Subjt:  DIIAIKQQLLSQYNVLQSRIRDLKVAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAA

Query:  EAASS
        EA ++
Subjt:  EAASS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCTGAACTTGAGTTCACCATGGCTTGCTATCACTCGTCTCCCTCCTCCTAAACTCTTCGAACCACTCGCCTCTGCAACCAATGGCGCTACCGTTCTCATGCCTCT
CCTTCTATGTTCCCACGCTCTCTTCGCTTTCACCTCCTTCTCCAACTCGATGCGAGTTAGAGCTTCTTTAAGTGGCAGCGACATCGACGGTGCTGCGGCTTTTGAGAATC
CTGTTTCGGAGTTACTCGACAACGAGCTGATTAGGGTTGTTTCGGGTGCTAAGGATGCCGATGAAGTGCTAGGGATGATCGCTGATAAGGCAGGGAGAAGTGGAGGTACT
GTGTCTGTTTTGGACTGTCGTTTGATTATTGCAGCTGCTCTTAAGCGTAACAATCCCGAGCTTGCTTTGTCCGTGTTATACGCAATGCGCTCCACTTTCTATCAAGTTAC
AGCATGGGAAAGTGTTAATGAAAATGCTCCCACTGTTGAGAGATGGGAATGGTCAAGGCCAGATGTCCATATATATACATTGCTGATTCAAGGTCTTGCAGCATCCTTGA
GGGTCTCTGATGCACTTAGGATGATTGAGATTATTTGCCGTGTTGGTGTATCACCTGCTGAGGAGGTACCATTTGGAAAGGTAGTGCAGTGTCCCAGTTGTATGGTAGCA
GTAGCAGTTGCACAACCCCAGCACGGTATTCAGATTGTATCCTGTGCAAAGTGCCGTTACAAGTACGAACTTATTTCAGGAAACATAGTTAATATTGAGTCAGAAGAAAT
TAGCGTGGATGCTCCTGCATGGGAAAAAGCACTCCGATTCTTGAATATAATGAAGCGAAAAATTCCTGCTGCTGTGCACTCCATTGTGGTACAAACTCCATCTGGAGTGG
CACGAACCCAGAAGTTTGCTACTGAAACAGCAGATCTCCCAGCACGAGAGGGAGAAAGGGTGACAATTGCTGCTGCAGCTCCGTCAAATGTATTCAGAGAAGTTGGTCCT
ATTAAATTTAGCCCAAAGGATCCCAATTTTTACTCTGGGGAGCCTATGAGCCTGACAAATCATTCAGATGGCCGGGAGTCACTATTATTAAGAGTGCCAGCAAAGGGAAC
CTCATCCTTACTTAACCCATCGACCCTCTTTCCACTTATAGTTTTCTCTGCCGCTGGAGATGCTGCCTCTGGAGTTATTGACCCCAGCTTGCCTCGGTTGCTTTTAGTTG
CTGGATTTGCTTCTCTAGCTGCAGGAGCTACCTTGAATTCATTAATTTTGCCTCAATTCAATCGGCTTCCTCAACGATCTGTTGATATCATTGCTATCAAGCAGCAACTT
TTGTCTCAATATAATGTGCTTCAGTCTCGTATCAGGGATTTAAAAGTAGCTGCTGAAAAGGAGGTTTGGATGTTGGCTCGGATGTGCCAATTAGAGAACAAAATTTTTGC
CGTAGGAGAACCTTCTTACCGCGCACGTAGAAGTAGGATAAAAAAGGTGCGAGAAGGCTTGGAAAATTCCCTTAAGCAACGGATTGAACTAATAGAAAGCTATGCAAGGA
TTTCCTCGATGATTGAGATTGAAGTTGAAATGGAGTCTGATGTTATTGCAGCTGAAGCAGCCAGCAGTGTGGTGCGTGCAATTTCTGCCTTAGATGGAAATTACAAGCAG
AAGCCAATGATGAAGCCGAAAGACTTCTCAACCAATCAATGCCAACAGAAACGGTTTAGACAAGCTTGTACTCATGTTTTCAATCCGAGTAACATCCGTACATCGCTTTC
CAGATCGCCTTGTACCCGCACTACTTACCCAGACTTACTATTTGTCAAACCAATCTAG
mRNA sequenceShow/hide mRNA sequence
CCGCAGGCGACCATTCCCCTCTTCGTCTTCTCCTAACAACCTCTCATTCTTCATCCTTTATCCGGCAAAATCCTCCACTCTAGTGAGAGAAAAAGAAGCCATAGAAGAGA
GAGAAAGAAAGCATTCGACGGATTCTATGCCCATTTTTCTACACCAATTCTCCTCCGTTTCAATCCAAATCTCCACTCCACAATGATTCTGAACTTGAGTTCACCATGGC
TTGCTATCACTCGTCTCCCTCCTCCTAAACTCTTCGAACCACTCGCCTCTGCAACCAATGGCGCTACCGTTCTCATGCCTCTCCTTCTATGTTCCCACGCTCTCTTCGCT
TTCACCTCCTTCTCCAACTCGATGCGAGTTAGAGCTTCTTTAAGTGGCAGCGACATCGACGGTGCTGCGGCTTTTGAGAATCCTGTTTCGGAGTTACTCGACAACGAGCT
GATTAGGGTTGTTTCGGGTGCTAAGGATGCCGATGAAGTGCTAGGGATGATCGCTGATAAGGCAGGGAGAAGTGGAGGTACTGTGTCTGTTTTGGACTGTCGTTTGATTA
TTGCAGCTGCTCTTAAGCGTAACAATCCCGAGCTTGCTTTGTCCGTGTTATACGCAATGCGCTCCACTTTCTATCAAGTTACAGCATGGGAAAGTGTTAATGAAAATGCT
CCCACTGTTGAGAGATGGGAATGGTCAAGGCCAGATGTCCATATATATACATTGCTGATTCAAGGTCTTGCAGCATCCTTGAGGGTCTCTGATGCACTTAGGATGATTGA
GATTATTTGCCGTGTTGGTGTATCACCTGCTGAGGAGGTACCATTTGGAAAGGTAGTGCAGTGTCCCAGTTGTATGGTAGCAGTAGCAGTTGCACAACCCCAGCACGGTA
TTCAGATTGTATCCTGTGCAAAGTGCCGTTACAAGTACGAACTTATTTCAGGAAACATAGTTAATATTGAGTCAGAAGAAATTAGCGTGGATGCTCCTGCATGGGAAAAA
GCACTCCGATTCTTGAATATAATGAAGCGAAAAATTCCTGCTGCTGTGCACTCCATTGTGGTACAAACTCCATCTGGAGTGGCACGAACCCAGAAGTTTGCTACTGAAAC
AGCAGATCTCCCAGCACGAGAGGGAGAAAGGGTGACAATTGCTGCTGCAGCTCCGTCAAATGTATTCAGAGAAGTTGGTCCTATTAAATTTAGCCCAAAGGATCCCAATT
TTTACTCTGGGGAGCCTATGAGCCTGACAAATCATTCAGATGGCCGGGAGTCACTATTATTAAGAGTGCCAGCAAAGGGAACCTCATCCTTACTTAACCCATCGACCCTC
TTTCCACTTATAGTTTTCTCTGCCGCTGGAGATGCTGCCTCTGGAGTTATTGACCCCAGCTTGCCTCGGTTGCTTTTAGTTGCTGGATTTGCTTCTCTAGCTGCAGGAGC
TACCTTGAATTCATTAATTTTGCCTCAATTCAATCGGCTTCCTCAACGATCTGTTGATATCATTGCTATCAAGCAGCAACTTTTGTCTCAATATAATGTGCTTCAGTCTC
GTATCAGGGATTTAAAAGTAGCTGCTGAAAAGGAGGTTTGGATGTTGGCTCGGATGTGCCAATTAGAGAACAAAATTTTTGCCGTAGGAGAACCTTCTTACCGCGCACGT
AGAAGTAGGATAAAAAAGGTGCGAGAAGGCTTGGAAAATTCCCTTAAGCAACGGATTGAACTAATAGAAAGCTATGCAAGGATTTCCTCGATGATTGAGATTGAAGTTGA
AATGGAGTCTGATGTTATTGCAGCTGAAGCAGCCAGCAGTGTGGTGCGTGCAATTTCTGCCTTAGATGGAAATTACAAGCAGAAGCCAATGATGAAGCCGAAAGACTTCT
CAACCAATCAATGCCAACAGAAACGGTTTAGACAAGCTTGTACTCATGTTTTCAATCCGAGTAACATCCGTACATCGCTTTCCAGATCGCCTTGTACCCGCACTACTTAC
CCAGACTTACTATTTGTCAAACCAATCTAGGAGCATCAATAACAAGACCAAATACTTCCCAAATCTTCTTACTTCGCAGTCCATCCTCAATGAAAATGTCAATTAATGTC
TACTTGGTGCATGTACATGAAGTTTTGAAGCAGCATCAGGTCAGTGCTTCGTGATTAGATATCTCATCACTGTAGATCTCATTTGTTAAATTGGTATTATTGTTACGAAA
AGTACATTGAAATCTAAGTTTATTTAAAACTTTTCCCTCCTGAGTGCTGAAATGTGGTTTATCACGTAAATATAACACATTAATGATTCGATTGAATTAATAGCCGTCTT
TGTATTTCAATTATGTCCTTAGTTGGGTCT
Protein sequenceShow/hide protein sequence
MILNLSSPWLAITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSNSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAKDADEVLGMIADKAGRSGGT
VSVLDCRLIIAAALKRNNPELALSVLYAMRSTFYQVTAWESVNENAPTVERWEWSRPDVHIYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKVVQCPSCMVA
VAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISVDAPAWEKALRFLNIMKRKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGP
IKFSPKDPNFYSGEPMSLTNHSDGRESLLLRVPAKGTSSLLNPSTLFPLIVFSAAGDAASGVIDPSLPRLLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQL
LSQYNVLQSRIRDLKVAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVVRAISALDGNYKQ
KPMMKPKDFSTNQCQQKRFRQACTHVFNPSNIRTSLSRSPCTRTTYPDLLFVKPI