; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G007230 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G007230
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationchr07:7833016..7844779
RNA-Seq ExpressionLsi07G007230
SyntenyLsi07G007230
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148995.1 uncharacterized protein LOC101209802 [Cucumis sativus]6.0e-28092.97Show/hide
Query:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI
        MILN +SP LT+TRLPPPKL EPLAS+TNGATV MPLLLCSHA FAFTSFSKS+RVR SLSGSDIDG+AAFENP SELLD+ELI VVSGA+DADEALGMI
Subjt:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI

Query:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
         DKSGRSGGTVSVSDCRLII+AALKRNNPELALSVFYAMRSTFYQ  AWEGVNENAS VERWKWSRPDVHVYTLLI+GLAASLRVSDALRMIEIICRVGV
Subjt:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA
        +PAEEVPFGK+V+CPSCMVAVAVAQPQHGIQIVSCAKC YKYELISGNIVNIESEEI MD PAWEKALRFLNIMK+KIP AVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN YSGE MCLTNHSDGRESLLLRVP K  SSLLNPSILFPLIVLSAAGDAASGVIDP LP+L
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRL

Query:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        L+VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEE
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE

XP_008451955.1 PREDICTED: uncharacterized protein LOC103493103 [Cucumis melo]5.3e-28494.23Show/hide
Query:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI
        MILN +SP+LT+TRLPPPKL EPL S+TNGATV +PLLLCSHALFAFTSFSKSMRVR SLSGSDIDG+AAFENP SELLD+ELI VVSGA+DADEALGMI
Subjt:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI

Query:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
         DKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWE VNENAS VERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGK+V+CPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMD PAWEKALRFLNIMK+KIP AVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN  SGE MCLTNHSDGRESLLLRVPAK  SSLLNPSILFPLIVLSAAGDAASGVIDP LP+L
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRL

Query:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAGFASLAAGATLNSLILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE

XP_038895166.1 uncharacterized protein LOC120083467 isoform X1 [Benincasa hispida]6.0e-28894.16Show/hide
Query:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI
        MILNL+SPWL ITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSM+VRASLSGSDIDGAAAFENPVS+LL NELIR VSGA+DADEAL MI
Subjt:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI

Query:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ----------VTAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALR
        ADKSGRSGGTVS SDC LIIAAALK NNPELALSVFYAMRSTFYQ          VTAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALR
Subjt:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ----------VTAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALR

Query:  MIEIICRVGVSPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTP
        MIEIICRVGVSPAEEVPFGK+VQCPSCMVAVAVAQPQHGIQIVSCA+CRYKYELISGNIVNI+SEEISMD PAWEKALRFLNIMK+KIPAAVHSIVVQTP
Subjt:  MIEIICRVGVSPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTP

Query:  SGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAAS
        SGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPS LFPLIVLSAAGDAAS
Subjt:  SGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAAS

Query:  GVIDPGLPRLLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI
        GV+DP LP+LLLVAG ASLAAGATLNSLILPQ NRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI
Subjt:  GVIDPGLPRLLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI

Query:  KKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE
        +KVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE
Subjt:  KKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE

XP_038895173.1 uncharacterized protein LOC120083467 isoform X2 [Benincasa hispida]2.2e-29095.86Show/hide
Query:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI
        MILNL+SPWL ITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSM+VRASLSGSDIDGAAAFENPVS+LL NELIR VSGA+DADEAL MI
Subjt:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI

Query:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        ADKSGRSGGTVS SDC LIIAAALK NNPELALSVFYAMRSTFYQVTAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGK+VQCPSCMVAVAVAQPQHGIQIVSCA+CRYKYELISGNIVNI+SEEISMD PAWEKALRFLNIMK+KIPAAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPS LFPLIVLSAAGDAASGV+DP LP+L
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRL

Query:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAG ASLAAGATLNSLILPQ NRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI+KVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE

XP_038895181.1 uncharacterized protein LOC120083467 isoform X3 [Benincasa hispida]7.8e-28895.5Show/hide
Query:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI
        MILNL+SPWL ITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSM+VRASLSGSDIDGAAAFENPVS+LL NELIR VSGA+DADEAL MI
Subjt:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI

Query:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        ADKSGRSGGTVS SDC LIIAAALK NNPELALSVFYAMRSTFYQ  AWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGK+VQCPSCMVAVAVAQPQHGIQIVSCA+CRYKYELISGNIVNI+SEEISMD PAWEKALRFLNIMK+KIPAAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPS LFPLIVLSAAGDAASGV+DP LP+L
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRL

Query:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAG ASLAAGATLNSLILPQ NRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI+KVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE

TrEMBL top hitse value%identityAlignment
A0A0A0KZV4 Uncharacterized protein2.9e-28092.97Show/hide
Query:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI
        MILN +SP LT+TRLPPPKL EPLAS+TNGATV MPLLLCSHA FAFTSFSKS+RVR SLSGSDIDG+AAFENP SELLD+ELI VVSGA+DADEALGMI
Subjt:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI

Query:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
         DKSGRSGGTVSVSDCRLII+AALKRNNPELALSVFYAMRSTFYQ  AWEGVNENAS VERWKWSRPDVHVYTLLI+GLAASLRVSDALRMIEIICRVGV
Subjt:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA
        +PAEEVPFGK+V+CPSCMVAVAVAQPQHGIQIVSCAKC YKYELISGNIVNIESEEI MD PAWEKALRFLNIMK+KIP AVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN YSGE MCLTNHSDGRESLLLRVP K  SSLLNPSILFPLIVLSAAGDAASGVIDP LP+L
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRL

Query:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        L+VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEE
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE

A0A1S3BTU3 uncharacterized protein LOC1034931032.5e-28494.23Show/hide
Query:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI
        MILN +SP+LT+TRLPPPKL EPL S+TNGATV +PLLLCSHALFAFTSFSKSMRVR SLSGSDIDG+AAFENP SELLD+ELI VVSGA+DADEALGMI
Subjt:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI

Query:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
         DKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWE VNENAS VERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGK+V+CPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMD PAWEKALRFLNIMK+KIP AVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN  SGE MCLTNHSDGRESLLLRVPAK  SSLLNPSILFPLIVLSAAGDAASGVIDP LP+L
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRL

Query:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAGFASLAAGATLNSLILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE

A0A6J1ETW5 uncharacterized protein LOC111437671 isoform X19.4e-27991.73Show/hide
Query:  MILNLSSPWLTITRL-PPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGM
        MIL+LSSPWLTITRL PPPKL EPLASA+NG +VLMPLLLCSHALF FTSFSKS RVRASL+ S+IDGAAAFENPVSELLD+ELI VVSGA+DADE L +
Subjt:  MILNLSSPWLTITRL-PPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGM

Query:  IADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVG
        IADKSGR+GGTVSV DCRLIIAAALKRNN ELALSVFYAMRS+FY+VTAWEGVN+N S+VERWKW+RPDVHVYTLLIQGLAASLRVSDALR+IEIICRVG
Subjt:  IADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVG

Query:  VSPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKF
        VSPAEEVPFGK+VQCPSCMVAVAVAQPQHGIQIVSCAKCRY+YELISGNIVNIESEEISMD PAWEKALRFLN+MKQK+PAAVHSIVVQTPSGVARTQKF
Subjt:  VSPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKF

Query:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPR
        ATETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPN YSGEPMCLTNHSDGRESLLLRVPAK TS LL PS LFPLI+LS AGD +SGV+DP LPR
Subjt:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPR

Query:  LLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN
        LLLVAGFASLAAGATLNS ILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN
Subjt:  LLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN

Query:  SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE
        SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEE
Subjt:  SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE

A0A6J1EYW6 uncharacterized protein LOC111437671 isoform X23.3e-27691.37Show/hide
Query:  MILNLSSPWLTITRL-PPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGM
        MIL+LSSPWLTITRL PPPKL EPLASA+NG +VLMPLLLCSHALF FTSFSKS RVRASL+ S+IDGAAAFENPVSELLD+ELI VVSGA+DADE L +
Subjt:  MILNLSSPWLTITRL-PPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGM

Query:  IADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVG
        IADKSGR+GGTVSV DCRLIIAAALKRNN ELALSVFYAMRS+FY+  AWEGVN+N S+VERWKW+RPDVHVYTLLIQGLAASLRVSDALR+IEIICRVG
Subjt:  IADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVG

Query:  VSPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKF
        VSPAEEVPFGK+VQCPSCMVAVAVAQPQHGIQIVSCAKCRY+YELISGNIVNIESEEISMD PAWEKALRFLN+MKQK+PAAVHSIVVQTPSGVARTQKF
Subjt:  VSPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKF

Query:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPR
        ATETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPN YSGEPMCLTNHSDGRESLLLRVPAK TS LL PS LFPLI+LS AGD +SGV+DP LPR
Subjt:  ATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPR

Query:  LLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN
        LLLVAGFASLAAGATLNS ILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN
Subjt:  LLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLEN

Query:  SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE
        SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEE
Subjt:  SLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE

A0A6J1J4R1 uncharacterized protein LOC111483407 isoform X11.4e-27791.17Show/hide
Query:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI
        MIL+LSSPWLTITRLP PKL EPLASA+NG +VLMPLLLCSHA F FTSFS+S RVRASL+ S+IDGAAAFENPVS+LLD+ELI VVSGA+DADE L MI
Subjt:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI

Query:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        A+KSGR+GGTVSV DCRLIIAAALKRNN ELALSVFYAMRS+FY+VTAWEGVN+N S+VERWKW+RPDVHVYTLLIQGLAASLRVSDALR+IEIICRVGV
Subjt:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGK+VQCPSCMVAVAVAQPQHGIQIVSCAKCRY+YELISGNIVNIESEEISMD PAWEKALRFLN+MKQK+PAAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRL
        TETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPN YSGEPMCLTNHSDGRESLL+RVPAK TS LL PS LFPLI+LS AGDAASGV+DP LPR+
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRL

Query:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAGFASLAAGATLNS ILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEE
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64430.1 Pentatricopeptide repeat (PPR) superfamily protein4.4e-18064.19Show/hide
Query:  DGAAAFENPVSELLDNELIRVVSGAEDADEALGMIADKSGRS-GGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEGVNENASTVERWKW
        D   +  +  S +LD+EL+  VS   DADEAL MI+D+ G + GG V + DCR II+AA+ R N +LALS+FY MR++F         +   S  +RW W
Subjt:  DGAAAFENPVSELLDNELIRVVSGAEDADEALGMIADKSGRS-GGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEGVNENASTVERWKW

Query:  SRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAW
        SRPDV VYT+L+ GLAASLRVSD+LR+I  ICRVG+SPAEEVPFGKIV+CPSC++A+AVAQPQHG+QIVSCA CRY+YEL SG+I +I+SEE+  D P W
Subjt:  SRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAW

Query:  EKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPA
        EK LR + I K KI ++VHSIVVQTPSG ART +FATETA+LPA+EGERVTIA+AAPSNV+R+VGP KF  K PNFY GEPM LT H DGRES+LLR P+
Subjt:  EKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPA

Query:  KGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRLLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWM
        K    +L PS L PL+ + A GDAASGVIDP LP+LL VA   SLA GAT+NS +LP+ N+LP+R+VD++ IKQQLLSQY+VLQ RIRDLK A EKEVWM
Subjt:  KGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRLLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWM

Query:  LARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEACLYHCELF
        LARMCQLENKI AVGEP+YR RR+R+KKVRE LENS+K +I+LI+SYARISSMIEIEVEM+SDV+AAEA ++ E +++QIEQIM LENLEE      E  
Subjt:  LARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEACLYHCELF

Query:  FCLHALLFSPP
             LL S P
Subjt:  FCLHALLFSPP

AT1G64430.2 Pentatricopeptide repeat (PPR) superfamily protein4.4e-18064.19Show/hide
Query:  DGAAAFENPVSELLDNELIRVVSGAEDADEALGMIADKSGRS-GGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEGVNENASTVERWKW
        D   +  +  S +LD+EL+  VS   DADEAL MI+D+ G + GG V + DCR II+AA+ R N +LALS+FY MR++F         +   S  +RW W
Subjt:  DGAAAFENPVSELLDNELIRVVSGAEDADEALGMIADKSGRS-GGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEGVNENASTVERWKW

Query:  SRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAW
        SRPDV VYT+L+ GLAASLRVSD+LR+I  ICRVG+SPAEEVPFGKIV+CPSC++A+AVAQPQHG+QIVSCA CRY+YEL SG+I +I+SEE+  D P W
Subjt:  SRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAW

Query:  EKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPA
        EK LR + I K KI ++VHSIVVQTPSG ART +FATETA+LPA+EGERVTIA+AAPSNV+R+VGP KF  K PNFY GEPM LT H DGRES+LLR P+
Subjt:  EKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPA

Query:  KGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRLLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWM
        K    +L PS L PL+ + A GDAASGVIDP LP+LL VA   SLA GAT+NS +LP+ N+LP+R+VD++ IKQQLLSQY+VLQ RIRDLK A EKEVWM
Subjt:  KGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRLLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWM

Query:  LARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEACLYHCELF
        LARMCQLENKI AVGEP+YR RR+R+KKVRE LENS+K +I+LI+SYARISSMIEIEVEM+SDV+AAEA ++ E +++QIEQIM LENLEE      E  
Subjt:  LARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEACLYHCELF

Query:  FCLHALLFSPP
             LL S P
Subjt:  FCLHALLFSPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCTGAACTTGAGTTCACCATGGCTCACTATCACTCGTCTTCCTCCTCCCAAACTCTTCGAACCACTCGCCTCTGCAACCAATGGCGCTACCGTTCTCATGCCTCT
CCTTCTTTGTTCCCACGCTCTCTTTGCTTTCACCTCCTTCTCCAAGTCGATGCGAGTTAGAGCTTCTTTAAGTGGCAGCGACATCGACGGCGCTGCGGCTTTTGAGAATC
CTGTTTCGGAGTTACTCGACAACGAGCTGATTAGGGTTGTTTCGGGTGCTGAGGATGCCGATGAAGCACTAGGGATGATTGCTGATAAGTCAGGGAGAAGTGGAGGTACT
GTGTCTGTTTCGGACTGTCGTTTGATTATTGCGGCTGCACTTAAACGTAACAATCCCGAGCTTGCTTTGTCCGTGTTCTACGCAATGCGCTCCACTTTCTATCAAGTCAC
AGCATGGGAAGGTGTTAATGAAAATGCTTCCACTGTTGAGAGATGGAAATGGTCAAGGCCAGATGTCCATGTATATACATTGCTGATTCAAGGTCTTGCAGCATCCTTGA
GGGTCTCTGATGCACTTAGGATGATCGAGATCATTTGCCGTGTCGGTGTATCACCTGCTGAGGAGGTACCATTTGGAAAGATAGTGCAGTGTCCCAGTTGTATGGTAGCA
GTAGCAGTTGCACAACCCCAGCACGGTATTCAGATTGTATCCTGTGCAAAGTGCCGCTACAAATATGAACTTATTTCAGGAAACATAGTTAATATTGAGTCAGAAGAAAT
TAGCATGGATGCTCCTGCATGGGAAAAAGCACTCCGATTCTTGAATATAATGAAGCAAAAAATTCCTGCTGCTGTGCACTCCATTGTGGTACAAACTCCCTCTGGAGTGG
CACGAACCCAGAAGTTTGCTACTGAAACAGCAGATCTCCCAGCACGAGAGGGAGAAAGAGTGACAATTGCTGCTGCAGCTCCGTCAAATGTATTCAGAGAAGTTGGTCCT
ATTAAATTTAGTCCAAAGGATCCCAATTTTTACTCTGGGGAGCCTATGTGCCTGACAAATCATTCAGATGGCCGGGAGTCACTATTATTAAGAGTGCCAGCAAAGGGAAC
CTCATCCTTACTTAACCCATCGATCCTCTTTCCACTCATAGTTTTGTCTGCCGCTGGAGATGCTGCCTCTGGAGTTATTGACCCCGGCTTGCCTCGGTTGCTTTTAGTTG
CTGGATTTGCTTCTCTAGCTGCAGGAGCTACCTTGAATTCATTAATTTTGCCTCAATTCAATCGGCTTCCTCAACGATCAGTTGATATCATTGCTATCAAGCAGCAGCTT
TTATCTCAATATAATGTGCTTCAGTCTCGTATCAGGGATTTAAAACTAGCTGCTGAAAAGGAGGTTTGGATGTTGGCTCGGATGTGCCAATTAGAGAACAAAATTTTTGC
TGTAGGAGAACCTTCTTACCGCGCACGTAGAAGTAGGATAAAAAAGGTGCGAGAAGGCTTGGAAAATTCCCTTAAGCAACGGATTGAACTAATAGAAAGCTATGCAAGGA
TTTCCTCGATGATTGAGATTGAAGTTGAAATGGAGTCTGATGTTATTGCAGCTGAAGCAGCCAGCAGTGTGGAAAGGGTTTCTGAACAGATAGAGCAAATCATGGCGCTG
GAAAATCTAGAAGAGGCATGTCTTTATCACTGTGAACTATTTTTTTGTTTGCATGCCTTACTATTTTCTCCCCCTATTCTTCTATTTTGGGCAGCTGGGACAGCTCAGCT
TTATCTTACTCGTCAACATTTTGAGAAACTTGCTCTTGTTTACTATGTTCGTTCACTATTACTGCCAGGGTTGGCCCGGTGCCATACCCACTGGACTTACCTTCAATCAC
TTCCATCCATACGAGATGGAAAATACAAGCAGAAGCCAACGATGAAGCCGAAAGACTTCTCAACCAATCAATGCCAACAGAAACGGTTTAGACAAGCTTGTACGCATGTT
TTCAATCTGGGTAATGCCCGTACCTCGCTTTCCACATCGCCTTGTACCTGCACTACTTACCCAGACTTACTATTTGTCAAAATCTAG
mRNA sequenceShow/hide mRNA sequence
CCAAAATCCTCCACTCTAGTGAGAGAGGGAAAAAAAACCAGAGAAGAGAGAGAAAGCAAGCATTCGACAAATTCTATGCCATTTTTCTACACCAATTCTCCTTCGTTTCA
ATCCAAAACCTCAACTCTACAATGATTCTGAACTTGAGTTCACCATGGCTCACTATCACTCGTCTTCCTCCTCCCAAACTCTTCGAACCACTCGCCTCTGCAACCAATGG
CGCTACCGTTCTCATGCCTCTCCTTCTTTGTTCCCACGCTCTCTTTGCTTTCACCTCCTTCTCCAAGTCGATGCGAGTTAGAGCTTCTTTAAGTGGCAGCGACATCGACG
GCGCTGCGGCTTTTGAGAATCCTGTTTCGGAGTTACTCGACAACGAGCTGATTAGGGTTGTTTCGGGTGCTGAGGATGCCGATGAAGCACTAGGGATGATTGCTGATAAG
TCAGGGAGAAGTGGAGGTACTGTGTCTGTTTCGGACTGTCGTTTGATTATTGCGGCTGCACTTAAACGTAACAATCCCGAGCTTGCTTTGTCCGTGTTCTACGCAATGCG
CTCCACTTTCTATCAAGTCACAGCATGGGAAGGTGTTAATGAAAATGCTTCCACTGTTGAGAGATGGAAATGGTCAAGGCCAGATGTCCATGTATATACATTGCTGATTC
AAGGTCTTGCAGCATCCTTGAGGGTCTCTGATGCACTTAGGATGATCGAGATCATTTGCCGTGTCGGTGTATCACCTGCTGAGGAGGTACCATTTGGAAAGATAGTGCAG
TGTCCCAGTTGTATGGTAGCAGTAGCAGTTGCACAACCCCAGCACGGTATTCAGATTGTATCCTGTGCAAAGTGCCGCTACAAATATGAACTTATTTCAGGAAACATAGT
TAATATTGAGTCAGAAGAAATTAGCATGGATGCTCCTGCATGGGAAAAAGCACTCCGATTCTTGAATATAATGAAGCAAAAAATTCCTGCTGCTGTGCACTCCATTGTGG
TACAAACTCCCTCTGGAGTGGCACGAACCCAGAAGTTTGCTACTGAAACAGCAGATCTCCCAGCACGAGAGGGAGAAAGAGTGACAATTGCTGCTGCAGCTCCGTCAAAT
GTATTCAGAGAAGTTGGTCCTATTAAATTTAGTCCAAAGGATCCCAATTTTTACTCTGGGGAGCCTATGTGCCTGACAAATCATTCAGATGGCCGGGAGTCACTATTATT
AAGAGTGCCAGCAAAGGGAACCTCATCCTTACTTAACCCATCGATCCTCTTTCCACTCATAGTTTTGTCTGCCGCTGGAGATGCTGCCTCTGGAGTTATTGACCCCGGCT
TGCCTCGGTTGCTTTTAGTTGCTGGATTTGCTTCTCTAGCTGCAGGAGCTACCTTGAATTCATTAATTTTGCCTCAATTCAATCGGCTTCCTCAACGATCAGTTGATATC
ATTGCTATCAAGCAGCAGCTTTTATCTCAATATAATGTGCTTCAGTCTCGTATCAGGGATTTAAAACTAGCTGCTGAAAAGGAGGTTTGGATGTTGGCTCGGATGTGCCA
ATTAGAGAACAAAATTTTTGCTGTAGGAGAACCTTCTTACCGCGCACGTAGAAGTAGGATAAAAAAGGTGCGAGAAGGCTTGGAAAATTCCCTTAAGCAACGGATTGAAC
TAATAGAAAGCTATGCAAGGATTTCCTCGATGATTGAGATTGAAGTTGAAATGGAGTCTGATGTTATTGCAGCTGAAGCAGCCAGCAGTGTGGAAAGGGTTTCTGAACAG
ATAGAGCAAATCATGGCGCTGGAAAATCTAGAAGAGGCATGTCTTTATCACTGTGAACTATTTTTTTGTTTGCATGCCTTACTATTTTCTCCCCCTATTCTTCTATTTTG
GGCAGCTGGGACAGCTCAGCTTTATCTTACTCGTCAACATTTTGAGAAACTTGCTCTTGTTTACTATGTTCGTTCACTATTACTGCCAGGGTTGGCCCGGTGCCATACCC
ACTGGACTTACCTTCAATCACTTCCATCCATACGAGATGGAAAATACAAGCAGAAGCCAACGATGAAGCCGAAAGACTTCTCAACCAATCAATGCCAACAGAAACGGTTT
AGACAAGCTTGTACGCATGTTTTCAATCTGGGTAATGCCCGTACCTCGCTTTCCACATCGCCTTGTACCTGCACTACTTACCCAGACTTACTATTTGTCAAAATCTAGGA
GCATCAATAACAAGACCAAATACTTCCCAAATCTTCTTACTTCGCAATCCATCCTCAATGAAAATGTCAATCAATGTCTACTTGGTGCATGTACATGAAGTTTTGAAGCA
GCATCAGAGCGTGAGTGAGGATTGGGAAACATGATGAGGAGCTAGAATGACTTCGAGTGGCCGTGCATTTGGAGTCAAGTTGGTAACTTCACACATAAGGCTGACATACA
TACGCATAAAGTCATTTATCATTTGCTTTTTAATTTTGGGTCATTTTTCATTTGGCCCCTCAAATACAGCCATCCATCCATTAACTGGACATATTAACTCATATGAGGAG
AGA
Protein sequenceShow/hide protein sequence
MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMIADKSGRSGGT
VSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQVTAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKIVQCPSCMVA
VAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGP
IKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRLLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQL
LSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMAL
ENLEEACLYHCELFFCLHALLFSPPILLFWAAGTAQLYLTRQHFEKLALVYYVRSLLLPGLARCHTHWTYLQSLPSIRDGKYKQKPTMKPKDFSTNQCQQKRFRQACTHV
FNLGNARTSLSTSPCTCTTYPDLLFVKI