; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10002499 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10002499
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationChr11:7423932..7433239
RNA-Seq ExpressionHG10002499
SyntenyHG10002499
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044898.1 Pentatricopeptide repeat (PPR) superfamily protein isoform 2 [Cucumis melo var. makuwa]0.0e+0082.19Show/hide
Query:  RVDA-PPAPRKPFTNGTFVPKVSAPAQPQTNATPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGVQQQQQRNVGRQQRPPWGKRPFCERGKKNQRRER--
        RVDA PP P+K FTNG FVPKVSAPAQ QTNATPRQRPQTLDSLFANMKEQRLRVLSQRQNGGG   QQQRN GRQQRPPWGKRPFC  G+      R  
Subjt:  RVDA-PPAPRKPFTNGTFVPKVSAPAQPQTNATPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGVQQQQQRNVGRQQRPPWGKRPFCERGKKNQRRER--

Query:  -----------------------------------------------------------------KQAFDKFYAIFLHQFSFVSIQNLNSTMILNLSSPW
                                                                          +AFDKFYAI LH+FSFVSIQ     MILN +SP+
Subjt:  -----------------------------------------------------------------KQAFDKFYAIFLHQFSFVSIQNLNSTMILNLSSPW

Query:  LTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMIADKSGRSGG
        LT+TRLPPPKL EPL S+TNGATV +PLLLCSHALFAFTSFSKSMRVR SLSGSDIDG+AAFENP SELLD+ELI VVSGA+DADEALGMI DKSGRSGG
Subjt:  LTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMIADKSGRSGG

Query:  TVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFG
        TVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ  AWE VNENAS VERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFG
Subjt:  TVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFG

Query:  KIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFATETADLPAR
        K+V+CPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMD PAWEKALRFLNIMK+KIP AVHSIVVQTPSGVARTQKFATETADLPAR
Subjt:  KIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFATETADLPAR

Query:  EGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRLLLVAGFASL
        EGERVTIAAAAPSNVFREVGPIKFSPKDPN  SGE MCLTNHSDGRESLLLRVPAK  SSLLNPSILFPLIVLSAAGDAASGVIDP LP+LLLVAGFASL
Subjt:  EGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRLLLVAGFASL

Query:  AAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIE
        AAGATLNSLILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIE
Subjt:  AAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIE

Query:  SYARISSMIEIEVEMESDVIAAEAASSVER
        SYARISSMIEIEVEMESDVIAAEAASSV R
Subjt:  SYARISSMIEIEVEMESDVIAAEAASSVER

XP_008451955.1 PREDICTED: uncharacterized protein LOC103493103 [Cucumis melo]3.1e-29088.89Show/hide
Query:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI
        MILN +SP+LT+TRLPPPKL EPL S+TNGATV +PLLLCSHALFAFTSFSKSMRVR SLSGSDIDG+AAFENP SELLD+ELI VVSGA+DADEALGMI
Subjt:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI

Query:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
         DKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ  AWE VNENAS VERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGK+V+CPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMD PAWEKALRFLNIMK+KIP AVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN  SGE MCLTNHSDGRESLLLRVPAK  SSLLNPSILFPLIVLSAAGDAASGVIDP LP+L
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRL

Query:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAGFASLAAGATLNSLILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEACLYHCELFFCLHALLFSPPILLFWAAGTAQLRWKIQAEANDEAE
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE                                RWK+QAEANDEAE
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEACLYHCELFFCLHALLFSPPILLFWAAGTAQLRWKIQAEANDEAE

Query:  RLLNQSMPTETV
        RLLNQSMPTE V
Subjt:  RLLNQSMPTETV

XP_038895166.1 uncharacterized protein LOC120083467 isoform X1 [Benincasa hispida]1.9e-29588.91Show/hide
Query:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI
        MILNL+SPWL ITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSM+VRASLSGSDIDGAAAFENPVS+LL NELIR VSGA+DADEAL MI
Subjt:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI

Query:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ------------AWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALR
        ADKSGRSGGTVS SDC LIIAAALK NNPELALSVFYAMRSTFYQ            AWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALR
Subjt:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ------------AWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALR

Query:  MIEIICRVGVSPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTP
        MIEIICRVGVSPAEEVPFGK+VQCPSCMVAVAVAQPQHGIQIVSCA+CRYKYELISGNIVNI+SEEISMD PAWEKALRFLNIMK+KIPAAVHSIVVQTP
Subjt:  MIEIICRVGVSPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTP

Query:  SGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAAS
        SGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPS LFPLIVLSAAGDAAS
Subjt:  SGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAAS

Query:  GVIDPGLPRLLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI
        GV+DP LP+LLLVAG ASLAAGATLNSLILPQ NRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI
Subjt:  GVIDPGLPRLLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI

Query:  KKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEACLYHCELFFCLHALLFSPPILLFWAAGTAQLRWK
        +KVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE                                RWK
Subjt:  KKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEACLYHCELFFCLHALLFSPPILLFWAAGTAQLRWK

Query:  IQAEANDEAERLLNQSMPTETV
        +QAEANDEAERLLNQSMPTE V
Subjt:  IQAEANDEAERLLNQSMPTETV

XP_038895173.1 uncharacterized protein LOC120083467 isoform X2 [Benincasa hispida]1.3e-29690.36Show/hide
Query:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI
        MILNL+SPWL ITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSM+VRASLSGSDIDGAAAFENPVS+LL NELIR VSGA+DADEAL MI
Subjt:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI

Query:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
        ADKSGRSGGTVS SDC LIIAAALK NNPELALSVFYAMRSTFYQ  AWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGK+VQCPSCMVAVAVAQPQHGIQIVSCA+CRYKYELISGNIVNI+SEEISMD PAWEKALRFLNIMK+KIPAAVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPS LFPLIVLSAAGDAASGV+DP LP+L
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRL

Query:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAG ASLAAGATLNSLILPQ NRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI+KVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEACLYHCELFFCLHALLFSPPILLFWAAGTAQLRWKIQAEANDEAE
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE                                RWK+QAEANDEAE
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEACLYHCELFFCLHALLFSPPILLFWAAGTAQLRWKIQAEANDEAE

Query:  RLLNQSMPTETV
        RLLNQSMPTE V
Subjt:  RLLNQSMPTETV

XP_038895181.1 uncharacterized protein LOC120083467 isoform X3 [Benincasa hispida]4.1e-29890.66Show/hide
Query:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI
        MILNL+SPWL ITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSM+VRASLSGSDIDGAAAFENPVS+LL NELIR VSGA+DADEAL MI
Subjt:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI

Query:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSP
        ADKSGRSGGTVS SDC LIIAAALK NNPELALSVFYAMRSTFYQAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSP
Subjt:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSP

Query:  AEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFATE
        AEEVPFGK+VQCPSCMVAVAVAQPQHGIQIVSCA+CRYKYELISGNIVNI+SEEISMD PAWEKALRFLNIMK+KIPAAVHSIVVQTPSGVARTQKFATE
Subjt:  AEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFATE

Query:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRLLL
        TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPS LFPLIVLSAAGDAASGV+DP LP+LLL
Subjt:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRLLL

Query:  VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK
        VAG ASLAAGATLNSLILPQ NRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRI+KVREGLENSLK
Subjt:  VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK

Query:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEACLYHCELFFCLHALLFSPPILLFWAAGTAQLRWKIQAEANDEAERL
        QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE                                RWK+QAEANDEAERL
Subjt:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEACLYHCELFFCLHALLFSPPILLFWAAGTAQLRWKIQAEANDEAERL

Query:  LNQSMPTETV
        LNQSMPTE V
Subjt:  LNQSMPTETV

TrEMBL top hitse value%identityAlignment
A0A0A0KZV4 Uncharacterized protein1.5e-29088.36Show/hide
Query:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI
        MILN +SP LT+TRLPPPKL EPLAS+TNGATV MPLLLCSHA FAFTSFSKS+RVR SLSGSDIDG+AAFENP SELLD+ELI VVSGA+DADEALGMI
Subjt:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI

Query:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSP
         DKSGRSGGTVSVSDCRLII+AALKRNNPELALSVFYAMRSTFYQAWEGVNENAS VERWKWSRPDVHVYTLLI+GLAASLRVSDALRMIEIICRVGV+P
Subjt:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSP

Query:  AEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFATE
        AEEVPFGK+V+CPSCMVAVAVAQPQHGIQIVSCAKC YKYELISGNIVNIESEEI MD PAWEKALRFLNIMK+KIP AVHSIVVQTPSGVARTQKFATE
Subjt:  AEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFATE

Query:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRLLL
        TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN YSGE MCLTNHSDGRESLLLRVP K  SSLLNPSILFPLIVLSAAGDAASGVIDP LP+LL+
Subjt:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRLLL

Query:  VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK
        VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK
Subjt:  VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK

Query:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEACLYHCELFFCLHALLFSPPILLFWAAGTAQLRWKIQAEANDEAERL
        QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEE                                RWK+QAEANDEAERL
Subjt:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEACLYHCELFFCLHALLFSPPILLFWAAGTAQLRWKIQAEANDEAERL

Query:  LNQSMPTETV
        LNQSMPTE V
Subjt:  LNQSMPTETV

A0A1S3BTU3 uncharacterized protein LOC1034931031.5e-29088.89Show/hide
Query:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI
        MILN +SP+LT+TRLPPPKL EPL S+TNGATV +PLLLCSHALFAFTSFSKSMRVR SLSGSDIDG+AAFENP SELLD+ELI VVSGA+DADEALGMI
Subjt:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI

Query:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
         DKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ  AWE VNENAS VERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV
Subjt:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGV

Query:  SPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA
        SPAEEVPFGK+V+CPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMD PAWEKALRFLNIMK+KIP AVHSIVVQTPSGVARTQKFA
Subjt:  SPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFA

Query:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRL
        TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPN  SGE MCLTNHSDGRESLLLRVPAK  SSLLNPSILFPLIVLSAAGDAASGVIDP LP+L
Subjt:  TETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRL

Query:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
        LLVAGFASLAAGATLNSLILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS
Subjt:  LLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENS

Query:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEACLYHCELFFCLHALLFSPPILLFWAAGTAQLRWKIQAEANDEAE
        LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEE                                RWK+QAEANDEAE
Subjt:  LKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEACLYHCELFFCLHALLFSPPILLFWAAGTAQLRWKIQAEANDEAE

Query:  RLLNQSMPTETV
        RLLNQSMPTE V
Subjt:  RLLNQSMPTETV

A0A5A7TPS5 Pentatricopeptide repeat (PPR) superfamily protein isoform 20.0e+0082.19Show/hide
Query:  RVDA-PPAPRKPFTNGTFVPKVSAPAQPQTNATPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGVQQQQQRNVGRQQRPPWGKRPFCERGKKNQRRER--
        RVDA PP P+K FTNG FVPKVSAPAQ QTNATPRQRPQTLDSLFANMKEQRLRVLSQRQNGGG   QQQRN GRQQRPPWGKRPFC  G+      R  
Subjt:  RVDA-PPAPRKPFTNGTFVPKVSAPAQPQTNATPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGVQQQQQRNVGRQQRPPWGKRPFCERGKKNQRRER--

Query:  -----------------------------------------------------------------KQAFDKFYAIFLHQFSFVSIQNLNSTMILNLSSPW
                                                                          +AFDKFYAI LH+FSFVSIQ     MILN +SP+
Subjt:  -----------------------------------------------------------------KQAFDKFYAIFLHQFSFVSIQNLNSTMILNLSSPW

Query:  LTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMIADKSGRSGG
        LT+TRLPPPKL EPL S+TNGATV +PLLLCSHALFAFTSFSKSMRVR SLSGSDIDG+AAFENP SELLD+ELI VVSGA+DADEALGMI DKSGRSGG
Subjt:  LTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMIADKSGRSGG

Query:  TVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFG
        TVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ  AWE VNENAS VERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFG
Subjt:  TVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQ--AWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFG

Query:  KIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFATETADLPAR
        K+V+CPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMD PAWEKALRFLNIMK+KIP AVHSIVVQTPSGVARTQKFATETADLPAR
Subjt:  KIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFATETADLPAR

Query:  EGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRLLLVAGFASL
        EGERVTIAAAAPSNVFREVGPIKFSPKDPN  SGE MCLTNHSDGRESLLLRVPAK  SSLLNPSILFPLIVLSAAGDAASGVIDP LP+LLLVAGFASL
Subjt:  EGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRLLLVAGFASL

Query:  AAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIE
        AAGATLNSLILPQF+RLPQRSVDIIAIKQQLLSQYNVLQSRI DLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIE
Subjt:  AAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIE

Query:  SYARISSMIEIEVEMESDVIAAEAASSVER
        SYARISSMIEIEVEMESDVIAAEAASSV R
Subjt:  SYARISSMIEIEVEMESDVIAAEAASSVER

A0A6J1EYW6 uncharacterized protein LOC111437671 isoform X21.1e-28586.58Show/hide
Query:  MILNLSSPWLTITRL-PPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGM
        MIL+LSSPWLTITRL PPPKL EPLASA+NG +VLMPLLLCSHALF FTSFSKS RVRASL+ S+IDGAAAFENPVSELLD+ELI VVSGA+DADE L +
Subjt:  MILNLSSPWLTITRL-PPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGM

Query:  IADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVS
        IADKSGR+GGTVSV DCRLIIAAALKRNN ELALSVFYAMRS+FY+AWEGVN+N S+VERWKW+RPDVHVYTLLIQGLAASLRVSDALR+IEIICRVGVS
Subjt:  IADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVS

Query:  PAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFAT
        PAEEVPFGK+VQCPSCMVAVAVAQPQHGIQIVSCAKCRY+YELISGNIVNIESEEISMD PAWEKALRFLN+MKQK+PAAVHSIVVQTPSGVARTQKFAT
Subjt:  PAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFAT

Query:  ETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRLL
        ETADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPN YSGEPMCLTNHSDGRESLLLRVPAK TS LL PS LFPLI+LS AGD +SGV+DP LPRLL
Subjt:  ETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRLL

Query:  LVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSL
        LVAGFASLAAGATLNS ILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSL
Subjt:  LVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSL

Query:  KQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEACLYHCELFFCLHALLFSPPILLFWAAGTAQLRWKIQAEANDEAER
        KQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEE                                RW++QAEANDEAER
Subjt:  KQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEACLYHCELFFCLHALLFSPPILLFWAAGTAQLRWKIQAEANDEAER

Query:  LLNQSMPTETV
        L NQSMPTE V
Subjt:  LLNQSMPTETV

A0A6J1JDG3 uncharacterized protein LOC111483407 isoform X21.6e-28486.07Show/hide
Query:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI
        MIL+LSSPWLTITRLP PKL EPLASA+NG +VLMPLLLCSHA F FTSFS+S RVRASL+ S+IDGAAAFENPVS+LLD+ELI VVSGA+DADE L MI
Subjt:  MILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNELIRVVSGAEDADEALGMI

Query:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSP
        A+KSGR+GGTVSV DCRLIIAAALKRNN ELALSVFYAMRS+FY+AWEGVN+N S+VERWKW+RPDVHVYTLLIQGLAASLRVSDALR+IEIICRVGVSP
Subjt:  ADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSP

Query:  AEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFATE
        AEEVPFGK+VQCPSCMVAVAVAQPQHGIQIVSCAKCRY+YELISGNIVNIESEEISMD PAWEKALRFLN+MKQK+PAAVHSIVVQTPSGVARTQKFATE
Subjt:  AEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFATE

Query:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRLLL
        TADLPAREGERVTIAAAAPSNV+REVGPIKFSPKDPN YSGEPMCLTNHSDGRESLL+RVPAK TS LL PS LFPLI+LS AGDAASGV+DP LPR+LL
Subjt:  TADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRLLL

Query:  VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK
        VAGFASLAAGATLNS ILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK
Subjt:  VAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLK

Query:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEACLYHCELFFCLHALLFSPPILLFWAAGTAQLRWKIQAEANDEAERL
        QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIM LENLEE                                RW++QAEANDEAERL
Subjt:  QRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEACLYHCELFFCLHALLFSPPILLFWAAGTAQLRWKIQAEANDEAERL

Query:  LNQSMPTETV
         NQSMPTE V
Subjt:  LNQSMPTETV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64430.1 Pentatricopeptide repeat (PPR) superfamily protein5.0e-18562.83Show/hide
Query:  DGAAAFENPVSELLDNELIRVVSGAEDADEALGMIADKSGRS-GGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASTVERWKWSR
        D   +  +  S +LD+EL+  VS   DADEAL MI+D+ G + GG V + DCR II+AA+ R N +LALS+FY MR++F       +   S  +RW WSR
Subjt:  DGAAAFENPVSELLDNELIRVVSGAEDADEALGMIADKSGRS-GGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASTVERWKWSR

Query:  PDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEK
        PDV VYT+L+ GLAASLRVSD+LR+I  ICRVG+SPAEEVPFGKIV+CPSC++A+AVAQPQHG+QIVSCA CRY+YEL SG+I +I+SEE+  D P WEK
Subjt:  PDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEK

Query:  ALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKG
         LR + I K KI ++VHSIVVQTPSG ART +FATETA+LPA+EGERVTIA+AAPSNV+R+VGP KF  K PNFY GEPM LT H DGRES+LLR P+K 
Subjt:  ALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKG

Query:  TSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRLLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLA
           +L PS L PL+ + A GDAASGVIDP LP+LL VA   SLA GAT+NS +LP+ N+LP+R+VD++ IKQQLLSQY+VLQ RIRDLK A EKEVWMLA
Subjt:  TSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRLLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLA

Query:  RMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEACLYHCELFFC
        RMCQLENKI AVGEP+YR RR+R+KKVRE LENS+K +I+LI+SYARISSMIEIEVEM+SDV+AAEA ++ E +++QIEQIM LENLEE           
Subjt:  RMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEACLYHCELFFC

Query:  LHALLFSPPILLFWAAGTAQLRWKIQAEANDEAERLLN
                             +WKIQAEANDEAERLL+
Subjt:  LHALLFSPPILLFWAAGTAQLRWKIQAEANDEAERLLN

AT1G64430.2 Pentatricopeptide repeat (PPR) superfamily protein5.0e-18562.83Show/hide
Query:  DGAAAFENPVSELLDNELIRVVSGAEDADEALGMIADKSGRS-GGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASTVERWKWSR
        D   +  +  S +LD+EL+  VS   DADEAL MI+D+ G + GG V + DCR II+AA+ R N +LALS+FY MR++F       +   S  +RW WSR
Subjt:  DGAAAFENPVSELLDNELIRVVSGAEDADEALGMIADKSGRS-GGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASTVERWKWSR

Query:  PDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEK
        PDV VYT+L+ GLAASLRVSD+LR+I  ICRVG+SPAEEVPFGKIV+CPSC++A+AVAQPQHG+QIVSCA CRY+YEL SG+I +I+SEE+  D P WEK
Subjt:  PDVHVYTLLIQGLAASLRVSDALRMIEIICRVGVSPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEK

Query:  ALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKG
         LR + I K KI ++VHSIVVQTPSG ART +FATETA+LPA+EGERVTIA+AAPSNV+R+VGP KF  K PNFY GEPM LT H DGRES+LLR P+K 
Subjt:  ALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFATETADLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKG

Query:  TSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRLLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLA
           +L PS L PL+ + A GDAASGVIDP LP+LL VA   SLA GAT+NS +LP+ N+LP+R+VD++ IKQQLLSQY+VLQ RIRDLK A EKEVWMLA
Subjt:  TSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRLLLVAGFASLAAGATLNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLA

Query:  RMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEACLYHCELFFC
        RMCQLENKI AVGEP+YR RR+R+KKVRE LENS+K +I+LI+SYARISSMIEIEVEM+SDV+AAEA ++ E +++QIEQIM LENLEE           
Subjt:  RMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEMESDVIAAEAASSVERVSEQIEQIMALENLEEACLYHCELFFC

Query:  LHALLFSPPILLFWAAGTAQLRWKIQAEANDEAERLLN
                             +WKIQAEANDEAERLL+
Subjt:  LHALLFSPPILLFWAAGTAQLRWKIQAEANDEAERLLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTTTCTTATGTTCATCAATCATTTCAACAATTTTTGGTACAGGGTTGATGCTCCACCAGCTCCGAGGAAGCCTTTCACCAATGGAACCTTTGTTCCCAAGGTATC
TGCACCGGCCCAGCCGCAAACAAATGCCACGCCGAGACAGAGGCCACAAACTCTTGACTCACTGTTTGCCAACATGAAGGAACAGAGGTTGAGGGTGTTGTCACAGCGAC
AAAATGGCGGCGGGGTACAACAACAACAACAACGGAATGTTGGTCGACAACAAAGACCTCCATGGGGGAAGAGGCCGTTTTGTGAGAGAGGGAAAAAAAACCAGAGAAGA
GAGAGAAAGCAAGCATTCGACAAATTCTATGCCATTTTTCTACACCAATTCTCCTTCGTTTCAATCCAAAACCTCAACTCTACAATGATTCTGAACTTGAGTTCACCATG
GCTCACTATCACTCGTCTTCCTCCTCCCAAACTCTTCGAACCACTCGCCTCTGCAACCAATGGCGCTACCGTTCTCATGCCTCTCCTTCTTTGTTCCCACGCTCTCTTTG
CTTTCACCTCCTTCTCCAAGTCGATGCGAGTTAGAGCTTCTTTAAGTGGCAGCGACATCGACGGCGCTGCGGCTTTTGAGAATCCTGTTTCGGAGTTACTCGACAACGAG
CTGATTAGGGTTGTTTCGGGTGCTGAGGATGCCGATGAAGCACTAGGGATGATTGCTGATAAGTCAGGGAGAAGTGGAGGTACTGTGTCTGTTTCGGACTGTCGTTTGAT
TATTGCGGCTGCACTTAAACGTAACAATCCCGAGCTTGCTTTGTCCGTGTTCTACGCAATGCGCTCCACTTTCTATCAAGCATGGGAAGGTGTTAATGAAAATGCTTCCA
CTGTTGAGAGATGGAAATGGTCAAGGCCAGATGTCCATGTATATACATTGCTGATTCAAGGTCTTGCAGCATCCTTGAGGGTCTCTGATGCACTTAGGATGATCGAGATC
ATTTGCCGTGTCGGTGTATCACCTGCTGAGGAGGTACCATTTGGAAAGATAGTGCAGTGTCCCAGTTGTATGGTAGCAGTAGCAGTTGCACAACCCCAGCACGGTATTCA
GATTGTATCCTGTGCAAAGTGCCGCTACAAATATGAACTTATTTCAGGAAACATAGTTAATATTGAGTCAGAAGAAATTAGCATGGATGCTCCTGCATGGGAAAAAGCAC
TCCGATTCTTGAATATAATGAAGCAAAAAATTCCTGCTGCTGTGCACTCCATTGTGGTACAAACTCCCTCTGGAGTGGCACGAACCCAGAAGTTTGCTACTGAAACAGCA
GATCTCCCAGCACGAGAGGGAGAAAGAGTGACAATTGCTGCTGCAGCTCCGTCAAATGTATTCAGAGAAGTTGGTCCTATTAAATTTAGTCCAAAGGATCCCAATTTTTA
CTCTGGGGAGCCTATGTGCCTGACAAATCATTCAGATGGCCGGGAGTCACTATTATTAAGAGTGCCAGCAAAGGGAACCTCATCCTTACTTAACCCATCGATCCTCTTTC
CACTCATAGTTTTGTCTGCCGCTGGAGATGCTGCCTCTGGAGTTATTGACCCCGGCTTGCCTCGGTTGCTTTTAGTTGCTGGATTTGCTTCTCTAGCTGCAGGAGCTACC
TTGAATTCATTAATTTTGCCTCAATTCAATCGGCTTCCTCAACGATCAGTTGATATCATTGCTATCAAGCAGCAGCTTTTATCTCAATATAATGTGCTTCAGTCTCGTAT
CAGGGATTTAAAACTAGCTGCTGAAAAGGAGGTTTGGATGTTGGCTCGGATGTGCCAATTAGAGAACAAAATTTTTGCTGTAGGAGAACCTTCTTACCGCGCACGTAGAA
GTAGGATAAAAAAGGTGCGAGAAGGCTTGGAAAATTCCCTTAAGCAACGGATTGAACTAATAGAAAGCTATGCAAGGATTTCCTCGATGATTGAGATTGAAGTTGAAATG
GAGTCTGATGTTATTGCAGCTGAAGCAGCCAGCAGTGTGGAAAGGGTTTCTGAACAGATAGAGCAAATCATGGCGCTGGAAAATCTAGAAGAGGCATGTCTTTATCACTG
TGAACTATTTTTTTGTTTGCATGCCTTACTATTTTCTCCCCCTATTCTTCTATTTTGGGCAGCTGGGACAGCTCAGCTTAGATGGAAAATACAAGCAGAAGCCAACGATG
AAGCCGAAAGACTTCTCAACCAATCAATGCCAACAGAAACGGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCTTTCTTATGTTCATCAATCATTTCAACAATTTTTGGTACAGGGTTGATGCTCCACCAGCTCCGAGGAAGCCTTTCACCAATGGAACCTTTGTTCCCAAGGTATC
TGCACCGGCCCAGCCGCAAACAAATGCCACGCCGAGACAGAGGCCACAAACTCTTGACTCACTGTTTGCCAACATGAAGGAACAGAGGTTGAGGGTGTTGTCACAGCGAC
AAAATGGCGGCGGGGTACAACAACAACAACAACGGAATGTTGGTCGACAACAAAGACCTCCATGGGGGAAGAGGCCGTTTTGTGAGAGAGGGAAAAAAAACCAGAGAAGA
GAGAGAAAGCAAGCATTCGACAAATTCTATGCCATTTTTCTACACCAATTCTCCTTCGTTTCAATCCAAAACCTCAACTCTACAATGATTCTGAACTTGAGTTCACCATG
GCTCACTATCACTCGTCTTCCTCCTCCCAAACTCTTCGAACCACTCGCCTCTGCAACCAATGGCGCTACCGTTCTCATGCCTCTCCTTCTTTGTTCCCACGCTCTCTTTG
CTTTCACCTCCTTCTCCAAGTCGATGCGAGTTAGAGCTTCTTTAAGTGGCAGCGACATCGACGGCGCTGCGGCTTTTGAGAATCCTGTTTCGGAGTTACTCGACAACGAG
CTGATTAGGGTTGTTTCGGGTGCTGAGGATGCCGATGAAGCACTAGGGATGATTGCTGATAAGTCAGGGAGAAGTGGAGGTACTGTGTCTGTTTCGGACTGTCGTTTGAT
TATTGCGGCTGCACTTAAACGTAACAATCCCGAGCTTGCTTTGTCCGTGTTCTACGCAATGCGCTCCACTTTCTATCAAGCATGGGAAGGTGTTAATGAAAATGCTTCCA
CTGTTGAGAGATGGAAATGGTCAAGGCCAGATGTCCATGTATATACATTGCTGATTCAAGGTCTTGCAGCATCCTTGAGGGTCTCTGATGCACTTAGGATGATCGAGATC
ATTTGCCGTGTCGGTGTATCACCTGCTGAGGAGGTACCATTTGGAAAGATAGTGCAGTGTCCCAGTTGTATGGTAGCAGTAGCAGTTGCACAACCCCAGCACGGTATTCA
GATTGTATCCTGTGCAAAGTGCCGCTACAAATATGAACTTATTTCAGGAAACATAGTTAATATTGAGTCAGAAGAAATTAGCATGGATGCTCCTGCATGGGAAAAAGCAC
TCCGATTCTTGAATATAATGAAGCAAAAAATTCCTGCTGCTGTGCACTCCATTGTGGTACAAACTCCCTCTGGAGTGGCACGAACCCAGAAGTTTGCTACTGAAACAGCA
GATCTCCCAGCACGAGAGGGAGAAAGAGTGACAATTGCTGCTGCAGCTCCGTCAAATGTATTCAGAGAAGTTGGTCCTATTAAATTTAGTCCAAAGGATCCCAATTTTTA
CTCTGGGGAGCCTATGTGCCTGACAAATCATTCAGATGGCCGGGAGTCACTATTATTAAGAGTGCCAGCAAAGGGAACCTCATCCTTACTTAACCCATCGATCCTCTTTC
CACTCATAGTTTTGTCTGCCGCTGGAGATGCTGCCTCTGGAGTTATTGACCCCGGCTTGCCTCGGTTGCTTTTAGTTGCTGGATTTGCTTCTCTAGCTGCAGGAGCTACC
TTGAATTCATTAATTTTGCCTCAATTCAATCGGCTTCCTCAACGATCAGTTGATATCATTGCTATCAAGCAGCAGCTTTTATCTCAATATAATGTGCTTCAGTCTCGTAT
CAGGGATTTAAAACTAGCTGCTGAAAAGGAGGTTTGGATGTTGGCTCGGATGTGCCAATTAGAGAACAAAATTTTTGCTGTAGGAGAACCTTCTTACCGCGCACGTAGAA
GTAGGATAAAAAAGGTGCGAGAAGGCTTGGAAAATTCCCTTAAGCAACGGATTGAACTAATAGAAAGCTATGCAAGGATTTCCTCGATGATTGAGATTGAAGTTGAAATG
GAGTCTGATGTTATTGCAGCTGAAGCAGCCAGCAGTGTGGAAAGGGTTTCTGAACAGATAGAGCAAATCATGGCGCTGGAAAATCTAGAAGAGGCATGTCTTTATCACTG
TGAACTATTTTTTTGTTTGCATGCCTTACTATTTTCTCCCCCTATTCTTCTATTTTGGGCAGCTGGGACAGCTCAGCTTAGATGGAAAATACAAGCAGAAGCCAACGATG
AAGCCGAAAGACTTCTCAACCAATCAATGCCAACAGAAACGGTTTAG
Protein sequenceShow/hide protein sequence
MFFLMFINHFNNFWYRVDAPPAPRKPFTNGTFVPKVSAPAQPQTNATPRQRPQTLDSLFANMKEQRLRVLSQRQNGGGVQQQQQRNVGRQQRPPWGKRPFCERGKKNQRR
ERKQAFDKFYAIFLHQFSFVSIQNLNSTMILNLSSPWLTITRLPPPKLFEPLASATNGATVLMPLLLCSHALFAFTSFSKSMRVRASLSGSDIDGAAAFENPVSELLDNE
LIRVVSGAEDADEALGMIADKSGRSGGTVSVSDCRLIIAAALKRNNPELALSVFYAMRSTFYQAWEGVNENASTVERWKWSRPDVHVYTLLIQGLAASLRVSDALRMIEI
ICRVGVSPAEEVPFGKIVQCPSCMVAVAVAQPQHGIQIVSCAKCRYKYELISGNIVNIESEEISMDAPAWEKALRFLNIMKQKIPAAVHSIVVQTPSGVARTQKFATETA
DLPAREGERVTIAAAAPSNVFREVGPIKFSPKDPNFYSGEPMCLTNHSDGRESLLLRVPAKGTSSLLNPSILFPLIVLSAAGDAASGVIDPGLPRLLLVAGFASLAAGAT
LNSLILPQFNRLPQRSVDIIAIKQQLLSQYNVLQSRIRDLKLAAEKEVWMLARMCQLENKIFAVGEPSYRARRSRIKKVREGLENSLKQRIELIESYARISSMIEIEVEM
ESDVIAAEAASSVERVSEQIEQIMALENLEEACLYHCELFFCLHALLFSPPILLFWAAGTAQLRWKIQAEANDEAERLLNQSMPTETV