; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018897 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018897
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUnknown protein
Genome locationChr04:10766971..10780571
RNA-Seq ExpressionHG10018897
SyntenyHG10018897
Gene Ontology termsNA
InterPro domainsIPR040306 - Os02g0753200-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031490.1 hypothetical protein E6C27_scaffold139G002510 [Cucumis melo var. makuwa]4.1e-11670.03Show/hide
Query:  IHGLKIKPVRSPSTESAMSLALLQGYSSAEEEAEDNSAFNHTSSDDEDEDLAATAASTVTVNLSIRDKSLFELPQPSSHPGLPSAFDAFSELSEFGDCSF
        IHGLKI+P+RSPSTESAMSLALLQGYSSAEEEAE NS FNHTSSDD+DEDLAA AASTVTVNLSIRDKSLFELPQPSS PGLPSAFDAFS          
Subjt:  IHGLKIKPVRSPSTESAMSLALLQGYSSAEEEAEDNSAFNHTSSDDEDEDLAATAASTVTVNLSIRDKSLFELPQPSSHPGLPSAFDAFSELSEFGDCSF

Query:  PDFVFRGLPEVSGPPEFLNNSVEEYAAPKDIDQPRGGHGSRRNRKEKKDLPTVAKLARNMCSLHTNCNHCEQWRQQRLNLPGRERGGMSEGRLALIGKMR
                 EVSGPPEFLNNSVEEYAAP+D DQPRGGHG RRNRKEKKDLPT                                                
Subjt:  PDFVFRGLPEVSGPPEFLNNSVEEYAAPKDIDQPRGGHGSRRNRKEKKDLPTVAKLARNMCSLHTNCNHCEQWRQQRLNLPGRERGGMSEGRLALIGKMR

Query:  ALAGVRDLLPRRCHGGWVGEIGAVLEAKAQLVGIHERVRSDVESNQASNPSISIATPEGKRVVTAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLC
                             GAVLEAKAQLVGIHERVRSDVESN ASN SIS AT EGKRV TAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLC
Subjt:  ALAGVRDLLPRRCHGGWVGEIGAVLEAKAQLVGIHERVRSDVESNQASNPSISIATPEGKRVVTAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLC

Query:  GDRPPEPDSETKKKGSTVKDKEKIKRIRGQSSHATWKSETEMQLRQQ
        GDRPPEPDSETKKKGSTVKDKEKIKR+RGQSSHATWKSETEMQLRQQ
Subjt:  GDRPPEPDSETKKKGSTVKDKEKIKRIRGQSSHATWKSETEMQLRQQ

KAG6571683.1 putative CRM domain-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]7.3e-11367.14Show/hide
Query:  KSLKSVIHGLKIKPVRSPSTESAMSLALLQGYSSAEEEAEDNSAFNHTSSDDEDEDLAATAASTVTVNLSIRDKSLFELPQPSSHPGLPSAFDAFSELSE
        K  +S IH LKI+PV SPSTESAMSLALLQGYSSAEEEAEDNS FNHTSSDD+DEDLAA AAS+VT NLSIRDKSLFELPQPSSHPGLPSAFDAFS    
Subjt:  KSLKSVIHGLKIKPVRSPSTESAMSLALLQGYSSAEEEAEDNSAFNHTSSDDEDEDLAATAASTVTVNLSIRDKSLFELPQPSSHPGLPSAFDAFSELSE

Query:  FGDCSFPDFVFRGLPEVSGPPEFLNNSVEEYAAPKDIDQPRGGHGSRRNRKEKKDLPTVAKLARNMCSLHTNCNHCEQWRQQRLNLPGRERGGMSEGRLA
                       EVSGPPEFLNNSVEEYAAP+D+DQPRGGHGSRRNRKEKKD PT                                          
Subjt:  FGDCSFPDFVFRGLPEVSGPPEFLNNSVEEYAAPKDIDQPRGGHGSRRNRKEKKDLPTVAKLARNMCSLHTNCNHCEQWRQQRLNLPGRERGGMSEGRLA

Query:  LIGKMRALAGVRDLLPRRCHGGWVGEIGAVLEAKAQLVGIHERVRSDVESNQASNPSISIATPEGKRVVTAANPNAEDAAELLRMCLHCGIPKTFSNARG
                                   GAV+EAKAQLVGIHERVRSD ESNQ SNPS+S  T +GKR+ TAANPNAEDAAELLRMCLHCGIPKTFSNARG
Subjt:  LIGKMRALAGVRDLLPRRCHGGWVGEIGAVLEAKAQLVGIHERVRSDVESNQASNPSISIATPEGKRVVTAANPNAEDAAELLRMCLHCGIPKTFSNARG

Query:  MFCPLCGDRPPEPDSETKKKGSTVKDKEKIKRIRGQSSHATWKSETEMQLRQQ
        MFCPLCGDRPPEPDSETKKKGSTVKDKEKIKR+RGQSSHA+WKSETEM LRQQ
Subjt:  MFCPLCGDRPPEPDSETKKKGSTVKDKEKIKRIRGQSSHATWKSETEMQLRQQ

XP_004136870.2 uncharacterized protein LOC101218280 [Cucumis sativus]4.1e-10867.15Show/hide
Query:  IKPVRSPSTESAMSLALLQGYSSAEEEAEDNSAFNHTSSDDEDEDLA---ATAASTVTVNLSIRDKSLFELPQPSSHPGLPSAFDAFSELSEFGDCSFPD
        ++P+RSPSTESAMSL+LLQGYSSAEEEA+ NS FNHTSSDD+DEDLA   A AASTVTVNLSIRDKSLFELPQPSS PGLPSAFDAFS            
Subjt:  IKPVRSPSTESAMSLALLQGYSSAEEEAEDNSAFNHTSSDDEDEDLA---ATAASTVTVNLSIRDKSLFELPQPSSHPGLPSAFDAFSELSEFGDCSFPD

Query:  FVFRGLPEVSGPPEFLNNSVEEYAAPKDIDQPRG--GHGSRRNRKEKKDLPTVAKLARNMCSLHTNCNHCEQWRQQRLNLPGRERGGMSEGRLALIGKMR
               EVSGPPEFLNNSVEEYAAP+D DQPRG  GHG RRNRKEKKDLPT                                                
Subjt:  FVFRGLPEVSGPPEFLNNSVEEYAAPKDIDQPRG--GHGSRRNRKEKKDLPTVAKLARNMCSLHTNCNHCEQWRQQRLNLPGRERGGMSEGRLALIGKMR

Query:  ALAGVRDLLPRRCHGGWVGEIGAVLEAKAQLVGIHERVRSDVESNQASNPSISIATPEGKRVVTAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLC
                             GAVLEAKAQLVGIHERVRSDVESN +SN SIS ATPE KRV TAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLC
Subjt:  ALAGVRDLLPRRCHGGWVGEIGAVLEAKAQLVGIHERVRSDVESNQASNPSISIATPEGKRVVTAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLC

Query:  GDRPPEPDSETKKKGSTVKDKEKIKRIRGQSSHATWKSETEMQLRQQ
        GDRPPEPDSE+KKKGSTVKDKEKIKR+RGQSSHATWKSETEMQLRQQ
Subjt:  GDRPPEPDSETKKKGSTVKDKEKIKRIRGQSSHATWKSETEMQLRQQ

XP_008455215.2 PREDICTED: uncharacterized protein LOC103495434 [Cucumis melo]2.7e-10769.09Show/hide
Query:  MSLALLQGYSSAEEEAEDNSAFNHTSSDDEDEDLAATAASTVTVNLSIRDKSLFELPQPSSHPGLPSAFDAFSELSEFGDCSFPDFVFRGLPEVSGPPEF
        MSLALLQGYSSAEEEAE NS FNHTSSDD+DEDLAA AASTVTVNLSIRDKSLFELPQPSS PGLPSAFDAFS                   EVSGPPEF
Subjt:  MSLALLQGYSSAEEEAEDNSAFNHTSSDDEDEDLAATAASTVTVNLSIRDKSLFELPQPSSHPGLPSAFDAFSELSEFGDCSFPDFVFRGLPEVSGPPEF

Query:  LNNSVEEYAAPKDIDQPRGGHGSRRNRKEKKDLPTVAKLARNMCSLHTNCNHCEQWRQQRLNLPGRERGGMSEGRLALIGKMRALAGVRDLLPRRCHGGW
        LNNSVEEYAAP+D DQPRGGHG RRNRKEKKDLPT                                                                 
Subjt:  LNNSVEEYAAPKDIDQPRGGHGSRRNRKEKKDLPTVAKLARNMCSLHTNCNHCEQWRQQRLNLPGRERGGMSEGRLALIGKMRALAGVRDLLPRRCHGGW

Query:  VGEIGAVLEAKAQLVGIHERVRSDVESNQASNPSISIATPEGKRVVTAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPDSETKKKGST
            GAVLEAKAQLVGIHERVRSDVESN ASN SIS AT EGKRV TAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPDSETKKKGST
Subjt:  VGEIGAVLEAKAQLVGIHERVRSDVESNQASNPSISIATPEGKRVVTAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPDSETKKKGST

Query:  VKDKEKIKRIRGQSSHATWKSETEMQLRQQ
        VKDKEKIKR+RGQSSHATWKSETEMQLRQQ
Subjt:  VKDKEKIKRIRGQSSHATWKSETEMQLRQQ

XP_038888650.1 uncharacterized protein LOC120078451 [Benincasa hispida]4.6e-10768.37Show/hide
Query:  MSLALLQGYSSAEEEAEDNSAFNHTSSDDEDEDLAAT--AASTVTVNLSIRDKSLFELPQPSSHPGLPSAFDAFSELSEFGDCSFPDFVFRGLPEVSGPP
        MSLALLQGYSSAEEEAEDNS FNHTSSDD+DEDLAAT  AASTVTVNLSIRDKSLFELPQPSSHPGLPSAFDAFS                   EVSGPP
Subjt:  MSLALLQGYSSAEEEAEDNSAFNHTSSDDEDEDLAAT--AASTVTVNLSIRDKSLFELPQPSSHPGLPSAFDAFSELSEFGDCSFPDFVFRGLPEVSGPP

Query:  EFLNNSVEEYAAPKDIDQPRGGHGSRRNRKEKKDLPTVAKLARNMCSLHTNCNHCEQWRQQRLNLPGRERGGMSEGRLALIGKMRALAGVRDLLPRRCHG
        EFLNNSVEEYAAP+++DQPRGGHG RRNRKEKKDLPT                                                               
Subjt:  EFLNNSVEEYAAPKDIDQPRGGHGSRRNRKEKKDLPTVAKLARNMCSLHTNCNHCEQWRQQRLNLPGRERGGMSEGRLALIGKMRALAGVRDLLPRRCHG

Query:  GWVGEIGAVLEAKAQLVGIHERVRSDVESNQASNPSISIATPEGKRVVTAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPDSETKKKG
              GAVLEAK QLVGIHERVRSDVE+N ASNPSIS ATPE KRV T ANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEP+SETKKKG
Subjt:  GWVGEIGAVLEAKAQLVGIHERVRSDVESNQASNPSISIATPEGKRVVTAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPDSETKKKG

Query:  STVKDKEKIKRIRGQSSHATWKSETEMQLRQQ
        STVKDKEKIKR+RGQSSHATWKSETEMQLRQQ
Subjt:  STVKDKEKIKRIRGQSSHATWKSETEMQLRQQ

TrEMBL top hitse value%identityAlignment
A0A0A0K7B5 Uncharacterized protein2.0e-10867.15Show/hide
Query:  IKPVRSPSTESAMSLALLQGYSSAEEEAEDNSAFNHTSSDDEDEDLA---ATAASTVTVNLSIRDKSLFELPQPSSHPGLPSAFDAFSELSEFGDCSFPD
        ++P+RSPSTESAMSL+LLQGYSSAEEEA+ NS FNHTSSDD+DEDLA   A AASTVTVNLSIRDKSLFELPQPSS PGLPSAFDAFS            
Subjt:  IKPVRSPSTESAMSLALLQGYSSAEEEAEDNSAFNHTSSDDEDEDLA---ATAASTVTVNLSIRDKSLFELPQPSSHPGLPSAFDAFSELSEFGDCSFPD

Query:  FVFRGLPEVSGPPEFLNNSVEEYAAPKDIDQPRG--GHGSRRNRKEKKDLPTVAKLARNMCSLHTNCNHCEQWRQQRLNLPGRERGGMSEGRLALIGKMR
               EVSGPPEFLNNSVEEYAAP+D DQPRG  GHG RRNRKEKKDLPT                                                
Subjt:  FVFRGLPEVSGPPEFLNNSVEEYAAPKDIDQPRG--GHGSRRNRKEKKDLPTVAKLARNMCSLHTNCNHCEQWRQQRLNLPGRERGGMSEGRLALIGKMR

Query:  ALAGVRDLLPRRCHGGWVGEIGAVLEAKAQLVGIHERVRSDVESNQASNPSISIATPEGKRVVTAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLC
                             GAVLEAKAQLVGIHERVRSDVESN +SN SIS ATPE KRV TAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLC
Subjt:  ALAGVRDLLPRRCHGGWVGEIGAVLEAKAQLVGIHERVRSDVESNQASNPSISIATPEGKRVVTAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLC

Query:  GDRPPEPDSETKKKGSTVKDKEKIKRIRGQSSHATWKSETEMQLRQQ
        GDRPPEPDSE+KKKGSTVKDKEKIKR+RGQSSHATWKSETEMQLRQQ
Subjt:  GDRPPEPDSETKKKGSTVKDKEKIKRIRGQSSHATWKSETEMQLRQQ

A0A1S3C0J1 uncharacterized protein LOC1034954341.3e-10769.09Show/hide
Query:  MSLALLQGYSSAEEEAEDNSAFNHTSSDDEDEDLAATAASTVTVNLSIRDKSLFELPQPSSHPGLPSAFDAFSELSEFGDCSFPDFVFRGLPEVSGPPEF
        MSLALLQGYSSAEEEAE NS FNHTSSDD+DEDLAA AASTVTVNLSIRDKSLFELPQPSS PGLPSAFDAFS                   EVSGPPEF
Subjt:  MSLALLQGYSSAEEEAEDNSAFNHTSSDDEDEDLAATAASTVTVNLSIRDKSLFELPQPSSHPGLPSAFDAFSELSEFGDCSFPDFVFRGLPEVSGPPEF

Query:  LNNSVEEYAAPKDIDQPRGGHGSRRNRKEKKDLPTVAKLARNMCSLHTNCNHCEQWRQQRLNLPGRERGGMSEGRLALIGKMRALAGVRDLLPRRCHGGW
        LNNSVEEYAAP+D DQPRGGHG RRNRKEKKDLPT                                                                 
Subjt:  LNNSVEEYAAPKDIDQPRGGHGSRRNRKEKKDLPTVAKLARNMCSLHTNCNHCEQWRQQRLNLPGRERGGMSEGRLALIGKMRALAGVRDLLPRRCHGGW

Query:  VGEIGAVLEAKAQLVGIHERVRSDVESNQASNPSISIATPEGKRVVTAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPDSETKKKGST
            GAVLEAKAQLVGIHERVRSDVESN ASN SIS AT EGKRV TAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPDSETKKKGST
Subjt:  VGEIGAVLEAKAQLVGIHERVRSDVESNQASNPSISIATPEGKRVVTAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPDSETKKKGST

Query:  VKDKEKIKRIRGQSSHATWKSETEMQLRQQ
        VKDKEKIKR+RGQSSHATWKSETEMQLRQQ
Subjt:  VKDKEKIKRIRGQSSHATWKSETEMQLRQQ

A0A5A7SL87 Uncharacterized protein2.0e-11670.03Show/hide
Query:  IHGLKIKPVRSPSTESAMSLALLQGYSSAEEEAEDNSAFNHTSSDDEDEDLAATAASTVTVNLSIRDKSLFELPQPSSHPGLPSAFDAFSELSEFGDCSF
        IHGLKI+P+RSPSTESAMSLALLQGYSSAEEEAE NS FNHTSSDD+DEDLAA AASTVTVNLSIRDKSLFELPQPSS PGLPSAFDAFS          
Subjt:  IHGLKIKPVRSPSTESAMSLALLQGYSSAEEEAEDNSAFNHTSSDDEDEDLAATAASTVTVNLSIRDKSLFELPQPSSHPGLPSAFDAFSELSEFGDCSF

Query:  PDFVFRGLPEVSGPPEFLNNSVEEYAAPKDIDQPRGGHGSRRNRKEKKDLPTVAKLARNMCSLHTNCNHCEQWRQQRLNLPGRERGGMSEGRLALIGKMR
                 EVSGPPEFLNNSVEEYAAP+D DQPRGGHG RRNRKEKKDLPT                                                
Subjt:  PDFVFRGLPEVSGPPEFLNNSVEEYAAPKDIDQPRGGHGSRRNRKEKKDLPTVAKLARNMCSLHTNCNHCEQWRQQRLNLPGRERGGMSEGRLALIGKMR

Query:  ALAGVRDLLPRRCHGGWVGEIGAVLEAKAQLVGIHERVRSDVESNQASNPSISIATPEGKRVVTAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLC
                             GAVLEAKAQLVGIHERVRSDVESN ASN SIS AT EGKRV TAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLC
Subjt:  ALAGVRDLLPRRCHGGWVGEIGAVLEAKAQLVGIHERVRSDVESNQASNPSISIATPEGKRVVTAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLC

Query:  GDRPPEPDSETKKKGSTVKDKEKIKRIRGQSSHATWKSETEMQLRQQ
        GDRPPEPDSETKKKGSTVKDKEKIKR+RGQSSHATWKSETEMQLRQQ
Subjt:  GDRPPEPDSETKKKGSTVKDKEKIKRIRGQSSHATWKSETEMQLRQQ

A0A6J1HK72 uncharacterized protein LOC1114637991.0e-10466.67Show/hide
Query:  MSLALLQGYSSAEEEAEDNSAFNHTSSDDEDEDLAATAASTVTVNLSIRDKSLFELPQPSSHPGLPSAFDAFSELSEFGDCSFPDFVFRGLPEVSGPPEF
        MSLALLQGYSSAEEEAEDNS FNHTSSDD+DEDLAA AAS+VT NLSIRDKSLFELPQPSSHPGLPSAFDAFS                   EVSGPPEF
Subjt:  MSLALLQGYSSAEEEAEDNSAFNHTSSDDEDEDLAATAASTVTVNLSIRDKSLFELPQPSSHPGLPSAFDAFSELSEFGDCSFPDFVFRGLPEVSGPPEF

Query:  LNNSVEEYAAPKDIDQPRGGHGSRRNRKEKKDLPTVAKLARNMCSLHTNCNHCEQWRQQRLNLPGRERGGMSEGRLALIGKMRALAGVRDLLPRRCHGGW
        LNNSVEEYAAP+D+DQPRGGHGSRRNRKEK+D PT                                                                 
Subjt:  LNNSVEEYAAPKDIDQPRGGHGSRRNRKEKKDLPTVAKLARNMCSLHTNCNHCEQWRQQRLNLPGRERGGMSEGRLALIGKMRALAGVRDLLPRRCHGGW

Query:  VGEIGAVLEAKAQLVGIHERVRSDVESNQASNPSISIATPEGKRVVTAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPDSETKKKGST
            GAV+EAKAQLVGIHERVRSD ESNQ SNPS+S  T +GKR+ TAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPDSETKKKGST
Subjt:  VGEIGAVLEAKAQLVGIHERVRSDVESNQASNPSISIATPEGKRVVTAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPDSETKKKGST

Query:  VKDKEKIKRIRGQSSHATWKSETEMQLRQQ
        VKDKEKIKR+RGQSSHA+WKSETEM LRQQ
Subjt:  VKDKEKIKRIRGQSSHATWKSETEMQLRQQ

A0A6J1HRB5 uncharacterized protein LOC1114670681.1e-10165.45Show/hide
Query:  MSLALLQGYSSAEEEAEDNSAFNHTSSDDEDEDLAATAASTVTVNLSIRDKSLFELPQPSSHPGLPSAFDAFSELSEFGDCSFPDFVFRGLPEVSGPPEF
        MSLALLQGYSSAEEEAEDN+ FNHTSSDD+DEDLA  AAS+VT NLSIRDKSLFELPQPSSHPGLPSAFDAFS                   EVSGPPEF
Subjt:  MSLALLQGYSSAEEEAEDNSAFNHTSSDDEDEDLAATAASTVTVNLSIRDKSLFELPQPSSHPGLPSAFDAFSELSEFGDCSFPDFVFRGLPEVSGPPEF

Query:  LNNSVEEYAAPKDIDQPRGGHGSRRNRKEKKDLPTVAKLARNMCSLHTNCNHCEQWRQQRLNLPGRERGGMSEGRLALIGKMRALAGVRDLLPRRCHGGW
        LNNSVEEYAAP+D+DQPRGGHGS RN KEKKDLPT                                                                 
Subjt:  LNNSVEEYAAPKDIDQPRGGHGSRRNRKEKKDLPTVAKLARNMCSLHTNCNHCEQWRQQRLNLPGRERGGMSEGRLALIGKMRALAGVRDLLPRRCHGGW

Query:  VGEIGAVLEAKAQLVGIHERVRSDVESNQASNPSISIATPEGKRVVTAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPDSETKKKGST
            GAV+EAKAQLVGIHERVRSD ESNQ SN S+S  T +GKR+ TAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPD ETKKKGST
Subjt:  VGEIGAVLEAKAQLVGIHERVRSDVESNQASNPSISIATPEGKRVVTAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPDSETKKKGST

Query:  VKDKEKIKRIRGQSSHATWKSETEMQLRQQ
        VKDKEKIKR+RGQSSHA+WKSETEM LRQQ
Subjt:  VKDKEKIKRIRGQSSHATWKSETEMQLRQQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G64160.1 unknown protein6.3e-4639.16Show/hide
Query:  MSLALLQGYSSAEEEAEDNSAFNHTSSDDEDEDLAATAASTVTVNLSIRDKSLFELPQPSS---HPGLPSAFDAFSELSEFGDCSFPDFVFRGLPEVSGP
        MSL LLQGYSSAEEE  +  AF    + DED D                  S+F+    +S   + GLPSA D FS                   ++SGP
Subjt:  MSLALLQGYSSAEEEAEDNSAFNHTSSDDEDEDLAATAASTVTVNLSIRDKSLFELPQPSS---HPGLPSAFDAFSELSEFGDCSFPDFVFRGLPEVSGP

Query:  PEFLNNSVEEYAAPKDIDQPRGGHGSRRNRKEKKDLPTVAKLARNMCSLHTNCNHCEQWRQQRLNLPGRERGGMSEGRLALIGKMRALAGVRDLLPRRCH
        PEFLNN  E   A  +       H +R +RK+KK  P                                                               
Subjt:  PEFLNNSVEEYAAPKDIDQPRGGHGSRRNRKEKKDLPTVAKLARNMCSLHTNCNHCEQWRQQRLNLPGRERGGMSEGRLALIGKMRALAGVRDLLPRRCH

Query:  GGWVGEIGAVLEAKAQLVGIHERVRSDVESNQASNPSISIATPEGKRVVTAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPDSETKKK
               G V+EAK QLVGIHERVR+D+++  +S           KR+ TA NPNAE++A+LLRMC+ CG+PKT+++ARGM CP+CGDR P PD + KKK
Subjt:  GGWVGEIGAVLEAKAQLVGIHERVRSDVESNQASNPSISIATPEGKRVVTAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPDSETKKK

Query:  GSTVKDKEKIKRIRGQSSHATWKSETEMQLRQ
        GST+KDKEK KR+RGQSSHA+WKSETEMQLRQ
Subjt:  GSTVKDKEKIKRIRGQSSHATWKSETEMQLRQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATACATTGGGGCGGGCTTTCGGATCTGTATCGGACAAAAGTTTGAAATCTGTCATCCATGGACTCAAAATCAAACCAGTTCGATCTCCATCAACTGAATCCGCCAT
GAGTCTGGCACTTCTCCAAGGCTATTCTTCAGCCGAAGAAGAAGCTGAAGACAACTCTGCCTTCAACCACACCTCTTCCGACGATGAGGACGAAGATCTTGCCGCCACCG
CCGCCTCTACGGTTACCGTCAATCTTTCCATACGTGACAAGTCACTCTTCGAACTTCCGCAGCCCTCCTCTCATCCCGGCCTGCCTTCTGCATTCGACGCTTTCTCCGAA
TTGAGCGAATTCGGAGACTGTTCGTTTCCTGATTTTGTTTTCCGTGGGCTACCGGAGGTTTCAGGACCGCCGGAGTTTCTGAATAATTCGGTTGAGGAGTACGCTGCACC
GAAAGATATCGATCAGCCGCGAGGGGGCCATGGGAGCCGTAGGAATCGTAAGGAGAAGAAAGATTTGCCTACTGTTGCCAAACTTGCAAGGAACATGTGTTCATTGCATA
CAAACTGCAATCATTGTGAACAGTGGAGACAGCAACGATTGAACTTGCCTGGGAGGGAGAGAGGAGGTATGAGTGAGGGAAGATTAGCATTGATTGGGAAAATGAGAGCC
CTTGCAGGCGTGAGGGATCTCCTTCCCCGCCGTTGTCATGGAGGGTGGGTTGGTGAAATAGGTGCTGTATTGGAAGCAAAAGCTCAATTAGTTGGGATTCATGAGCGAGT
GAGGAGTGATGTTGAGAGTAATCAAGCATCAAATCCATCCATTTCAATTGCAACACCGGAAGGCAAGCGCGTGGTAACTGCAGCCAATCCAAATGCTGAAGATGCTGCAG
AGCTACTAAGAATGTGCCTGCATTGTGGCATTCCCAAGACGTTTTCAAATGCGCGAGGGATGTTTTGCCCTTTATGTGGTGATCGTCCCCCAGAGCCAGACAGTGAGACA
AAAAAGAAGGGATCTACTGTTAAAGATAAGGAAAAGATAAAGAGAATAAGGGGACAGTCATCTCATGCTACTTGGAAGAGTGAAACAGAGATGCAGCTGAGACAACAGGA
CGGCGTCGAGATGAATGTCATCTGTAGTCTGTGCCAAACAGGAAAATTAGCAGCGCGTATCGAAGATTCGAAGTTGGTTCCAAACGACCTAGGTGCTGAGGTCGAGACTG
AGGCTAAAGCCGAGATAATCTTGGTCGAGGCCAAGAAAACCAAATTGAAGCGAAGAGTCAAGGCTAAGACAATCTTGGCCGAGACCGAGGAGACCAAGTTGAAACGAAGA
GCCAAGGCCAAGGCGACCAATGTTGAGACCGAGGATGCCAAGCTGAAGCGAAGAGCCAAGGTCGAGACAATTCTGGCCCAGCGAAAAGCCAAGGTCGAGACAATCTTAGT
TGAGGCCGAAGAGATCAAGTTGAAGTTAAGTGCCAAGGTCGAAGCAACCAAGGCTGAGGTCAAGGAGATCAAGTTGAAAAGTATAGTCAAGGTCAATGCTGAGGCTCCAA
ATTATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATACATTGGGGCGGGCTTTCGGATCTGTATCGGACAAAAGTTTGAAATCTGTCATCCATGGACTCAAAATCAAACCAGTTCGATCTCCATCAACTGAATCCGCCAT
GAGTCTGGCACTTCTCCAAGGCTATTCTTCAGCCGAAGAAGAAGCTGAAGACAACTCTGCCTTCAACCACACCTCTTCCGACGATGAGGACGAAGATCTTGCCGCCACCG
CCGCCTCTACGGTTACCGTCAATCTTTCCATACGTGACAAGTCACTCTTCGAACTTCCGCAGCCCTCCTCTCATCCCGGCCTGCCTTCTGCATTCGACGCTTTCTCCGAA
TTGAGCGAATTCGGAGACTGTTCGTTTCCTGATTTTGTTTTCCGTGGGCTACCGGAGGTTTCAGGACCGCCGGAGTTTCTGAATAATTCGGTTGAGGAGTACGCTGCACC
GAAAGATATCGATCAGCCGCGAGGGGGCCATGGGAGCCGTAGGAATCGTAAGGAGAAGAAAGATTTGCCTACTGTTGCCAAACTTGCAAGGAACATGTGTTCATTGCATA
CAAACTGCAATCATTGTGAACAGTGGAGACAGCAACGATTGAACTTGCCTGGGAGGGAGAGAGGAGGTATGAGTGAGGGAAGATTAGCATTGATTGGGAAAATGAGAGCC
CTTGCAGGCGTGAGGGATCTCCTTCCCCGCCGTTGTCATGGAGGGTGGGTTGGTGAAATAGGTGCTGTATTGGAAGCAAAAGCTCAATTAGTTGGGATTCATGAGCGAGT
GAGGAGTGATGTTGAGAGTAATCAAGCATCAAATCCATCCATTTCAATTGCAACACCGGAAGGCAAGCGCGTGGTAACTGCAGCCAATCCAAATGCTGAAGATGCTGCAG
AGCTACTAAGAATGTGCCTGCATTGTGGCATTCCCAAGACGTTTTCAAATGCGCGAGGGATGTTTTGCCCTTTATGTGGTGATCGTCCCCCAGAGCCAGACAGTGAGACA
AAAAAGAAGGGATCTACTGTTAAAGATAAGGAAAAGATAAAGAGAATAAGGGGACAGTCATCTCATGCTACTTGGAAGAGTGAAACAGAGATGCAGCTGAGACAACAGGA
CGGCGTCGAGATGAATGTCATCTGTAGTCTGTGCCAAACAGGAAAATTAGCAGCGCGTATCGAAGATTCGAAGTTGGTTCCAAACGACCTAGGTGCTGAGGTCGAGACTG
AGGCTAAAGCCGAGATAATCTTGGTCGAGGCCAAGAAAACCAAATTGAAGCGAAGAGTCAAGGCTAAGACAATCTTGGCCGAGACCGAGGAGACCAAGTTGAAACGAAGA
GCCAAGGCCAAGGCGACCAATGTTGAGACCGAGGATGCCAAGCTGAAGCGAAGAGCCAAGGTCGAGACAATTCTGGCCCAGCGAAAAGCCAAGGTCGAGACAATCTTAGT
TGAGGCCGAAGAGATCAAGTTGAAGTTAAGTGCCAAGGTCGAAGCAACCAAGGCTGAGGTCAAGGAGATCAAGTTGAAAAGTATAGTCAAGGTCAATGCTGAGGCTCCAA
ATTATTAA
Protein sequenceShow/hide protein sequence
MDTLGRAFGSVSDKSLKSVIHGLKIKPVRSPSTESAMSLALLQGYSSAEEEAEDNSAFNHTSSDDEDEDLAATAASTVTVNLSIRDKSLFELPQPSSHPGLPSAFDAFSE
LSEFGDCSFPDFVFRGLPEVSGPPEFLNNSVEEYAAPKDIDQPRGGHGSRRNRKEKKDLPTVAKLARNMCSLHTNCNHCEQWRQQRLNLPGRERGGMSEGRLALIGKMRA
LAGVRDLLPRRCHGGWVGEIGAVLEAKAQLVGIHERVRSDVESNQASNPSISIATPEGKRVVTAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPDSET
KKKGSTVKDKEKIKRIRGQSSHATWKSETEMQLRQQDGVEMNVICSLCQTGKLAARIEDSKLVPNDLGAEVETEAKAEIILVEAKKTKLKRRVKAKTILAETEETKLKRR
AKAKATNVETEDAKLKRRAKVETILAQRKAKVETILVEAEEIKLKLSAKVEATKAEVKEIKLKSIVKVNAEAPNY