; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G06250 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G06250
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionCTP_transf_like domain-containing protein
Genome locationClcChr01:6068165..6075726
RNA-Seq ExpressionClc01G06250
SyntenyClc01G06250
Gene Ontology termsGO:0009058 - biosynthetic process (biological process)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016735.1 hypothetical protein SDJN02_21845 [Cucurbita argyrosperma subsp. argyrosperma]7.8e-18878.79Show/hide
Query:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE
        M DT  +AAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEA+VPYSR SMIQLLGK                        VPSQ CSR+TAEE
Subjt:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE

Query:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA
        MALLAYNRALKLSRPGYPVLGVGFTGSLAT HPKFGDH               RMHMSTR +NR WVSTITLSKGLRTREQEEILS HLLLK       A
Subjt:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA

Query:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL
        IA ACKVPGTFVSDLT+SDL EECETLF+EDEELEQLI GEVCFKVYPFL ETFT+DAERKIILSGSFNPLHDGH+KLLEVAT                 
Subjt:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL

Query:  NICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFLV
        +ICG GYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGD KKMLE LLRIKNTG  FLV
Subjt:  NICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFLV

Query:  GGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLGI
        GGRN+NG+FKVLED++IPQELRDMFISIPAD+FRMDISSTQIRKQLGI
Subjt:  GGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLGI

XP_004147117.1 uncharacterized protein LOC101217608 isoform X1 [Cucumis sativus]1.1e-18979.02Show/hide
Query:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE
        MADTWARAA DAIHL+PTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSR+SM+QLLGK                        VPSQFCS RTAEE
Subjt:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE

Query:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA
        +ALLAYNRALKLSRPGYPVLGVGFTGSLAT HPK G+H               RMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLK       A
Subjt:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA

Query:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL
        IA ACKVPGTFVSDLTQSDL EECETLFTEDEELEQLIKG+VCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLE AT                 
Subjt:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL

Query:  NICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFLV
        ++C DGYPCFELSAVNADKPPLSVSQIKDRVEQF+KVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGD KKMLE L++IKN GC FLV
Subjt:  NICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFLV

Query:  GGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLGI
         GR+++GVFKVLEDIDIPQELRD FI IPAD+FRMDISSTQIR+QLGI
Subjt:  GGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLGI

XP_008467184.1 PREDICTED: uncharacterized protein LOC103504595 [Cucumis melo]3.1e-19280.58Show/hide
Query:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE
        MADTWARAA DAIHL+PTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSR+SMIQLLGK                        VPSQFCSRRTAEE
Subjt:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE

Query:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA
        MALLAYNRALKLSRPGYPVLGV FTGSLAT HPK GDH               RMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLK       A
Subjt:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA

Query:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL
        IA+ACKVPGTFVSDLTQSDL EECETLFTEDEELEQLIKGEVCFKVYPFLSET TSDAE+KIILSGSFNPLHDGHIKLLEVAT                 
Subjt:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL

Query:  NICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFLV
        +IC DGYPCFELSAVNADKPPLSVSQIKDRVEQFKK+GKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGD KKMLE L+RIKNT   FLV
Subjt:  NICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFLV

Query:  GGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLGI
         GR++NGVFKVLEDIDIPQELRDMFI IPAD+FRMDISSTQIRKQLGI
Subjt:  GGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLGI

XP_023550267.1 uncharacterized protein LOC111808493 isoform X1 [Cucurbita pepo subsp. pepo]7.8e-18879.02Show/hide
Query:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE
        M DT  +AAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEA+VPYSR SMIQLLGK                        VPSQ CSR+TAEE
Subjt:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE

Query:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA
        MALLAYNRALKLSRPGYPVLGVGFTGSLAT HPKFGDH               RMHMSTR SNR WVSTITLSKGLRTREQEEILS HLLL+       A
Subjt:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA

Query:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL
        IA ACKVPGTFVSDLT+SDL EECETLF+EDEELEQLI GEVCFKVYPFL ETFT+DAERKIILSGSFNPLHDGH+KLLEVAT                 
Subjt:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL

Query:  NICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFLV
        +ICG GYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGD KKMLE LLRIKNTG  FLV
Subjt:  NICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFLV

Query:  GGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLGI
        GGRN+NGVFKVLED++IPQELRDMFISIPAD+FRMDISSTQIRKQLGI
Subjt:  GGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLGI

XP_038906633.1 uncharacterized protein LOC120092580 isoform X1 [Benincasa hispida]1.0e-19581.25Show/hide
Query:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE
        MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGK                        VPSQFCSRRT EE
Subjt:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE

Query:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA
        +ALLAYNRALKLSRPGYPVLGVGFTGSLAT HPK GDH               RMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLL+K       A
Subjt:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA

Query:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL
        IADACKVPGTFVSDLT+SDL E+ ETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGH+KLLEVAT                 
Subjt:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL

Query:  NICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFLV
        +ICGDGYPCFE+SAVNADKPPLSVSQIKDR+EQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGD KKMLE LLRIK+TG  FLV
Subjt:  NICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFLV

Query:  GGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLGI
        GGRNVNG+FKVLEDIDIPQEL+DMFISIPAD+FRMDISSTQIRKQLGI
Subjt:  GGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLGI

TrEMBL top hitse value%identityAlignment
A0A1S3CSY7 uncharacterized protein LOC1035045951.5e-19280.58Show/hide
Query:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE
        MADTWARAA DAIHL+PTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSR+SMIQLLGK                        VPSQFCSRRTAEE
Subjt:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE

Query:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA
        MALLAYNRALKLSRPGYPVLGV FTGSLAT HPK GDH               RMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLK       A
Subjt:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA

Query:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL
        IA+ACKVPGTFVSDLTQSDL EECETLFTEDEELEQLIKGEVCFKVYPFLSET TSDAE+KIILSGSFNPLHDGHIKLLEVAT                 
Subjt:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL

Query:  NICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFLV
        +IC DGYPCFELSAVNADKPPLSVSQIKDRVEQFKK+GKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGD KKMLE L+RIKNT   FLV
Subjt:  NICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFLV

Query:  GGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLGI
         GR++NGVFKVLEDIDIPQELRDMFI IPAD+FRMDISSTQIRKQLGI
Subjt:  GGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLGI

A0A5D3BLW5 CTP_transf_2 domain-containing protein1.5e-19280.58Show/hide
Query:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE
        MADTWARAA DAIHL+PTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSR+SMIQLLGK                        VPSQFCSRRTAEE
Subjt:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE

Query:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA
        MALLAYNRALKLSRPGYPVLGV FTGSLAT HPK GDH               RMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLK       A
Subjt:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA

Query:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL
        IA+ACKVPGTFVSDLTQSDL EECETLFTEDEELEQLIKGEVCFKVYPFLSET TSDAE+KIILSGSFNPLHDGHIKLLEVAT                 
Subjt:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL

Query:  NICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFLV
        +IC DGYPCFELSAVNADKPPLSVSQIKDRVEQFKK+GKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGD KKMLE L+RIKNT   FLV
Subjt:  NICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFLV

Query:  GGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLGI
         GR++NGVFKVLEDIDIPQELRDMFI IPAD+FRMDISSTQIRKQLGI
Subjt:  GGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLGI

A0A6J1DHD5 uncharacterized protein LOC1110200729.7e-18475.89Show/hide
Query:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE
        MADTW RAAVDA+H +PTQAVLYLSGGASQAIGWLLSVPGASGTVLEA+VPYSRHSMIQLLGK                        VPSQFCS +T EE
Subjt:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE

Query:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA
        MALLAYNRALKLS PGYPVLGVGFTGSLAT HPKFGDH               RMHMSTRSSNRHWVST+TLSKGLRTR+QEEILSGHLLLK       A
Subjt:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA

Query:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL
        IA+ACKVPGTFV DLTQSDL +ECETLFTED+ELEQ+I+GEVCFKVYPFLSE + S+AERKIILSGSFNPLHDGH+KLLEVAT                 
Subjt:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL

Query:  NICG-DGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFL
        +ICG DGYPCFE+SAVNADKPPLSVSQIKDRVE+FK VGKSVIISNQPYFYKKAELFPGSAFVIGADTA RLIDPKYYDGD KKML+ L+R K+TGC FL
Subjt:  NICG-DGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFL

Query:  VGGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLG
        VGGRN++GVFKVLED  IP+ELRDMFI IP D+FRMDISSTQIRKQLG
Subjt:  VGGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLG

A0A6J1FHZ1 uncharacterized protein LOC111445506 isoform X11.9e-18778.79Show/hide
Query:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE
        M DT  +AAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEA+VPYSR SMIQLLGK                        VPSQ CSR+TAEE
Subjt:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE

Query:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA
        MALLAYNRALKLSRPGYPVLGVGFTGSLAT HPKFGDH               RMHMSTR +NR WVSTITLSKGLRTREQEEILS HLLLK       A
Subjt:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA

Query:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL
        IA ACKVPGTFVSDLT+SDL EECETLF+EDEELEQL  GEVCFKVYPFL ETFT+DAERKIILSGSFNPLHDGH+KLLEVAT                 
Subjt:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL

Query:  NICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFLV
        +ICG GYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGD KKMLE LLRIKNTG  FLV
Subjt:  NICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFLV

Query:  GGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLGI
        GGRN+NGVFKVLED++IPQELRDMFISIPAD+FRMDISSTQIRKQLGI
Subjt:  GGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLGI

A0A6J1JW61 uncharacterized protein LOC111489421 isoform X11.4e-18778.57Show/hide
Query:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE
        M DT  +AAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEA+VPYSR SMIQLLGK                        VPSQ CSR+TAEE
Subjt:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE

Query:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA
        MALLAYNRALKLSRPGYPVLGVGFTGSLAT HPKFGDH               RMHMSTR +NR WVSTITLSKGLRTREQEEILS HLLL+       A
Subjt:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA

Query:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL
        IA ACKVPGTFVSDLT+SDL EECETLF+EDEELEQLI GEVCFKVYPFL ETFT+DAERK+ILSGSFNPLHDGH+KLLEVAT                 
Subjt:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL

Query:  NICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFLV
         ICG GYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGD KKMLE LLRIKNTG  FLV
Subjt:  NICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFLV

Query:  GGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLGI
        GGRN+NGVFKVLED++IPQELRDMFISIPAD+FRMDISSTQIRKQLGI
Subjt:  GGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLGI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01220.1 Nucleotidylyl transferase superfamily protein1.7e-14059.06Show/hide
Query:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE
        M D   R  V+AIH SPTQAV+YL GGAS A+GWL+SVPGAS T+LE++VPYSR SM+QLLG+                        VPSQ CS+  A+E
Subjt:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE

Query:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA
        MALLAYNRALKLS+PGYPVLGVGFTGSL T  PK GDH               R  +S R+S+R   S++TL+K LR+RE+E+ +S       C+ IQ A
Subjt:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA

Query:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL
        +A AC+V GT  S LT+S++  E ET FTE++ELEQLI G +C K+YPF S    SD +RKIIL GSFNPLH+GH+KLLEVA                 +
Subjt:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL

Query:  NICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFLV
        ++CG GYPCFE+SA+NADKPPL+++QIKDRV+QF+ VGK++I+SNQPYFYKKAELFPGS+FVIGADTA RL++PKYY+G  K+MLE L   K TGC FLV
Subjt:  NICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFLV

Query:  GGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLG
        GGRNV+GVFKVLED+DIP+E+ DMFISIPAD FRMDISST+IRK+ G
Subjt:  GGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLG

AT2G01220.2 Nucleotidylyl transferase superfamily protein5.3e-14259.06Show/hide
Query:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE
        M D   R  V+AIH SPTQAV+YL GGAS A+GWL+SVPGAS T+LE++VPYSR SM+QLLG+                        VPSQ CS+  A+E
Subjt:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE

Query:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA
        MALLAYNRALKLS+PGYPVLGVGFTGSL T  PK GDH               R  +S R+S+R   S++TL+K LR+RE+E+ +S       C+ IQ A
Subjt:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA

Query:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL
        +A AC+V GT  S LT+S++  E ET FTE++ELEQLI G +C K+YPF  E   SD +RKIIL GSFNPLH+GH+KLLEVA                 +
Subjt:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL

Query:  NICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFLV
        ++CG GYPCFE+SA+NADKPPL+++QIKDRV+QF+ VGK++I+SNQPYFYKKAELFPGS+FVIGADTA RL++PKYY+G  K+MLE L   K TGC FLV
Subjt:  NICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFLV

Query:  GGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLG
        GGRNV+GVFKVLED+DIP+E+ DMFISIPAD FRMDISST+IRK+ G
Subjt:  GGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLG

AT3G27610.1 Nucleotidylyl transferase superfamily protein9.7e-12854.14Show/hide
Query:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE
        MA++  R  V++IH SPTQAV+YLSGGASQ++GWL+SVPGAS T+LEA+VPYS  SM+QLLG+                        VP+Q CS+  A E
Subjt:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE

Query:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA
        MALLAYNRALKLS+PG  VLGVGFTG+LAT  PK GDH               R  +S R+SNR W +++TL+KG R+RE+E+ ++  +L++       A
Subjt:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA

Query:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL
        +A AC+V  T  S LT S++  E    F+E+EELEQLI G++C K+YPF  E++ SD +RKIIL GSFNPLHDG +KLLE A                  
Subjt:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQIL

Query:  NICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFLV
             GYPCFE+SA+NADKP L+V++IKDRV+QF+ + K+VI+SNQP+FYKKAELFPGS+FVIGADTA R+++PKYY+G  K+MLE L   K TGC FLV
Subjt:  NICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFLV

Query:  GGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLG
        GGRNV+ VFKVL+D +IP+E+  MF SI AD FRMDISST++RK  G
Subjt:  GGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLG

AT3G27610.2 Nucleotidylyl transferase superfamily protein1.8e-12654.02Show/hide
Query:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE
        MA++  R  V++IH SPTQAV+YLSGGASQ++GWL+SVPGAS T+LEA+VPYS  SM+QLLG+                        VP+Q CS+  A E
Subjt:  MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEE

Query:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA
        MALLAYNRALKLS+PG  VLGVGFTG+LAT  PK GDH               R  +S R+SNR W +++TL+KG R+RE+E+ ++  +L++       A
Subjt:  MALLAYNRALKLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMA

Query:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPF-LSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQI
        +A AC+V  T  S LT S++  E    F+E+EELEQLI G++C K+YPF  +E++ SD +RKIIL GSFNPLHDG +KLLE A                 
Subjt:  IADACKVPGTFVSDLTQSDLSEECETLFTEDEELEQLIKGEVCFKVYPF-LSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQI

Query:  LNICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFL
              GYPCFE+SA+NADKP L+V++IKDRV+QF+ + K+VI+SNQP+FYKKAELFPGS+FVIGADTA R+++PKYY+G  K+MLE L   K TGC FL
Subjt:  LNICGDGYPCFELSAVNADKPPLSVSQIKDRVEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFL

Query:  VGGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLG
        VGGRNV+ VFKVL+D +IP+E+  MF SI AD FRMDISST++RK  G
Subjt:  VGGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISSTQIRKQLG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGATACGTGGGCCAGAGCTGCCGTCGATGCCATTCACTTGAGTCCTACACAGGCCGTTCTCTACCTTTCCGGCGGCGCTTCTCAGGCTATAGGATGGTTGCTCTC
AGTTCCGGGAGCTTCCGGCACAGTCCTCGAAGCTCTTGTGCCTTACTCCCGGCATTCCATGATCCAGTTACTCGGCAAGCTGCACCGGAGTTTAGGACAGCTGTGGAATT
TGATTCTCATGGATGAAGTGTTCAATATAAGGGTGGTCCTGGTCCCCTCTCAATTCTGCAGCCGTAGAACTGCTGAAGAAATGGCTTTGCTGGCTTATAATCGTGCTCTG
AAGCTATCTAGGCCAGGTTATCCTGTTCTTGGTGTCGGTTTTACTGGTTCTTTGGCTACCATTCATCCAAAATTTGGTGACCACAGTTTTTGTTCACTAGCCCTCACCTC
TCTCTCTTGTATTATGCGTAGGATGCATATGTCAACAAGATCGTCTAACCGACATTGGGTTTCCACAATCACACTATCGAAGGGTTTACGTACTCGGGAGCAAGAGGAGA
TACTCTCTGGTCATCTTTTGCTTAAGGTATGTTCGAAGATCCAAATGGCAATTGCAGACGCTTGTAAAGTTCCTGGAACCTTTGTTTCAGATTTGACTCAATCTGATTTA
TCAGAAGAATGTGAAACACTATTCACCGAGGATGAAGAACTAGAGCAACTTATAAAGGGGGAAGTTTGCTTTAAGGTCTATCCATTTTTAAGTGAGACCTTTACATCAGA
TGCAGAAAGAAAGATAATACTTTCTGGTTCTTTTAATCCATTACATGATGGTCACATCAAGCTTTTAGAGGTTGCAACCAGGTACTATTCTGGGAAGCTTCATCAACCAC
ATGAAATCTACCAGATATTGAACATTTGTGGTGATGGGTATCCTTGTTTTGAATTATCAGCCGTGAATGCTGATAAACCACCCCTGTCAGTATCACAGATCAAAGATCGT
GTTGAGCAATTTAAAAAAGTTGGAAAGTCAGTAATCATTTCTAATCAGCCGTACTTTTACAAGAAAGCCGAACTCTTTCCTGGTAGTGCTTTTGTAATTGGTGCTGACAC
TGCAGTAAGGTTGATAGATCCCAAATACTATGATGGGGATTGCAAGAAGATGCTGGAGACTTTGCTCCGAATAAAAAACACTGGGTGCATGTTTCTTGTTGGTGGTCGAA
ACGTAAATGGTGTTTTCAAAGTTCTTGAAGATATTGATATTCCACAAGAGCTAAGAGACATGTTTATCTCAATACCTGCAGACAGATTTCGTATGGACATTTCCTCCACC
CAAATAAGAAAACAACTAGGAATTTAA
mRNA sequenceShow/hide mRNA sequence
TGGAAGAAGGAGAAGCGATGGCGGATACGTGGGCCAGAGCTGCCGTCGATGCCATTCACTTGAGTCCTACACAGGCCGTTCTCTACCTTTCCGGCGGCGCTTCTCAGGCT
ATAGGATGGTTGCTCTCAGTTCCGGGAGCTTCCGGCACAGTCCTCGAAGCTCTTGTGCCTTACTCCCGGCATTCCATGATCCAGTTACTCGGCAAGCTGCACCGGAGTTT
AGGACAGCTGTGGAATTTGATTCTCATGGATGAAGTGTTCAATATAAGGGTGGTCCTGGTCCCCTCTCAATTCTGCAGCCGTAGAACTGCTGAAGAAATGGCTTTGCTGG
CTTATAATCGTGCTCTGAAGCTATCTAGGCCAGGTTATCCTGTTCTTGGTGTCGGTTTTACTGGTTCTTTGGCTACCATTCATCCAAAATTTGGTGACCACAGTTTTTGT
TCACTAGCCCTCACCTCTCTCTCTTGTATTATGCGTAGGATGCATATGTCAACAAGATCGTCTAACCGACATTGGGTTTCCACAATCACACTATCGAAGGGTTTACGTAC
TCGGGAGCAAGAGGAGATACTCTCTGGTCATCTTTTGCTTAAGGTATGTTCGAAGATCCAAATGGCAATTGCAGACGCTTGTAAAGTTCCTGGAACCTTTGTTTCAGATT
TGACTCAATCTGATTTATCAGAAGAATGTGAAACACTATTCACCGAGGATGAAGAACTAGAGCAACTTATAAAGGGGGAAGTTTGCTTTAAGGTCTATCCATTTTTAAGT
GAGACCTTTACATCAGATGCAGAAAGAAAGATAATACTTTCTGGTTCTTTTAATCCATTACATGATGGTCACATCAAGCTTTTAGAGGTTGCAACCAGGTACTATTCTGG
GAAGCTTCATCAACCACATGAAATCTACCAGATATTGAACATTTGTGGTGATGGGTATCCTTGTTTTGAATTATCAGCCGTGAATGCTGATAAACCACCCCTGTCAGTAT
CACAGATCAAAGATCGTGTTGAGCAATTTAAAAAAGTTGGAAAGTCAGTAATCATTTCTAATCAGCCGTACTTTTACAAGAAAGCCGAACTCTTTCCTGGTAGTGCTTTT
GTAATTGGTGCTGACACTGCAGTAAGGTTGATAGATCCCAAATACTATGATGGGGATTGCAAGAAGATGCTGGAGACTTTGCTCCGAATAAAAAACACTGGGTGCATGTT
TCTTGTTGGTGGTCGAAACGTAAATGGTGTTTTCAAAGTTCTTGAAGATATTGATATTCCACAAGAGCTAAGAGACATGTTTATCTCAATACCTGCAGACAGATTTCGTA
TGGACATTTCCTCCACCCAAATAAGAAAACAACTAGGAATTTAAACTTCTAGGTGACTTATTCTTGCTGCCCGATCTTTCAAGTGACAGGCAATGCTGCATTTTGATTAG
GGGTAAATATTGGTTCATTTTTTGCATTCAAGGATTTGAATTGCTCTTTTTGGAGGACTATCAGCATGCTCAAAATCCTATTTTCAGTGGTGACTAATGAATGGTTTAAA
TTTATTTTGATGCTTATTTTTATTTTATTATTATTGGAGTCTGGAGTGTCCTATTCTTG
Protein sequenceShow/hide protein sequence
MADTWARAAVDAIHLSPTQAVLYLSGGASQAIGWLLSVPGASGTVLEALVPYSRHSMIQLLGKLHRSLGQLWNLILMDEVFNIRVVLVPSQFCSRRTAEEMALLAYNRAL
KLSRPGYPVLGVGFTGSLATIHPKFGDHSFCSLALTSLSCIMRRMHMSTRSSNRHWVSTITLSKGLRTREQEEILSGHLLLKVCSKIQMAIADACKVPGTFVSDLTQSDL
SEECETLFTEDEELEQLIKGEVCFKVYPFLSETFTSDAERKIILSGSFNPLHDGHIKLLEVATRYYSGKLHQPHEIYQILNICGDGYPCFELSAVNADKPPLSVSQIKDR
VEQFKKVGKSVIISNQPYFYKKAELFPGSAFVIGADTAVRLIDPKYYDGDCKKMLETLLRIKNTGCMFLVGGRNVNGVFKVLEDIDIPQELRDMFISIPADRFRMDISST
QIRKQLGI