; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g0434 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g0434
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionUnknown protein
Genome locationMC06:3578825..3579604
RNA-Seq ExpressionMC06g0434
SyntenyMC06g0434
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589708.1 hypothetical protein SDJN03_15131, partial [Cucurbita argyrosperma subsp. sororia]1.23e-12573.72Show/hide
Query:  MDDTNESNPLTLNHT---------GEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTP
        MDDT+ES PLT  H           E+E  E+LS SDLPLD  KSD HT + FRKNPRRSSSEPLDLFEFF+ G  +SEISPAEDLIFCGRLLPLND + 
Subjt:  MDDTNESNPLTLNHT---------GEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTP

Query:  LTRAT----EKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPILSPTAEGDRNYPMKSAVRPDALNKKASSKPRWY
        LT +T    EK+F K+ESRKQ  FRKRSESLSGLQSSVSRSN++KINLKR+S+SLDYRKLYRQ N I SPTAE DRN  +K+ ++PD LNKKASSKPRWY
Subjt:  LTRAT----EKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPILSPTAEGDRNYPMKSAVRPDALNKKASSKPRWY

Query:  LMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCN-RSSGEAAWRLLRALSCKNHSSVDVTASLTA
        L+MFGMVKFPAEM+LSDIKSRQVRRSSSALFPA+E KGK+ CN RSSGEA WR+LRALSCKN++SVDVTASLTA
Subjt:  LMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCN-RSSGEAAWRLLRALSCKNHSSVDVTASLTA

KAG7023388.1 hypothetical protein SDJN02_14413 [Cucurbita argyrosperma subsp. argyrosperma]1.32e-12573.72Show/hide
Query:  MDDTNESNPLTLNHT---------GEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTP
        MDDT+ES PLT  H           E+E  E+LS SDLPLD  KSD HT + FRKNPRRSSSEPLDLFEFF+ G  +SEISPAEDLIFCGRLLPLND + 
Subjt:  MDDTNESNPLTLNHT---------GEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTP

Query:  LTRAT----EKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPILSPTAEGDRNYPMKSAVRPDALNKKASSKPRWY
        LT +T    EK+F K+ESRKQ  FRKRSESLSGLQSSVSRSN++KINLKR+S+SLDYRKLYRQ N I SPTAE DRN  +K+ ++PD LNKKASSKPRWY
Subjt:  LTRAT----EKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPILSPTAEGDRNYPMKSAVRPDALNKKASSKPRWY

Query:  LMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCN-RSSGEAAWRLLRALSCKNHSSVDVTASLTA
        L+MFGMVKFPAEM+LSDIKSRQVRRSSSALFPA+E KGK+ CN RSSGEA WR+LRALSCKN++SVDVTASLTA
Subjt:  LMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCN-RSSGEAAWRLLRALSCKNHSSVDVTASLTA

XP_022135307.1 uncharacterized protein LOC111007302, partial [Momordica charantia]1.03e-15099.55Show/hide
Query:  MDDTNESNPLTLNHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLTRATEKNF
        MDDTNESNPLTLNHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLTRATEKNF
Subjt:  MDDTNESNPLTLNHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLTRATEKNF

Query:  KKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPILSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMVKFPAEM
        KKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPI SPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMVKFPAEM
Subjt:  KKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPILSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMVKFPAEM

Query:  ELSDIKSRQVRRSSSALFPAHEDK
        ELSDIKSRQVRRSSSALFPAHEDK
Subjt:  ELSDIKSRQVRRSSSALFPAHEDK

XP_022921809.1 uncharacterized protein LOC111429953 [Cucurbita moschata]1.32e-12573.72Show/hide
Query:  MDDTNESNPLTLNHT---------GEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTP
        MDDT+ES PLT  H           E+E  E+LS SDLPLD  KSD HT + FRKNPRRSSSEPLDLFEFF+ G  +SEISPAEDLIFCGRLLPLND + 
Subjt:  MDDTNESNPLTLNHT---------GEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTP

Query:  LTRAT----EKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPILSPTAEGDRNYPMKSAVRPDALNKKASSKPRWY
        LT +T    EK+F K+ESRKQ  FRKRSESLSGLQSSVSRSN++KINLKR+S+SLDYRKLYRQ N I SPTAE DRN  +K+ ++PD LNKKASSKPRWY
Subjt:  LTRAT----EKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPILSPTAEGDRNYPMKSAVRPDALNKKASSKPRWY

Query:  LMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCN-RSSGEAAWRLLRALSCKNHSSVDVTASLTA
        L+MFGMVKFPAEM+LSDIKSRQVRRSSSALFPA+E KGK+ CN RSSGEA WR+LRALSCKN++SVDVTASLTA
Subjt:  LMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCN-RSSGEAAWRLLRALSCKNHSSVDVTASLTA

XP_038879756.1 uncharacterized protein LOC120071507 [Benincasa hispida]1.17e-12676.23Show/hide
Query:  MDDTNESNPLTLNHT--GEDEEHESLSLSDLPLDNDKSDDHTF-DGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLT-RAT
        MDD+ ESNPL   H    +DE  ESLS SDLPLD +KSD HT  +  RKNPRRSSSEPLDLFEFF+ G  +SEISPAEDLIFCGRLLPLND +PLT R  
Subjt:  MDDTNESNPLTLNHT--GEDEEHESLSLSDLPLDNDKSDDHTF-DGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLT-RAT

Query:  EKNFKKEE-SRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPILSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMVK
        EK+F KEE +RKQ  FRKRSESLSGLQSSVSRSNS+K NLKR+S+SLDYRKLYRQ N I SPTAE DRN  + + ++PD+LNKKASSKPRWYL+MFGMVK
Subjt:  EKNFKKEE-SRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPILSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMVK

Query:  FPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTASLTA
        FPAEM+L DIKSRQVRRSSS LFPA+E+KGKF CNRSSGEAAWR+LRALSCKNH+SVDVTASLTA
Subjt:  FPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTASLTA

TrEMBL top hitse value%identityAlignment
A0A1S3BW36 uncharacterized protein LOC1034942654.06e-12273.31Show/hide
Query:  MDDTNESNPLTLN---HTGED--EEHESLSLSDLPLDNDKSDDHTF-DGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLTR
        MDDT+ESNPLT     H  +D  E  ESLS SDLP+D + SD HT  D FRKNPRRSSSEPLDLFEFF+ G  +SEISPAEDLIFCGRLLPLND +    
Subjt:  MDDTNESNPLTLN---HTGED--EEHESLSLSDLPLDNDKSDDHTF-DGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLTR

Query:  ATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPILSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMV
        A +  +K+E SRKQ  FRKRSESLSGLQSSVSRSNS+K NLKR+S+SLDYR+LYRQ N I SPTAE DRN  +K+ ++PD+LNKK SSKPRWYL+MFGMV
Subjt:  ATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPILSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMV

Query:  KFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTASLTA
        KFPAEMELSDIKSRQVRRSSS LFP++E K KF C RSSGEA WR+LRALSCKNH+SVDVTASLTA
Subjt:  KFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTASLTA

A0A5A7UVS6 Uncharacterized protein4.06e-12273.31Show/hide
Query:  MDDTNESNPLTLN---HTGED--EEHESLSLSDLPLDNDKSDDHTF-DGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLTR
        MDDT+ESNPLT     H  +D  E  ESLS SDLP+D + SD HT  D FRKNPRRSSSEPLDLFEFF+ G  +SEISPAEDLIFCGRLLPLND +    
Subjt:  MDDTNESNPLTLN---HTGED--EEHESLSLSDLPLDNDKSDDHTF-DGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLTR

Query:  ATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPILSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMV
        A +  +K+E SRKQ  FRKRSESLSGLQSSVSRSNS+K NLKR+S+SLDYR+LYRQ N I SPTAE DRN  +K+ ++PD+LNKK SSKPRWYL+MFGMV
Subjt:  ATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPILSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMV

Query:  KFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTASLTA
        KFPAEMELSDIKSRQVRRSSS LFP++E K KF C RSSGEA WR+LRALSCKNH+SVDVTASLTA
Subjt:  KFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTASLTA

A0A6J1C2C2 uncharacterized protein LOC1110073025.00e-15199.55Show/hide
Query:  MDDTNESNPLTLNHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLTRATEKNF
        MDDTNESNPLTLNHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLTRATEKNF
Subjt:  MDDTNESNPLTLNHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLTRATEKNF

Query:  KKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPILSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMVKFPAEM
        KKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPI SPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMVKFPAEM
Subjt:  KKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPILSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMVKFPAEM

Query:  ELSDIKSRQVRRSSSALFPAHEDK
        ELSDIKSRQVRRSSSALFPAHEDK
Subjt:  ELSDIKSRQVRRSSSALFPAHEDK

A0A6J1E1L2 uncharacterized protein LOC1114299536.37e-12673.72Show/hide
Query:  MDDTNESNPLTLNHT---------GEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTP
        MDDT+ES PLT  H           E+E  E+LS SDLPLD  KSD HT + FRKNPRRSSSEPLDLFEFF+ G  +SEISPAEDLIFCGRLLPLND + 
Subjt:  MDDTNESNPLTLNHT---------GEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTP

Query:  LTRAT----EKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPILSPTAEGDRNYPMKSAVRPDALNKKASSKPRWY
        LT +T    EK+F K+ESRKQ  FRKRSESLSGLQSSVSRSN++KINLKR+S+SLDYRKLYRQ N I SPTAE DRN  +K+ ++PD LNKKASSKPRWY
Subjt:  LTRAT----EKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPILSPTAEGDRNYPMKSAVRPDALNKKASSKPRWY

Query:  LMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCN-RSSGEAAWRLLRALSCKNHSSVDVTASLTA
        L+MFGMVKFPAEM+LSDIKSRQVRRSSSALFPA+E KGK+ CN RSSGEA WR+LRALSCKN++SVDVTASLTA
Subjt:  LMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCN-RSSGEAAWRLLRALSCKNHSSVDVTASLTA

A0A6J1JAL3 uncharacterized protein LOC1114850635.21e-12573.36Show/hide
Query:  MDDTNESNPLTLNHTG---------EDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTP
        MDDT+ES PLT  H           E+E  E+LS SDLPLD  KSD HT + FRKNPRRSSSEPLDLFEFF+ G  +SEISPAEDLIFCGRLLPLND + 
Subjt:  MDDTNESNPLTLNHTG---------EDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTP

Query:  LTRAT----EKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPILSPTAEGDRNYPMKSAVRPDALNKKASSKPRWY
        LT +T    +K+F K+ESRKQ  FRKRSESLSGLQSSVSRSN++KINLKR+S+SLDYRKLYRQ N I SPTAE DRN  +K+ ++PD LNKKASSKPRWY
Subjt:  LTRAT----EKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPILSPTAEGDRNYPMKSAVRPDALNKKASSKPRWY

Query:  LMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCN-RSSGEAAWRLLRALSCKNHSSVDVTASLTA
        L+MFGMVKFPAEM+LSDIKSRQVRRSSSALFPA E KGK+ CN RSSGEA WR+LRALSCKN++SVDVTASLTA
Subjt:  LMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCN-RSSGEAAWRLLRALSCKNHSSVDVTASLTA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30230.1 unknown protein3.5e-2936.63Show/hide
Query:  DTNESNPLTL----NHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSE-----PLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLT
        +TN  NP  L     H  E+E+ ++LSL DLPL              KNP  +++E       +LFEF T+   S +++PAE++IF G+L+PLN Q    
Subjt:  DTNESNPLTL----NHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSE-----PLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLT

Query:  RATEKNFKKEESRKQAAFRKRSESLSGLQS-SVSRSNSSKINLK------RSSKSLDYRKLYRQGNPILSPTAEGDRNYPMKSAVRPDALNKKA--SSKP
           E   ++         R RSESLS +Q   ++R  S  +  +      R+S+SLDYRKL R    + SP    + +   K+  +P+  +  +  S +P
Subjt:  RATEKNFKKEESRKQAAFRKRSESLSGLQS-SVSRSNSSKINLK------RSSKSLDYRKLYRQGNPILSPTAEGDRNYPMKSAVRPDALNKKA--SSKP

Query:  RWYLMMFGMVKFPAEMELSDIKSRQVRRS-SSALFPAHEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTA
        RWY++MFGMVKFP E+EL DIKSRQ+RR+    +FP+  ++        S   +WR L ALSCK  +SV  TA
Subjt:  RWYLMMFGMVKFPAEMELSDIKSRQVRRS-SSALFPAHEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGACACCAACGAATCCAATCCTCTGACCTTGAATCACACAGGAGAAGACGAAGAACACGAATCGCTCTCCCTTTCCGATCTTCCACTCGATAACGACAAATCCGA
CGACCACACTTTCGACGGCTTCCGCAAGAATCCGCGAAGATCCTCATCCGAGCCTCTCGATCTCTTCGAATTCTTCACCACCGGACTCAGAAGCTCTGAGATATCTCCGG
CCGAGGATTTGATCTTCTGCGGCAGATTGCTTCCTCTCAACGATCAGACTCCGCTTACGCGCGCCACGGAGAAGAATTTCAAAAAGGAAGAGAGCCGAAAGCAGGCTGCC
TTTCGAAAACGCTCCGAGTCGTTGTCCGGATTACAGAGCTCTGTTTCTCGATCGAACAGTTCAAAAATCAACCTCAAGCGGAGCAGTAAATCGCTCGATTACCGGAAACT
CTACCGCCAAGGGAATCCGATTCTCTCGCCCACGGCGGAAGGCGATCGTAATTATCCGATGAAGAGCGCCGTGAGACCTGATGCGCTGAACAAAAAGGCGTCGTCGAAGC
CGCGGTGGTACTTGATGATGTTTGGAATGGTGAAATTTCCGGCGGAGATGGAGCTCAGCGACATTAAGAGCAGACAAGTCCGCCGCAGTTCGTCGGCGCTTTTCCCGGCG
CACGAGGATAAAGGTAAGTTTCCGTGCAATCGGAGCTCCGGCGAGGCGGCCTGGAGACTCCTCCGGGCGCTAAGCTGCAAGAACCACTCTAGCGTAGATGTAACGGCGTC
GTTAACTGCC
mRNA sequenceShow/hide mRNA sequence
ATGGACGACACCAACGAATCCAATCCTCTGACCTTGAATCACACAGGAGAAGACGAAGAACACGAATCGCTCTCCCTTTCCGATCTTCCACTCGATAACGACAAATCCGA
CGACCACACTTTCGACGGCTTCCGCAAGAATCCGCGAAGATCCTCATCCGAGCCTCTCGATCTCTTCGAATTCTTCACCACCGGACTCAGAAGCTCTGAGATATCTCCGG
CCGAGGATTTGATCTTCTGCGGCAGATTGCTTCCTCTCAACGATCAGACTCCGCTTACGCGCGCCACGGAGAAGAATTTCAAAAAGGAAGAGAGCCGAAAGCAGGCTGCC
TTTCGAAAACGCTCCGAGTCGTTGTCCGGATTACAGAGCTCTGTTTCTCGATCGAACAGTTCAAAAATCAACCTCAAGCGGAGCAGTAAATCGCTCGATTACCGGAAACT
CTACCGCCAAGGGAATCCGATTCTCTCGCCCACGGCGGAAGGCGATCGTAATTATCCGATGAAGAGCGCCGTGAGACCTGATGCGCTGAACAAAAAGGCGTCGTCGAAGC
CGCGGTGGTACTTGATGATGTTTGGAATGGTGAAATTTCCGGCGGAGATGGAGCTCAGCGACATTAAGAGCAGACAAGTCCGCCGCAGTTCGTCGGCGCTTTTCCCGGCG
CACGAGGATAAAGGTAAGTTTCCGTGCAATCGGAGCTCCGGCGAGGCGGCCTGGAGACTCCTCCGGGCGCTAAGCTGCAAGAACCACTCTAGCGTAGATGTAACGGCGTC
GTTAACTGCC
Protein sequenceShow/hide protein sequence
MDDTNESNPLTLNHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLTRATEKNFKKEESRKQAA
FRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPILSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMVKFPAEMELSDIKSRQVRRSSSALFPA
HEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTASLTA