; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS002243 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS002243
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUnknown protein
Genome locationscaffold30:3575274..3576053
RNA-Seq ExpressionMS002243
SyntenyMS002243
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589708.1 hypothetical protein SDJN03_15131, partial [Cucurbita argyrosperma subsp. sororia]1.4e-9974.45Show/hide
Query:  MDDTNESNPLTLNH---------TGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLISSEISPAEDLIFCGRLLPLNDQTP
        MDDT+ES PLT  H           E+E  E+LS SDLPLD  KSD HT + FRKNPRRSSSEPLDLFEFF+ G I+SEISPAEDLIFCGRLLPLND + 
Subjt:  MDDTNESNPLTLNH---------TGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLISSEISPAEDLIFCGRLLPLNDQTP

Query:  LT----RATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWY
        LT    R  EK+F K+ESRKQ  FRKRSESLSGLQSSVSRSN++KINLKR+S+SLDYRKLYRQ N IFSPTAE DRN  +K+ ++PD LNKKASSKPRWY
Subjt:  LT----RATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWY

Query:  LMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPC-NRSSGEAAWRLLRALSCKNHSSVDVTASLTA
        L+MFGMVKFPAEM+LSDIKSRQVRRSSSALFPA+E KGK+ C NRSSGEA WR+LRALSCKN++SVDVTASLTA
Subjt:  LMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPC-NRSSGEAAWRLLRALSCKNHSSVDVTASLTA

KAG7023388.1 hypothetical protein SDJN02_14413 [Cucurbita argyrosperma subsp. argyrosperma]1.4e-9974.45Show/hide
Query:  MDDTNESNPLTLNH---------TGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLISSEISPAEDLIFCGRLLPLNDQTP
        MDDT+ES PLT  H           E+E  E+LS SDLPLD  KSD HT + FRKNPRRSSSEPLDLFEFF+ G I+SEISPAEDLIFCGRLLPLND + 
Subjt:  MDDTNESNPLTLNH---------TGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLISSEISPAEDLIFCGRLLPLNDQTP

Query:  LT----RATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWY
        LT    R  EK+F K+ESRKQ  FRKRSESLSGLQSSVSRSN++KINLKR+S+SLDYRKLYRQ N IFSPTAE DRN  +K+ ++PD LNKKASSKPRWY
Subjt:  LT----RATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWY

Query:  LMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPC-NRSSGEAAWRLLRALSCKNHSSVDVTASLTA
        L+MFGMVKFPAEM+LSDIKSRQVRRSSSALFPA+E KGK+ C NRSSGEA WR+LRALSCKN++SVDVTASLTA
Subjt:  LMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPC-NRSSGEAAWRLLRALSCKNHSSVDVTASLTA

XP_022135307.1 uncharacterized protein LOC111007302, partial [Momordica charantia]1.6e-11699.55Show/hide
Query:  MDDTNESNPLTLNHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLISSEISPAEDLIFCGRLLPLNDQTPLTRATEKNF
        MDDTNESNPLTLNHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGL SSEISPAEDLIFCGRLLPLNDQTPLTRATEKNF
Subjt:  MDDTNESNPLTLNHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLISSEISPAEDLIFCGRLLPLNDQTPLTRATEKNF

Query:  KKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMVKFPAEM
        KKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMVKFPAEM
Subjt:  KKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMVKFPAEM

Query:  ELSDIKSRQVRRSSSALFPAHEDK
        ELSDIKSRQVRRSSSALFPAHEDK
Subjt:  ELSDIKSRQVRRSSSALFPAHEDK

XP_022921809.1 uncharacterized protein LOC111429953 [Cucurbita moschata]1.4e-9974.45Show/hide
Query:  MDDTNESNPLTLNHT---------GEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLISSEISPAEDLIFCGRLLPLNDQTP
        MDDT+ES PLT  H           E+E  E+LS SDLPLD  KSD HT + FRKNPRRSSSEPLDLFEFF+ G I+SEISPAEDLIFCGRLLPLND + 
Subjt:  MDDTNESNPLTLNHT---------GEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLISSEISPAEDLIFCGRLLPLNDQTP

Query:  LT----RATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWY
        LT    R  EK+F K+ESRKQ  FRKRSESLSGLQSSVSRSN++KINLKR+S+SLDYRKLYRQ N IFSPTAE DRN  +K+ ++PD LNKKASSKPRWY
Subjt:  LT----RATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWY

Query:  LMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPC-NRSSGEAAWRLLRALSCKNHSSVDVTASLTA
        L+MFGMVKFPAEM+LSDIKSRQVRRSSSALFPA+E KGK+ C NRSSGEA WR+LRALSCKN++SVDVTASLTA
Subjt:  LMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPC-NRSSGEAAWRLLRALSCKNHSSVDVTASLTA

XP_038879756.1 uncharacterized protein LOC120071507 [Benincasa hispida]2.7e-10076.98Show/hide
Query:  MDDTNESNPLTLNH--TGEDEEHESLSLSDLPLDNDKSDDHTF-DGFRKNPRRSSSEPLDLFEFFTTGLISSEISPAEDLIFCGRLLPLNDQTPL-TRAT
        MDD+ ESNPL   H    +DE  ESLS SDLPLD +KSD HT  +  RKNPRRSSSEPLDLFEFF+ G I+SEISPAEDLIFCGRLLPLND +PL TR  
Subjt:  MDDTNESNPLTLNH--TGEDEEHESLSLSDLPLDNDKSDDHTF-DGFRKNPRRSSSEPLDLFEFFTTGLISSEISPAEDLIFCGRLLPLNDQTPL-TRAT

Query:  EKNFKKEE-SRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMVK
        EK+F KEE +RKQ  FRKRSESLSGLQSSVSRSNS+K NLKR+S+SLDYRKLYRQ N IFSPTAE DRN  + + ++PD+LNKKASSKPRWYL+MFGMVK
Subjt:  EKNFKKEE-SRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMVK

Query:  FPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTASLTA
        FPAEM+L DIKSRQVRRSSS LFPA+E+KGKF CNRSSGEAAWR+LRALSCKNH+SVDVTASLTA
Subjt:  FPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTASLTA

TrEMBL top hitse value%identityAlignment
A0A1S3BW36 uncharacterized protein LOC1034942656.8e-9774.06Show/hide
Query:  MDDTNESNPLTL---NHTGED--EEHESLSLSDLPLDNDKSDDHTF-DGFRKNPRRSSSEPLDLFEFFTTGLISSEISPAEDLIFCGRLLPLNDQTPLTR
        MDDT+ESNPLT     H  +D  E  ESLS SDLP+D + SD HT  D FRKNPRRSSSEPLDLFEFF+ G I+SEISPAEDLIFCGRLLPLND +    
Subjt:  MDDTNESNPLTL---NHTGED--EEHESLSLSDLPLDNDKSDDHTF-DGFRKNPRRSSSEPLDLFEFFTTGLISSEISPAEDLIFCGRLLPLNDQTPLTR

Query:  ATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMV
        A +  +K+E SRKQ  FRKRSESLSGLQSSVSRSNS+K NLKR+S+SLDYR+LYRQ N IFSPTAE DRN  +K+ ++PD+LNKK SSKPRWYL+MFGMV
Subjt:  ATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMV

Query:  KFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTASLTA
        KFPAEMELSDIKSRQVRRSSS LFP++E K KF C RSSGEA WR+LRALSCKNH+SVDVTASLTA
Subjt:  KFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTASLTA

A0A5A7UVS6 Uncharacterized protein6.8e-9774.06Show/hide
Query:  MDDTNESNPLTL---NHTGED--EEHESLSLSDLPLDNDKSDDHTF-DGFRKNPRRSSSEPLDLFEFFTTGLISSEISPAEDLIFCGRLLPLNDQTPLTR
        MDDT+ESNPLT     H  +D  E  ESLS SDLP+D + SD HT  D FRKNPRRSSSEPLDLFEFF+ G I+SEISPAEDLIFCGRLLPLND +    
Subjt:  MDDTNESNPLTL---NHTGED--EEHESLSLSDLPLDNDKSDDHTF-DGFRKNPRRSSSEPLDLFEFFTTGLISSEISPAEDLIFCGRLLPLNDQTPLTR

Query:  ATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMV
        A +  +K+E SRKQ  FRKRSESLSGLQSSVSRSNS+K NLKR+S+SLDYR+LYRQ N IFSPTAE DRN  +K+ ++PD+LNKK SSKPRWYL+MFGMV
Subjt:  ATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMV

Query:  KFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTASLTA
        KFPAEMELSDIKSRQVRRSSS LFP++E K KF C RSSGEA WR+LRALSCKNH+SVDVTASLTA
Subjt:  KFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTASLTA

A0A6J1C2C2 uncharacterized protein LOC1110073027.7e-11799.55Show/hide
Query:  MDDTNESNPLTLNHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLISSEISPAEDLIFCGRLLPLNDQTPLTRATEKNF
        MDDTNESNPLTLNHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGL SSEISPAEDLIFCGRLLPLNDQTPLTRATEKNF
Subjt:  MDDTNESNPLTLNHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLISSEISPAEDLIFCGRLLPLNDQTPLTRATEKNF

Query:  KKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMVKFPAEM
        KKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMVKFPAEM
Subjt:  KKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMVKFPAEM

Query:  ELSDIKSRQVRRSSSALFPAHEDK
        ELSDIKSRQVRRSSSALFPAHEDK
Subjt:  ELSDIKSRQVRRSSSALFPAHEDK

A0A6J1E1L2 uncharacterized protein LOC1114299536.6e-10074.45Show/hide
Query:  MDDTNESNPLTLNHT---------GEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLISSEISPAEDLIFCGRLLPLNDQTP
        MDDT+ES PLT  H           E+E  E+LS SDLPLD  KSD HT + FRKNPRRSSSEPLDLFEFF+ G I+SEISPAEDLIFCGRLLPLND + 
Subjt:  MDDTNESNPLTLNHT---------GEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLISSEISPAEDLIFCGRLLPLNDQTP

Query:  LT----RATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWY
        LT    R  EK+F K+ESRKQ  FRKRSESLSGLQSSVSRSN++KINLKR+S+SLDYRKLYRQ N IFSPTAE DRN  +K+ ++PD LNKKASSKPRWY
Subjt:  LT----RATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWY

Query:  LMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPC-NRSSGEAAWRLLRALSCKNHSSVDVTASLTA
        L+MFGMVKFPAEM+LSDIKSRQVRRSSSALFPA+E KGK+ C NRSSGEA WR+LRALSCKN++SVDVTASLTA
Subjt:  LMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPC-NRSSGEAAWRLLRALSCKNHSSVDVTASLTA

A0A6J1JAL3 uncharacterized protein LOC1114850633.3e-9974.09Show/hide
Query:  MDDTNESNPLTLNHT---------GEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLISSEISPAEDLIFCGRLLPLNDQTP
        MDDT+ES PLT  H           E+E  E+LS SDLPLD  KSD HT + FRKNPRRSSSEPLDLFEFF+ G I+SEISPAEDLIFCGRLLPLND + 
Subjt:  MDDTNESNPLTLNHT---------GEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLISSEISPAEDLIFCGRLLPLNDQTP

Query:  LT----RATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWY
        LT    R  +K+F K+ESRKQ  FRKRSESLSGLQSSVSRSN++KINLKR+S+SLDYRKLYRQ N IFSPTAE DRN  +K+ ++PD LNKKASSKPRWY
Subjt:  LT----RATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWY

Query:  LMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPC-NRSSGEAAWRLLRALSCKNHSSVDVTASLTA
        L+MFGMVKFPAEM+LSDIKSRQVRRSSSALFPA E KGK+ C NRSSGEA WR+LRALSCKN++SVDVTASLTA
Subjt:  LMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPC-NRSSGEAAWRLLRALSCKNHSSVDVTASLTA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30230.1 unknown protein1.6e-2936.63Show/hide
Query:  DTNESNPLTL----NHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSE-----PLDLFEFFTTGLISSEISPAEDLIFCGRLLPLNDQTPLT
        +TN  NP  L     H  E+E+ ++LSL DLPL              KNP  +++E       +LFEF T+   S +++PAE++IF G+L+PLN Q    
Subjt:  DTNESNPLTL----NHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSE-----PLDLFEFFTTGLISSEISPAEDLIFCGRLLPLNDQTPLT

Query:  RATEKNFKKEESRKQAAFRKRSESLSGLQS-SVSRSNSSKINLK------RSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKA--SSKP
           E   ++         R RSESLS +Q   ++R  S  +  +      R+S+SLDYRKL R    + SP    + +   K+  +P+  +  +  S +P
Subjt:  RATEKNFKKEESRKQAAFRKRSESLSGLQS-SVSRSNSSKINLK------RSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKA--SSKP

Query:  RWYLMMFGMVKFPAEMELSDIKSRQVRRS-SSALFPAHEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTA
        RWY++MFGMVKFP E+EL DIKSRQ+RR+    +FP+  ++        S   +WR L ALSCK  +SV  TA
Subjt:  RWYLMMFGMVKFPAEMELSDIKSRQVRRS-SSALFPAHEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGACACCAACGAATCCAATCCTCTGACCTTGAATCACACAGGAGAAGACGAAGAACACGAATCGCTCTCCCTTTCCGATCTTCCACTCGATAACGACAAATCCGA
CGACCACACTTTCGACGGCTTCCGCAAGAATCCGCGAAGATCCTCATCCGAGCCTCTCGATCTCTTCGAATTCTTCACCACCGGACTCATAAGCTCTGAGATATCTCCGG
CCGAGGATTTGATCTTCTGCGGCAGATTGCTTCCTCTCAACGATCAGACTCCGCTTACGCGCGCCACGGAGAAGAATTTCAAAAAGGAAGAGAGCCGAAAGCAGGCTGCC
TTTCGAAAACGCTCCGAGTCGTTGTCCGGATTACAGAGCTCTGTTTCTCGATCGAACAGTTCAAAAATCAACCTCAAGCGGAGCAGTAAATCGCTCGATTACCGGAAACT
CTACCGCCAAGGTAATCCGATTTTCTCGCCCACGGCGGAAGGCGATCGTAATTATCCGATGAAGAGCGCCGTGAGACCTGATGCGCTGAACAAAAAGGCGTCGTCGAAGC
CGCGGTGGTACTTGATGATGTTTGGAATGGTGAAATTTCCGGCGGAGATGGAGCTCAGCGACATTAAGAGCAGACAAGTCCGCCGCAGTTCGTCGGCGCTTTTCCCGGCG
CACGAGGATAAAGGTAAGTTTCCGTGCAATCGGAGCTCCGGCGAGGCGGCCTGGAGACTCCTCCGGGCGCTAAGCTGCAAGAACCACTCTAGCGTAGATGTAACGGCGTC
GTTAACTGCC
mRNA sequenceShow/hide mRNA sequence
ATGGACGACACCAACGAATCCAATCCTCTGACCTTGAATCACACAGGAGAAGACGAAGAACACGAATCGCTCTCCCTTTCCGATCTTCCACTCGATAACGACAAATCCGA
CGACCACACTTTCGACGGCTTCCGCAAGAATCCGCGAAGATCCTCATCCGAGCCTCTCGATCTCTTCGAATTCTTCACCACCGGACTCATAAGCTCTGAGATATCTCCGG
CCGAGGATTTGATCTTCTGCGGCAGATTGCTTCCTCTCAACGATCAGACTCCGCTTACGCGCGCCACGGAGAAGAATTTCAAAAAGGAAGAGAGCCGAAAGCAGGCTGCC
TTTCGAAAACGCTCCGAGTCGTTGTCCGGATTACAGAGCTCTGTTTCTCGATCGAACAGTTCAAAAATCAACCTCAAGCGGAGCAGTAAATCGCTCGATTACCGGAAACT
CTACCGCCAAGGTAATCCGATTTTCTCGCCCACGGCGGAAGGCGATCGTAATTATCCGATGAAGAGCGCCGTGAGACCTGATGCGCTGAACAAAAAGGCGTCGTCGAAGC
CGCGGTGGTACTTGATGATGTTTGGAATGGTGAAATTTCCGGCGGAGATGGAGCTCAGCGACATTAAGAGCAGACAAGTCCGCCGCAGTTCGTCGGCGCTTTTCCCGGCG
CACGAGGATAAAGGTAAGTTTCCGTGCAATCGGAGCTCCGGCGAGGCGGCCTGGAGACTCCTCCGGGCGCTAAGCTGCAAGAACCACTCTAGCGTAGATGTAACGGCGTC
GTTAACTGCC
Protein sequenceShow/hide protein sequence
MDDTNESNPLTLNHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLISSEISPAEDLIFCGRLLPLNDQTPLTRATEKNFKKEESRKQAA
FRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMVKFPAEMELSDIKSRQVRRSSSALFPA
HEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTASLTA