; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g04870 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g04870
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:3577163..3577957
RNA-Seq ExpressionMoc06g04870
SyntenyMoc06g04870
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589708.1 hypothetical protein SDJN03_15131, partial [Cucurbita argyrosperma subsp. sororia]6.8e-9974.09Show/hide
Query:  MDDTNESNPLTLNH---------TGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTP
        MDDT+ES PLT  H           E+E  E+LS SDLPLD  KSD HT + FRKNPRRSSSEPLDLFEFF+ G  +SEISPAEDLIFCGRLLPLND + 
Subjt:  MDDTNESNPLTLNH---------TGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTP

Query:  LT----RATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWY
        LT    R  EK+F K+ESRKQ  FRKRSESLSGLQSSVSRSN++KINLKR+S+SLDYRKLYRQ N IFSPTAE DRN  +K+ ++PD LNKKASSKPRWY
Subjt:  LT----RATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWY

Query:  LMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPC-NRSSGEAAWRLLRALSCKNHSSVDVTASLTA
        L+MFGMVKFPAEM+LSDIKSRQVRRSSSALFPA+E KGK+ C NRSSGEA WR+LRALSCKN++SVDVTASLTA
Subjt:  LMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPC-NRSSGEAAWRLLRALSCKNHSSVDVTASLTA

KAG7023388.1 hypothetical protein SDJN02_14413 [Cucurbita argyrosperma subsp. argyrosperma]4.0e-9973.91Show/hide
Query:  MEMDDTNESNPLTLNH---------TGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQ
        M MDDT+ES PLT  H           E+E  E+LS SDLPLD  KSD HT + FRKNPRRSSSEPLDLFEFF+ G  +SEISPAEDLIFCGRLLPLND 
Subjt:  MEMDDTNESNPLTLNH---------TGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQ

Query:  TPLT----RATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPR
        + LT    R  EK+F K+ESRKQ  FRKRSESLSGLQSSVSRSN++KINLKR+S+SLDYRKLYRQ N IFSPTAE DRN  +K+ ++PD LNKKASSKPR
Subjt:  TPLT----RATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPR

Query:  WYLMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPC-NRSSGEAAWRLLRALSCKNHSSVDVTASLTA
        WYL+MFGMVKFPAEM+LSDIKSRQVRRSSSALFPA+E KGK+ C NRSSGEA WR+LRALSCKN++SVDVTASLTA
Subjt:  WYLMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPC-NRSSGEAAWRLLRALSCKNHSSVDVTASLTA

XP_022135307.1 uncharacterized protein LOC111007302, partial [Momordica charantia]5.4e-120100Show/hide
Query:  MFMEMDDTNESNPLTLNHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLTRAT
        MFMEMDDTNESNPLTLNHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLTRAT
Subjt:  MFMEMDDTNESNPLTLNHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLTRAT

Query:  EKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMVKF
        EKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMVKF
Subjt:  EKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMVKF

Query:  PAEMELSDIKSRQVRRSSSALFPAHEDK
        PAEMELSDIKSRQVRRSSSALFPAHEDK
Subjt:  PAEMELSDIKSRQVRRSSSALFPAHEDK

XP_022921809.1 uncharacterized protein LOC111429953 [Cucurbita moschata]4.0e-9973.91Show/hide
Query:  MEMDDTNESNPLTLNHT---------GEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQ
        M MDDT+ES PLT  H           E+E  E+LS SDLPLD  KSD HT + FRKNPRRSSSEPLDLFEFF+ G  +SEISPAEDLIFCGRLLPLND 
Subjt:  MEMDDTNESNPLTLNHT---------GEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQ

Query:  TPLT----RATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPR
        + LT    R  EK+F K+ESRKQ  FRKRSESLSGLQSSVSRSN++KINLKR+S+SLDYRKLYRQ N IFSPTAE DRN  +K+ ++PD LNKKASSKPR
Subjt:  TPLT----RATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPR

Query:  WYLMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPC-NRSSGEAAWRLLRALSCKNHSSVDVTASLTA
        WYL+MFGMVKFPAEM+LSDIKSRQVRRSSSALFPA+E KGK+ C NRSSGEA WR+LRALSCKN++SVDVTASLTA
Subjt:  WYLMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPC-NRSSGEAAWRLLRALSCKNHSSVDVTASLTA

XP_038879756.1 uncharacterized protein LOC120071507 [Benincasa hispida]8.1e-10076.4Show/hide
Query:  MEMDDTNESNPLTLNH--TGEDEEHESLSLSDLPLDNDKSDDHTF-DGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPL-TR
        M MDD+ ESNPL   H    +DE  ESLS SDLPLD +KSD HT  +  RKNPRRSSSEPLDLFEFF+ G  +SEISPAEDLIFCGRLLPLND +PL TR
Subjt:  MEMDDTNESNPLTLNH--TGEDEEHESLSLSDLPLDNDKSDDHTF-DGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPL-TR

Query:  ATEKNFKKEE-SRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGM
          EK+F KEE +RKQ  FRKRSESLSGLQSSVSRSNS+K NLKR+S+SLDYRKLYRQ N IFSPTAE DRN  + + ++PD+LNKKASSKPRWYL+MFGM
Subjt:  ATEKNFKKEE-SRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGM

Query:  VKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTASLTA
        VKFPAEM+L DIKSRQVRRSSS LFPA+E+KGKF CNRSSGEAAWR+LRALSCKNH+SVDVTASLTA
Subjt:  VKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTASLTA

TrEMBL top hitse value%identityAlignment
A0A1S3BW36 uncharacterized protein LOC1034942653.4e-9673.68Show/hide
Query:  MDDTNESNPLTL---NHTGED--EEHESLSLSDLPLDNDKSDDHTF-DGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLTR
        MDDT+ESNPLT     H  +D  E  ESLS SDLP+D + SD HT  D FRKNPRRSSSEPLDLFEFF+ G  +SEISPAEDLIFCGRLLPLND +    
Subjt:  MDDTNESNPLTL---NHTGED--EEHESLSLSDLPLDNDKSDDHTF-DGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLTR

Query:  ATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMV
        A +  +K+E SRKQ  FRKRSESLSGLQSSVSRSNS+K NLKR+S+SLDYR+LYRQ N IFSPTAE DRN  +K+ ++PD+LNKK SSKPRWYL+MFGMV
Subjt:  ATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMV

Query:  KFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTASLTA
        KFPAEMELSDIKSRQVRRSSS LFP++E K KF C RSSGEA WR+LRALSCKNH+SVDVTASLTA
Subjt:  KFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTASLTA

A0A5A7UVS6 Uncharacterized protein3.4e-9673.68Show/hide
Query:  MDDTNESNPLTL---NHTGED--EEHESLSLSDLPLDNDKSDDHTF-DGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLTR
        MDDT+ESNPLT     H  +D  E  ESLS SDLP+D + SD HT  D FRKNPRRSSSEPLDLFEFF+ G  +SEISPAEDLIFCGRLLPLND +    
Subjt:  MDDTNESNPLTL---NHTGED--EEHESLSLSDLPLDNDKSDDHTF-DGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLTR

Query:  ATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMV
        A +  +K+E SRKQ  FRKRSESLSGLQSSVSRSNS+K NLKR+S+SLDYR+LYRQ N IFSPTAE DRN  +K+ ++PD+LNKK SSKPRWYL+MFGMV
Subjt:  ATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMV

Query:  KFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTASLTA
        KFPAEMELSDIKSRQVRRSSS LFP++E K KF C RSSGEA WR+LRALSCKNH+SVDVTASLTA
Subjt:  KFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTASLTA

A0A6J1C2C2 uncharacterized protein LOC1110073022.6e-120100Show/hide
Query:  MFMEMDDTNESNPLTLNHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLTRAT
        MFMEMDDTNESNPLTLNHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLTRAT
Subjt:  MFMEMDDTNESNPLTLNHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLTRAT

Query:  EKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMVKF
        EKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMVKF
Subjt:  EKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMVKF

Query:  PAEMELSDIKSRQVRRSSSALFPAHEDK
        PAEMELSDIKSRQVRRSSSALFPAHEDK
Subjt:  PAEMELSDIKSRQVRRSSSALFPAHEDK

A0A6J1E1L2 uncharacterized protein LOC1114299531.9e-9973.91Show/hide
Query:  MEMDDTNESNPLTLNHT---------GEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQ
        M MDDT+ES PLT  H           E+E  E+LS SDLPLD  KSD HT + FRKNPRRSSSEPLDLFEFF+ G  +SEISPAEDLIFCGRLLPLND 
Subjt:  MEMDDTNESNPLTLNHT---------GEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQ

Query:  TPLT----RATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPR
        + LT    R  EK+F K+ESRKQ  FRKRSESLSGLQSSVSRSN++KINLKR+S+SLDYRKLYRQ N IFSPTAE DRN  +K+ ++PD LNKKASSKPR
Subjt:  TPLT----RATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPR

Query:  WYLMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPC-NRSSGEAAWRLLRALSCKNHSSVDVTASLTA
        WYL+MFGMVKFPAEM+LSDIKSRQVRRSSSALFPA+E KGK+ C NRSSGEA WR+LRALSCKN++SVDVTASLTA
Subjt:  WYLMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPC-NRSSGEAAWRLLRALSCKNHSSVDVTASLTA

A0A6J1JAL3 uncharacterized protein LOC1114850639.6e-9973.55Show/hide
Query:  MEMDDTNESNPLTLNHT---------GEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQ
        M MDDT+ES PLT  H           E+E  E+LS SDLPLD  KSD HT + FRKNPRRSSSEPLDLFEFF+ G  +SEISPAEDLIFCGRLLPLND 
Subjt:  MEMDDTNESNPLTLNHT---------GEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQ

Query:  TPLT----RATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPR
        + LT    R  +K+F K+ESRKQ  FRKRSESLSGLQSSVSRSN++KINLKR+S+SLDYRKLYRQ N IFSPTAE DRN  +K+ ++PD LNKKASSKPR
Subjt:  TPLT----RATEKNFKKEESRKQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPR

Query:  WYLMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPC-NRSSGEAAWRLLRALSCKNHSSVDVTASLTA
        WYL+MFGMVKFPAEM+LSDIKSRQVRRSSSALFPA E KGK+ C NRSSGEA WR+LRALSCKN++SVDVTASLTA
Subjt:  WYLMMFGMVKFPAEMELSDIKSRQVRRSSSALFPAHEDKGKFPC-NRSSGEAAWRLLRALSCKNHSSVDVTASLTA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30230.1 unknown protein1.2e-2936.63Show/hide
Query:  DTNESNPLTL----NHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSE-----PLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLT
        +TN  NP  L     H  E+E+ ++LSL DLPL              KNP  +++E       +LFEF T+   S +++PAE++IF G+L+PLN Q    
Subjt:  DTNESNPLTL----NHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSE-----PLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLT

Query:  RATEKNFKKEESRKQAAFRKRSESLSGLQS-SVSRSNSSKINLK------RSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKA--SSKP
           E   ++         R RSESLS +Q   ++R  S  +  +      R+S+SLDYRKL R    + SP    + +   K+  +P+  +  +  S +P
Subjt:  RATEKNFKKEESRKQAAFRKRSESLSGLQS-SVSRSNSSKINLK------RSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKA--SSKP

Query:  RWYLMMFGMVKFPAEMELSDIKSRQVRRS-SSALFPAHEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTA
        RWY++MFGMVKFP E+EL DIKSRQ+RR+    +FP+  ++        S   +WR L ALSCK  +SV  TA
Subjt:  RWYLMMFGMVKFPAEMELSDIKSRQVRRS-SSALFPAHEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCATGGAGATGGACGACACCAACGAATCCAATCCTCTGACCTTGAATCACACAGGAGAAGACGAAGAACACGAATCGCTCTCCCTTTCCGATCTTCCACTCGATAA
CGACAAATCCGACGACCACACTTTCGACGGCTTCCGCAAGAATCCGCGAAGATCCTCATCCGAGCCTCTCGATCTCTTCGAATTCTTCACCACCGGACTCAGAAGCTCTG
AGATATCTCCGGCCGAGGATTTGATCTTCTGCGGCAGATTGCTTCCTCTCAACGATCAGACTCCGCTTACGCGCGCCACGGAGAAGAATTTCAAAAAGGAAGAGAGCCGA
AAGCAGGCTGCCTTTCGAAAACGGTCCGAGTCGTTGTCCGGATTACAGAGCTCTGTTTCTCGATCGAACAGTTCAAAAATCAACCTCAAGCGGAGCAGTAAATCGCTCGA
TTACCGGAAACTCTACCGCCAAGGGAATCCGATTTTCTCGCCCACGGCGGAAGGCGATCGTAATTATCCGATGAAGAGCGCCGTGAGACCTGATGCGCTGAACAAAAAGG
CGTCGTCGAAGCCGCGGTGGTACTTGATGATGTTTGGAATGGTGAAATTTCCGGCGGAGATGGAGCTCAGCGACATTAAGAGCAGACAAGTCCGCCGCAGTTCGTCGGCG
CTTTTCCCGGCGCACGAGGATAAAGGTAAGTTTCCGTGCAATCGGAGCTCCGGCGAGGCGGCCTGGAGACTCCTCCGGGCGCTAAGCTGCAAGAACCACTCTAGCGTAGA
TGTAACGGCGTCGTTAACTGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTCATGGAGATGGACGACACCAACGAATCCAATCCTCTGACCTTGAATCACACAGGAGAAGACGAAGAACACGAATCGCTCTCCCTTTCCGATCTTCCACTCGATAA
CGACAAATCCGACGACCACACTTTCGACGGCTTCCGCAAGAATCCGCGAAGATCCTCATCCGAGCCTCTCGATCTCTTCGAATTCTTCACCACCGGACTCAGAAGCTCTG
AGATATCTCCGGCCGAGGATTTGATCTTCTGCGGCAGATTGCTTCCTCTCAACGATCAGACTCCGCTTACGCGCGCCACGGAGAAGAATTTCAAAAAGGAAGAGAGCCGA
AAGCAGGCTGCCTTTCGAAAACGGTCCGAGTCGTTGTCCGGATTACAGAGCTCTGTTTCTCGATCGAACAGTTCAAAAATCAACCTCAAGCGGAGCAGTAAATCGCTCGA
TTACCGGAAACTCTACCGCCAAGGGAATCCGATTTTCTCGCCCACGGCGGAAGGCGATCGTAATTATCCGATGAAGAGCGCCGTGAGACCTGATGCGCTGAACAAAAAGG
CGTCGTCGAAGCCGCGGTGGTACTTGATGATGTTTGGAATGGTGAAATTTCCGGCGGAGATGGAGCTCAGCGACATTAAGAGCAGACAAGTCCGCCGCAGTTCGTCGGCG
CTTTTCCCGGCGCACGAGGATAAAGGTAAGTTTCCGTGCAATCGGAGCTCCGGCGAGGCGGCCTGGAGACTCCTCCGGGCGCTAAGCTGCAAGAACCACTCTAGCGTAGA
TGTAACGGCGTCGTTAACTGCCTAA
Protein sequenceShow/hide protein sequence
MFMEMDDTNESNPLTLNHTGEDEEHESLSLSDLPLDNDKSDDHTFDGFRKNPRRSSSEPLDLFEFFTTGLRSSEISPAEDLIFCGRLLPLNDQTPLTRATEKNFKKEESR
KQAAFRKRSESLSGLQSSVSRSNSSKINLKRSSKSLDYRKLYRQGNPIFSPTAEGDRNYPMKSAVRPDALNKKASSKPRWYLMMFGMVKFPAEMELSDIKSRQVRRSSSA
LFPAHEDKGKFPCNRSSGEAAWRLLRALSCKNHSSVDVTASLTA