; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g08730 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g08730
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr2:6160526..6166113
RNA-Seq ExpressionMoc02g08730
SyntenyMoc02g08730
Gene Ontology termsNA
InterPro domainsIPR040344 - Uncharacterized protein At3g17950-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570710.1 hypothetical protein SDJN03_29625, partial [Cucurbita argyrosperma subsp. sororia]5.4e-9757.51Show/hide
Query:  EDRLKDKFDILLSTLGVRLP-------PSVGEENED-RPEDL-KTPTSSEAAAVLQCPPPPRKPKRLPSM-KRKARGRSIVMPHN-LFVEM---ELMFPP
        ED  +DKF+ILLSTL +RLP       P   E+N++ RP+DL KTP S E AA+L+CPPPPRKP+RLPS+ KRKA GR   MPH+ +F EM   E++FP 
Subjt:  EDRLKDKFDILLSTLGVRLP-------PSVGEENED-RPEDL-KTPTSSEAAAVLQCPPPPRKPKRLPSM-KRKARGRSIVMPHN-LFVEM---ELMFPP

Query:  HDLLGGKTTNTSYRDTLTRTKDRGGGGVVWFGGKENRGGGGGWMVTTRRTGVVGRGVGEGEEEGSR---ENKDCGKVI--GEGWPLGLQPLNVRGIGVPG
        +    G                                                   G  + + SR   E+  C  +   GEGWPLGLQPLNVR +GVPG
Subjt:  HDLLGGKTTNTSYRDTLTRTKDRGGGGVVWFGGKENRGGGGGWMVTTRRTGVVGRGVGEGEEEGSR---ENKDCGKVI--GEGWPLGLQPLNVRGIGVPG

Query:  NRDYSGSVSFNTLMTASPISFSDSSSDLDTESTGSFFRDKSITLGSLIGVSNILELSRRSLRGRKTENTGDKRGNPRSRPWFFSLCSRESTDANSIDDNG
        NRD  GSVSFNTLMTASP+SF+DSS+DLDTESTGSFF D +ITLGSLIGVSNILELSRRS+RGR+TE+T  KR N RSR W F LCSRESTD +SI DN 
Subjt:  NRDYSGSVSFNTLMTASPISFSDSSSDLDTESTGSFFRDKSITLGSLIGVSNILELSRRSLRGRKTENTGDKRGNPRSRPWFFSLCSRESTDANSIDDNG

Query:  PSLGHFLAEERRAADEYRRNQSVVMYGGDDLALAEAAQEPNSLFINGCVAPPQSSLGSEVERGRNGGTEPTNDNGVAVLCACICGH
        PSLGHFLAEERRAADE RRNQ + +    +L LAE+A EPNSLFINGCVAPPQ SL S+ E GRNGGTEP ND+ VA+LCACICGH
Subjt:  PSLGHFLAEERRAADEYRRNQSVVMYGGDDLALAEAAQEPNSLFINGCVAPPQSSLGSEVERGRNGGTEPTNDNGVAVLCACICGH

KAG6605464.1 hypothetical protein SDJN03_02781, partial [Cucurbita argyrosperma subsp. sororia]8.6e-8778.16Show/hide
Query:  GEGWPLGLQPLNVRGIGVPGNRDYSGSVSFNTLMTASPISFSDSSSDLDTESTGSFFRDKSITLGSLIGVSNILELSRRSLRGRKTENTGDKRGNPRSRP
        GEGWPLGLQPLN+R +GVPGNRDY GS+SFNTL+TASPISF+DSSSDLDTESTGSFF DKSITLGSLIGVSNILELSRRS+RGR+TE+T +KR N RSR 
Subjt:  GEGWPLGLQPLNVRGIGVPGNRDYSGSVSFNTLMTASPISFSDSSSDLDTESTGSFFRDKSITLGSLIGVSNILELSRRSLRGRKTENTGDKRGNPRSRP

Query:  WFFSLCSRESTDANSIDDNGPSLGHFLAEERRAADEYRRNQSVVMYGGDDLALAEAAQEPNSLFINGCVAPPQSSLGSEVERGRNGGTEPTNDNGVAVLC
        W FSLCSR++TDA+S+ +NGPSLG FL EERRAADE RRNQSV+MYGGD++ LA++A EPNSLFINGCVAPPQSSL S+ +  RNGGTEP ND+G+A+LC
Subjt:  WFFSLCSRESTDANSIDDNGPSLGHFLAEERRAADEYRRNQSVVMYGGDDLALAEAAQEPNSLFINGCVAPPQSSLGSEVERGRNGGTEPTNDNGVAVLC

Query:  ACICGH
        +C+CGH
Subjt:  ACICGH

XP_022148431.1 uncharacterized protein At3g17950-like [Momordica charantia]7.7e-112100Show/hide
Query:  GEGWPLGLQPLNVRGIGVPGNRDYSGSVSFNTLMTASPISFSDSSSDLDTESTGSFFRDKSITLGSLIGVSNILELSRRSLRGRKTENTGDKRGNPRSRP
        GEGWPLGLQPLNVRGIGVPGNRDYSGSVSFNTLMTASPISFSDSSSDLDTESTGSFFRDKSITLGSLIGVSNILELSRRSLRGRKTENTGDKRGNPRSRP
Subjt:  GEGWPLGLQPLNVRGIGVPGNRDYSGSVSFNTLMTASPISFSDSSSDLDTESTGSFFRDKSITLGSLIGVSNILELSRRSLRGRKTENTGDKRGNPRSRP

Query:  WFFSLCSRESTDANSIDDNGPSLGHFLAEERRAADEYRRNQSVVMYGGDDLALAEAAQEPNSLFINGCVAPPQSSLGSEVERGRNGGTEPTNDNGVAVLC
        WFFSLCSRESTDANSIDDNGPSLGHFLAEERRAADEYRRNQSVVMYGGDDLALAEAAQEPNSLFINGCVAPPQSSLGSEVERGRNGGTEPTNDNGVAVLC
Subjt:  WFFSLCSRESTDANSIDDNGPSLGHFLAEERRAADEYRRNQSVVMYGGDDLALAEAAQEPNSLFINGCVAPPQSSLGSEVERGRNGGTEPTNDNGVAVLC

Query:  ACICGH
        ACICGH
Subjt:  ACICGH

XP_038902040.1 uncharacterized protein At3g17950-like isoform X1 [Benincasa hispida]1.3e-9083.5Show/hide
Query:  GEGWPLGLQPLNVRGIGVPGNRDYSGSVSFNTLMTASPISFSDSSSDLDTESTGSFFRDKSITLGSLIGVSNILELSRRSLRGRKTENTGDKRGNPRSRP
        GEGWPLGLQPLNVR +GVPGNRDY GSVSFNTLMTASPISFSDSSSDLDTESTGSFF DKSITLGSLIGVSNILELSRRS+RGR+TENT +KR N RSR 
Subjt:  GEGWPLGLQPLNVRGIGVPGNRDYSGSVSFNTLMTASPISFSDSSSDLDTESTGSFFRDKSITLGSLIGVSNILELSRRSLRGRKTENTGDKRGNPRSRP

Query:  WFFSLCSRESTDANSIDDNGPSLGHFLAEERRAADEYRRNQSVVMYGGDDLALAEAAQEPNSLFINGCVAPPQSSLGSEVERGRNGGTEPTNDNGVAVLC
        WFFSLCSREST+A+S+ D+GPSLGHFLAEERRAADE RRNQSV+MYG D+L LA++  EPNSLFINGCVAPPQSS+GS  E GRNGGTEP NDN  A++C
Subjt:  WFFSLCSRESTDANSIDDNGPSLGHFLAEERRAADEYRRNQSVVMYGGDDLALAEAAQEPNSLFINGCVAPPQSSLGSEVERGRNGGTEPTNDNGVAVLC

Query:  ACICGH
        ACICGH
Subjt:  ACICGH

XP_038902041.1 uncharacterized protein At3g17950-like isoform X2 [Benincasa hispida]1.3e-9083.5Show/hide
Query:  GEGWPLGLQPLNVRGIGVPGNRDYSGSVSFNTLMTASPISFSDSSSDLDTESTGSFFRDKSITLGSLIGVSNILELSRRSLRGRKTENTGDKRGNPRSRP
        GEGWPLGLQPLNVR +GVPGNRDY GSVSFNTLMTASPISFSDSSSDLDTESTGSFF DKSITLGSLIGVSNILELSRRS+RGR+TENT +KR N RSR 
Subjt:  GEGWPLGLQPLNVRGIGVPGNRDYSGSVSFNTLMTASPISFSDSSSDLDTESTGSFFRDKSITLGSLIGVSNILELSRRSLRGRKTENTGDKRGNPRSRP

Query:  WFFSLCSRESTDANSIDDNGPSLGHFLAEERRAADEYRRNQSVVMYGGDDLALAEAAQEPNSLFINGCVAPPQSSLGSEVERGRNGGTEPTNDNGVAVLC
        WFFSLCSREST+A+S+ D+GPSLGHFLAEERRAADE RRNQSV+MYG D+L LA++  EPNSLFINGCVAPPQSS+GS  E GRNGGTEP NDN  A++C
Subjt:  WFFSLCSRESTDANSIDDNGPSLGHFLAEERRAADEYRRNQSVVMYGGDDLALAEAAQEPNSLFINGCVAPPQSSLGSEVERGRNGGTEPTNDNGVAVLC

Query:  ACICGH
        ACICGH
Subjt:  ACICGH

TrEMBL top hitse value%identityAlignment
A0A0A0KCA6 Uncharacterized protein6.0e-8681.64Show/hide
Query:  GEGWPLGLQPLNVRGIGVPGNRDYSGSVSFNTLMTASPISFSDSSSDLDTESTGSFFRDKSITLGSLIGVSNILELSRRSLRGRKTENTGDKRGNPRSRP
        G+GWPLGLQPLNVR +GVPGNRDY GSVSFNTLMTASPISFSDSSSDLDTESTGSFF DKSITLGSLIGVSNILELSRRS+RGR+TE+T DKR N +SR 
Subjt:  GEGWPLGLQPLNVRGIGVPGNRDYSGSVSFNTLMTASPISFSDSSSDLDTESTGSFFRDKSITLGSLIGVSNILELSRRSLRGRKTENTGDKRGNPRSRP

Query:  WFFSLCSRESTDANSIDDNGPSLGHFLAEERRAADEYRR-NQSVVMYGGDDLALAEAAQEPNSLFINGCVAPPQSSLGSEVERGRNGGTEPTNDNGVAVL
        WFFSLCSRESTDA+SI ++GPSLGHFLAEERRAADE RR NQS +MYG D+L LA+   EPNSLFINGCVAPPQ S+GSE E   NGGTEPTNDN VA++
Subjt:  WFFSLCSRESTDANSIDDNGPSLGHFLAEERRAADEYRR-NQSVVMYGGDDLALAEAAQEPNSLFINGCVAPPQSSLGSEVERGRNGGTEPTNDNGVAVL

Query:  CACICGH
        C+CICGH
Subjt:  CACICGH

A0A5A7V897 Uncharacterized protein6.6e-8581.16Show/hide
Query:  GEGWPLGLQPLNVRGIGVPGNRDYSGSVSFNTLMTASPISFSDSSSDLDTESTGSFFRDKSITLGSLIGVSNILELSRRSLRGRKTENTGDKRGNPRSRP
        G+GWPLGLQPLNVR +GVPGNRDY GSVSFNTLMTASPISFSDSSSDLDTESTGSFF DKSITLGSLIGVSNILELSRRS+RGR+TE+T DKR N +SR 
Subjt:  GEGWPLGLQPLNVRGIGVPGNRDYSGSVSFNTLMTASPISFSDSSSDLDTESTGSFFRDKSITLGSLIGVSNILELSRRSLRGRKTENTGDKRGNPRSRP

Query:  WFFSLCSRESTDANSIDDNGPSLGHFLAEERRAADEYRR-NQSVVMYGGDDLALAEAAQEPNSLFINGCVAPPQSSLGSEVERGRNGGTEPTNDNGVAVL
        WFFSLCSRESTDA+SI ++GPSLGHFLAEERRAADE RR NQS +MYG D+L LA+   EPNSLFINGCVAPPQSS+GS  E   NGGTE TNDN VA++
Subjt:  WFFSLCSRESTDANSIDDNGPSLGHFLAEERRAADEYRR-NQSVVMYGGDDLALAEAAQEPNSLFINGCVAPPQSSLGSEVERGRNGGTEPTNDNGVAVL

Query:  CACICGH
        C+CICGH
Subjt:  CACICGH

A0A6J1D5C9 uncharacterized protein At3g17950-like3.7e-112100Show/hide
Query:  GEGWPLGLQPLNVRGIGVPGNRDYSGSVSFNTLMTASPISFSDSSSDLDTESTGSFFRDKSITLGSLIGVSNILELSRRSLRGRKTENTGDKRGNPRSRP
        GEGWPLGLQPLNVRGIGVPGNRDYSGSVSFNTLMTASPISFSDSSSDLDTESTGSFFRDKSITLGSLIGVSNILELSRRSLRGRKTENTGDKRGNPRSRP
Subjt:  GEGWPLGLQPLNVRGIGVPGNRDYSGSVSFNTLMTASPISFSDSSSDLDTESTGSFFRDKSITLGSLIGVSNILELSRRSLRGRKTENTGDKRGNPRSRP

Query:  WFFSLCSRESTDANSIDDNGPSLGHFLAEERRAADEYRRNQSVVMYGGDDLALAEAAQEPNSLFINGCVAPPQSSLGSEVERGRNGGTEPTNDNGVAVLC
        WFFSLCSRESTDANSIDDNGPSLGHFLAEERRAADEYRRNQSVVMYGGDDLALAEAAQEPNSLFINGCVAPPQSSLGSEVERGRNGGTEPTNDNGVAVLC
Subjt:  WFFSLCSRESTDANSIDDNGPSLGHFLAEERRAADEYRRNQSVVMYGGDDLALAEAAQEPNSLFINGCVAPPQSSLGSEVERGRNGGTEPTNDNGVAVLC

Query:  ACICGH
        ACICGH
Subjt:  ACICGH

A0A6J1G630 uncharacterized protein At3g17950-like1.6e-8677.67Show/hide
Query:  GEGWPLGLQPLNVRGIGVPGNRDYSGSVSFNTLMTASPISFSDSSSDLDTESTGSFFRDKSITLGSLIGVSNILELSRRSLRGRKTENTGDKRGNPRSRP
        GEGWPLGLQPLN+R +GVPGNRDY GS+SFNTL+TASPISF+DSSSDLDTESTGSFF DKSITLGSLIGVSNILELSRRS+RGR+TE+T +KR N RSR 
Subjt:  GEGWPLGLQPLNVRGIGVPGNRDYSGSVSFNTLMTASPISFSDSSSDLDTESTGSFFRDKSITLGSLIGVSNILELSRRSLRGRKTENTGDKRGNPRSRP

Query:  WFFSLCSRESTDANSIDDNGPSLGHFLAEERRAADEYRRNQSVVMYGGDDLALAEAAQEPNSLFINGCVAPPQSSLGSEVERGRNGGTEPTNDNGVAVLC
        W FSLCSR++TDA+S+ +N PSLG FL EERRAADE RRNQSV+MYGGD++ LA++A EPNSLFINGCVAPPQ SLGS+ +  RNGGTEP ND+G+A+LC
Subjt:  WFFSLCSRESTDANSIDDNGPSLGHFLAEERRAADEYRRNQSVVMYGGDDLALAEAAQEPNSLFINGCVAPPQSSLGSEVERGRNGGTEPTNDNGVAVLC

Query:  ACICGH
        +C+CGH
Subjt:  ACICGH

A0A6J1KX21 uncharacterized protein At3g17950-like5.1e-8578.16Show/hide
Query:  GEGWPLGLQPLNVRGIGVPGNRDYSGSVSFNTLMTASPISFSDSSSDLDTESTGSFFRDKSITLGSLIGVSNILELSRRSLRGRKTENTGDKRGNPRSRP
        GEGWPLGLQPLN+R +GVPGNRDY GS+SFNTL+TASPISF+DSSSDLDTESTGSFF DKSITLGSLIGVSNILELSRRS+RGR+TE+T +KR N RSR 
Subjt:  GEGWPLGLQPLNVRGIGVPGNRDYSGSVSFNTLMTASPISFSDSSSDLDTESTGSFFRDKSITLGSLIGVSNILELSRRSLRGRKTENTGDKRGNPRSRP

Query:  WFFSLCSRESTDANSIDDNGPSLGHFLAEERRAADEYRRNQSVVMYGGDDLALAEAAQEPNSLFINGCVAPPQSSLGSEVERGRNGGTEPTNDNGVAVLC
        W FSLCSR++TDA+S+  NGPSLG FL EERRAADE RRNQSV+MYGGD++ LA+ A EPNSLFINGCVAPPQ SLGS+ + GRNGGTEP ND+G+A+LC
Subjt:  WFFSLCSRESTDANSIDDNGPSLGHFLAEERRAADEYRRNQSVVMYGGDDLALAEAAQEPNSLFINGCVAPPQSSLGSEVERGRNGGTEPTNDNGVAVLC

Query:  ACICGH
        + +CGH
Subjt:  ACICGH

SwissProt top hitse value%identityAlignment
Q6DR24 Uncharacterized protein At3g179502.2e-0833.15Show/hide
Query:  ASPISFSDSSSDLDTESTGSFFRDKSITLGSLIG------------------VSNILELSRRS--------LRGRKTENTGDKRGNPRSRPWFFSLCSRE
        +SP   S SSSDLDTESTGSFF D+SITLG+L+G                  VS  + +SR S         R R   N+ +   + R + W F     +
Subjt:  ASPISFSDSSSDLDTESTGSFFRDKSITLGSLIG------------------VSNILELSRRS--------LRGRKTENTGDKRGNPRSRPWFFSLCSRE

Query:  STDANSI-----DDNGPSLGHFLAEERRAADEYRRNQSVVMYGGDDLALAEAA------QEP----NSLFINGCVAPPQSS
            N I     D    SLG +L  ERR  DE        +Y   +  L +A       Q+P     +LF +G V PP S+
Subjt:  STDANSI-----DDNGPSLGHFLAEERRAADEYRRNQSVVMYGGDDLALAEAA------QEP----NSLFINGCVAPPQSS

Arabidopsis top hitse value%identityAlignment
AT3G17950.1 unknown protein1.5e-0933.15Show/hide
Query:  ASPISFSDSSSDLDTESTGSFFRDKSITLGSLIG------------------VSNILELSRRS--------LRGRKTENTGDKRGNPRSRPWFFSLCSRE
        +SP   S SSSDLDTESTGSFF D+SITLG+L+G                  VS  + +SR S         R R   N+ +   + R + W F     +
Subjt:  ASPISFSDSSSDLDTESTGSFFRDKSITLGSLIG------------------VSNILELSRRS--------LRGRKTENTGDKRGNPRSRPWFFSLCSRE

Query:  STDANSI-----DDNGPSLGHFLAEERRAADEYRRNQSVVMYGGDDLALAEAA------QEP----NSLFINGCVAPPQSS
            N I     D    SLG +L  ERR  DE        +Y   +  L +A       Q+P     +LF +G V PP S+
Subjt:  STDANSI-----DDNGPSLGHFLAEERRAADEYRRNQSVVMYGGDDLALAEAA------QEP----NSLFINGCVAPPQSS

AT5G02440.1 unknown protein1.0e-2445.91Show/hide
Query:  EGWPLGLQPLNVRGIGVPGNRDY--------SGSVSFNTLMTASPISFSDSSSDLDTESTGSFFRDKSITLGSLIGVSNILELSRRSLRGRKTENTGDKR
        EGWPLGL+P+N R  G+     +        +GS+SF++L++ SP   S SSSDLD++S GSFFRD+S TLG+LIG+S+ LELSRRS R R  + TG  R
Subjt:  EGWPLGLQPLNVRGIGVPGNRDY--------SGSVSFNTLMTASPISFSDSSSDLDTESTGSFFRDKSITLGSLIGVSNILELSRRSLRGRKTENTGDKR

Query:  GNPRS-------RPWFFSLCSRESTDANSID------------DNGPSLGHFLAEERRA
         +          +PW FS+CS+ ST+A  I             +N  SLGHFL  ERRA
Subjt:  GNPRS-------RPWFFSLCSRESTDANSID------------DNGPSLGHFLAEERRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGGGCAAATCGGCGACTACGAGGAAGAAGATCGGTTGAAAGACAAGTTTGATATTTTACTGTCCACTTTGGGAGTACGGTTGCCGCCGTCCGTAGGCGAGGAAAA
TGAAGATCGGCCCGAGGATTTGAAGACTCCCACATCCTCGGAGGCTGCGGCAGTTCTTCAGTGTCCGCCGCCGCCGAGAAAGCCGAAAAGGCTGCCGTCGATGAAACGGA
AGGCGCGTGGGCGGTCAATTGTGATGCCCCATAATTTGTTCGTCGAAATGGAGCTGATGTTTCCTCCGCATGATCTCCTCGGCGGTAAGACAACGAACACTAGTTATAGA
GATACATTAACGAGAACAAAGGACAGAGGGGGCGGGGGTGTGGTTTGGTTTGGCGGCAAGGAGAACAGAGGGGGCGGAGGCGGTTGGATGGTGACAACGAGGAGAACGGG
GGTGGTTGGACGGGGAGTCGGCGAAGGTGAAGAAGAGGGATCAAGGGAGAATAAAGACTGTGGGAAAGTGATAGGGGAGGGGTGGCCTCTCGGTCTACAGCCTCTGAATG
TGAGGGGTATTGGGGTACCTGGAAACCGGGACTATTCAGGATCAGTGTCTTTCAACACTTTGATGACGGCTTCTCCTATTTCCTTCTCGGATTCATCATCCGATTTGGAC
ACTGAGTCGACGGGATCTTTCTTCCGTGACAAGAGCATCACACTTGGGAGTCTGATAGGAGTTTCTAACATTTTGGAACTCTCAAGAAGATCACTCAGGGGAAGAAAAAC
AGAGAACACAGGGGACAAGAGGGGCAACCCCAGGTCTAGACCTTGGTTTTTCTCTTTGTGTTCGAGGGAGAGTACCGATGCCAACAGCATCGACGACAATGGCCCTTCAT
TAGGCCACTTCCTGGCAGAAGAAAGACGAGCGGCCGACGAGTATAGGAGAAACCAGAGTGTAGTCATGTATGGAGGAGATGATTTAGCATTAGCTGAGGCTGCCCAAGAA
CCAAACTCCTTGTTCATCAATGGCTGTGTGGCACCTCCTCAATCAAGCCTTGGTTCAGAAGTTGAAAGAGGGAGAAATGGAGGAACTGAACCTACAAATGACAATGGAGT
TGCAGTTCTTTGTGCTTGCATTTGTGGACACTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGGGGCAAATCGGCGACTACGAGGAAGAAGATCGGTTGAAAGACAAGTTTGATATTTTACTGTCCACTTTGGGAGTACGGTTGCCGCCGTCCGTAGGCGAGGAAAA
TGAAGATCGGCCCGAGGATTTGAAGACTCCCACATCCTCGGAGGCTGCGGCAGTTCTTCAGTGTCCGCCGCCGCCGAGAAAGCCGAAAAGGCTGCCGTCGATGAAACGGA
AGGCGCGTGGGCGGTCAATTGTGATGCCCCATAATTTGTTCGTCGAAATGGAGCTGATGTTTCCTCCGCATGATCTCCTCGGCGGTAAGACAACGAACACTAGTTATAGA
GATACATTAACGAGAACAAAGGACAGAGGGGGCGGGGGTGTGGTTTGGTTTGGCGGCAAGGAGAACAGAGGGGGCGGAGGCGGTTGGATGGTGACAACGAGGAGAACGGG
GGTGGTTGGACGGGGAGTCGGCGAAGGTGAAGAAGAGGGATCAAGGGAGAATAAAGACTGTGGGAAAGTGATAGGGGAGGGGTGGCCTCTCGGTCTACAGCCTCTGAATG
TGAGGGGTATTGGGGTACCTGGAAACCGGGACTATTCAGGATCAGTGTCTTTCAACACTTTGATGACGGCTTCTCCTATTTCCTTCTCGGATTCATCATCCGATTTGGAC
ACTGAGTCGACGGGATCTTTCTTCCGTGACAAGAGCATCACACTTGGGAGTCTGATAGGAGTTTCTAACATTTTGGAACTCTCAAGAAGATCACTCAGGGGAAGAAAAAC
AGAGAACACAGGGGACAAGAGGGGCAACCCCAGGTCTAGACCTTGGTTTTTCTCTTTGTGTTCGAGGGAGAGTACCGATGCCAACAGCATCGACGACAATGGCCCTTCAT
TAGGCCACTTCCTGGCAGAAGAAAGACGAGCGGCCGACGAGTATAGGAGAAACCAGAGTGTAGTCATGTATGGAGGAGATGATTTAGCATTAGCTGAGGCTGCCCAAGAA
CCAAACTCCTTGTTCATCAATGGCTGTGTGGCACCTCCTCAATCAAGCCTTGGTTCAGAAGTTGAAAGAGGGAGAAATGGAGGAACTGAACCTACAAATGACAATGGAGT
TGCAGTTCTTTGTGCTTGCATTTGTGGACACTGA
Protein sequenceShow/hide protein sequence
MMGQIGDYEEEDRLKDKFDILLSTLGVRLPPSVGEENEDRPEDLKTPTSSEAAAVLQCPPPPRKPKRLPSMKRKARGRSIVMPHNLFVEMELMFPPHDLLGGKTTNTSYR
DTLTRTKDRGGGGVVWFGGKENRGGGGGWMVTTRRTGVVGRGVGEGEEEGSRENKDCGKVIGEGWPLGLQPLNVRGIGVPGNRDYSGSVSFNTLMTASPISFSDSSSDLD
TESTGSFFRDKSITLGSLIGVSNILELSRRSLRGRKTENTGDKRGNPRSRPWFFSLCSRESTDANSIDDNGPSLGHFLAEERRAADEYRRNQSVVMYGGDDLALAEAAQE
PNSLFINGCVAPPQSSLGSEVERGRNGGTEPTNDNGVAVLCACICGH