; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS002132 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS002132
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF1639)
Genome locationscaffold30:2573943..2574539
RNA-Seq ExpressionMS002132
SyntenyMS002132
Gene Ontology termsNA
InterPro domainsIPR012438 - Protein of unknown function DUF1639


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137319.1 uncharacterized protein LOC101214785 [Cucumis sativus]6.4e-7375.61Show/hide
Query:  MAMAPERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSK-----HPSHSKQKPI-SASNSIETMREKI
        M++ P+RSNPLHNFSLP LKWGSQRFLKCMKVS  SNSN S L HPS  R+SKSYQFRAR ++S+A NF+K     + +HSKQKP    S+SIE MREKI
Subjt:  MAMAPERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSK-----HPSHSKQKPI-SASNSIETMREKI

Query:  MLDIREESKKLKFSIPEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSSSTKALMEKEKHRTALSVSLSKEELEEDFAALVGRLPRRPKKRPRVV
        MLDIREESK+LKFSI +EGGEDESAAARPWNLRTRRAACKAPL+ERNLELGSSS+   + +KEK+RTAL+VSLSKEELE+DFA LVGRLPRRPKKRPR V
Subjt:  MLDIREESKKLKFSIPEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSSSTKALMEKEKHRTALSVSLSKEELEEDFAALVGRLPRRPKKRPRVV

Query:  QKQLD
        QKQ+D
Subjt:  QKQLD

XP_008453422.1 PREDICTED: uncharacterized protein LOC103494136 [Cucumis melo]1.3e-7376.59Show/hide
Query:  MAMAPERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSK-----HPSHSKQKPI-SASNSIETMREKI
        M++ P+RSNPLHNFSLP LKWGSQRFLKCMKVS  SNSN S L HPS  R+SKSYQFRAR +NS+A NF+K     + +HSKQKPI   S+SIE MREKI
Subjt:  MAMAPERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSK-----HPSHSKQKPI-SASNSIETMREKI

Query:  MLDIREESKKLKFSIPEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSSSTKALMEKEKHRTALSVSLSKEELEEDFAALVGRLPRRPKKRPRVV
        MLDIREESK+LKFSI +EGGEDESAAARPWNLRTRRAACKAPL+ERNLELGSSS+   + +K+K+RTAL VSLSKEELEEDFA LVGRLPRRPKKRPR V
Subjt:  MLDIREESKKLKFSIPEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSSSTKALMEKEKHRTALSVSLSKEELEEDFAALVGRLPRRPKKRPRVV

Query:  QKQLD
        QKQ+D
Subjt:  QKQLD

XP_022134637.1 uncharacterized protein LOC111006857 [Momordica charantia]5.2e-9998.49Show/hide
Query:  MAMAPERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSKHPSHSKQKPISASNSIETMREKIMLDIRE
        MAMAPERSNPLHNFSLPYLKWGSQRFLKCMKVSS SNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSKHPSHSKQKPISAS+SIETMREKIMLDIRE
Subjt:  MAMAPERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSKHPSHSKQKPISASNSIETMREKIMLDIRE

Query:  ESKKLKFSIPEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSSSTKALMEKEKHRTALSVSLSKEELEEDFAALVGRLPRRPKKRPRVVQKQLD
        ESKKLKFSIPEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSSSTKALMEKEK+RTALSVSLSKEELEEDFAALVGRLPRRPKKRPRVVQKQLD
Subjt:  ESKKLKFSIPEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSSSTKALMEKEKHRTALSVSLSKEELEEDFAALVGRLPRRPKKRPRVVQKQLD

XP_022921716.1 uncharacterized protein LOC111429881 [Cucurbita moschata]3.3e-6169.12Show/hide
Query:  MAMAPERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSKHPSHSK--QKPISASNSIETMREKIMLDI
        MAMAP+RS PLHNFSLPYLKWGSQRFLKCMK+SS+SN        P+A R+S+SY+ R R +NS+ AN ++  S  K  +     S+SIE MREKIMLDI
Subjt:  MAMAPERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSKHPSHSK--QKPISASNSIETMREKIMLDI

Query:  REESKKLKFSIPEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSS---STKALMEKEKHRTALSVSLSKEELEEDFAALVGRLPRRPKKRPRVVQ
        REESK+LKFSI +EGGE ESAAARPWNLRTRRAACKAP +ER  E GSSS    TK   EKEK+R+ L VSLSKEELEEDFA LVG+LPRRPKKRPR VQ
Subjt:  REESKKLKFSIPEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSS---STKALMEKEKHRTALSVSLSKEELEEDFAALVGRLPRRPKKRPRVVQ

Query:  KQLD
        KQLD
Subjt:  KQLD

XP_038898793.1 uncharacterized protein LOC120086296 [Benincasa hispida]7.5e-7476.59Show/hide
Query:  MAMAPERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSK-----HPSHSKQKPIS-ASNSIETMREKI
        MAM P+RSNPLHNFSLPYLKWGSQRFLKCMKVS  SNS+ S L HPS QR+SKSYQFRAR +NS+  NF+K     +P+HSKQKP +  S+SIE MREKI
Subjt:  MAMAPERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSK-----HPSHSKQKPIS-ASNSIETMREKI

Query:  MLDIREESKKLKFSIPEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSSSTKALMEKEKHRTALSVSLSKEELEEDFAALVGRLPRRPKKRPRVV
        MLDIREESK++KFSI +EGGEDESAAARPWNLRTRRAACKAP EE+N ELGSSS+   + +KEK+RTAL VSLSKEELEEDFA LVGRLPRRPKKRPR V
Subjt:  MLDIREESKKLKFSIPEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSSSTKALMEKEKHRTALSVSLSKEELEEDFAALVGRLPRRPKKRPRVV

Query:  QKQLD
        QKQ+D
Subjt:  QKQLD

TrEMBL top hitse value%identityAlignment
A0A0A0LS42 Uncharacterized protein3.1e-7375.61Show/hide
Query:  MAMAPERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSK-----HPSHSKQKPI-SASNSIETMREKI
        M++ P+RSNPLHNFSLP LKWGSQRFLKCMKVS  SNSN S L HPS  R+SKSYQFRAR ++S+A NF+K     + +HSKQKP    S+SIE MREKI
Subjt:  MAMAPERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSK-----HPSHSKQKPI-SASNSIETMREKI

Query:  MLDIREESKKLKFSIPEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSSSTKALMEKEKHRTALSVSLSKEELEEDFAALVGRLPRRPKKRPRVV
        MLDIREESK+LKFSI +EGGEDESAAARPWNLRTRRAACKAPL+ERNLELGSSS+   + +KEK+RTAL+VSLSKEELE+DFA LVGRLPRRPKKRPR V
Subjt:  MLDIREESKKLKFSIPEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSSSTKALMEKEKHRTALSVSLSKEELEEDFAALVGRLPRRPKKRPRVV

Query:  QKQLD
        QKQ+D
Subjt:  QKQLD

A0A1S3BXD4 uncharacterized protein LOC1034941366.2e-7476.59Show/hide
Query:  MAMAPERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSK-----HPSHSKQKPI-SASNSIETMREKI
        M++ P+RSNPLHNFSLP LKWGSQRFLKCMKVS  SNSN S L HPS  R+SKSYQFRAR +NS+A NF+K     + +HSKQKPI   S+SIE MREKI
Subjt:  MAMAPERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSK-----HPSHSKQKPI-SASNSIETMREKI

Query:  MLDIREESKKLKFSIPEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSSSTKALMEKEKHRTALSVSLSKEELEEDFAALVGRLPRRPKKRPRVV
        MLDIREESK+LKFSI +EGGEDESAAARPWNLRTRRAACKAPL+ERNLELGSSS+   + +K+K+RTAL VSLSKEELEEDFA LVGRLPRRPKKRPR V
Subjt:  MLDIREESKKLKFSIPEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSSSTKALMEKEKHRTALSVSLSKEELEEDFAALVGRLPRRPKKRPRVV

Query:  QKQLD
        QKQ+D
Subjt:  QKQLD

A0A5A7UX47 DUF1639 domain-containing protein6.2e-7476.59Show/hide
Query:  MAMAPERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSK-----HPSHSKQKPI-SASNSIETMREKI
        M++ P+RSNPLHNFSLP LKWGSQRFLKCMKVS  SNSN S L HPS  R+SKSYQFRAR +NS+A NF+K     + +HSKQKPI   S+SIE MREKI
Subjt:  MAMAPERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSK-----HPSHSKQKPI-SASNSIETMREKI

Query:  MLDIREESKKLKFSIPEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSSSTKALMEKEKHRTALSVSLSKEELEEDFAALVGRLPRRPKKRPRVV
        MLDIREESK+LKFSI +EGGEDESAAARPWNLRTRRAACKAPL+ERNLELGSSS+   + +K+K+RTAL VSLSKEELEEDFA LVGRLPRRPKKRPR V
Subjt:  MLDIREESKKLKFSIPEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSSSTKALMEKEKHRTALSVSLSKEELEEDFAALVGRLPRRPKKRPRVV

Query:  QKQLD
        QKQ+D
Subjt:  QKQLD

A0A6J1C056 uncharacterized protein LOC1110068572.5e-9998.49Show/hide
Query:  MAMAPERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSKHPSHSKQKPISASNSIETMREKIMLDIRE
        MAMAPERSNPLHNFSLPYLKWGSQRFLKCMKVSS SNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSKHPSHSKQKPISAS+SIETMREKIMLDIRE
Subjt:  MAMAPERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSKHPSHSKQKPISASNSIETMREKIMLDIRE

Query:  ESKKLKFSIPEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSSSTKALMEKEKHRTALSVSLSKEELEEDFAALVGRLPRRPKKRPRVVQKQLD
        ESKKLKFSIPEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSSSTKALMEKEK+RTALSVSLSKEELEEDFAALVGRLPRRPKKRPRVVQKQLD
Subjt:  ESKKLKFSIPEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSSSTKALMEKEKHRTALSVSLSKEELEEDFAALVGRLPRRPKKRPRVVQKQLD

A0A6J1E256 uncharacterized protein LOC1114298811.6e-6169.12Show/hide
Query:  MAMAPERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSKHPSHSK--QKPISASNSIETMREKIMLDI
        MAMAP+RS PLHNFSLPYLKWGSQRFLKCMK+SS+SN        P+A R+S+SY+ R R +NS+ AN ++  S  K  +     S+SIE MREKIMLDI
Subjt:  MAMAPERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSKHPSHSK--QKPISASNSIETMREKIMLDI

Query:  REESKKLKFSIPEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSS---STKALMEKEKHRTALSVSLSKEELEEDFAALVGRLPRRPKKRPRVVQ
        REESK+LKFSI +EGGE ESAAARPWNLRTRRAACKAP +ER  E GSSS    TK   EKEK+R+ L VSLSKEELEEDFA LVG+LPRRPKKRPR VQ
Subjt:  REESKKLKFSIPEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSS---STKALMEKEKHRTALSVSLSKEELEEDFAALVGRLPRRPKKRPRVVQ

Query:  KQLD
        KQLD
Subjt:  KQLD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G25370.1 Protein of unknown function (DUF1639)1.6e-1835.34Show/hide
Query:  ERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSA----QRESKSYQFRARTMNSRAANFSKHPSHSKQKPISASNSIETMREKIMLDIREE
        +RS  LHNF LP L WG+QR LKC K+ S SN+N++    P      +R S   +F A +  S    F  +  H +     +   IE  R K+M D++ E
Subjt:  ERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSA----QRESKSYQFRARTMNSRAANFSKHPSHSKQKPISASNSIETMREKIMLDIREE

Query:  SKKLKFSIPEEGGEDES--------------------AAARPWNLRTRR-AACKAPLEERNLELGSSSSTK------------ALMEKEKHRTALSVSLS
        + K+  S+  +G  +E                        +PWNLR RR AACK P     +  G     K             ++E EK R   S+ LS
Subjt:  SKKLKFSIPEEGGEDES--------------------AAARPWNLRTRR-AACKAPLEERNLELGSSSSTK------------ALMEKEKHRTALSVSLS

Query:  KEELEEDFAALVG-RLPRRPKKRPRVVQKQLD
        K+E+EEDF  +VG R PRRPKKR + VQK+LD
Subjt:  KEELEEDFAALVG-RLPRRPKKRPRVVQKQLD

AT1G48770.1 Protein of unknown function (DUF1639)1.2e-1631.84Show/hide
Query:  ERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSKHPSHSKQKPISASNSIETMREKIMLDIREESKKL
        ERS  LHNFSLP L+WG QRFL+C+ + S   S+SS    P     ++S      ++       +K+                                 
Subjt:  ERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSKHPSHSKQKPISASNSIETMREKIMLDIREESKKL

Query:  KFSIPEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSSSTKAL------MEKEKHRTALSVSLSKEELEEDFAALVGRL-PRRPKKRPRVVQKQL
                  +  AAA+PWNLR RRAAC  P EE  +E+G +     +       +K+  ++  S++LS++E+E+DF+ + G+  P+RPKKRPR+VQK+L
Subjt:  KFSIPEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSSSTKAL------MEKEKHRTALSVSLSKEELEEDFAALVGRL-PRRPKKRPRVVQKQL

Query:  D
        +
Subjt:  D

AT1G68340.1 Protein of unknown function (DUF1639)3.9e-2036.12Show/hide
Query:  ERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANF-SKHPSHSKQKPISASNSIETMREKIMLDIREESKK
        ERS  L NFSLP L WG+QR L+C K                 QR           +  R++NF S H +   +   S    IE  REKIMLD+R  + K
Subjt:  ERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANF-SKHPSHSKQKPISASNSIETMREKIMLDIREESKK

Query:  LKFSI-----------------------PEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSSSTKALM--------EKEKHRTALSVSLSKEELE
        +K SI                       P E     +   RPWNLR RRAACKA +   +L + S +  K  +        E  K R+ L  +LSK+E+E
Subjt:  LKFSI-----------------------PEEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSSSTKALM--------EKEKHRTALSVSLSKEELE

Query:  EDFAALVG-RLPRRPKKRPRVVQKQLD
        ED+  ++G + PRRPKKR R VQKQ+D
Subjt:  EDFAALVG-RLPRRPKKRPRVVQKQLD

AT3G18295.1 Protein of unknown function (DUF1639)7.6e-2437.04Show/hide
Query:  PERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSKHPSHSKQKPISASNSIETMREKIMLDIREESKK
        PERS  LHNF+LPYL+WG QRFL+C+K+           HH      S S+       +S ++    H SH+             +  ++ LD+  ++ +
Subjt:  PERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSKHPSHSKQKPISASNSIETMREKIMLDIREESKK

Query:  LKFSIPEEGGE----DESAAARPWNLRTRRAACKAP----------------LEERNLELGSSSSTKALMEKEKHRTALSVSLSKEELEEDFAALVG-RL
         K S+   GG+    D  AAARPWNLRTRRAAC  P                  E  ++ G S       + +  +   SVSL +EE+E+DF+AL+G R 
Subjt:  LKFSIPEEGGE----DESAAARPWNLRTRRAACKAP----------------LEERNLELGSSSSTKALMEKEKHRTALSVSLSKEELEEDFAALVG-RL

Query:  PRRPKKRPRVVQKQLD
        PRRPKKRPR+VQKQ++
Subjt:  PRRPKKRPRVVQKQLD

AT3G60410.1 Protein of unknown function (DUF1639)2.3e-0426.39Show/hide
Query:  APERSNPLHNFSLPYLKW-----GSQRFLKCMKVS---------------------------------------SDSNSNSSALHHPSAQRESKSYQFRA
        +P +S+PLHNF L  L+W      + R  K    S                                       SDS ++ SA    +    SK +  R 
Subjt:  APERSNPLHNFSLPYLKW-----GSQRFLKCMKVS---------------------------------------SDSNSNSSALHHPSAQRESKSYQFRA

Query:  RTMNSRAANFSKHPSHSKQKPISASNSIETMREKIMLDIREESKKLKFSIPEEGGED-ESAAARPWNLRTRR----------------AACKAPLEERNL
        RT N      ++  + S     S + S++   +     I  E ++    I + GG++ +    + WNLR RR                 +C   L E N 
Subjt:  RTMNSRAANFSKHPSHSKQKPISASNSIETMREKIMLDIREESKKLKFSIPEEGGED-ESAAARPWNLRTRR----------------AACKAPLEERNL

Query:  ELGS-----------SSSTKALMEKEKHRTALSVSLSKEELEEDFAALVGRLP-RRPKKRPRVVQKQLD
         LG+             +  A  E+++ +  LS+SLSK E++ED  AL G  P RRPKKR + VQKQLD
Subjt:  ELGS-----------SSSTKALMEKEKHRTALSVSLSKEELEEDFAALVGRLP-RRPKKRPRVVQKQLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTATGGCGCCTGAAAGATCAAACCCACTGCACAACTTCTCCTTGCCGTATCTCAAATGGGGTTCCCAGAGATTCCTCAAGTGTATGAAAGTCTCTTCCGACTCCAA
TTCCAATTCGTCTGCGCTTCATCATCCTTCTGCTCAACGTGAATCGAAATCGTATCAATTCCGTGCCAGAACTATGAATTCTAGGGCCGCGAACTTCTCCAAGCACCCGA
GTCATTCCAAGCAGAAACCGATTAGCGCCAGCAATTCCATCGAAACCATGCGAGAGAAGATCATGCTCGATATCAGGGAAGAATCGAAGAAACTCAAGTTTTCGATTCCT
GAAGAGGGGGGCGAGGACGAGTCGGCTGCGGCGAGGCCGTGGAATTTGAGGACGCGCAGAGCGGCGTGTAAGGCTCCTCTGGAAGAGAGGAATCTGGAATTGGGATCATC
ATCGTCGACGAAGGCTCTAATGGAGAAGGAGAAGCACCGGACTGCGTTATCTGTGTCTCTGTCGAAGGAGGAGCTGGAAGAGGACTTCGCGGCTCTGGTCGGTAGGCTAC
CGAGGAGGCCAAAGAAGAGGCCTAGGGTTGTACAAAAGCAATTGGAC
mRNA sequenceShow/hide mRNA sequence
ATGGCTATGGCGCCTGAAAGATCAAACCCACTGCACAACTTCTCCTTGCCGTATCTCAAATGGGGTTCCCAGAGATTCCTCAAGTGTATGAAAGTCTCTTCCGACTCCAA
TTCCAATTCGTCTGCGCTTCATCATCCTTCTGCTCAACGTGAATCGAAATCGTATCAATTCCGTGCCAGAACTATGAATTCTAGGGCCGCGAACTTCTCCAAGCACCCGA
GTCATTCCAAGCAGAAACCGATTAGCGCCAGCAATTCCATCGAAACCATGCGAGAGAAGATCATGCTCGATATCAGGGAAGAATCGAAGAAACTCAAGTTTTCGATTCCT
GAAGAGGGGGGCGAGGACGAGTCGGCTGCGGCGAGGCCGTGGAATTTGAGGACGCGCAGAGCGGCGTGTAAGGCTCCTCTGGAAGAGAGGAATCTGGAATTGGGATCATC
ATCGTCGACGAAGGCTCTAATGGAGAAGGAGAAGCACCGGACTGCGTTATCTGTGTCTCTGTCGAAGGAGGAGCTGGAAGAGGACTTCGCGGCTCTGGTCGGTAGGCTAC
CGAGGAGGCCAAAGAAGAGGCCTAGGGTTGTACAAAAGCAATTGGAC
Protein sequenceShow/hide protein sequence
MAMAPERSNPLHNFSLPYLKWGSQRFLKCMKVSSDSNSNSSALHHPSAQRESKSYQFRARTMNSRAANFSKHPSHSKQKPISASNSIETMREKIMLDIREESKKLKFSIP
EEGGEDESAAARPWNLRTRRAACKAPLEERNLELGSSSSTKALMEKEKHRTALSVSLSKEELEEDFAALVGRLPRRPKKRPRVVQKQLD