; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g28570 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g28570
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:21423133..21426221
RNA-Seq ExpressionMoc09g28570
SyntenyMoc09g28570
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]6.5e-5248.3Show/hide
Query:  GTSNIDMSFKVEPSSVGVRENTMRISSACLDRYWRRASKFVSALGLAIQRMLDYSAKTHAAICQVALMMKAELEGHNLLTVKEREASSAAV-----VLER
        G   I    ++EPSS GVR+   RIS+A LDR  RRASKFVSA G  +QR +DY+A+   A  Q AL +KAEL+G  +L  +E+E  SAA+      ++ 
Subjt:  GTSNIDMSFKVEPSSVGVRENTMRISSACLDRYWRRASKFVSALGLAIQRMLDYSAKTHAAICQVALMMKAELEGHNLLTVKEREASSAAV-----VLER

Query:  ELKEARAEAQAWKSSSEADKAKLKSAQMEVARHLENLKGAHAVAKCLEKEKFALMKQNDDLERLRDDLEGKIKARDVEVAELRAKLELEESKLSNGVILE
        EL +A +E +  K+  E+    LK    E  R    L+ AHA+ + LE+EKF L+K+ DD+ +        ++A+D E+    A+LE  + +LSNGV+LE
Subjt:  ELKEARAEAQAWKSSSEADKAKLKSAQMEVARHLENLKGAHAVAKCLEKEKFALMKQNDDLERLRDDLEGKIKARDVEVAELRAKLELEESKLSNGVILE

Query:  EAFRKHPDFDGFAKDFSDAGFRFLMKGIQEMAPELY--LAPIKLRYAEKWASGPNGTPGPQDFME
        EAFR+HPDFDGFAKDFSDAGF+FLMKGI    P+L   L+ +K RYAEKWASGP GTPGPQ  ++
Subjt:  EAFRKHPDFDGFAKDFSDAGFRFLMKGIQEMAPELY--LAPIKLRYAEKWASGPNGTPGPQDFME

XP_022150867.1 uncharacterized protein LOC111018913 [Momordica charantia]2.2e-5247.92Show/hide
Query:  DDPEARMDGTSNIDMSFKVEPSSVGVRENTMRISSACLDRYWRRASKFVSALGLAIQRMLDYSAKTHAAICQVALMMKAELEGHNLLTVKEREASSAAV-
        +DP+AR+  T +I M FK+EPSS G++E   + SS C DR  ++ASKFV      I++++DY+ K HA  C  A++MK++L+  +L+ V EREA S A+ 
Subjt:  DDPEARMDGTSNIDMSFKVEPSSVGVRENTMRISSACLDRYWRRASKFVSALGLAIQRMLDYSAKTHAAICQVALMMKAELEGHNLLTVKEREASSAAV-

Query:  ---VLERELKEARAEAQAWKSSSEADKAKLKSAQMEVARHLENLKGAHAVAKCLEKEKFALMKQNDDLERLRDDLEGKIKARDVEVAELRAKLELEESKL
            LERELKEAR E +  KS  E   AK KS + EV    E  K  + + K LE EKF LM++ND L R         K    EV EL+ ++EL ++KL
Subjt:  ---VLERELKEARAEAQAWKSSSEADKAKLKSAQMEVARHLENLKGAHAVAKCLEKEKFALMKQNDDLERLRDDLEGKIKARDVEVAELRAKLELEESKL

Query:  SNGVILEEAFRKHPDFDGFAKDFSDAGFRFLMKGIQEMAPELYLAPIKLRYAEKWASGPNGTPGP
        SNGV+LEEAF+ H DFD F  DFSD  F+FLMKGI E+A +L L P+K  Y +KWASGP  T GP
Subjt:  SNGVILEEAFRKHPDFDGFAKDFSDAGFRFLMKGIQEMAPELYLAPIKLRYAEKWASGPNGTPGP

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]4.2e-5148.3Show/hide
Query:  MDGTSNIDMSFKVEPSSVGVRENTMRISSACLDRYWRRASKFVSALGLAIQRMLDYSAKTHAAICQVALMMKAELEGHNLLTVKEREASSAAV----VLE
        M GT ++   F++EPSS GV++   RIS+ CLDR  +RASKFVS  G  +QR +D +A+   A    A+M+KAEL+G   L  KERE SSAA+     L+
Subjt:  MDGTSNIDMSFKVEPSSVGVRENTMRISSACLDRYWRRASKFVSALGLAIQRMLDYSAKTHAAICQVALMMKAELEGHNLLTVKEREASSAAV----VLE

Query:  RELKEARAEAQAWKSSSEADKAKLKSAQMEVARHLENLKGAHAVAKCLEKEKFALMKQNDDLERLRDDLEGKIKARDVEVAELRAKLELEESKLSNGVIL
         EL +A+ E    ++  +A    LK    E  +H  +L+ AHA+ K LEKEKF L+K+ DDL ++   LEGK    D  +  L A+L+  + +L+NG +L
Subjt:  RELKEARAEAQAWKSSSEADKAKLKSAQMEVARHLENLKGAHAVAKCLEKEKFALMKQNDDLERLRDDLEGKIKARDVEVAELRAKLELEESKLSNGVIL

Query:  EEAFRKHPDFDGFAKDFSDAGFRFLMKGIQEMAPELY--LAPIKLRYAEKWASGPNGTPGPQDFM
        EE+FR+H DFDGFAKDFSDAGF+FLMKGI    P L   L+ +K +Y+EKWASGPNGTPGPQ  +
Subjt:  EEAFRKHPDFDGFAKDFSDAGFRFLMKGIQEMAPELY--LAPIKLRYAEKWASGPNGTPGPQDFM

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]8.8e-5757.21Show/hide
Query:  RLDSELEEEIDNFRFSEDVEDDSDTSTSGQGLEFPSQMPENYLGSLRRKYGIPDDILLKLPKEGERA----------------------------EFLVQ
        RL+S+L EEI+N R S+D E DSD STSGQGLE+PS++PE+YLGSLRR + IP++ILL+LP+EGERA                            EFL +
Subjt:  RLDSELEEEIDNFRFSEDVEDDSDTSTSGQGLEFPSQMPENYLGSLRRKYGIPDDILLKLPKEGERA----------------------------EFLVQ

Query:  TGLASAQVAPNRWGVIFSLVVLFWLRCREVDDLDLLEIDQLLACFEVKRISKKPGRYCLCARKGAGGILKGQTSIKKWVGKWFFASSAWLAKNESNLPFF
        TGLA AQVAPN WGVIF+L +LFWLR R+ ++ +L ++DQLLACFE KRI+KKPGR+ +CARKGAGGI+KG TSIK WV KWF+AS  WLAK+ES   FF
Subjt:  TGLASAQVAPNRWGVIFSLVVLFWLRCREVDDLDLLEIDQLLACFEVKRISKKPGRYCLCARKGAGGILKGQTSIKKWVGKWFFASSAWLAKNESNLPFF

Query:  NVPYSRYKDVRRKRP
        +VP +R+ ++   RP
Subjt:  NVPYSRYKDVRRKRP

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]4.2e-6738.89Show/hide
Query:  LCARKGAGGILKGQTSIKKWVGKWFFASSAWLAKNESNLPFFNVP----------------------YSRYKD---------------------------
        +CARKG GGI+KG TSIK WVGKWFFAS  WLAK+ES   FF+VP                         YKD                           
Subjt:  LCARKGAGGILKGQTSIKKWVGKWFFASSAWLAKNESNLPFFNVP----------------------YSRYKD---------------------------

Query:  ----VRRKRPRAGQA----------SKSK---EAPTSVVGDLP-------------------TEVDVVEGDLEEGFLQEGR----RSARLTTP--RTRWV
            +   RP +  A           KSK    A  +VVG  P                       V+E DL  G   E R      A   +P    R  
Subjt:  ----VRRKRPRAGQA----------SKSK---EAPTSVVGDLP-------------------TEVDVVEGDLEEGFLQEGR----RSARLTTP--RTRWV

Query:  SPFR-------------------------DWVDDPEARMDGTSNIDMSFKVEPSSVGVRENTMRISSACLDRYWRRASKFVSALGLAIQRMLDYSAKTHA
        SP R                         D VDDPEARM GTSN+ M F +EPSS GV++   RIS+ CLDRY RRASKFVS  G  +QR +D  A+   
Subjt:  SPFR-------------------------DWVDDPEARMDGTSNIDMSFKVEPSSVGVRENTMRISSACLDRYWRRASKFVSALGLAIQRMLDYSAKTHA

Query:  AICQVALMMKAELEGHNLLTVKERE----ASSAAVVLERELKEARAEAQAWKSSSEADKAKLKSAQMEVARHLENLKGAHAVAKCLEKEKFALMKQNDDL
        A   +A+M+KAEL+G   L  KERE    A  AA  L+ EL +A+ E    +  +E D AK+   + E  +H  +L+ AHA+ K LEKEKF L+K+ DDL
Subjt:  AICQVALMMKAELEGHNLLTVKERE----ASSAAVVLERELKEARAEAQAWKSSSEADKAKLKSAQMEVARHLENLKGAHAVAKCLEKEKFALMKQNDDL

Query:  ERLRDDLEGKIKARDVEVAELRAKLELEESKLSNGVILEEAFRKHPDFDGFAKDFSDAGFRFLMKGIQEMAPELY--LAPIKLRYAEKWASGPNGTPGPQ
         ++       ++ +D  +  L  +L+  + +L+NG +LEE+FR+HPDFDGFAKDFSDAGF+FLMKGI    P L   L  +K +Y+EKWASGPNGTP PQ
Subjt:  ERLRDDLEGKIKARDVEVAELRAKLELEESKLSNGVILEEAFRKHPDFDGFAKDFSDAGFRFLMKGIQEMAPELY--LAPIKLRYAEKWASGPNGTPGPQ

Query:  DFME
          ++
Subjt:  DFME

TrEMBL top hitse value%identityAlignment
A0A6J1D971 uncharacterized protein LOC1110185383.2e-5248.3Show/hide
Query:  GTSNIDMSFKVEPSSVGVRENTMRISSACLDRYWRRASKFVSALGLAIQRMLDYSAKTHAAICQVALMMKAELEGHNLLTVKEREASSAAV-----VLER
        G   I    ++EPSS GVR+   RIS+A LDR  RRASKFVSA G  +QR +DY+A+   A  Q AL +KAEL+G  +L  +E+E  SAA+      ++ 
Subjt:  GTSNIDMSFKVEPSSVGVRENTMRISSACLDRYWRRASKFVSALGLAIQRMLDYSAKTHAAICQVALMMKAELEGHNLLTVKEREASSAAV-----VLER

Query:  ELKEARAEAQAWKSSSEADKAKLKSAQMEVARHLENLKGAHAVAKCLEKEKFALMKQNDDLERLRDDLEGKIKARDVEVAELRAKLELEESKLSNGVILE
        EL +A +E +  K+  E+    LK    E  R    L+ AHA+ + LE+EKF L+K+ DD+ +        ++A+D E+    A+LE  + +LSNGV+LE
Subjt:  ELKEARAEAQAWKSSSEADKAKLKSAQMEVARHLENLKGAHAVAKCLEKEKFALMKQNDDLERLRDDLEGKIKARDVEVAELRAKLELEESKLSNGVILE

Query:  EAFRKHPDFDGFAKDFSDAGFRFLMKGIQEMAPELY--LAPIKLRYAEKWASGPNGTPGPQDFME
        EAFR+HPDFDGFAKDFSDAGF+FLMKGI    P+L   L+ +K RYAEKWASGP GTPGPQ  ++
Subjt:  EAFRKHPDFDGFAKDFSDAGFRFLMKGIQEMAPELY--LAPIKLRYAEKWASGPNGTPGPQDFME

A0A6J1DBX9 uncharacterized protein LOC1110189131.1e-5247.92Show/hide
Query:  DDPEARMDGTSNIDMSFKVEPSSVGVRENTMRISSACLDRYWRRASKFVSALGLAIQRMLDYSAKTHAAICQVALMMKAELEGHNLLTVKEREASSAAV-
        +DP+AR+  T +I M FK+EPSS G++E   + SS C DR  ++ASKFV      I++++DY+ K HA  C  A++MK++L+  +L+ V EREA S A+ 
Subjt:  DDPEARMDGTSNIDMSFKVEPSSVGVRENTMRISSACLDRYWRRASKFVSALGLAIQRMLDYSAKTHAAICQVALMMKAELEGHNLLTVKEREASSAAV-

Query:  ---VLERELKEARAEAQAWKSSSEADKAKLKSAQMEVARHLENLKGAHAVAKCLEKEKFALMKQNDDLERLRDDLEGKIKARDVEVAELRAKLELEESKL
            LERELKEAR E +  KS  E   AK KS + EV    E  K  + + K LE EKF LM++ND L R         K    EV EL+ ++EL ++KL
Subjt:  ---VLERELKEARAEAQAWKSSSEADKAKLKSAQMEVARHLENLKGAHAVAKCLEKEKFALMKQNDDLERLRDDLEGKIKARDVEVAELRAKLELEESKL

Query:  SNGVILEEAFRKHPDFDGFAKDFSDAGFRFLMKGIQEMAPELYLAPIKLRYAEKWASGPNGTPGP
        SNGV+LEEAF+ H DFD F  DFSD  F+FLMKGI E+A +L L P+K  Y +KWASGP  T GP
Subjt:  SNGVILEEAFRKHPDFDGFAKDFSDAGFRFLMKGIQEMAPELYLAPIKLRYAEKWASGPNGTPGP

A0A6J1DF31 uncharacterized protein LOC1110199092.0e-5148.3Show/hide
Query:  MDGTSNIDMSFKVEPSSVGVRENTMRISSACLDRYWRRASKFVSALGLAIQRMLDYSAKTHAAICQVALMMKAELEGHNLLTVKEREASSAAV----VLE
        M GT ++   F++EPSS GV++   RIS+ CLDR  +RASKFVS  G  +QR +D +A+   A    A+M+KAEL+G   L  KERE SSAA+     L+
Subjt:  MDGTSNIDMSFKVEPSSVGVRENTMRISSACLDRYWRRASKFVSALGLAIQRMLDYSAKTHAAICQVALMMKAELEGHNLLTVKEREASSAAV----VLE

Query:  RELKEARAEAQAWKSSSEADKAKLKSAQMEVARHLENLKGAHAVAKCLEKEKFALMKQNDDLERLRDDLEGKIKARDVEVAELRAKLELEESKLSNGVIL
         EL +A+ E    ++  +A    LK    E  +H  +L+ AHA+ K LEKEKF L+K+ DDL ++   LEGK    D  +  L A+L+  + +L+NG +L
Subjt:  RELKEARAEAQAWKSSSEADKAKLKSAQMEVARHLENLKGAHAVAKCLEKEKFALMKQNDDLERLRDDLEGKIKARDVEVAELRAKLELEESKLSNGVIL

Query:  EEAFRKHPDFDGFAKDFSDAGFRFLMKGIQEMAPELY--LAPIKLRYAEKWASGPNGTPGPQDFM
        EE+FR+H DFDGFAKDFSDAGF+FLMKGI    P L   L+ +K +Y+EKWASGPNGTPGPQ  +
Subjt:  EEAFRKHPDFDGFAKDFSDAGFRFLMKGIQEMAPELY--LAPIKLRYAEKWASGPNGTPGPQDFM

A0A6J1DXS5 uncharacterized protein LOC1110255024.3e-5757.21Show/hide
Query:  RLDSELEEEIDNFRFSEDVEDDSDTSTSGQGLEFPSQMPENYLGSLRRKYGIPDDILLKLPKEGERA----------------------------EFLVQ
        RL+S+L EEI+N R S+D E DSD STSGQGLE+PS++PE+YLGSLRR + IP++ILL+LP+EGERA                            EFL +
Subjt:  RLDSELEEEIDNFRFSEDVEDDSDTSTSGQGLEFPSQMPENYLGSLRRKYGIPDDILLKLPKEGERA----------------------------EFLVQ

Query:  TGLASAQVAPNRWGVIFSLVVLFWLRCREVDDLDLLEIDQLLACFEVKRISKKPGRYCLCARKGAGGILKGQTSIKKWVGKWFFASSAWLAKNESNLPFF
        TGLA AQVAPN WGVIF+L +LFWLR R+ ++ +L ++DQLLACFE KRI+KKPGR+ +CARKGAGGI+KG TSIK WV KWF+AS  WLAK+ES   FF
Subjt:  TGLASAQVAPNRWGVIFSLVVLFWLRCREVDDLDLLEIDQLLACFEVKRISKKPGRYCLCARKGAGGILKGQTSIKKWVGKWFFASSAWLAKNESNLPFF

Query:  NVPYSRYKDVRRKRP
        +VP +R+ ++   RP
Subjt:  NVPYSRYKDVRRKRP

A0A6J1DZB3 uncharacterized protein LOC1110256652.0e-6738.89Show/hide
Query:  LCARKGAGGILKGQTSIKKWVGKWFFASSAWLAKNESNLPFFNVP----------------------YSRYKD---------------------------
        +CARKG GGI+KG TSIK WVGKWFFAS  WLAK+ES   FF+VP                         YKD                           
Subjt:  LCARKGAGGILKGQTSIKKWVGKWFFASSAWLAKNESNLPFFNVP----------------------YSRYKD---------------------------

Query:  ----VRRKRPRAGQA----------SKSK---EAPTSVVGDLP-------------------TEVDVVEGDLEEGFLQEGR----RSARLTTP--RTRWV
            +   RP +  A           KSK    A  +VVG  P                       V+E DL  G   E R      A   +P    R  
Subjt:  ----VRRKRPRAGQA----------SKSK---EAPTSVVGDLP-------------------TEVDVVEGDLEEGFLQEGR----RSARLTTP--RTRWV

Query:  SPFR-------------------------DWVDDPEARMDGTSNIDMSFKVEPSSVGVRENTMRISSACLDRYWRRASKFVSALGLAIQRMLDYSAKTHA
        SP R                         D VDDPEARM GTSN+ M F +EPSS GV++   RIS+ CLDRY RRASKFVS  G  +QR +D  A+   
Subjt:  SPFR-------------------------DWVDDPEARMDGTSNIDMSFKVEPSSVGVRENTMRISSACLDRYWRRASKFVSALGLAIQRMLDYSAKTHA

Query:  AICQVALMMKAELEGHNLLTVKERE----ASSAAVVLERELKEARAEAQAWKSSSEADKAKLKSAQMEVARHLENLKGAHAVAKCLEKEKFALMKQNDDL
        A   +A+M+KAEL+G   L  KERE    A  AA  L+ EL +A+ E    +  +E D AK+   + E  +H  +L+ AHA+ K LEKEKF L+K+ DDL
Subjt:  AICQVALMMKAELEGHNLLTVKERE----ASSAAVVLERELKEARAEAQAWKSSSEADKAKLKSAQMEVARHLENLKGAHAVAKCLEKEKFALMKQNDDL

Query:  ERLRDDLEGKIKARDVEVAELRAKLELEESKLSNGVILEEAFRKHPDFDGFAKDFSDAGFRFLMKGIQEMAPELY--LAPIKLRYAEKWASGPNGTPGPQ
         ++       ++ +D  +  L  +L+  + +L+NG +LEE+FR+HPDFDGFAKDFSDAGF+FLMKGI    P L   L  +K +Y+EKWASGPNGTP PQ
Subjt:  ERLRDDLEGKIKARDVEVAELRAKLELEESKLSNGVILEEAFRKHPDFDGFAKDFSDAGFRFLMKGIQEMAPELY--LAPIKLRYAEKWASGPNGTPGPQ

Query:  DFME
          ++
Subjt:  DFME

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGAATTGCACAATGTCAATCTATCTCGAACCCGGTCCTTGCTCCGACCTGAAATGCTAAGGGAAGACCTGCACAAGAGGGCAAAATCTCCGACGCTCAAGTCAGT
AAGCAGCTCGGACTCAATATTAGGTTCAAGGAGAAATTCGATACCTGATGGTGAAATGAAACCCTCTATTTATAGGTTTAGGTTTCGACAGGGGTCGCGACAGGTCGATA
CCTTGGTCAAACTTGAGGTCGGATGTATCGAGTTCGAGCAGATCGAACTCGGATATACTCCTGGAAATAACAGGTTAGACTCCGAACTAGAAGAGGAGATAGATAACTTT
AGGTTCTCGGAAGACGTCGAGGATGACAGTGATACCTCAACCTCGGGGCAAGGTTTAGAATTCCCTTCCCAGATGCCTGAGAACTACCTCGGCTCCCTTCGTAGGAAATA
TGGCATACCAGATGATATCCTCCTTAAGCTTCCTAAGGAAGGAGAACGAGCTGAGTTCTTAGTCCAAACTGGGTTAGCTTCTGCTCAAGTGGCCCCCAACAGATGGGGAG
TTATCTTTAGTTTAGTCGTTCTATTTTGGCTAAGGTGTAGGGAGGTAGACGATTTGGACCTCCTCGAAATCGATCAACTCTTAGCCTGTTTTGAGGTTAAGCGTATATCT
AAGAAGCCAGGGAGGTATTGTTTGTGTGCTAGGAAGGGCGCGGGAGGCATTCTGAAGGGTCAGACCTCCATAAAGAAATGGGTTGGGAAGTGGTTCTTCGCCTCCAGTGC
ATGGCTGGCCAAGAACGAGTCCAACCTACCTTTCTTCAACGTCCCGTATAGTCGCTATAAGGATGTTCGGCGCAAGCGCCCACGCGCTGGGCAGGCTTCCAAGAGTAAGG
AGGCACCTACCTCTGTTGTGGGTGACCTTCCCACCGAGGTCGACGTGGTTGAGGGAGATTTAGAGGAAGGCTTCCTCCAAGAAGGAAGAAGAAGCGCAAGGCTCACCACT
CCTAGGACGAGGTGGGTAAGTCCCTTCAGAGACTGGGTAGACGACCCTGAAGCCAGGATGGACGGCACCTCGAATATCGACATGAGTTTCAAGGTGGAACCTTCGAGTGT
CGGGGTGAGGGAGAACACAATGCGGATATCGAGCGCCTGCCTCGACCGCTACTGGAGGAGAGCGTCCAAGTTTGTCAGTGCCCTAGGGTTAGCTATTCAACGAATGTTGG
ATTACTCGGCCAAGACTCACGCTGCCATTTGTCAGGTCGCTCTTATGATGAAGGCCGAGCTGGAGGGGCACAACCTTCTCACTGTGAAGGAGCGGGAGGCTTCCTCGGCT
GCTGTTGTCCTGGAGAGAGAGCTCAAAGAGGCTCGTGCGGAGGCTCAGGCATGGAAATCCTCCTCTGAGGCTGACAAGGCCAAACTCAAAAGTGCTCAAATGGAGGTTGC
TCGGCACCTGGAGAACCTGAAGGGCGCGCACGCTGTGGCTAAGTGCCTTGAGAAGGAGAAGTTTGCGCTGATGAAGCAGAACGACGACCTCGAACGCCTCCGAGACGACC
TGGAGGGCAAGATAAAGGCTCGTGACGTCGAGGTGGCAGAATTGAGGGCTAAGCTCGAGCTCGAAGAGTCCAAGCTCAGCAATGGGGTCATTCTGGAGGAGGCATTTCGC
AAGCATCCTGACTTCGACGGTTTTGCCAAGGACTTCAGCGACGCGGGCTTTAGGTTCCTGATGAAGGGAATCCAGGAAATGGCTCCCGAGCTCTATCTCGCACCCATCAA
GCTAAGGTATGCGGAGAAGTGGGCTTCGGGCCCCAATGGGACCCCTGGCCCCCAAGACTTCATGGAGAATGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGAATTGCACAATGTCAATCTATCTCGAACCCGGTCCTTGCTCCGACCTGAAATGCTAAGGGAAGACCTGCACAAGAGGGCAAAATCTCCGACGCTCAAGTCAGT
AAGCAGCTCGGACTCAATATTAGGTTCAAGGAGAAATTCGATACCTGATGGTGAAATGAAACCCTCTATTTATAGGTTTAGGTTTCGACAGGGGTCGCGACAGGTCGATA
CCTTGGTCAAACTTGAGGTCGGATGTATCGAGTTCGAGCAGATCGAACTCGGATATACTCCTGGAAATAACAGGTTAGACTCCGAACTAGAAGAGGAGATAGATAACTTT
AGGTTCTCGGAAGACGTCGAGGATGACAGTGATACCTCAACCTCGGGGCAAGGTTTAGAATTCCCTTCCCAGATGCCTGAGAACTACCTCGGCTCCCTTCGTAGGAAATA
TGGCATACCAGATGATATCCTCCTTAAGCTTCCTAAGGAAGGAGAACGAGCTGAGTTCTTAGTCCAAACTGGGTTAGCTTCTGCTCAAGTGGCCCCCAACAGATGGGGAG
TTATCTTTAGTTTAGTCGTTCTATTTTGGCTAAGGTGTAGGGAGGTAGACGATTTGGACCTCCTCGAAATCGATCAACTCTTAGCCTGTTTTGAGGTTAAGCGTATATCT
AAGAAGCCAGGGAGGTATTGTTTGTGTGCTAGGAAGGGCGCGGGAGGCATTCTGAAGGGTCAGACCTCCATAAAGAAATGGGTTGGGAAGTGGTTCTTCGCCTCCAGTGC
ATGGCTGGCCAAGAACGAGTCCAACCTACCTTTCTTCAACGTCCCGTATAGTCGCTATAAGGATGTTCGGCGCAAGCGCCCACGCGCTGGGCAGGCTTCCAAGAGTAAGG
AGGCACCTACCTCTGTTGTGGGTGACCTTCCCACCGAGGTCGACGTGGTTGAGGGAGATTTAGAGGAAGGCTTCCTCCAAGAAGGAAGAAGAAGCGCAAGGCTCACCACT
CCTAGGACGAGGTGGGTAAGTCCCTTCAGAGACTGGGTAGACGACCCTGAAGCCAGGATGGACGGCACCTCGAATATCGACATGAGTTTCAAGGTGGAACCTTCGAGTGT
CGGGGTGAGGGAGAACACAATGCGGATATCGAGCGCCTGCCTCGACCGCTACTGGAGGAGAGCGTCCAAGTTTGTCAGTGCCCTAGGGTTAGCTATTCAACGAATGTTGG
ATTACTCGGCCAAGACTCACGCTGCCATTTGTCAGGTCGCTCTTATGATGAAGGCCGAGCTGGAGGGGCACAACCTTCTCACTGTGAAGGAGCGGGAGGCTTCCTCGGCT
GCTGTTGTCCTGGAGAGAGAGCTCAAAGAGGCTCGTGCGGAGGCTCAGGCATGGAAATCCTCCTCTGAGGCTGACAAGGCCAAACTCAAAAGTGCTCAAATGGAGGTTGC
TCGGCACCTGGAGAACCTGAAGGGCGCGCACGCTGTGGCTAAGTGCCTTGAGAAGGAGAAGTTTGCGCTGATGAAGCAGAACGACGACCTCGAACGCCTCCGAGACGACC
TGGAGGGCAAGATAAAGGCTCGTGACGTCGAGGTGGCAGAATTGAGGGCTAAGCTCGAGCTCGAAGAGTCCAAGCTCAGCAATGGGGTCATTCTGGAGGAGGCATTTCGC
AAGCATCCTGACTTCGACGGTTTTGCCAAGGACTTCAGCGACGCGGGCTTTAGGTTCCTGATGAAGGGAATCCAGGAAATGGCTCCCGAGCTCTATCTCGCACCCATCAA
GCTAAGGTATGCGGAGAAGTGGGCTTCGGGCCCCAATGGGACCCCTGGCCCCCAAGACTTCATGGAGAATGCTTAA
Protein sequenceShow/hide protein sequence
MQELHNVNLSRTRSLLRPEMLREDLHKRAKSPTLKSVSSSDSILGSRRNSIPDGEMKPSIYRFRFRQGSRQVDTLVKLEVGCIEFEQIELGYTPGNNRLDSELEEEIDNF
RFSEDVEDDSDTSTSGQGLEFPSQMPENYLGSLRRKYGIPDDILLKLPKEGERAEFLVQTGLASAQVAPNRWGVIFSLVVLFWLRCREVDDLDLLEIDQLLACFEVKRIS
KKPGRYCLCARKGAGGILKGQTSIKKWVGKWFFASSAWLAKNESNLPFFNVPYSRYKDVRRKRPRAGQASKSKEAPTSVVGDLPTEVDVVEGDLEEGFLQEGRRSARLTT
PRTRWVSPFRDWVDDPEARMDGTSNIDMSFKVEPSSVGVRENTMRISSACLDRYWRRASKFVSALGLAIQRMLDYSAKTHAAICQVALMMKAELEGHNLLTVKEREASSA
AVVLERELKEARAEAQAWKSSSEADKAKLKSAQMEVARHLENLKGAHAVAKCLEKEKFALMKQNDDLERLRDDLEGKIKARDVEVAELRAKLELEESKLSNGVILEEAFR
KHPDFDGFAKDFSDAGFRFLMKGIQEMAPELYLAPIKLRYAEKWASGPNGTPGPQDFMENA