; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g19940 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g19940
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr2:14812469..14815303
RNA-Seq ExpressionMoc02g19940
SyntenyMoc02g19940
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]3.0e-4042.8Show/hide
Query:  LCARMGARGILKGLTSIKKWVGKWFFATGVWLARNEVDLPFFSVPLAIRPIPQPSEPTFNALKFFKSKFKSGRQISTLITNKLLLASRLLDYNPLLSPLE
        +CAR GA GI+KG TSIK WV KWF+A+G WLA++E         +AIRP+P+ ++ +F+ LK++K  F  GR++ TL+T+KLLL S LLDYNP + P+E
Subjt:  LCARMGARGILKGLTSIKKWVGKWFFATGVWLARNEVDLPFFSVPLAIRPIPQPSEPTFNALKFFKSKFKSGRQISTLITNKLLLASRLLDYNPLLSPLE

Query:  ARRPNSELAMVCGFSQDVWRK---RPRTGQASKNKEAPSPAIAEPPTE--VEVVEGDSEEGSPKKSKKKKRKTHHSE-----DEVRE---LRARRR----
        + RPNSELAMVCGF+ +V RK   +    +A+++ +  +PA+  P +E    V+E +S  G P + K+ + +T   +     +EVRE   L+ RR+    
Subjt:  ARRPNSELAMVCGFSQDVWRK---RPRTGQASKNKEAPSPAIAEPPTE--VEVVEGDSEEGSPKKSKKKKRKTHHSE-----DEVRE---LRARRR----

Query:  ISP------------FGELVNDPEARMGGTSNIEMKFKVEPSSARVRERAMEMSSSY
         SP            F + V+DPEARMGGT ++  +F+VEPSS+ VR++ M   +SY
Subjt:  ISP------------FGELVNDPEARMGGTSNIEMKFKVEPSSARVRERAMEMSSSY

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]7.0e-3746.56Show/hide
Query:  RLPKKPGRYYLCARMGARGILKGLTSIKKWVGKWFFATGVWLARNEVDLPFFSVP------LAIRPIPQPSEPTFNALKFFKSKFKSGRQISTLITNKLL
        R+ KKPGR+Y+CAR GA GI+KG TSIK WV KWF+A+G WLA++E    FF VP      ++IRP+P+ ++ +F+ LK++K +F  GR++ TL+T++LL
Subjt:  RLPKKPGRYYLCARMGARGILKGLTSIKKWVGKWFFATGVWLARNEVDLPFFSVP------LAIRPIPQPSEPTFNALKFFKSKFKSGRQISTLITNKLL

Query:  LASRLLDYNPLLSPLEARRPNSELAMVCGFSQDVWRK---RPRTGQASKNKEAPSPAIAEPPTE--VEVVEGDSEEGSPKKSKKKKRKT
        L S LLDYNP + P+E  RPNS LAMVC F+  V RK   R    +A+++ + P+PA+  P +E    V+E +S  G P + K+ + +T
Subjt:  LASRLLDYNPLLSPLEARRPNSELAMVCGFSQDVWRK---RPRTGQASKNKEAPSPAIAEPPTE--VEVVEGDSEEGSPKKSKKKKRKT

XP_022152115.1 uncharacterized protein LOC111019905 [Momordica charantia]9.4e-3454.55Show/hide
Query:  LCARMGARGILKGLTSIKKWVGKWFFATGVWLARNEVDLPFFSVP------LAIRPIPQPSEPTFNALKFFKSKFKSGRQISTLITNKLLLASRLLDYNP
        +C R GA GI+KG TSIK WVGKWFFA+G WL ++E   PFF +P      ++I+PIP+  + TF+ LKF+K  F  GR+I TL+T+KLLL S LLDYNP
Subjt:  LCARMGARGILKGLTSIKKWVGKWFFATGVWLARNEVDLPFFSVP------LAIRPIPQPSEPTFNALKFFKSKFKSGRQISTLITNKLLLASRLLDYNP

Query:  LLSPLEARRPNSELAMVCGFSQDVWRKRPRTGQASKNKEAPSP
        L+ P+EA RPNSELAMVCGF+  V RK      A K      P
Subjt:  LLSPLEARRPNSELAMVCGFSQDVWRKRPRTGQASKNKEAPSP

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.8e-4537.17Show/hide
Query:  ANRLDSELEEEIDNFGFSEDDGDDSDTSTLGQGLEFPFQMPESYLGSLHTRYSIPDDIILRLP-------------------------------------
        A RL+S+L EEI+N   S DDG+DSD ST GQGLE+P ++PE YLGSL   ++IP++I+LRLP                                     
Subjt:  ANRLDSELEEEIDNFGFSEDDGDDSDTSTLGQGLEFPFQMPESYLGSLHTRYSIPDDIILRLP-------------------------------------

Query:  -----------------------------------------------------KKPGRYYLCARMGARGILKGLTSIKKWVGKWFFATGVWLARNEVDLP
                                                             KKPGR+Y+CAR GA GI+KG TSIK WV KWF+A+G WLA++E    
Subjt:  -----------------------------------------------------KKPGRYYLCARMGARGILKGLTSIKKWVGKWFFATGVWLARNEVDLP

Query:  FFSVP------LAIRPIPQPSEPTFNALKFFKSKFKSGRQISTLITNKLLLASRLLDYNPLLSPLEARRPNSELAMVCGFSQDVWRK---RPRTGQASKN
        FF VP      ++IRP+P+ ++ +F+ LK++K +F  GR++ TL+T++LLL S LLDYNP + P+E+ RPNSELAMVCGF+  V RK   R    +A+++
Subjt:  FFSVP------LAIRPIPQPSEPTFNALKFFKSKFKSGRQISTLITNKLLLASRLLDYNPLLSPLEARRPNSELAMVCGFSQDVWRK---RPRTGQASKN

Query:  KEAPSPAIAEPPTE--VEVVEGDSEEGSPKKSKKKKRKT
         +  +PA+  P +E    V+E +S  G P + K+ + +T
Subjt:  KEAPSPAIAEPPTE--VEVVEGDSEEGSPKKSKKKKRKT

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]8.4e-6740.61Show/hide
Query:  LCARMGARGILKGLTSIKKWVGKWFFATGVWLARNEVDLPFFSVP------LAIRPIPQPSEPTFNALKFFKSKFKSGRQISTLITNKLLLASRLLDYNP
        +CAR G  GI+KG TSIK WVGKWFFA+G WLA++E    FF VP      ++I+ IP+ ++ TF+ LK +K  F   R+I TL+T+KLLL S LLDYNP
Subjt:  LCARMGARGILKGLTSIKKWVGKWFFATGVWLARNEVDLPFFSVP------LAIRPIPQPSEPTFNALKFFKSKFKSGRQISTLITNKLLLASRLLDYNP

Query:  LLSPLEARRPNSELAMVCGFSQDVWRKR--------------------PRTGQASKNKEAPSPAIAEPPTEVEVVEGDSEE-------------------
        L+  +EA RPNSELAMVCGF+  V RK                     PRT   ++    PS A+  P  E+++  G S E                   
Subjt:  LLSPLEARRPNSELAMVCGFSQDVWRKR--------------------PRTGQASKNKEAPSPAIAEPPTEVEVVEGDSEE-------------------

Query:  -GSPKKSKKKKRKTHHSEDEVRELRARRRI-SPFGELVNDPEARMGGTSNIEMKFKVEPSSARVRERAMEMSSSYFNRCWRRASKFFSAPRSAIQRLLDF
          SP + ++KK+KT  S     E  AR  + +   +LV+DPEARM GTSN+ M+F +EPSS+ V+++   +S++  +R  RRASKF S P S +QR +D 
Subjt:  -GSPKKSKKKKRKTHHSEDEVRELRARRRI-SPFGELVNDPEARMGGTSNIEMKFKVEPSSARVRERAMEMSSSYFNRCWRRASKFFSAPRSAIQRLLDF

Query:  TAEVRLCLPSFVFVCLPFILTFRFPLSLPTCSRCSCQTAIRVKAELDRRELLTLKEREASLAALETVAALEGELKEAQAEAQTWKSTSDDDKVELKIVKA
         AE              FI               S   A+ VKAELD RE L  KERE S AALE    L+GEL +AQ E    ++  D    ++ ++K 
Subjt:  TAEVRLCLPSFVFVCLPFILTFRFPLSLPTCSRCSCQTAIRVKAELDRRELLTLKEREASLAALETVAALEGELKEAQAEAQTWKSTSDDDKVELKIVKA

Query:  EVARHMEHLRGAHAVAKGLEKEKFALLKENDELQRLREDLEGKLSASDSEAAELKAKL
        E  +H  HLR AHA+ KGLEKEKF LLKE D+L ++ E+ +  +    +E  +LK +L
Subjt:  EVARHMEHLRGAHAVAKGLEKEKFALLKENDELQRLREDLEGKLSASDSEAAELKAKL

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092981.5e-4042.8Show/hide
Query:  LCARMGARGILKGLTSIKKWVGKWFFATGVWLARNEVDLPFFSVPLAIRPIPQPSEPTFNALKFFKSKFKSGRQISTLITNKLLLASRLLDYNPLLSPLE
        +CAR GA GI+KG TSIK WV KWF+A+G WLA++E         +AIRP+P+ ++ +F+ LK++K  F  GR++ TL+T+KLLL S LLDYNP + P+E
Subjt:  LCARMGARGILKGLTSIKKWVGKWFFATGVWLARNEVDLPFFSVPLAIRPIPQPSEPTFNALKFFKSKFKSGRQISTLITNKLLLASRLLDYNPLLSPLE

Query:  ARRPNSELAMVCGFSQDVWRK---RPRTGQASKNKEAPSPAIAEPPTE--VEVVEGDSEEGSPKKSKKKKRKTHHSE-----DEVRE---LRARRR----
        + RPNSELAMVCGF+ +V RK   +    +A+++ +  +PA+  P +E    V+E +S  G P + K+ + +T   +     +EVRE   L+ RR+    
Subjt:  ARRPNSELAMVCGFSQDVWRK---RPRTGQASKNKEAPSPAIAEPPTE--VEVVEGDSEEGSPKKSKKKKRKTHHSE-----DEVRE---LRARRR----

Query:  ISP------------FGELVNDPEARMGGTSNIEMKFKVEPSSARVRERAMEMSSSY
         SP            F + V+DPEARMGGT ++  +F+VEPSS+ VR++ M   +SY
Subjt:  ISP------------FGELVNDPEARMGGTSNIEMKFKVEPSSARVRERAMEMSSSY

A0A6J1CR42 uncharacterized protein LOC1110138263.4e-3746.56Show/hide
Query:  RLPKKPGRYYLCARMGARGILKGLTSIKKWVGKWFFATGVWLARNEVDLPFFSVP------LAIRPIPQPSEPTFNALKFFKSKFKSGRQISTLITNKLL
        R+ KKPGR+Y+CAR GA GI+KG TSIK WV KWF+A+G WLA++E    FF VP      ++IRP+P+ ++ +F+ LK++K +F  GR++ TL+T++LL
Subjt:  RLPKKPGRYYLCARMGARGILKGLTSIKKWVGKWFFATGVWLARNEVDLPFFSVP------LAIRPIPQPSEPTFNALKFFKSKFKSGRQISTLITNKLL

Query:  LASRLLDYNPLLSPLEARRPNSELAMVCGFSQDVWRK---RPRTGQASKNKEAPSPAIAEPPTE--VEVVEGDSEEGSPKKSKKKKRKT
        L S LLDYNP + P+E  RPNS LAMVC F+  V RK   R    +A+++ + P+PA+  P +E    V+E +S  G P + K+ + +T
Subjt:  LASRLLDYNPLLSPLEARRPNSELAMVCGFSQDVWRK---RPRTGQASKNKEAPSPAIAEPPTE--VEVVEGDSEEGSPKKSKKKKRKT

A0A6J1DD09 uncharacterized protein LOC1110199054.6e-3454.55Show/hide
Query:  LCARMGARGILKGLTSIKKWVGKWFFATGVWLARNEVDLPFFSVP------LAIRPIPQPSEPTFNALKFFKSKFKSGRQISTLITNKLLLASRLLDYNP
        +C R GA GI+KG TSIK WVGKWFFA+G WL ++E   PFF +P      ++I+PIP+  + TF+ LKF+K  F  GR+I TL+T+KLLL S LLDYNP
Subjt:  LCARMGARGILKGLTSIKKWVGKWFFATGVWLARNEVDLPFFSVP------LAIRPIPQPSEPTFNALKFFKSKFKSGRQISTLITNKLLLASRLLDYNP

Query:  LLSPLEARRPNSELAMVCGFSQDVWRKRPRTGQASKNKEAPSP
        L+ P+EA RPNSELAMVCGF+  V RK      A K      P
Subjt:  LLSPLEARRPNSELAMVCGFSQDVWRKRPRTGQASKNKEAPSP

A0A6J1DXS5 uncharacterized protein LOC1110255028.9e-4637.17Show/hide
Query:  ANRLDSELEEEIDNFGFSEDDGDDSDTSTLGQGLEFPFQMPESYLGSLHTRYSIPDDIILRLP-------------------------------------
        A RL+S+L EEI+N   S DDG+DSD ST GQGLE+P ++PE YLGSL   ++IP++I+LRLP                                     
Subjt:  ANRLDSELEEEIDNFGFSEDDGDDSDTSTLGQGLEFPFQMPESYLGSLHTRYSIPDDIILRLP-------------------------------------

Query:  -----------------------------------------------------KKPGRYYLCARMGARGILKGLTSIKKWVGKWFFATGVWLARNEVDLP
                                                             KKPGR+Y+CAR GA GI+KG TSIK WV KWF+A+G WLA++E    
Subjt:  -----------------------------------------------------KKPGRYYLCARMGARGILKGLTSIKKWVGKWFFATGVWLARNEVDLP

Query:  FFSVP------LAIRPIPQPSEPTFNALKFFKSKFKSGRQISTLITNKLLLASRLLDYNPLLSPLEARRPNSELAMVCGFSQDVWRK---RPRTGQASKN
        FF VP      ++IRP+P+ ++ +F+ LK++K +F  GR++ TL+T++LLL S LLDYNP + P+E+ RPNSELAMVCGF+  V RK   R    +A+++
Subjt:  FFSVP------LAIRPIPQPSEPTFNALKFFKSKFKSGRQISTLITNKLLLASRLLDYNPLLSPLEARRPNSELAMVCGFSQDVWRK---RPRTGQASKN

Query:  KEAPSPAIAEPPTE--VEVVEGDSEEGSPKKSKKKKRKT
         +  +PA+  P +E    V+E +S  G P + K+ + +T
Subjt:  KEAPSPAIAEPPTE--VEVVEGDSEEGSPKKSKKKKRKT

A0A6J1DZB3 uncharacterized protein LOC1110256654.1e-6740.61Show/hide
Query:  LCARMGARGILKGLTSIKKWVGKWFFATGVWLARNEVDLPFFSVP------LAIRPIPQPSEPTFNALKFFKSKFKSGRQISTLITNKLLLASRLLDYNP
        +CAR G  GI+KG TSIK WVGKWFFA+G WLA++E    FF VP      ++I+ IP+ ++ TF+ LK +K  F   R+I TL+T+KLLL S LLDYNP
Subjt:  LCARMGARGILKGLTSIKKWVGKWFFATGVWLARNEVDLPFFSVP------LAIRPIPQPSEPTFNALKFFKSKFKSGRQISTLITNKLLLASRLLDYNP

Query:  LLSPLEARRPNSELAMVCGFSQDVWRKR--------------------PRTGQASKNKEAPSPAIAEPPTEVEVVEGDSEE-------------------
        L+  +EA RPNSELAMVCGF+  V RK                     PRT   ++    PS A+  P  E+++  G S E                   
Subjt:  LLSPLEARRPNSELAMVCGFSQDVWRKR--------------------PRTGQASKNKEAPSPAIAEPPTEVEVVEGDSEE-------------------

Query:  -GSPKKSKKKKRKTHHSEDEVRELRARRRI-SPFGELVNDPEARMGGTSNIEMKFKVEPSSARVRERAMEMSSSYFNRCWRRASKFFSAPRSAIQRLLDF
          SP + ++KK+KT  S     E  AR  + +   +LV+DPEARM GTSN+ M+F +EPSS+ V+++   +S++  +R  RRASKF S P S +QR +D 
Subjt:  -GSPKKSKKKKRKTHHSEDEVRELRARRRI-SPFGELVNDPEARMGGTSNIEMKFKVEPSSARVRERAMEMSSSYFNRCWRRASKFFSAPRSAIQRLLDF

Query:  TAEVRLCLPSFVFVCLPFILTFRFPLSLPTCSRCSCQTAIRVKAELDRRELLTLKEREASLAALETVAALEGELKEAQAEAQTWKSTSDDDKVELKIVKA
         AE              FI               S   A+ VKAELD RE L  KERE S AALE    L+GEL +AQ E    ++  D    ++ ++K 
Subjt:  TAEVRLCLPSFVFVCLPFILTFRFPLSLPTCSRCSCQTAIRVKAELDRRELLTLKEREASLAALETVAALEGELKEAQAEAQTWKSTSDDDKVELKIVKA

Query:  EVARHMEHLRGAHAVAKGLEKEKFALLKENDELQRLREDLEGKLSASDSEAAELKAKL
        E  +H  HLR AHA+ KGLEKEKF LLKE D+L ++ E+ +  +    +E  +LK +L
Subjt:  EVARHMEHLRGAHAVAKGLEKEKFALLKENDELQRLREDLEGKLSASDSEAAELKAKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGAAGGGAAGACCTACACAAGAGGACAAAATCTCCAACGCTCAAGTCACTAAGCAGCTCAGACTCAATATTAAGTTCGAGGAGAAATTCGATACCTGATAATGAAAT
GAGGCCCTCTGTTTATAGGTTTAGGCTCCGACAGGGGTTACGACAAGTGATTACCTTGGTTGGAATTGAGGTCGGATGTGTCGGATTCGAGCAGATTGAACTTAGACACA
CTCTTGGAAATGTTGATGGCCCCCTCTCTGCCGGGTTCGATCTCGACCTGACAGAGAAGTTTGTTATCCTTGGGGAAGCTGATTCGCTTGGTGACTTCCTAGGCCGACAC
GATAAATGTTACCATTGGGAAATCTCGAAAACGGAACCGGCGGTCAACGCACACTTGATTACCTTTGACGTTCAGACTTCTTCCGATCTGAGACGTCCAATTTGGGACAT
GTGTGTGGCCAAGATTGAATTTCGGGCTACTATAAATACCACCAAGGTTCTGTTTTCGTTTTTTACGCTTTCTGAAATCTTGGAGTTCGACCTCGTCAGATCGGAATCTT
TAGGTCAGTCTGGCTCATTCTCTTTTTTTAAGTTCACTCTTTTATTTTCAAAGTCATGTCGACATCTTCTAGTCCCTCCAGCCCAAGTAATAGTAGTTCTTCTAATGGGC
AGCTCGGAATTAGGACGCCCTTCGCCCAGAAGGGCCGATTCCGTAGAGAAGTTTGCTAATAGGTTAGACTCCGAGCTAGAGGAGGAGATAGATAATTTTGGGTTCTCAGA
GGATGATGGGGATGATAGTGATACTTCAACCTTGGGACAAGGTTTAGAATTCCCTTTTCAGATGCCTGAGAGTTACCTTGGCTCCCTTCATACGAGATATAGCATTCCGG
ATGACATCATCCTTAGGCTCCCCAAGAAACCAGGGAGGTATTACCTGTGCGCTAGAATGGGCGCGAGAGGCATTCTGAAAGGTCTGACCTCCATAAAGAAATGGGTCGGA
AAATGGTTCTTTGCCACTGGCGTGTGGCTGGCTAGGAACGAGGTCGACCTGCCTTTCTTCAGCGTCCCTCTTGCTATTCGGCCAATTCCTCAACCTTCCGAGCCGACCTT
CAACGCGTTGAAATTTTTCAAGAGCAAGTTTAAGAGCGGCAGGCAGATCAGCACTCTTATAACAAATAAGCTTCTTCTCGCCTCAAGACTGCTCGATTACAACCCTCTCT
TATCTCCACTCGAAGCTCGAAGACCGAATTCCGAACTAGCCATGGTTTGCGGCTTCTCCCAGGATGTTTGGCGCAAGCGCCCACGCACTGGGCAGGCTTCCAAGAATAAG
GAGGCACCTAGCCCTGCTATCGCTGAACCTCCTACCGAGGTCGAGGTGGTTGAGGGAGATTCGGAGGAAGGCTCCCCTAAGAAATCTAAGAAGAAGAAGCGCAAGACCCA
TCACTCCGAGGACGAGGTGAGGGAGTTGCGCGCTAGGCGACGAATCAGTCCTTTCGGGGAATTAGTTAACGACCCCGAGGCTAGAATGGGTGGCACCTCTAACATCGAGA
TGAAGTTTAAGGTTGAGCCATCTAGTGCCAGGGTGAGAGAGAGAGCCATGGAGATGTCGAGCTCCTACTTCAACCGCTGTTGGAGGAGGGCTTCCAAGTTCTTCAGCGCT
CCCAGGTCAGCCATCCAACGACTGTTGGATTTCACTGCCGAGGTGCGACTTTGCTTGCCTTCCTTTGTTTTCGTATGTTTACCTTTTATCCTGACATTTCGGTTCCCTTT
GTCCTTGCCCACTTGCTCACGCTGCAGTTGCCAGACTGCTATCAGAGTAAAGGCCGAGCTCGATAGGCGCGAGCTTTTGACTCTGAAGGAACGAGAAGCCTCTTTGGCCG
CCTTAGAGACTGTCGCGGCTCTGGAAGGAGAACTCAAGGAGGCCCAAGCTGAAGCCCAGACGTGGAAGTCCACCTCCGATGACGACAAGGTGGAGCTAAAAATCGTGAAG
GCGGAGGTTGCCCGGCACATGGAGCACCTGAGGGGTGCGCATGCTGTGGCCAAGGGCCTGGAGAAGGAGAAGTTTGCGCTGTTGAAGGAGAATGATGAACTTCAGCGCCT
TCGAGAGGACCTGGAGGGCAAGCTGAGTGCTAGTGACTCCGAGGCGGCGGAACTTAAGGCTAAGCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGAAGGGAAGACCTACACAAGAGGACAAAATCTCCAACGCTCAAGTCACTAAGCAGCTCAGACTCAATATTAAGTTCGAGGAGAAATTCGATACCTGATAATGAAAT
GAGGCCCTCTGTTTATAGGTTTAGGCTCCGACAGGGGTTACGACAAGTGATTACCTTGGTTGGAATTGAGGTCGGATGTGTCGGATTCGAGCAGATTGAACTTAGACACA
CTCTTGGAAATGTTGATGGCCCCCTCTCTGCCGGGTTCGATCTCGACCTGACAGAGAAGTTTGTTATCCTTGGGGAAGCTGATTCGCTTGGTGACTTCCTAGGCCGACAC
GATAAATGTTACCATTGGGAAATCTCGAAAACGGAACCGGCGGTCAACGCACACTTGATTACCTTTGACGTTCAGACTTCTTCCGATCTGAGACGTCCAATTTGGGACAT
GTGTGTGGCCAAGATTGAATTTCGGGCTACTATAAATACCACCAAGGTTCTGTTTTCGTTTTTTACGCTTTCTGAAATCTTGGAGTTCGACCTCGTCAGATCGGAATCTT
TAGGTCAGTCTGGCTCATTCTCTTTTTTTAAGTTCACTCTTTTATTTTCAAAGTCATGTCGACATCTTCTAGTCCCTCCAGCCCAAGTAATAGTAGTTCTTCTAATGGGC
AGCTCGGAATTAGGACGCCCTTCGCCCAGAAGGGCCGATTCCGTAGAGAAGTTTGCTAATAGGTTAGACTCCGAGCTAGAGGAGGAGATAGATAATTTTGGGTTCTCAGA
GGATGATGGGGATGATAGTGATACTTCAACCTTGGGACAAGGTTTAGAATTCCCTTTTCAGATGCCTGAGAGTTACCTTGGCTCCCTTCATACGAGATATAGCATTCCGG
ATGACATCATCCTTAGGCTCCCCAAGAAACCAGGGAGGTATTACCTGTGCGCTAGAATGGGCGCGAGAGGCATTCTGAAAGGTCTGACCTCCATAAAGAAATGGGTCGGA
AAATGGTTCTTTGCCACTGGCGTGTGGCTGGCTAGGAACGAGGTCGACCTGCCTTTCTTCAGCGTCCCTCTTGCTATTCGGCCAATTCCTCAACCTTCCGAGCCGACCTT
CAACGCGTTGAAATTTTTCAAGAGCAAGTTTAAGAGCGGCAGGCAGATCAGCACTCTTATAACAAATAAGCTTCTTCTCGCCTCAAGACTGCTCGATTACAACCCTCTCT
TATCTCCACTCGAAGCTCGAAGACCGAATTCCGAACTAGCCATGGTTTGCGGCTTCTCCCAGGATGTTTGGCGCAAGCGCCCACGCACTGGGCAGGCTTCCAAGAATAAG
GAGGCACCTAGCCCTGCTATCGCTGAACCTCCTACCGAGGTCGAGGTGGTTGAGGGAGATTCGGAGGAAGGCTCCCCTAAGAAATCTAAGAAGAAGAAGCGCAAGACCCA
TCACTCCGAGGACGAGGTGAGGGAGTTGCGCGCTAGGCGACGAATCAGTCCTTTCGGGGAATTAGTTAACGACCCCGAGGCTAGAATGGGTGGCACCTCTAACATCGAGA
TGAAGTTTAAGGTTGAGCCATCTAGTGCCAGGGTGAGAGAGAGAGCCATGGAGATGTCGAGCTCCTACTTCAACCGCTGTTGGAGGAGGGCTTCCAAGTTCTTCAGCGCT
CCCAGGTCAGCCATCCAACGACTGTTGGATTTCACTGCCGAGGTGCGACTTTGCTTGCCTTCCTTTGTTTTCGTATGTTTACCTTTTATCCTGACATTTCGGTTCCCTTT
GTCCTTGCCCACTTGCTCACGCTGCAGTTGCCAGACTGCTATCAGAGTAAAGGCCGAGCTCGATAGGCGCGAGCTTTTGACTCTGAAGGAACGAGAAGCCTCTTTGGCCG
CCTTAGAGACTGTCGCGGCTCTGGAAGGAGAACTCAAGGAGGCCCAAGCTGAAGCCCAGACGTGGAAGTCCACCTCCGATGACGACAAGGTGGAGCTAAAAATCGTGAAG
GCGGAGGTTGCCCGGCACATGGAGCACCTGAGGGGTGCGCATGCTGTGGCCAAGGGCCTGGAGAAGGAGAAGTTTGCGCTGTTGAAGGAGAATGATGAACTTCAGCGCCT
TCGAGAGGACCTGGAGGGCAAGCTGAGTGCTAGTGACTCCGAGGCGGCGGAACTTAAGGCTAAGCTCTAG
Protein sequenceShow/hide protein sequence
MRREDLHKRTKSPTLKSLSSSDSILSSRRNSIPDNEMRPSVYRFRLRQGLRQVITLVGIEVGCVGFEQIELRHTLGNVDGPLSAGFDLDLTEKFVILGEADSLGDFLGRH
DKCYHWEISKTEPAVNAHLITFDVQTSSDLRRPIWDMCVAKIEFRATINTTKVLFSFFTLSEILEFDLVRSESLGQSGSFSFFKFTLLFSKSCRHLLVPPAQVIVVLLMG
SSELGRPSPRRADSVEKFANRLDSELEEEIDNFGFSEDDGDDSDTSTLGQGLEFPFQMPESYLGSLHTRYSIPDDIILRLPKKPGRYYLCARMGARGILKGLTSIKKWVG
KWFFATGVWLARNEVDLPFFSVPLAIRPIPQPSEPTFNALKFFKSKFKSGRQISTLITNKLLLASRLLDYNPLLSPLEARRPNSELAMVCGFSQDVWRKRPRTGQASKNK
EAPSPAIAEPPTEVEVVEGDSEEGSPKKSKKKKRKTHHSEDEVRELRARRRISPFGELVNDPEARMGGTSNIEMKFKVEPSSARVRERAMEMSSSYFNRCWRRASKFFSA
PRSAIQRLLDFTAEVRLCLPSFVFVCLPFILTFRFPLSLPTCSRCSCQTAIRVKAELDRRELLTLKEREASLAALETVAALEGELKEAQAEAQTWKSTSDDDKVELKIVK
AEVARHMEHLRGAHAVAKGLEKEKFALLKENDELQRLREDLEGKLSASDSEAAELKAKL