; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC07g0254 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC07g0254
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111488680
Genome locationMC07:7723579..7724424
RNA-Seq ExpressionMC07g0254
SyntenyMC07g0254
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575310.1 hypothetical protein SDJN03_25949, partial [Cucurbita argyrosperma subsp. sororia]2.12e-13268.69Show/hide
Query:  LIFFKCTRWQLEETMEKLSCPYHYFCDSIYAGDYPAAVDFLVLAFTVAAYMSTLSTMVAYTSSRPSQ-------RLLLPSGPVSLPVFLLILAKGHRINA
        L FFKCTRWQLEET++K +CPYHY+CD+IY GDYP AVD LVL FTVA Y+STL  M+A  SS   +       R LLPSGPVSLP+FL +L KGHRIN 
Subjt:  LIFFKCTRWQLEETMEKLSCPYHYFCDSIYAGDYPAAVDFLVLAFTVAAYMSTLSTMVAYTSSRPSQ-------RLLLPSGPVSLPVFLLILAKGHRINA

Query:  AFPLFLLGPAILNLVYISALSFES-GADKDIKYVFLEASTMSGILHASLNLDSVILPYYTGLDALVGSTLSGGCPSCVCRDEALVVGGRLLSYRGWSRTT
         FPLFL+GPAIL+LVYISAL+F+S G DKDIKYVF EASTMSGILHASLNLD++ILPYYTGLDAL+GS  SG CPSCVCR E LVVGGRL+SYRGWS TT
Subjt:  AFPLFLLGPAILNLVYISALSFES-GADKDIKYVFLEASTMSGILHASLNLDSVILPYYTGLDALVGSTLSGGCPSCVCRDEALVVGGRLLSYRGWSRTT

Query:  FVVVCALGARIVCRLSGEKATTKFGGVLRLLLEGLGWVLITLDSVYLSRNSLLE-----GAVYGGIFALVFVHVIKLVLRRWRRVC--GGNEELEKV
        FVVVCAL  RIVCR+SGEK   +   VLR +LEGLGWV IT D VYLSRN  LE     G  YG +F LVFVHV+K+V RRW+ +C  G + EL+ V
Subjt:  FVVVCALGARIVCRLSGEKATTKFGGVLRLLLEGLGWVLITLDSVYLSRNSLLE-----GAVYGGIFALVFVHVIKLVLRRWRRVC--GGNEELEKV

KAG7013842.1 hypothetical protein SDJN02_24011, partial [Cucurbita argyrosperma subsp. argyrosperma]8.60e-13268.35Show/hide
Query:  LIFFKCTRWQLEETMEKLSCPYHYFCDSIYAGDYPAAVDFLVLAFTVAAYMSTLSTMVAYTSSRPSQ-------RLLLPSGPVSLPVFLLILAKGHRINA
        L FFKCTRWQLEET++K +CPYHY+CD+IY GDYP AVD LVL FTVA Y+STL  M+A  SS   +       R LLPSGPVSLP+FL +L KGHRIN 
Subjt:  LIFFKCTRWQLEETMEKLSCPYHYFCDSIYAGDYPAAVDFLVLAFTVAAYMSTLSTMVAYTSSRPSQ-------RLLLPSGPVSLPVFLLILAKGHRINA

Query:  AFPLFLLGPAILNLVYISALSFES-GADKDIKYVFLEASTMSGILHASLNLDSVILPYYTGLDALVGSTLSGGCPSCVCRDEALVVGGRLLSYRGWSRTT
         FPLFL+GPAIL+LVYISAL+F+S G DKDIKYVF EASTMSGILHASLNLD++ILPYYTGLDAL+GS  SG CPSCVCR + LVVGGRL+SYRGWS TT
Subjt:  AFPLFLLGPAILNLVYISALSFES-GADKDIKYVFLEASTMSGILHASLNLDSVILPYYTGLDALVGSTLSGGCPSCVCRDEALVVGGRLLSYRGWSRTT

Query:  FVVVCALGARIVCRLSGEKATTKFGGVLRLLLEGLGWVLITLDSVYLSRNSLLE-----GAVYGGIFALVFVHVIKLVLRRWRRVC--GGNEELEKV
        FVVVCAL  RIVCR+SGEK   +   VLR +LEGLGWV IT D VYLSRN  LE     G  YG +F LVFVHV+K+V RRW+ +C  G + EL+ V
Subjt:  FVVVCALGARIVCRLSGEKATTKFGGVLRLLLEGLGWVLITLDSVYLSRNSLLE-----GAVYGGIFALVFVHVIKLVLRRWRRVC--GGNEELEKV

XP_022158846.1 uncharacterized protein LOC111025312 [Momordica charantia]1.46e-188100Show/hide
Query:  MEKLSCPYHYFCDSIYAGDYPAAVDFLVLAFTVAAYMSTLSTMVAYTSSRPSQRLLLPSGPVSLPVFLLILAKGHRINAAFPLFLLGPAILNLVYISALS
        MEKLSCPYHYFCDSIYAGDYPAAVDFLVLAFTVAAYMSTLSTMVAYTSSRPSQRLLLPSGPVSLPVFLLILAKGHRINAAFPLFLLGPAILNLVYISALS
Subjt:  MEKLSCPYHYFCDSIYAGDYPAAVDFLVLAFTVAAYMSTLSTMVAYTSSRPSQRLLLPSGPVSLPVFLLILAKGHRINAAFPLFLLGPAILNLVYISALS

Query:  FESGADKDIKYVFLEASTMSGILHASLNLDSVILPYYTGLDALVGSTLSGGCPSCVCRDEALVVGGRLLSYRGWSRTTFVVVCALGARIVCRLSGEKATT
        FESGADKDIKYVFLEASTMSGILHASLNLDSVILPYYTGLDALVGSTLSGGCPSCVCRDEALVVGGRLLSYRGWSRTTFVVVCALGARIVCRLSGEKATT
Subjt:  FESGADKDIKYVFLEASTMSGILHASLNLDSVILPYYTGLDALVGSTLSGGCPSCVCRDEALVVGGRLLSYRGWSRTTFVVVCALGARIVCRLSGEKATT

Query:  KFGGVLRLLLEGLGWVLITLDSVYLSRNSLLEGAVYGGIFALVFVHVIKLVLRRWRRVCGGNEELEKV
        KFGGVLRLLLEGLGWVLITLDSVYLSRNSLLEGAVYGGIFALVFVHVIKLVLRRWRRVCGGNEELEKV
Subjt:  KFGGVLRLLLEGLGWVLITLDSVYLSRNSLLEGAVYGGIFALVFVHVIKLVLRRWRRVCGGNEELEKV

XP_022929796.1 uncharacterized protein LOC111436298 [Cucurbita moschata]3.30e-12967.12Show/hide
Query:  LIFFKCTRWQLEETMEKLSCPYHYFCDSIYAGDYPAAVDFLVLAFTVAAYMSTLSTMVAYTSSRPSQ-------RLLLPSGPVSLPVFLLILAKGHRINA
        L FFKCTRWQLEET++K +CPYHY+CD++Y G+YP  VD LVL FTVA Y+STL  M+A  SS   +       R LLPSGPVSLP+FL +L KGHRIN 
Subjt:  LIFFKCTRWQLEETMEKLSCPYHYFCDSIYAGDYPAAVDFLVLAFTVAAYMSTLSTMVAYTSSRPSQ-------RLLLPSGPVSLPVFLLILAKGHRINA

Query:  AFPLFLLGPAILNLVYISALSFES-GADKDIKYVFLEASTMSGILHASLNLDSVILPYYTGLDALVGSTLSGGCPSCVCRDEALVVGGRLLSYRGWSRTT
         FPLFL+GPAIL+LVYISAL+F++ G DKDIKYVF EASTMSGILHASLNLD++ILPYYTGLDAL+GS  SG CPSCVCR E LVVGGRL+SYRGWS TT
Subjt:  AFPLFLLGPAILNLVYISALSFES-GADKDIKYVFLEASTMSGILHASLNLDSVILPYYTGLDALVGSTLSGGCPSCVCRDEALVVGGRLLSYRGWSRTT

Query:  FVVVCALGARIVCRLSGEKATTKFGGVLRLLLEGLGWVLITLDSVYLSRNSLLE-----GAVYGGIFALVFVHVIKLVLRRWRRVC--GGNEELE
        FVVVCAL  RIVCR+SGEK   +   VLR +LEGLGWV IT D VYLS N  LE     G  YG +F LVFVHV+K+V RRW+ +C  G + EL+
Subjt:  FVVVCALGARIVCRLSGEKATTKFGGVLRLLLEGLGWVLITLDSVYLSRNSLLE-----GAVYGGIFALVFVHVIKLVLRRWRRVC--GGNEELE

XP_038875593.1 uncharacterized protein LOC120068005 [Benincasa hispida]5.35e-13067.58Show/hide
Query:  FFKCTRWQLEETMEKLSCPYHYFCDSIYAGDYPAAVDFLVLAFTVAAYMSTLSTMVAYTSSRPS------QRLLLPSGPVSLPVFLLILAKGHRINAAFP
        FFKCTRWQLEET++K SCP+HY+CDSIY GDYPAA+D LVL FT A YMSTL  M+A  S R        ++ LLPSGP SLP+FL +LAKG+RIN  FP
Subjt:  FFKCTRWQLEETMEKLSCPYHYFCDSIYAGDYPAAVDFLVLAFTVAAYMSTLSTMVAYTSSRPS------QRLLLPSGPVSLPVFLLILAKGHRINAAFP

Query:  LFLLGPAILNLVYISALSFESGADKDIKYVFLEASTMSGILHASLNLDSVILPYYTGLDALVGSTLSGGCPSCVCRDEALVVGGRLLSYRGWSRTTFVVV
        LFL+GP IL++VYISAL+F++GADKDIKYVF EASTMSGILHASLNLDSVILPYYTGLDALVGS  SG CPSCVCR+  L VGGR +SYRGWS TTFVVV
Subjt:  LFLLGPAILNLVYISALSFESGADKDIKYVFLEASTMSGILHASLNLDSVILPYYTGLDALVGSTLSGGCPSCVCRDEALVVGGRLLSYRGWSRTTFVVV

Query:  CALGARIVCRLSGEKATTKFGGVLRLLLEGLGWVLITLDSVYLSRN-----SLLEGAVYGGIFALVFVHVIKLVLRRWRRVC--GGNEELEKV
        C L  RIVCR++G+K   K   VL+ LLEGLGWVLIT D VYLS N       L+G VYG +F LVFVH+IKL LRRW  +C  G   +L++V
Subjt:  CALGARIVCRLSGEKATTKFGGVLRLLLEGLGWVLITLDSVYLSRN-----SLLEGAVYGGIFALVFVHVIKLVLRRWRRVC--GGNEELEKV

TrEMBL top hitse value%identityAlignment
A0A0A0K9S3 Uncharacterized protein9.55e-12464.86Show/hide
Query:  LIFFKCTRWQLEETMEKLSCPYHYFCDSIYAGDYPAAVDFLVLAFTVAAYMSTLSTMVAYTSSRPSQ-------RLLLPSGPVSLPVFLLILAKGHRINA
        LIFFKCTRWQLEET++K SCP+HY+CD+IY GDYP A+D LVL FT   Y+STL  M+   SS   +       + LLPSGP SLPVFL +LAKGHRIN 
Subjt:  LIFFKCTRWQLEETMEKLSCPYHYFCDSIYAGDYPAAVDFLVLAFTVAAYMSTLSTMVAYTSSRPSQ-------RLLLPSGPVSLPVFLLILAKGHRINA

Query:  AFPLFLLGPAILNLVYISALSFESGADKDIKYVFLEASTMSGILHASLNLDSVILPYYTGLDALVGSTLSGGCPSCVCRDEALVVGGRLLSYRGWSRTTF
         FPLFL+GP IL L+YISAL+F++GADKDIKYVF EASTMSGILHASLNLD VILPYYTGLDAL+GS  SG C SCVCR+  LVVGGR +SYRGWS TTF
Subjt:  AFPLFLLGPAILNLVYISALSFESGADKDIKYVFLEASTMSGILHASLNLDSVILPYYTGLDALVGSTLSGGCPSCVCRDEALVVGGRLLSYRGWSRTTF

Query:  VVVCALGARIVCRLSGEKATTKFGGVLRLLLEGLGWVLITLDSVYLSRN-----SLLEGAVYGGIFALVFVHVIKLVLRRWRRV-CGGN-EELEKV
        V+VC L  RIV R++G +   K    L+ LLEGLGWVLIT D VYLS N       L+G VYG +F LVF+HVIKL LRRW+ + C  N ++L+KV
Subjt:  VVVCALGARIVCRLSGEKATTKFGGVLRLLLEGLGWVLITLDSVYLSRN-----SLLEGAVYGGIFALVFVHVIKLVLRRWRRV-CGGN-EELEKV

A0A1S3CC89 uncharacterized protein LOC1034988286.54e-12565.54Show/hide
Query:  LIFFKCTRWQLEETMEKLSCPYHYFCDSIYAGDYPAAVDFLVLAFTVAAYMSTLSTMVAYTSSRPSQ-------RLLLPSGPVSLPVFLLILAKGHRINA
        LIFFKCTRWQLEET++K SCP+HY+CD+IY GDYPAA+D LVL FT   Y+STL  M+   SS   +       + LLPSGP SLPVFL +LAKGHRIN 
Subjt:  LIFFKCTRWQLEETMEKLSCPYHYFCDSIYAGDYPAAVDFLVLAFTVAAYMSTLSTMVAYTSSRPSQ-------RLLLPSGPVSLPVFLLILAKGHRINA

Query:  AFPLFLLGPAILNLVYISALSFESGADKDIKYVFLEASTMSGILHASLNLDSVILPYYTGLDALVGSTLSGGCPSCVCRDEALVVGGRLLSYRGWSRTTF
         FPLFL+GP IL L+YISAL+F++GADKDIKYVF EASTMSGILHASLNLD VILPYYTGLDAL+GS  SG C SCVCR+  LVVGGR ++YRGWS TTF
Subjt:  AFPLFLLGPAILNLVYISALSFESGADKDIKYVFLEASTMSGILHASLNLDSVILPYYTGLDALVGSTLSGGCPSCVCRDEALVVGGRLLSYRGWSRTTF

Query:  VVVCALGARIVCRLSGEKATTKFGGVLRLLLEGLGWVLITLDSVYLSRN-----SLLEGAVYGGIFALVFVHVIKLVLRRWRRV-CGGN-EELEKV
        V+VC L  RIVCR++G +   K    L+ LLEGLGWVLIT D VYLS N       L+G VYG +F LVFVHVIKL LRRW+ + C  N  +L+KV
Subjt:  VVVCALGARIVCRLSGEKATTKFGGVLRLLLEGLGWVLITLDSVYLSRN-----SLLEGAVYGGIFALVFVHVIKLVLRRWRRV-CGGN-EELEKV

A0A6J1E258 uncharacterized protein LOC1110253127.07e-189100Show/hide
Query:  MEKLSCPYHYFCDSIYAGDYPAAVDFLVLAFTVAAYMSTLSTMVAYTSSRPSQRLLLPSGPVSLPVFLLILAKGHRINAAFPLFLLGPAILNLVYISALS
        MEKLSCPYHYFCDSIYAGDYPAAVDFLVLAFTVAAYMSTLSTMVAYTSSRPSQRLLLPSGPVSLPVFLLILAKGHRINAAFPLFLLGPAILNLVYISALS
Subjt:  MEKLSCPYHYFCDSIYAGDYPAAVDFLVLAFTVAAYMSTLSTMVAYTSSRPSQRLLLPSGPVSLPVFLLILAKGHRINAAFPLFLLGPAILNLVYISALS

Query:  FESGADKDIKYVFLEASTMSGILHASLNLDSVILPYYTGLDALVGSTLSGGCPSCVCRDEALVVGGRLLSYRGWSRTTFVVVCALGARIVCRLSGEKATT
        FESGADKDIKYVFLEASTMSGILHASLNLDSVILPYYTGLDALVGSTLSGGCPSCVCRDEALVVGGRLLSYRGWSRTTFVVVCALGARIVCRLSGEKATT
Subjt:  FESGADKDIKYVFLEASTMSGILHASLNLDSVILPYYTGLDALVGSTLSGGCPSCVCRDEALVVGGRLLSYRGWSRTTFVVVCALGARIVCRLSGEKATT

Query:  KFGGVLRLLLEGLGWVLITLDSVYLSRNSLLEGAVYGGIFALVFVHVIKLVLRRWRRVCGGNEELEKV
        KFGGVLRLLLEGLGWVLITLDSVYLSRNSLLEGAVYGGIFALVFVHVIKLVLRRWRRVCGGNEELEKV
Subjt:  KFGGVLRLLLEGLGWVLITLDSVYLSRNSLLEGAVYGGIFALVFVHVIKLVLRRWRRVCGGNEELEKV

A0A6J1EP56 uncharacterized protein LOC1114362981.60e-12967.12Show/hide
Query:  LIFFKCTRWQLEETMEKLSCPYHYFCDSIYAGDYPAAVDFLVLAFTVAAYMSTLSTMVAYTSSRPSQ-------RLLLPSGPVSLPVFLLILAKGHRINA
        L FFKCTRWQLEET++K +CPYHY+CD++Y G+YP  VD LVL FTVA Y+STL  M+A  SS   +       R LLPSGPVSLP+FL +L KGHRIN 
Subjt:  LIFFKCTRWQLEETMEKLSCPYHYFCDSIYAGDYPAAVDFLVLAFTVAAYMSTLSTMVAYTSSRPSQ-------RLLLPSGPVSLPVFLLILAKGHRINA

Query:  AFPLFLLGPAILNLVYISALSFES-GADKDIKYVFLEASTMSGILHASLNLDSVILPYYTGLDALVGSTLSGGCPSCVCRDEALVVGGRLLSYRGWSRTT
         FPLFL+GPAIL+LVYISAL+F++ G DKDIKYVF EASTMSGILHASLNLD++ILPYYTGLDAL+GS  SG CPSCVCR E LVVGGRL+SYRGWS TT
Subjt:  AFPLFLLGPAILNLVYISALSFES-GADKDIKYVFLEASTMSGILHASLNLDSVILPYYTGLDALVGSTLSGGCPSCVCRDEALVVGGRLLSYRGWSRTT

Query:  FVVVCALGARIVCRLSGEKATTKFGGVLRLLLEGLGWVLITLDSVYLSRNSLLE-----GAVYGGIFALVFVHVIKLVLRRWRRVC--GGNEELE
        FVVVCAL  RIVCR+SGEK   +   VLR +LEGLGWV IT D VYLS N  LE     G  YG +F LVFVHV+K+V RRW+ +C  G + EL+
Subjt:  FVVVCALGARIVCRLSGEKATTKFGGVLRLLLEGLGWVLITLDSVYLSRNSLLE-----GAVYGGIFALVFVHVIKLVLRRWRRVC--GGNEELE

A0A6J1JX99 LOW QUALITY PROTEIN: uncharacterized protein LOC1114886806.14e-12765.66Show/hide
Query:  LIFFKCTRWQLEETMEKLSCPYHYFCDSIYAGDYPAAVDFLVLAFTVAAYMSTLSTMVA-YTSSRPS------QRLLLPSGPVSLPVFLLILAKGHRINA
        L FFKCTRWQLEET++K +CPYHY+CD++Y GDYP  +D LVL FTVA Y+STL  M+  + SSR        +R LLPSGPVSLP+FL +L KGHRIN 
Subjt:  LIFFKCTRWQLEETMEKLSCPYHYFCDSIYAGDYPAAVDFLVLAFTVAAYMSTLSTMVA-YTSSRPS------QRLLLPSGPVSLPVFLLILAKGHRINA

Query:  AFPLFLLGPAILNLVYISALSFES-GADKDIKYVFLEASTMSGILHASLNLDSVILPYYTGLDALVGSTLSGGCPSCVCRDEALVVGGRLLSYRGWSRTT
         FPLFL+GPAIL LVYISAL+F++ G+DKDIKYVF EASTMSGILHASLNLD++I+PYYTGLDAL+GS  SG CPSCVCR+E LVVGG+ +SYRGWS TT
Subjt:  AFPLFLLGPAILNLVYISALSFES-GADKDIKYVFLEASTMSGILHASLNLDSVILPYYTGLDALVGSTLSGGCPSCVCRDEALVVGGRLLSYRGWSRTT

Query:  FVVVCALGARIVCRLSGEKATTKFGGVLRLLLEGLGWVLITLDSVYLSRNSLLE-----GAVYGGIFALVFVHVIKLVLRRWRRVC--GGNEELEKV
        FVVVCAL  RIVCR+SGEK   +   VLR +LEGLGW  ITLD VYLS N  LE     G  YG +F LVFVHV+K+V RRW+ +   G + EL+ V
Subjt:  FVVVCALGARIVCRLSGEKATTKFGGVLRLLLEGLGWVLITLDSVYLSRNSLLE-----GAVYGGIFALVFVHVIKLVLRRWRRVC--GGNEELEKV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G41610.1 unknown protein1.9e-7350.89Show/hide
Query:  FFKCTRWQLEETMEKLSCPYHYFCDSIYAGDYPAAVDFLVLAFTVAAYMSTLSTMVAYTSSR------------PSQRLLLPSGPVSLPVFLLILAKGHR
        FFKCT+WQ E+T++ ++CP+HYFCDSIYAGDYP   D LV  F    Y++TL  +V    SR             ++R LLPSGP+SLP+ +LILAKG R
Subjt:  FFKCTRWQLEETMEKLSCPYHYFCDSIYAGDYPAAVDFLVLAFTVAAYMSTLSTMVAYTSSR------------PSQRLLLPSGPVSLPVFLLILAKGHR

Query:  INAAFPLFLLGPAILNLVYISALSFESGADKDIKYVFLEASTMSGILHASLNLDSVILPYYTGLDALVGSTLSGGCPSCVCRDEALVVGGRLLSYRGWSR
        IN  FP+ + GPAIL LV +S L FE+  +K+  +VF EAST+SGILHASL LD+VILPYYTG DALV ST SG C SC+CR E L+VGG+++SYRGWS 
Subjt:  INAAFPLFLLGPAILNLVYISALSFESGADKDIKYVFLEASTMSGILHASLNLDSVILPYYTGLDALVGSTLSGGCPSCVCRDEALVVGGRLLSYRGWSR

Query:  TTFVVVCALGARIVCRLSGEKATTKFGGVLRLLLEGLGWVLITLDSVYLSRNSLLEGA------VYGGIFALVFVHVIKLV
        TTF+VV  L  RI+C+L  E+   K   V++ +++GL  +++  D VYL+  S +E        V+G +  L+ V+VI  V
Subjt:  TTFVVVCALGARIVCRLSGEKATTKFGGVLRLLLEGLGWVLITLDSVYLSRNSLLEGA------VYGGIFALVFVHVIKLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CTAATCTTCTTCAAGTGCACCCGGTGGCAACTGGAAGAAACCATGGAAAAACTCAGTTGTCCTTATCACTACTTCTGCGACAGCATCTATGCCGGGGACTACCCCGCCGC
AGTCGATTTCCTCGTCCTCGCCTTCACCGTCGCCGCCTACATGTCCACCCTTTCCACCATGGTTGCCTATACGTCGTCCCGTCCTAGCCAGAGACTGCTGCTACCATCCG
GCCCGGTTTCCCTCCCGGTTTTCCTCCTCATCTTAGCCAAAGGCCACCGCATCAATGCTGCCTTCCCTCTCTTCCTCCTCGGACCCGCGATCCTCAATCTCGTTTACATT
TCTGCTCTTTCCTTCGAGAGCGGCGCTGACAAGGACATAAAATATGTGTTTTTGGAAGCTTCAACGATGTCGGGCATTCTCCACGCCAGCTTGAACCTGGACTCTGTGAT
TCTGCCGTACTACACGGGGCTGGATGCTCTCGTGGGGTCTACATTGTCCGGGGGATGCCCGTCGTGTGTTTGCAGAGACGAGGCGCTGGTGGTCGGCGGCAGGTTGTTGT
CTTACAGGGGATGGTCGAGGACAACGTTTGTTGTGGTGTGTGCTTTGGGGGCGAGAATTGTTTGTCGGCTGTCGGGAGAGAAGGCAACCACAAAATTTGGTGGGGTTTTG
AGGTTGTTGTTGGAAGGCTTGGGATGGGTGCTTATAACGTTGGACTCTGTTTATTTGAGTAGGAACTCTTTGTTGGAAGGGGCTGTGTATGGTGGAATATTTGCTCTGGT
GTTTGTTCATGTGATTAAATTGGTGCTGAGGAGATGGCGGAGGGTGTGTGGTGGGAATGAGGAATTGGAGAAAGTG
mRNA sequenceShow/hide mRNA sequence
CTAATCTTCTTCAAGTGCACCCGGTGGCAACTGGAAGAAACCATGGAAAAACTCAGTTGTCCTTATCACTACTTCTGCGACAGCATCTATGCCGGGGACTACCCCGCCGC
AGTCGATTTCCTCGTCCTCGCCTTCACCGTCGCCGCCTACATGTCCACCCTTTCCACCATGGTTGCCTATACGTCGTCCCGTCCTAGCCAGAGACTGCTGCTACCATCCG
GCCCGGTTTCCCTCCCGGTTTTCCTCCTCATCTTAGCCAAAGGCCACCGCATCAATGCTGCCTTCCCTCTCTTCCTCCTCGGACCCGCGATCCTCAATCTCGTTTACATT
TCTGCTCTTTCCTTCGAGAGCGGCGCTGACAAGGACATAAAATATGTGTTTTTGGAAGCTTCAACGATGTCGGGCATTCTCCACGCCAGCTTGAACCTGGACTCTGTGAT
TCTGCCGTACTACACGGGGCTGGATGCTCTCGTGGGGTCTACATTGTCCGGGGGATGCCCGTCGTGTGTTTGCAGAGACGAGGCGCTGGTGGTCGGCGGCAGGTTGTTGT
CTTACAGGGGATGGTCGAGGACAACGTTTGTTGTGGTGTGTGCTTTGGGGGCGAGAATTGTTTGTCGGCTGTCGGGAGAGAAGGCAACCACAAAATTTGGTGGGGTTTTG
AGGTTGTTGTTGGAAGGCTTGGGATGGGTGCTTATAACGTTGGACTCTGTTTATTTGAGTAGGAACTCTTTGTTGGAAGGGGCTGTGTATGGTGGAATATTTGCTCTGGT
GTTTGTTCATGTGATTAAATTGGTGCTGAGGAGATGGCGGAGGGTGTGTGGTGGGAATGAGGAATTGGAGAAAGTG
Protein sequenceShow/hide protein sequence
LIFFKCTRWQLEETMEKLSCPYHYFCDSIYAGDYPAAVDFLVLAFTVAAYMSTLSTMVAYTSSRPSQRLLLPSGPVSLPVFLLILAKGHRINAAFPLFLLGPAILNLVYI
SALSFESGADKDIKYVFLEASTMSGILHASLNLDSVILPYYTGLDALVGSTLSGGCPSCVCRDEALVVGGRLLSYRGWSRTTFVVVCALGARIVCRLSGEKATTKFGGVL
RLLLEGLGWVLITLDSVYLSRNSLLEGAVYGGIFALVFVHVIKLVLRRWRRVCGGNEELEKV