; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g36700 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g36700
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBifunctional inhibitor/lipid-transfer protein/seed storage 2S albumin superfamily protein
Genome locationchr8:27279738..27281520
RNA-Seq ExpressionMoc08g36700
SyntenyMoc08g36700
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR016140 - Bifunctional inhibitor/plant lipid transfer protein/seed storage helical domain
IPR036312 - Bifunctional inhibitor/plant lipid transfer protein/seed storage helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN67047.1 hypothetical protein VITISV_001152 [Vitis vinifera]6.0e-4041.84Show/hide
Query:  LHVRPSGACCRRVREQESCYCAYYRNPKYTHYLQSSAVKKIISRYRLKAVLPSTIAQHQSAFVRGRQITDAILVANEVVD----------LWCCSKKRAE
        L+  PSG  C   ++  S   + YR       L    + K++S  RL+ VL  TI   Q  FV+G+QI DA+L+ANE+VD          ++    ++A 
Subjt:  LHVRPSGACCRRVREQESCYCAYYRNPKYTHYLQSSAVKKIISRYRLKAVLPSTIAQHQSAFVRGRQITDAILVANEVVD----------LWCCSKKRAE

Query:  DLPHLARI---WDCPIISLLINYLGMPLGGNPKASPFWDPIIER--RKLESWKYSDISKAGRLTLIQSILSSLPTYYLSIFKSPIQVTKALEKLMRDFLW
        D  H++R+    DC + S  I YLG+PLGGNPK   FW+P+IER  R+L+ WK + +S  GR+TLIQS L+ +P Y+LS+FK P  +   +E+L RDFLW
Subjt:  DLPHLARI---WDCPIISLLINYLGMPLGGNPKASPFWDPIIER--RKLESWKYSDISKAGRLTLIQSILSSLPTYYLSIFKSPIQVTKALEKLMRDFLW

Query:  KGIGDKRGSHLVRWNIVTLPKSKGSLGITSISITNKALL
         G+G+ +  HLV W++V   K+K  LG   ISI N ALL
Subjt:  KGIGDKRGSHLVRWNIVTLPKSKGSLGITSISITNKALL

CAN70928.1 hypothetical protein VITISV_016057 [Vitis vinifera]3.3e-3842.34Show/hide
Query:  KIISRY---RLKAVLPSTIAQHQSAFVRGRQITDAILVANEVVDLWCCSKKRAED--------------------------------LPHLARIWDCPII
        KII++    RL+ +L  TI   Q AFV+GRQI DA+L+ANE+VD     KKR+ +                                L  LA++ DC   
Subjt:  KIISRY---RLKAVLPSTIAQHQSAFVRGRQITDAILVANEVVDLWCCSKKRAED--------------------------------LPHLARIWDCPII

Query:  SLLINYLGMPLGGNPKASPFWDPIIER--RKLESWKYSDISKAGRLTLIQSILSSLPTYYLSIFKSPIQVTKALEKLMRDFLWKGIGDKRGSHLVRWNIV
           I YLG+PLGGNPK+  FWDP+IER   +L+ W+ + +S  GR+TLIQS L+ +P Y+LS+FK P  V   +E+L RDFLW G+G  +  HLV W+++
Subjt:  SLLINYLGMPLGGNPKASPFWDPIIER--RKLESWKYSDISKAGRLTLIQSILSSLPTYYLSIFKSPIQVTKALEKLMRDFLWKGIGDKRGSHLVRWNIV

Query:  TLPKSKGSLGITSISITNKALL
          PK+KG LG   IS+ N ALL
Subjt:  TLPKSKGSLGITSISITNKALL

CAN77234.1 hypothetical protein VITISV_010061 [Vitis vinifera]2.1e-3745.27Show/hide
Query:  VKKIISRYRLKAVLPSTIAQHQSAFVRGRQITDAILVANEVVD----------LWCCSKKRAEDLPHLARI---WDCPIISLLINYLGMPLGGNPKASPF
        + K++S  RL+ VL  TI   Q AFV+GRQI DA+L+ANE+VD          ++    ++A D  HL+R+    DC      I YLG+PLG NPKA  F
Subjt:  VKKIISRYRLKAVLPSTIAQHQSAFVRGRQITDAILVANEVVD----------LWCCSKKRAEDLPHLARI---WDCPIISLLINYLGMPLGGNPKASPF

Query:  WDPIIER--RKLESWKYSDISKAGRLTLIQSILSSLPTYYLSIFKSPIQVTKALEKLMRDFLWKGIGDKRGSHLVRWNIVTLPKSKGSLGITSISITNKA
        WDP+IER   +L+ W+ + +S   R+TL QS L+ LP Y+LS+FK P  V   +E+L RDFLW G+G+ +  HLV W++V  PK+ G LG  +IS  N A
Subjt:  WDPIIER--RKLESWKYSDISKAGRLTLIQSILSSLPTYYLSIFKSPIQVTKALEKLMRDFLWKGIGDKRGSHLVRWNIVTLPKSKGSLGITSISITNKA

Query:  L
        L
Subjt:  L

KAA0065894.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]7.1e-4148.26Show/hide
Query:  RLKAVLPSTIAQHQSAFVRGRQITDAILVANEVVDLWCCSKKRAE---------------DLP-----HLARIWDCPIISLLINYLGMPLGGNPKASPFW
        RLK+ LP TI+ +Q AFVRGRQITD IL+ANE V      + +A                ++P      +A +W+ P     I+YLG+PLGG P +  FW
Subjt:  RLKAVLPSTIAQHQSAFVRGRQITDAILVANEVVDLWCCSKKRAE---------------DLP-----HLARIWDCPIISLLINYLGMPLGGNPKASPFW

Query:  DPIIER--RKLESWKYSDISKAGRLTLIQSILSSLPTYYLSIFKSPIQVTKALEKLMRDFLWKGIGDKRGSHLVRWNIVTLPKSKGSLGITSISITNKAL
          I+++  +KL  WKYS +SK G+LTLIQ+ LSSLPTY LS+FK+P  V K++EK  RDFLWK   D + ++LVRW+IVT PK K  LGIT++  TN AL
Subjt:  DPIIER--RKLESWKYSDISKAGRLTLIQSILSSLPTYYLSIFKSPIQVTKALEKLMRDFLWKGIGDKRGSHLVRWNIVTLPKSKGSLGITSISITNKAL

Query:  L
        L
Subjt:  L

XP_022143310.1 uncharacterized protein LOC111013210 [Momordica charantia]9.0e-12969.31Show/hide
Query:  MSKLPITSLFLLVAAMVVLLAGARAAEAAATCDLDNLKPCLLAFTLHVRPSGACCRRVREQESCYCAYYRNPKYTHYLQSSAVKKIISRYRLKAVLPSTI
        MSKLPITSLFLLVAAMVVLLAGARAAEAAATCDLDNLKPCLLAFTLHVRPSGACCRRVREQESCY                        YRLKAVLPSTI
Subjt:  MSKLPITSLFLLVAAMVVLLAGARAAEAAATCDLDNLKPCLLAFTLHVRPSGACCRRVREQESCYCAYYRNPKYTHYLQSSAVKKIISRYRLKAVLPSTI

Query:  AQHQSAFVRGRQITDAILVANEVVDLWCCSKKR-------------------------------------------------------------------
        AQHQSAFVRGRQITDAILVANEVVDLWCCSKKR                                                                   
Subjt:  AQHQSAFVRGRQITDAILVANEVVDLWCCSKKR-------------------------------------------------------------------

Query:  -------------------------AEDLPHLARIWDCPIISLLINYLGMPLGGNPKASPFWDPIIERRKLESWKYSDISKAGRLTLIQSILSSLPTYYL
                                 AEDLPHLARIWDCPIISLLINYLGMPLGGNPKASPFWDPIIERRKLESWKYSDISKAGRLTLIQSILSSLPTYYL
Subjt:  -------------------------AEDLPHLARIWDCPIISLLINYLGMPLGGNPKASPFWDPIIERRKLESWKYSDISKAGRLTLIQSILSSLPTYYL

Query:  SIFKSPIQVTKALEKLMRDFLWKGIGDKRGSHLVRWNIVTLPKSKGSLGITSISITNKALLTSGCGASSMKITAYGFL
        SIFKSPIQVTKALEKLMRDFLWKGIGDKRGSHLVRWNIVTLPKSKGSLGITSISITNKALLTSGCGASSMKITAYGFL
Subjt:  SIFKSPIQVTKALEKLMRDFLWKGIGDKRGSHLVRWNIVTLPKSKGSLGITSISITNKALLTSGCGASSMKITAYGFL

TrEMBL top hitse value%identityAlignment
A0A5A7VF18 LINE-1 retrotransposable element ORF2 protein3.4e-4148.26Show/hide
Query:  RLKAVLPSTIAQHQSAFVRGRQITDAILVANEVVDLWCCSKKRAE---------------DLP-----HLARIWDCPIISLLINYLGMPLGGNPKASPFW
        RLK+ LP TI+ +Q AFVRGRQITD IL+ANE V      + +A                ++P      +A +W+ P     I+YLG+PLGG P +  FW
Subjt:  RLKAVLPSTIAQHQSAFVRGRQITDAILVANEVVDLWCCSKKRAE---------------DLP-----HLARIWDCPIISLLINYLGMPLGGNPKASPFW

Query:  DPIIER--RKLESWKYSDISKAGRLTLIQSILSSLPTYYLSIFKSPIQVTKALEKLMRDFLWKGIGDKRGSHLVRWNIVTLPKSKGSLGITSISITNKAL
          I+++  +KL  WKYS +SK G+LTLIQ+ LSSLPTY LS+FK+P  V K++EK  RDFLWK   D + ++LVRW+IVT PK K  LGIT++  TN AL
Subjt:  DPIIER--RKLESWKYSDISKAGRLTLIQSILSSLPTYYLSIFKSPIQVTKALEKLMRDFLWKGIGDKRGSHLVRWNIVTLPKSKGSLGITSISITNKAL

Query:  L
        L
Subjt:  L

A0A6J1CNG6 uncharacterized protein LOC1110132104.3e-12969.31Show/hide
Query:  MSKLPITSLFLLVAAMVVLLAGARAAEAAATCDLDNLKPCLLAFTLHVRPSGACCRRVREQESCYCAYYRNPKYTHYLQSSAVKKIISRYRLKAVLPSTI
        MSKLPITSLFLLVAAMVVLLAGARAAEAAATCDLDNLKPCLLAFTLHVRPSGACCRRVREQESCY                        YRLKAVLPSTI
Subjt:  MSKLPITSLFLLVAAMVVLLAGARAAEAAATCDLDNLKPCLLAFTLHVRPSGACCRRVREQESCYCAYYRNPKYTHYLQSSAVKKIISRYRLKAVLPSTI

Query:  AQHQSAFVRGRQITDAILVANEVVDLWCCSKKR-------------------------------------------------------------------
        AQHQSAFVRGRQITDAILVANEVVDLWCCSKKR                                                                   
Subjt:  AQHQSAFVRGRQITDAILVANEVVDLWCCSKKR-------------------------------------------------------------------

Query:  -------------------------AEDLPHLARIWDCPIISLLINYLGMPLGGNPKASPFWDPIIERRKLESWKYSDISKAGRLTLIQSILSSLPTYYL
                                 AEDLPHLARIWDCPIISLLINYLGMPLGGNPKASPFWDPIIERRKLESWKYSDISKAGRLTLIQSILSSLPTYYL
Subjt:  -------------------------AEDLPHLARIWDCPIISLLINYLGMPLGGNPKASPFWDPIIERRKLESWKYSDISKAGRLTLIQSILSSLPTYYL

Query:  SIFKSPIQVTKALEKLMRDFLWKGIGDKRGSHLVRWNIVTLPKSKGSLGITSISITNKALLTSGCGASSMKITAYGFL
        SIFKSPIQVTKALEKLMRDFLWKGIGDKRGSHLVRWNIVTLPKSKGSLGITSISITNKALLTSGCGASSMKITAYGFL
Subjt:  SIFKSPIQVTKALEKLMRDFLWKGIGDKRGSHLVRWNIVTLPKSKGSLGITSISITNKALLTSGCGASSMKITAYGFL

A5ARB2 Reverse transcriptase domain-containing protein2.9e-4041.84Show/hide
Query:  LHVRPSGACCRRVREQESCYCAYYRNPKYTHYLQSSAVKKIISRYRLKAVLPSTIAQHQSAFVRGRQITDAILVANEVVD----------LWCCSKKRAE
        L+  PSG  C   ++  S   + YR       L    + K++S  RL+ VL  TI   Q  FV+G+QI DA+L+ANE+VD          ++    ++A 
Subjt:  LHVRPSGACCRRVREQESCYCAYYRNPKYTHYLQSSAVKKIISRYRLKAVLPSTIAQHQSAFVRGRQITDAILVANEVVD----------LWCCSKKRAE

Query:  DLPHLARI---WDCPIISLLINYLGMPLGGNPKASPFWDPIIER--RKLESWKYSDISKAGRLTLIQSILSSLPTYYLSIFKSPIQVTKALEKLMRDFLW
        D  H++R+    DC + S  I YLG+PLGGNPK   FW+P+IER  R+L+ WK + +S  GR+TLIQS L+ +P Y+LS+FK P  +   +E+L RDFLW
Subjt:  DLPHLARI---WDCPIISLLINYLGMPLGGNPKASPFWDPIIER--RKLESWKYSDISKAGRLTLIQSILSSLPTYYLSIFKSPIQVTKALEKLMRDFLW

Query:  KGIGDKRGSHLVRWNIVTLPKSKGSLGITSISITNKALL
         G+G+ +  HLV W++V   K+K  LG   ISI N ALL
Subjt:  KGIGDKRGSHLVRWNIVTLPKSKGSLGITSISITNKALL

A5BSG1 Uncharacterized protein1.0e-3745.27Show/hide
Query:  VKKIISRYRLKAVLPSTIAQHQSAFVRGRQITDAILVANEVVD----------LWCCSKKRAEDLPHLARI---WDCPIISLLINYLGMPLGGNPKASPF
        + K++S  RL+ VL  TI   Q AFV+GRQI DA+L+ANE+VD          ++    ++A D  HL+R+    DC      I YLG+PLG NPKA  F
Subjt:  VKKIISRYRLKAVLPSTIAQHQSAFVRGRQITDAILVANEVVD----------LWCCSKKRAEDLPHLARI---WDCPIISLLINYLGMPLGGNPKASPF

Query:  WDPIIER--RKLESWKYSDISKAGRLTLIQSILSSLPTYYLSIFKSPIQVTKALEKLMRDFLWKGIGDKRGSHLVRWNIVTLPKSKGSLGITSISITNKA
        WDP+IER   +L+ W+ + +S   R+TL QS L+ LP Y+LS+FK P  V   +E+L RDFLW G+G+ +  HLV W++V  PK+ G LG  +IS  N A
Subjt:  WDPIIER--RKLESWKYSDISKAGRLTLIQSILSSLPTYYLSIFKSPIQVTKALEKLMRDFLWKGIGDKRGSHLVRWNIVTLPKSKGSLGITSISITNKA

Query:  L
        L
Subjt:  L

A5C6D8 Reverse transcriptase domain-containing protein1.6e-3842.34Show/hide
Query:  KIISRY---RLKAVLPSTIAQHQSAFVRGRQITDAILVANEVVDLWCCSKKRAED--------------------------------LPHLARIWDCPII
        KII++    RL+ +L  TI   Q AFV+GRQI DA+L+ANE+VD     KKR+ +                                L  LA++ DC   
Subjt:  KIISRY---RLKAVLPSTIAQHQSAFVRGRQITDAILVANEVVDLWCCSKKRAED--------------------------------LPHLARIWDCPII

Query:  SLLINYLGMPLGGNPKASPFWDPIIER--RKLESWKYSDISKAGRLTLIQSILSSLPTYYLSIFKSPIQVTKALEKLMRDFLWKGIGDKRGSHLVRWNIV
           I YLG+PLGGNPK+  FWDP+IER   +L+ W+ + +S  GR+TLIQS L+ +P Y+LS+FK P  V   +E+L RDFLW G+G  +  HLV W+++
Subjt:  SLLINYLGMPLGGNPKASPFWDPIIER--RKLESWKYSDISKAGRLTLIQSILSSLPTYYLSIFKSPIQVTKALEKLMRDFLWKGIGDKRGSHLVRWNIV

Query:  TLPKSKGSLGITSISITNKALL
          PK+KG LG   IS+ N ALL
Subjt:  TLPKSKGSLGITSISITNKALL

SwissProt top hitse value%identityAlignment
A2XBN5 Non-specific lipid-transfer protein 21.2e-0632.1Show/hide
Query:  SLFLLVAAMVVLLAGARAAEAAATCDLDNLKPCLLAFTLHVRPSGACCRRVREQESCYCAYYRNPKYTHYLQSSAVKKIIS
        ++ +L  AMV    G     A A+C+   L  C  A     RP+ ACC  +R Q+ C+C + ++P+Y  Y+ S   +K +S
Subjt:  SLFLLVAAMVVLLAGARAAEAAATCDLDNLKPCLLAFTLHVRPSGACCRRVREQESCYCAYYRNPKYTHYLQSSAVKKIIS

P0C2F6 Putative ribonuclease H protein At1g657501.3e-1337Show/hide
Query:  IIER--RKLESWKYSDISKAGRLTLIQSILSSLPTYYLSIFKSPIQVTKALEKLMRDFLWKGIGDKRGSHLVRWNIVTLPKSKGSLGITSISITNKALLT
        I+ER   ++  W+   +S AGRLTL +++LSS+P + +S    P  +   L++L R FLW    +K+  HLV+W+ V  PK +G LG+ +    N+AL++
Subjt:  IIER--RKLESWKYSDISKAGRLTLIQSILSSLPTYYLSIFKSPIQVTKALEKLMRDFLWKGIGDKRGSHLVRWNIVTLPKSKGSLGITSISITNKALLT

P20145 Probable non-specific lipid-transfer protein1.5e-0639.13Show/hide
Query:  LVAAMVVLLAGARAAEAAATCDLDNLKPCLLAFTLHVRPSGACCRRVREQESCYCAYYRNPKYTHYLQS
        ++  MVV LA    A A A C+   L  C  A     +PSG CC  +R Q+ C C Y ++P Y HY+ S
Subjt:  LVAAMVVLLAGARAAEAAATCDLDNLKPCLLAFTLHVRPSGACCRRVREQESCYCAYYRNPKYTHYLQS

P86809 Non-specific lipid-transfer protein 28.2e-0840.68Show/hide
Query:  ATCDLDNLKPCLLAFTLHVRPSGACCRRVREQESCYCAYYRNPKYTHYLQSSAVKKIIS
        ATC    L PCL A T +  PS ACC +++EQ+ C C Y ++P   +Y+ S   +K  S
Subjt:  ATCDLDNLKPCLLAFTLHVRPSGACCRRVREQESCYCAYYRNPKYTHYLQSSAVKKIIS

Q10ST8 Non-specific lipid-transfer protein 22.6e-0632.1Show/hide
Query:  SLFLLVAAMVVLLAGARAAEAAATCDLDNLKPCLLAFTLHVRPSGACCRRVREQESCYCAYYRNPKYTHYLQSSAVKKIIS
        ++ +L  AMV    G     A A C+   L  C  A     RP+ ACC  +R Q+ C+C + ++P+Y  Y+ S   +K +S
Subjt:  SLFLLVAAMVVLLAGARAAEAAATCDLDNLKPCLLAFTLHVRPSGACCRRVREQESCYCAYYRNPKYTHYLQSSAVKKIIS

Arabidopsis top hitse value%identityAlignment
AT1G66850.1 Bifunctional inhibitor/lipid-transfer protein/seed storage 2S albumin superfamily protein1.8e-1043.86Show/hide
Query:  TCDLDNLKPCLLAFTLHVRPSGACCRRVREQESCYCAYYRNPKYTHYLQSSAVKKII
        TCD   L+PCL A T   +PSGACC ++ EQ+SC C + +NP +  Y+ S   +K++
Subjt:  TCDLDNLKPCLLAFTLHVRPSGACCRRVREQESCYCAYYRNPKYTHYLQSSAVKKII

AT3G18280.1 Bifunctional inhibitor/lipid-transfer protein/seed storage 2S albumin superfamily protein1.7e-0834.78Show/hide
Query:  MSKLPITSLFLLVAAMVVLLAGARAAEAAATCDLDNLKPCLLAFTLHVRPSGACCRRVREQESCYCAYYRNPKYTHYLQSSAVKKIISRYRL
        M K  + SLF L A ++++LA    A  A TC    L PC  A T    PS  CC +++EQ  C C Y RNP    ++ +   +K+    +L
Subjt:  MSKLPITSLFLLVAAMVVLLAGARAAEAAATCDLDNLKPCLLAFTLHVRPSGACCRRVREQESCYCAYYRNPKYTHYLQSSAVKKIISRYRL

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.0e-1432.06Show/hide
Query:  SLLINYLGMPLGGNPKASPFWDPIIE--RRKLESWKYSDISKAGRLTLIQSILSSLPTYYLSIFKSPIQVTKALEKLMRDFLWKGIGDKRGSHLVRWNIV
        +L + YLG+PL      +  + P++E  R ++  W    +S AGRL LI S++ SL  +++S F+ P    K ++ +   FLW G         V W+ V
Subjt:  SLLINYLGMPLGGNPKASPFWDPIIE--RRKLESWKYSDISKAGRLTLIQSILSSLPTYYLSIFKSPIQVTKALEKLMRDFLWKGIGDKRGSHLVRWNIV

Query:  TLPKSKGSLGITSISITNKALLTSGCGASSM
          PK +G LGI S+   NK    S  G +++
Subjt:  TLPKSKGSLGITSISITNKALLTSGCGASSM

AT5G38160.1 Bifunctional inhibitor/lipid-transfer protein/seed storage 2S albumin superfamily protein1.9e-0735.48Show/hide
Query:  EAAATCDLDNLKPCLLAFTLHVRPSGACCRRVREQESCYCAYYRNPKYTHYLQSSAVKKIIS
        E   TCD   L  C+ A +    PS  CC +++E E+C C Y +NP Y+ Y+ S   +K ++
Subjt:  EAAATCDLDNLKPCLLAFTLHVRPSGACCRRVREQESCYCAYYRNPKYTHYLQSSAVKKIIS

AT5G38170.1 Bifunctional inhibitor/lipid-transfer protein/seed storage 2S albumin superfamily protein8.4e-0835.53Show/hide
Query:  VAAMVVLLAGARAAEAAATCDLDNLKPCLLAFTLHVRPSGACCRRVREQESCYCAYYRNPKYTHYLQSSAVKKIIS
        + A  V L+G    EA  TCD   L  C       V PS  CC +++EQ+ C+C Y ++P+Y+ Y+ S+  KK ++
Subjt:  VAAMVVLLAGARAAEAAATCDLDNLKPCLLAFTLHVRPSGACCRRVREQESCYCAYYRNPKYTHYLQSSAVKKIIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAAACTTCCGATCACAAGCCTTTTCCTTTTGGTTGCTGCAATGGTGGTGCTTCTCGCTGGAGCTCGTGCAGCGGAGGCTGCGGCGACTTGCGACCTTGACAATCT
GAAACCATGTCTCTTGGCTTTCACGTTGCATGTGCGGCCATCGGGAGCTTGCTGCCGGAGGGTGAGGGAGCAAGAATCATGCTACTGTGCGTACTACAGAAATCCAAAAT
ATACGCATTACTTGCAATCTTCTGCCGTCAAAAAGATCATTTCGAGATATCGTTTGAAAGCTGTTCTTCCCTCTACAATAGCTCAACATCAGTCGGCTTTTGTTAGAGGG
CGTCAAATTACCGATGCAATTTTAGTGGCCAACGAAGTGGTCGATCTCTGGTGTTGCTCAAAGAAACGGGCTGAAGATCTCCCCCATCTAGCTAGAATATGGGATTGCCC
AATTATTTCTCTTCTGATTAATTATCTGGGGATGCCACTGGGAGGTAATCCAAAGGCTTCACCTTTTTGGGATCCGATCATCGAGCGCAGAAAACTTGAAAGTTGGAAAT
ATTCTGACATATCAAAAGCTGGTCGCCTAACTCTCATTCAATCGATTCTAAGCAGTTTGCCAACTTATTACCTATCGATCTTCAAATCCCCCATTCAAGTTACTAAAGCA
CTTGAGAAGCTTATGAGGGACTTCCTTTGGAAAGGTATCGGCGATAAACGAGGATCTCATCTTGTCAGATGGAACATTGTCACTCTCCCCAAGTCAAAGGGGAGTCTTGG
CATTACAAGCATTAGCATCACAAACAAGGCTCTACTTACAAGTGGTTGTGGCGCTTCTTCAATGAAAATAACAGCCTATGGGTTTCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCAAACTTCCGATCACAAGCCTTTTCCTTTTGGTTGCTGCAATGGTGGTGCTTCTCGCTGGAGCTCGTGCAGCGGAGGCTGCGGCGACTTGCGACCTTGACAATCT
GAAACCATGTCTCTTGGCTTTCACGTTGCATGTGCGGCCATCGGGAGCTTGCTGCCGGAGGGTGAGGGAGCAAGAATCATGCTACTGTGCGTACTACAGAAATCCAAAAT
ATACGCATTACTTGCAATCTTCTGCCGTCAAAAAGATCATTTCGAGATATCGTTTGAAAGCTGTTCTTCCCTCTACAATAGCTCAACATCAGTCGGCTTTTGTTAGAGGG
CGTCAAATTACCGATGCAATTTTAGTGGCCAACGAAGTGGTCGATCTCTGGTGTTGCTCAAAGAAACGGGCTGAAGATCTCCCCCATCTAGCTAGAATATGGGATTGCCC
AATTATTTCTCTTCTGATTAATTATCTGGGGATGCCACTGGGAGGTAATCCAAAGGCTTCACCTTTTTGGGATCCGATCATCGAGCGCAGAAAACTTGAAAGTTGGAAAT
ATTCTGACATATCAAAAGCTGGTCGCCTAACTCTCATTCAATCGATTCTAAGCAGTTTGCCAACTTATTACCTATCGATCTTCAAATCCCCCATTCAAGTTACTAAAGCA
CTTGAGAAGCTTATGAGGGACTTCCTTTGGAAAGGTATCGGCGATAAACGAGGATCTCATCTTGTCAGATGGAACATTGTCACTCTCCCCAAGTCAAAGGGGAGTCTTGG
CATTACAAGCATTAGCATCACAAACAAGGCTCTACTTACAAGTGGTTGTGGCGCTTCTTCAATGAAAATAACAGCCTATGGGTTTCTTTGA
Protein sequenceShow/hide protein sequence
MSKLPITSLFLLVAAMVVLLAGARAAEAAATCDLDNLKPCLLAFTLHVRPSGACCRRVREQESCYCAYYRNPKYTHYLQSSAVKKIISRYRLKAVLPSTIAQHQSAFVRG
RQITDAILVANEVVDLWCCSKKRAEDLPHLARIWDCPIISLLINYLGMPLGGNPKASPFWDPIIERRKLESWKYSDISKAGRLTLIQSILSSLPTYYLSIFKSPIQVTKA
LEKLMRDFLWKGIGDKRGSHLVRWNIVTLPKSKGSLGITSISITNKALLTSGCGASSMKITAYGFL