; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC07g0566 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC07g0566
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionFamily of unknown function (DUF716)
Genome locationMC07:13703803..13704543
RNA-Seq ExpressionMC07g0566
SyntenyMC07g0566
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006904 - Protein of unknown function DUF716


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_016902581.1 PREDICTED: uncharacterized protein LOC103499449 isoform X1 [Cucumis melo]5.56e-15289.43Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWALAIL
        MASLATHLSASLFL+PIG+RRLLCSSS+YL NPSLYRSK WYLSEPKWKNFDLYSLI+ LPI AFSEIFLFLAFSGNPTYRFAFSQQS AIFFFWALAIL
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWALAIL

Query:  IILRENVDPVLASESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK
        I+LRENVDP+L SESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLT++CACSC YLSMKPSAFF EF+LSS LTFKGTWLFQ GLSLYTKAFALK
Subjt:  IILRENVDPVLASESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK

Query:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG
        GC+ MLVLPA GDA VHC+LEEDGLRGIALMNL+FIGHA LVLILG
Subjt:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG

XP_022138433.1 uncharacterized protein LOC111009606 [Momordica charantia]5.73e-16799.6Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWALAIL
        MASLATHLSASLFLVPI VRRLLCSSSVYLKNPSLYRSKAWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWALAIL
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWALAIL

Query:  IILRENVDPVLASESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK
        IILRENVDPVLASESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK
Subjt:  IILRENVDPVLASESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK

Query:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILGF
        GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILGF
Subjt:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILGF

XP_023547967.1 uncharacterized protein LOC111806759 [Cucurbita pepo subsp. pepo]1.81e-14486.99Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWALAIL
        MASLATHLSASL LVPIGVRRLL SSS+YL NPSLYRSK WYLSEPKWKNFDLYSL++ LPIAAFSEIFLF+AFSGNPTYRFAF QQS AIFFFWALAIL
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWALAIL

Query:  IILRENVDPVLASESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK
        IILREN+DP+L SESFIFVFAG+AFLVEYSVIGKGITGLGG  YHISGGLTL+CACSCL+LSMKPSAFF EF+LSS LTFKGTWLFQ GLSLYT AFALK
Subjt:  IILRENVDPVLASESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK

Query:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG
        GC+ MLVLPA GDA V C+LEEDGLRGIALMNL+FIGH VLVLILG
Subjt:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG

XP_031736602.1 uncharacterized protein LOC101208255 isoform X1 [Cucumis sativus]3.74e-15088.62Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWALAIL
        MASLAT LSASLFL+PIG+RRLLCSSS+YL NPSLYRSK WYLSEPKWKNFDLYSLI+ LPIAAFSEIFLFLAFSGNPTY+FAFSQQS AIFFFWALAIL
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWALAIL

Query:  IILRENVDPVLASESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK
        I+LRENVDP+L SESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLT++CACSC YLSMKPSAFF EF+LSS LTFKGTWLFQ GLSLYTKAFALK
Subjt:  IILRENVDPVLASESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK

Query:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG
        GC+ MLVLPA GD  VHC+LEEDGLRGIALMNL+FIGHA LVLILG
Subjt:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG

XP_038874543.1 uncharacterized protein LOC120067159 [Benincasa hispida]6.47e-15190.24Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWALAIL
        MASLATHLS+SLFLVPIGVRRLLCSSS+YLKNPSLYRSK WYLSEPKWKNFDLYSLI+ LPIAAFSEIFLFLAFSGNPTYRFAFSQQS AIF FWALAIL
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWALAIL

Query:  IILRENVDPVLASESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK
        IILRENVDP+L SES IFVFAGIAFL+EYSVIGKGITGLGGAFY ISGGLTL+CAC CLYLSMKPSAFF EF+LSS LTFKGTWLFQ GLSLYT AFALK
Subjt:  IILRENVDPVLASESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK

Query:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG
        GC+ MLVLPA GDA VHCELEEDGLRGIALMNL+FIGHAVLVLILG
Subjt:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG

TrEMBL top hitse value%identityAlignment
A0A5D3B6R4 Uncharacterized protein2.69e-15289.43Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWALAIL
        MASLATHLSASLFL+PIG+RRLLCSSS+YL NPSLYRSK WYLSEPKWKNFDLYSLI+ LPI AFSEIFLFLAFSGNPTYRFAFSQQS AIFFFWALAIL
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWALAIL

Query:  IILRENVDPVLASESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK
        I+LRENVDP+L SESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLT++CACSC YLSMKPSAFF EF+LSS LTFKGTWLFQ GLSLYTKAFALK
Subjt:  IILRENVDPVLASESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK

Query:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG
        GC+ MLVLPA GDA VHC+LEEDGLRGIALMNL+FIGHA LVLILG
Subjt:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG

A0A6J1CCY9 uncharacterized protein LOC1110096062.77e-16799.6Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWALAIL
        MASLATHLSASLFLVPI VRRLLCSSSVYLKNPSLYRSKAWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWALAIL
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWALAIL

Query:  IILRENVDPVLASESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK
        IILRENVDPVLASESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK
Subjt:  IILRENVDPVLASESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK

Query:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILGF
        GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILGF
Subjt:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILGF

A0A6J1H5S4 uncharacterized protein LOC111460286 isoform X23.85e-14386.18Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWALAIL
        MASLATHLSA+LFLVPIGVRRLL SSSVYL NPSLYRSK WYLSEPKWKNFDLYSL++  PIAAFSEIFLF+AFSGNPTYRFAF QQS AIFFFWALAIL
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWALAIL

Query:  IILRENVDPVLASESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK
        IILREN+DP+L SESFIFVFAG+AFLVEYSVIGKGITGLGG  YHISGGLTL+CACSCL+LSMKPSAFF EF+LSS LTFKGTWLFQ GLSLYT AF+LK
Subjt:  IILRENVDPVLASESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK

Query:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG
        GC+ MLVLPA GDA V C+LEEDGLRGIALM+L+FIGH VLVLILG
Subjt:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG

B0F814 uncharacterized protein LOC103499449 isoform X12.69e-15289.43Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWALAIL
        MASLATHLSASLFL+PIG+RRLLCSSS+YL NPSLYRSK WYLSEPKWKNFDLYSLI+ LPI AFSEIFLFLAFSGNPTYRFAFSQQS AIFFFWALAIL
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWALAIL

Query:  IILRENVDPVLASESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK
        I+LRENVDP+L SESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLT++CACSC YLSMKPSAFF EF+LSS LTFKGTWLFQ GLSLYTKAFALK
Subjt:  IILRENVDPVLASESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK

Query:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG
        GC+ MLVLPA GDA VHC+LEEDGLRGIALMNL+FIGHA LVLILG
Subjt:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG

B0F830 Uncharacterized protein1.81e-15088.62Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWALAIL
        MASLAT LSASLFL+PIG+RRLLCSSS+YL NPSLYRSK WYLSEPKWKNFDLYSLI+ LPIAAFSEIFLFLAFSGNPTY+FAFSQQS AIFFFWALAIL
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWALAIL

Query:  IILRENVDPVLASESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK
        I+LRENVDP+L SESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLT++CACSC YLSMKPSAFF EF+LSS LTFKGTWLFQ GLSLYTKAFALK
Subjt:  IILRENVDPVLASESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK

Query:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG
        GC+ MLVLPA GD  VHC+LEEDGLRGIALMNL+FIGHA LVLILG
Subjt:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G13890.1 Family of unknown function (DUF716)5.7e-2125.65Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWY-------------LSEPKWKNFDLYSLIVALPIAAFSEIFLF-----LAFSGNPTYRF
        M  +   L AS  L+ +G+  L+C+    LK+P  Y +K +Y              +  + +N  L+ LI++L +A   E  +      L     P +RF
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWY-------------LSEPKWKNFDLYSLIVALPIAAFSEIFLF-----LAFSGNPTYRF

Query:  AFSQQSAAIFFFWALAILIILRENVDPVLASESFIFVFAGIAFLVEYSVIGKG----ITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCL
        +    +A  F F  +AI  +L ++   +      +F  A + F + YS          + L      +S  ++ +C+  CL L+ +   F  +  L++ +
Subjt:  AFSQQSAAIFFFWALAILIILRENVDPVLASESFIFVFAGIAFLVEYSVIGKG----ITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCL

Query:  TFKGTWLFQAGLSLYTKAFALKGCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILGF
          +G W  Q GLSLY   F  +GC  +L + +  +    C++++  LR +++++LMF  H VLV+IL F
Subjt:  TFKGTWLFQAGLSLYTKAFALKGCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILGF

AT5G13890.2 Family of unknown function (DUF716)5.7e-2125.65Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWY-------------LSEPKWKNFDLYSLIVALPIAAFSEIFLF-----LAFSGNPTYRF
        M  +   L AS  L+ +G+  L+C+    LK+P  Y +K +Y              +  + +N  L+ LI++L +A   E  +      L     P +RF
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWY-------------LSEPKWKNFDLYSLIVALPIAAFSEIFLF-----LAFSGNPTYRF

Query:  AFSQQSAAIFFFWALAILIILRENVDPVLASESFIFVFAGIAFLVEYSVIGKG----ITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCL
        +    +A  F F  +AI  +L ++   +      +F  A + F + YS          + L      +S  ++ +C+  CL L+ +   F  +  L++ +
Subjt:  AFSQQSAAIFFFWALAILIILRENVDPVLASESFIFVFAGIAFLVEYSVIGKG----ITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCL

Query:  TFKGTWLFQAGLSLYTKAFALKGCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILGF
          +G W  Q GLSLY   F  +GC  +L + +  +    C++++  LR +++++LMF  H VLV+IL F
Subjt:  TFKGTWLFQAGLSLYTKAFALKGCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILGF

AT5G13890.3 Family of unknown function (DUF716)5.7e-2125.65Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWY-------------LSEPKWKNFDLYSLIVALPIAAFSEIFLF-----LAFSGNPTYRF
        M  +   L AS  L+ +G+  L+C+    LK+P  Y +K +Y              +  + +N  L+ LI++L +A   E  +      L     P +RF
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWY-------------LSEPKWKNFDLYSLIVALPIAAFSEIFLF-----LAFSGNPTYRF

Query:  AFSQQSAAIFFFWALAILIILRENVDPVLASESFIFVFAGIAFLVEYSVIGKG----ITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCL
        +    +A  F F  +AI  +L ++   +      +F  A + F + YS          + L      +S  ++ +C+  CL L+ +   F  +  L++ +
Subjt:  AFSQQSAAIFFFWALAILIILRENVDPVLASESFIFVFAGIAFLVEYSVIGKG----ITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCL

Query:  TFKGTWLFQAGLSLYTKAFALKGCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILGF
          +G W  Q GLSLY   F  +GC  +L + +  +    C++++  LR +++++LMF  H VLV+IL F
Subjt:  TFKGTWLFQAGLSLYTKAFALKGCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCACTGGCAACGCATCTCTCGGCCTCCCTCTTCCTCGTCCCCATAGGCGTCCGCCGCCTCCTCTGTTCTTCCTCCGTCTACCTCAAAAACCCATCTCTTTACCG
ATCGAAAGCCTGGTACTTATCCGAACCCAAATGGAAAAATTTCGATTTATACTCCCTCATCGTCGCCCTCCCCATCGCTGCCTTCTCCGAAATTTTCCTCTTCCTCGCAT
TTTCCGGCAACCCCACCTACAGATTTGCGTTTTCCCAGCAATCGGCGGCCATTTTCTTCTTCTGGGCCCTCGCGATTCTGATCATTCTGCGGGAAAACGTCGACCCTGTT
CTCGCAAGTGAGAGTTTCATCTTTGTTTTCGCTGGAATCGCGTTTCTGGTTGAGTACTCTGTGATTGGGAAGGGGATTACAGGTCTTGGTGGCGCTTTTTACCACATTTC
CGGAGGATTGACCCTTATTTGTGCTTGTTCTTGCCTGTATTTATCCATGAAACCATCTGCATTTTTCGTCGAATTTATACTTTCTTCTTGCTTAACCTTTAAGGGGACAT
GGCTGTTTCAAGCTGGATTGTCTCTATATACCAAGGCATTTGCCTTGAAGGGGTGTCAGGAAATGCTGGTTTTGCCTGCCATAGGGGACGCTGATGTCCATTGTGAGCTT
GAGGAGGATGGCCTGAGGGGCATTGCTTTGATGAACTTAATGTTCATTGGTCATGCTGTTTTGGTTCTGATTTTGGGTTTT
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCACTGGCAACGCATCTCTCGGCCTCCCTCTTCCTCGTCCCCATAGGCGTCCGCCGCCTCCTCTGTTCTTCCTCCGTCTACCTCAAAAACCCATCTCTTTACCG
ATCGAAAGCCTGGTACTTATCCGAACCCAAATGGAAAAATTTCGATTTATACTCCCTCATCGTCGCCCTCCCCATCGCTGCCTTCTCCGAAATTTTCCTCTTCCTCGCAT
TTTCCGGCAACCCCACCTACAGATTTGCGTTTTCCCAGCAATCGGCGGCCATTTTCTTCTTCTGGGCCCTCGCGATTCTGATCATTCTGCGGGAAAACGTCGACCCTGTT
CTCGCAAGTGAGAGTTTCATCTTTGTTTTCGCTGGAATCGCGTTTCTGGTTGAGTACTCTGTGATTGGGAAGGGGATTACAGGTCTTGGTGGCGCTTTTTACCACATTTC
CGGAGGATTGACCCTTATTTGTGCTTGTTCTTGCCTGTATTTATCCATGAAACCATCTGCATTTTTCGTCGAATTTATACTTTCTTCTTGCTTAACCTTTAAGGGGACAT
GGCTGTTTCAAGCTGGATTGTCTCTATATACCAAGGCATTTGCCTTGAAGGGGTGTCAGGAAATGCTGGTTTTGCCTGCCATAGGGGACGCTGATGTCCATTGTGAGCTT
GAGGAGGATGGCCTGAGGGGCATTGCTTTGATGAACTTAATGTTCATTGGTCATGCTGTTTTGGTTCTGATTTTGGGTTTT
Protein sequenceShow/hide protein sequence
MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKAWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWALAILIILRENVDPV
LASESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLICACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALKGCQEMLVLPAIGDADVHCEL
EEDGLRGIALMNLMFIGHAVLVLILGF