; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS007744 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS007744
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionFamily of unknown function (DUF716)
Genome locationscaffold13:970602..971342
RNA-Seq ExpressionMS007744
SyntenyMS007744
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006904 - Protein of unknown function DUF716


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_016902581.1 PREDICTED: uncharacterized protein LOC103499449 isoform X1 [Cucumis melo]3.6e-11889.84Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWTLAIL
        MASLATHLSASLFL+PIG+RRLLCSSS+YL NPSLYRSK WYLSEPKWKNFDLYSLI+ LPI AFSEIFLFLAFSGNPTYRFAFSQQS AIFFFW LAIL
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWTLAIL

Query:  IILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK
        I+LRENVDP+LVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLT+LCACSC YLSMKPSAFF EF+LSS LTFKGTWLFQ GLSLYTKAFALK
Subjt:  IILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK

Query:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG
        GC+ MLVLPA GDA VHC+LEEDGLRGIALMNL+FIGHA LVLILG
Subjt:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG

XP_022138433.1 uncharacterized protein LOC111009606 [Momordica charantia]2.9e-12897.98Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWTLAIL
        MASLATHLSASLFLVPI VRRLLCSSSVYLKNPSLYRSK WYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFW LAIL
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWTLAIL

Query:  IILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK
        IILRENVDPVL SESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTL+CACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK
Subjt:  IILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK

Query:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILGF
        GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILGF
Subjt:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILGF

XP_023547967.1 uncharacterized protein LOC111806759 [Cucurbita pepo subsp. pepo]2.1e-11387.8Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWTLAIL
        MASLATHLSASL LVPIGVRRLL SSS+YL NPSLYRSKIWYLSEPKWKNFDLYSL++ LPIAAFSEIFLF+AFSGNPTYRFAF QQS AIFFFW LAIL
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWTLAIL

Query:  IILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK
        IILREN+DP+LVSESFIFVFAG+AFLVEYSVIGKGITGLGG  YHISGGLTLLCACSCL+LSMKPSAFF EF+LSS LTFKGTWLFQ GLSLYT AFALK
Subjt:  IILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK

Query:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG
        GC+ MLVLPA GDA V C+LEEDGLRGIALMNL+FIGH VLVLILG
Subjt:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG

XP_031736602.1 uncharacterized protein LOC101208255 isoform X1 [Cucumis sativus]8.9e-11789.02Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWTLAIL
        MASLAT LSASLFL+PIG+RRLLCSSS+YL NPSLYRSK WYLSEPKWKNFDLYSLI+ LPIAAFSEIFLFLAFSGNPTY+FAFSQQS AIFFFW LAIL
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWTLAIL

Query:  IILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK
        I+LRENVDP+LVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLT+LCACSC YLSMKPSAFF EF+LSS LTFKGTWLFQ GLSLYTKAFALK
Subjt:  IILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK

Query:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG
        GC+ MLVLPA GD  VHC+LEEDGLRGIALMNL+FIGHA LVLILG
Subjt:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG

XP_038874543.1 uncharacterized protein LOC120067159 [Benincasa hispida]2.3e-11790.65Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWTLAIL
        MASLATHLS+SLFLVPIGVRRLLCSSS+YLKNPSLYRSK WYLSEPKWKNFDLYSLI+ LPIAAFSEIFLFLAFSGNPTYRFAFSQQS AIF FW LAIL
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWTLAIL

Query:  IILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK
        IILRENVDP+LVSES IFVFAGIAFL+EYSVIGKGITGLGGAFY ISGGLTLLCAC CLYLSMKPSAFF EF+LSS LTFKGTWLFQ GLSLYT AFALK
Subjt:  IILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK

Query:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG
        GC+ MLVLPA GDA VHCELEEDGLRGIALMNL+FIGHAVLVLILG
Subjt:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG

TrEMBL top hitse value%identityAlignment
A0A5D3B6R4 Uncharacterized protein1.7e-11889.84Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWTLAIL
        MASLATHLSASLFL+PIG+RRLLCSSS+YL NPSLYRSK WYLSEPKWKNFDLYSLI+ LPI AFSEIFLFLAFSGNPTYRFAFSQQS AIFFFW LAIL
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWTLAIL

Query:  IILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK
        I+LRENVDP+LVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLT+LCACSC YLSMKPSAFF EF+LSS LTFKGTWLFQ GLSLYTKAFALK
Subjt:  IILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK

Query:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG
        GC+ MLVLPA GDA VHC+LEEDGLRGIALMNL+FIGHA LVLILG
Subjt:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG

A0A6J1CCY9 uncharacterized protein LOC1110096061.4e-12897.98Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWTLAIL
        MASLATHLSASLFLVPI VRRLLCSSSVYLKNPSLYRSK WYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFW LAIL
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWTLAIL

Query:  IILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK
        IILRENVDPVL SESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTL+CACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK
Subjt:  IILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK

Query:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILGF
        GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILGF
Subjt:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILGF

A0A6J1H5E8 uncharacterized protein LOC111460286 isoform X11.9e-11286.99Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWTLAIL
        MASLATHLSA+LFLVPIGVRRLL SSSVYL NPSLYRSKIWYLSEPKWKNFDLYSL++  PIAAFSEIFLF+AFSGNPTYRFAF QQS AIFFFW LAIL
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWTLAIL

Query:  IILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK
        IILREN+DP+LVSESFIFVFAG+AFLVEYSVIGKGITGLGG  YHISGGLTLLCACSCL+LSMKPSAFF EF+LSS LTFKGTWLFQ GLSLYT AF+LK
Subjt:  IILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK

Query:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG
        GC+ MLVLPA GDA V C+LEEDGLRGIALM+L+FIGH VLVLILG
Subjt:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG

B0F814 uncharacterized protein LOC103499449 isoform X11.7e-11889.84Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWTLAIL
        MASLATHLSASLFL+PIG+RRLLCSSS+YL NPSLYRSK WYLSEPKWKNFDLYSLI+ LPI AFSEIFLFLAFSGNPTYRFAFSQQS AIFFFW LAIL
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWTLAIL

Query:  IILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK
        I+LRENVDP+LVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLT+LCACSC YLSMKPSAFF EF+LSS LTFKGTWLFQ GLSLYTKAFALK
Subjt:  IILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK

Query:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG
        GC+ MLVLPA GDA VHC+LEEDGLRGIALMNL+FIGHA LVLILG
Subjt:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG

B0F830 Uncharacterized protein4.3e-11789.02Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWTLAIL
        MASLAT LSASLFL+PIG+RRLLCSSS+YL NPSLYRSK WYLSEPKWKNFDLYSLI+ LPIAAFSEIFLFLAFSGNPTY+FAFSQQS AIFFFW LAIL
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWTLAIL

Query:  IILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK
        I+LRENVDP+LVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLT+LCACSC YLSMKPSAFF EF+LSS LTFKGTWLFQ GLSLYTKAFALK
Subjt:  IILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALK

Query:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG
        GC+ MLVLPA GD  VHC+LEEDGLRGIALMNL+FIGHA LVLILG
Subjt:  GCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G13890.1 Family of unknown function (DUF716)5.7e-2126.02Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWY-------------LSEPKWKNFDLYSLIVALPIAAFSEIFLF-----LAFSGNPTYRF
        M  +   L AS  L+ +G+  L+C+    LK+P  Y +K +Y              +  + +N  L+ LI++L +A   E  +      L     P +RF
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWY-------------LSEPKWKNFDLYSLIVALPIAAFSEIFLF-----LAFSGNPTYRF

Query:  AFSQQSAAIFFFWTLAILIILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKG----ITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCL
        +    +A  F F  +AI  +L ++   + +    +F  A + F + YS          + L      +S  ++ LC+  CL L+ +   F  +  L++ +
Subjt:  AFSQQSAAIFFFWTLAILIILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKG----ITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCL

Query:  TFKGTWLFQAGLSLYTKAFALKGCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILGF
          +G W  Q GLSLY   F  +GC  +L + +  +    C++++  LR +++++LMF  H VLV+IL F
Subjt:  TFKGTWLFQAGLSLYTKAFALKGCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILGF

AT5G13890.2 Family of unknown function (DUF716)5.7e-2126.02Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWY-------------LSEPKWKNFDLYSLIVALPIAAFSEIFLF-----LAFSGNPTYRF
        M  +   L AS  L+ +G+  L+C+    LK+P  Y +K +Y              +  + +N  L+ LI++L +A   E  +      L     P +RF
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWY-------------LSEPKWKNFDLYSLIVALPIAAFSEIFLF-----LAFSGNPTYRF

Query:  AFSQQSAAIFFFWTLAILIILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKG----ITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCL
        +    +A  F F  +AI  +L ++   + +    +F  A + F + YS          + L      +S  ++ LC+  CL L+ +   F  +  L++ +
Subjt:  AFSQQSAAIFFFWTLAILIILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKG----ITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCL

Query:  TFKGTWLFQAGLSLYTKAFALKGCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILGF
          +G W  Q GLSLY   F  +GC  +L + +  +    C++++  LR +++++LMF  H VLV+IL F
Subjt:  TFKGTWLFQAGLSLYTKAFALKGCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILGF

AT5G13890.3 Family of unknown function (DUF716)5.7e-2126.02Show/hide
Query:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWY-------------LSEPKWKNFDLYSLIVALPIAAFSEIFLF-----LAFSGNPTYRF
        M  +   L AS  L+ +G+  L+C+    LK+P  Y +K +Y              +  + +N  L+ LI++L +A   E  +      L     P +RF
Subjt:  MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWY-------------LSEPKWKNFDLYSLIVALPIAAFSEIFLF-----LAFSGNPTYRF

Query:  AFSQQSAAIFFFWTLAILIILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKG----ITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCL
        +    +A  F F  +AI  +L ++   + +    +F  A + F + YS          + L      +S  ++ LC+  CL L+ +   F  +  L++ +
Subjt:  AFSQQSAAIFFFWTLAILIILRENVDPVLVSESFIFVFAGIAFLVEYSVIGKG----ITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCL

Query:  TFKGTWLFQAGLSLYTKAFALKGCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILGF
          +G W  Q GLSLY   F  +GC  +L + +  +    C++++  LR +++++LMF  H VLV+IL F
Subjt:  TFKGTWLFQAGLSLYTKAFALKGCQEMLVLPAIGDADVHCELEEDGLRGIALMNLMFIGHAVLVLILGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCACTGGCAACGCATCTCTCGGCCTCCCTCTTCCTCGTCCCCATAGGCGTCCGCCGCCTCCTCTGTTCTTCCTCCGTCTACCTCAAAAACCCATCTCTTTACCG
ATCGAAAATCTGGTACTTATCCGAACCCAAATGGAAAAATTTCGATCTATACTCCCTCATCGTCGCCCTCCCCATCGCTGCCTTCTCCGAAATTTTCCTCTTCCTCGCAT
TTTCCGGCAACCCCACCTACAGATTTGCGTTTTCCCAGCAATCGGCGGCCATTTTCTTCTTCTGGACCCTCGCGATTCTGATCATTCTGCGGGAAAACGTCGACCCTGTT
CTCGTAAGTGAGAGTTTCATCTTTGTTTTCGCTGGAATCGCGTTTCTGGTTGAGTACTCTGTGATTGGGAAGGGGATTACAGGTCTTGGTGGCGCTTTTTACCACATTTC
CGGAGGATTGACCCTTCTTTGTGCTTGTTCTTGCCTGTATTTATCCATGAAACCATCTGCATTTTTCGTCGAATTTATACTTTCTTCTTGCTTAACCTTTAAGGGGACAT
GGCTGTTTCAAGCTGGATTGTCTCTATATACCAAGGCATTTGCCTTGAAGGGGTGTCAGGAAATGCTGGTTTTGCCTGCCATAGGGGACGCTGATGTCCATTGTGAGCTT
GAGGAGGATGGCCTGAGGGGCATTGCTTTGATGAACTTAATGTTCATTGGTCATGCTGTTTTGGTTCTGATTTTGGGTTTT
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCACTGGCAACGCATCTCTCGGCCTCCCTCTTCCTCGTCCCCATAGGCGTCCGCCGCCTCCTCTGTTCTTCCTCCGTCTACCTCAAAAACCCATCTCTTTACCG
ATCGAAAATCTGGTACTTATCCGAACCCAAATGGAAAAATTTCGATCTATACTCCCTCATCGTCGCCCTCCCCATCGCTGCCTTCTCCGAAATTTTCCTCTTCCTCGCAT
TTTCCGGCAACCCCACCTACAGATTTGCGTTTTCCCAGCAATCGGCGGCCATTTTCTTCTTCTGGACCCTCGCGATTCTGATCATTCTGCGGGAAAACGTCGACCCTGTT
CTCGTAAGTGAGAGTTTCATCTTTGTTTTCGCTGGAATCGCGTTTCTGGTTGAGTACTCTGTGATTGGGAAGGGGATTACAGGTCTTGGTGGCGCTTTTTACCACATTTC
CGGAGGATTGACCCTTCTTTGTGCTTGTTCTTGCCTGTATTTATCCATGAAACCATCTGCATTTTTCGTCGAATTTATACTTTCTTCTTGCTTAACCTTTAAGGGGACAT
GGCTGTTTCAAGCTGGATTGTCTCTATATACCAAGGCATTTGCCTTGAAGGGGTGTCAGGAAATGCTGGTTTTGCCTGCCATAGGGGACGCTGATGTCCATTGTGAGCTT
GAGGAGGATGGCCTGAGGGGCATTGCTTTGATGAACTTAATGTTCATTGGTCATGCTGTTTTGGTTCTGATTTTGGGTTTT
Protein sequenceShow/hide protein sequence
MASLATHLSASLFLVPIGVRRLLCSSSVYLKNPSLYRSKIWYLSEPKWKNFDLYSLIVALPIAAFSEIFLFLAFSGNPTYRFAFSQQSAAIFFFWTLAILIILRENVDPV
LVSESFIFVFAGIAFLVEYSVIGKGITGLGGAFYHISGGLTLLCACSCLYLSMKPSAFFVEFILSSCLTFKGTWLFQAGLSLYTKAFALKGCQEMLVLPAIGDADVHCEL
EEDGLRGIALMNLMFIGHAVLVLILGF