; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS020077 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS020077
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionIntegral membrane HPP family protein
Genome locationscaffold22:1553396..1554301
RNA-Seq ExpressionMS020077
SyntenyMS020077
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007065 - HPP


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008437643.1 PREDICTED: uncharacterized protein LOC103482985 [Cucumis melo]7.2e-9077.56Show/hide
Query:  MSLQLKPIHHHLRHRGRRHSHHQQQYDQASSNVRLQASSASPPPNQSFVSLLPNFHLWNVNRG-----MRLFGD-RRRRS------GHRKIAASGIVGTA
        MSLQLKPIHHHL H G RH H      Q S   ++QA SA    N S VSLLP  HL N  RG     + LF D RRRRS      GHR I AS I GT 
Subjt:  MSLQLKPIHHHLRHRGRRHSHHQQQYDQASSNVRLQASSASPPPNQSFVSLLPNFHLWNVNRG-----MRLFGD-RRRRS------GHRKIAASGIVGTA

Query:  VSDGAKPEKGSASPSLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVAAFTLLGPGWLARSS
        VSDG+KPEKG  SP LSDILWPSAGAFAAMA+LGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGV AFTLLGPGWLARSS
Subjt:  VSDGAKPEKGSASPSLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVAAFTLLGPGWLARSS

Query:  ALAASMAFMICTGSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLI
        ALAASMAFMI TGSTHPPAASLPILFIDGAK+Q LNFWYALFPGAAGCILLCLI
Subjt:  ALAASMAFMICTGSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLI

XP_011651222.2 uncharacterized protein LOC105434855 [Cucumis sativus]3.2e-9076.38Show/hide
Query:  MSLQLKPIHHHLRHRGRRHSHHQQQYDQASSNVRLQASSASPPPNQSFVSLLPNFHLWNVNRG-----MRLFGDRRRRS-------GHRKIAASGIVGTA
        MSLQLKPIHHHL H G RH H+ + Y Q S    +Q  S     N SFVSLLP+ HL N  RG     + LF D RRR        GHR I AS I GT 
Subjt:  MSLQLKPIHHHLRHRGRRHSHHQQQYDQASSNVRLQASSASPPPNQSFVSLLPNFHLWNVNRG-----MRLFGDRRRRS-------GHRKIAASGIVGTA

Query:  VSDGAKPEKGSASPSLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVAAFTLLGPGWLARSS
        VSDG+KPEKG  SP LSDILWPSAGAFAAMA+LGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIF+AQIGCAAIGV AFTLLGPGWLARSS
Subjt:  VSDGAKPEKGSASPSLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVAAFTLLGPGWLARSS

Query:  ALAASMAFMICTGSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLI
        ALAASMAFMI TGSTHPPAASLPILFIDGAK+Q LNFWYALFPGAAGCILLCLI
Subjt:  ALAASMAFMICTGSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLI

XP_022137446.1 uncharacterized protein LOC111008888 [Momordica charantia]6.6e-12899.59Show/hide
Query:  MSLQLKPIHHHLRHRGRRHSHHQQQYDQASSNVRLQASSASPPPNQSFVSLLPNFHLWNVNRGMRLFGDRRRRSGHRKIAASGIVGTAVSDGAKPEKGSA
        MSLQLKPIHHHLRHRGRRHSHHQQQYDQASSNVRLQASSASPPPNQSFVSLLPNFHLWNVNRGMRLFGDRRRRSGHRKIAASGIVGTAVSDGAKPEKGSA
Subjt:  MSLQLKPIHHHLRHRGRRHSHHQQQYDQASSNVRLQASSASPPPNQSFVSLLPNFHLWNVNRGMRLFGDRRRRSGHRKIAASGIVGTAVSDGAKPEKGSA

Query:  SPSLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVAAFTLLGPGWLARSSALAASMAFMICT
        SPSLSDILWPSAGAFAAMAMLGKMDQILA KGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVAAFTLLGPGWLARSSALAASMAFMICT
Subjt:  SPSLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVAAFTLLGPGWLARSSALAASMAFMICT

Query:  GSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLI
        GSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLI
Subjt:  GSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLI

XP_023001510.1 uncharacterized protein LOC111495629 [Cucurbita maxima]1.1e-8269.35Show/hide
Query:  MSLQLKPIHHHLRHRGRRHSHHQQQYDQASSNVRLQASSASPPPNQSFVSLLPNFHLWNVNRGMRLFG----------DRRRRS---------GHRKIAA
        M+LQLKPIH    HRG      QQ Y Q S  V           N SF+SLLPN HL N NRG  + G          DRRRR          G+R I A
Subjt:  MSLQLKPIHHHLRHRGRRHSHHQQQYDQASSNVRLQASSASPPPNQSFVSLLPNFHLWNVNRGMRLFG----------DRRRRS---------GHRKIAA

Query:  SGIVGTAVSDGAKPEKGSASPSLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVAAFTLLGP
        SGI G  +SDG+KP+KG  SP LSDILWPSAGAFAAMAMLGKMDQ+LAPKGLSMTIAPLGAVCA+LFA PS+PAARKYN+FMAQIGCAAIGV AFTLLGP
Subjt:  SGIVGTAVSDGAKPEKGSASPSLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVAAFTLLGP

Query:  GWLARSSALAASMAFMICTGSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLI
        GWLARSSALAASMAFMI TGSTHPPAASLP++FIDGAK+QHLNFWYALFPGAAGC+LLC I
Subjt:  GWLARSSALAASMAFMICTGSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLI

XP_038894638.1 uncharacterized protein LOC120083135 [Benincasa hispida]3.9e-9679.28Show/hide
Query:  MSLQLKPIHHHLRHRGRRHSHHQQQYDQASSNVRLQASSASPPPNQSFVSLLPNFHLWNVNRG--MRLFGDRRRRS-------GHRKIAASGIVGTAVSD
        MSLQLKPIHHHL H GRRH H Q+ Y Q S  V++QA SAS   N SFVSLLPN HL N NRG  + LF +RR+R        GHR I ASGI GT +SD
Subjt:  MSLQLKPIHHHLRHRGRRHSHHQQQYDQASSNVRLQASSASPPPNQSFVSLLPNFHLWNVNRG--MRLFGDRRRRS-------GHRKIAASGIVGTAVSD

Query:  GAKPEKGSASPSLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVAAFTLLGPGWLARSSALA
        G+K EKG  SP LSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPS+PAARKYN+F+AQIGCAAIGV AFTLLGPGWLARSSALA
Subjt:  GAKPEKGSASPSLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVAAFTLLGPGWLARSSALA

Query:  ASMAFMICTGSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLI
        ASMAFMI TGSTHPPAASLPILFIDGAK+Q LNFWYALFPGAAGCILLCLI
Subjt:  ASMAFMICTGSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLI

TrEMBL top hitse value%identityAlignment
A0A0A0LR37 Uncharacterized protein2.3e-8975.98Show/hide
Query:  MSLQLKPIHHHLRHRGRRHSHHQQQYDQASSNVRLQASSASPPPNQSFVSLLPNFHLWNVNRG-----MRLFGDRRRRS-------GHRKIAASGIVGTA
        MSLQLKPIHHHL H G R  H+ + Y Q S    +Q  S     N SFVSLLP+ HL N  RG     + LF D RRR        GHR I AS I GT 
Subjt:  MSLQLKPIHHHLRHRGRRHSHHQQQYDQASSNVRLQASSASPPPNQSFVSLLPNFHLWNVNRG-----MRLFGDRRRRS-------GHRKIAASGIVGTA

Query:  VSDGAKPEKGSASPSLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVAAFTLLGPGWLARSS
        VSDG+KPEKG  SP LSDILWPSAGAFAAMA+LGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIF+AQIGCAAIGV AFTLLGPGWLARSS
Subjt:  VSDGAKPEKGSASPSLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVAAFTLLGPGWLARSS

Query:  ALAASMAFMICTGSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLI
        ALAASMAFMI TGSTHPPAASLPILFIDGAK+Q LNFWYALFPGAAGCILLCLI
Subjt:  ALAASMAFMICTGSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLI

A0A1S3AUM8 uncharacterized protein LOC1034829853.5e-9077.56Show/hide
Query:  MSLQLKPIHHHLRHRGRRHSHHQQQYDQASSNVRLQASSASPPPNQSFVSLLPNFHLWNVNRG-----MRLFGD-RRRRS------GHRKIAASGIVGTA
        MSLQLKPIHHHL H G RH H      Q S   ++QA SA    N S VSLLP  HL N  RG     + LF D RRRRS      GHR I AS I GT 
Subjt:  MSLQLKPIHHHLRHRGRRHSHHQQQYDQASSNVRLQASSASPPPNQSFVSLLPNFHLWNVNRG-----MRLFGD-RRRRS------GHRKIAASGIVGTA

Query:  VSDGAKPEKGSASPSLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVAAFTLLGPGWLARSS
        VSDG+KPEKG  SP LSDILWPSAGAFAAMA+LGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGV AFTLLGPGWLARSS
Subjt:  VSDGAKPEKGSASPSLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVAAFTLLGPGWLARSS

Query:  ALAASMAFMICTGSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLI
        ALAASMAFMI TGSTHPPAASLPILFIDGAK+Q LNFWYALFPGAAGCILLCLI
Subjt:  ALAASMAFMICTGSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLI

A0A6J1C6M6 uncharacterized protein LOC1110088883.2e-12899.59Show/hide
Query:  MSLQLKPIHHHLRHRGRRHSHHQQQYDQASSNVRLQASSASPPPNQSFVSLLPNFHLWNVNRGMRLFGDRRRRSGHRKIAASGIVGTAVSDGAKPEKGSA
        MSLQLKPIHHHLRHRGRRHSHHQQQYDQASSNVRLQASSASPPPNQSFVSLLPNFHLWNVNRGMRLFGDRRRRSGHRKIAASGIVGTAVSDGAKPEKGSA
Subjt:  MSLQLKPIHHHLRHRGRRHSHHQQQYDQASSNVRLQASSASPPPNQSFVSLLPNFHLWNVNRGMRLFGDRRRRSGHRKIAASGIVGTAVSDGAKPEKGSA

Query:  SPSLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVAAFTLLGPGWLARSSALAASMAFMICT
        SPSLSDILWPSAGAFAAMAMLGKMDQILA KGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVAAFTLLGPGWLARSSALAASMAFMICT
Subjt:  SPSLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVAAFTLLGPGWLARSSALAASMAFMICT

Query:  GSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLI
        GSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLI
Subjt:  GSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLI

A0A6J1E7R0 uncharacterized protein LOC111431576 isoform X13.5e-8269.62Show/hide
Query:  MSLQLKPIHHHLRHRGRRHSHHQQQYDQASSNVRLQASSASPPPNQSFVSLLPNFHLWNVNRGMRLFG----------DRRRRS--------GHRKIAAS
        MSLQLKPIH    HRG      QQ Y Q S  V           N SF+SLLPN HL N  RG+ + G          DRRRR         G+R I AS
Subjt:  MSLQLKPIHHHLRHRGRRHSHHQQQYDQASSNVRLQASSASPPPNQSFVSLLPNFHLWNVNRGMRLFG----------DRRRRS--------GHRKIAAS

Query:  GIVGTAVSDGAKPEKGSASPSLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVAAFTLLGPG
        GI    +SDG+KP+KG  SP LSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCA+LFA PS+PAARKYN+FMAQIGCAAIGV AFTLLGPG
Subjt:  GIVGTAVSDGAKPEKGSASPSLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVAAFTLLGPG

Query:  WLARSSALAASMAFMICTGSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLI
        WLARSSALAASMAFMI TGSTHPPAASLP++FIDGAK+QHLNFWYALFPGAAGC+LLC I
Subjt:  WLARSSALAASMAFMICTGSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLI

A0A6J1KLD7 uncharacterized protein LOC1114956295.4e-8369.35Show/hide
Query:  MSLQLKPIHHHLRHRGRRHSHHQQQYDQASSNVRLQASSASPPPNQSFVSLLPNFHLWNVNRGMRLFG----------DRRRRS---------GHRKIAA
        M+LQLKPIH    HRG      QQ Y Q S  V           N SF+SLLPN HL N NRG  + G          DRRRR          G+R I A
Subjt:  MSLQLKPIHHHLRHRGRRHSHHQQQYDQASSNVRLQASSASPPPNQSFVSLLPNFHLWNVNRGMRLFG----------DRRRRS---------GHRKIAA

Query:  SGIVGTAVSDGAKPEKGSASPSLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVAAFTLLGP
        SGI G  +SDG+KP+KG  SP LSDILWPSAGAFAAMAMLGKMDQ+LAPKGLSMTIAPLGAVCA+LFA PS+PAARKYN+FMAQIGCAAIGV AFTLLGP
Subjt:  SGIVGTAVSDGAKPEKGSASPSLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVAAFTLLGP

Query:  GWLARSSALAASMAFMICTGSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLI
        GWLARSSALAASMAFMI TGSTHPPAASLP++FIDGAK+QHLNFWYALFPGAAGC+LLC I
Subjt:  GWLARSSALAASMAFMICTGSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G47980.1 Integral membrane HPP family protein1.4e-5965.95Show/hide
Query:  RGMRLFGDRRRRS---GHRKIAASGIVGTAVS-DGAKPEKGSASPSLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAAR
        R +R   +RRR S   G     AS     AVS +  KPEK + +PSLSD++WP+AGAFAAMA++G++DQ+L PKG+SM++APLGAV A+LF TPSAPAAR
Subjt:  RGMRLFGDRRRRS---GHRKIAASGIVGTAVS-DGAKPEKGSASPSLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAAR

Query:  KYNIFMAQIGCAAIGVAAFTLLGPGWLARSSALAASMAFMICTGSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLI
        KYN+F AQIGCAAIGV AF+  GP WLARS+ALAAS+AFM+ T + HPPAASLP+LFIDGAKL  LNFWYALFPGAA CILLC +
Subjt:  KYNIFMAQIGCAAIGVAAFTLLGPGWLARSSALAASMAFMICTGSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLI

AT5G62720.1 Integral membrane HPP family protein1.3e-6066.29Show/hide
Query:  RRSGHRKIAASGIVGTAVSDGAKPEKGSASPS--LSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCA
        RR     +A++G +     D  KP+K +A+ +  LSD++WP+AGAFAAMA+LG+MDQ+L+PKG+SM++APLGAV A+LF TPSAPAARKYNIF+AQIGCA
Subjt:  RRSGHRKIAASGIVGTAVSDGAKPEKGSASPS--LSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCA

Query:  AIGVAAFTLLGPGWLARSSALAASMAFMICTGSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLIVS
        AIGV AF++ GPGWLARS ALAAS+AFM+ T + HPPAASLP++FIDGAK  HLNFWYALFPGAA C++LCL+ S
Subjt:  AIGVAAFTLLGPGWLARSSALAASMAFMICTGSTHPPAASLPILFIDGAKLQHLNFWYALFPGAAGCILLCLIVS

AT5G62720.2 Integral membrane HPP family protein2.3e-4162.41Show/hide
Query:  RRSGHRKIAASGIVGTAVSDGAKPEKGSASPS--LSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCA
        RR     +A++G +     D  KP+K +A+ +  LSD++WP+AGAFAAMA+LG+MDQ+L+PKG+SM++APLGAV A+LF TPSAPAARKYNIF+AQIGCA
Subjt:  RRSGHRKIAASGIVGTAVSDGAKPEKGSASPS--LSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCA

Query:  AIGVAAFTLLGPGWLARSSALAASMAFMICTGSTHPPAASL
        AIGV AF++ GPGWLARS ALAAS+AFM+ T + HPP   L
Subjt:  AIGVAAFTLLGPGWLARSSALAASMAFMICTGSTHPPAASL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCTGCAACTGAAGCCAATTCACCACCATCTCCGCCACCGTGGCCGCCGCCATTCCCACCATCAGCAGCAATATGATCAAGCCAGTTCCAATGTACGATTACAAGC
TTCATCGGCATCTCCGCCGCCGAACCAATCGTTCGTTTCTCTGCTGCCGAATTTCCATTTATGGAACGTAAATCGAGGGATGAGATTATTTGGCGATCGGAGAAGACGAA
GCGGTCACCGGAAAATTGCGGCGTCCGGCATTGTTGGTACGGCGGTTTCAGATGGCGCGAAACCAGAAAAGGGCTCTGCCTCTCCTTCCCTCAGCGACATCCTCTGGCCT
TCTGCAGGGGCATTCGCAGCAATGGCAATGCTGGGGAAAATGGATCAGATTCTAGCGCCAAAGGGGCTGTCTATGACAATCGCGCCATTGGGCGCCGTCTGCGCCGTCCT
GTTCGCGACTCCGTCGGCGCCGGCCGCTCGAAAGTACAATATATTCATGGCCCAGATTGGGTGTGCGGCAATTGGGGTTGCGGCGTTTACTCTGTTGGGGCCTGGATGGC
TGGCTCGAAGCTCTGCTCTTGCCGCATCCATGGCGTTCATGATCTGTACTGGTTCCACCCACCCACCTGCTGCGAGCTTGCCGATTCTGTTCATCGATGGAGCGAAGTTG
CAGCATCTGAATTTCTGGTACGCTCTGTTTCCGGGAGCTGCTGGCTGTATTCTGCTTTGTTTGATTGTGAGTTCTCTCTCGCTCGGATTT
mRNA sequenceShow/hide mRNA sequence
ATGAGCCTGCAACTGAAGCCAATTCACCACCATCTCCGCCACCGTGGCCGCCGCCATTCCCACCATCAGCAGCAATATGATCAAGCCAGTTCCAATGTACGATTACAAGC
TTCATCGGCATCTCCGCCGCCGAACCAATCGTTCGTTTCTCTGCTGCCGAATTTCCATTTATGGAACGTAAATCGAGGGATGAGATTATTTGGCGATCGGAGAAGACGAA
GCGGTCACCGGAAAATTGCGGCGTCCGGCATTGTTGGTACGGCGGTTTCAGATGGCGCGAAACCAGAAAAGGGCTCTGCCTCTCCTTCCCTCAGCGACATCCTCTGGCCT
TCTGCAGGGGCATTCGCAGCAATGGCAATGCTGGGGAAAATGGATCAGATTCTAGCGCCAAAGGGGCTGTCTATGACAATCGCGCCATTGGGCGCCGTCTGCGCCGTCCT
GTTCGCGACTCCGTCGGCGCCGGCCGCTCGAAAGTACAATATATTCATGGCCCAGATTGGGTGTGCGGCAATTGGGGTTGCGGCGTTTACTCTGTTGGGGCCTGGATGGC
TGGCTCGAAGCTCTGCTCTTGCCGCATCCATGGCGTTCATGATCTGTACTGGTTCCACCCACCCACCTGCTGCGAGCTTGCCGATTCTGTTCATCGATGGAGCGAAGTTG
CAGCATCTGAATTTCTGGTACGCTCTGTTTCCGGGAGCTGCTGGCTGTATTCTGCTTTGTTTGATTGTGAGTTCTCTCTCGCTCGGATTT
Protein sequenceShow/hide protein sequence
MSLQLKPIHHHLRHRGRRHSHHQQQYDQASSNVRLQASSASPPPNQSFVSLLPNFHLWNVNRGMRLFGDRRRRSGHRKIAASGIVGTAVSDGAKPEKGSASPSLSDILWP
SAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVAAFTLLGPGWLARSSALAASMAFMICTGSTHPPAASLPILFIDGAKL
QHLNFWYALFPGAAGCILLCLIVSSLSLGF