; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030827 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030827
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptiontransmembrane protein 33 homolog
Genome locationchr11:1787361..1790326
RNA-Seq ExpressionLag0030827
SyntenyLag0030827
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005344 - TMEM33/Pom33 family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607043.1 hypothetical protein SDJN03_00385, partial [Cucurbita argyrosperma subsp. sororia]1.3e-12280.92Show/hide
Query:  DPDLVVETMSSSSSSFQPARQSAPSPPTNDQTQAQSSGSTTRNSGTSATTDSDSNSTRWDRHMFLFSLHAWVLIVALFAMIPMVPRNMSHRAYRLSLMGT
        DPDLVVETMSS SSSFQP RQSAPSPPT DQTQ+QS GSTTRN GTSATT S+    RWDRH  LF ++AWV +++  A+IPMVPRN+SHRAY++SLMGT
Subjt:  DPDLVVETMSSSSSSFQPARQSAPSPPTNDQTQAQSSGSTTRNSGTSATTDSDSNSTRWDRHMFLFSLHAWVLIVALFAMIPMVPRNMSHRAYRLSLMGT

Query:  ACSSLFSLFIKCGMPRSFDMQALEVYFQAVFATKAFVYFIYSVTFIASNLCLKFALIPIICVAVEQIAKFVRRTFPQSFFYRKCLERPCVWVESNTTTLS
        ACSSLF+LFI CG PRSFDMQALEVYF+ V ATKA VY IYS+TF+ASNL LK ALIPIICVAVEQI+K+VR  FP+S FYRKCLERPC WVESNT TL 
Subjt:  ACSSLFSLFIKCGMPRSFDMQALEVYFQAVFATKAFVYFIYSVTFIASNLCLKFALIPIICVAVEQIAKFVRRTFPQSFFYRKCLERPCVWVESNTTTLS

Query:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAKVGRKVNPFVSKFLPFLKPSLSAAQRWWTR
        LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFP+TAGYHQSAWAK+GRKVNPFV++FLPFLKP LSAAQRWW R
Subjt:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAKVGRKVNPFVSKFLPFLKPSLSAAQRWWTR

XP_022949013.1 uncharacterized protein LOC111452483 [Cucurbita moschata]1.7e-12481.63Show/hide
Query:  DPDLVVETMSSSSSSFQPARQSAPSPPTNDQTQAQSSGSTTRNSGTSATTDSDSNSTRWDRHMFLFSLHAWVLIVALFAMIPMVPRNMSHRAYRLSLMGT
        DPDLVVETMSS SSSFQP RQSAPSPPTNDQTQ+QSSGSTTRN GTSATT S+    RWDRH  LF ++AWV +++  A++PMVPRN+SHRAY++SLMGT
Subjt:  DPDLVVETMSSSSSSFQPARQSAPSPPTNDQTQAQSSGSTTRNSGTSATTDSDSNSTRWDRHMFLFSLHAWVLIVALFAMIPMVPRNMSHRAYRLSLMGT

Query:  ACSSLFSLFIKCGMPRSFDMQALEVYFQAVFATKAFVYFIYSVTFIASNLCLKFALIPIICVAVEQIAKFVRRTFPQSFFYRKCLERPCVWVESNTTTLS
        ACSSLF+LFI CG+PRSFDMQALEVYF+ V ATKA VY IYS+TF+ASNL LK ALIPIICVAVEQI+K+VR  FP+S FYRKCLERPC WVESNT TL 
Subjt:  ACSSLFSLFIKCGMPRSFDMQALEVYFQAVFATKAFVYFIYSVTFIASNLCLKFALIPIICVAVEQIAKFVRRTFPQSFFYRKCLERPCVWVESNTTTLS

Query:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAKVGRKVNPFVSKFLPFLKPSLSAAQRWWTR
        LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFP+TAGYHQSAWAK+GRKVNPFVS+FLPFLKP LSAAQRWW R
Subjt:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAKVGRKVNPFVSKFLPFLKPSLSAAQRWWTR

XP_022998724.1 transmembrane protein 33 homolog [Cucurbita maxima]1.5e-12884.81Show/hide
Query:  DPDLVVETMSSSSSSFQPARQSAPSPPTNDQTQAQSSGSTTRNSGTSATTDSDSNSTRWDRHMFLFSLHAWVLIVALFAMIPMVPRNMSHRAYRLSLMGT
        DP+LVVETMSSSSSSFQP RQSAPSPPTNDQTQ+QSSGSTTRNSGTSATTDS+    RWDRH  LF ++AWV ++  FAMIPMVPRN+SHRAY++SLMGT
Subjt:  DPDLVVETMSSSSSSFQPARQSAPSPPTNDQTQAQSSGSTTRNSGTSATTDSDSNSTRWDRHMFLFSLHAWVLIVALFAMIPMVPRNMSHRAYRLSLMGT

Query:  ACSSLFSLFIKCGMPRSFDMQALEVYFQAVFATKAFVYFIYSVTFIASNLCLKFALIPIICVAVEQIAKFVRRTFPQSFFYRKCLERPCVWVESNTTTLS
        ACSSLF+LFI CG PRSFDMQALEVYFQ V ATKA VY IYS+TF+ASNL LK ALIPIICVAVEQIAKFVR  FP+S FYRKCLERPC W ESNTTTL 
Subjt:  ACSSLFSLFIKCGMPRSFDMQALEVYFQAVFATKAFVYFIYSVTFIASNLCLKFALIPIICVAVEQIAKFVRRTFPQSFFYRKCLERPCVWVESNTTTLS

Query:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAKVGRKVNPFVSKFLPFLKPSLSAAQRWWTR
        LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAK+GRKVNPFVS+FLPFLKP LSAAQRWW R
Subjt:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAKVGRKVNPFVSKFLPFLKPSLSAAQRWWTR

XP_023525188.1 uncharacterized protein LOC111788863 [Cucurbita pepo subsp. pepo]1.7e-12482.33Show/hide
Query:  DPDLVVETMSSSSSSFQPARQSAPSPPTNDQTQAQSSGSTTRNSGTSATTDSDSNSTRWDRHMFLFSLHAWVLIVALFAMIPMVPRNMSHRAYRLSLMGT
        DPDLVVETMSS SSSFQP RQSA SPPTNDQTQ+QSSGSTTRNSGTSATT S+    RWDRH  LF ++AWV +++  A+IPMVPRN+SHRAY++SLMGT
Subjt:  DPDLVVETMSSSSSSFQPARQSAPSPPTNDQTQAQSSGSTTRNSGTSATTDSDSNSTRWDRHMFLFSLHAWVLIVALFAMIPMVPRNMSHRAYRLSLMGT

Query:  ACSSLFSLFIKCGMPRSFDMQALEVYFQAVFATKAFVYFIYSVTFIASNLCLKFALIPIICVAVEQIAKFVRRTFPQSFFYRKCLERPCVWVESNTTTLS
        ACSSLF+LFI CG PRSFDMQALEVYF+ V ATKA VY IYS+TF+ASNL LK ALIPIICVAVEQIAK+V+  FP+S FYRKCLERPC WVESNT TL 
Subjt:  ACSSLFSLFIKCGMPRSFDMQALEVYFQAVFATKAFVYFIYSVTFIASNLCLKFALIPIICVAVEQIAKFVRRTFPQSFFYRKCLERPCVWVESNTTTLS

Query:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAKVGRKVNPFVSKFLPFLKPSLSAAQRWWTR
        LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAK+GRKVNPFVS+FLPFLKP LSAAQRWW R
Subjt:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAKVGRKVNPFVSKFLPFLKPSLSAAQRWWTR

XP_038874907.1 uncharacterized protein LOC120067410 [Benincasa hispida]1.1e-13187.99Show/hide
Query:  DPDLVVETMSSSSSSFQPARQSAPSPPTNDQTQAQSSGSTTRNSGTSATTDSDSNSTRWDRHMFLFSLHAWVLIVALFAMIPMVPRNMSHRAYRLSLMGT
        DPDLVVETMS SSS FQPA QSAPSPPTNDQTQ QSS    RNS   ATTDS SNS+RW+RHM LFSL+AWVLIVALFAMIPMVPRN+SHRAYRLS+MGT
Subjt:  DPDLVVETMSSSSSSFQPARQSAPSPPTNDQTQAQSSGSTTRNSGTSATTDSDSNSTRWDRHMFLFSLHAWVLIVALFAMIPMVPRNMSHRAYRLSLMGT

Query:  ACSSLFSLFIKCGMPRSFDMQALEVYFQAVFATKAFVYFIYSVTFIASNLCLKFALIPIICVAVEQIAKFVRRTFPQSFFYRKCLERPCVWVESNTTTLS
        ACSSLFSLFIKCGMPR+FDMQ LEVYFQAV ATK FVYFIYSVTF+ASNLCLKFALIPIIC A EQIAKF+RRTFPQSFFYRK LERPC WVESNTTTLS
Subjt:  ACSSLFSLFIKCGMPRSFDMQALEVYFQAVFATKAFVYFIYSVTFIASNLCLKFALIPIICVAVEQIAKFVRRTFPQSFFYRKCLERPCVWVESNTTTLS

Query:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAKVGRKVNPFVSKFLPFLKPSLSAAQRWWTR
        LLSS+VEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAG+HQSAWAKVGRKVNPFV +FLPFLKPS SAAQ WWTR
Subjt:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAKVGRKVNPFVSKFLPFLKPSLSAAQRWWTR

TrEMBL top hitse value%identityAlignment
A0A2N9J2X3 Uncharacterized protein5.2e-9865.51Show/hide
Query:  DPDLVVETMSSSSSSFQPARQSAP----SPPTNDQTQAQSSGSTTRNSGTSATTDSDSNSTRWDRHMFLFSLHAWVLIVALFAMIPMVPRNMSHRAYRLS
        DPDLVVE MS++SSS QP R SAP    S  +NDQ + ++SGSTTR SGTSATT +   S RWDR    FS++AWV +VA+ A+ P+VPR++SHRAYRLS
Subjt:  DPDLVVETMSSSSSSFQPARQSAP----SPPTNDQTQAQSSGSTTRNSGTSATTDSDSNSTRWDRHMFLFSLHAWVLIVALFAMIPMVPRNMSHRAYRLS

Query:  LMGTACSSLFSLFIKCGMPRSFDMQALEVYFQAVFATKAFVYFIYSVTFIASNLCLKFALIPIICVAVEQIAKFVRRTFPQSFFYRKCLERPCVWVESNT
         MGTACSSLFSL+   G PR +++QAL+VYFQ++ ATK F+YFIY +TF+ SNLCLKFALIPI+C A+E +AKF+RR F +S  YRK LE PCVWV+SNT
Subjt:  LMGTACSSLFSLFIKCGMPRSFDMQALEVYFQAVFATKAFVYFIYSVTFIASNLCLKFALIPIICVAVEQIAKFVRRTFPQSFFYRKCLERPCVWVESNT

Query:  TTLSLLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAKVGRKVNPFVSKFLPFLKPSLSAAQRWWTR
        TTLS+LSS+ EI LGFLLI+SLF+WQRN +QTFMYWQLLKLMY  P+TAGYH S WAK+GR VNP V ++ PFL+  +SAAQRWW R
Subjt:  TTLSLLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAKVGRKVNPFVSKFLPFLKPSLSAAQRWWTR

A0A6J1CQJ0 uncharacterized protein LOC111013250 isoform X27.9e-12381.27Show/hide
Query:  DPDLVVETMSSSSSSFQPARQSAPSPPTNDQTQAQSSGSTTRNSGTSATTDSDSNSTRWDRHMFLFSLHAWVLIVALFAMIPMVPRNMSHRAYRLSLMGT
        DPDLVV+T+S SS   Q ARQSAPSPPTNDQTQAQSSGSTT NSGTS TTDS+  S     HM LFS++AWVLIVALFAM+PMVP+N+SHRA+RLS MGT
Subjt:  DPDLVVETMSSSSSSFQPARQSAPSPPTNDQTQAQSSGSTTRNSGTSATTDSDSNSTRWDRHMFLFSLHAWVLIVALFAMIPMVPRNMSHRAYRLSLMGT

Query:  ACSSLFSLFIKCGMPRSFDMQALEVYFQAVFATKAFVYFIYSVTFIASNLCLKFALIPIICVAVEQIAKFVRRTFPQSFFYRKCLERPCVWVESNTTTLS
          SSLFSL I CG P++ DMQALEVYFQ+V ATKAFVYFIY +TF+ASNLCLKFALIPIIC  +EQIAKF+RR+FP+S FYRKCLERPC WVESNTTTLS
Subjt:  ACSSLFSLFIKCGMPRSFDMQALEVYFQAVFATKAFVYFIYSVTFIASNLCLKFALIPIICVAVEQIAKFVRRTFPQSFFYRKCLERPCVWVESNTTTLS

Query:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAKVGRKVNPFVSKFLPFLKPSLSAAQRWWTR
        LLSSNVEIALGFLLIISLF+WQRNFV TFMYWQLL+LMYHFPMTAGYHQS WAKVGRKVNPFVS+FLPFLKPSLSAA+RWW R
Subjt:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAKVGRKVNPFVSKFLPFLKPSLSAAQRWWTR

A0A6J1CQK8 uncharacterized protein LOC111013250 isoform X17.9e-12381.27Show/hide
Query:  DPDLVVETMSSSSSSFQPARQSAPSPPTNDQTQAQSSGSTTRNSGTSATTDSDSNSTRWDRHMFLFSLHAWVLIVALFAMIPMVPRNMSHRAYRLSLMGT
        DPDLVV+T+S SS   Q ARQSAPSPPTNDQTQAQSSGSTT NSGTS TTDS+  S     HM LFS++AWVLIVALFAM+PMVP+N+SHRA+RLS MGT
Subjt:  DPDLVVETMSSSSSSFQPARQSAPSPPTNDQTQAQSSGSTTRNSGTSATTDSDSNSTRWDRHMFLFSLHAWVLIVALFAMIPMVPRNMSHRAYRLSLMGT

Query:  ACSSLFSLFIKCGMPRSFDMQALEVYFQAVFATKAFVYFIYSVTFIASNLCLKFALIPIICVAVEQIAKFVRRTFPQSFFYRKCLERPCVWVESNTTTLS
          SSLFSL I CG P++ DMQALEVYFQ+V ATKAFVYFIY +TF+ASNLCLKFALIPIIC  +EQIAKF+RR+FP+S FYRKCLERPC WVESNTTTLS
Subjt:  ACSSLFSLFIKCGMPRSFDMQALEVYFQAVFATKAFVYFIYSVTFIASNLCLKFALIPIICVAVEQIAKFVRRTFPQSFFYRKCLERPCVWVESNTTTLS

Query:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAKVGRKVNPFVSKFLPFLKPSLSAAQRWWTR
        LLSSNVEIALGFLLIISLF+WQRNFV TFMYWQLL+LMYHFPMTAGYHQS WAKVGRKVNPFVS+FLPFLKPSLSAA+RWW R
Subjt:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAKVGRKVNPFVSKFLPFLKPSLSAAQRWWTR

A0A6J1GAW8 uncharacterized protein LOC1114524838.5e-12581.63Show/hide
Query:  DPDLVVETMSSSSSSFQPARQSAPSPPTNDQTQAQSSGSTTRNSGTSATTDSDSNSTRWDRHMFLFSLHAWVLIVALFAMIPMVPRNMSHRAYRLSLMGT
        DPDLVVETMSS SSSFQP RQSAPSPPTNDQTQ+QSSGSTTRN GTSATT S+    RWDRH  LF ++AWV +++  A++PMVPRN+SHRAY++SLMGT
Subjt:  DPDLVVETMSSSSSSFQPARQSAPSPPTNDQTQAQSSGSTTRNSGTSATTDSDSNSTRWDRHMFLFSLHAWVLIVALFAMIPMVPRNMSHRAYRLSLMGT

Query:  ACSSLFSLFIKCGMPRSFDMQALEVYFQAVFATKAFVYFIYSVTFIASNLCLKFALIPIICVAVEQIAKFVRRTFPQSFFYRKCLERPCVWVESNTTTLS
        ACSSLF+LFI CG+PRSFDMQALEVYF+ V ATKA VY IYS+TF+ASNL LK ALIPIICVAVEQI+K+VR  FP+S FYRKCLERPC WVESNT TL 
Subjt:  ACSSLFSLFIKCGMPRSFDMQALEVYFQAVFATKAFVYFIYSVTFIASNLCLKFALIPIICVAVEQIAKFVRRTFPQSFFYRKCLERPCVWVESNTTTLS

Query:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAKVGRKVNPFVSKFLPFLKPSLSAAQRWWTR
        LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFP+TAGYHQSAWAK+GRKVNPFVS+FLPFLKP LSAAQRWW R
Subjt:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAKVGRKVNPFVSKFLPFLKPSLSAAQRWWTR

A0A6J1KF32 transmembrane protein 33 homolog7.4e-12984.81Show/hide
Query:  DPDLVVETMSSSSSSFQPARQSAPSPPTNDQTQAQSSGSTTRNSGTSATTDSDSNSTRWDRHMFLFSLHAWVLIVALFAMIPMVPRNMSHRAYRLSLMGT
        DP+LVVETMSSSSSSFQP RQSAPSPPTNDQTQ+QSSGSTTRNSGTSATTDS+    RWDRH  LF ++AWV ++  FAMIPMVPRN+SHRAY++SLMGT
Subjt:  DPDLVVETMSSSSSSFQPARQSAPSPPTNDQTQAQSSGSTTRNSGTSATTDSDSNSTRWDRHMFLFSLHAWVLIVALFAMIPMVPRNMSHRAYRLSLMGT

Query:  ACSSLFSLFIKCGMPRSFDMQALEVYFQAVFATKAFVYFIYSVTFIASNLCLKFALIPIICVAVEQIAKFVRRTFPQSFFYRKCLERPCVWVESNTTTLS
        ACSSLF+LFI CG PRSFDMQALEVYFQ V ATKA VY IYS+TF+ASNL LK ALIPIICVAVEQIAKFVR  FP+S FYRKCLERPC W ESNTTTL 
Subjt:  ACSSLFSLFIKCGMPRSFDMQALEVYFQAVFATKAFVYFIYSVTFIASNLCLKFALIPIICVAVEQIAKFVRRTFPQSFFYRKCLERPCVWVESNTTTLS

Query:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAKVGRKVNPFVSKFLPFLKPSLSAAQRWWTR
        LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAK+GRKVNPFVS+FLPFLKP LSAAQRWW R
Subjt:  LLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAKVGRKVNPFVSKFLPFLKPSLSAAQRWWTR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G02420.1 unknown protein4.0e-9559.86Show/hide
Query:  DPDLVVETMSSSSSSFQPARQSAPSPPT------NDQTQAQSSGSTTRNSGTSATTDSDSNSTRWDRHMFLFSLHAWVLIVALFAMIPMVPRNMSHRAYR
        DPDLVVE MS+SSSS Q AR +A S  +      N+Q ++++SGS  R SG SATT +  +S RWD     FS++AWV ++A+ A++P++P+N+S+RAYR
Subjt:  DPDLVVETMSSSSSSFQPARQSAPSPPT------NDQTQAQSSGSTTRNSGTSATTDSDSNSTRWDRHMFLFSLHAWVLIVALFAMIPMVPRNMSHRAYR

Query:  LSLMGTACSSLFSLFIKCGMPRSFDMQALEVYFQAVFATKAFVYFIYSVTFIASNLCLKFALIPIICVAVEQIAKFVRRTFPQSFFYRKCLERPCVWVES
        LS MGTACSSL+SL+   G PR+++MQ L+VYFQ++ A K F+YFIY +TF+ S+LCLKFALIPI+C A+EQ+AKF+RR F +S  YRK LE PCVWVES
Subjt:  LSLMGTACSSLFSLFIKCGMPRSFDMQALEVYFQAVFATKAFVYFIYSVTFIASNLCLKFALIPIICVAVEQIAKFVRRTFPQSFFYRKCLERPCVWVES

Query:  NTTTLSLLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAKVGRKVNPFVSKFLPFLKPSLSAAQRWWTR
        NTTTL++LSS  EIA+GFLLIISL +WQRN +QTFMYWQLLKLMY  P+TAGYHQS W+++GR V P + ++ PFL   +SA QRWW R
Subjt:  NTTTLSLLSSNVEIALGFLLIISLFTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAKVGRKVNPFVSKFLPFLKPSLSAAQRWWTR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGGATCCTGATCTTGTAGTTGAGACAATGTCTTCCAGCAGTTCATCATTTCAGCCAGCAAGGCAATCAGCACCTTCACCTCCTACGAATGATCAAACTCAGGCACA
GAGCTCAGGTTCAACTACAAGAAATTCAGGAACATCGGCAACTACAGATTCAGATTCAAATTCTACACGCTGGGATCGACATATGTTTCTTTTTTCACTCCATGCCTGGG
TGCTTATTGTGGCATTGTTTGCAATGATACCGATGGTACCCAGAAACATGTCACATAGGGCTTATCGGCTTTCTCTTATGGGCACTGCATGCTCCTCTCTGTTCTCCTTG
TTCATTAAATGTGGGATGCCCAGGTCATTTGATATGCAGGCTTTGGAAGTTTATTTCCAGGCTGTCTTTGCAACAAAAGCTTTTGTCTACTTCATTTACTCTGTCACCTT
TATAGCTTCAAATCTCTGCCTTAAATTTGCTTTAATTCCAATTATATGCGTAGCTGTTGAGCAGATTGCCAAGTTTGTTAGGCGTACTTTTCCTCAATCTTTCTTCTACA
GGAAATGCTTGGAGCGGCCTTGTGTTTGGGTGGAATCAAATACAACCACTCTTTCTCTTCTGTCTTCAAATGTTGAGATTGCATTGGGTTTCCTTCTGATCATCTCCTTG
TTTACATGGCAGCGCAACTTCGTGCAAACATTCATGTACTGGCAGTTACTGAAGCTCATGTATCACTTCCCCATGACTGCTGGGTATCATCAAAGCGCCTGGGCTAAGGT
TGGGAGAAAAGTTAATCCATTTGTGAGCAAATTTCTGCCATTTCTGAAACCTTCACTTTCTGCAGCTCAAAGATGGTGGACCAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAGGATCCTGATCTTGTAGTTGAGACAATGTCTTCCAGCAGTTCATCATTTCAGCCAGCAAGGCAATCAGCACCTTCACCTCCTACGAATGATCAAACTCAGGCACA
GAGCTCAGGTTCAACTACAAGAAATTCAGGAACATCGGCAACTACAGATTCAGATTCAAATTCTACACGCTGGGATCGACATATGTTTCTTTTTTCACTCCATGCCTGGG
TGCTTATTGTGGCATTGTTTGCAATGATACCGATGGTACCCAGAAACATGTCACATAGGGCTTATCGGCTTTCTCTTATGGGCACTGCATGCTCCTCTCTGTTCTCCTTG
TTCATTAAATGTGGGATGCCCAGGTCATTTGATATGCAGGCTTTGGAAGTTTATTTCCAGGCTGTCTTTGCAACAAAAGCTTTTGTCTACTTCATTTACTCTGTCACCTT
TATAGCTTCAAATCTCTGCCTTAAATTTGCTTTAATTCCAATTATATGCGTAGCTGTTGAGCAGATTGCCAAGTTTGTTAGGCGTACTTTTCCTCAATCTTTCTTCTACA
GGAAATGCTTGGAGCGGCCTTGTGTTTGGGTGGAATCAAATACAACCACTCTTTCTCTTCTGTCTTCAAATGTTGAGATTGCATTGGGTTTCCTTCTGATCATCTCCTTG
TTTACATGGCAGCGCAACTTCGTGCAAACATTCATGTACTGGCAGTTACTGAAGCTCATGTATCACTTCCCCATGACTGCTGGGTATCATCAAAGCGCCTGGGCTAAGGT
TGGGAGAAAAGTTAATCCATTTGTGAGCAAATTTCTGCCATTTCTGAAACCTTCACTTTCTGCAGCTCAAAGATGGTGGACCAGGTAG
Protein sequenceShow/hide protein sequence
MQDPDLVVETMSSSSSSFQPARQSAPSPPTNDQTQAQSSGSTTRNSGTSATTDSDSNSTRWDRHMFLFSLHAWVLIVALFAMIPMVPRNMSHRAYRLSLMGTACSSLFSL
FIKCGMPRSFDMQALEVYFQAVFATKAFVYFIYSVTFIASNLCLKFALIPIICVAVEQIAKFVRRTFPQSFFYRKCLERPCVWVESNTTTLSLLSSNVEIALGFLLIISL
FTWQRNFVQTFMYWQLLKLMYHFPMTAGYHQSAWAKVGRKVNPFVSKFLPFLKPSLSAAQRWWTR