; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10007811 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10007811
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionAmmonium transporter 1 member 2
Genome locationChr10:13933075..13935580
RNA-Seq ExpressionHG10007811
SyntenyHG10007811
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ97071.1 ammonium transporter 1 member 2 [Cucumis melo var. makuwa]2.5e-11078.41Show/hide
Query:  MASLHLLQPLTFLSSHSAPFFSQSSRPISSFRPSFPRRSFPLKPR-TLSFALAESDSPKSLEPDPQILLQELADSFDLSRDYFEKLPRDLRLDLNDAAFD
        MASLHLLQPLTFLSSHSAP FSQ S PI SFRPSFP+  FPLK   TLSFALAESDSPKSL PDPQ+LLQELAD FDLSRDYFEKLPRDLRLDLNDAAFD
Subjt:  MASLHLLQPLTFLSSHSAPFFSQSSRPISSFRPSFPRRSFPLKPR-TLSFALAESDSPKSLEPDPQILLQELADSFDLSRDYFEKLPRDLRLDLNDAAFD

Query:  LSNGPVIDE------TITSQARTGLREIDAQVFCEEIN--VTFVWSL----YLGFGKRLISAGRRFQSMGQYGQGELQKIAKVMNTTGKLLSASSASKVA
        LSNGPV+DE       I           D       ++   T V SL      GFGKRLISAGRRFQSMGQYGQGELQKIA+VMNTTGKLLSASS  KVA
Subjt:  LSNGPVIDE------TITSQARTGLREIDAQVFCEEIN--VTFVWSL----YLGFGKRLISAGRRFQSMGQYGQGELQKIAKVMNTTGKLLSASSASKVA

Query:  GQPKKETRMFKFGELQVELTADKANIGAAIGFVFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSSVVLSAFTTVGLILLARQLKSKD
         +P+ ETRMFKFGELQVELTADKANIGAAIGFVFGVISWQL QGVQSI ESSLQYAN+NALLLAKSLRGALLAVSY+S VLSAFTTVGLILLARQLKSK+
Subjt:  GQPKKETRMFKFGELQVELTADKANIGAAIGFVFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSSVVLSAFTTVGLILLARQLKSKD

Query:  E
        E
Subjt:  E

XP_008443734.2 PREDICTED: uncharacterized protein LOC103487250 [Cucumis melo]1.9e-11078.74Show/hide
Query:  MASLHLLQPLTFLSSHSAPFFSQSSRPISSFRPSFPRRSFPLKPR-TLSFALAESDSPKSLEPDPQILLQELADSFDLSRDYFEKLPRDLRLDLNDAAFD
        MASLHLLQPLTFLSSHSAP FSQ S PI SFRPSFP+  FPLK   TLSFALAESDSPKSL PDPQ+LLQELAD FDLSRDYFEKLPRDLRLDLNDAAFD
Subjt:  MASLHLLQPLTFLSSHSAPFFSQSSRPISSFRPSFPRRSFPLKPR-TLSFALAESDSPKSLEPDPQILLQELADSFDLSRDYFEKLPRDLRLDLNDAAFD

Query:  LSNGPVIDE------TITSQARTGLREIDAQVFCEEIN--VTFVWSL----YLGFGKRLISAGRRFQSMGQYGQGELQKIAKVMNTTGKLLSASSASKVA
        LSNGPVIDE       I           D       ++   T V SL      GFGKRLISAGRRFQSMGQYGQGELQKIA+VMNTTGKLLSASS  KVA
Subjt:  LSNGPVIDE------TITSQARTGLREIDAQVFCEEIN--VTFVWSL----YLGFGKRLISAGRRFQSMGQYGQGELQKIAKVMNTTGKLLSASSASKVA

Query:  GQPKKETRMFKFGELQVELTADKANIGAAIGFVFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSSVVLSAFTTVGLILLARQLKSKD
         +P+ ETRMFKFGELQVELTADKANIGAAIGFVFGVISWQL QGVQSI ESSLQYAN+NALLLAKSLRGALLAVSY+S VLSAFTTVGLILLARQLKSK+
Subjt:  GQPKKETRMFKFGELQVELTADKANIGAAIGFVFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSSVVLSAFTTVGLILLARQLKSKD

Query:  E
        E
Subjt:  E

XP_022988171.1 uncharacterized protein LOC111485488 isoform X2 [Cucurbita maxima]4.2e-11078.33Show/hide
Query:  MASLHLLQPLTFLSSHSAPFFSQSSRPISSFRPSFPRRSFPLKPRTLSFALAESDSPKSLEPDPQILLQELADSFDLSRDYFEKLPRDLRLDLNDAAFDL
        MASLHLLQPLTFLSSHSAP FSQ S PI  F+PSF ++    KP TLSFALAESDS KSLEPDPQ+LLQELADSFDLSRDYFEKLPRDLRLDLNDAAFDL
Subjt:  MASLHLLQPLTFLSSHSAPFFSQSSRPISSFRPSFPRRSFPLKPRTLSFALAESDSPKSLEPDPQILLQELADSFDLSRDYFEKLPRDLRLDLNDAAFDL

Query:  SNGPVIDETITSQARTGLREIDA-QVFCEEINVTFVWSL-----------YLGFGKRLISAGRRFQSMGQYGQGELQKIAKVMNTTGKLLSASSASKVAG
        SNGPVIDE         L    A +V     + T V  L             G GKRLISAGRRFQSMGQYGQGELQKIAK MNTTGKLLSASSA KVA 
Subjt:  SNGPVIDETITSQARTGLREIDA-QVFCEEINVTFVWSL-----------YLGFGKRLISAGRRFQSMGQYGQGELQKIAKVMNTTGKLLSASSASKVAG

Query:  QPKKETRMFKFGELQVELTADKANIGAAIGFVFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSSVVLSAFTTVGLILLARQLKSKDE
        QPK ETRMFKFGELQVELT DKANIGAAIG VFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSS VLSAFT VGL+LLARQLKSK+E
Subjt:  QPKKETRMFKFGELQVELTADKANIGAAIGFVFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSSVVLSAFTTVGLILLARQLKSKDE

XP_023516677.1 uncharacterized protein LOC111780489 [Cucurbita pepo subsp. pepo]5.5e-11078.33Show/hide
Query:  MASLHLLQPLTFLSSHSAPFFSQSSRPISSFRPSFPRRSFPLKPRTLSFALAESDSPKSLEPDPQILLQELADSFDLSRDYFEKLPRDLRLDLNDAAFDL
        MASLHLLQPLTFLSSHSAP FSQ S PI  F+PSF ++    KP TL FALAESDS KSLEPDPQ+LLQELADSFDLSRDYFEKLPRDLRLDLNDAAFDL
Subjt:  MASLHLLQPLTFLSSHSAPFFSQSSRPISSFRPSFPRRSFPLKPRTLSFALAESDSPKSLEPDPQILLQELADSFDLSRDYFEKLPRDLRLDLNDAAFDL

Query:  SNGPVIDETITSQARTGLREIDA-QVFCEEINVTFVWSL-----------YLGFGKRLISAGRRFQSMGQYGQGELQKIAKVMNTTGKLLSASSASKVAG
        SNGPVIDE         L    A +V     + T V  L             G GKRLISAGRRFQSMGQYGQGELQKIAK MNTTGKLLSASSA KVA 
Subjt:  SNGPVIDETITSQARTGLREIDA-QVFCEEINVTFVWSL-----------YLGFGKRLISAGRRFQSMGQYGQGELQKIAKVMNTTGKLLSASSASKVAG

Query:  QPKKETRMFKFGELQVELTADKANIGAAIGFVFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSSVVLSAFTTVGLILLARQLKSKDE
        QPK ETRMFKFGELQVELTADKANIGAAIG VFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSS VLSAFT VGL+LLARQLKSK+E
Subjt:  QPKKETRMFKFGELQVELTADKANIGAAIGFVFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSSVVLSAFTTVGLILLARQLKSKDE

XP_038879157.1 uncharacterized protein LOC120071143 isoform X1 [Benincasa hispida]5.1e-11681.06Show/hide
Query:  MASLHLLQPLTFLSSHSAPFFSQSSRPIS-SFRPSFPRRSFPLKPRTLSFALAESDSPKSLEPDPQILLQELADSFDLSRDYFEKLPRDLRLDLNDAAFD
        MASLHLLQP+TFLSSHS P FS  SRP   SFRP F ++ FPLKP TLSFALAESDSPKSLEPDPQ+LLQELADSFDLSRDYFEKLPRDLRLDLNDAAFD
Subjt:  MASLHLLQPLTFLSSHSAPFFSQSSRPIS-SFRPSFPRRSFPLKPRTLSFALAESDSPKSLEPDPQILLQELADSFDLSRDYFEKLPRDLRLDLNDAAFD

Query:  LSNGPVIDETITSQARTGLREIDAQVFCEEIN--------VTFVWSL----YLGFGKRLISAGRRFQSMGQYGQGELQKIAKVMNTTGKLLSASSASKVA
        LSNGPVIDE         L    A    +            T V SL      GFGKRLISAGRRFQSMGQYGQGELQKIAK+MNTTGKLLSASSASKVA
Subjt:  LSNGPVIDETITSQARTGLREIDAQVFCEEIN--------VTFVWSL----YLGFGKRLISAGRRFQSMGQYGQGELQKIAKVMNTTGKLLSASSASKVA

Query:  GQPKKETRMFKFGELQVELTADKANIGAAIGFVFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSSVVLSAFTTVGLILLARQLKSKD
         QPK ETRMFKFGELQVELTADKANIGAAIGFVFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSSVVLSAFTTVGLILLARQLKSK+
Subjt:  GQPKKETRMFKFGELQVELTADKANIGAAIGFVFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSSVVLSAFTTVGLILLARQLKSKD

Query:  E
        E
Subjt:  E

TrEMBL top hitse value%identityAlignment
A0A1S3B8P9 uncharacterized protein LOC1034872509.2e-11178.74Show/hide
Query:  MASLHLLQPLTFLSSHSAPFFSQSSRPISSFRPSFPRRSFPLKPR-TLSFALAESDSPKSLEPDPQILLQELADSFDLSRDYFEKLPRDLRLDLNDAAFD
        MASLHLLQPLTFLSSHSAP FSQ S PI SFRPSFP+  FPLK   TLSFALAESDSPKSL PDPQ+LLQELAD FDLSRDYFEKLPRDLRLDLNDAAFD
Subjt:  MASLHLLQPLTFLSSHSAPFFSQSSRPISSFRPSFPRRSFPLKPR-TLSFALAESDSPKSLEPDPQILLQELADSFDLSRDYFEKLPRDLRLDLNDAAFD

Query:  LSNGPVIDE------TITSQARTGLREIDAQVFCEEIN--VTFVWSL----YLGFGKRLISAGRRFQSMGQYGQGELQKIAKVMNTTGKLLSASSASKVA
        LSNGPVIDE       I           D       ++   T V SL      GFGKRLISAGRRFQSMGQYGQGELQKIA+VMNTTGKLLSASS  KVA
Subjt:  LSNGPVIDE------TITSQARTGLREIDAQVFCEEIN--VTFVWSL----YLGFGKRLISAGRRFQSMGQYGQGELQKIAKVMNTTGKLLSASSASKVA

Query:  GQPKKETRMFKFGELQVELTADKANIGAAIGFVFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSSVVLSAFTTVGLILLARQLKSKD
         +P+ ETRMFKFGELQVELTADKANIGAAIGFVFGVISWQL QGVQSI ESSLQYAN+NALLLAKSLRGALLAVSY+S VLSAFTTVGLILLARQLKSK+
Subjt:  GQPKKETRMFKFGELQVELTADKANIGAAIGFVFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSSVVLSAFTTVGLILLARQLKSKD

Query:  E
        E
Subjt:  E

A0A5D3BEI6 Ammonium transporter 1 member 21.2e-11078.41Show/hide
Query:  MASLHLLQPLTFLSSHSAPFFSQSSRPISSFRPSFPRRSFPLKPR-TLSFALAESDSPKSLEPDPQILLQELADSFDLSRDYFEKLPRDLRLDLNDAAFD
        MASLHLLQPLTFLSSHSAP FSQ S PI SFRPSFP+  FPLK   TLSFALAESDSPKSL PDPQ+LLQELAD FDLSRDYFEKLPRDLRLDLNDAAFD
Subjt:  MASLHLLQPLTFLSSHSAPFFSQSSRPISSFRPSFPRRSFPLKPR-TLSFALAESDSPKSLEPDPQILLQELADSFDLSRDYFEKLPRDLRLDLNDAAFD

Query:  LSNGPVIDE------TITSQARTGLREIDAQVFCEEIN--VTFVWSL----YLGFGKRLISAGRRFQSMGQYGQGELQKIAKVMNTTGKLLSASSASKVA
        LSNGPV+DE       I           D       ++   T V SL      GFGKRLISAGRRFQSMGQYGQGELQKIA+VMNTTGKLLSASS  KVA
Subjt:  LSNGPVIDE------TITSQARTGLREIDAQVFCEEIN--VTFVWSL----YLGFGKRLISAGRRFQSMGQYGQGELQKIAKVMNTTGKLLSASSASKVA

Query:  GQPKKETRMFKFGELQVELTADKANIGAAIGFVFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSSVVLSAFTTVGLILLARQLKSKD
         +P+ ETRMFKFGELQVELTADKANIGAAIGFVFGVISWQL QGVQSI ESSLQYAN+NALLLAKSLRGALLAVSY+S VLSAFTTVGLILLARQLKSK+
Subjt:  GQPKKETRMFKFGELQVELTADKANIGAAIGFVFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSSVVLSAFTTVGLILLARQLKSKD

Query:  E
        E
Subjt:  E

A0A6J1H959 uncharacterized protein LOC111461699 isoform X23.0e-10978Show/hide
Query:  MASLHLLQPLTFLSSHSAPFFSQSSRPISSFRPSFPRRSFPLKPRTLSFALAESDSPKSLEPDPQILLQELADSFDLSRDYFEKLPRDLRLDLNDAAFDL
        MASLHLLQPLTFLSSHSAP  SQ S PI  F+PSF  +    KP TLSFALAESDS KSLEPDPQ+LLQELADSFDLSRDYFEKLPRDLRLDLNDAAFDL
Subjt:  MASLHLLQPLTFLSSHSAPFFSQSSRPISSFRPSFPRRSFPLKPRTLSFALAESDSPKSLEPDPQILLQELADSFDLSRDYFEKLPRDLRLDLNDAAFDL

Query:  SNGPVIDETITSQARTGLREIDA-QVFCEEINVTFVWSL-----------YLGFGKRLISAGRRFQSMGQYGQGELQKIAKVMNTTGKLLSASSASKVAG
        SNGPVIDE         L    A +V     + T V  L             G GKRLISAGRRFQSMGQYGQGELQKIAK MNTTGKLLSASSA KVA 
Subjt:  SNGPVIDETITSQARTGLREIDA-QVFCEEINVTFVWSL-----------YLGFGKRLISAGRRFQSMGQYGQGELQKIAKVMNTTGKLLSASSASKVAG

Query:  QPKKETRMFKFGELQVELTADKANIGAAIGFVFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSSVVLSAFTTVGLILLARQLKSKDE
        QPK ETRMFKFGELQVELTADKANIGAAIG VFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSS VLSAFT VGL+LLARQLKSK++
Subjt:  QPKKETRMFKFGELQVELTADKANIGAAIGFVFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSSVVLSAFTTVGLILLARQLKSKDE

A0A6J1H9C7 uncharacterized protein LOC111461699 isoform X11.3e-10978.33Show/hide
Query:  MASLHLLQPLTFLSSHSAPFFSQSSRPISSFRPSFPRRSFPLKPRTLSFALAESDSPKSLEPDPQILLQELADSFDLSRDYFEKLPRDLRLDLNDAAFDL
        MASLHLLQPLTFLSSHSAP  SQ S PI  F+PSF  +    KP TLSFALAESDS KSLEPDPQ+LLQELADSFDLSRDYFEKLPRDLRLDLNDAAFDL
Subjt:  MASLHLLQPLTFLSSHSAPFFSQSSRPISSFRPSFPRRSFPLKPRTLSFALAESDSPKSLEPDPQILLQELADSFDLSRDYFEKLPRDLRLDLNDAAFDL

Query:  SNGPVIDETITSQARTGLREIDA-QVFCEEINVTFVWSL-----------YLGFGKRLISAGRRFQSMGQYGQGELQKIAKVMNTTGKLLSASSASKVAG
        SNGPVIDE         L    A +V     + T V  L             G GKRLISAGRRFQSMGQYGQGELQKIAK MNTTGKLLSASSA KVA 
Subjt:  SNGPVIDETITSQARTGLREIDA-QVFCEEINVTFVWSL-----------YLGFGKRLISAGRRFQSMGQYGQGELQKIAKVMNTTGKLLSASSASKVAG

Query:  QPKKETRMFKFGELQVELTADKANIGAAIGFVFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSSVVLSAFTTVGLILLARQLKSKDE
        QPK ETRMFKFGELQVELTADKANIGAAIG VFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSS VLSAFT VGL+LLARQLKSK+E
Subjt:  QPKKETRMFKFGELQVELTADKANIGAAIGFVFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSSVVLSAFTTVGLILLARQLKSKDE

A0A6J1JGH5 uncharacterized protein LOC111485488 isoform X22.0e-11078.33Show/hide
Query:  MASLHLLQPLTFLSSHSAPFFSQSSRPISSFRPSFPRRSFPLKPRTLSFALAESDSPKSLEPDPQILLQELADSFDLSRDYFEKLPRDLRLDLNDAAFDL
        MASLHLLQPLTFLSSHSAP FSQ S PI  F+PSF ++    KP TLSFALAESDS KSLEPDPQ+LLQELADSFDLSRDYFEKLPRDLRLDLNDAAFDL
Subjt:  MASLHLLQPLTFLSSHSAPFFSQSSRPISSFRPSFPRRSFPLKPRTLSFALAESDSPKSLEPDPQILLQELADSFDLSRDYFEKLPRDLRLDLNDAAFDL

Query:  SNGPVIDETITSQARTGLREIDA-QVFCEEINVTFVWSL-----------YLGFGKRLISAGRRFQSMGQYGQGELQKIAKVMNTTGKLLSASSASKVAG
        SNGPVIDE         L    A +V     + T V  L             G GKRLISAGRRFQSMGQYGQGELQKIAK MNTTGKLLSASSA KVA 
Subjt:  SNGPVIDETITSQARTGLREIDA-QVFCEEINVTFVWSL-----------YLGFGKRLISAGRRFQSMGQYGQGELQKIAKVMNTTGKLLSASSASKVAG

Query:  QPKKETRMFKFGELQVELTADKANIGAAIGFVFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSSVVLSAFTTVGLILLARQLKSKDE
        QPK ETRMFKFGELQVELT DKANIGAAIG VFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSS VLSAFT VGL+LLARQLKSK+E
Subjt:  QPKKETRMFKFGELQVELTADKANIGAAIGFVFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSSVVLSAFTTVGLILLARQLKSKDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G37360.1 unknown protein5.2e-7455.92Show/hide
Query:  ASLHLLQPLTFLSSHSAPFFSQSSRPI-SSFRPSFPRRSFPLKPRTLSFALAESDSPKSL---EPDPQILLQELADSFDLSRDYFEKLPRDLRLDLNDAA
        +S+ LLQPL  LSS S  FFSQ S    SS +P+  +R    K  TL FAL ESDS K L   EP  + LL +L+  FDL  DYF++LP DLRLDLNDAA
Subjt:  ASLHLLQPLTFLSSHSAPFFSQSSRPI-SSFRPSFPRRSFPLKPRTLSFALAESDSPKSL---EPDPQILLQELADSFDLSRDYFEKLPRDLRLDLNDAA

Query:  FDLSNGPVIDETITSQARTGL------REIDAQVF------CEEINVTFVWSLYLGFGKRLISAGRRFQSMGQYGQGELQKIAKVMNTTGKLLSASSAS-
        FDLSNGPVIDE       T L       + D            E+ +         FGKRLISAG+RFQ MGQY +GELQKIAK M TTG +LSA ++S 
Subjt:  FDLSNGPVIDETITSQARTGL------REIDAQVF------CEEINVTFVWSLYLGFGKRLISAGRRFQSMGQYGQGELQKIAKVMNTTGKLLSASSAS-

Query:  KVAGQPKKETRMFKFGELQVELTADKANIGAAIGFVFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSSVVLSAFTTVGLILLARQLK
         V+ + K  TRMFKFGELQV +T +KA  GAAI F++G++SWQ+ QG+QSIPE+SLQYANDNALL+ KSLRG+LLA+ Y+S VLS FTT GLILLA+QL 
Subjt:  KVAGQPKKETRMFKFGELQVELTADKANIGAAIGFVFGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSSVVLSAFTTVGLILLARQLK

Query:  SKDE
        S+ E
Subjt:  SKDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTCTTCACCTTCTTCAACCTCTCACTTTTCTCTCTTCCCATTCTGCCCCTTTCTTTTCCCAATCTTCCCGTCCGATCAGTAGTTTCAGACCCTCCTTTCCCAG
AAGGTCCTTTCCACTGAAACCCCGCACTCTTTCCTTTGCTCTCGCCGAATCGGACTCTCCCAAGTCTTTAGAACCCGACCCCCAAATTCTCCTTCAAGAACTGGCCGACA
GTTTTGATCTCTCCCGAGATTACTTTGAAAAACTTCCCCGTGATCTTCGTCTTGATCTCAATGACGCTGCTTTTGATCTTTCTAATGGACCCGTTATTGACGAGACAATT
ACAAGTCAGGCAAGAACAGGACTCAGGGAAATTGATGCTCAAGTATTTTGTGAAGAAATAAATGTGACCTTTGTTTGGTCTCTCTATTTAGGATTTGGCAAGCGTTTGAT
ATCTGCTGGAAGAAGGTTCCAGTCGATGGGACAGTATGGTCAGGGTGAACTACAGAAGATTGCCAAAGTAATGAATACAACTGGAAAGCTTCTGTCTGCATCCTCTGCTT
CTAAAGTAGCTGGACAGCCTAAGAAAGAAACCAGAATGTTTAAGTTTGGAGAGCTGCAAGTTGAACTGACTGCAGATAAGGCGAACATCGGTGCAGCAATTGGTTTTGTT
TTTGGAGTAATTTCATGGCAACTGGGTCAGGGTGTCCAAAGTATTCCTGAGAGTTCTCTGCAATATGCAAATGACAATGCTTTACTTCTAGCCAAGTCTTTGAGAGGCGC
TCTACTTGCCGTTTCGTACTCGTCGGTGGTTTTGTCTGCTTTCACTACTGTGGGATTAATCTTACTTGCTAGACAACTTAAATCAAAGGACGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTCTTCACCTTCTTCAACCTCTCACTTTTCTCTCTTCCCATTCTGCCCCTTTCTTTTCCCAATCTTCCCGTCCGATCAGTAGTTTCAGACCCTCCTTTCCCAG
AAGGTCCTTTCCACTGAAACCCCGCACTCTTTCCTTTGCTCTCGCCGAATCGGACTCTCCCAAGTCTTTAGAACCCGACCCCCAAATTCTCCTTCAAGAACTGGCCGACA
GTTTTGATCTCTCCCGAGATTACTTTGAAAAACTTCCCCGTGATCTTCGTCTTGATCTCAATGACGCTGCTTTTGATCTTTCTAATGGACCCGTTATTGACGAGACAATT
ACAAGTCAGGCAAGAACAGGACTCAGGGAAATTGATGCTCAAGTATTTTGTGAAGAAATAAATGTGACCTTTGTTTGGTCTCTCTATTTAGGATTTGGCAAGCGTTTGAT
ATCTGCTGGAAGAAGGTTCCAGTCGATGGGACAGTATGGTCAGGGTGAACTACAGAAGATTGCCAAAGTAATGAATACAACTGGAAAGCTTCTGTCTGCATCCTCTGCTT
CTAAAGTAGCTGGACAGCCTAAGAAAGAAACCAGAATGTTTAAGTTTGGAGAGCTGCAAGTTGAACTGACTGCAGATAAGGCGAACATCGGTGCAGCAATTGGTTTTGTT
TTTGGAGTAATTTCATGGCAACTGGGTCAGGGTGTCCAAAGTATTCCTGAGAGTTCTCTGCAATATGCAAATGACAATGCTTTACTTCTAGCCAAGTCTTTGAGAGGCGC
TCTACTTGCCGTTTCGTACTCGTCGGTGGTTTTGTCTGCTTTCACTACTGTGGGATTAATCTTACTTGCTAGACAACTTAAATCAAAGGACGAGTAA
Protein sequenceShow/hide protein sequence
MASLHLLQPLTFLSSHSAPFFSQSSRPISSFRPSFPRRSFPLKPRTLSFALAESDSPKSLEPDPQILLQELADSFDLSRDYFEKLPRDLRLDLNDAAFDLSNGPVIDETI
TSQARTGLREIDAQVFCEEINVTFVWSLYLGFGKRLISAGRRFQSMGQYGQGELQKIAKVMNTTGKLLSASSASKVAGQPKKETRMFKFGELQVELTADKANIGAAIGFV
FGVISWQLGQGVQSIPESSLQYANDNALLLAKSLRGALLAVSYSSVVLSAFTTVGLILLARQLKSKDE