; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013152 (gene) of Snake gourd v1 genome

Gene IDTan0013152
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG10:18905616..18912162
RNA-Seq ExpressionTan0013152
SyntenyTan0013152
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137519.1 uncharacterized protein LOC101204111 isoform X2 [Cucumis sativus]4.1e-9670.07Show/hide
Query:  MTTALVALQFPFLSPTASKKFPYPFFSFSRGRNGVQFLGCANVSNSRIAQGAFDPELHFVLELATNSELYELEQIFFDPIYFNPLMESIANRGETDYSMI
        M TAL ALQF FLSP ASKK  YPFFS S  RNGVQF+GC N SN R AQG FDP+L  VLELATNSELYELEQI F P YF+PLM+SI NRG+ DY+MI
Subjt:  MTTALVALQFPFLSPTASKKFPYPFFSFSRGRNGVQFLGCANVSNSRIAQGAFDPELHFVLELATNSELYELEQIFFDPIYFNPLMESIANRGETDYSMI

Query:  EEDLEERDEFISTLESRFLYLVVDARSTLRGWRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHF------EESVRQSNIDGSLQLGLNQWKVQTLA
        EEDLEERD FIS+LESRFL+L  DARSTLRGWRPSYRDVLLTVRKK NVLCSTKLSSEDLEAEIFLH       EESVRQ+N++G+LQLGL+QWKVQT A
Subjt:  EEDLEERDEFISTLESRFLYLVVDARSTLRGWRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHF------EESVRQSNIDGSLQLGLNQWKVQTLA

Query:  ATDGSFDLQSLILKILVLLELFTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKK---VLVWNFVNQLSLTLAQILLA
        ATDG+ DLQSLIL+          G LIT+VKM Q+FA TL GK+F+EA NYQIKKEIIKK   +   N  ++++L +AQ  LA
Subjt:  ATDGSFDLQSLILKILVLLELFTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKK---VLVWNFVNQLSLTLAQILLA

XP_008467154.1 PREDICTED: uncharacterized protein LOC103504561 isoform X2 [Cucumis melo]2.7e-9570.07Show/hide
Query:  MTTALVALQFPFLSPTASKKFPYPFFSFSRGRNGVQFLGCANVSNSRIAQGAFDPELHFVLELATNSELYELEQIFFDPIYFNPLMESIANRGETDYSMI
        MTT L ALQF FLSP  SKK  YPFFS S  R GVQF+GC N SN R AQG FDPEL  VLELATNSELYELE I F P YF+PLM+SI NRG+TDY+MI
Subjt:  MTTALVALQFPFLSPTASKKFPYPFFSFSRGRNGVQFLGCANVSNSRIAQGAFDPELHFVLELATNSELYELEQIFFDPIYFNPLMESIANRGETDYSMI

Query:  EEDLEERDEFISTLESRFLYLVVDARSTLRGWRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHF------EESVRQSNIDGSLQLGLNQWKVQTLA
        EEDLEERD FISTLESRFL+L  DARSTLRGWRPSYRDVLLTVRKK NV CSTKLSSEDLEAEIFLH       EESVR++N++GSLQLGL+ WKVQTLA
Subjt:  EEDLEERDEFISTLESRFLYLVVDARSTLRGWRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHF------EESVRQSNIDGSLQLGLNQWKVQTLA

Query:  ATDGSFDLQSLILKILVLLELFTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKK---VLVWNFVNQLSLTLAQILLA
        ATDG+ DLQSLILK          G LIT+VKM Q+FA TL GK+F+EA NYQIKKEIIKK   +   N  ++++L +AQ  LA
Subjt:  ATDGSFDLQSLILKILVLLELFTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKK---VLVWNFVNQLSLTLAQILLA

XP_022137611.1 uncharacterized protein LOC111009011 isoform X2 [Momordica charantia]7.0e-9668.92Show/hide
Query:  MTTALVALQFPFLSPTASKKFPYPFFSFSRGRNGVQFLGCANVSNSRIAQGAFDPELHFVLELATNSELYELEQIFFDPIYFNPLMESIANRGETDYSMI
        M TAL ALQF F+SP  SKKF YPFFS S  RNGVQF+ CAN S SR  +GAFDPEL  VLELATNSELYELEQI F P YF+PLM+SI NRG+TDY+MI
Subjt:  MTTALVALQFPFLSPTASKKFPYPFFSFSRGRNGVQFLGCANVSNSRIAQGAFDPELHFVLELATNSELYELEQIFFDPIYFNPLMESIANRGETDYSMI

Query:  EEDLEERDEFISTLESRFLYLVVDARSTLRGWRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHF------EESVRQSNIDGSLQLGLNQWKVQTLA
        EEDLEERD+FIS LESRFL+L  DARSTLRGWRPSYRDVLLTVRKK NV CSTKLSSEDLEAEIFLH       EESVRQSN++GSLQLGL++WKVQTLA
Subjt:  EEDLEERDEFISTLESRFLYLVVDARSTLRGWRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHF------EESVRQSNIDGSLQLGLNQWKVQTLA

Query:  ATDGSFDLQSLILKILVLLELFTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKK---VLVWNFVNQLSLTLAQILLANKCNLATMKLGF
        ATDG+ DL+SLILK          G LIT V+M QMFA TL GK+F+EA NYQIKKEIIKK   +   N  ++++L +AQ  LA     A+  LGF
Subjt:  ATDGSFDLQSLILKILVLLELFTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKK---VLVWNFVNQLSLTLAQILLANKCNLATMKLGF

XP_031736268.1 uncharacterized protein LOC101204111 isoform X1 [Cucumis sativus]4.1e-9670.07Show/hide
Query:  MTTALVALQFPFLSPTASKKFPYPFFSFSRGRNGVQFLGCANVSNSRIAQGAFDPELHFVLELATNSELYELEQIFFDPIYFNPLMESIANRGETDYSMI
        M TAL ALQF FLSP ASKK  YPFFS S  RNGVQF+GC N SN R AQG FDP+L  VLELATNSELYELEQI F P YF+PLM+SI NRG+ DY+MI
Subjt:  MTTALVALQFPFLSPTASKKFPYPFFSFSRGRNGVQFLGCANVSNSRIAQGAFDPELHFVLELATNSELYELEQIFFDPIYFNPLMESIANRGETDYSMI

Query:  EEDLEERDEFISTLESRFLYLVVDARSTLRGWRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHF------EESVRQSNIDGSLQLGLNQWKVQTLA
        EEDLEERD FIS+LESRFL+L  DARSTLRGWRPSYRDVLLTVRKK NVLCSTKLSSEDLEAEIFLH       EESVRQ+N++G+LQLGL+QWKVQT A
Subjt:  EEDLEERDEFISTLESRFLYLVVDARSTLRGWRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHF------EESVRQSNIDGSLQLGLNQWKVQTLA

Query:  ATDGSFDLQSLILKILVLLELFTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKK---VLVWNFVNQLSLTLAQILLA
        ATDG+ DLQSLIL+          G LIT+VKM Q+FA TL GK+F+EA NYQIKKEIIKK   +   N  ++++L +AQ  LA
Subjt:  ATDGSFDLQSLILKILVLLELFTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKK---VLVWNFVNQLSLTLAQILLA

XP_038893597.1 uncharacterized protein LOC120082484 [Benincasa hispida]9.5e-10172.39Show/hide
Query:  MTTALVALQFPFLSPTASKKFPYP-FFSFSRGRNGVQFLGCANVSNSRIAQGAFDPELHFVLELATNSELYELEQIFFDPIYFNPLMESIANRGETDYSM
        MTTAL ALQF FLSP ASKKF YP FFS SR RNGVQF+ CAN  N R AQGAFDPEL  VLELATNSELYELEQI F P YF+PLM+SI NRG+TDY+M
Subjt:  MTTALVALQFPFLSPTASKKFPYP-FFSFSRGRNGVQFLGCANVSNSRIAQGAFDPELHFVLELATNSELYELEQIFFDPIYFNPLMESIANRGETDYSM

Query:  IEEDLEERDEFISTLESRFLYLVVDARSTLRGWRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHF------EESVRQSNIDGSLQLGLNQWKVQTL
        IEEDLEERD+FISTLESRFLYL  DARSTLRGWRPSYRDVLLTVRKK NV CSTKLSSEDLEAEIFLH       EESV QSN++GSLQLGLNQWKVQTL
Subjt:  IEEDLEERDEFISTLESRFLYLVVDARSTLRGWRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHF------EESVRQSNIDGSLQLGLNQWKVQTL

Query:  AATDGSFDLQSLILKILVLLELFTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKK---VLVWNFVNQLSLTLAQILLANKCNLATMKLGF
        AATDG+ DLQSLILK          G LITMVKM Q+FA TL GK+FQEA NYQIKKEIIKK   +   N  ++++L +AQ  LA     A+  LGF
Subjt:  AATDGSFDLQSLILKILVLLELFTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKK---VLVWNFVNQLSLTLAQILLANKCNLATMKLGF

TrEMBL top hitse value%identityAlignment
A0A0A0LT63 Uncharacterized protein2.0e-9670.07Show/hide
Query:  MTTALVALQFPFLSPTASKKFPYPFFSFSRGRNGVQFLGCANVSNSRIAQGAFDPELHFVLELATNSELYELEQIFFDPIYFNPLMESIANRGETDYSMI
        M TAL ALQF FLSP ASKK  YPFFS S  RNGVQF+GC N SN R AQG FDP+L  VLELATNSELYELEQI F P YF+PLM+SI NRG+ DY+MI
Subjt:  MTTALVALQFPFLSPTASKKFPYPFFSFSRGRNGVQFLGCANVSNSRIAQGAFDPELHFVLELATNSELYELEQIFFDPIYFNPLMESIANRGETDYSMI

Query:  EEDLEERDEFISTLESRFLYLVVDARSTLRGWRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHF------EESVRQSNIDGSLQLGLNQWKVQTLA
        EEDLEERD FIS+LESRFL+L  DARSTLRGWRPSYRDVLLTVRKK NVLCSTKLSSEDLEAEIFLH       EESVRQ+N++G+LQLGL+QWKVQT A
Subjt:  EEDLEERDEFISTLESRFLYLVVDARSTLRGWRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHF------EESVRQSNIDGSLQLGLNQWKVQTLA

Query:  ATDGSFDLQSLILKILVLLELFTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKK---VLVWNFVNQLSLTLAQILLA
        ATDG+ DLQSLIL+          G LIT+VKM Q+FA TL GK+F+EA NYQIKKEIIKK   +   N  ++++L +AQ  LA
Subjt:  ATDGSFDLQSLILKILVLLELFTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKK---VLVWNFVNQLSLTLAQILLA

A0A1S3CSV2 uncharacterized protein LOC103504561 isoform X21.3e-9570.07Show/hide
Query:  MTTALVALQFPFLSPTASKKFPYPFFSFSRGRNGVQFLGCANVSNSRIAQGAFDPELHFVLELATNSELYELEQIFFDPIYFNPLMESIANRGETDYSMI
        MTT L ALQF FLSP  SKK  YPFFS S  R GVQF+GC N SN R AQG FDPEL  VLELATNSELYELE I F P YF+PLM+SI NRG+TDY+MI
Subjt:  MTTALVALQFPFLSPTASKKFPYPFFSFSRGRNGVQFLGCANVSNSRIAQGAFDPELHFVLELATNSELYELEQIFFDPIYFNPLMESIANRGETDYSMI

Query:  EEDLEERDEFISTLESRFLYLVVDARSTLRGWRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHF------EESVRQSNIDGSLQLGLNQWKVQTLA
        EEDLEERD FISTLESRFL+L  DARSTLRGWRPSYRDVLLTVRKK NV CSTKLSSEDLEAEIFLH       EESVR++N++GSLQLGL+ WKVQTLA
Subjt:  EEDLEERDEFISTLESRFLYLVVDARSTLRGWRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHF------EESVRQSNIDGSLQLGLNQWKVQTLA

Query:  ATDGSFDLQSLILKILVLLELFTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKK---VLVWNFVNQLSLTLAQILLA
        ATDG+ DLQSLILK          G LIT+VKM Q+FA TL GK+F+EA NYQIKKEIIKK   +   N  ++++L +AQ  LA
Subjt:  ATDGSFDLQSLILKILVLLELFTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKK---VLVWNFVNQLSLTLAQILLA

A0A5D3BIW0 Uncharacterized protein3.2e-9469.82Show/hide
Query:  MTTALVALQFPFLSPTASKKFPYP-FFSFSRGRNGVQFLGCANVSNSRIAQGAFDPELHFVLELATNSELYELEQIFFDPIYFNPLMESIANRGETDYSM
        MTT L ALQF FLSP  SKK  YP FFS S  R GVQF+GC N SN R AQG FDPEL  VLELATNSELYELE I F P YF+PLM+SI NRG+TDY+M
Subjt:  MTTALVALQFPFLSPTASKKFPYP-FFSFSRGRNGVQFLGCANVSNSRIAQGAFDPELHFVLELATNSELYELEQIFFDPIYFNPLMESIANRGETDYSM

Query:  IEEDLEERDEFISTLESRFLYLVVDARSTLRGWRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHF------EESVRQSNIDGSLQLGLNQWKVQTL
        IEEDLEERD FISTLESRFL+L  DARSTLRGWRPSYRDVLLTVRKK NV CSTKLSSEDLEAEIFLH       EESVR++N++GSLQLGL+ WKVQTL
Subjt:  IEEDLEERDEFISTLESRFLYLVVDARSTLRGWRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHF------EESVRQSNIDGSLQLGLNQWKVQTL

Query:  AATDGSFDLQSLILKILVLLELFTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKK---VLVWNFVNQLSLTLAQILLA
        AATDG+ DLQSLILK          G LIT+VKM Q+FA TL GK+F+EA NYQIKKEIIKK   +   N  ++++L +AQ  LA
Subjt:  AATDGSFDLQSLILKILVLLELFTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKK---VLVWNFVNQLSLTLAQILLA

A0A6J1C746 uncharacterized protein LOC111009011 isoform X18.4e-9568.69Show/hide
Query:  MTTALVALQFPFLSPTASKKFPYP-FFSFSRGRNGVQFLGCANVSNSRIAQGAFDPELHFVLELATNSELYELEQIFFDPIYFNPLMESIANRGETDYSM
        M TAL ALQF F+SP  SKKF YP FFS S  RNGVQF+ CAN S SR  +GAFDPEL  VLELATNSELYELEQI F P YF+PLM+SI NRG+TDY+M
Subjt:  MTTALVALQFPFLSPTASKKFPYP-FFSFSRGRNGVQFLGCANVSNSRIAQGAFDPELHFVLELATNSELYELEQIFFDPIYFNPLMESIANRGETDYSM

Query:  IEEDLEERDEFISTLESRFLYLVVDARSTLRGWRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHF------EESVRQSNIDGSLQLGLNQWKVQTL
        IEEDLEERD+FIS LESRFL+L  DARSTLRGWRPSYRDVLLTVRKK NV CSTKLSSEDLEAEIFLH       EESVRQSN++GSLQLGL++WKVQTL
Subjt:  IEEDLEERDEFISTLESRFLYLVVDARSTLRGWRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHF------EESVRQSNIDGSLQLGLNQWKVQTL

Query:  AATDGSFDLQSLILKILVLLELFTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKK---VLVWNFVNQLSLTLAQILLANKCNLATMKLGF
        AATDG+ DL+SLILK          G LIT V+M QMFA TL GK+F+EA NYQIKKEIIKK   +   N  ++++L +AQ  LA     A+  LGF
Subjt:  AATDGSFDLQSLILKILVLLELFTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKK---VLVWNFVNQLSLTLAQILLANKCNLATMKLGF

A0A6J1C749 uncharacterized protein LOC111009011 isoform X23.4e-9668.92Show/hide
Query:  MTTALVALQFPFLSPTASKKFPYPFFSFSRGRNGVQFLGCANVSNSRIAQGAFDPELHFVLELATNSELYELEQIFFDPIYFNPLMESIANRGETDYSMI
        M TAL ALQF F+SP  SKKF YPFFS S  RNGVQF+ CAN S SR  +GAFDPEL  VLELATNSELYELEQI F P YF+PLM+SI NRG+TDY+MI
Subjt:  MTTALVALQFPFLSPTASKKFPYPFFSFSRGRNGVQFLGCANVSNSRIAQGAFDPELHFVLELATNSELYELEQIFFDPIYFNPLMESIANRGETDYSMI

Query:  EEDLEERDEFISTLESRFLYLVVDARSTLRGWRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHF------EESVRQSNIDGSLQLGLNQWKVQTLA
        EEDLEERD+FIS LESRFL+L  DARSTLRGWRPSYRDVLLTVRKK NV CSTKLSSEDLEAEIFLH       EESVRQSN++GSLQLGL++WKVQTLA
Subjt:  EEDLEERDEFISTLESRFLYLVVDARSTLRGWRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHF------EESVRQSNIDGSLQLGLNQWKVQTLA

Query:  ATDGSFDLQSLILKILVLLELFTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKK---VLVWNFVNQLSLTLAQILLANKCNLATMKLGF
        ATDG+ DL+SLILK          G LIT V+M QMFA TL GK+F+EA NYQIKKEIIKK   +   N  ++++L +AQ  LA     A+  LGF
Subjt:  ATDGSFDLQSLILKILVLLELFTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKK---VLVWNFVNQLSLTLAQILLANKCNLATMKLGF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G73470.1 unknown protein5.4e-5444.06Show/hide
Query:  RNGVQFLGCANVSNSRIAQGAFDPELHFVLELATNSELYELEQIFFDPIYFNPLMESIANRGETDYSMIEEDLEERDEFISTLESRFLYLVVDARSTLRG
        R  + F   +  ++   ++  +DPEL  V ELAT+SELYELE+I F P YF+PL++SI N+G  D  MI +D+E RD FI  LESRFL+L  DARSTLRG
Subjt:  RNGVQFLGCANVSNSRIAQGAFDPELHFVLELATNSELYELEQIFFDPIYFNPLMESIANRGETDYSMIEEDLEERDEFISTLESRFLYLVVDARSTLRG

Query:  WRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHFE---------------ESVRQSNIDGSLQLGLNQWKVQTLAATD-GSFDLQSLILKILVLLEL
        WRPSYR+VLL VR   N+ CS++L +EDLEAEIFL+                 E+   S  +GSL+LGL++WKV+ LAA   G+ ++QS+ILK       
Subjt:  WRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHFE---------------ESVRQSNIDGSLQLGLNQWKVQTLAATD-GSFDLQSLILKILVLLEL

Query:  FTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKKVLVWNFVNQLSLTLAQILLANKCNLATMKLGFLGFVKPCLATKARL
           G +IT  K+ Q+ A  L GK+F EA NYQI+KE++KK              A I L ++  L   K GF G     +  K  +
Subjt:  FTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKKVLVWNFVNQLSLTLAQILLANKCNLATMKLGFLGFVKPCLATKARL

AT1G73470.2 unknown protein3.0e-3643.84Show/hide
Query:  MIEEDLEERDEFISTLESRFLYLVVDARSTLRGWRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHFE---------------ESVRQSNIDGSLQL
        MI +D+E RD FI  LESRFL+L  DARSTLRGWRPSYR+VLL VR   N+ CS++L +EDLEAEIFL+                 E+   S  +GSL+L
Subjt:  MIEEDLEERDEFISTLESRFLYLVVDARSTLRGWRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHFE---------------ESVRQSNIDGSLQL

Query:  GLNQWKVQTLAATD-GSFDLQSLILKILVLLELFTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKKVLVWNFVNQLSLTLAQILLANKCNLAT
        GL++WKV+ LAA   G+ ++QS+ILK          G +IT  K+ Q+ A  L GK+F EA NYQI+KE++KK              A I L ++  L  
Subjt:  GLNQWKVQTLAATD-GSFDLQSLILKILVLLELFTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKKVLVWNFVNQLSLTLAQILLANKCNLAT

Query:  MKLGFLGFVKPCLATKARL
         K GF G     +  K  +
Subjt:  MKLGFLGFVKPCLATKARL

AT1G73470.3 unknown protein5.4e-5444.06Show/hide
Query:  RNGVQFLGCANVSNSRIAQGAFDPELHFVLELATNSELYELEQIFFDPIYFNPLMESIANRGETDYSMIEEDLEERDEFISTLESRFLYLVVDARSTLRG
        R  + F   +  ++   ++  +DPEL  V ELAT+SELYELE+I F P YF+PL++SI N+G  D  MI +D+E RD FI  LESRFL+L  DARSTLRG
Subjt:  RNGVQFLGCANVSNSRIAQGAFDPELHFVLELATNSELYELEQIFFDPIYFNPLMESIANRGETDYSMIEEDLEERDEFISTLESRFLYLVVDARSTLRG

Query:  WRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHFE---------------ESVRQSNIDGSLQLGLNQWKVQTLAATD-GSFDLQSLILKILVLLEL
        WRPSYR+VLL VR   N+ CS++L +EDLEAEIFL+                 E+   S  +GSL+LGL++WKV+ LAA   G+ ++QS+ILK       
Subjt:  WRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHFE---------------ESVRQSNIDGSLQLGLNQWKVQTLAATD-GSFDLQSLILKILVLLEL

Query:  FTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKKVLVWNFVNQLSLTLAQILLANKCNLATMKLGFLGFVKPCLATKARL
           G +IT  K+ Q+ A  L GK+F EA NYQI+KE++KK              A I L ++  L   K GF G     +  K  +
Subjt:  FTHGILITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKKVLVWNFVNQLSLTLAQILLANKCNLATMKLGFLGFVKPCLATKARL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGACTGCTTTGGTTGCGCTTCAATTCCCCTTCCTTTCTCCAACCGCTTCCAAAAAATTTCCATATCCCTTTTTCAGCTTTTCTAGGGGTAGAAATGGTGTACAATT
TCTGGGTTGTGCGAATGTTTCCAATTCAAGAATTGCCCAAGGTGCTTTTGACCCAGAATTGCATTTTGTGCTTGAACTCGCCACGAATTCTGAATTATACGAGCTTGAAC
AAATCTTCTTCGACCCCATTTACTTCAACCCTTTGATGGAATCGATTGCGAACAGAGGCGAAACTGATTATTCTATGATTGAGGAAGACCTTGAAGAGAGAGATGAGTTT
ATTTCAACACTAGAATCTCGATTCTTATATCTTGTTGTTGATGCTCGGTCGACATTAAGGGGTTGGAGACCATCTTATAGAGATGTCTTGCTTACAGTGAGAAAAAAGTT
CAACGTTCTTTGCTCAACCAAGCTGTCATCCGAAGACCTTGAAGCAGAAATATTTCTTCATTTTGAAGAATCAGTAAGGCAGTCTAACATTGATGGTAGTCTACAACTTG
GACTCAATCAGTGGAAAGTGCAAACTTTAGCAGCTACAGATGGCTCATTTGACCTGCAATCCTTGATATTAAAGATATTGGTTTTATTAGAACTTTTCACACATGGCATT
TTGATAACTATGGTTAAAATGTTGCAAATGTTTGCTATGACATTATTTGGGAAGATATTCCAAGAAGCAACCAACTATCAAATTAAAAAGGAAATCATCAAAAAGGTACT
CGTGTGGAACTTTGTCAACCAACTATCCCTAACTCTTGCCCAAATTCTTTTGGCTAACAAATGCAACCTAGCCACCATGAAATTGGGATTTTTAGGATTCGTCAAGCCTT
GCTTAGCCACTAAAGCTCGGTTAAATCCCATCAAAACTCTAAAATTCAAACTGGCCAATTCTTTCCACCAACTATCATGTTTTGTTCTTATTTTTTACAATTCGTTCTTG
ACTTCTTCATTTGTTGATTTAAGTGGACTCGAGAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGACGACTGCTTTGGTTGCGCTTCAATTCCCCTTCCTTTCTCCAACCGCTTCCAAAAAATTTCCATATCCCTTTTTCAGCTTTTCTAGGGGTAGAAATGGTGTACAATT
TCTGGGTTGTGCGAATGTTTCCAATTCAAGAATTGCCCAAGGTGCTTTTGACCCAGAATTGCATTTTGTGCTTGAACTCGCCACGAATTCTGAATTATACGAGCTTGAAC
AAATCTTCTTCGACCCCATTTACTTCAACCCTTTGATGGAATCGATTGCGAACAGAGGCGAAACTGATTATTCTATGATTGAGGAAGACCTTGAAGAGAGAGATGAGTTT
ATTTCAACACTAGAATCTCGATTCTTATATCTTGTTGTTGATGCTCGGTCGACATTAAGGGGTTGGAGACCATCTTATAGAGATGTCTTGCTTACAGTGAGAAAAAAGTT
CAACGTTCTTTGCTCAACCAAGCTGTCATCCGAAGACCTTGAAGCAGAAATATTTCTTCATTTTGAAGAATCAGTAAGGCAGTCTAACATTGATGGTAGTCTACAACTTG
GACTCAATCAGTGGAAAGTGCAAACTTTAGCAGCTACAGATGGCTCATTTGACCTGCAATCCTTGATATTAAAGATATTGGTTTTATTAGAACTTTTCACACATGGCATT
TTGATAACTATGGTTAAAATGTTGCAAATGTTTGCTATGACATTATTTGGGAAGATATTCCAAGAAGCAACCAACTATCAAATTAAAAAGGAAATCATCAAAAAGGTACT
CGTGTGGAACTTTGTCAACCAACTATCCCTAACTCTTGCCCAAATTCTTTTGGCTAACAAATGCAACCTAGCCACCATGAAATTGGGATTTTTAGGATTCGTCAAGCCTT
GCTTAGCCACTAAAGCTCGGTTAAATCCCATCAAAACTCTAAAATTCAAACTGGCCAATTCTTTCCACCAACTATCATGTTTTGTTCTTATTTTTTACAATTCGTTCTTG
ACTTCTTCATTTGTTGATTTAAGTGGACTCGAGAGATAA
Protein sequenceShow/hide protein sequence
MTTALVALQFPFLSPTASKKFPYPFFSFSRGRNGVQFLGCANVSNSRIAQGAFDPELHFVLELATNSELYELEQIFFDPIYFNPLMESIANRGETDYSMIEEDLEERDEF
ISTLESRFLYLVVDARSTLRGWRPSYRDVLLTVRKKFNVLCSTKLSSEDLEAEIFLHFEESVRQSNIDGSLQLGLNQWKVQTLAATDGSFDLQSLILKILVLLELFTHGI
LITMVKMLQMFAMTLFGKIFQEATNYQIKKEIIKKVLVWNFVNQLSLTLAQILLANKCNLATMKLGFLGFVKPCLATKARLNPIKTLKFKLANSFHQLSCFVLIFYNSFL
TSSFVDLSGLER