; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013023 (gene) of Snake gourd v1 genome

Gene IDTan0013023
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionAB hydrolase-1 domain-containing protein
Genome locationLG06:72253446..72263367
RNA-Seq ExpressionTan0013023
SyntenyTan0013023
Gene Ontology termsGO:0016788 - hydrolase activity, acting on ester bonds (molecular function)
InterPro domainsIPR012908 - GPI inositol-deacylase PGAP1-like
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593743.1 hypothetical protein SDJN03_13219, partial [Cucurbita argyrosperma subsp. sororia]1.9e-14688.05Show/hide
Query:  MAVSSFSSLHFKP--SFSSYPHSRFSCSLRPAVILPGLGNNSGDYDKLRLLLKERYGVTAVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKT
        MAV SFS LHFKP  SFS+ P  +FSCS+RPAVILPGLGNNSGDYDKLRL+L++RYGV +VVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYL KT
Subjt:  MAVSSFSSLHFKP--SFSSYPHSRFSCSLRPAVILPGLGNNSGDYDKLRLLLKERYGVTAVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKT

Query:  DEAIQEAKDLAQGGTLSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPKGFPGVIDQTRGLLNYVDKYCSKAGYNPELKYVCIAGRYIQGARLF
        D+AIQEA++LAQG  LSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPK  PGVIDQTRGLLNYVDKYCSK GYNPELKYVCIAGRYIQGARLF
Subjt:  DEAIQEAKDLAQGGTLSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPKGFPGVIDQTRGLLNYVDKYCSKAGYNPELKYVCIAGRYIQGARLF

Query:  GNSNANTIHAAASIANNQPTPQLTIANDTSSSTNTSTFRARFIGQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRP
          S+ANTI AAASIANNQP P+LTIAN+TSSSTNT TFRARF+GQGYKQVCGESEVWGDGVVPEVSAHLEGAIN+SF+GVYHSPVGSDDELRP
Subjt:  GNSNANTIHAAASIANNQPTPQLTIANDTSSSTNTSTFRARFIGQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRP

XP_004154031.1 uncharacterized protein LOC101213079 [Cucumis sativus]4.9e-14788.74Show/hide
Query:  MAVSSFSSLHFKPSFSSYPHSRFSCSLRPAVILPGLGNNSGDYDKLRLLLKERYGVTAVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKTDE
        MAV SFS LH KPSF S  HS+FSCSLRPAVILPGLGNNSGDYDKLRLLLKER+GV +VVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKTDE
Subjt:  MAVSSFSSLHFKPSFSSYPHSRFSCSLRPAVILPGLGNNSGDYDKLRLLLKERYGVTAVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKTDE

Query:  AIQEAKDLAQGGTLSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPKGFPGVIDQTRGLLNYVDKYCSKAGYNPELKYVCIAGRYIQGARLFGN
        AIQEAK+LAQGGTLSLIGHSAGGWLARVYMEEFGIS ISMLLTLGTPHLPPPKG PGVIDQTRGLLNYVDK CSKAGYNPELK+VCIAGRYIQG+RLFGN
Subjt:  AIQEAKDLAQGGTLSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPKGFPGVIDQTRGLLNYVDKYCSKAGYNPELKYVCIAGRYIQGARLFGN

Query:  SNANTIHAAASIANNQPTPQLTIANDTSSSTN--TSTFRARFIGQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRP
        S+ANTI AAASI++NQPTP+L I N+TS+ST+  T++ RARF+GQGYKQVCGESEVWGDGVVPEVSAHLEGA+NISFDGVYHSPVGSDDELRP
Subjt:  SNANTIHAAASIANNQPTPQLTIANDTSSSTN--TSTFRARFIGQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRP

XP_023000108.1 uncharacterized protein LOC111494403 isoform X1 [Cucurbita maxima]1.9e-14688.74Show/hide
Query:  MAVSSFSSLHFKPSFSSYPHS--RFSCSLRPAVILPGLGNNSGDYDKLRLLLKERYGVTAVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKT
        MAV SFS LHFKPSFS    S  +FSCS+RPAVILPGLGNNSGDYDKLRL+L++RYGV +VVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYL KT
Subjt:  MAVSSFSSLHFKPSFSSYPHS--RFSCSLRPAVILPGLGNNSGDYDKLRLLLKERYGVTAVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKT

Query:  DEAIQEAKDLAQGGTLSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPKGFPGVIDQTRGLLNYVDKYCSKAGYNPELKYVCIAGRYIQGARLF
        DEAIQEAK+LAQG  LSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPK  PGVIDQTRGLLNYVDKYCSK GYNPELKYVCIAGRYIQGARLF
Subjt:  DEAIQEAKDLAQGGTLSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPKGFPGVIDQTRGLLNYVDKYCSKAGYNPELKYVCIAGRYIQGARLF

Query:  GNSNANTIHAAASIANNQPTPQLTIANDTSSSTNTSTFRARFIGQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRP
          S+ANTI AAASIANNQP P+LTI N TSSSTNT TFRARF+GQGYKQVCGESEVWGDGVVPEVSAHLEGAIN+SFDGVYHSPVGSDDELRP
Subjt:  GNSNANTIHAAASIANNQPTPQLTIANDTSSSTNTSTFRARFIGQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRP

XP_023000109.1 uncharacterized protein LOC111494403 isoform X2 [Cucurbita maxima]1.9e-14688.74Show/hide
Query:  MAVSSFSSLHFKPSFSSYPHS--RFSCSLRPAVILPGLGNNSGDYDKLRLLLKERYGVTAVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKT
        MAV SFS LHFKPSFS    S  +FSCS+RPAVILPGLGNNSGDYDKLRL+L++RYGV +VVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYL KT
Subjt:  MAVSSFSSLHFKPSFSSYPHS--RFSCSLRPAVILPGLGNNSGDYDKLRLLLKERYGVTAVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKT

Query:  DEAIQEAKDLAQGGTLSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPKGFPGVIDQTRGLLNYVDKYCSKAGYNPELKYVCIAGRYIQGARLF
        DEAIQEAK+LAQG  LSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPK  PGVIDQTRGLLNYVDKYCSK GYNPELKYVCIAGRYIQGARLF
Subjt:  DEAIQEAKDLAQGGTLSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPKGFPGVIDQTRGLLNYVDKYCSKAGYNPELKYVCIAGRYIQGARLF

Query:  GNSNANTIHAAASIANNQPTPQLTIANDTSSSTNTSTFRARFIGQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRP
          S+ANTI AAASIANNQP P+LTI N TSSSTNT TFRARF+GQGYKQVCGESEVWGDGVVPEVSAHLEGAIN+SFDGVYHSPVGSDDELRP
Subjt:  GNSNANTIHAAASIANNQPTPQLTIANDTSSSTNTSTFRARFIGQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRP

XP_038906865.1 uncharacterized protein LOC120092752 [Benincasa hispida]7.3e-15191.47Show/hide
Query:  MAVSSFSSLHFKPSFSSYPHSRFSCSLRPAVILPGLGNNSGDYDKLRLLLKERYGVTAVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKTDE
        MAV SFS LHFKPSF S  HS+ SCSLRPAVILPGLGNNSGDYDKLRLLLKERYGV AVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKTDE
Subjt:  MAVSSFSSLHFKPSFSSYPHSRFSCSLRPAVILPGLGNNSGDYDKLRLLLKERYGVTAVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKTDE

Query:  AIQEAKDLAQGGTLSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPKGFPGVIDQTRGLLNYVDKYCSKAGYNPELKYVCIAGRYIQGARLFGN
        AIQEAK+ AQGGTLSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPKG PGVIDQTRGLLNYVDK CSKAGYNPELKYVCIAGRYIQGARLFGN
Subjt:  AIQEAKDLAQGGTLSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPKGFPGVIDQTRGLLNYVDKYCSKAGYNPELKYVCIAGRYIQGARLFGN

Query:  SNANTIHAAASIANNQPTPQLTIANDTSSSTNTS--TFRARFIGQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRP
        S+ANTI AAASI+NNQP+P+L I NDTSSST+T+  T RARF+GQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRP
Subjt:  SNANTIHAAASIANNQPTPQLTIANDTSSSTNTS--TFRARFIGQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRP

TrEMBL top hitse value%identityAlignment
A0A0A0K9V1 Uncharacterized protein2.4e-14788.74Show/hide
Query:  MAVSSFSSLHFKPSFSSYPHSRFSCSLRPAVILPGLGNNSGDYDKLRLLLKERYGVTAVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKTDE
        MAV SFS LH KPSF S  HS+FSCSLRPAVILPGLGNNSGDYDKLRLLLKER+GV +VVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKTDE
Subjt:  MAVSSFSSLHFKPSFSSYPHSRFSCSLRPAVILPGLGNNSGDYDKLRLLLKERYGVTAVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKTDE

Query:  AIQEAKDLAQGGTLSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPKGFPGVIDQTRGLLNYVDKYCSKAGYNPELKYVCIAGRYIQGARLFGN
        AIQEAK+LAQGGTLSLIGHSAGGWLARVYMEEFGIS ISMLLTLGTPHLPPPKG PGVIDQTRGLLNYVDK CSKAGYNPELK+VCIAGRYIQG+RLFGN
Subjt:  AIQEAKDLAQGGTLSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPKGFPGVIDQTRGLLNYVDKYCSKAGYNPELKYVCIAGRYIQGARLFGN

Query:  SNANTIHAAASIANNQPTPQLTIANDTSSSTN--TSTFRARFIGQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRP
        S+ANTI AAASI++NQPTP+L I N+TS+ST+  T++ RARF+GQGYKQVCGESEVWGDGVVPEVSAHLEGA+NISFDGVYHSPVGSDDELRP
Subjt:  SNANTIHAAASIANNQPTPQLTIANDTSSSTN--TSTFRARFIGQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRP

A0A1S3CBR0 uncharacterized protein LOC1034990043.4e-14688.4Show/hide
Query:  MAVSSFSSLHFKPSFSSYPHSRFSCSLRPAVILPGLGNNSGDYDKLRLLLKERYGVTAVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKTDE
        MAV SFS LH KPSF S   S+FSCSLRPAVILPGLGNNSGDYDKLRLLLKERYGV +VVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKTDE
Subjt:  MAVSSFSSLHFKPSFSSYPHSRFSCSLRPAVILPGLGNNSGDYDKLRLLLKERYGVTAVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKTDE

Query:  AIQEAKDLAQGGTLSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPKGFPGVIDQTRGLLNYVDKYCSKAGYNPELKYVCIAGRYIQGARLFGN
        AIQEAK+LAQGGTLSLIGHSAGGWLARVYMEEFGIS ISMLLTLGTPHLPPPKG PGVIDQTRGLLNYVDK CSKAGYNPELK+VCIAGRYIQGARLFG+
Subjt:  AIQEAKDLAQGGTLSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPKGFPGVIDQTRGLLNYVDKYCSKAGYNPELKYVCIAGRYIQGARLFGN

Query:  SNANTIHAAASIANNQPTPQLTIANDTSSSTN--TSTFRARFIGQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRP
        S+ANTI AAASI++NQPTP+L I ND+S+ST+   ++ R+RF+GQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRP
Subjt:  SNANTIHAAASIANNQPTPQLTIANDTSSSTN--TSTFRARFIGQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRP

A0A5D3DMM6 GPI inositol-deacylase PGAP1-like protein4.4e-14688.4Show/hide
Query:  MAVSSFSSLHFKPSFSSYPHSRFSCSLRPAVILPGLGNNSGDYDKLRLLLKERYGVTAVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKTDE
        MAV SFS LH KPS  S   S+FSCSLRPAVILPGLGNNSGDYDKLRLLLKERYGV +VVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKTDE
Subjt:  MAVSSFSSLHFKPSFSSYPHSRFSCSLRPAVILPGLGNNSGDYDKLRLLLKERYGVTAVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKTDE

Query:  AIQEAKDLAQGGTLSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPKGFPGVIDQTRGLLNYVDKYCSKAGYNPELKYVCIAGRYIQGARLFGN
        AIQEAK+LAQGGTLSLIGHSAGGWLARVYMEEFGIS ISMLLTLGTPHLPPPKG PGVIDQTRGLLNYVDK CSKAGYNPELK+VCIAGRYIQGARLFG+
Subjt:  AIQEAKDLAQGGTLSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPKGFPGVIDQTRGLLNYVDKYCSKAGYNPELKYVCIAGRYIQGARLFGN

Query:  SNANTIHAAASIANNQPTPQLTIANDTSSSTN--TSTFRARFIGQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRP
        S+ANTI AAASI++NQPTP+L I ND+S+ST+  T++ R+RF+GQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRP
Subjt:  SNANTIHAAASIANNQPTPQLTIANDTSSSTN--TSTFRARFIGQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRP

A0A6J1KIZ4 uncharacterized protein LOC111494403 isoform X29.0e-14788.74Show/hide
Query:  MAVSSFSSLHFKPSFSSYPHS--RFSCSLRPAVILPGLGNNSGDYDKLRLLLKERYGVTAVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKT
        MAV SFS LHFKPSFS    S  +FSCS+RPAVILPGLGNNSGDYDKLRL+L++RYGV +VVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYL KT
Subjt:  MAVSSFSSLHFKPSFSSYPHS--RFSCSLRPAVILPGLGNNSGDYDKLRLLLKERYGVTAVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKT

Query:  DEAIQEAKDLAQGGTLSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPKGFPGVIDQTRGLLNYVDKYCSKAGYNPELKYVCIAGRYIQGARLF
        DEAIQEAK+LAQG  LSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPK  PGVIDQTRGLLNYVDKYCSK GYNPELKYVCIAGRYIQGARLF
Subjt:  DEAIQEAKDLAQGGTLSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPKGFPGVIDQTRGLLNYVDKYCSKAGYNPELKYVCIAGRYIQGARLF

Query:  GNSNANTIHAAASIANNQPTPQLTIANDTSSSTNTSTFRARFIGQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRP
          S+ANTI AAASIANNQP P+LTI N TSSSTNT TFRARF+GQGYKQVCGESEVWGDGVVPEVSAHLEGAIN+SFDGVYHSPVGSDDELRP
Subjt:  GNSNANTIHAAASIANNQPTPQLTIANDTSSSTNTSTFRARFIGQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRP

A0A6J1KLP2 uncharacterized protein LOC111494403 isoform X19.0e-14788.74Show/hide
Query:  MAVSSFSSLHFKPSFSSYPHS--RFSCSLRPAVILPGLGNNSGDYDKLRLLLKERYGVTAVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKT
        MAV SFS LHFKPSFS    S  +FSCS+RPAVILPGLGNNSGDYDKLRL+L++RYGV +VVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYL KT
Subjt:  MAVSSFSSLHFKPSFSSYPHS--RFSCSLRPAVILPGLGNNSGDYDKLRLLLKERYGVTAVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKT

Query:  DEAIQEAKDLAQGGTLSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPKGFPGVIDQTRGLLNYVDKYCSKAGYNPELKYVCIAGRYIQGARLF
        DEAIQEAK+LAQG  LSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPK  PGVIDQTRGLLNYVDKYCSK GYNPELKYVCIAGRYIQGARLF
Subjt:  DEAIQEAKDLAQGGTLSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPKGFPGVIDQTRGLLNYVDKYCSKAGYNPELKYVCIAGRYIQGARLF

Query:  GNSNANTIHAAASIANNQPTPQLTIANDTSSSTNTSTFRARFIGQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRP
          S+ANTI AAASIANNQP P+LTI N TSSSTNT TFRARF+GQGYKQVCGESEVWGDGVVPEVSAHLEGAIN+SFDGVYHSPVGSDDELRP
Subjt:  GNSNANTIHAAASIANNQPTPQLTIANDTSSSTNTSTFRARFIGQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRP

SwissProt top hitse value%identityAlignment
Q6CF60 Putative GPI inositol-deacylase C3.3e-0528.48Show/hide
Query:  PAVILPGLGNNSGDYDKLRLL------LKERYGVTAVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDW--YLKKT-DEAIQEAKDLAQG-GTLSLIG
        P + +PG   N+G Y ++R +      L E+YG        S ID+       LD N     L  R +LD   YL    +  +Q  +D  +   ++ L+G
Subjt:  PAVILPGLGNNSGDYDKLRLL------LKERYGVTAVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDW--YLKKT-DEAIQEAKDLAQG-GTLSLIG

Query:  HSAGGWLAR--VYMEEFGISHISMLLTLGTPHLPPPKGFPGVIDQTRGLLN
        HS GG ++R  + ++ +    ++ + TL +PHL PP  F G I +    +N
Subjt:  HSAGGWLAR--VYMEEFGISHISMLLTLGTPHLPPPKGFPGVIDQTRGLLN

Arabidopsis top hitse value%identityAlignment
AT5G17670.1 alpha/beta-Hydrolases superfamily protein8.7e-11066.33Show/hide
Query:  MAVSSFSSLHFKPSFS-----SYPHSRFSCSLRPAVILPGLGNNSGDYDKLRLLLKERYGVTAVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYL
        MA +  +SL  +P+FS     S   S  +   RPAVILPGLGNN+GDY KL + L E YGV AVV  VSR+DW RNAAGL+DP YWRGTLRPRPVLDWYL
Subjt:  MAVSSFSSLHFKPSFS-----SYPHSRFSCSLRPAVILPGLGNNSGDYDKLRLLLKERYGVTAVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYL

Query:  KKTDEAIQEAKDLAQGGTLSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPKGFPGVIDQTRGLLNYVDKYCSKAGYNPELKYVCIAGRYIQGA
         + D+A++EA +LAQG  L LIGHSAGGWLARVYMEE+G S IS+LLTLGTPHLPPP+G PGVIDQTRGLL YV++ C+KA Y PELKYVCIAGRYI+GA
Subjt:  KKTDEAIQEAKDLAQGGTLSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPKGFPGVIDQTRGLLNYVDKYCSKAGYNPELKYVCIAGRYIQGA

Query:  RLFGNSNAN-TIHAAASIANNQPTPQLTIANDTSSSTNTSTFRARFIGQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRP
        RL  N++A+        I + +   +L IA    S+  + TFRARF+GQGYKQVCG ++VWGDGVVPEVSAHLEGA+N+SFDGVYHSPVGSDDE RP
Subjt:  RLFGNSNAN-TIHAAASIANNQPTPQLTIANDTSSSTNTSTFRARFIGQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTTTCGTCGTTTTCATCTCTGCATTTCAAGCCCTCATTCTCCTCTTATCCTCATTCCCGATTTTCCTGCTCACTCCGCCCTGCCGTCATCCTTCCTGGTTTGGG
GAACAACTCAGGTGACTACGATAAATTGAGGCTTCTTTTGAAAGAGCGGTACGGCGTCACCGCCGTGGTTGTTAAGGTGTCGAGAATTGATTGGCTGAGGAATGCGGCTG
GATTACTTGATCCTAACTACTGGCGCGGCACTCTCCGGCCCCGGCCTGTCCTCGATTGGTATCTGAAGAAAACAGATGAGGCCATCCAAGAAGCCAAGGACCTCGCTCAA
GGTGGGACTTTGTCCTTAATTGGGCACTCAGCGGGCGGATGGCTTGCACGAGTCTATATGGAAGAATTTGGAATATCTCATATTTCCATGTTGCTGACTCTTGGAACCCC
TCACCTGCCACCTCCAAAAGGTTTTCCAGGAGTGATTGATCAGACGAGGGGACTTCTGAATTATGTGGACAAATATTGCTCAAAAGCTGGTTACAATCCTGAACTTAAAT
ATGTATGTATAGCAGGAAGGTACATTCAAGGGGCTCGCTTGTTCGGTAACTCTAATGCAAACACCATTCATGCAGCCGCTTCCATCGCCAACAACCAACCAACTCCACAG
TTAACCATCGCCAACGACACAAGTAGCTCGACAAACACATCCACATTTCGCGCTCGCTTCATCGGGCAAGGATATAAGCAGGTGTGTGGGGAGAGTGAGGTATGGGGAGA
TGGGGTGGTTCCAGAAGTGTCTGCCCATTTGGAAGGAGCTATCAACATTAGCTTTGATGGAGTCTATCACTCTCCTGTTGGTTCTGATGATGAACTCAGACCCTGCGACG
ACAAATGTGAAGAAATTCCAGGTGGTGATGTCATTGTAGGCAACAATGACCATGGTGATTTCATTGATTTCGATGGTGAGGAATTTCAGCTCGACTAA
mRNA sequenceShow/hide mRNA sequence
GTTTACCATTCCCCCAAAAGATTTGAGGCGAAAACAGGTTCCACATGTTGACACCTCTCTTAAGTTTCCACCAGATAGACCGTTTTCCCGAACTGCCCAATGCAATGCAC
TCGCGCCATCGCTACAACTTCTGACGAACTTCCAACACTCCTCTGAGCTCTGCACAGAGTAAGGATTATAGCATAGCCATGGCGGTTTCGTCGTTTTCATCTCTGCATTT
CAAGCCCTCATTCTCCTCTTATCCTCATTCCCGATTTTCCTGCTCACTCCGCCCTGCCGTCATCCTTCCTGGTTTGGGGAACAACTCAGGTGACTACGATAAATTGAGGC
TTCTTTTGAAAGAGCGGTACGGCGTCACCGCCGTGGTTGTTAAGGTGTCGAGAATTGATTGGCTGAGGAATGCGGCTGGATTACTTGATCCTAACTACTGGCGCGGCACT
CTCCGGCCCCGGCCTGTCCTCGATTGGTATCTGAAGAAAACAGATGAGGCCATCCAAGAAGCCAAGGACCTCGCTCAAGGTGGGACTTTGTCCTTAATTGGGCACTCAGC
GGGCGGATGGCTTGCACGAGTCTATATGGAAGAATTTGGAATATCTCATATTTCCATGTTGCTGACTCTTGGAACCCCTCACCTGCCACCTCCAAAAGGTTTTCCAGGAG
TGATTGATCAGACGAGGGGACTTCTGAATTATGTGGACAAATATTGCTCAAAAGCTGGTTACAATCCTGAACTTAAATATGTATGTATAGCAGGAAGGTACATTCAAGGG
GCTCGCTTGTTCGGTAACTCTAATGCAAACACCATTCATGCAGCCGCTTCCATCGCCAACAACCAACCAACTCCACAGTTAACCATCGCCAACGACACAAGTAGCTCGAC
AAACACATCCACATTTCGCGCTCGCTTCATCGGGCAAGGATATAAGCAGGTGTGTGGGGAGAGTGAGGTATGGGGAGATGGGGTGGTTCCAGAAGTGTCTGCCCATTTGG
AAGGAGCTATCAACATTAGCTTTGATGGAGTCTATCACTCTCCTGTTGGTTCTGATGATGAACTCAGACCCTGCGACGACAAATGTGAAGAAATTCCAGGTGGTGATGTC
ATTGTAGGCAACAATGACCATGGTGATTTCATTGATTTCGATGGTGAGGAATTTCAGCTCGACTAAAATGCGGAAGAGAACAGGAAAAGCGGTGGTGAACATAGTAACGT
CCATGAACACAATGAACTGGTCATAGTCAACCTTG
Protein sequenceShow/hide protein sequence
MAVSSFSSLHFKPSFSSYPHSRFSCSLRPAVILPGLGNNSGDYDKLRLLLKERYGVTAVVVKVSRIDWLRNAAGLLDPNYWRGTLRPRPVLDWYLKKTDEAIQEAKDLAQ
GGTLSLIGHSAGGWLARVYMEEFGISHISMLLTLGTPHLPPPKGFPGVIDQTRGLLNYVDKYCSKAGYNPELKYVCIAGRYIQGARLFGNSNANTIHAAASIANNQPTPQ
LTIANDTSSSTNTSTFRARFIGQGYKQVCGESEVWGDGVVPEVSAHLEGAINISFDGVYHSPVGSDDELRPCDDKCEEIPGGDVIVGNNDHGDFIDFDGEEFQLD