; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019738 (gene) of Snake gourd v1 genome

Gene IDTan0019738
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionButirosin biosynthesis
Genome locationLG06:79520666..79523832
RNA-Seq ExpressionTan0019738
SyntenyTan0019738
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575087.1 hypothetical protein SDJN03_25726, partial [Cucurbita argyrosperma subsp. sororia]1.1e-15687.66Show/hide
Query:  MTDCLTLQFAHRSLSLGHSASLHIPIRSTFLVR---PLSFCIAHSPLCSGSGLVFRSNLRKISSINASFNRGMSNSMVNEDEPKELRDESDFEAVFSGDD
        MTD L LQFAHRSL LG SASL IPIRS F V+   P SFC AHSPLCSGS LV RS LR+IS INAS N GMSNSMVNE EPKELRDESDFEA+FSG D
Subjt:  MTDCLTLQFAHRSLSLGHSASLHIPIRSTFLVR---PLSFCIAHSPLCSGSGLVFRSNLRKISSINASFNRGMSNSMVNEDEPKELRDESDFEAVFSGDD

Query:  YISVCGFGSLLSERSARSTFPELINFRMARLNGFRRLFGNVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIQREIEFRFLAVLP
        +ISVCGFGSLLSERSARSTFPELINFR+ARLNGFRR+FGNVAP+FFERGIAKPETKEISSLCAEPCEGETI +TVFEIKKSEIPAFIQREIEFRFLAVLP
Subjt:  YISVCGFGSLLSERSARSTFPELINFRMARLNGFRRLFGNVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIQREIEFRFLAVLP

Query:  ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNLSDTAYNNFLDHTFLGDRRTTIREYLAANGSG
        ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGN+DI+ QHYGRHNIDKIWRDDILPCRVYLRHC+LAAKNL +TAYNNFLDHTFLGDR TTIREYLA++GSG
Subjt:  ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNLSDTAYNNFLDHTFLGDRRTTIREYLAANGSG

Query:  IMEEEPPESVKFRYGG
        IMEEEPPES+KFRYGG
Subjt:  IMEEEPPESVKFRYGG

KAG6593988.1 hypothetical protein SDJN03_13464, partial [Cucurbita argyrosperma subsp. sororia]2.5e-15687.77Show/hide
Query:  MTDCLTLQFAHRSLSLGHSASLHIPIRSTFLVRPLSFCIAHSPLCSGSG------LVFRSNLRKISSINASFNRGMSNSMVNEDEPKELRDESDFEAVFS
        M +CLTLQFAHRSL L  SAS   PIRS F VR LSFCI HS LCSGSG      LV  SNLR I  INAS NRGMSNS+VN++EPKELRDESDFE+VFS
Subjt:  MTDCLTLQFAHRSLSLGHSASLHIPIRSTFLVRPLSFCIAHSPLCSGSG------LVFRSNLRKISSINASFNRGMSNSMVNEDEPKELRDESDFEAVFS

Query:  GDDYISVCGFGSLLSERSARSTFPELINFRMARLNGFRRLFGNVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIQREIEFRFLA
         D YISVCGFGSLLSERSARSTFPELINFR+ARLNGFRRLFGNVAPVFFERGIAKPETKEISSLCAEPCEGETII+TVFEIKKSEIPAFIQREIEFRFLA
Subjt:  GDDYISVCGFGSLLSERSARSTFPELINFRMARLNGFRRLFGNVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIQREIEFRFLA

Query:  VLPETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNLSDTAYNNFLDHTFLGDRRTTIREYLAAN
        VLPETLDGKLYDKP+VLCSRSTDEEFF+VRCKGNED+YFQHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNL DTAYNNFLDHTFLGDRRTTIREYLA N
Subjt:  VLPETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNLSDTAYNNFLDHTFLGDRRTTIREYLAAN

Query:  GSGIMEEEPPESVKFRYGG
        GSGIMEEEPPES+KFRYGG
Subjt:  GSGIMEEEPPESVKFRYGG

XP_022930375.1 uncharacterized protein LOC111436842 [Cucurbita moschata]2.5e-15687.77Show/hide
Query:  MTDCLTLQFAHRSLSLGHSASLHIPIRSTFLVRPLSFCIAHSPLCSGSG------LVFRSNLRKISSINASFNRGMSNSMVNEDEPKELRDESDFEAVFS
        M +CL+LQFAHRSL L  SAS   PIRS F VR LSFCI HS LCSGSG      LVF SNLR I  INAS NRGMSNS+ N++EPKELRDESDFE+VFS
Subjt:  MTDCLTLQFAHRSLSLGHSASLHIPIRSTFLVRPLSFCIAHSPLCSGSG------LVFRSNLRKISSINASFNRGMSNSMVNEDEPKELRDESDFEAVFS

Query:  GDDYISVCGFGSLLSERSARSTFPELINFRMARLNGFRRLFGNVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIQREIEFRFLA
         D YISVCGFGSLLSERSARSTFPELINFR+ARLNGFRRLFGNVAPVFFERGIAKPETKEISSLCAEPCEGETII+TVFEIKKSEIPAFIQREIEFRFLA
Subjt:  GDDYISVCGFGSLLSERSARSTFPELINFRMARLNGFRRLFGNVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIQREIEFRFLA

Query:  VLPETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNLSDTAYNNFLDHTFLGDRRTTIREYLAAN
        VLPETLDGKLYDKPAVLCSRSTDEEFF+VRCKGNED+YFQHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNL DTAYNNFLDHTFLGDRRTTIREYLA N
Subjt:  VLPETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNLSDTAYNNFLDHTFLGDRRTTIREYLAAN

Query:  GSGIMEEEPPESVKFRYGG
        GSGIMEEEPPES+KFRYGG
Subjt:  GSGIMEEEPPESVKFRYGG

XP_022959219.1 uncharacterized protein LOC111460273 isoform X1 [Cucurbita moschata]2.2e-15787.66Show/hide
Query:  MTDCLTLQFAHRSLSLGHSASLHIPIRSTFLVR---PLSFCIAHSPLCSGSGLVFRSNLRKISSINASFNRGMSNSMVNEDEPKELRDESDFEAVFSGDD
        MTD L LQFAHRSL LGHSASL IP+RS F V+   P SFC AHSPLCSGS LV RS LR+IS INAS N GMSNS+VNE EPKELRDESDFEA+FSG D
Subjt:  MTDCLTLQFAHRSLSLGHSASLHIPIRSTFLVR---PLSFCIAHSPLCSGSGLVFRSNLRKISSINASFNRGMSNSMVNEDEPKELRDESDFEAVFSGDD

Query:  YISVCGFGSLLSERSARSTFPELINFRMARLNGFRRLFGNVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIQREIEFRFLAVLP
        +ISVCGFGSLLSERSARSTFPELINFR+ARLNGFRR+FGNVAP+FFERGIAKPETKEISSLCAEPCEGETII+TVFEIKKSEIPAFIQREIEFRFLAVLP
Subjt:  YISVCGFGSLLSERSARSTFPELINFRMARLNGFRRLFGNVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIQREIEFRFLAVLP

Query:  ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNLSDTAYNNFLDHTFLGDRRTTIREYLAANGSG
        ETLDGKLYDKPAVLC RSTDEEFFQVRCKGNEDI+ QHYGRHNIDKIWRDDILPCRVYLRHC+LAAKNL +TAYNNFLDHTFLGDR TTIREYLA++GSG
Subjt:  ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNLSDTAYNNFLDHTFLGDRRTTIREYLAANGSG

Query:  IMEEEPPESVKFRYGG
        IMEEEPPES+KFRYGG
Subjt:  IMEEEPPESVKFRYGG

XP_023548382.1 uncharacterized protein LOC111807043 isoform X1 [Cucurbita pepo subsp. pepo]1.1e-15687.66Show/hide
Query:  MTDCLTLQFAHRSLSLGHSASLHIPIRSTFLVR---PLSFCIAHSPLCSGSGLVFRSNLRKISSINASFNRGMSNSMVNEDEPKELRDESDFEAVFSGDD
        MTD L LQFAHRSL LGHSASL I IRS F V+   P SFC AHSPLCSGS LV RS LR+IS INAS N GMSNSMVNE EPKELRDESDFEA+FSG D
Subjt:  MTDCLTLQFAHRSLSLGHSASLHIPIRSTFLVR---PLSFCIAHSPLCSGSGLVFRSNLRKISSINASFNRGMSNSMVNEDEPKELRDESDFEAVFSGDD

Query:  YISVCGFGSLLSERSARSTFPELINFRMARLNGFRRLFGNVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIQREIEFRFLAVLP
        + SVCGFGSLLSERSARSTFPELINFR+ARLNGFRR+FGNVAP+FFERGIAKPETKEISSLCAEPCEGETII+TVFEIKKSEIPAFIQREIEFRFLAVLP
Subjt:  YISVCGFGSLLSERSARSTFPELINFRMARLNGFRRLFGNVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIQREIEFRFLAVLP

Query:  ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNLSDTAYNNFLDHTFLGDRRTTIREYLAANGSG
        ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGN+DI+ QHYGRHNIDKIWRDDILPCRVYLRHC+LAAKNL +TAYNNFLDHTFLGDR TTIREYLA++GSG
Subjt:  ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNLSDTAYNNFLDHTFLGDRRTTIREYLAANGSG

Query:  IMEEEPPESVKFRYGG
        IMEEEPPES+KFRYGG
Subjt:  IMEEEPPESVKFRYGG

TrEMBL top hitse value%identityAlignment
A0A6J1C9Z2 uncharacterized protein LOC1110096711.2e-14885.35Show/hide
Query:  MTDCLTLQFAHRSLSLGHSASLHIPIRSTFLVRPLSFCIAHSPLCSG-SGLVFRSNLRKISSINASFNRGMSNSMVNEDEPKELRDESDFEAVFSGDDYI
        MTD LTLQFAHRSLSLG   SL +PIR  F ++ + F +  SPLCSG SGLVFRSNLR++S I+AS +RGMSNSMVNE EPKELRDESDFEAV   D+YI
Subjt:  MTDCLTLQFAHRSLSLGHSASLHIPIRSTFLVRPLSFCIAHSPLCSG-SGLVFRSNLRKISSINASFNRGMSNSMVNEDEPKELRDESDFEAVFSGDDYI

Query:  SVCGFGSLLSERSARSTFPELINFRMARLNGFRRLFGNVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIQREIEFRFLAVLPET
        S+CGFGSLLSERSARSTFPEL+NFR+ARL+GFRR+F NVAPVFFERGIAKPETKEISSLCAEPCEGETIII+VFEIKKSEIPAFI REIEFRFLAVLPE 
Subjt:  SVCGFGSLLSERSARSTFPELINFRMARLNGFRRLFGNVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIQREIEFRFLAVLPET

Query:  LDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNLSDTAYNNFLDHTFLGDRRTTIREYLAANGSGIM
        LDG LYDKPAVLCSRSTDEEFFQVRCKGN+DIYFQHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNL D AYNNFLDHTFLGDR TTIR+YLAANGSGIM
Subjt:  LDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNLSDTAYNNFLDHTFLGDRRTTIREYLAANGSGIM

Query:  EEEPPESVKFRYGG
        EEEPPES+KFRYGG
Subjt:  EEEPPESVKFRYGG

A0A6J1EWR6 uncharacterized protein LOC1114368421.2e-15687.77Show/hide
Query:  MTDCLTLQFAHRSLSLGHSASLHIPIRSTFLVRPLSFCIAHSPLCSGSG------LVFRSNLRKISSINASFNRGMSNSMVNEDEPKELRDESDFEAVFS
        M +CL+LQFAHRSL L  SAS   PIRS F VR LSFCI HS LCSGSG      LVF SNLR I  INAS NRGMSNS+ N++EPKELRDESDFE+VFS
Subjt:  MTDCLTLQFAHRSLSLGHSASLHIPIRSTFLVRPLSFCIAHSPLCSGSG------LVFRSNLRKISSINASFNRGMSNSMVNEDEPKELRDESDFEAVFS

Query:  GDDYISVCGFGSLLSERSARSTFPELINFRMARLNGFRRLFGNVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIQREIEFRFLA
         D YISVCGFGSLLSERSARSTFPELINFR+ARLNGFRRLFGNVAPVFFERGIAKPETKEISSLCAEPCEGETII+TVFEIKKSEIPAFIQREIEFRFLA
Subjt:  GDDYISVCGFGSLLSERSARSTFPELINFRMARLNGFRRLFGNVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIQREIEFRFLA

Query:  VLPETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNLSDTAYNNFLDHTFLGDRRTTIREYLAAN
        VLPETLDGKLYDKPAVLCSRSTDEEFF+VRCKGNED+YFQHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNL DTAYNNFLDHTFLGDRRTTIREYLA N
Subjt:  VLPETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNLSDTAYNNFLDHTFLGDRRTTIREYLAAN

Query:  GSGIMEEEPPESVKFRYGG
        GSGIMEEEPPES+KFRYGG
Subjt:  GSGIMEEEPPESVKFRYGG

A0A6J1H5P5 uncharacterized protein LOC111460273 isoform X22.8e-14582.91Show/hide
Query:  MTDCLTLQFAHRSLSLGHSASLHIPIRSTFLVR---PLSFCIAHSPLCSGSGLVFRSNLRKISSINASFNRGMSNSMVNEDEPKELRDESDFEAVFSGDD
        MTD L LQFAHRSL LGHSASL IP+RS F V+   P SFC AHSPLCSGS LV RS LR+IS INAS N GMSNS+VNE EPKELRDESDFEA+FSG D
Subjt:  MTDCLTLQFAHRSLSLGHSASLHIPIRSTFLVR---PLSFCIAHSPLCSGSGLVFRSNLRKISSINASFNRGMSNSMVNEDEPKELRDESDFEAVFSGDD

Query:  YISVCGFGSLLSERSARSTFPELINFRMARLNGFRRLFGNVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIQREIEFRFLAVLP
        +ISVCGFGSLLSERSARSTFPELINFR+ARLNGFRR+FGNVAP+FFERGIAKPETKEISSLCAEPCEGETII+TVFEIKKSEIPAFIQREIEFRFL    
Subjt:  YISVCGFGSLLSERSARSTFPELINFRMARLNGFRRLFGNVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIQREIEFRFLAVLP

Query:  ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNLSDTAYNNFLDHTFLGDRRTTIREYLAANGSG
                   AVLC RSTDEEFFQVRCKGNEDI+ QHYGRHNIDKIWRDDILPCRVYLRHC+LAAKNL +TAYNNFLDHTFLGDR TTIREYLA++GSG
Subjt:  ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNLSDTAYNNFLDHTFLGDRRTTIREYLAANGSG

Query:  IMEEEPPESVKFRYGG
        IMEEEPPES+KFRYGG
Subjt:  IMEEEPPESVKFRYGG

A0A6J1H7C8 uncharacterized protein LOC111460273 isoform X11.1e-15787.66Show/hide
Query:  MTDCLTLQFAHRSLSLGHSASLHIPIRSTFLVR---PLSFCIAHSPLCSGSGLVFRSNLRKISSINASFNRGMSNSMVNEDEPKELRDESDFEAVFSGDD
        MTD L LQFAHRSL LGHSASL IP+RS F V+   P SFC AHSPLCSGS LV RS LR+IS INAS N GMSNS+VNE EPKELRDESDFEA+FSG D
Subjt:  MTDCLTLQFAHRSLSLGHSASLHIPIRSTFLVR---PLSFCIAHSPLCSGSGLVFRSNLRKISSINASFNRGMSNSMVNEDEPKELRDESDFEAVFSGDD

Query:  YISVCGFGSLLSERSARSTFPELINFRMARLNGFRRLFGNVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIQREIEFRFLAVLP
        +ISVCGFGSLLSERSARSTFPELINFR+ARLNGFRR+FGNVAP+FFERGIAKPETKEISSLCAEPCEGETII+TVFEIKKSEIPAFIQREIEFRFLAVLP
Subjt:  YISVCGFGSLLSERSARSTFPELINFRMARLNGFRRLFGNVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIQREIEFRFLAVLP

Query:  ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNLSDTAYNNFLDHTFLGDRRTTIREYLAANGSG
        ETLDGKLYDKPAVLC RSTDEEFFQVRCKGNEDI+ QHYGRHNIDKIWRDDILPCRVYLRHC+LAAKNL +TAYNNFLDHTFLGDR TTIREYLA++GSG
Subjt:  ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNLSDTAYNNFLDHTFLGDRRTTIREYLAANGSG

Query:  IMEEEPPESVKFRYGG
        IMEEEPPES+KFRYGG
Subjt:  IMEEEPPESVKFRYGG

A0A6J1KHJ0 uncharacterized protein LOC1114944341.6e-15386.75Show/hide
Query:  MTDCLTLQFAHRSLSLGHSASLHIPIRSTFLVRPLSFCIAHSPLCSGSG----LVFRSNLRKISSINASFNRGMSNSMVNEDEPKELRDESDFEAVFSGD
        M + L+LQFAHRSL L  SAS   PIRS F VR  SFCI HS LCSGSG    LVF SNLR +  INAS NRGMSNS+VN++ PKELRDESDFE+VFS D
Subjt:  MTDCLTLQFAHRSLSLGHSASLHIPIRSTFLVRPLSFCIAHSPLCSGSG----LVFRSNLRKISSINASFNRGMSNSMVNEDEPKELRDESDFEAVFSGD

Query:  DYISVCGFGSLLSERSARSTFPELINFRMARLNGFRRLFGNVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIQREIEFRFLAVL
         YISVCGFGSLLSERSARSTFPELINFR+ARLNGFRRLFGNVAPVFFERGIAKPETKEISSLCAEPCEGETII+TVFEIKKSEIPAFIQREIEFRFLAVL
Subjt:  DYISVCGFGSLLSERSARSTFPELINFRMARLNGFRRLFGNVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIQREIEFRFLAVL

Query:  PETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNLSDTAYNNFLDHTFLGDRRTTIREYLAANGS
        PETLDGKLYDKPAVLCSRSTDEEFF+VRCKGNED+YFQHYGRHNIDKIWRDDILPCRVYLRHCVLAA NL D AYNNFLDHTFLGDRRTTIREYLA NGS
Subjt:  PETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNLSDTAYNNFLDHTFLGDRRTTIREYLAANGS

Query:  GIMEEEPPESVKFRYGG
        GIMEEEPPES+KFRYGG
Subjt:  GIMEEEPPESVKFRYGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G16060.1 unknown protein1.1e-10667.86Show/hide
Query:  PLSFCIAHSPLCSGSGLVFRSNLRKISSINASFNRGMSNSMVNEDEPKELRDESDFEAVFSGDDYISVCGFGSLLSERSARSTFPELINFRMARLNGFRR
        P SF  + +P C       RS +   SS      R    +M    +  EL DESDFE + S D+ IS+ GFGSLLSERSARSTFP+L NFR+A+L GFRR
Subjt:  PLSFCIAHSPLCSGSGLVFRSNLRKISSINASFNRGMSNSMVNEDEPKELRDESDFEAVFSGDDYISVCGFGSLLSERSARSTFPELINFRMARLNGFRR

Query:  LFGNVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIQREIEFRFLAVLPETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYF
        +F + AP+FFERGIA PETKEISSL  EPCEGE++++TVFEIK SEIPAFI RE+EFRFLAV+PETL+GK Y   AVLC R +DEEFFQ+RCKGN+ IYF
Subjt:  LFGNVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIQREIEFRFLAVLPETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYF

Query:  QHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNLSDTAYNNFLDHTFLGDRRTTIREYLAANGSGIMEEEPPESVKFRYGG
        QHYGR  IDKIWRDDILPCR+YLRHCVLAAKNL D AYNNFLDHTFLGDR+TTIREYL++ GSGIMEEEPPE++K RYGG
Subjt:  QHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNLSDTAYNNFLDHTFLGDRRTTIREYLAANGSGIMEEEPPESVKFRYGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGATTGTCTCACTCTCCAATTTGCTCATCGTTCTCTCTCTCTAGGGCATTCCGCTTCCCTTCACATTCCAATCCGCTCTACGTTTCTGGTTCGGCCTCTCTCTTT
CTGCATAGCGCACTCTCCTCTTTGTTCCGGCTCCGGACTGGTTTTTCGAAGTAACCTCCGGAAAATATCTTCGATCAACGCGTCCTTCAACCGCGGAATGTCGAATTCCA
TGGTGAACGAAGATGAACCGAAGGAACTGCGAGATGAATCCGATTTTGAAGCCGTCTTCTCCGGTGACGATTACATTTCCGTTTGTGGCTTCGGCTCTCTTCTCTCTGAG
AGGAGTGCGCGAAGTACCTTTCCTGAATTGATCAACTTTAGAATGGCGAGATTGAACGGTTTCAGACGCCTTTTCGGAAATGTAGCTCCTGTATTCTTTGAGCGCGGCAT
TGCTAAACCTGAAACCAAGGAGATTTCAAGCTTGTGTGCGGAGCCTTGCGAAGGAGAAACTATCATCATTACGGTTTTCGAGATTAAGAAGTCTGAGATTCCAGCTTTTA
TACAGAGAGAGATTGAGTTTCGATTTCTAGCCGTTCTTCCTGAAACATTAGATGGAAAGCTATATGATAAACCAGCGGTGCTTTGTTCTCGATCCACTGATGAGGAATTC
TTCCAAGTGCGATGCAAAGGAAATGAGGACATTTATTTTCAGCATTATGGTCGTCATAATATTGATAAGATTTGGAGAGATGATATCTTACCCTGTCGTGTCTATCTTCG
ACACTGTGTTTTAGCTGCAAAAAACTTGAGCGACACAGCTTATAATAACTTTCTGGATCACACTTTCCTTGGAGATCGTAGAACAACCATCCGTGAATATTTGGCTGCTA
ATGGTTCAGGCATTATGGAAGAGGAACCCCCAGAATCCGTCAAGTTTCGATATGGCGGTTGA
mRNA sequenceShow/hide mRNA sequence
TATATATTTCCCACAAAATCACAATTCCTACTCACTCTCAGTCTCAGTCGCTCCACAAGTACGAAACGGAAACACAGGCATCGTTTGAGTTTCTAATTCTCACGGGCGAT
GACTGATTGTCTCACTCTCCAATTTGCTCATCGTTCTCTCTCTCTAGGGCATTCCGCTTCCCTTCACATTCCAATCCGCTCTACGTTTCTGGTTCGGCCTCTCTCTTTCT
GCATAGCGCACTCTCCTCTTTGTTCCGGCTCCGGACTGGTTTTTCGAAGTAACCTCCGGAAAATATCTTCGATCAACGCGTCCTTCAACCGCGGAATGTCGAATTCCATG
GTGAACGAAGATGAACCGAAGGAACTGCGAGATGAATCCGATTTTGAAGCCGTCTTCTCCGGTGACGATTACATTTCCGTTTGTGGCTTCGGCTCTCTTCTCTCTGAGAG
GAGTGCGCGAAGTACCTTTCCTGAATTGATCAACTTTAGAATGGCGAGATTGAACGGTTTCAGACGCCTTTTCGGAAATGTAGCTCCTGTATTCTTTGAGCGCGGCATTG
CTAAACCTGAAACCAAGGAGATTTCAAGCTTGTGTGCGGAGCCTTGCGAAGGAGAAACTATCATCATTACGGTTTTCGAGATTAAGAAGTCTGAGATTCCAGCTTTTATA
CAGAGAGAGATTGAGTTTCGATTTCTAGCCGTTCTTCCTGAAACATTAGATGGAAAGCTATATGATAAACCAGCGGTGCTTTGTTCTCGATCCACTGATGAGGAATTCTT
CCAAGTGCGATGCAAAGGAAATGAGGACATTTATTTTCAGCATTATGGTCGTCATAATATTGATAAGATTTGGAGAGATGATATCTTACCCTGTCGTGTCTATCTTCGAC
ACTGTGTTTTAGCTGCAAAAAACTTGAGCGACACAGCTTATAATAACTTTCTGGATCACACTTTCCTTGGAGATCGTAGAACAACCATCCGTGAATATTTGGCTGCTAAT
GGTTCAGGCATTATGGAAGAGGAACCCCCAGAATCCGTCAAGTTTCGATATGGCGGTTGATTTTGCAGAACAAAATAACTGCAGAGAAGGCTTTATAGATCCTTAGAAGC
TTTGAGATATTACTGGAGATTAGATATGAATCTTCTTCTGCCATGGAGATTGAATGGCGGCTGCCATGAAGATTCGTGATGAGTCTGGCAGCCTGATTTTCGGTTCGAAA
AGTAAGGTAAATTTTTTAAGCTATGTATTATGCATTACGTTGGTGGTTCGGAAACAGATTTTTTTTCCCCTTTTTCCGTTTACTAAAAGGTTGTATAAAAAATTGTTTCC
GAAATTATTTTTAGAGAT
Protein sequenceShow/hide protein sequence
MTDCLTLQFAHRSLSLGHSASLHIPIRSTFLVRPLSFCIAHSPLCSGSGLVFRSNLRKISSINASFNRGMSNSMVNEDEPKELRDESDFEAVFSGDDYISVCGFGSLLSE
RSARSTFPELINFRMARLNGFRRLFGNVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIQREIEFRFLAVLPETLDGKLYDKPAVLCSRSTDEEF
FQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRHCVLAAKNLSDTAYNNFLDHTFLGDRRTTIREYLAANGSGIMEEEPPESVKFRYGG