; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015692 (gene) of Snake gourd v1 genome

Gene IDTan0015692
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPolyketide_cyc domain-containing protein
Genome locationLG06:80572380..80604347
RNA-Seq ExpressionTan0015692
SyntenyTan0015692
Gene Ontology termsNA
InterPro domainsIPR005031 - Coenzyme Q-binding protein COQ10, START domain
IPR023393 - START-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575162.1 hypothetical protein SDJN03_25801, partial [Cucurbita argyrosperma subsp. sororia]9.8e-7358.11Show/hide
Query:  MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPS---TKSSDPSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYS
        MLS LNSS+P++SS  +SC  SRL PT PA ASA  AV V FR DPSLS VA+ S   TKSS+  SD+TY AKHFRSRFR YYSNSDP FSD+D N DYS
Subjt:  MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPS---TKSSDPSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYS

Query:  DASEPETILE-DGGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFSSLLY----------------
        DASE ETI E DGGVSI+IEKLGNNSRRIYSRIGIDA LQ VWNILTDY +LADFIPGLALSQ+++KTGNHARLFQV   +L +                
Subjt:  DASEPETILE-DGGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFSSLLY----------------

Query:  -RPFPS----------------------------------------SDIHTILLYRVDVKPKLLLPVQLLEGRLCDEIKVNLMCIREEVYKARSST
            PS                                         +++TIL YRVDVKPKL+LPV+L+EGRLCDEIK+NLMCIREE +K  S+T
Subjt:  -RPFPS----------------------------------------SDIHTILLYRVDVKPKLLLPVQLLEGRLCDEIKVNLMCIREEVYKARSST

XP_022959182.1 uncharacterized protein LOC111460246 isoform X2 [Cucurbita moschata]6.3e-7257.77Show/hide
Query:  MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPS---TKSSDPSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYS
        MLS LNSS+P++SS  +SC  SRL PT PA ASA  AV V FR DPSLS VA+ S   TKSS+  SD+TY AKHFRSRF  YYSNSDP FSD+D N DYS
Subjt:  MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPS---TKSSDPSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYS

Query:  DASEPETILE-DGGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFSSLLY----------------
        DASE ETI E DGGVSI+IEKLGNNSRRIYSRIGIDA LQAVWNILTDY +LADFIPGLALSQ+++KTGNHARLFQV   +L +                
Subjt:  DASEPETILE-DGGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFSSLLY----------------

Query:  -RPFPS----------------------------------------SDIHTILLYRVDVKPKLLLPVQLLEGRLCDEIKVNLMCIREEVYKARSST
            PS                                         + +TIL YRVDVKPKL+LP++L+EGRLCDEIK+NLMCIREE +K  S+T
Subjt:  -RPFPS----------------------------------------SDIHTILLYRVDVKPKLLLPVQLLEGRLCDEIKVNLMCIREEVYKARSST

XP_023006623.1 uncharacterized protein LOC111499296 [Cucurbita maxima]4.0e-7458.25Show/hide
Query:  MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPS---TKSSDPSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYS
        MLS LNSS+P++SS  +SC  SRL PT PA ASA  AVVV FR DPSLS VA+ S   TKSS+  SD+TYPAK+FRSRFR YYSNSDPTFSD+D N +YS
Subjt:  MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPS---TKSSDPSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYS

Query:  DASEPETILE-DGGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFSSLLY----------------
        DASE ETI E DGGVSI+IEKLGNNSRRIYSRIGID  LQ VWNILTDY +LADFIPGLALSQ+++KTGNHARLFQV   +L +                
Subjt:  DASEPETILE-DGGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFSSLLY----------------

Query:  -RPFPS----------------------------------------SDIHTILLYRVDVKPKLLLPVQLLEGRLCDEIKVNLMCIREEVYKARSSTC
            PS                                         +++TIL YRVDVKPKL+LPV+L+EGRLCDEIK+NLMCIREE +K  S+TC
Subjt:  -RPFPS----------------------------------------SDIHTILLYRVDVKPKLLLPVQLLEGRLCDEIKVNLMCIREEVYKARSSTC

XP_023548259.1 uncharacterized protein LOC111806945 [Cucurbita pepo subsp. pepo]3.0e-7458.78Show/hide
Query:  MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPS---TKSSDPSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYS
        MLS LNSS+P++SS  +SC  SRL P  PA ASA  AV V FR DPSLS VA+ S   TKSS+  SD+TYPAKHFRSRFR YYSNSDP FSD+D N DYS
Subjt:  MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPS---TKSSDPSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYS

Query:  DASEPETILE-DGGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFSSLLY----------------
        DASE ETI E DGGVSI+IEKLGNNSRRIYSRIGIDA LQAVWNILTDY +LADFIPGLALSQ+++KTGNHARLFQV   +L +                
Subjt:  DASEPETILE-DGGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFSSLLY----------------

Query:  -RPFPS----------------------------------------SDIHTILLYRVDVKPKLLLPVQLLEGRLCDEIKVNLMCIREEVYKARSST
            PS                                         D++TIL YRVDVKPKL+LPV+L+EGRLCDEIK+NLMCIREE +K  S+T
Subjt:  -RPFPS----------------------------------------SDIHTILLYRVDVKPKLLLPVQLLEGRLCDEIKVNLMCIREEVYKARSST

XP_038874642.1 uncharacterized protein LOC120067209 [Benincasa hispida]2.0e-7360.21Show/hide
Query:  MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPSTKSSDPSSDT-TYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYSDA
        MLS L+SSEP++SS   S  LSRLA T PA A A  AVVV FRVDPSLS + IP+TKS+  SSDT TYP KHFRS FRNYYSNSD TFSDSD NGDYSDA
Subjt:  MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPSTKSSDPSSDT-TYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYSDA

Query:  SEPETILED-GGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFSSLLY------------------
        SE ET  +D GG+SI+IEKLG+NSRRIYSRIGIDAPLQAVWNILTDY RLADFIPGLALSQIL+K GNH RLFQV   +L +                  
Subjt:  SEPETILED-GGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFSSLLY------------------

Query:  ----------------------------------RPFPSSDIHTILLYRVDVKPKLLLPVQLLEGRLCDEIKVNLMCIREEVYKARSST
                                                +IH+IL YRVDVKPKLLLPV+LLEGRLC EIK NL+CIREEV+K  S+T
Subjt:  ----------------------------------RPFPSSDIHTILLYRVDVKPKLLLPVQLLEGRLCDEIKVNLMCIREEVYKARSST

TrEMBL top hitse value%identityAlignment
A0A0A0KCX4 Polyketide_cyc domain-containing protein4.6e-6856.23Show/hide
Query:  MLSLLNSSEPSHSSGALSCFLS-----RLAPTLPAIASA--AVVVKFRVDPSLSCVAIPSTKSSD---PSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDV
        MLS LNSSEPS SS + S  L+     RLAPT PA  SA  AVV  FRV PSLS +AI +TK +      S TTYP KHFRSRFRNYYSNS+PTFSD D 
Subjt:  MLSLLNSSEPSHSSGALSCFLS-----RLAPTLPAIASA--AVVVKFRVDPSLSCVAIPSTKSSD---PSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDV

Query:  NGDYSDASEPETILED-GGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFSSLLY-----------
        NGDYSD S+ ETI +D GG+SI+IEKLG NSRRIYSRIGIDAPLQAVWNILTDY RLADFIPGLA+SQIL+K  NH RLFQV   +L +           
Subjt:  NGDYSDASEPETILED-GGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFSSLLY-----------

Query:  ------------------------------------------RPFPSSDIHTILLYRVDVKPKLLLPVQLLEGRLCDEIKVNLMCIREEVYKARSST
                                                    F   +IH+ L Y VDVKPKLLLPV+LLEGRLC EIK NL+CIREEV+K  S+T
Subjt:  ------------------------------------------RPFPSSDIHTILLYRVDVKPKLLLPVQLLEGRLCDEIKVNLMCIREEVYKARSST

A0A1S3C7G7 uncharacterized protein LOC1034977437.1e-6956.9Show/hide
Query:  MLSLLNSSEPSHSSGALSCFL-----SRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPSTKSSD-PS--SDTTYPAKHFRSRFRNYYSNSDPTFSDSDV
        M S LNSSEP++SS + S  L     SRL+ T PA +SA  AVV  FRV PSLS +AI +TKS+  PS  S TTYP KHFRSRFRNYYSNS+PTFSDSD 
Subjt:  MLSLLNSSEPSHSSGALSCFL-----SRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPSTKSSD-PS--SDTTYPAKHFRSRFRNYYSNSDPTFSDSDV

Query:  NGDYSDASEPETILED-GGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFSSLLY-----------
        NGDYSD S+ ETI +D GG+ I+IEKLG NSRRIYSRIGIDAPLQAVWNILTDY RLADFIPGLA+SQIL+K GNHARLFQV   +L +           
Subjt:  NGDYSDASEPETILED-GGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFSSLLY-----------

Query:  ------------------------------------------RPFPSSDIHTILLYRVDVKPKLLLPVQLLEGRLCDEIKVNLMCIREEVYKARSST
                                                    F   ++H+ L Y VDVKPKLLLPV+LLEGRLC EIK NLMCIREEV+K  S+T
Subjt:  ------------------------------------------RPFPSSDIHTILLYRVDVKPKLLLPVQLLEGRLCDEIKVNLMCIREEVYKARSST

A0A6J1H3U6 uncharacterized protein LOC111460246 isoform X23.1e-7257.77Show/hide
Query:  MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPS---TKSSDPSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYS
        MLS LNSS+P++SS  +SC  SRL PT PA ASA  AV V FR DPSLS VA+ S   TKSS+  SD+TY AKHFRSRF  YYSNSDP FSD+D N DYS
Subjt:  MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPS---TKSSDPSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYS

Query:  DASEPETILE-DGGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFSSLLY----------------
        DASE ETI E DGGVSI+IEKLGNNSRRIYSRIGIDA LQAVWNILTDY +LADFIPGLALSQ+++KTGNHARLFQV   +L +                
Subjt:  DASEPETILE-DGGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFSSLLY----------------

Query:  -RPFPS----------------------------------------SDIHTILLYRVDVKPKLLLPVQLLEGRLCDEIKVNLMCIREEVYKARSST
            PS                                         + +TIL YRVDVKPKL+LP++L+EGRLCDEIK+NLMCIREE +K  S+T
Subjt:  -RPFPS----------------------------------------SDIHTILLYRVDVKPKLLLPVQLLEGRLCDEIKVNLMCIREEVYKARSST

A0A6J1H450 uncharacterized protein LOC111460246 isoform X17.5e-7155.52Show/hide
Query:  MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPS---TKSSDPSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYS
        MLS LNSS+P++SS  +SC  SRL PT PA ASA  AV V FR DPSLS VA+ S   TKSS+  SD+TY AKHFRSRF  YYSNSDP FSD+D N DYS
Subjt:  MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPS---TKSSDPSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYS

Query:  DASEPETILE-DGGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFSSLLY----------------
        DASE ETI E DGGVSI+IEKLGNNSRRIYSRIGIDA LQAVWNILTDY +LADFIPGLALSQ+++KTGNHARLFQV   +L +                
Subjt:  DASEPETILE-DGGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFSSLLY----------------

Query:  -RPFPS----------------------------------------------------SDIHTILLYRVDVKPKLLLPVQLLEGRLCDEIKVNLMCIREE
            PS                                                     + +TIL YRVDVKPKL+LP++L+EGRLCDEIK+NLMCIREE
Subjt:  -RPFPS----------------------------------------------------SDIHTILLYRVDVKPKLLLPVQLLEGRLCDEIKVNLMCIREE

Query:  VYKARSST
         +K  S+T
Subjt:  VYKARSST

A0A6J1L5G6 uncharacterized protein LOC1114992961.9e-7458.25Show/hide
Query:  MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPS---TKSSDPSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYS
        MLS LNSS+P++SS  +SC  SRL PT PA ASA  AVVV FR DPSLS VA+ S   TKSS+  SD+TYPAK+FRSRFR YYSNSDPTFSD+D N +YS
Subjt:  MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPS---TKSSDPSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYS

Query:  DASEPETILE-DGGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFSSLLY----------------
        DASE ETI E DGGVSI+IEKLGNNSRRIYSRIGID  LQ VWNILTDY +LADFIPGLALSQ+++KTGNHARLFQV   +L +                
Subjt:  DASEPETILE-DGGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFSSLLY----------------

Query:  -RPFPS----------------------------------------SDIHTILLYRVDVKPKLLLPVQLLEGRLCDEIKVNLMCIREEVYKARSSTC
            PS                                         +++TIL YRVDVKPKL+LPV+L+EGRLCDEIK+NLMCIREE +K  S+TC
Subjt:  -RPFPS----------------------------------------SDIHTILLYRVDVKPKLLLPVQLLEGRLCDEIKVNLMCIREEVYKARSSTC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G01650.1 Polyketide cyclase / dehydrase and lipid transport protein3.6e-2534.18Show/hide
Query:  SSDPSSDTTYPAKH-FRSRFRN----YYSNSDPTFSDSDVNGDY--SDASEPETILEDGGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLAD
        S  PSS     ++  F  RF +    + SN D T +++D   DY  +D    E ++ D GV I+++KL  +SRRI S+IG++A L +VW++LTDY +L+D
Subjt:  SSDPSSDTTYPAKH-FRSRFRN----YYSNSDPTFSDSDVNGDY--SDASEPETILEDGGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLAD

Query:  FIPGLALSQILYKTGNHARLFQVLFSSLL-------------------------------------YRPFPS--------------------SDIHTILL
        FIPGL +S+++ K GN  RLFQ+   +L                                      ++ F                       D  T L 
Subjt:  FIPGLALSQILYKTGNHARLFQVLFSSLL-------------------------------------YRPFPS--------------------SDIHTILL

Query:  YRVDVKPKLLLPVQLLEGRLCDEIKVNLMCIREEVYK
        Y VDVKPK+ LPV+L+EGRLC EI+ NLM IR+   K
Subjt:  YRVDVKPKLLLPVQLLEGRLCDEIKVNLMCIREEVYK

AT4G01650.2 Polyketide cyclase / dehydrase and lipid transport protein1.2e-2335.26Show/hide
Query:  DASEPETILEDGGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFSSLL------------------
        D    E ++ D GV I+++KL  +SRRI S+IG++A L +VW++LTDY +L+DFIPGL +S+++ K GN  RLFQ+   +L                   
Subjt:  DASEPETILEDGGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFSSLL------------------

Query:  -------------------YRPFPS--------------------SDIHTILLYRVDVKPKLLLPVQLLEGRLCDEIKVNLMCIREEVYK
                           ++ F                       D  T L Y VDVKPK+ LPV+L+EGRLC EI+ NLM IR+   K
Subjt:  -------------------YRPFPS--------------------SDIHTILLYRVDVKPKLLLPVQLLEGRLCDEIKVNLMCIREEVYK

AT5G08720.1 CONTAINS InterPro DOMAIN/s: Streptomyces cyclase/dehydrase (InterPro:IPR005031)9.9e-0747.92Show/hide
Query:  VSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALS
        V  +++ +    RRI   I +D+  Q+VWN+LTDY RLADFIP L  S
Subjt:  VSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCTCCCTTCTTAATTCATCAGAACCGTCGCATTCTTCTGGAGCGCTCTCCTGCTTCCTTTCCCGTTTAGCTCCGACATTGCCGGCCATCGCCTCAGCCGCCGTAGT
AGTTAAGTTTAGAGTCGACCCTTCTCTCTCATGTGTCGCCATCCCCAGCACCAAATCAAGTGACCCTTCCTCAGATACTACTTATCCTGCCAAACATTTTCGGTCCAGGT
TCCGAAACTATTATTCGAATTCCGACCCCACGTTCTCAGATAGTGATGTCAATGGCGATTACTCTGACGCATCAGAGCCAGAAACCATATTGGAAGACGGTGGTGTAAGC
ATCAAAATCGAGAAGTTGGGAAACAACTCTCGCAGAATTTACTCGAGAATTGGTATTGACGCCCCACTTCAGGCCGTGTGGAACATCTTGACAGATTATGGTAGACTGGC
AGATTTCATACCCGGTCTTGCTCTCAGCCAAATACTCTATAAGACTGGCAACCATGCCCGACTCTTTCAGGTACTTTTTTCTTCTCTTCTTTACCGCCCCTTTCCCTCTT
CTGATATACATACAATTCTATTGTATAGGGTTGATGTAAAGCCAAAACTTCTGTTGCCCGTTCAGCTTCTTGAGGGTAGGCTTTGTGATGAGATAAAGGTGAACCTAATG
TGTATTCGAGAAGAAGTATATAAAGCTCGCTCAAGCACCTGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTCTCCCTTCTTAATTCATCAGAACCGTCGCATTCTTCTGGAGCGCTCTCCTGCTTCCTTTCCCGTTTAGCTCCGACATTGCCGGCCATCGCCTCAGCCGCCGTAGT
AGTTAAGTTTAGAGTCGACCCTTCTCTCTCATGTGTCGCCATCCCCAGCACCAAATCAAGTGACCCTTCCTCAGATACTACTTATCCTGCCAAACATTTTCGGTCCAGGT
TCCGAAACTATTATTCGAATTCCGACCCCACGTTCTCAGATAGTGATGTCAATGGCGATTACTCTGACGCATCAGAGCCAGAAACCATATTGGAAGACGGTGGTGTAAGC
ATCAAAATCGAGAAGTTGGGAAACAACTCTCGCAGAATTTACTCGAGAATTGGTATTGACGCCCCACTTCAGGCCGTGTGGAACATCTTGACAGATTATGGTAGACTGGC
AGATTTCATACCCGGTCTTGCTCTCAGCCAAATACTCTATAAGACTGGCAACCATGCCCGACTCTTTCAGGTACTTTTTTCTTCTCTTCTTTACCGCCCCTTTCCCTCTT
CTGATATACATACAATTCTATTGTATAGGGTTGATGTAAAGCCAAAACTTCTGTTGCCCGTTCAGCTTCTTGAGGGTAGGCTTTGTGATGAGATAAAGGTGAACCTAATG
TGTATTCGAGAAGAAGTATATAAAGCTCGCTCAAGCACCTGCTAA
Protein sequenceShow/hide protein sequence
MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASAAVVVKFRVDPSLSCVAIPSTKSSDPSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYSDASEPETILEDGGVS
IKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFSSLLYRPFPSSDIHTILLYRVDVKPKLLLPVQLLEGRLCDEIKVNLM
CIREEVYKARSSTC