; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013059 (gene) of Snake gourd v1 genome

Gene IDTan0013059
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionEndoglucanase
Genome locationLG02:7132752..7135649
RNA-Seq ExpressionTan0013059
SyntenyTan0013059
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7022261.1 hypothetical protein SDJN02_15992 [Cucurbita argyrosperma subsp. argyrosperma]3.0e-11472.58Show/hide
Query:  MSMVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTF
        MSM+EGS SSHL++G ESK+Q+D LTPEDIAW DSCLIK++PD  DGNWN IKDALLEVLDLYPQGFES LAVS  VPGG NDDIDVD+L F NVK+PT 
Subjt:  MSMVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTF

Query:  PSRDTDDPMNETGTASEDPQNDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTND--DIDIDMLLSNNVKEPTFLSRDSDDPMNETGIASEDQQNHDEINT
          RD+DDPM             NDVD SL LSFS NPLLPTYKEEDNV G T+D  DIDIDMLLSNNVK+PTF S+DSDD MNETG+ASEDQQ+HD+I+T
Subjt:  PSRDTDDPMNETGTASEDPQNDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTND--DIDIDMLLSNNVKEPTFLSRDSDDPMNETGIASEDQQNHDEINT

Query:  SLSLS-NKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEPPINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISS
        SLSLS NKNPFLPTYKEE +GKE+IQT SSHDL EIG EPPINDIF+VWDLNLPP+E++LV+QLNKALSENS ESV   DSN   ++D  + LLD LI S
Subjt:  SLSLS-NKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEPPINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISS

Query:  ISGLSLEQNK
        IS LSLE  K
Subjt:  ISGLSLEQNK

XP_022925089.1 uncharacterized protein LOC111432435 isoform X1 [Cucurbita moschata]3.0e-11472.58Show/hide
Query:  MSMVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTF
        MSM+EGS SSHL++G ESK+Q+D LTPEDIAW DSCLIK++PD  DGNWN IKDALLEVLDLYPQGFES LAVS  VPGG NDDIDVD+L F NVK+PT 
Subjt:  MSMVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTF

Query:  PSRDTDDPMNETGTASEDPQNDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTND--DIDIDMLLSNNVKEPTFLSRDSDDPMNETGIASEDQQNHDEINT
          RD+DDPM             NDVD SL LSFS NPLLPTYKEEDNV G T+D  DIDIDMLLSNNVK+PTF S+DSDD MNETG+ASEDQQ+HD+I+T
Subjt:  PSRDTDDPMNETGTASEDPQNDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTND--DIDIDMLLSNNVKEPTFLSRDSDDPMNETGIASEDQQNHDEINT

Query:  SLSLS-NKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEPPINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISS
        SLSLS NKNPFLPTYKEE +GKE+IQT SSHDL EIG EPPINDIF+VWDLNLPP+E++LV+QLNKALSENS ESV   DSN   ++D  + LLD LI S
Subjt:  SLSLS-NKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEPPINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISS

Query:  ISGLSLEQNK
        IS LSLE  K
Subjt:  ISGLSLEQNK

XP_022925097.1 uncharacterized protein LOC111432435 isoform X2 [Cucurbita moschata]2.5e-11372.4Show/hide
Query:  MVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPS
        M+EGS SSHL++G ESK+Q+D LTPEDIAW DSCLIK++PD  DGNWN IKDALLEVLDLYPQGFES LAVS  VPGG NDDIDVD+L F NVK+PT   
Subjt:  MVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPS

Query:  RDTDDPMNETGTASEDPQNDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTND--DIDIDMLLSNNVKEPTFLSRDSDDPMNETGIASEDQQNHDEINTSL
        RD+DDPM             NDVD SL LSFS NPLLPTYKEEDNV G T+D  DIDIDMLLSNNVK+PTF S+DSDD MNETG+ASEDQQ+HD+I+TSL
Subjt:  RDTDDPMNETGTASEDPQNDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTND--DIDIDMLLSNNVKEPTFLSRDSDDPMNETGIASEDQQNHDEINTSL

Query:  SLS-NKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEPPINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISSIS
        SLS NKNPFLPTYKEE +GKE+IQT SSHDL EIG EPPINDIF+VWDLNLPP+E++LV+QLNKALSENS ESV   DSN   ++D  + LLD LI SIS
Subjt:  SLS-NKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEPPINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISSIS

Query:  GLSLEQNK
         LSLE  K
Subjt:  GLSLEQNK

XP_022971265.1 uncharacterized protein LOC111470037 [Cucurbita maxima]1.8e-11473.2Show/hide
Query:  MVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPS
        M+EGS SS L++G ESK+Q+D LTPEDIAW DSCLIKE+PD  DGNWN IKDALLEV DLYPQGFES LAVS NVPGGTNDD+DVDML F NVK+ T   
Subjt:  MVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPS

Query:  RDTDDPMNETGTASEDPQNDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTNDDIDIDMLLSNNVKEPTFLSRDSDDPMNETGIASEDQQNHDEINTSLSL
        RD+DDPM             NDVD SL LSFS NPLLPTYKEEDNV GGT+DDI+I+MLLSNNVK+PTF SRDSDD MNETG+ASEDQQ+HD+I+TSLSL
Subjt:  RDTDDPMNETGTASEDPQNDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTNDDIDIDMLLSNNVKEPTFLSRDSDDPMNETGIASEDQQNHDEINTSLSL

Query:  S-NKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEPPINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISSISGL
        S NKNPFLPTYKEE +GKE+IQT SSHDL EIG EPPINDIF+VWDLNLPP+E++LV+QLNKALSENS ESV   DSN S L+D  + LLD LI SIS L
Subjt:  S-NKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEPPINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISSISGL

Query:  SLEQNK
        SL   K
Subjt:  SLEQNK

XP_023529376.1 uncharacterized protein LOC111792251 [Cucurbita pepo subsp. pepo]9.7e-11372.26Show/hide
Query:  MSMVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTF
        MSM+EGS SSHL++G ESK+Q+D LTPEDIAW DSCLIKE+PD  DGNWN IKDALLEVLDLYPQGFES LAVS  VPGG ND IDVDML F NVK+PT 
Subjt:  MSMVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTF

Query:  PSRDTDDPMNETGTASEDPQNDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTND--DIDIDMLLSNNVKEPTFLSRDSDDPMNETGIASEDQQNHDEINT
          RD+DDPM             NDVD SL LSFS NPLLP YKEEDNV GGT+D  DIDIDMLLSNNVK+PTF ++DSDD MNETG+ASEDQQ+HD+I+T
Subjt:  PSRDTDDPMNETGTASEDPQNDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTND--DIDIDMLLSNNVKEPTFLSRDSDDPMNETGIASEDQQNHDEINT

Query:  SLSLS-NKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEPPINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISS
        SLSLS NKNPFLPTYKEE + KE+IQT SSHDL EIG EPPINDIF+VWDLNLPP+E++LV+QLNKALSENS ESV   DSN   ++D  + LLD LI S
Subjt:  SLSLS-NKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEPPINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISS

Query:  ISGLSLEQNK
        IS LSLE  K
Subjt:  ISGLSLEQNK

TrEMBL top hitse value%identityAlignment
A0A1S3BBI4 uncharacterized protein LOC1034881234.3e-9062.87Show/hide
Query:  MVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPS
        MVEGSISSHLD GPESK+QVD LT EDIAWVDSCLIKE+PD SDGNWN +KDALLE+LDLYPQGFESSLA+S NVPG +N DIDVDML  NNVKEPTF S
Subjt:  MVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPS

Query:  RDTDDPMNETGTASEDPQNDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTNDDIDIDMLLSNNVKEPTFLSRDSDDPMNETGIASEDQQNHDEINTSLSL
        RD+DD MNET TA E                                                           D PMN+TGIASED Q HD+I+TSL L
Subjt:  RDTDDPMNETGTASEDPQNDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTNDDIDIDMLLSNNVKEPTFLSRDSDDPMNETGIASEDQQNHDEINTSLSL

Query:  S-NKNPFLPTYKEEAEGK-ETIQTGSSHDLSEIGSEPPINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISSISG
        +  KNPFLPTYKEE EG  E  Q G  H+LSEIGSE PINDIF VWDLN PPVEDEL++QLNKAL+ENS ESVPSMDSNL  L+DL+E LLDDLI+SIS 
Subjt:  S-NKNPFLPTYKEEAEGK-ETIQTGSSHDLSEIGSEPPINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISSISG

Query:  LSLEQNK
        LSLEQ K
Subjt:  LSLEQNK

A0A5D3BCZ2 Uncharacterized protein4.3e-9062.87Show/hide
Query:  MVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPS
        MVEGSISSHLD GPESK+QVD LT EDIAWVDSCLIKE+PD SDGNWN +KDALLE+LDLYPQGFESSLA+S NVPG +N DIDVDML  NNVKEPTF S
Subjt:  MVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPS

Query:  RDTDDPMNETGTASEDPQNDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTNDDIDIDMLLSNNVKEPTFLSRDSDDPMNETGIASEDQQNHDEINTSLSL
        RD+DD MNET TA E                                                           D PMN+TGIASED Q HD+I+TSL L
Subjt:  RDTDDPMNETGTASEDPQNDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTNDDIDIDMLLSNNVKEPTFLSRDSDDPMNETGIASEDQQNHDEINTSLSL

Query:  S-NKNPFLPTYKEEAEGK-ETIQTGSSHDLSEIGSEPPINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISSISG
        +  KNPFLPTYKEE EG  E  Q G  H+LSEIGSE PINDIF VWDLN PPVEDEL++QLNKAL+ENS ESVPSMDSNL  L+DL+E LLDDLI+SIS 
Subjt:  S-NKNPFLPTYKEEAEGK-ETIQTGSSHDLSEIGSEPPINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISSISG

Query:  LSLEQNK
        LSLEQ K
Subjt:  LSLEQNK

A0A6J1EAW3 uncharacterized protein LOC111432435 isoform X21.2e-11372.4Show/hide
Query:  MVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPS
        M+EGS SSHL++G ESK+Q+D LTPEDIAW DSCLIK++PD  DGNWN IKDALLEVLDLYPQGFES LAVS  VPGG NDDIDVD+L F NVK+PT   
Subjt:  MVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPS

Query:  RDTDDPMNETGTASEDPQNDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTND--DIDIDMLLSNNVKEPTFLSRDSDDPMNETGIASEDQQNHDEINTSL
        RD+DDPM             NDVD SL LSFS NPLLPTYKEEDNV G T+D  DIDIDMLLSNNVK+PTF S+DSDD MNETG+ASEDQQ+HD+I+TSL
Subjt:  RDTDDPMNETGTASEDPQNDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTND--DIDIDMLLSNNVKEPTFLSRDSDDPMNETGIASEDQQNHDEINTSL

Query:  SLS-NKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEPPINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISSIS
        SLS NKNPFLPTYKEE +GKE+IQT SSHDL EIG EPPINDIF+VWDLNLPP+E++LV+QLNKALSENS ESV   DSN   ++D  + LLD LI SIS
Subjt:  SLS-NKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEPPINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISSIS

Query:  GLSLEQNK
         LSLE  K
Subjt:  GLSLEQNK

A0A6J1EB39 uncharacterized protein LOC111432435 isoform X11.5e-11472.58Show/hide
Query:  MSMVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTF
        MSM+EGS SSHL++G ESK+Q+D LTPEDIAW DSCLIK++PD  DGNWN IKDALLEVLDLYPQGFES LAVS  VPGG NDDIDVD+L F NVK+PT 
Subjt:  MSMVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTF

Query:  PSRDTDDPMNETGTASEDPQNDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTND--DIDIDMLLSNNVKEPTFLSRDSDDPMNETGIASEDQQNHDEINT
          RD+DDPM             NDVD SL LSFS NPLLPTYKEEDNV G T+D  DIDIDMLLSNNVK+PTF S+DSDD MNETG+ASEDQQ+HD+I+T
Subjt:  PSRDTDDPMNETGTASEDPQNDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTND--DIDIDMLLSNNVKEPTFLSRDSDDPMNETGIASEDQQNHDEINT

Query:  SLSLS-NKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEPPINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISS
        SLSLS NKNPFLPTYKEE +GKE+IQT SSHDL EIG EPPINDIF+VWDLNLPP+E++LV+QLNKALSENS ESV   DSN   ++D  + LLD LI S
Subjt:  SLSLS-NKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEPPINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISS

Query:  ISGLSLEQNK
        IS LSLE  K
Subjt:  ISGLSLEQNK

A0A6J1I1I4 uncharacterized protein LOC1114700378.5e-11573.2Show/hide
Query:  MVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPS
        M+EGS SS L++G ESK+Q+D LTPEDIAW DSCLIKE+PD  DGNWN IKDALLEV DLYPQGFES LAVS NVPGGTNDD+DVDML F NVK+ T   
Subjt:  MVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPS

Query:  RDTDDPMNETGTASEDPQNDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTNDDIDIDMLLSNNVKEPTFLSRDSDDPMNETGIASEDQQNHDEINTSLSL
        RD+DDPM             NDVD SL LSFS NPLLPTYKEEDNV GGT+DDI+I+MLLSNNVK+PTF SRDSDD MNETG+ASEDQQ+HD+I+TSLSL
Subjt:  RDTDDPMNETGTASEDPQNDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTNDDIDIDMLLSNNVKEPTFLSRDSDDPMNETGIASEDQQNHDEINTSLSL

Query:  S-NKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEPPINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISSISGL
        S NKNPFLPTYKEE +GKE+IQT SSHDL EIG EPPINDIF+VWDLNLPP+E++LV+QLNKALSENS ESV   DSN S L+D  + LLD LI SIS L
Subjt:  S-NKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEPPINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISSISGL

Query:  SLEQNK
        SL   K
Subjt:  SLEQNK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G38980.1 unknown protein3.5e-1226.74Show/hide
Query:  LTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPSRDTDDPMNETGTASEDPQNDND
        L+PE +AW DSC+I  + D+ + NW   +DAL E++D++P+ F  S         GT   +  D ++    +  T   R   +P   +  +  +  N+  
Subjt:  LTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPSRDTDDPMNETGTASEDPQNDND

Query:  VDLSLPLSFSKNPLLPTYKEEDNVLGGTNDDIDIDMLLSNNVKEPTFLSR---DSDDPMNETGIASEDQQNHDEINTSLSLSNKNPFLPTYKEEAEGKET
         ++   L+F  +P         N L    D    + +  N  +EP   S+      + + E G  S  +   ++  +  S   K+ F+ TY E+      
Subjt:  VDLSLPLSFSKNPLLPTYKEEDNVLGGTNDDIDIDMLLSNNVKEPTFLSR---DSDDPMNETGIASEDQQNHDEINTSLSLSNKNPFLPTYKEEAEGKET

Query:  IQTGSSHDLSEIGSEPPINDIFRVWDLNL---PPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYL-LDDLISSISGLSLEQ
               +++E   +    +IF+VWDL +      ED LV QL KAL E+S  +V  +   L+  + + E   +DDLIS IS LSL +
Subjt:  IQTGSSHDLSEIGSEPPINDIFRVWDLNL---PPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYL-LDDLISSISGLSLEQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCATGGTCGAAGGATCTATTTCTTCTCATCTTGACAATGGACCAGAATCTAAAGATCAAGTCGATGCTCTTACTCCTGAAGACATTGCTTGGGTTGATTCTTGTCT
GATTAAAGAGGTACCAGATACTTCAGATGGCAATTGGAACCAGATAAAGGATGCCTTGTTAGAAGTCCTTGATCTGTATCCTCAAGGTTTTGAATCTTCTCTTGCTGTAA
GTGGTAATGTTCCAGGAGGTACTAACGATGATATCGACGTTGACATGCTTCTCTTTAATAATGTGAAGGAGCCTACATTTCCCTCAAGAGATACCGATGATCCTATGAAT
GAAACAGGAACAGCTTCAGAAGATCCTCAAAACGATAATGATGTTGATTTGTCTCTGCCGTTATCTTTCAGCAAGAATCCACTTTTACCCACTTACAAAGAGGAGGATAA
TGTTCTGGGAGGTACTAATGATGATATCGACATCGACATGCTTCTCTCTAATAATGTGAAGGAGCCTACATTTCTCTCGAGAGATAGCGATGACCCTATGAATGAAACAG
GAATAGCTTCGGAAGATCAACAAAACCACGATGAGATCAATACTTCTCTGTCGCTATCTAACAAGAATCCATTTTTACCTACTTACAAAGAGGAGGCAGAAGGGAAGGAG
ACCATTCAAACTGGATCTAGCCATGATTTATCAGAAATTGGATCTGAGCCCCCAATCAATGATATTTTCCGGGTCTGGGATTTGAACCTTCCTCCAGTCGAAGACGAGCT
TGTCAAGCAGCTGAACAAAGCCCTTTCTGAAAATTCTGCTGAATCAGTCCCTTCAATGGATAGTAATCTCAGTTCGTTGGAAGACTTACAGGAATATTTACTTGATGACT
TGATCAGTAGCATTTCTGGCCTGTCTTTGGAACAGAATAAATAA
mRNA sequenceShow/hide mRNA sequence
GTACATCCGGCCTTCCAATTTAAACGAGAGTTCTTCAACCGTCATCTCAGTTTCTCCTGGCAATAACAAGCAACCCCTCCGGATTCTCTCATCTCCAATCTCCAGCCAAA
TTAGGGTTTCAATTCTATGAGCATGGTCGAAGGATCTATTTCTTCTCATCTTGACAATGGACCAGAATCTAAAGATCAAGTCGATGCTCTTACTCCTGAAGACATTGCTT
GGGTTGATTCTTGTCTGATTAAAGAGGTACCAGATACTTCAGATGGCAATTGGAACCAGATAAAGGATGCCTTGTTAGAAGTCCTTGATCTGTATCCTCAAGGTTTTGAA
TCTTCTCTTGCTGTAAGTGGTAATGTTCCAGGAGGTACTAACGATGATATCGACGTTGACATGCTTCTCTTTAATAATGTGAAGGAGCCTACATTTCCCTCAAGAGATAC
CGATGATCCTATGAATGAAACAGGAACAGCTTCAGAAGATCCTCAAAACGATAATGATGTTGATTTGTCTCTGCCGTTATCTTTCAGCAAGAATCCACTTTTACCCACTT
ACAAAGAGGAGGATAATGTTCTGGGAGGTACTAATGATGATATCGACATCGACATGCTTCTCTCTAATAATGTGAAGGAGCCTACATTTCTCTCGAGAGATAGCGATGAC
CCTATGAATGAAACAGGAATAGCTTCGGAAGATCAACAAAACCACGATGAGATCAATACTTCTCTGTCGCTATCTAACAAGAATCCATTTTTACCTACTTACAAAGAGGA
GGCAGAAGGGAAGGAGACCATTCAAACTGGATCTAGCCATGATTTATCAGAAATTGGATCTGAGCCCCCAATCAATGATATTTTCCGGGTCTGGGATTTGAACCTTCCTC
CAGTCGAAGACGAGCTTGTCAAGCAGCTGAACAAAGCCCTTTCTGAAAATTCTGCTGAATCAGTCCCTTCAATGGATAGTAATCTCAGTTCGTTGGAAGACTTACAGGAA
TATTTACTTGATGACTTGATCAGTAGCATTTCTGGCCTGTCTTTGGAACAGAATAAATAATAGGGACATAGTTGTTCTGGAGCCACTGATTTCTGAAAGTTCATGCTGCT
GACACGTGGCACTCTGTTATTCGATAACTTCAAATGTCTCACATTGGAATAGTGGTGATTTTAAAGACGCTACTGAGAGGCTTAGATGATATAGTTCAACGGTATGTTTA
TTTTTTCTGGTGCAAACTAGTGGCATATGTGAATACCTTTATCTCAAAATCTTATTTGACAGGTTTGGTTAATGATGGCGTCTTAGTAATTTTCTTTTTCTTTTGGGAAC
ATGTCTTAGTAATTAGTATATCTAAGAGGTTGTGATGTTATGGAGGGTGGGATGTTGTTTAAAGTTTAATTGTATTTTTG
Protein sequenceShow/hide protein sequence
MSMVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPSRDTDDPMN
ETGTASEDPQNDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTNDDIDIDMLLSNNVKEPTFLSRDSDDPMNETGIASEDQQNHDEINTSLSLSNKNPFLPTYKEEAEGKE
TIQTGSSHDLSEIGSEPPINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISSISGLSLEQNK