; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021066 (gene) of Snake gourd v1 genome

Gene IDTan0021066
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG09:70254068..70255334
RNA-Seq ExpressionTan0021066
SyntenyTan0021066
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587646.1 hypothetical protein SDJN03_16211, partial [Cucurbita argyrosperma subsp. sororia]5.4e-8978.22Show/hide
Query:  MEACIDSRKRVRDESNDSLFNFIGSKILRLDSAELNFISPDHVNDAPIGSVASDAKSIDSKQIGIIHDGDSGLDSFQAHAIQEDLLKILDEADASTDRES
        ME CIDSRKRVRDESN+SLFNF+GSKILR DSAE NFISPD ++DAP+ SV+SDA+SI SKQ G IH  DSGLDSFQ   IQEDLLKILDEAD S DRE 
Subjt:  MEACIDSRKRVRDESNDSLFNFIGSKILRLDSAELNFISPDHVNDAPIGSVASDAKSIDSKQIGIIHDGDSGLDSFQAHAIQEDLLKILDEADASTDRES

Query:  AIQDLDSVIRSFEKEIQVPAPSVQPELGYLLEASDDELGLPPAGQKGEVEAVNFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENESNQGDEEVVAL
        AI DLDSVI SFEKEI VP P+VQPELGYLLEASDDELGLPPA  KGE+E VNF  ++SGS GMKGFLGFEDE VPNYCWLENLSSE E N+ +EEV AL
Subjt:  AIQDLDSVIRSFEKEIQVPAPSVQPELGYLLEASDDELGLPPAGQKGEVEAVNFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENESNQGDEEVVAL

Query:  GGLFDH-TDGTAELPPYRSETMYCL
        GGL DH TDG  ELPPYRSETM+CL
Subjt:  GGLFDH-TDGTAELPPYRSETMYCL

KAG6589552.1 hypothetical protein SDJN03_14975, partial [Cucurbita argyrosperma subsp. sororia]5.1e-7974.01Show/hide
Query:  MEACIDSRKRVRDESNDSLFNFIG--SKILRLDSAELNFISPDHVNDAPIGSVASDAKSIDSKQIGIIHDGDSGLDSFQAHAIQEDLLKILDEADASTDR
        MEAC+DSRKR+RDESNDSLFNFIG  SK +RLDSA L+      V+DAPI SV+SDAKSIDS          SGLDSFQA+ IQ+DLLKILD+ DA  DR
Subjt:  MEACIDSRKRVRDESNDSLFNFIG--SKILRLDSAELNFISPDHVNDAPIGSVASDAKSIDSKQIGIIHDGDSGLDSFQAHAIQEDLLKILDEADASTDR

Query:  ESAIQDLDSVIRSFEKEIQVPAPSVQPELGYLLEASDDELGLPPAGQKGEVEAVNFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENESN-QGDEEV
        E  IQDLDSVIRSFEKEIQVP PSVQPELG+LLEASDDELGLPPAG+K E EAVNFAA+F GSGGMKG LG EDE VPNYCWLENL SENE N + +EEV
Subjt:  ESAIQDLDSVIRSFEKEIQVPAPSVQPELGYLLEASDDELGLPPAGQKGEVEAVNFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENESN-QGDEEV

Query:  VALGGLFDHTDGTAELPPYRSETMYCL
        V LGGLFDHTD   EL  YRSETM CL
Subjt:  VALGGLFDHTDGTAELPPYRSETMYCL

KAG7021606.1 hypothetical protein SDJN02_15332, partial [Cucurbita argyrosperma subsp. argyrosperma]1.6e-8877.78Show/hide
Query:  MEACIDSRKRVRDESNDSLFNFIGSKILRLDSAELNFISPDHVNDAPIGSVASDAKSIDSKQIGIIHDGDSGLDSFQAHAIQEDLLKILDEADASTDRES
        ME CIDSRKRVRDESN+SLFNF+GSKILR DSAE NFISPD ++DAP+ SV+SDA+SI SKQ G IH  DSGLDSFQ   IQEDLLKIL+EAD S DRE 
Subjt:  MEACIDSRKRVRDESNDSLFNFIGSKILRLDSAELNFISPDHVNDAPIGSVASDAKSIDSKQIGIIHDGDSGLDSFQAHAIQEDLLKILDEADASTDRES

Query:  AIQDLDSVIRSFEKEIQVPAPSVQPELGYLLEASDDELGLPPAGQKGEVEAVNFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENESNQGDEEVVAL
        AI DLDSVI SFEKEI VP P+VQPELGYLLEASDDELGLPPA  KGE+E VNF  ++SGS GMKGFLGFEDE VPNYCWLENLSSE E N+ +EEV AL
Subjt:  AIQDLDSVIRSFEKEIQVPAPSVQPELGYLLEASDDELGLPPAGQKGEVEAVNFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENESNQGDEEVVAL

Query:  GGLFDH-TDGTAELPPYRSETMYCL
        GGL DH TDG  ELPPYRSETM+CL
Subjt:  GGLFDH-TDGTAELPPYRSETMYCL

XP_023516749.1 uncharacterized protein LOC111780554 [Cucurbita pepo subsp. pepo]8.7e-7974.01Show/hide
Query:  MEACIDSRKRVRDESNDSLFNFIG--SKILRLDSAELNFISPDHVNDAPIGSVASDAKSIDSKQIGIIHDGDSGLDSFQAHAIQEDLLKILDEADASTDR
        MEAC+DSRKR+RDESNDSLFNFIG  SK +RLDSA L+      V+DAP  SV+SDAKSIDS          SGLDSFQA+ IQ+DLLKILD+ DA  DR
Subjt:  MEACIDSRKRVRDESNDSLFNFIG--SKILRLDSAELNFISPDHVNDAPIGSVASDAKSIDSKQIGIIHDGDSGLDSFQAHAIQEDLLKILDEADASTDR

Query:  ESAIQDLDSVIRSFEKEIQVPAPSVQPELGYLLEASDDELGLPPAGQKGEVEAVNFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENE-SNQGDEEV
        ES IQDLDSVIRSFEKEIQVP PSVQPELG+LLEASDDELGLPPAG+K E EAVNFAA+F GSGGMKG LG EDE VPNYCWLENL SENE S + +EEV
Subjt:  ESAIQDLDSVIRSFEKEIQVPAPSVQPELGYLLEASDDELGLPPAGQKGEVEAVNFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENE-SNQGDEEV

Query:  VALGGLFDHTDGTAELPPYRSETMYCL
        V LGGLFDHTD   EL  YRSETM CL
Subjt:  VALGGLFDHTDGTAELPPYRSETMYCL

XP_023531746.1 uncharacterized protein LOC111793909 [Cucurbita pepo subsp. pepo]3.0e-8776.44Show/hide
Query:  MEACIDSRKRVRDESNDSLFNFIGSKILRLDSAELNFISPDHVNDAPIGSVASDAKSIDSKQIGIIHDGDSGLDSFQAHAIQEDLLKILDEADASTDRES
        ME CIDSRKRVRDESN+SLFNF+GSKILR DSAE NFISPD  +DAP+ SV+SDA+SI SKQ G IH  DSGLDSFQ   I+EDLLKILDEAD S DRE 
Subjt:  MEACIDSRKRVRDESNDSLFNFIGSKILRLDSAELNFISPDHVNDAPIGSVASDAKSIDSKQIGIIHDGDSGLDSFQAHAIQEDLLKILDEADASTDRES

Query:  AIQDLDSVIRSFEKEIQVPAPSVQPELGYLLEASDDELGLPPAGQKGEVEAVNFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENESNQGDEEVVAL
        AI DLDSVI SFEKEI VP P+VQPELGYLLEASDDELGLPPA  KGE+E VNF  ++SG  GMKGFLGFEDE VPNYCWLENLSSE E N+ ++EV AL
Subjt:  AIQDLDSVIRSFEKEIQVPAPSVQPELGYLLEASDDELGLPPAGQKGEVEAVNFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENESNQGDEEVVAL

Query:  GGLFDH-TDGTAELPPYRSETMYCL
        GGL DH TDG  E+PPYRSETM+CL
Subjt:  GGLFDH-TDGTAELPPYRSETMYCL

TrEMBL top hitse value%identityAlignment
A0A0A0LS21 Uncharacterized protein1.9e-5559.32Show/hide
Query:  MEACIDSRKRVRDESNDSLFNFIG--SKILRLDS-AELNFISPDHVNDAPIGSVASDAKSIDSKQIGIIHDGDSGLDSFQA-HAIQEDLLKILDEADAST
        ME  +DSRKR+RD+SNDSLFN IG  SK LRL++ A  NF       DAP+                         DSF + H IQEDLLKILD+ DAS 
Subjt:  MEACIDSRKRVRDESNDSLFNFIG--SKILRLDS-AELNFISPDHVNDAPIGSVASDAKSIDSKQIGIIHDGDSGLDSFQA-HAIQEDLLKILDEADAST

Query:  DRESAIQDLDSVIRSFEKEIQVP----APSVQPELGYLLEASDDELGLPP-AGQKGEVEAVNFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENE--
        DRE+ IQDLDSVIRSFEKEI+VP     P VQPELG+LLEASDDELGLPP AG+K E+E     A+FSGSGG+KG LGFEDE+V NYCW +NL  E +  
Subjt:  DRESAIQDLDSVIRSFEKEIQVP----APSVQPELGYLLEASDDELGLPP-AGQKGEVEAVNFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENE--

Query:  SNQGDEEVVALGGLFDHTDGTAELP-PYRSETMYCL
        S + +EEVVALGGLFDHTD  AELP  YRSE M CL
Subjt:  SNQGDEEVVALGGLFDHTDGTAELP-PYRSETMYCL

A0A1S3BWV1 uncharacterized protein LOC1034943333.7e-5960.78Show/hide
Query:  MEACIDSRKRVRDESN-DSLFNFIG--SKILRLDS-AELNFISPDHVNDAPIGSVASDAKSIDSKQIGIIHDGDSGLDSFQA-HAIQEDLLKILDEADAS
        ME C+D+RKR+RD+SN DSLFN IG  SK LRL++ A+ NF       DAP+                         DSFQ+ H IQEDLLKILD+ DAS
Subjt:  MEACIDSRKRVRDESN-DSLFNFIG--SKILRLDS-AELNFISPDHVNDAPIGSVASDAKSIDSKQIGIIHDGDSGLDSFQA-HAIQEDLLKILDEADAS

Query:  TDRESAIQDLDSVIRSFEKEIQVPAPSVQPELGYLLEASDDELGLPPAGQKGEVEAVNFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENES--NQG
         DRE+AIQDLDSVIRSFEKEI+VP P VQPELG+LLEASDDELGLPPAG+K E+E     A+FSGSGG+KG LGFEDE+V NYCW +NL  E +    + 
Subjt:  TDRESAIQDLDSVIRSFEKEIQVPAPSVQPELGYLLEASDDELGLPPAGQKGEVEAVNFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENES--NQG

Query:  DEEVVALGGLFDHTDGTAELP-PYRSETMYCL
        +EEVVALGGLFDHTD  AELP  YRSE M CL
Subjt:  DEEVVALGGLFDHTDGTAELP-PYRSETMYCL

A0A5A7UZW3 Uncharacterized protein3.7e-5960.78Show/hide
Query:  MEACIDSRKRVRDESN-DSLFNFIG--SKILRLDS-AELNFISPDHVNDAPIGSVASDAKSIDSKQIGIIHDGDSGLDSFQA-HAIQEDLLKILDEADAS
        ME C+D+RKR+RD+SN DSLFN IG  SK LRL++ A+ NF       DAP+                         DSFQ+ H IQEDLLKILD+ DAS
Subjt:  MEACIDSRKRVRDESN-DSLFNFIG--SKILRLDS-AELNFISPDHVNDAPIGSVASDAKSIDSKQIGIIHDGDSGLDSFQA-HAIQEDLLKILDEADAS

Query:  TDRESAIQDLDSVIRSFEKEIQVPAPSVQPELGYLLEASDDELGLPPAGQKGEVEAVNFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENES--NQG
         DRE+AIQDLDSVIRSFEKEI+VP P VQPELG+LLEASDDELGLPPAG+K E+E     A+FSGSGG+KG LGFEDE+V NYCW +NL  E +    + 
Subjt:  TDRESAIQDLDSVIRSFEKEIQVPAPSVQPELGYLLEASDDELGLPPAGQKGEVEAVNFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENES--NQG

Query:  DEEVVALGGLFDHTDGTAELP-PYRSETMYCL
        +EEVVALGGLFDHTD  AELP  YRSE M CL
Subjt:  DEEVVALGGLFDHTDGTAELP-PYRSETMYCL

A0A5D3DXI3 Uncharacterized protein1.3e-5961.21Show/hide
Query:  MEACIDSRKRVRDESN-DSLFNFIG--SKILRLDS-AELNFISPDHVNDAPIGSVASDAKSIDSKQIGIIHDGDSGLDSFQA-HAIQEDLLKILDEADAS
        ME C+D+RKR+RD+SN DSLFN IG  SK LRL++ A+ NF       DAP+                         DSFQ+ H IQEDLLKILD+ DAS
Subjt:  MEACIDSRKRVRDESN-DSLFNFIG--SKILRLDS-AELNFISPDHVNDAPIGSVASDAKSIDSKQIGIIHDGDSGLDSFQA-HAIQEDLLKILDEADAS

Query:  TDRESAIQDLDSVIRSFEKEIQVPAPSVQPELGYLLEASDDELGLPPAGQKGEVEAVNFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENE--SNQG
         DRE+AIQDLDSVIRSFEKEI+VP P VQPELG+LLEASDDELGLPPAG+K E+E     A+FSGSGG+KG LGFEDE+V NYCW +NL  E +  S + 
Subjt:  TDRESAIQDLDSVIRSFEKEIQVPAPSVQPELGYLLEASDDELGLPPAGQKGEVEAVNFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENE--SNQG

Query:  DEEVVALGGLFDHTDGTAELP-PYRSETMYCL
        +EEVVALGGLFDHTD  AELP  YRSE M CL
Subjt:  DEEVVALGGLFDHTDGTAELP-PYRSETMYCL

A0A6J1E5H5 uncharacterized protein LOC1114296743.7e-7571.37Show/hide
Query:  MEACIDSRKRVRDESNDSLFNFIG--SKILRLDSAELNFISPDHVNDAPIGSVASDAKSIDSKQIGIIHDGDSGLDSFQAHAIQEDLLKILDEADASTDR
        MEAC+DSRKR+RDESNDSLFNFIG  SK +RLDSA L+      V+DAPI SV+SDAKSI               DSFQA+ IQ+DLLKILD+ DA  DR
Subjt:  MEACIDSRKRVRDESNDSLFNFIG--SKILRLDSAELNFISPDHVNDAPIGSVASDAKSIDSKQIGIIHDGDSGLDSFQAHAIQEDLLKILDEADASTDR

Query:  ESAIQDLDSVIRSFEKEIQVPAPSVQPELGYLLEASDDELGLPPAGQKGEVEAVNFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENESN-QGDEEV
        ES IQDLDSVIRSFEKEIQVP PS QPELG+LLEASDDELGLPPAG+K E EAVNFAA+F GSG MKG LG EDE VPNYCWLENL SENE N + +EEV
Subjt:  ESAIQDLDSVIRSFEKEIQVPAPSVQPELGYLLEASDDELGLPPAGQKGEVEAVNFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENESN-QGDEEV

Query:  VALGGLFDHTDGTAELPPYRSETMYCL
        V LGGLFDHTD   EL  YRSETM CL
Subjt:  VALGGLFDHTDGTAELPPYRSETMYCL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13360.1 unknown protein5.5e-1534.57Show/hide
Query:  SIDSKQIGIIHDGDSGLDSFQAHAIQEDLLKILDEADASTDRESAIQDLDSVIRSFEKEIQV----------PAPSVQPELGYLLEASDDELGLPP----
        S + K++      ++ LDS +   +++DL  +LD++D     E   QDLDSV++SFE E+             A   QP+LGYLLEASDDELGLPP    
Subjt:  SIDSKQIGIIHDGDSGLDSFQAHAIQEDLLKILDEADASTDRESAIQDLDSVIRSFEKEIQV----------PAPSVQPELGYLLEASDDELGLPP----

Query:  -----AGQKGEVEAV-NFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENESNQGDEEVVALGGLFDHTD---GTAELPPYRSETM
             A ++   E V +     S S G+    GFED  V NY  L+  S   +      + VA+ GLF+ +D    + +L  +RSE++
Subjt:  -----AGQKGEVEAV-NFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENESNQGDEEVVALGGLFDHTD---GTAELPPYRSETM

AT1G13360.2 unknown protein5.2e-1335.09Show/hide
Query:  SIDSKQIGIIHDGDSGLDSFQAHAIQEDLLKILDEADASTDRESAIQDLDSVIRSFEKEIQV----------PAPSVQPELGYLLEASDDELGLPP----
        S + K++      ++ LDS +   +++DL  +LD++D     E   QDLDSV++SFE E+             A   QP+LGYLLEASDDELGLPP    
Subjt:  SIDSKQIGIIHDGDSGLDSFQAHAIQEDLLKILDEADASTDRESAIQDLDSVIRSFEKEIQV----------PAPSVQPELGYLLEASDDELGLPP----

Query:  -----AGQKGEVEAV-NFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENESNQGDEEVVALGGLFDHT
             A ++   E V +     S S G+    GFED  V NY  L+  S   +      + VA+ G F +T
Subjt:  -----AGQKGEVEAV-NFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENESNQGDEEVVALGGLFDHT

AT1G13360.3 unknown protein2.6e-1235.09Show/hide
Query:  SIDSKQIGIIHDGDSGLDSFQAHAIQEDLLKILDEADASTDRESAIQDLDSVIRSFEKEIQV----------PAPSVQPELGYLLEASDDELGLPP----
        S + K++      ++ LDS +   +++DL  +LD++D     E   QDLDSV++SFE E+             A   QP+LGYLLEASDDELGLPP    
Subjt:  SIDSKQIGIIHDGDSGLDSFQAHAIQEDLLKILDEADASTDRESAIQDLDSVIRSFEKEIQV----------PAPSVQPELGYLLEASDDELGLPP----

Query:  -----AGQKGEVEAV-NFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENESNQGDEEVVALGGLFDHT
             A ++   E V +     S S G+    GFED  V NY  L+  S   +   G + V   G  F H+
Subjt:  -----AGQKGEVEAV-NFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENESNQGDEEVVALGGLFDHT

AT3G25870.1 unknown protein8.6e-0833.09Show/hide
Query:  DLLKILDEADASTDRESAIQDLDSVIRSFEKEIQVPAPSV-----QPELGYLLEASDDELGLPPAGQKGEV-------EAVNFAAQFSGSGGMKGFL-GF
        D+ ++ D+    +  +   QDLDSV++SFE E+     ++     QP+LGYL EASDDELGLPP     +        E V    + S      G L GF
Subjt:  DLLKILDEADASTDRESAIQDLDSVIRSFEKEIQVPAPSV-----QPELGYLLEASDDELGLPPAGQKGEV-------EAVNFAAQFSGSGGMKGFL-GF

Query:  EDEVVPNYCWLENLSSENESNQGDEEVVALGGLFDHTDG
        ED V          +     + GD+      GLF++ DG
Subjt:  EDEVVPNYCWLENLSSENESNQGDEEVVALGGLFDHTDG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCCTGCATTGACAGCAGGAAGCGAGTACGCGACGAATCCAATGATTCTCTATTCAATTTCATCGGATCCAAGATCCTCCGACTTGACTCGGCTGAATTGAACTT
CATCTCGCCGGATCATGTCAACGATGCACCGATTGGTTCCGTTGCGTCTGATGCAAAATCGATCGATTCGAAACAGATTGGAATAATCCACGATGGTGATTCAGGCCTGG
ACTCCTTTCAGGCACACGCAATTCAAGAAGACCTACTGAAGATTCTCGACGAAGCCGACGCTTCCACAGATCGCGAGTCGGCGATTCAAGATCTCGACTCGGTGATCAGA
AGCTTCGAGAAGGAAATTCAGGTGCCGGCACCTTCTGTTCAGCCTGAACTCGGATACCTTCTAGAAGCGTCGGACGACGAATTAGGGCTTCCACCGGCCGGCCAGAAAGG
GGAGGTCGAGGCCGTTAATTTTGCGGCGCAATTTTCAGGTAGCGGTGGTATGAAAGGGTTTTTAGGGTTTGAGGATGAAGTTGTTCCGAATTACTGTTGGCTGGAAAATT
TGAGCAGTGAGAATGAATCGAATCAGGGGGATGAAGAGGTGGTGGCGTTGGGTGGATTGTTCGATCATACGGACGGCACGGCGGAGTTGCCGCCGTATCGATCGGAGACG
ATGTACTGTTTATAA
mRNA sequenceShow/hide mRNA sequence
TGAAATTTGAAGTAAATAAAAGAATAATTGTACGCGCCAAAGTGCGTTTGAAGTGTACGAATTCCCCACTATTTAATCGCCAGATTTTGCTTCGTCCATTTCCCCGTCAC
GAACTCTCTCTCTCTCTCTCTTTGTTTCTTCTAGTTGATTTTGCGTTAATTTCATGCGATTTCCACACCAAATTTATTTCCTTCTCTCTCCCGATTCATCACTTTCAGTT
CTATTACTAATTGATTGCTGAACTCCTCTCCAATTCGACGGCGTTCTTCTTTTCCTCTGTTGTTCTGCAATGGAAGCCTGCATTGACAGCAGGAAGCGAGTACGCGACGA
ATCCAATGATTCTCTATTCAATTTCATCGGATCCAAGATCCTCCGACTTGACTCGGCTGAATTGAACTTCATCTCGCCGGATCATGTCAACGATGCACCGATTGGTTCCG
TTGCGTCTGATGCAAAATCGATCGATTCGAAACAGATTGGAATAATCCACGATGGTGATTCAGGCCTGGACTCCTTTCAGGCACACGCAATTCAAGAAGACCTACTGAAG
ATTCTCGACGAAGCCGACGCTTCCACAGATCGCGAGTCGGCGATTCAAGATCTCGACTCGGTGATCAGAAGCTTCGAGAAGGAAATTCAGGTGCCGGCACCTTCTGTTCA
GCCTGAACTCGGATACCTTCTAGAAGCGTCGGACGACGAATTAGGGCTTCCACCGGCCGGCCAGAAAGGGGAGGTCGAGGCCGTTAATTTTGCGGCGCAATTTTCAGGTA
GCGGTGGTATGAAAGGGTTTTTAGGGTTTGAGGATGAAGTTGTTCCGAATTACTGTTGGCTGGAAAATTTGAGCAGTGAGAATGAATCGAATCAGGGGGATGAAGAGGTG
GTGGCGTTGGGTGGATTGTTCGATCATACGGACGGCACGGCGGAGTTGCCGCCGTATCGATCGGAGACGATGTACTGTTTATAATATGATGTATTAATTGTTGTATTGGA
ACAGGAAGAAACGAAAAACCGCAAAAAGAAAAGAAAATTGAATGGGTGAATTCTCTTTGATGAAAAACAGAGGAAACAAAATGACTACAGAGAACTTGTGTTGATCTTGA
TCTATGTTTCACACTTCAATTGGCTCAGGAAAAATAAAGGCTTCCTTTATTTGTTTCCTTTGACAATTTTTTAATGGTTTTGGGAATTTCAAAAGAGTGTTCTAGTTTAT
TCAATTATTATTTAATAATATTATATAATTTATAATATATATTTTTTAGTTGAACAA
Protein sequenceShow/hide protein sequence
MEACIDSRKRVRDESNDSLFNFIGSKILRLDSAELNFISPDHVNDAPIGSVASDAKSIDSKQIGIIHDGDSGLDSFQAHAIQEDLLKILDEADASTDRESAIQDLDSVIR
SFEKEIQVPAPSVQPELGYLLEASDDELGLPPAGQKGEVEAVNFAAQFSGSGGMKGFLGFEDEVVPNYCWLENLSSENESNQGDEEVVALGGLFDHTDGTAELPPYRSET
MYCL