; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003446 (gene) of Snake gourd v1 genome

Gene IDTan0003446
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGATA zinc finger domain-containing protein 10
Genome locationLG03:67261654..67264541
RNA-Seq ExpressionTan0003446
SyntenyTan0003446
Gene Ontology termsGO:0022904 - respiratory electron transport chain (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR016174 - Di-haem cytochrome, transmembrane
IPR025067 - Protein of unknown function DUF4079


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604398.1 hypothetical protein SDJN03_05007, partial [Cucurbita argyrosperma subsp. sororia]6.1e-11684.21Show/hide
Query:  MATLTRA-----SSSSYMCLTKVSPSSFPSSSFPISLPILPNIPRGSCFSLASPVTRTRKVRFKPTSARDDEFFDFVEIDDYEVEIRETSESRLYSLAPF
        MATLT A     SSSSYMCLTKV P  F S+SFPI LP L  +PR S FSLASPVT  RKVRF P+ ARD EF D        VE+RET E+RLYSLAPF
Subjt:  MATLTRA-----SSSSYMCLTKVSPSSFPSSSFPISLPILPNIPRGSCFSLASPVTRTRKVRFKPTSARDDEFFDFVEIDDYEVEIRETSESRLYSLAPF

Query:  PLLFVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLL
        PLLFVAALPGAGTVRSLFGPFVELVKSW+LPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS++VEEKANAKDLHPKLLGGMFFFFALGATGGITSLL
Subjt:  PLLFVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLL

Query:  TSDKPIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS
        TSDKPIFESPHAVTGFIGLALLT+Q+LLPSLFEDNPGLRN+HGILGSGIMTLFLIHAALGLQLGLS
Subjt:  TSDKPIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS

KAG7034548.1 hypothetical protein SDJN02_04278, partial [Cucurbita argyrosperma subsp. argyrosperma]5.5e-11785.55Show/hide
Query:  MATLTRA--SSSSYMCLTKVSPSSFPSSSFPISLPILPNIPRGSCFSLASPVTRTRKVRFKPTSARDDEFFDFVEIDDYEVEIRETSESRLYSLAPFPLL
        MATLT A  SSSSYMCLTKV P  F S+SFPI LP L  +PR S FSLASPVT  RKVRF P+ ARD EF D VE+ +   E RET E+RLYSLAPFPLL
Subjt:  MATLTRA--SSSSYMCLTKVSPSSFPSSSFPISLPILPNIPRGSCFSLASPVTRTRKVRFKPTSARDDEFFDFVEIDDYEVEIRETSESRLYSLAPFPLL

Query:  FVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSD
        FVAALPGAGTVRSLFGPFVELVKSW+LPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS++VEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSD
Subjt:  FVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSD

Query:  KPIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS
        KPIFESPHAVTGFIGLALLT+Q+LLPSLFEDNPGLRN+HGILGSGIMTLFLIHAALGLQLGLS
Subjt:  KPIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS

XP_022925916.1 uncharacterized protein LOC111433189 [Cucurbita moschata]2.1e-11685.5Show/hide
Query:  MATLTRA-SSSSYMCLTKVSPSSFPSSSFPISLPILPNIPRGSCFSLASPVTRTRKVRFKPTSARDDEFFDFVEIDDYEVEIRETSESRLYSLAPFPLLF
        MATLT A SSSSYMCLTKV P  F S+SFPI LP L  +PR S FSLASPVT  RKVRF P+ ARD EF D        VE+RET E+RLYSLAPFPLLF
Subjt:  MATLTRA-SSSSYMCLTKVSPSSFPSSSFPISLPILPNIPRGSCFSLASPVTRTRKVRFKPTSARDDEFFDFVEIDDYEVEIRETSESRLYSLAPFPLLF

Query:  VAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSDK
        VAALPGAGTVRSLFGPFVELVKSW+LPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS++VEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSDK
Subjt:  VAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSDK

Query:  PIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS
        PIFESPHAVTGFIGLALLT+Q+LLPSLFEDNPGLRN+HGILGSGIMTLFLIHAALGLQLGLS
Subjt:  PIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS

XP_022977650.1 uncharacterized protein LOC111477900 [Cucurbita maxima]3.6e-11685.5Show/hide
Query:  MATLTRA-SSSSYMCLTKVSPSSFPSSSFPISLPILPNIPRGSCFSLASPVTRTRKVRFKPTSARDDEFFDFVEIDDYEVEIRETSESRLYSLAPFPLLF
        MATLT A SSSSYMCLTKV P  F S+SFPI LP L  +PR S FSLASPVT  RKVRF P+ ARD EF D        VE RET E+RLYSLAPFPLLF
Subjt:  MATLTRA-SSSSYMCLTKVSPSSFPSSSFPISLPILPNIPRGSCFSLASPVTRTRKVRFKPTSARDDEFFDFVEIDDYEVEIRETSESRLYSLAPFPLLF

Query:  VAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSDK
        VAALPGAGTVRSLFGPFVELVKSW+LPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS++VEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSDK
Subjt:  VAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSDK

Query:  PIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS
        PIFESPHAVTGFIGLALLT+Q+LLPSLFEDNPGLRN+HGILGSGIMTLFLIHAALGLQLGLS
Subjt:  PIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS

XP_038880983.1 uncharacterized protein LOC120072635 [Benincasa hispida]4.1e-12085.61Show/hide
Query:  MATLTRA--SSSSYMCLTKVSPSSFPSSSFPISLPILPNIPRGSCFSLASPVTRTRKVRFKPTSARDDEFFDFVEIDDYEVEIRETSESRLYSLAPFPLL
        MATLT A  SSSSYMCLTKVS    PS+SFPIS P  P +PR S FSLASPVT    VRF P+ ARDDEF DF       VE++ETSE+RLYSL+PFPLL
Subjt:  MATLTRA--SSSSYMCLTKVSPSSFPSSSFPISLPILPNIPRGSCFSLASPVTRTRKVRFKPTSARDDEFFDFVEIDDYEVEIRETSESRLYSLAPFPLL

Query:  FVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSD
        F+AALPGAGTVRSLFGPFVELVKSW+LPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSD
Subjt:  FVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSD

Query:  KPIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY
        KPIFESPHAVTGFIGL LLTVQTLLPSLFE+NPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY
Subjt:  KPIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY

TrEMBL top hitse value%identityAlignment
A0A0A0KIF5 Uncharacterized protein1.0e-10880.3Show/hide
Query:  MATLTRASSS-SYMCLTKVSPSSFPSSSFPISLPILP-NIPRGSCFSLASPVTRTRKVRFKPTSARDDEFFDFVEIDDYEVEIRETSESRLYSLAPFPLL
        MATLT  SSS SY+CLTKVSP   P  S  ++LP +P N    S FS ASP+     VRF P+ AR+DEF DF E  +      ETSE RLYSL+PFPLL
Subjt:  MATLTRASSS-SYMCLTKVSPSSFPSSSFPISLPILP-NIPRGSCFSLASPVTRTRKVRFKPTSARDDEFFDFVEIDDYEVEIRETSESRLYSLAPFPLL

Query:  FVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSD
        F+AALPGAGTVRSLFGPFVELVKSW+LPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSD+VEEKA AKDLHPKLLGGMFFFFALGATGG+TSLLTSD
Subjt:  FVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSD

Query:  KPIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY
        KPI ESPHAVTGFIGL LLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY
Subjt:  KPIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY

A0A1S3CHJ9 uncharacterized protein LOC1035008272.9e-11181.27Show/hide
Query:  MATLTRA----SSSSYMCLTKVSPSSFPSSSFPISLPILPNIPR-GSCFSLASPVTRTRKVRFKPTSARDDEFFDFVEIDDYEVEIRETSESRLYSLAPF
        MATLT A    SSSSYMCLTKVSP   PS+SFPI  P LP  PR  S FS ASP+     +RF P  AR+D F DFVE+ +      ETSE RLYSL+PF
Subjt:  MATLTRA----SSSSYMCLTKVSPSSFPSSSFPISLPILPNIPR-GSCFSLASPVTRTRKVRFKPTSARDDEFFDFVEIDDYEVEIRETSESRLYSLAPF

Query:  PLLFVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLL
        PLLF+AALPG GTVRSLFGPFVELVKS +LPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSD+VEEKA AKDLHPKLLGGMFFFFALGATGGI SLL
Subjt:  PLLFVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLL

Query:  TSDKPIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY
        TSDKPIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAA GLQLGLSY
Subjt:  TSDKPIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY

A0A6J1CRM9 uncharacterized protein LOC1110135571.2e-11282.58Show/hide
Query:  MATLTRASSSSYMCLTKVSPS-SFPS-SSFPISLPILPNIPRGSCFSLASPVTRTRKVRFKPTSARDDEFFDFVEIDDYEVEIRETSESRLYSLAPFPLL
        MATLT    +SYMCL+KV P    PS SSFPISLP LP  PR SCFSL SPVT T K R  P S++DD+F +        VE++E SE+ LYSLAPFPLL
Subjt:  MATLTRASSSSYMCLTKVSPS-SFPS-SSFPISLPILPNIPRGSCFSLASPVTRTRKVRFKPTSARDDEFFDFVEIDDYEVEIRETSESRLYSLAPFPLL

Query:  FVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSD
        FVAALPGAGTVRSLFGPFVELVKSW+LP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSD+VEEKA AKDLHPKLLGGMFFFFALGATGGITSLLTSD
Subjt:  FVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSD

Query:  KPIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY
        KPIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGL Y
Subjt:  KPIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY

A0A6J1EDG7 uncharacterized protein LOC1114331891.0e-11685.5Show/hide
Query:  MATLTRA-SSSSYMCLTKVSPSSFPSSSFPISLPILPNIPRGSCFSLASPVTRTRKVRFKPTSARDDEFFDFVEIDDYEVEIRETSESRLYSLAPFPLLF
        MATLT A SSSSYMCLTKV P  F S+SFPI LP L  +PR S FSLASPVT  RKVRF P+ ARD EF D        VE+RET E+RLYSLAPFPLLF
Subjt:  MATLTRA-SSSSYMCLTKVSPSSFPSSSFPISLPILPNIPRGSCFSLASPVTRTRKVRFKPTSARDDEFFDFVEIDDYEVEIRETSESRLYSLAPFPLLF

Query:  VAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSDK
        VAALPGAGTVRSLFGPFVELVKSW+LPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS++VEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSDK
Subjt:  VAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSDK

Query:  PIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS
        PIFESPHAVTGFIGLALLT+Q+LLPSLFEDNPGLRN+HGILGSGIMTLFLIHAALGLQLGLS
Subjt:  PIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS

A0A6J1IMY7 uncharacterized protein LOC1114779001.7e-11685.5Show/hide
Query:  MATLTRA-SSSSYMCLTKVSPSSFPSSSFPISLPILPNIPRGSCFSLASPVTRTRKVRFKPTSARDDEFFDFVEIDDYEVEIRETSESRLYSLAPFPLLF
        MATLT A SSSSYMCLTKV P  F S+SFPI LP L  +PR S FSLASPVT  RKVRF P+ ARD EF D        VE RET E+RLYSLAPFPLLF
Subjt:  MATLTRA-SSSSYMCLTKVSPSSFPSSSFPISLPILPNIPRGSCFSLASPVTRTRKVRFKPTSARDDEFFDFVEIDDYEVEIRETSESRLYSLAPFPLLF

Query:  VAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSDK
        VAALPGAGTVRSLFGPFVELVKSW+LPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS++VEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSDK
Subjt:  VAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSDK

Query:  PIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS
        PIFESPHAVTGFIGLALLT+Q+LLPSLFEDNPGLRN+HGILGSGIMTLFLIHAALGLQLGLS
Subjt:  PIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G36885.1 unknown protein2.3e-8465.15Show/hide
Query:  MATLTRASSSSYMCLTKVSP-SSFPSSSFPISLPILPNIPRGSCFSLASPVTRTRKVRFKPTSARDDEFFDFVEI-DDYEVEIRETSESRLYSLAPFPLL
        MA +T  S+ S      +SP SS PSS    S   L      S F   S   R RK+   P  +   +  + +E   D E EIRET    + S++P PLL
Subjt:  MATLTRASSSSYMCLTKVSP-SSFPSSSFPISLPILPNIPRGSCFSLASPVTRTRKVRFKPTSARDDEFFDFVEI-DDYEVEIRETSESRLYSLAPFPLL

Query:  FVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSD
         VA+LPGA TVRS+FGP VE+VKS +LP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSD++EEKA AKDLHPKLL GMFFFFALGATGG+ SLLTSD
Subjt:  FVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSD

Query:  KPIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY
        KPIFESPHAVTG IGL LLTVQT+LPSLF++ P LRNVHGILGSGIM LFL+HAA GLQLGLS+
Subjt:  KPIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY

AT2G36885.2 unknown protein1.2e-8265.15Show/hide
Query:  MATLTRASSSSYMCLTKVSP-SSFPSSSFPISLPILPNIPRGSCFSLASPVTRTRKVRFKPTSARDDEFFDFVEI-DDYEVEIRETSESRLYSLAPFPLL
        MA +T  S+ S      +SP SS PSS    S   L      S F   S   R RK+   P  +   +  + +E   D E EIRET    + S++P PLL
Subjt:  MATLTRASSSSYMCLTKVSP-SSFPSSSFPISLPILPNIPRGSCFSLASPVTRTRKVRFKPTSARDDEFFDFVEI-DDYEVEIRETSESRLYSLAPFPLL

Query:  FVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSD
         VA+LPGA TVRS+FGP VE+VKS +LP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSD++EEKA AKDLHPKLL GMFFFFALGATGG+ SLLTSD
Subjt:  FVAALPGAGTVRSLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSD

Query:  KPIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY
        KPIFESPHAVTG IGL LLTVQT+LPSLF+  P LRNVHGILGSGIM LFL+HAA GLQLGLS+
Subjt:  KPIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACGCTAACTCGAGCTTCTTCTTCTTCTTACATGTGTTTAACCAAGGTATCACCCAGCTCATTTCCTTCCTCTTCATTCCCCATTTCGTTGCCAATTCTTCCCAA
CATCCCTCGCGGTTCTTGTTTCTCTTTGGCTTCCCCTGTGACTAGAACGAGAAAAGTTCGGTTCAAGCCGACTTCTGCTCGGGACGATGAGTTCTTCGATTTTGTCGAAA
TTGATGACTATGAGGTGGAAATCAGGGAGACGAGCGAGTCGCGTTTGTACTCTCTTGCGCCTTTTCCTTTACTGTTCGTCGCTGCTCTTCCTGGAGCGGGAACTGTGAGG
TCTCTCTTTGGTCCTTTTGTTGAGCTTGTTAAGTCTTGGAGTCTTCCTGAATGGTTGGTACACTGGGGTCATCCAGGCAACATGGCAGTTGTGCTCTTCGCCATGGGTGG
CTATGGAACGTATCTAGGTTTCCGTATCCGTTACTCTGACAATGTGGAGGAGAAGGCCAATGCCAAAGACTTGCATCCAAAGCTTCTAGGCGGGATGTTTTTCTTCTTTG
CTCTTGGAGCAACAGGTGGAATCACTTCTCTACTTACATCAGACAAACCTATATTTGAGAGTCCACATGCTGTAACGGGGTTCATCGGTCTCGCGCTCTTGACTGTACAA
ACTCTTCTACCCTCACTTTTTGAGGATAATCCTGGACTGAGAAATGTTCATGGTATTTTGGGTAGTGGAATCATGACACTGTTTCTCATCCATGCTGCACTTGGACTTCA
ACTTGGCCTCAGTTACTAA
mRNA sequenceShow/hide mRNA sequence
TTAATCTTTAAAAAAAAGAGAAAAAAAAATCCTTTGAAATGGAAGCATGAGTCTCAGCGTGTCTCTTTCTGGGCCATTCCCTCCATTTCTGCACTTTTTCTTCCATCTCC
TCCCGTCATTTTCTCTCTCTCATTCAAAATCCCATCTCCGTCTCTCTGCAACTGGAAGCCACACTCCAAACCGTTATCTCTCTCCATGGCTACGCTAACTCGAGCTTCTT
CTTCTTCTTACATGTGTTTAACCAAGGTATCACCCAGCTCATTTCCTTCCTCTTCATTCCCCATTTCGTTGCCAATTCTTCCCAACATCCCTCGCGGTTCTTGTTTCTCT
TTGGCTTCCCCTGTGACTAGAACGAGAAAAGTTCGGTTCAAGCCGACTTCTGCTCGGGACGATGAGTTCTTCGATTTTGTCGAAATTGATGACTATGAGGTGGAAATCAG
GGAGACGAGCGAGTCGCGTTTGTACTCTCTTGCGCCTTTTCCTTTACTGTTCGTCGCTGCTCTTCCTGGAGCGGGAACTGTGAGGTCTCTCTTTGGTCCTTTTGTTGAGC
TTGTTAAGTCTTGGAGTCTTCCTGAATGGTTGGTACACTGGGGTCATCCAGGCAACATGGCAGTTGTGCTCTTCGCCATGGGTGGCTATGGAACGTATCTAGGTTTCCGT
ATCCGTTACTCTGACAATGTGGAGGAGAAGGCCAATGCCAAAGACTTGCATCCAAAGCTTCTAGGCGGGATGTTTTTCTTCTTTGCTCTTGGAGCAACAGGTGGAATCAC
TTCTCTACTTACATCAGACAAACCTATATTTGAGAGTCCACATGCTGTAACGGGGTTCATCGGTCTCGCGCTCTTGACTGTACAAACTCTTCTACCCTCACTTTTTGAGG
ATAATCCTGGACTGAGAAATGTTCATGGTATTTTGGGTAGTGGAATCATGACACTGTTTCTCATCCATGCTGCACTTGGACTTCAACTTGGCCTCAGTTACTAAATCATA
ACTACATTTGTAGCACCATATAGGTTGTTGTTCTGAATTTTTCAGTATTATATCCTTCTGTCACACGACCTGCATTTGTTAGAAAGACGCCTCCGTGCAATCCTTTCAGA
TGAGGAAAACTTCATATTACTGCCTTTTCCTTGGATCTAGATGCAAAATGGAGGTGGTTGTGCCAAGCTGCATTACTTGGAAATAACCAGAGCAATGATTTGAAGTAATA
GCCTTTCCTGTTCACTGTTTATTCATTATTCTTATATTTTTACAAAGTTATAGGGGCTTCTCATTTACTATCTGTTCTTTCTCTTTCTGTTGACACTATATGATGTTAAT
CTTGTTTTCTAATTGATTTGGTATGTTGCGACGATCAATTCATAAATGTAG
Protein sequenceShow/hide protein sequence
MATLTRASSSSYMCLTKVSPSSFPSSSFPISLPILPNIPRGSCFSLASPVTRTRKVRFKPTSARDDEFFDFVEIDDYEVEIRETSESRLYSLAPFPLLFVAALPGAGTVR
SLFGPFVELVKSWSLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLALLTVQ
TLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY