; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007230 (gene) of Snake gourd v1 genome

Gene IDTan0007230
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionMembrane fusion protein Use1
Genome locationLG03:70570126..70591440
RNA-Seq ExpressionTan0007230
SyntenyTan0007230
Gene Ontology termsGO:0015031 - protein transport (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR019150 - Vesicle transport protein, Use1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008438519.1 PREDICTED: uncharacterized protein LOC103483591 [Cucumis melo]2.0e-10590.87Show/hide
Query:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPSPDVEESSEPSTSVSVGDFSSVAEGG
        MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKL VP PDVEESSEPSTS SV + SSVAEG 
Subjt:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPSPDVEESSEPSTSVSVGDFSSVAEGG

Query:  KST-LSPPGLRRRFQPSSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATTG
         +T  SP GLRRRF  SS V+DRSHGTIKEDSSAPVKLDAAAI+HIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLE+TEKILDSTEKAVEDSLATTG
Subjt:  KST-LSPPGLRRRFQPSSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATTG

Query:  RVNKRAVEIYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT
        RVNKRAV+IYSESSKTSCFTWLAIF MTC+F+MVVLLIRVT
Subjt:  RVNKRAVEIYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT

XP_022135542.1 uncharacterized protein LOC111007470 [Momordica charantia]6.0e-11094.58Show/hide
Query:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPSPDVEESSEPSTSVSVGDFSSVAEGG
        M LSKTEINLKRLLATAP QKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVP PDVEESSEPSTS SV +FSSV EG 
Subjt:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPSPDVEESSEPSTSVSVGDFSSVAEGG

Query:  KSTLSPPGLRRRFQPSSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATTGR
         +TLS PGLRRRFQPSSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLE+TEKILDSTEKAVEDSLATTGR
Subjt:  KSTLSPPGLRRRFQPSSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATTGR

Query:  VNKRAVEIYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT
        VNKRAVEIYSESSKTSCFTWLAIF MTCVFVMVVLLIRVT
Subjt:  VNKRAVEIYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT

XP_022921329.1 uncharacterized protein LOC111429634 isoform X1 [Cucurbita moschata]2.4e-10692.15Show/hide
Query:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPSPDVEESSEPSTSVSVGDFSSVAEGG
        MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTP+G+SRVSKALLGDY+EKIEAIASKLAVP PD EESSEPSTS SV +FSSVAEG 
Subjt:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPSPDVEESSEPSTSVSVGDFSSVAEGG

Query:  -KSTLSPPGLRRRFQP-SSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATT
          +  SPPGLRRRF P SSVV+DRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLA+QLKESSLIMSKSLESTEKILDSTEKAVEDSLATT
Subjt:  -KSTLSPPGLRRRFQP-SSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATT

Query:  GRVNKRAVEIYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT
        GRVNKRAV+IYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT
Subjt:  GRVNKRAVEIYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT

XP_023515489.1 uncharacterized protein LOC111779632 isoform X1 [Cucurbita pepo subsp. pepo]9.0e-10691.74Show/hide
Query:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPSPDVEESSEPSTSVSVGDFSSVAEGG
        MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTP+G+SRVSKALLGDY+EKIEAIASKLAVP PD EESSEPSTS SV +FSSVA+G 
Subjt:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPSPDVEESSEPSTSVSVGDFSSVAEGG

Query:  -KSTLSPPGLRRRFQP-SSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATT
          +  SPPGLRRRF P SSVV+DRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLA+QLKESSLIMSKSLESTEKILDSTEKAVEDSLATT
Subjt:  -KSTLSPPGLRRRFQP-SSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATT

Query:  GRVNKRAVEIYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT
        GRVNKRAV+IYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT
Subjt:  GRVNKRAVEIYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT

XP_038879132.1 uncharacterized protein LOC120071129 isoform X3 [Benincasa hispida]1.1e-10692.53Show/hide
Query:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPSPDVEESSEPSTSVSVGDFSSVAEGG
        MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVP PDVEESSEPSTS S  + SSVAEG 
Subjt:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPSPDVEESSEPSTSVSVGDFSSVAEGG

Query:  KST-LSPPGLRRRFQPSSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATTG
         +T  SP GLRRRF  SS+V+DRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATTG
Subjt:  KST-LSPPGLRRRFQPSSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATTG

Query:  RVNKRAVEIYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT
        +VNKRAV+IYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT
Subjt:  RVNKRAVEIYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT

TrEMBL top hitse value%identityAlignment
A0A1S3AWP3 uncharacterized protein LOC1034835919.7e-10690.87Show/hide
Query:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPSPDVEESSEPSTSVSVGDFSSVAEGG
        MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKL VP PDVEESSEPSTS SV + SSVAEG 
Subjt:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPSPDVEESSEPSTSVSVGDFSSVAEGG

Query:  KST-LSPPGLRRRFQPSSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATTG
         +T  SP GLRRRF  SS V+DRSHGTIKEDSSAPVKLDAAAI+HIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLE+TEKILDSTEKAVEDSLATTG
Subjt:  KST-LSPPGLRRRFQPSSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATTG

Query:  RVNKRAVEIYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT
        RVNKRAV+IYSESSKTSCFTWLAIF MTC+F+MVVLLIRVT
Subjt:  RVNKRAVEIYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT

A0A5A7U6J9 Cation exchanger family protein9.7e-10690.87Show/hide
Query:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPSPDVEESSEPSTSVSVGDFSSVAEGG
        MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKL VP PDVEESSEPSTS SV + SSVAEG 
Subjt:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPSPDVEESSEPSTSVSVGDFSSVAEGG

Query:  KST-LSPPGLRRRFQPSSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATTG
         +T  SP GLRRRF  SS V+DRSHGTIKEDSSAPVKLDAAAI+HIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLE+TEKILDSTEKAVEDSLATTG
Subjt:  KST-LSPPGLRRRFQPSSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATTG

Query:  RVNKRAVEIYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT
        RVNKRAV+IYSESSKTSCFTWLAIF MTC+F+MVVLLIRVT
Subjt:  RVNKRAVEIYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT

A0A6J1C2Z7 uncharacterized protein LOC1110074702.9e-11094.58Show/hide
Query:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPSPDVEESSEPSTSVSVGDFSSVAEGG
        M LSKTEINLKRLLATAP QKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVP PDVEESSEPSTS SV +FSSV EG 
Subjt:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPSPDVEESSEPSTSVSVGDFSSVAEGG

Query:  KSTLSPPGLRRRFQPSSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATTGR
         +TLS PGLRRRFQPSSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLE+TEKILDSTEKAVEDSLATTGR
Subjt:  KSTLSPPGLRRRFQPSSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATTGR

Query:  VNKRAVEIYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT
        VNKRAVEIYSESSKTSCFTWLAIF MTCVFVMVVLLIRVT
Subjt:  VNKRAVEIYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT

A0A6J1E043 uncharacterized protein LOC111429634 isoform X11.1e-10692.15Show/hide
Query:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPSPDVEESSEPSTSVSVGDFSSVAEGG
        MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTP+G+SRVSKALLGDY+EKIEAIASKLAVP PD EESSEPSTS SV +FSSVAEG 
Subjt:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPSPDVEESSEPSTSVSVGDFSSVAEGG

Query:  -KSTLSPPGLRRRFQP-SSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATT
          +  SPPGLRRRF P SSVV+DRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLA+QLKESSLIMSKSLESTEKILDSTEKAVEDSLATT
Subjt:  -KSTLSPPGLRRRFQP-SSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATT

Query:  GRVNKRAVEIYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT
        GRVNKRAV+IYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT
Subjt:  GRVNKRAVEIYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT

A0A6J1JGJ9 uncharacterized protein LOC111484922 isoform X12.2e-10591.74Show/hide
Query:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPSPDVEESSEPSTSVSVGDFSSVAEGG
        MGLSKTEINLKRLLATA HQKDQAKLIHYVTTLREQLEQLAEEKTP+G+SRVSKALLGDY+EKIEAIASKLAVP PD EESSEPSTS SV +FSSVAEG 
Subjt:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPSPDVEESSEPSTSVSVGDFSSVAEGG

Query:  -KSTLSPPGLRRRFQP-SSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATT
          +  SPPGLRRRF P SSVV+DRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLA+QLKESSLIMSKSLESTEKILDSTEKAVEDSLATT
Subjt:  -KSTLSPPGLRRRFQP-SSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATT

Query:  GRVNKRAVEIYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT
        GRVNKRAV+IYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT
Subjt:  GRVNKRAVEIYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G54110.1 Membrane fusion protein Use11.1e-6156.67Show/hide
Query:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPSPDVEESSEPSTSVSVGDFSSVAEGG
        MG+ KTEIN  RLL+ AP+Q++Q+KL+HYV TLREQLEQL+EEKT EGL RV+ A + +Y EKIEA+ S++    P  E S E     S  D S   E  
Subjt:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPSPDVEESSEPSTSVSVGDFSSVAEGG

Query:  KSTLSPPGLRRRFQPSSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATTGR
          T + P LRRR  P+S  +     +   D S P+KLD AA + + K RKLQEDLTDEMV LA+QLKE S ++S+S+++TEKILDSTE+A+E SLA+TG 
Subjt:  KSTLSPPGLRRRFQPSSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATTGR

Query:  VNKRAVEIYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT
           RA +IYSESSKTSCF WL I  MTCVF+MVV+LIRVT
Subjt:  VNKRAVEIYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT

AT3G55600.1 Membrane fusion protein Use17.2e-6961.25Show/hide
Query:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPSPDVEESSEPSTSVSVGDFSSVAEGG
        MG+SKTEINL+RLL+ AP+Q++Q+KL+HYV TLREQLEQL+EEKTPEGL RV+KA + +Y EKIEA+ASK+A   P+ E S EP    S    S   E  
Subjt:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPSPDVEESSEPSTSVSVGDFSSVAEGG

Query:  KSTLSPPGLRRRFQPSSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATTGR
          + + P LRRR  P+S   ++S      DSS P+KLD AA +HI+KHRKLQEDLTDEMV LA+QLKE S  +S+S+++TEKILDSTE+A+E SLA+TG 
Subjt:  KSTLSPPGLRRRFQPSSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATTGR

Query:  VNKRAVEIYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT
           RA +IYS+SSKTSCF WL IF M CVF+MVVLLIRVT
Subjt:  VNKRAVEIYSESSKTSCFTWLAIFVMTCVFVMVVLLIRVT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTAAGTAAAACTGAAATCAACTTGAAGCGGCTGTTGGCAACTGCTCCTCATCAGAAGGATCAGGCTAAACTTATACATTATGTGACTACTCTAAGAGAACAGTT
GGAACAACTTGCTGAAGAGAAAACACCAGAAGGCTTATCAAGAGTTTCAAAGGCTTTACTTGGGGATTATTCAGAGAAGATAGAGGCCATTGCTTCCAAATTAGCCGTTC
CTTCGCCTGATGTGGAAGAGTCTTCTGAGCCCTCTACTAGTGTTTCTGTTGGAGATTTTTCTTCTGTAGCAGAAGGAGGCAAAAGCACTCTTTCACCCCCGGGACTAAGG
AGAAGATTTCAGCCTTCCTCTGTTGTTGATGATAGATCTCATGGCACCATCAAAGAAGACTCATCAGCACCTGTAAAGTTGGATGCTGCAGCAATATCCCACATCGAGAA
ACACAGGAAGCTTCAAGAGGACTTGACCGATGAAATGGTTGGGTTAGCAAAGCAACTGAAAGAGAGCAGTCTGATAATGAGCAAATCTTTGGAGAGCACTGAAAAGATAT
TGGATTCCACAGAGAAGGCTGTTGAGGATAGCTTGGCAACCACCGGTCGGGTCAATAAACGTGCTGTTGAGATTTACTCGGAAAGCTCGAAAACTTCATGCTTCACTTGG
TTGGCGATCTTTGTAATGACCTGTGTTTTCGTAATGGTTGTGCTTCTCATTCGAGTTACTTGA
mRNA sequenceShow/hide mRNA sequence
GTTTTCCTTTAGAATTTTAGATTTTGAGTTTATGGTAGTTTAATGGAGAGGAGAAGAGTAAAATTGGAAAATCGATATAGGATTAGGGTGTGTTGAAGGATCCTTTTGTC
TTCTAGTGGACACAAATTGCGAGACGCTCCTCCCTTCTCCTTCAACCTCTCGACAAAGCAAGAAGGAACACGCAAACTCAACCCGACCAGCCGCCGGTGTGAGCTTTGCC
GCAGTCGTCCGGATCTGAAGCAATCTGAAACTTCGCGCCATCGCCGGTCTCCTCTACACGTTACTGCAGCCGCCGACCACGTCTGGTCGTACGCCGAGCGTGCTCCATCA
CGCGACCGCAGCCACACGGAGGGTTGCCGCCGTCGACCTGCACCACACGAGCCGCACGCCGAATCTGGGAGTTAATGCGCCGCCACCGGAAATTGCTTCAAGTTGAAGTT
CCGCCAGCCGTTGAGGCGTTGCACGCCGGATTCACCGTTTTCGCCATCGACCGGTCTACGCTTGGTTCGTGTTTTTCAAAGTCTTAGCAACCGGTTTTGATCAAGATTCG
ACCGGTTCTGGATAGATCGGCGCCCGAGCAACCCAAAACTGGTACTCACGCGCTGCCGCCGCCGGTTAGTCAACGTCGTCGCCGGCCGGTCAACGCCGGCACTCGCCGGT
TAACTCCAGTTTGACCAGTTTCGACCCCCGAGCTCGTTATGAACCTCTGTGGTGGAGATCTATTTATGGAACGTGTGTAAGTGGAACGTTGGAGCAAGAATGAGAGTTTA
GGAACTTTCAGAGCTAAAATCCAAGGTATTGGAAGTACTGGTGCAAATGAGAGAGTACATGAGAACTTCCATTTGTATTCTTCAACGTGTAATCCTCTGATGAAACCTCA
GTTTCACAGAAGTTGAAGTACGGGATTATGGCATCATGATTTTGGAAGTCTTCAACAAGGAGTTACAACCCTAGTCAAGATTTGGTCCTTCATTGAGTTATAATCCACTT
TGCTATGTTAAATCTAGTGCTAGATTTAATACACTCGTCAAGAACACAACCCTCTGGATCTTTAGTCATTAAAATAGAAAACATCCGATTCTATGAAATATGTTTTTGCT
CAAGCAGCTATTGTTACTCTCCAATATCTCTTTGAAGCCAACAATCAAGTTAACTATCACATTTGTAAGAAAATCTTTAAAATTTGGAGTAGACGAAAAGTAATATGCAG
AAATGAATTGGTGGCACTGTTACTATCACCGAGTGACAGCAATGGTAATCTTTGGCATTTTGAAAAGATGCTCCCAAGTAAAAAAATACTTAATTAATTCATTGAAAGCA
AATCAAGCAAATAAACTAAAGACTAAACAAAGAGTACAACTTGCTGAAAAAGAAGACTTACAGTGCTAAAGTAACAGGTCTGCTAGGTGCTGGCTAAGATTATGCTCTCC
TTTATGATTGAATCATGCAATAGACCCAAGTGAGTAATCTACGACCATCTCCAAATGGCATATCTACTTGAAACATCATCAATATGAACAAGATGAAGGGATTAGAAGAC
TATCTCTCACATTTACAACCTAGAGGTTCTAAATTTATGTCAAGGCTTTAGTCATGACCAAGAGGATTCTTTTCCTGTTTTCTAAATGGGTTTAAGTAAAACTGAAATCA
ACTTGAAGCGGCTGTTGGCAACTGCTCCTCATCAGAAGGATCAGGCTAAACTTATACATTATGTGACTACTCTAAGAGAACAGTTGGAACAACTTGCTGAAGAGAAAACA
CCAGAAGGCTTATCAAGAGTTTCAAAGGCTTTACTTGGGGATTATTCAGAGAAGATAGAGGCCATTGCTTCCAAATTAGCCGTTCCTTCGCCTGATGTGGAAGAGTCTTC
TGAGCCCTCTACTAGTGTTTCTGTTGGAGATTTTTCTTCTGTAGCAGAAGGAGGCAAAAGCACTCTTTCACCCCCGGGACTAAGGAGAAGATTTCAGCCTTCCTCTGTTG
TTGATGATAGATCTCATGGCACCATCAAAGAAGACTCATCAGCACCTGTAAAGTTGGATGCTGCAGCAATATCCCACATCGAGAAACACAGGAAGCTTCAAGAGGACTTG
ACCGATGAAATGGTTGGGTTAGCAAAGCAACTGAAAGAGAGCAGTCTGATAATGAGCAAATCTTTGGAGAGCACTGAAAAGATATTGGATTCCACAGAGAAGGCTGTTGA
GGATAGCTTGGCAACCACCGGTCGGGTCAATAAACGTGCTGTTGAGATTTACTCGGAAAGCTCGAAAACTTCATGCTTCACTTGGTTGGCGATCTTTGTAATGACCTGTG
TTTTCGTAATGGTTGTGCTTCTCATTCGAGTTACTTGAAATACCAATTCACATTCTAAGAAAAGGTTGAAGCATTCTTGACTTTTGTGTACAAACTTTCAAGGGATAAGA
AAGCTAGAATAGATGTTGAGAATTCATATCAAAGCTTTTTATTTTTTAATAATTAATACTGTCACACAAAGTTAGCAGTGGTTTCAAAATTTGCAGTTCGACATATGTAG
TTGTAATTTGAGAAATTTTTAGAAATAGCCTCTGATTTATTTGTACAGATTTGTTGAGTATGGGTATCTTGAAGTAGATTTATTGGTGTTTTAAGTTTT
Protein sequenceShow/hide protein sequence
MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPSPDVEESSEPSTSVSVGDFSSVAEGGKSTLSPPGLR
RRFQPSSVVDDRSHGTIKEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLIMSKSLESTEKILDSTEKAVEDSLATTGRVNKRAVEIYSESSKTSCFTW
LAIFVMTCVFVMVVLLIRVT