; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007571 (gene) of Snake gourd v1 genome

Gene IDTan0007571
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionFilamentous hemagglutinin transporter
Genome locationLG05:4354127..4355577
RNA-Seq ExpressionTan0007571
SyntenyTan0007571
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140015.1 uncharacterized protein LOC101202760 [Cucumis sativus]6.4e-11084.5Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQD
        MAAEVSSLVRVLT YNK+DRHRT G+ES  EKLTPLITRDLL+GGYSKFTE QELDLDL VPSGWE+RLDLKSGKMFIQRCNVQDFNN+  NQTV KLQD
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQD

Query:  LNFPPSPNYSKFQLSNHFVDETNLDLKLVSS-SPSP-SPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDNF
        LNFPPSPN SKFQL+NH VDET+LDLKLVSS S SP S SPRSNYQSVCTLDKVKSALERAE+NPIRKRSSLWKSSPSPSYSSSSSSAAA +EF+EE+N 
Subjt:  LNFPPSPNYSKFQLSNHFVDETNLDLKLVSS-SPSP-SPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDNF

Query:  KSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
        K LS        +PIAAGCPGCLSYVLVMKNNPTCPRCSS+VPLPA KKPRIDLNISI
Subjt:  KSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI

XP_008456306.1 PREDICTED: uncharacterized protein LOC103496294 [Cucumis melo]1.4e-10983.72Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQD
        MAAEVSSLVRVLT YNK+DRH T GNES  EKL PLITRDLL+GGYSKFTE QELDLDL VPSGWE+RLDLKSGKMFIQRCNVQDFNN+  NQTV KLQD
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQD

Query:  LNFPPSPNYSKFQLSNHFVDETNLDLKLVSS-SPSP-SPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDNF
        LNFPPSPNYSKFQL+NH VDET+LDLKLVSS S SP S SPRSNYQSVCTLDKVKSALERAE+NPIRKRSSLWKSSPSPSYSSSSSSAAA +EF+EE+  
Subjt:  LNFPPSPNYSKFQLSNHFVDETNLDLKLVSS-SPSP-SPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDNF

Query:  KSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
        K         +++PIAAGCPGCLSYVLVMKNNPTCPRCSS+VPLPAAKKPRIDLNISI
Subjt:  KSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI

XP_022947770.1 uncharacterized protein LOC111451530 isoform X1 [Cucurbita moschata]2.3e-10782.63Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQD
        MAA+V+S VRVLT YNKDD HRTV N+S P+ LTPLITRDLL+GG SKFT+PQELDLDLQ+PSGWEK LDLKSGKMFIQR NVQDFNNHQTNQTV+KLQD
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQD

Query:  LNFPPSPNYSKFQLSNHFVDETNLDLKLVSS---SPSPSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDN
        LNFPPS NYSKF+LSNH V ET+LDLKL SS   S  PSPSPRSNYQSVCTLDKVKSALERA+KNPIRKRSSLWK SPSPSYSSSSSS   AREFQEEDN
Subjt:  LNFPPSPNYSKFQLSNHFVDETNLDLKLVSS---SPSPSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDN

Query:  FKSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
        F   S SSS++AAA IA GCPGCLSYVLVMKNNPTCPRC SVV LPA KKPR+DLNISI
Subjt:  FKSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI

XP_022970865.1 uncharacterized protein LOC111469711 [Cucurbita maxima]2.2e-11084.5Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQD
        MAAEV+S VRVLTGYNKDD H TV NES P+ LTPLITRDLL+GG SKFT+PQELDLDLQ+PSGWEKRLDLKSGKMFIQR NVQDFNNHQTNQTV+KLQD
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQD

Query:  LNFPPSPNYSKFQLSNHFVDETNLDLKL--VSSSPSPSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDNF
        LNFPPS NYSKF+LSNH V ET+L+LKL   SSSP PSPSPRSNYQSVCTLDKVKSALERA+KNPIRKRSSLWKSSPSPSYSSSSSS   AREFQEEDNF
Subjt:  LNFPPSPNYSKFQLSNHFVDETNLDLKL--VSSSPSPSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDNF

Query:  KSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
           S SSS++AAA IA GCPGCLSYVLVMKNNPTCPRC SVV LPA KKPR+DLNISI
Subjt:  KSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI

XP_038901828.1 uncharacterized protein LOC120088523 [Benincasa hispida]2.8e-11386.05Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQD
        MAAEVSSLVRVLTGYNKDDRHRTVGN+S  EKLTPLITRDLLSGGYSK+TE QELDLDL VPSGWE+RLDLKSGK FIQRCNVQDFNN   NQTV KLQD
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQD

Query:  LNFPPSPNYSKFQLSNHFVDETNLDLKLVSS-SPSPSP-SPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDNF
        LNFPPSPN+SKFQ SNH VDET+LDLKLVSS SPSPSP SPRSNYQSVCTLDKVKSALERAE+NPIRKRSSLWKSSPSPSYSSSSSSA A +EF++EDN 
Subjt:  LNFPPSPNYSKFQLSNHFVDETNLDLKLVSS-SPSPSP-SPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDNF

Query:  KSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
        KSLS        +PIAAGCPGCLSYVLVMKNNPTCPRC+SVVPLPA KKPRIDLNISI
Subjt:  KSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI

TrEMBL top hitse value%identityAlignment
A0A0A0KAN5 Uncharacterized protein3.1e-11084.5Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQD
        MAAEVSSLVRVLT YNK+DRHRT G+ES  EKLTPLITRDLL+GGYSKFTE QELDLDL VPSGWE+RLDLKSGKMFIQRCNVQDFNN+  NQTV KLQD
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQD

Query:  LNFPPSPNYSKFQLSNHFVDETNLDLKLVSS-SPSP-SPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDNF
        LNFPPSPN SKFQL+NH VDET+LDLKLVSS S SP S SPRSNYQSVCTLDKVKSALERAE+NPIRKRSSLWKSSPSPSYSSSSSSAAA +EF+EE+N 
Subjt:  LNFPPSPNYSKFQLSNHFVDETNLDLKLVSS-SPSP-SPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDNF

Query:  KSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
        K LS        +PIAAGCPGCLSYVLVMKNNPTCPRCSS+VPLPA KKPRIDLNISI
Subjt:  KSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI

A0A1S3C2I3 uncharacterized protein LOC1034962946.9e-11083.72Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQD
        MAAEVSSLVRVLT YNK+DRH T GNES  EKL PLITRDLL+GGYSKFTE QELDLDL VPSGWE+RLDLKSGKMFIQRCNVQDFNN+  NQTV KLQD
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQD

Query:  LNFPPSPNYSKFQLSNHFVDETNLDLKLVSS-SPSP-SPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDNF
        LNFPPSPNYSKFQL+NH VDET+LDLKLVSS S SP S SPRSNYQSVCTLDKVKSALERAE+NPIRKRSSLWKSSPSPSYSSSSSSAAA +EF+EE+  
Subjt:  LNFPPSPNYSKFQLSNHFVDETNLDLKLVSS-SPSP-SPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDNF

Query:  KSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
        K         +++PIAAGCPGCLSYVLVMKNNPTCPRCSS+VPLPAAKKPRIDLNISI
Subjt:  KSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI

A0A5D3CB81 Putative YUP8H12R.23 protein6.9e-11083.72Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQD
        MAAEVSSLVRVLT YNK+DRH T GNES  EKL PLITRDLL+GGYSKFTE QELDLDL VPSGWE+RLDLKSGKMFIQRCNVQDFNN+  NQTV KLQD
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQD

Query:  LNFPPSPNYSKFQLSNHFVDETNLDLKLVSS-SPSP-SPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDNF
        LNFPPSPNYSKFQL+NH VDET+LDLKLVSS S SP S SPRSNYQSVCTLDKVKSALERAE+NPIRKRSSLWKSSPSPSYSSSSSSAAA +EF+EE+  
Subjt:  LNFPPSPNYSKFQLSNHFVDETNLDLKLVSS-SPSP-SPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDNF

Query:  KSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
        K         +++PIAAGCPGCLSYVLVMKNNPTCPRCSS+VPLPAAKKPRIDLNISI
Subjt:  KSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI

A0A6J1G7U1 uncharacterized protein LOC111451530 isoform X11.1e-10782.63Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQD
        MAA+V+S VRVLT YNKDD HRTV N+S P+ LTPLITRDLL+GG SKFT+PQELDLDLQ+PSGWEK LDLKSGKMFIQR NVQDFNNHQTNQTV+KLQD
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQD

Query:  LNFPPSPNYSKFQLSNHFVDETNLDLKLVSS---SPSPSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDN
        LNFPPS NYSKF+LSNH V ET+LDLKL SS   S  PSPSPRSNYQSVCTLDKVKSALERA+KNPIRKRSSLWK SPSPSYSSSSSS   AREFQEEDN
Subjt:  LNFPPSPNYSKFQLSNHFVDETNLDLKLVSS---SPSPSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDN

Query:  FKSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
        F   S SSS++AAA IA GCPGCLSYVLVMKNNPTCPRC SVV LPA KKPR+DLNISI
Subjt:  FKSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI

A0A6J1I433 uncharacterized protein LOC1114697111.1e-11084.5Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQD
        MAAEV+S VRVLTGYNKDD H TV NES P+ LTPLITRDLL+GG SKFT+PQELDLDLQ+PSGWEKRLDLKSGKMFIQR NVQDFNNHQTNQTV+KLQD
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQD

Query:  LNFPPSPNYSKFQLSNHFVDETNLDLKL--VSSSPSPSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDNF
        LNFPPS NYSKF+LSNH V ET+L+LKL   SSSP PSPSPRSNYQSVCTLDKVKSALERA+KNPIRKRSSLWKSSPSPSYSSSSSS   AREFQEEDNF
Subjt:  LNFPPSPNYSKFQLSNHFVDETNLDLKL--VSSSPSPSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDNF

Query:  KSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
           S SSS++AAA IA GCPGCLSYVLVMKNNPTCPRC SVV LPA KKPR+DLNISI
Subjt:  KSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G16500.1 unknown protein2.3e-4946.85Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLL------SGGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFI-QRCNV----------
        MAA+VSSLVR+L+ + KDDR   V + + P     L+TRDLL       GG     +  ELDLD+QVP+GWEKRLDLKSGK+++ Q+CN           
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLL------SGGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFI-QRCNV----------

Query:  QDFNNHQTNQTVSKLQDLNFPP-SPNYSKFQLSNHF--VDETNLDLKLVSSSPS-PSPSPRSNY---------QSVCTLDKVKSALERAEKNPIRKRSSL
           +  QTNQTV + QDLN PP S  +    L + F   D+T+L+LKLV SS S P P P S++          SVCTLDKVK ALERAEK+  +++S  
Subjt:  QDFNNHQTNQTVSKLQDLNFPP-SPNYSKFQLSNHF--VDETNLDLKLVSSSPS-PSPSPRSNY---------QSVCTLDKVKSALERAEKNPIRKRSSL

Query:  WKSSPSPSYSSSSSSAAAAREFQEEDNFKSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
                                ED+     T+S+  AA+ +AAGCPGCLSYV V KNNP CPRC S VPLPA KKP+IDLNIS+
Subjt:  WKSSPSPSYSSSSSSAAAAREFQEEDNFKSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI

AT1G79160.1 unknown protein2.6e-5351.52Show/hide
Query:  MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQ-ELDLDLQVPSGWEKRLDLKSGKMFIQRCNVQD----FNNHQTNQTV
        MAA+VSSLVR+L+GY KDDR   V + +  +    L+TRDLL  G     +   ELDLDLQVP+G+EKRLDLKSGK+++QRCN        N  QTNQTV
Subjt:  MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQ-ELDLDLQVPSGWEKRLDLKSGKMFIQRCNVQD----FNNHQTNQTV

Query:  SKLQDLNFPPSPNYSKFQLSNHFVDETNLDLKLVSSSPSPSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEE
           QDLNFPP P  +   L N F D+T  +LKL+ SS S  P+  SN QSVCTLDKVKSALERAE++P     +++K   SP             +    
Subjt:  SKLQDLNFPPSPNYSKFQLSNHFVDETNLDLKLVSSSPSPSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEE

Query:  DNFKSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPA---AKKPRIDLNISI
        D+++      + A A+P+ AGCPGCLSYVLVM NNP CPRC ++VPLP     KKP+IDLNISI
Subjt:  DNFKSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPA---AKKPRIDLNISI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCGAAGTGAGCTCGCTCGTGCGAGTTCTTACAGGCTACAACAAAGACGACCGTCATCGGACGGTCGGAAACGAATCTGTCCCCGAGAAATTAACGCCTCTGAT
CACCCGAGACTTACTCAGCGGCGGCTATTCCAAATTTACAGAGCCCCAAGAATTGGACCTCGATCTTCAAGTTCCTTCCGGCTGGGAAAAAAGACTCGACTTGAAGTCGG
GGAAAATGTTCATTCAAAGATGCAATGTTCAAGATTTCAACAACCATCAAACGAATCAAACAGTGTCAAAGCTTCAAGATTTGAACTTTCCGCCGTCCCCCAATTACTCC
AAATTCCAATTGTCCAATCATTTCGTCGACGAAACGAATTTGGATTTGAAATTGGTTTCTTCGTCGCCGTCGCCGTCGCCGTCGCCGAGGAGTAATTATCAGAGTGTTTG
TACTTTGGATAAGGTCAAATCGGCGCTCGAACGGGCTGAGAAAAATCCCATCAGAAAACGCTCGTCGCTGTGGAAATCGTCGCCGTCGCCGTCGTATTCGTCGTCGTCGT
CGTCAGCCGCGGCGGCGAGAGAGTTTCAAGAAGAAGACAACTTTAAATCTCTGTCGACGTCGTCATCAGCAGCGGCGGCGGCTCCGATTGCCGCCGGCTGCCCTGGGTGT
TTGTCGTATGTGTTGGTGATGAAGAACAATCCGACGTGTCCACGGTGTAGCTCCGTCGTGCCGTTGCCGGCCGCGAAGAAACCTCGGATTGATCTGAACATTTCAATTTG
A
mRNA sequenceShow/hide mRNA sequence
TTTAATAGTTTATATAACGGCCTGTTAACTTTCTTCCATATATAGAGACCTCTCTGCTGCCATTGTCGAAGAAAACCGACACTCCCATCTCTCTCTCTCCCTACCTTCTC
TCTCTTCGTCTCTCAAAACCAGAAAAAAAAAAATGGCCGCCGAAGTGAGCTCGCTCGTGCGAGTTCTTACAGGCTACAACAAAGACGACCGTCATCGGACGGTCGGAAAC
GAATCTGTCCCCGAGAAATTAACGCCTCTGATCACCCGAGACTTACTCAGCGGCGGCTATTCCAAATTTACAGAGCCCCAAGAATTGGACCTCGATCTTCAAGTTCCTTC
CGGCTGGGAAAAAAGACTCGACTTGAAGTCGGGGAAAATGTTCATTCAAAGATGCAATGTTCAAGATTTCAACAACCATCAAACGAATCAAACAGTGTCAAAGCTTCAAG
ATTTGAACTTTCCGCCGTCCCCCAATTACTCCAAATTCCAATTGTCCAATCATTTCGTCGACGAAACGAATTTGGATTTGAAATTGGTTTCTTCGTCGCCGTCGCCGTCG
CCGTCGCCGAGGAGTAATTATCAGAGTGTTTGTACTTTGGATAAGGTCAAATCGGCGCTCGAACGGGCTGAGAAAAATCCCATCAGAAAACGCTCGTCGCTGTGGAAATC
GTCGCCGTCGCCGTCGTATTCGTCGTCGTCGTCGTCAGCCGCGGCGGCGAGAGAGTTTCAAGAAGAAGACAACTTTAAATCTCTGTCGACGTCGTCATCAGCAGCGGCGG
CGGCTCCGATTGCCGCCGGCTGCCCTGGGTGTTTGTCGTATGTGTTGGTGATGAAGAACAATCCGACGTGTCCACGGTGTAGCTCCGTCGTGCCGTTGCCGGCCGCGAAG
AAACCTCGGATTGATCTGAACATTTCAATTTGATTTAAAGAAGTGGAAAAGTATTGTTATTGTAGATGGAGGGACAGAAATTATCAGCTTCAAGACATAGATTTCTTTGT
TTTTTTTTTTTGTTTTTTTTTTTGGGTAGCTGTAGAAACTATATGTAAAAAAAAATGCAAAAGGGAAGGTGGGGATTTCAATTCTTAGTATGATTTCTCATATGATAATT
TTATTTCATCTCTCAATTATCAATTAGATAATTAGTTATTTTGATGTTAAATTGATATTCATGGCGA
Protein sequenceShow/hide protein sequence
MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQDLNFPPSPNYS
KFQLSNHFVDETNLDLKLVSSSPSPSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDNFKSLSTSSSAAAAAPIAAGCPGC
LSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI