; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018053 (gene) of Snake gourd v1 genome

Gene IDTan0018053
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUPF0587 protein C1orf123 homolog
Genome locationLG02:77471166..77475225
RNA-Seq ExpressionTan0018053
SyntenyTan0018053
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR008584 - CXXC motif containing zinc binding protein, eukaryotic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585732.1 CXXC motif containing zinc binding protein, partial [Cucurbita argyrosperma subsp. sororia]2.8e-9095.78Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK
        SPLMLFDCRGYEPV F+FGPGWKAESIEGTKFEDIDLSGGE+AEYDEKG CPVMISNLEATF+ VK
Subjt:  SPLMLFDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK

XP_008444495.1 PREDICTED: UPF0587 protein C1orf123 homolog [Cucumis melo]7.1e-8690.36Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGE+SQKETCVTL+ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK
        SPLMLFDCRGYEP+ FVFGPGWK ESIEGTKFEDIDL+GGEFAEYDEKG CPVMISNL+A F+L+K
Subjt:  SPLMLFDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK

XP_022132352.1 UPF0587 protein C1orf123 [Momordica charantia]4.2e-8689.76Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKI AELENLTNLQPQDGCDDPNF YLFK+KCGRCGE+SQKETC+TLNET+ L  GKGTTNLVQKCKFCGRDGT+TMIPGRG+PLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK
        SPLMLFDCRGYEP+DF+FGPGWKAESIEGTKFEDIDLS GEFAEYDEKG CPVMIS L+ATFDLVK
Subjt:  SPLMLFDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK

XP_022952030.1 UPF0587 protein C1orf123 homolog [Cucurbita moschata]2.8e-9095.78Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK
        SPLMLFDCRGYEPV F+FGPGWKAESIEGTKFEDIDLSGGE+AEYDEKG CPVMISNLEATF+ VK
Subjt:  SPLMLFDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK

XP_038884333.1 CXXC motif containing zinc binding protein isoform X1 [Benincasa hispida]1.8e-8995.18Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGE+SQKETCVTLNETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRGQPLTQETSE GKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK
        SPLMLFDCRGYEP+DFVFGPGWKAESIEGTKFEDIDLS GEFAEYDEKG CPVMIS LEATF+LVK
Subjt:  SPLMLFDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK

TrEMBL top hitse value%identityAlignment
A0A1S3BAF1 UPF0587 protein C1orf123 homolog3.5e-8690.36Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGE+SQKETCVTL+ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK
        SPLMLFDCRGYEP+ FVFGPGWK ESIEGTKFEDIDL+GGEFAEYDEKG CPVMISNL+A F+L+K
Subjt:  SPLMLFDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK

A0A5A7V299 UPF0587 protein C1orf123-like protein3.5e-8690.36Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGE+SQKETCVTL+ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK
        SPLMLFDCRGYEP+ FVFGPGWK ESIEGTKFEDIDL+GGEFAEYDEKG CPVMISNL+A F+L+K
Subjt:  SPLMLFDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK

A0A6J1BTL8 UPF0587 protein C1orf1232.0e-8689.76Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKI AELENLTNLQPQDGCDDPNF YLFK+KCGRCGE+SQKETC+TLNET+ L  GKGTTNLVQKCKFCGRDGT+TMIPGRG+PLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK
        SPLMLFDCRGYEP+DF+FGPGWKAESIEGTKFEDIDLS GEFAEYDEKG CPVMIS L+ATFDLVK
Subjt:  SPLMLFDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK

A0A6J1GKL1 UPF0587 protein C1orf123 homolog1.4e-9095.78Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK
        SPLMLFDCRGYEPV F+FGPGWKAESIEGTKFEDIDLSGGE+AEYDEKG CPVMISNLEATF+ VK
Subjt:  SPLMLFDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK

A0A6J1KKK0 UPF0587 protein C1orf123 homolog1.4e-9095.78Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK
        SPLMLFDCRGYEPV F+FGPGWKAESIEGTKFEDIDLSGGE+AEYDEKG CPVMISNLEATF+ VK
Subjt:  SPLMLFDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK

SwissProt top hitse value%identityAlignment
Q32P66 CXXC motif containing zinc binding protein1.7e-2639.51Show/hide
Query:  LKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKSSPLML
        L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L +++ L+ G+G+ ++VQKCK C R+ +I ++    +    E +E  K+  ++ 
Subjt:  LKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKSSPLML

Query:  FDCRGYEPVDFVFGPGWKAESIE-GTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK
        F+CRG EPVDF    G+ AE +E GT F DI+L   ++ +YDEK    V I   E T   VK
Subjt:  FDCRGYEPVDFVFGPGWKAESIE-GTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK

Q3B8G0 CXXC motif containing zinc binding protein1.7e-2940.49Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MV F L+ KA LENLT L+P       +F +  K+KCG CGE+S K   +TL +++PL+ G+G+ ++VQ+CK C R+ +I ++     P   E SE+ K+
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVDFVFGPGWKAESIE-GTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATF
          ++ F+CRG EP+DF    G+ AE  E GT F +I+L   ++ +YDEK    V I  +E  F
Subjt:  SPLMLFDCRGYEPVDFVFGPGWKAESIE-GTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATF

Q498R7 CXXC motif containing zinc binding protein2.3e-2638.92Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L +++ L+ G+G+ ++VQKCK C R+ +I ++    +    E +E  K+
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVDFVFGPGWKAESIE-GTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK
          ++ F+CRG EPVDF    G+ AE +E GT F DI+L   ++ +YDEK    V I   E T   VK
Subjt:  SPLMLFDCRGYEPVDFVFGPGWKAESIE-GTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK

Q8BHG2 CXXC motif containing zinc binding protein3.0e-2638.32Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L +++ L+ G+G+ ++VQKCK C R+ +I ++    +    E +E  K+
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVDFVFGPGWKAESIE-GTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK
          ++ F+CRG EPVDF    G+ A+ +E GT F DI+L   ++ +YDEK    V I   E T   VK
Subjt:  SPLMLFDCRGYEPVDFVFGPGWKAESIE-GTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK

Q9NWV4 CXXC motif containing zinc binding protein1.6e-2739.52Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S K   + L +++ L+ G+G+ ++VQKCK C R+ +I ++    +P   E +E+ K+
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVDFVFGPGWKAESIE-GTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK
          ++ F+CRG EPVDF    G+ AE +E GT F DI+L   ++ +YDEK    V I   E T   VK
Subjt:  SPLMLFDCRGYEPVDFVFGPGWKAESIE-GTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK

Arabidopsis top hitse value%identityAlignment
AT4G32930.1 unknown protein5.5e-6868.26Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVN++LKI A+LENLTNLQP  GCDD NF YLFK+KC RCGE++ KETCVTLNET     G+GT +LVQKCKFCGR+G +TMIPG+G+PLT E SE+G+ 
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGG-EFAEYDEKGGCPVMISNLEATFDLVK
        +PLM+FDCRGYEP+DF FG  WKA++  GTKF++IDLS G EF EYDEKG CPVMISN  A+F + K
Subjt:  SPLMLFDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGG-EFAEYDEKGGCPVMISNLEATFDLVK

AT4G32930.2 unknown protein8.7e-6665.14Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQK--------CKFCGRDGTITMIPGRGQPLTQ
        MVN++LKI A+LENLTNLQP  GCDD NF YLFK+KC RCGE++ KETCVTLNET     G+GT +LVQK        CKFCGR+G +TMIPG+G+PLT 
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQK--------CKFCGRDGTITMIPGRGQPLTQ

Query:  ETSESGKSSPLMLFDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGG-EFAEYDEKGGCPVMISNLEATFDLVK
        E SE+G+ +PLM+FDCRGYEP+DF FG  WKA++  GTKF++IDLS G EF EYDEKG CPVMISN  A+F + K
Subjt:  ETSESGKSSPLMLFDCRGYEPVDFVFGPGWKAESIEGTKFEDIDLSGG-EFAEYDEKGGCPVMISNLEATFDLVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAACTTCTTGCTTAAGATCAAAGCGGAGCTCGAGAACCTCACGAATCTTCAGCCCCAGGATGGTTGCGACGATCCCAACTTCACTTACCTTTTCAAAGTGAAATG
CGGGAGATGCGGGGAGTTGAGCCAGAAAGAAACGTGTGTGACCTTGAATGAAACTATTCCTCTCCAAGCGGGTAAAGGGACTACTAATCTTGTTCAAAAGTGCAAGTTCT
GTGGGAGGGACGGAACTATTACAATGATTCCGGGGCGAGGTCAACCATTGACTCAGGAAACAAGTGAATCAGGGAAGTCATCTCCCTTGATGTTATTTGACTGCAGAGGT
TATGAGCCTGTGGACTTCGTATTTGGACCTGGATGGAAAGCAGAATCTATTGAGGGGACCAAATTTGAGGATATTGACTTGTCTGGAGGTGAGTTTGCAGAGTATGATGA
GAAGGGAGGATGCCCTGTCATGATTTCCAATCTAGAGGCCACATTTGACTTGGTAAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTGACTGAGCTTCGTAGACCCGCACCGAAACCAGAGCCCATGGACCTCTAAATCCAAAATTCGACGCCTTGTTGAAGTGGAAGTGTGATTGATTGAGCAAAACC
GCTCAAAAAATGGTGAACTTCTTGCTTAAGATCAAAGCGGAGCTCGAGAACCTCACGAATCTTCAGCCCCAGGATGGTTGCGACGATCCCAACTTCACTTACCTTTTCAA
AGTGAAATGCGGGAGATGCGGGGAGTTGAGCCAGAAAGAAACGTGTGTGACCTTGAATGAAACTATTCCTCTCCAAGCGGGTAAAGGGACTACTAATCTTGTTCAAAAGT
GCAAGTTCTGTGGGAGGGACGGAACTATTACAATGATTCCGGGGCGAGGTCAACCATTGACTCAGGAAACAAGTGAATCAGGGAAGTCATCTCCCTTGATGTTATTTGAC
TGCAGAGGTTATGAGCCTGTGGACTTCGTATTTGGACCTGGATGGAAAGCAGAATCTATTGAGGGGACCAAATTTGAGGATATTGACTTGTCTGGAGGTGAGTTTGCAGA
GTATGATGAGAAGGGAGGATGCCCTGTCATGATTTCCAATCTAGAGGCCACATTTGACTTGGTAAAGTAAGAGAGTAATCCCCACCTTAGATCTCTCATTGTAAAAAAAT
TGACCATATTGAATGGCCGTGGTGTTATTGATGGTCACCCAGTCTATACTTTTATGCATCAGTTCTCTTCATCTTTCTGTATTATGAAATATGAATTAAGTTTGAGGAGT
TGAATGAATGGTCATTATTTGTTTTCTTTTGTTGGTAGTTCTTTTTCCTTTTCCTTTTCTGGTATGGATACTCCTTCCATGATCAATATCTTAAAAAAAAAAAAAAAAGA
GATGTAGGGTTTACTAACATTAAACAAAATATTTATAGAGACAAAATGGACTTATGGTATTCTCCTAGTACTCAGAATTTTTTTTCCTACAAATTTCGTGTTTTTTCGTA
GAGATGTTTACCACTTATTGTTATCCACTTTTTTTCATGTGTTTTAAAAAACCAACCCAAATTTTGAAAACTAATTGTTTTTAGAATTTAGTTAAACATTCCCATGTTTT
TTTAAAAAAGTAAAAGGCATGGTAGTGAATTGTAGAGAATAGTCCAATTTTTTTGATTGGTTTTTTACAATGGGTTAAAAAGGCATAACTAATTAGAACTATTTTGGATT
TTCAGCATGTTGTCCTCAATAATATCCTTACTTAAAAGCGTCCGTTTAACTGGTGAGAATTTTACTACGTGGAAATCCAACCTGAATATGATTATGGTTGTTGATGACTT
ACAGTTTGTATTGACGGAGGAATGTTCTCAGGTCCCTACTCGAAACGCTCCTCAATCTGTTAAGGAAGCGTACGACCGCTGGATCAAGGCCAATGATAAGGCCAAGGTCT
ACATTTTGGCTAGTGTTTCTGAAGTTCTGGCCAAAAAGCACGAGGGCATGGTCTCAGCTCGTGAGATCATGAGTTCGTTGCAGGAAATGTTTGGACAACCGTCTCGACAG
ATTCAACACGAATCTCTCAAATACGTTTATAATTCCCGTATGAAGGAGGGTTCATCGGTGAGAGAACACATTCTTGATCTGATGGTCCACTTCAACGTGGTTGAGATGAA
TGAAGCGGTCATTGACGAGCAAAGTCAGGTATCGTTCATCCTGAAATTTCTTCCGAAGAGTTTCCTGCAATTTCGCAGCAATGAGGTGATGAACAAGATAGAGTATAACC
TGACTACTTTACTTAATGAACTACAGACTTTCCAGTCTCTTATGAAGAATAAGGGCGGCAATGATGGAGAGCAAATCTGTTTGCCCATTCCGAGAAGGTTCCGAGAAGGT
TCATCCTCCGGGACTAAGTCCTCAGAGCTCATCTCTCTGGGCTTAAGAAGACCC
Protein sequenceShow/hide protein sequence
MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGELSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKSSPLMLFDCRG
YEPVDFVFGPGWKAESIEGTKFEDIDLSGGEFAEYDEKGGCPVMISNLEATFDLVK