; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012667 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012667
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDUF4283 domain-containing protein
Genome locationchr1:43173078..43175679
RNA-Seq ExpressionLag0012667
SyntenyLag0012667
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN33672.1 hypothetical protein [Cucumis melo subsp. melo]1.4e-3431.53Show/hide
Query:  IERKTFSINPANRNPNL-FRITETTNDKSFSLTLTKESLKWIQTFFDRLCSLPLTQKFFNETRVDDTPGVVIAPRSNNVVDRNVSYSEALKKSLIETPTQ
        IE+K F ++  NR+ +L   ITE    KSFS+ +T ESL+W+++ F  L   PLT +FF + R ++                   Y   ++K+       
Subjt:  IERKTFSINPANRNPNL-FRITETTNDKSFSLTLTKESLKWIQTFFDRLCSLPLTQKFFNETRVDDTPGVVIAPRSNNVVDRNVSYSEALKKSLIETPTQ

Query:  HLTQDATKPQFVALYLSSSVIIQRKYFHDNWYDIMRVLQKELLAFSSVSPIQPDKALIACEDEDQARTLANIQGWYKVGKYQVRFLPWSAENMLGQPKVP
                          +V++ R+YFHD+W  I+  L ++L    S  P   +KALI  +D +QA  +   +GW  VG++ V+F  W+ ++      VP
Subjt:  HLTQDATKPQFVALYLSSSVIIQRKYFHDNWYDIMRVLQKELLAFSSVSPIQPDKALIACEDEDQARTLANIQGWYKVGKYQVRFLPWSAENMLGQPKVP

Query:  SYGGWIKVKNLPLDKWSLDTFKFIGNECGGYIETASKTLSRMDMMEIGIKVQENGTDFIPAEISIPSTSCSPITVKIDPLFVEDHNIGYKASIHG
        SYGGWIKV+ +PL  W+L++F  IG+ CGG++E A +T    D++E  I+V++N + FIPA I +         +++        ++G   SIHG
Subjt:  SYGGWIKVKNLPLDKWSLDTFKFIGNECGGYIETASKTLSRMDMMEIGIKVQENGTDFIPAEISIPSTSCSPITVKIDPLFVEDHNIGYKASIHG

KAA0039967.1 hypothetical protein E6C27_scaffold122G002490 [Cucumis melo var. makuwa]1.4e-3432.29Show/hide
Query:  IERKTFSINPANRNPNL-FRITETTNDKSFSLTLTKESLKWIQTFFDRLCSLPLTQKFFNETRVDD----------TPGVV-----IAPRSNN---VVDR
        IE+K F ++  NR+ +    ITE    KSFS+ +T ESL+W+++ F  L   P T +FF E R ++            G +     +  R      +V  
Subjt:  IERKTFSINPANRNPNL-FRITETTNDKSFSLTLTKESLKWIQTFFDRLCSLPLTQKFFNETRVDD----------TPGVV-----IAPRSNN---VVDR

Query:  NVSYSEALKKSL-----IETPTQHLTQDATKPQFVALYLSSSVIIQRKYFHDNWYDIMRVLQKELLAFSSVSPIQPDKALIACEDEDQARTLANIQGWYK
            SEA+ K         + T ++ +        +     +V++ R++FHD+W  I+  L ++L       P + DKALI  ++E+QA+ L   +GW  
Subjt:  NVSYSEALKKSL-----IETPTQHLTQDATKPQFVALYLSSSVIIQRKYFHDNWYDIMRVLQKELLAFSSVSPIQPDKALIACEDEDQARTLANIQGWYK

Query:  VGKYQVRFLPWSAENMLGQPKVPSYGGWIKVKNLPLDKWSLDTFKFIGNECGGYIETASKTLSRMDMMEIGIKVQENGTDFIPAEISI
        VG++ V+F  WS +       +PSYGGWIKV+ +PL  W+L++F  IG+ CGG++E A +T    D+ E  IK+++N + FIPA I +
Subjt:  VGKYQVRFLPWSAENMLGQPKVPSYGGWIKVKNLPLDKWSLDTFKFIGNECGGYIETASKTLSRMDMMEIGIKVQENGTDFIPAEISI

KAA0040039.1 hypothetical protein E6C27_scaffold366G00060 [Cucumis melo var. makuwa]1.1e-2928.87Show/hide
Query:  IERKTFSINPANRNPNL-FRITETTNDKSFSLTLTKESLKWIQTFFDRLCSLPLTQKFFNET-----------------------RVDD--TPGVVIAPR
        IE+K F ++  N + +    ITE    KSFS+ +T ESL+W+++ F  L   P T +FF E                        RVDD      ++ P 
Subjt:  IERKTFSINPANRNPNL-FRITETTNDKSFSLTLTKESLKWIQTFFDRLCSLPLTQKFFNET-----------------------RVDD--TPGVVIAPR

Query:  --------------------SNNVVDRNV---------------------SYSEALKKSL-----IETPTQHLTQDATKPQFVALYLSSSVIIQRKYFHD
                            S+    R +                     SY+EA+ K         + T+++ +        +     + ++ R+YFHD
Subjt:  --------------------SNNVVDRNV---------------------SYSEALKKSL-----IETPTQHLTQDATKPQFVALYLSSSVIIQRKYFHD

Query:  NWYDIMRVLQKELLAFSSVSPIQPDKALIACEDEDQARTLANIQGWYKVGKYQVRFLPWSAENMLGQPKVPSYGGWIKVKNLPLDKWSLDTFKFIGNECG
        +W  I+  L ++L       P   DKALI  ++E+QA+ L   +GW  VG++ V+F  WS +       +PSYGGWIKV+ +PL  W+L++F  IG+ CG
Subjt:  NWYDIMRVLQKELLAFSSVSPIQPDKALIACEDEDQARTLANIQGWYKVGKYQVRFLPWSAENMLGQPKVPSYGGWIKVKNLPLDKWSLDTFKFIGNECG

Query:  GYIETASKTLSRMDMMEIGIKVQENGTDFIPAEISI
        G++E A +T    D+ E  IK+++N T FIPA I +
Subjt:  GYIETASKTLSRMDMMEIGIKVQENGTDFIPAEISI

TYK10355.1 hypothetical protein E5676_scaffold367G00330 [Cucumis melo var. makuwa]1.1e-2928.57Show/hide
Query:  IERKTFSINPANRNPNL-FRITETTNDKSFSLTLTKESLKWIQTFFDRLCSLPLTQKFFNETRVDD-------------------------TPGVVIAPR
        IE+K F ++  N++ +L   ITE    KSFS+ +T ESL+W+++ F  L   P T +FF E R +D                             ++ P 
Subjt:  IERKTFSINPANRNPNL-FRITETTNDKSFSLTLTKESLKWIQTFFDRLCSLPLTQKFFNETRVDD-------------------------TPGVVIAPR

Query:  ------------------------------SNNVVDR-----------NVSYSEA-LKKSLIETPTQHLTQDATKP----QFVALYLSSSVIIQRKYFHD
                                      +  + DR             SY+EA +K S  +  T   T +  K        +     +V++ R++FHD
Subjt:  ------------------------------SNNVVDR-----------NVSYSEA-LKKSLIETPTQHLTQDATKP----QFVALYLSSSVIIQRKYFHD

Query:  NWYDIMRVLQKELLAFSSVSPIQPDKALIACEDEDQARTLANIQGWYKVGKYQVRFLPWSAENMLGQPKVPSYGGWIKVKNLPLDKWSLDTFKFIGNECG
        +W  I+  L ++L       P   DKALI  ++E+QA  +   +GW  VG++ V+F  W+ +       +PSYGGWIKV+ +PL  W+L++F  IG+ CG
Subjt:  NWYDIMRVLQKELLAFSSVSPIQPDKALIACEDEDQARTLANIQGWYKVGKYQVRFLPWSAENMLGQPKVPSYGGWIKVKNLPLDKWSLDTFKFIGNECG

Query:  GYIETASKTLSRMDMMEIGIKVQENGTDFIPAEISI
        G+IE A +T    D++E  I++++N + FIPA I +
Subjt:  GYIETASKTLSRMDMMEIGIKVQENGTDFIPAEISI

TYK24535.1 hypothetical protein E5676_scaffold266G00770 [Cucumis melo var. makuwa]1.9e-3432.29Show/hide
Query:  IERKTFSINPANRNPNL-FRITETTNDKSFSLTLTKESLKWIQTFFDRLCSLPLTQKFFNETRVDD----------TPGVV-----IAPRSNN---VVDR
        IE+K F ++  NR+ +    ITE    KSFS+ +T ESL+W+++ F  L   P T +FF E R ++            G +     +  R      +V  
Subjt:  IERKTFSINPANRNPNL-FRITETTNDKSFSLTLTKESLKWIQTFFDRLCSLPLTQKFFNETRVDD----------TPGVV-----IAPRSNN---VVDR

Query:  NVSYSEALKKSL-----IETPTQHLTQDATKPQFVALYLSSSVIIQRKYFHDNWYDIMRVLQKELLAFSSVSPIQPDKALIACEDEDQARTLANIQGWYK
            SEA+ K         + T ++ +        +     +V++ R++FHD+W  I+  L ++L       P   DKALI  ++E+QA+ L   +GW  
Subjt:  NVSYSEALKKSL-----IETPTQHLTQDATKPQFVALYLSSSVIIQRKYFHDNWYDIMRVLQKELLAFSSVSPIQPDKALIACEDEDQARTLANIQGWYK

Query:  VGKYQVRFLPWSAENMLGQPKVPSYGGWIKVKNLPLDKWSLDTFKFIGNECGGYIETASKTLSRMDMMEIGIKVQENGTDFIPAEISI
        VG++ V+F  WS +       +PSYGGWIKV+ +PL  W+L++F  IG+ CGG++E A +T    D+ E  IK+++N + FIPA I +
Subjt:  VGKYQVRFLPWSAENMLGQPKVPSYGGWIKVKNLPLDKWSLDTFKFIGNECGGYIETASKTLSRMDMMEIGIKVQENGTDFIPAEISI

TrEMBL top hitse value%identityAlignment
A0A5A7TEK8 DUF4283 domain-containing protein6.9e-3532.29Show/hide
Query:  IERKTFSINPANRNPNL-FRITETTNDKSFSLTLTKESLKWIQTFFDRLCSLPLTQKFFNETRVDD----------TPGVV-----IAPRSNN---VVDR
        IE+K F ++  NR+ +    ITE    KSFS+ +T ESL+W+++ F  L   P T +FF E R ++            G +     +  R      +V  
Subjt:  IERKTFSINPANRNPNL-FRITETTNDKSFSLTLTKESLKWIQTFFDRLCSLPLTQKFFNETRVDD----------TPGVV-----IAPRSNN---VVDR

Query:  NVSYSEALKKSL-----IETPTQHLTQDATKPQFVALYLSSSVIIQRKYFHDNWYDIMRVLQKELLAFSSVSPIQPDKALIACEDEDQARTLANIQGWYK
            SEA+ K         + T ++ +        +     +V++ R++FHD+W  I+  L ++L       P + DKALI  ++E+QA+ L   +GW  
Subjt:  NVSYSEALKKSL-----IETPTQHLTQDATKPQFVALYLSSSVIIQRKYFHDNWYDIMRVLQKELLAFSSVSPIQPDKALIACEDEDQARTLANIQGWYK

Query:  VGKYQVRFLPWSAENMLGQPKVPSYGGWIKVKNLPLDKWSLDTFKFIGNECGGYIETASKTLSRMDMMEIGIKVQENGTDFIPAEISI
        VG++ V+F  WS +       +PSYGGWIKV+ +PL  W+L++F  IG+ CGG++E A +T    D+ E  IK+++N + FIPA I +
Subjt:  VGKYQVRFLPWSAENMLGQPKVPSYGGWIKVKNLPLDKWSLDTFKFIGNECGGYIETASKTLSRMDMMEIGIKVQENGTDFIPAEISI

A0A5A7TFK7 DUF4283 domain-containing protein5.1e-3028.87Show/hide
Query:  IERKTFSINPANRNPNL-FRITETTNDKSFSLTLTKESLKWIQTFFDRLCSLPLTQKFFNET-----------------------RVDD--TPGVVIAPR
        IE+K F ++  N + +    ITE    KSFS+ +T ESL+W+++ F  L   P T +FF E                        RVDD      ++ P 
Subjt:  IERKTFSINPANRNPNL-FRITETTNDKSFSLTLTKESLKWIQTFFDRLCSLPLTQKFFNET-----------------------RVDD--TPGVVIAPR

Query:  --------------------SNNVVDRNV---------------------SYSEALKKSL-----IETPTQHLTQDATKPQFVALYLSSSVIIQRKYFHD
                            S+    R +                     SY+EA+ K         + T+++ +        +     + ++ R+YFHD
Subjt:  --------------------SNNVVDRNV---------------------SYSEALKKSL-----IETPTQHLTQDATKPQFVALYLSSSVIIQRKYFHD

Query:  NWYDIMRVLQKELLAFSSVSPIQPDKALIACEDEDQARTLANIQGWYKVGKYQVRFLPWSAENMLGQPKVPSYGGWIKVKNLPLDKWSLDTFKFIGNECG
        +W  I+  L ++L       P   DKALI  ++E+QA+ L   +GW  VG++ V+F  WS +       +PSYGGWIKV+ +PL  W+L++F  IG+ CG
Subjt:  NWYDIMRVLQKELLAFSSVSPIQPDKALIACEDEDQARTLANIQGWYKVGKYQVRFLPWSAENMLGQPKVPSYGGWIKVKNLPLDKWSLDTFKFIGNECG

Query:  GYIETASKTLSRMDMMEIGIKVQENGTDFIPAEISI
        G++E A +T    D+ E  IK+++N T FIPA I +
Subjt:  GYIETASKTLSRMDMMEIGIKVQENGTDFIPAEISI

A0A5D3CFS8 DUF4283 domain-containing protein5.1e-3028.57Show/hide
Query:  IERKTFSINPANRNPNL-FRITETTNDKSFSLTLTKESLKWIQTFFDRLCSLPLTQKFFNETRVDD-------------------------TPGVVIAPR
        IE+K F ++  N++ +L   ITE    KSFS+ +T ESL+W+++ F  L   P T +FF E R +D                             ++ P 
Subjt:  IERKTFSINPANRNPNL-FRITETTNDKSFSLTLTKESLKWIQTFFDRLCSLPLTQKFFNETRVDD-------------------------TPGVVIAPR

Query:  ------------------------------SNNVVDR-----------NVSYSEA-LKKSLIETPTQHLTQDATKP----QFVALYLSSSVIIQRKYFHD
                                      +  + DR             SY+EA +K S  +  T   T +  K        +     +V++ R++FHD
Subjt:  ------------------------------SNNVVDR-----------NVSYSEA-LKKSLIETPTQHLTQDATKP----QFVALYLSSSVIIQRKYFHD

Query:  NWYDIMRVLQKELLAFSSVSPIQPDKALIACEDEDQARTLANIQGWYKVGKYQVRFLPWSAENMLGQPKVPSYGGWIKVKNLPLDKWSLDTFKFIGNECG
        +W  I+  L ++L       P   DKALI  ++E+QA  +   +GW  VG++ V+F  W+ +       +PSYGGWIKV+ +PL  W+L++F  IG+ CG
Subjt:  NWYDIMRVLQKELLAFSSVSPIQPDKALIACEDEDQARTLANIQGWYKVGKYQVRFLPWSAENMLGQPKVPSYGGWIKVKNLPLDKWSLDTFKFIGNECG

Query:  GYIETASKTLSRMDMMEIGIKVQENGTDFIPAEISI
        G+IE A +T    D++E  I++++N + FIPA I +
Subjt:  GYIETASKTLSRMDMMEIGIKVQENGTDFIPAEISI

A0A5D3DLP0 DUF4283 domain-containing protein9.0e-3532.29Show/hide
Query:  IERKTFSINPANRNPNL-FRITETTNDKSFSLTLTKESLKWIQTFFDRLCSLPLTQKFFNETRVDD----------TPGVV-----IAPRSNN---VVDR
        IE+K F ++  NR+ +    ITE    KSFS+ +T ESL+W+++ F  L   P T +FF E R ++            G +     +  R      +V  
Subjt:  IERKTFSINPANRNPNL-FRITETTNDKSFSLTLTKESLKWIQTFFDRLCSLPLTQKFFNETRVDD----------TPGVV-----IAPRSNN---VVDR

Query:  NVSYSEALKKSL-----IETPTQHLTQDATKPQFVALYLSSSVIIQRKYFHDNWYDIMRVLQKELLAFSSVSPIQPDKALIACEDEDQARTLANIQGWYK
            SEA+ K         + T ++ +        +     +V++ R++FHD+W  I+  L ++L       P   DKALI  ++E+QA+ L   +GW  
Subjt:  NVSYSEALKKSL-----IETPTQHLTQDATKPQFVALYLSSSVIIQRKYFHDNWYDIMRVLQKELLAFSSVSPIQPDKALIACEDEDQARTLANIQGWYK

Query:  VGKYQVRFLPWSAENMLGQPKVPSYGGWIKVKNLPLDKWSLDTFKFIGNECGGYIETASKTLSRMDMMEIGIKVQENGTDFIPAEISI
        VG++ V+F  WS +       +PSYGGWIKV+ +PL  W+L++F  IG+ CGG++E A +T    D+ E  IK+++N + FIPA I +
Subjt:  VGKYQVRFLPWSAENMLGQPKVPSYGGWIKVKNLPLDKWSLDTFKFIGNECGGYIETASKTLSRMDMMEIGIKVQENGTDFIPAEISI

E5GB30 DUF4283 domain-containing protein6.9e-3531.53Show/hide
Query:  IERKTFSINPANRNPNL-FRITETTNDKSFSLTLTKESLKWIQTFFDRLCSLPLTQKFFNETRVDDTPGVVIAPRSNNVVDRNVSYSEALKKSLIETPTQ
        IE+K F ++  NR+ +L   ITE    KSFS+ +T ESL+W+++ F  L   PLT +FF + R ++                   Y   ++K+       
Subjt:  IERKTFSINPANRNPNL-FRITETTNDKSFSLTLTKESLKWIQTFFDRLCSLPLTQKFFNETRVDDTPGVVIAPRSNNVVDRNVSYSEALKKSLIETPTQ

Query:  HLTQDATKPQFVALYLSSSVIIQRKYFHDNWYDIMRVLQKELLAFSSVSPIQPDKALIACEDEDQARTLANIQGWYKVGKYQVRFLPWSAENMLGQPKVP
                          +V++ R+YFHD+W  I+  L ++L    S  P   +KALI  +D +QA  +   +GW  VG++ V+F  W+ ++      VP
Subjt:  HLTQDATKPQFVALYLSSSVIIQRKYFHDNWYDIMRVLQKELLAFSSVSPIQPDKALIACEDEDQARTLANIQGWYKVGKYQVRFLPWSAENMLGQPKVP

Query:  SYGGWIKVKNLPLDKWSLDTFKFIGNECGGYIETASKTLSRMDMMEIGIKVQENGTDFIPAEISIPSTSCSPITVKIDPLFVEDHNIGYKASIHG
        SYGGWIKV+ +PL  W+L++F  IG+ CGG++E A +T    D++E  I+V++N + FIPA I +         +++        ++G   SIHG
Subjt:  SYGGWIKVKNLPLDKWSLDTFKFIGNECGGYIETASKTLSRMDMMEIGIKVQENGTDFIPAEISIPSTSCSPITVKIDPLFVEDHNIGYKASIHG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCACCACCACCTTCAGAATTGAAAGGAAGACTTTCTCCATTAACCCTGCAAATAGAAACCCGAACCTTTTCCGTATCACAGAAACAACCAACGACAAAAGTTTCTC
TCTGACTTTGACTAAGGAGTCTCTGAAATGGATTCAAACTTTCTTTGACAGACTATGCTCCCTTCCTCTGACTCAGAAGTTCTTCAATGAAACGCGAGTTGATGACACGC
CCGGAGTAGTTATTGCACCTCGAAGCAACAATGTGGTCGACAGGAATGTTTCTTATAGTGAAGCATTAAAGAAGAGCCTTATTGAGACACCCACCCAGCATCTTACCCAA
GATGCAACAAAACCCCAGTTTGTTGCCCTCTATCTATCCTCCTCGGTTATCATACAAAGAAAATACTTCCACGATAACTGGTATGACATTATGCGAGTCCTGCAGAAAGA
ACTCTTAGCCTTTTCCTCCGTCAGCCCGATACAACCAGACAAAGCTCTTATCGCTTGCGAAGATGAAGACCAAGCGCGAACTCTAGCCAATATCCAAGGATGGTATAAGG
TTGGAAAATATCAGGTTCGTTTTTTACCGTGGAGTGCGGAAAACATGTTGGGCCAGCCAAAGGTTCCATCATATGGGGGTTGGATCAAAGTAAAAAATCTCCCTTTGGAC
AAGTGGTCCCTTGATACTTTCAAGTTCATCGGCAATGAGTGTGGAGGGTATATAGAAACAGCTAGCAAAACCCTTTCCCGTATGGATATGATGGAAATCGGCATCAAAGT
TCAAGAAAATGGCACCGATTTCATCCCAGCAGAAATCTCCATTCCATCGACATCTTGTAGCCCCATCACGGTTAAAATTGACCCCCTATTTGTGGAAGACCACAACATAG
GATACAAAGCTAGTATCCATGGAAAGATCCCGACCACTTCGATGATTATGGATTGCCCACGCGCCGCCGTCGAGGAAGAAAAGATGAATAGATGCGATATCCAAGGTCCA
CGCGCCTGTGATTATACTCGGAGAGGCACAAATATGGAAAAATCCCTTTTATCGTACCGACGCGATACCCAAAGTGCCCCCAATGATGTGCTGCTGATATCTTTGGATGT
GATTTATGCCGTAATACCAGAGGAGGGCCCACATATATCCTCATCTCCTGTTGAACTCACTTCCCCAAAATCCTCTAGCACAAGCCTCATTCTACCTATAGCTCCCAACA
AATCCCCAAAATGTCAGTCGACCATCCCTGCTAGCCGACAAACCCCACCTCCTCAATACCCATCTCCAAAGCCCACTCATAAAAGCCCAAACCCACCAATCTTTTATAAC
CAACCCCCTTCACCCAGATATGGACCAAAGACACTAGGCCCACCTACTGAAATCCATAAAGGTAGTAAAAAGCCCACTATGATCAATAATAAAAAGACCTACCTCCTCAC
GGGATCGGTAAACTCCACCAATACCGAACTACACGTATCAGACTCGGAAGATGCTCTATCATCTCCCTGCACGACAGTGATGGAGGACTCACCATTACCGACCCCACAAA
AAGCAATTATACCAGCAGCTTCTCCTCCGTCCATCCGCAACCTTTTCGAGCCCCAAATTGAACAACAGCCATATCTAGAAGAACTCATTCCCCTGTGTAGGGAAGAACCA
GCCCTCCTGTGTGACCAAACTCCAATGAATATGGAGGAAACAACTCTTATAGAAGTGGACATTGTAAAAGAGAATGATGATGAACAGGATAGCGGAACAGATGAAAAAGA
TCCAGCGAAAAGAGCCCTCATCAAGGATTTCATCTCTTCCCATAACCCATCTCTGGTGATCCTACAAGAAACTAAGTTAGCTTCCATCGACAGGAAGATCATTAAATCTC
TATGGAGCTCCAGGAGCATTTCTTGGGCTGCTGTTAATGCCACAGGCTCCTCGGGTGGCATTGCCATTATGTGGAATGAGGGGTCTATGGCCCAAGCTCTTATCATGATA
AGAAGATCTTTTGGAGAGAATTGGCCGATCTTCAAGCTCTCTGTCTGCCAAATTGGATTGTTGGAGGGGATTTCAACATTATTAGATGGACATGGGAAAAATCCACTCAC
ACAGCCCCCACCCGAGCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCACCACCACCTTCAGAATTGAAAGGAAGACTTTCTCCATTAACCCTGCAAATAGAAACCCGAACCTTTTCCGTATCACAGAAACAACCAACGACAAAAGTTTCTC
TCTGACTTTGACTAAGGAGTCTCTGAAATGGATTCAAACTTTCTTTGACAGACTATGCTCCCTTCCTCTGACTCAGAAGTTCTTCAATGAAACGCGAGTTGATGACACGC
CCGGAGTAGTTATTGCACCTCGAAGCAACAATGTGGTCGACAGGAATGTTTCTTATAGTGAAGCATTAAAGAAGAGCCTTATTGAGACACCCACCCAGCATCTTACCCAA
GATGCAACAAAACCCCAGTTTGTTGCCCTCTATCTATCCTCCTCGGTTATCATACAAAGAAAATACTTCCACGATAACTGGTATGACATTATGCGAGTCCTGCAGAAAGA
ACTCTTAGCCTTTTCCTCCGTCAGCCCGATACAACCAGACAAAGCTCTTATCGCTTGCGAAGATGAAGACCAAGCGCGAACTCTAGCCAATATCCAAGGATGGTATAAGG
TTGGAAAATATCAGGTTCGTTTTTTACCGTGGAGTGCGGAAAACATGTTGGGCCAGCCAAAGGTTCCATCATATGGGGGTTGGATCAAAGTAAAAAATCTCCCTTTGGAC
AAGTGGTCCCTTGATACTTTCAAGTTCATCGGCAATGAGTGTGGAGGGTATATAGAAACAGCTAGCAAAACCCTTTCCCGTATGGATATGATGGAAATCGGCATCAAAGT
TCAAGAAAATGGCACCGATTTCATCCCAGCAGAAATCTCCATTCCATCGACATCTTGTAGCCCCATCACGGTTAAAATTGACCCCCTATTTGTGGAAGACCACAACATAG
GATACAAAGCTAGTATCCATGGAAAGATCCCGACCACTTCGATGATTATGGATTGCCCACGCGCCGCCGTCGAGGAAGAAAAGATGAATAGATGCGATATCCAAGGTCCA
CGCGCCTGTGATTATACTCGGAGAGGCACAAATATGGAAAAATCCCTTTTATCGTACCGACGCGATACCCAAAGTGCCCCCAATGATGTGCTGCTGATATCTTTGGATGT
GATTTATGCCGTAATACCAGAGGAGGGCCCACATATATCCTCATCTCCTGTTGAACTCACTTCCCCAAAATCCTCTAGCACAAGCCTCATTCTACCTATAGCTCCCAACA
AATCCCCAAAATGTCAGTCGACCATCCCTGCTAGCCGACAAACCCCACCTCCTCAATACCCATCTCCAAAGCCCACTCATAAAAGCCCAAACCCACCAATCTTTTATAAC
CAACCCCCTTCACCCAGATATGGACCAAAGACACTAGGCCCACCTACTGAAATCCATAAAGGTAGTAAAAAGCCCACTATGATCAATAATAAAAAGACCTACCTCCTCAC
GGGATCGGTAAACTCCACCAATACCGAACTACACGTATCAGACTCGGAAGATGCTCTATCATCTCCCTGCACGACAGTGATGGAGGACTCACCATTACCGACCCCACAAA
AAGCAATTATACCAGCAGCTTCTCCTCCGTCCATCCGCAACCTTTTCGAGCCCCAAATTGAACAACAGCCATATCTAGAAGAACTCATTCCCCTGTGTAGGGAAGAACCA
GCCCTCCTGTGTGACCAAACTCCAATGAATATGGAGGAAACAACTCTTATAGAAGTGGACATTGTAAAAGAGAATGATGATGAACAGGATAGCGGAACAGATGAAAAAGA
TCCAGCGAAAAGAGCCCTCATCAAGGATTTCATCTCTTCCCATAACCCATCTCTGGTGATCCTACAAGAAACTAAGTTAGCTTCCATCGACAGGAAGATCATTAAATCTC
TATGGAGCTCCAGGAGCATTTCTTGGGCTGCTGTTAATGCCACAGGCTCCTCGGGTGGCATTGCCATTATGTGGAATGAGGGGTCTATGGCCCAAGCTCTTATCATGATA
AGAAGATCTTTTGGAGAGAATTGGCCGATCTTCAAGCTCTCTGTCTGCCAAATTGGATTGTTGGAGGGGATTTCAACATTATTAGATGGACATGGGAAAAATCCACTCAC
ACAGCCCCCACCCGAGCCATGA
Protein sequenceShow/hide protein sequence
MATTTFRIERKTFSINPANRNPNLFRITETTNDKSFSLTLTKESLKWIQTFFDRLCSLPLTQKFFNETRVDDTPGVVIAPRSNNVVDRNVSYSEALKKSLIETPTQHLTQ
DATKPQFVALYLSSSVIIQRKYFHDNWYDIMRVLQKELLAFSSVSPIQPDKALIACEDEDQARTLANIQGWYKVGKYQVRFLPWSAENMLGQPKVPSYGGWIKVKNLPLD
KWSLDTFKFIGNECGGYIETASKTLSRMDMMEIGIKVQENGTDFIPAEISIPSTSCSPITVKIDPLFVEDHNIGYKASIHGKIPTTSMIMDCPRAAVEEEKMNRCDIQGP
RACDYTRRGTNMEKSLLSYRRDTQSAPNDVLLISLDVIYAVIPEEGPHISSSPVELTSPKSSSTSLILPIAPNKSPKCQSTIPASRQTPPPQYPSPKPTHKSPNPPIFYN
QPPSPRYGPKTLGPPTEIHKGSKKPTMINNKKTYLLTGSVNSTNTELHVSDSEDALSSPCTTVMEDSPLPTPQKAIIPAASPPSIRNLFEPQIEQQPYLEELIPLCREEP
ALLCDQTPMNMEETTLIEVDIVKENDDEQDSGTDEKDPAKRALIKDFISSHNPSLVILQETKLASIDRKIIKSLWSSRSISWAAVNATGSSGGIAIMWNEGSMAQALIMI
RRSFGENWPIFKLSVCQIGLLEGISTLLDGHGKNPLTQPPPEP