; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000693 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000693
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr4:13153485..13155308
RNA-Seq ExpressionLag0000693
SyntenyLag0000693
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMO61345.1 reverse transcriptase [Corchorus capsularis]1.8e-4133.61Show/hide
Query:  LSQKETSGIDTSELGLQTTQNKLQNRLVGKVISQRIINIEAFRKTMKGVWKFCRGITIDNIGDNLFLINPQSEQGRDKILDESPWCFDGNLILLKAPTTF
        L+++E   +      +  T  +  N L+GK++S+R +N+E  R  M  VWK   G+ +  IG+NLF+   +S+  ++++  ++PW F+  L++LK+   +
Subjt:  LSQKETSGIDTSELGLQTTQNKLQNRLVGKVISQRIINIEAFRKTMKGVWKFCRGITIDNIGDNLFLINPQSEQGRDKILDESPWCFDGNLILLKAPTTF

Query:  DLPEDMVFEDAQFWVQFHHLPLGLRNEIVAHTLGSRLGIVQRVETNEADECWGRFLRVRVKMRIDEPLCRSLTINGGPSGKILAAIKYERLPDFCYYCGM
        D  ED+  +   FW Q H LPLG  NE +   +G   G V+ ++T      WG+FLR R ++ + +PL R + +     GKIL + +YE+LPDFCY CG 
Subjt:  DLPEDMVFEDAQFWVQFHHLPLGLRNEIVAHTLGSRLGIVQRVETNEADECWGRFLRVRVKMRIDEPLCRSLTINGGPSGKILAAIKYERLPDFCYYCGM

Query:  IDHNDGSCPKL----KTQSKPEKQFGSWMRAASPPRQS
        + H +  C K     + + K  K++G W+RA  P  +S
Subjt:  IDHNDGSCPKL----KTQSKPEKQFGSWMRAASPPRQS

XP_023888198.1 uncharacterized protein LOC112000324 [Quercus suber]9.7e-4036.08Show/hide
Query:  DVIEKLTALQLSQKETSGIDTSELGLQTTQNKLQNR-LVGKVISQRIINIEAFRKTMKGVWKFCRGITIDNIGDNLFLINPQSEQGRDKILDESPWCFDG
        D+  K   L L+ +ET    T  L  +   N   NR LV K+ ++R +NIEA  +T+K +W+  +   + ++  N+ LI   +E    KIL + PW FD 
Subjt:  DVIEKLTALQLSQKETSGIDTSELGLQTTQNKLQNR-LVGKVISQRIINIEAFRKTMKGVWKFCRGITIDNIGDNLFLINPQSEQGRDKILDESPWCFDG

Query:  NLILLKAPTTFDLPEDMVFEDAQFWVQFHHLPLGLRNEIVAHTLGSRLGIVQRVETNEADECWGRFLRVRVKMRIDEPLCRSLTINGGPSGKILAAIKYE
         LI L  P+  +  +D  F+ A FWVQ H+LPL   N   A  +GS LG V++V+ +   EC GR +R+RV + I++PLCR   ++ G +G +  + +YE
Subjt:  NLILLKAPTTFDLPEDMVFEDAQFWVQFHHLPLGLRNEIVAHTLGSRLGIVQRVETNEADECWGRFLRVRVKMRIDEPLCRSLTINGGPSGKILAAIKYE

Query:  RLPDFCYYCGMIDHNDGSC----PKLKTQSKPEKQFGSWMRA--ASPPRQSRFHT
        R+P FCY+CG+++H++  C       +T  K  +Q+G W+RA  A+P +    HT
Subjt:  RLPDFCYYCGMIDHNDGSC----PKLKTQSKPEKQFGSWMRA--ASPPRQSRFHT

XP_023926510.1 uncharacterized protein LOC112037926 [Quercus suber]2.8e-3936.08Show/hide
Query:  DVIEKLTALQLSQKETSGIDTSELGLQTTQNKLQNR-LVGKVISQRIINIEAFRKTMKGVWKFCRGITIDNIGDNLFLINPQSEQGRDKILDESPWCFDG
        D+  K   L L+ +ET    T  L  +   N   NR LV K+ ++R +NIEA  +T+K +W+  +   + ++  N  LI   +E    KIL + PW FD 
Subjt:  DVIEKLTALQLSQKETSGIDTSELGLQTTQNKLQNR-LVGKVISQRIINIEAFRKTMKGVWKFCRGITIDNIGDNLFLINPQSEQGRDKILDESPWCFDG

Query:  NLILLKAPTTFDLPEDMVFEDAQFWVQFHHLPLGLRNEIVAHTLGSRLGIVQRVETNEADECWGRFLRVRVKMRIDEPLCRSLTINGGPSGKILAAIKYE
         LI L  P+  +  +D  F+ A FWVQ H+LPL   N   A  +GS LG V++V+ +   EC GR +R+RV + I++PLCR   ++ G +G +  + +YE
Subjt:  NLILLKAPTTFDLPEDMVFEDAQFWVQFHHLPLGLRNEIVAHTLGSRLGIVQRVETNEADECWGRFLRVRVKMRIDEPLCRSLTINGGPSGKILAAIKYE

Query:  RLPDFCYYCGMIDHNDGSC----PKLKTQSKPEKQFGSWMRA--ASPPRQSRFHT
        R+P FCY+CG+++H++  C       +T  K  +Q+G W+RA  A+P +    HT
Subjt:  RLPDFCYYCGMIDHNDGSC----PKLKTQSKPEKQFGSWMRA--ASPPRQSRFHT

XP_038708494.1 uncharacterized protein LOC120003548 [Tripterygium wilfordii]2.8e-3933.33Show/hide
Query:  DVIEKLTALQLSQKETSGIDTSELGLQTTQNKLQNRLVGKVISQRIINIEAFRKTMKGVWKFCRGITIDNIGDNLFLINPQSEQGRDKILDESPWCFDGN
        D+ ++   L L+ +E+  +  +     T ++  +  L G+++S R  N EAF  TMK +WK  RG+    +G+NL+L+   S++  +K+LD SPW FD +
Subjt:  DVIEKLTALQLSQKETSGIDTSELGLQTTQNKLQNRLVGKVISQRIINIEAFRKTMKGVWKFCRGITIDNIGDNLFLINPQSEQGRDKILDESPWCFDGN

Query:  LILLKAPTTFDLPEDMVFEDAQFWVQFHHLPLGLRNEIVAHTLGSRLGIVQRVETNEADECWGRFLRVRVKMRIDEPLCRSLTINGGPSG-KILAAIKYE
         +LLK       P D+VFE+   WV+F++LP  + NE V   LG+ +G +++V+ +     WG++LRVR+++ I +PL R + + G   G  I   ++YE
Subjt:  LILLKAPTTFDLPEDMVFEDAQFWVQFHHLPLGLRNEIVAHTLGSRLGIVQRVETNEADECWGRFLRVRVKMRIDEPLCRSLTINGGPSG-KILAAIKYE

Query:  RLPDFCYYCGMIDHNDGSCPKLKTQSKPEKQFGSWMRAASPPRQSRFHTNNRDYEGFRGGRGRFGYRQRRRNPWR-SNVGEEEKETNHRGSGDSQSSDES
        RLP+FCY CG + H D  C   +T+    K +G W+R +SPPR  R  T + D +       R   R+   +  + ++ G+ +   N RG    +SS   
Subjt:  RLPDFCYYCGMIDHNDGSCPKLKTQSKPEKQFGSWMRAASPPRQSRFHTNNRDYEGFRGGRGRFGYRQRRRNPWR-SNVGEEEKETNHRGSGDSQSSDES

Query:  GDE
        GD+
Subjt:  GDE

XP_038716189.1 uncharacterized protein At4g02000-like [Tripterygium wilfordii]9.7e-4036.59Show/hide
Query:  DVIEKLTALQLSQKETSGIDTSELGLQTTQNKLQNRLVGKVISQRIINIEAFRKTMKGVWKFCRGITIDNIGDNLFLINPQSEQGRDKILDESPWCFDGN
        D+ ++   L L  +E+  +  +     T ++  +  L G+++S R  N EAF  TMK +WK  RG+ I  +G+NL+L++  S++  +K+LD SPW FD +
Subjt:  DVIEKLTALQLSQKETSGIDTSELGLQTTQNKLQNRLVGKVISQRIINIEAFRKTMKGVWKFCRGITIDNIGDNLFLINPQSEQGRDKILDESPWCFDGN

Query:  LILLKAPTTFDLPEDMVFEDAQFWVQFHHLPLGLRNEIVAHTLGSRLGIVQRVETNEADECWGRFLRVRVKMRIDEPLCRSLTINGGPSG-KILAAIKYE
         +LLK       P D+VFE+   WV+F +LP  + NE V   LG+ +G +++V+ +     WG++LRVR+ + I +PL R + + G   G  I   ++YE
Subjt:  LILLKAPTTFDLPEDMVFEDAQFWVQFHHLPLGLRNEIVAHTLGSRLGIVQRVETNEADECWGRFLRVRVKMRIDEPLCRSLTINGGPSG-KILAAIKYE

Query:  RLPDFCYYCGMIDHNDGSCPKLKTQSKPEKQFGSWMRAASPPRQSR
        RLP+FCY CG + H D  C   +T     K +G W+R +SPPR  R
Subjt:  RLPDFCYYCGMIDHNDGSCPKLKTQSKPEKQFGSWMRAASPPRQSR

TrEMBL top hitse value%identityAlignment
A0A1R3GTB5 Reverse transcriptase8.6e-4233.61Show/hide
Query:  LSQKETSGIDTSELGLQTTQNKLQNRLVGKVISQRIINIEAFRKTMKGVWKFCRGITIDNIGDNLFLINPQSEQGRDKILDESPWCFDGNLILLKAPTTF
        L+++E   +      +  T  +  N L+GK++S+R +N+E  R  M  VWK   G+ +  IG+NLF+   +S+  ++++  ++PW F+  L++LK+   +
Subjt:  LSQKETSGIDTSELGLQTTQNKLQNRLVGKVISQRIINIEAFRKTMKGVWKFCRGITIDNIGDNLFLINPQSEQGRDKILDESPWCFDGNLILLKAPTTF

Query:  DLPEDMVFEDAQFWVQFHHLPLGLRNEIVAHTLGSRLGIVQRVETNEADECWGRFLRVRVKMRIDEPLCRSLTINGGPSGKILAAIKYERLPDFCYYCGM
        D  ED+  +   FW Q H LPLG  NE +   +G   G V+ ++T      WG+FLR R ++ + +PL R + +     GKIL + +YE+LPDFCY CG 
Subjt:  DLPEDMVFEDAQFWVQFHHLPLGLRNEIVAHTLGSRLGIVQRVETNEADECWGRFLRVRVKMRIDEPLCRSLTINGGPSGKILAAIKYERLPDFCYYCGM

Query:  IDHNDGSCPKL----KTQSKPEKQFGSWMRAASPPRQS
        + H +  C K     + + K  K++G W+RA  P  +S
Subjt:  IDHNDGSCPKL----KTQSKPEKQFGSWMRAASPPRQS

A0A1R3K847 Uncharacterized protein2.3e-3935.1Show/hide
Query:  LVGKVISQRIINIEAFRKTMKGVWKFCRGITIDNIGDNLFLINPQSEQGRDKILDESPWCFDGNLILLKAPTTFDLPEDMVFEDAQFWVQFHHLPLGLRN
        L+GK++S+R +NI+  R  +  VWK   G+ +  IG+ L++   +SE  ++++  + PW F+  L++LK    F   E++  E   FW+Q H LPLG   
Subjt:  LVGKVISQRIINIEAFRKTMKGVWKFCRGITIDNIGDNLFLINPQSEQGRDKILDESPWCFDGNLILLKAPTTFDLPEDMVFEDAQFWVQFHHLPLGLRN

Query:  EIVAHTLGSRLGIVQRVETNEADECWGRFLRVRVKMRIDEPLCRSLTINGGPSGKILAAIKYERLPDFCYYCGMIDHNDGSCPKL----KTQSKPEKQFG
        E V   +G   G V  ++T      WG+FLR+R  + +++PL R + +     GKIL + +YE+LPDFCY CG ++H +  C K     +   K +K++G
Subjt:  EIVAHTLGSRLGIVQRVETNEADECWGRFLRVRVKMRIDEPLCRSLTINGGPSGKILAAIKYERLPDFCYYCGMIDHNDGSCPKL----KTQSKPEKQFG

Query:  SWMRAASP
         W+RA  P
Subjt:  SWMRAASP

A0A2N9F689 Uncharacterized protein2.0e-3835.77Show/hide
Query:  MDVIEKLTALQLSQKETSGIDTSELGLQTTQNKLQNRLVGKVISQRIINIEAFRKTMKGVWK--FCRGITIDNIGDNLFLINPQSEQGRDKILDESPWCF
        +DVI  L  L L++ E   I   E   + +Q K +  L+ K+++QR  N    + TM  +WK    RG+   +IGDNLFL     +  ++ +L  SPWCF
Subjt:  MDVIEKLTALQLSQKETSGIDTSELGLQTTQNKLQNRLVGKVISQRIINIEAFRKTMKGVWK--FCRGITIDNIGDNLFLINPQSEQGRDKILDESPWCF

Query:  DGNLILLKAPTTFDLPEDMVFEDAQFWVQFHHLPLGLRNEIVAHTLGSRLGIVQRVETNEADECWGRFLRVRVKMRIDEPLCRSLTINGGPSGKILAAIK
        D  L++LK         ++  ++A FWVQ + LP+      V   +G  +G V+ VE       WGRFLRVRV++ + +PL R   +  G    IL   +
Subjt:  DGNLILLKAPTTFDLPEDMVFEDAQFWVQFHHLPLGLRNEIVAHTLGSRLGIVQRVETNEADECWGRFLRVRVKMRIDEPLCRSLTINGGPSGKILAAIK

Query:  YERLPDFCYYCGMIDHNDGSCP-KLKTQSK-----PEKQFGSWMRA
        YERLP+FC+YCG +DH +  C  +L+ + K      E Q+G W+RA
Subjt:  YERLPDFCYYCGMIDHNDGSCP-KLKTQSK-----PEKQFGSWMRA

A0A5C7HUW5 CCHC-type domain-containing protein2.6e-3839.27Show/hide
Query:  LVGKVISQRIINIEAFRKTMKGVWKFCRGITIDNIGDNLFLINPQSEQGRDKILDESPWCFDGNLILLKAPTTFDLPEDMVFEDAQFWVQFHHLPLGLRN
        LVGKV++ + +N EAF+K ++ +W     + I+ +GDN F+    S + R  I    PW FD NLI+L+ PT       + F   +FWVQ H+LPL   N
Subjt:  LVGKVISQRIINIEAFRKTMKGVWKFCRGITIDNIGDNLFLINPQSEQGRDKILDESPWCFDGNLILLKAPTTFDLPEDMVFEDAQFWVQFHHLPLGLRN

Query:  EIVAHTLGSRLGIVQRVETNEADECWGRFLRVRVKMRIDEPLCRSLTINGGPSGKILAAI-KYERLPDFCYYCGMIDHNDGSCP----KLKTQSKPEKQF
          +A  L  ++G V  + T ++ ECWGRF+RV+V++ I +PL R L +N   SG++  AI  YERLP+FCY CG+I H    CP    +L+       ++
Subjt:  EIVAHTLGSRLGIVQRVETNEADECWGRFLRVRVKMRIDEPLCRSLTINGGPSGKILAAI-KYERLPDFCYYCGMIDHNDGSCP----KLKTQSKPEKQF

Query:  GSWMRAASPPRQSRFHTNN
        GSW+RAAS  +    ++ N
Subjt:  GSWMRAASPPRQSRFHTNN

A0A5C7IY45 CCHC-type domain-containing protein4.9e-3732Show/hide
Query:  DVIEKLTALQLSQKETSGIDTSELGLQTTQNKLQNRLVGKVISQRIINIEAFRKTMKGVWKFCRGITIDNIGDNLFLINPQSEQGRDKILDESPWCFDGN
        D+     AL + +KE+             + +L   LVGKV+  +++N EAFR  M+ VW+   G+ I+ I DN+F    ++ + R ++L   PW FD  
Subjt:  DVIEKLTALQLSQKETSGIDTSELGLQTTQNKLQNRLVGKVISQRIINIEAFRKTMKGVWKFCRGITIDNIGDNLFLINPQSEQGRDKILDESPWCFDGN

Query:  LILLKAPTTFDLPEDMVFEDAQFWVQFHHLPLGLRNEIVAHTLGSRLGIVQRVETNEADECWGRFLRVRVKMRIDEPLCRSLTINGGPSGKILA-AIKYE
        L++L+ P+      +M F   +FWVQ H+LP+   +E +   LG+ +G V+ V+         RF+RVRV +++DEPL RSL ++    GKI    ++YE
Subjt:  LILLKAPTTFDLPEDMVFEDAQFWVQFHHLPLGLRNEIVAHTLGSRLGIVQRVETNEADECWGRFLRVRVKMRIDEPLCRSLTINGGPSGKILA-AIKYE

Query:  RLPDFCYYCGMIDHNDGSCPKL----KTQSKPEKQFGSWMRAASPPRQSRFHT-NNRDYEGFRG-GRGRFGY-----RQRRRNPWRSNVGEEEKETNHRG
        R+PDFCY C  + HN G C       K  ++   +  +WMR  SPP++  + +   R  +G R     R+G+       R +  WRS   + +K+   +G
Subjt:  RLPDFCYYCGMIDHNDGSCPKL----KTQSKPEKQFGSWMRAASPPRQSRFHT-NNRDYEGFRG-GRGRFGY-----RQRRRNPWRSNVGEEEKETNHRG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G31430.1 unknown protein9.1e-1230.15Show/hide
Query:  EQGRDKILDESPWCFDGNLILLKAPTTFDLPEDMVFEDAQFWVQFHHLPLGLRNEIVAHTLGSRLGIVQRVETNEADECWGRFLRVRVKMRIDEPLCRSL
        E+  + +L   PW F+  +ILL+       P+  +F    FWVQ   +P    N  V   +G  LG V   + N        F RV +   I  PL    
Subjt:  EQGRDKILDESPWCFDGNLILLKAPTTFDLPEDMVFEDAQFWVQFHHLPLGLRNEIVAHTLGSRLGIVQRVETNEADECWGRFLRVRVKMRIDEPLCRSL

Query:  TINGGPSGKILAAIKYERLPDFCYYCGMIDHNDGSC
                  L   +YERL  FC  CGM+ H+ G+C
Subjt:  TINGGPSGKILAAIKYERLPDFCYYCGMIDHNDGSC

AT3G42140.1 zinc ion binding;nucleic acid binding1.6e-0825.9Show/hide
Query:  QSEQGRDKILDESPWCFDGNLILLKAPTTFDLPEDMVFEDAQFWVQFHHLPLGLRNEIVAHTLGSRLGIVQRVETNEADECWGRFLRVRVKMRIDEPLCR
        QSE+    IL   PW F+  + +++  T   L  D  F+   FW+Q   +PL      +  ++G R+G+   +ETN                     L R
Subjt:  QSEQGRDKILDESPWCFDGNLILLKAPTTFDLPEDMVFEDAQFWVQFHHLPLGLRNEIVAHTLGSRLGIVQRVETNEADECWGRFLRVRVKMRIDEPLCR

Query:  SLTINGGPSGKILAAIKYERLPDFCYYCGMIDHNDGSCP
         +++            +YE+L +FC  CGM+ H+   CP
Subjt:  SLTINGGPSGKILAAIKYERLPDFCYYCGMIDHNDGSCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTAATCGAAAAGCTAACAGCGCTCCAGCTTTCTCAAAAGGAAACTAGCGGCATTGACACATCGGAGTTAGGTCTGCAAACGACTCAAAATAAGCTCCAAAACAG
ATTGGTGGGAAAGGTTATCTCTCAAAGGATAATTAACATTGAAGCATTCAGAAAGACCATGAAAGGAGTGTGGAAATTTTGCAGAGGGATAACTATTGATAACATTGGGG
ACAATCTTTTCCTAATTAATCCTCAATCAGAACAGGGAAGGGACAAAATCTTGGATGAGAGTCCATGGTGTTTTGATGGCAACTTGATACTTCTCAAAGCTCCAACGACT
TTTGATTTACCAGAAGATATGGTTTTTGAAGATGCCCAATTCTGGGTTCAATTTCATCACTTACCGCTTGGACTAAGGAATGAAATAGTGGCTCACACTCTTGGGAGTCG
ATTAGGGATAGTTCAAAGAGTTGAGACGAATGAAGCGGATGAATGTTGGGGTCGCTTCCTTCGAGTTCGAGTCAAGATGAGGATCGATGAACCATTATGTCGAAGCCTAA
CTATAAATGGCGGACCATCTGGAAAAATTTTGGCAGCAATTAAATACGAGAGACTACCGGATTTCTGTTACTATTGTGGGATGATTGATCACAACGATGGATCTTGCCCA
AAACTCAAGACTCAATCGAAACCAGAAAAACAATTCGGTTCATGGATGAGAGCGGCATCTCCCCCGCGACAGAGTCGTTTCCACACAAATAACAGAGACTATGAGGGATT
TAGAGGAGGAAGAGGAAGGTTTGGCTACAGACAAAGAAGACGAAACCCATGGCGATCAAATGTGGGGGAAGAGGAAAAGGAAACCAACCATAGAGGATCAGGGGACTCCC
AAAGCTCCGACGAAAGCGGTGACGAACGGCAGCCTGAGAACAATCGCCGGTCACCCAATGCACCCGTTTCCGATGATCTCACCACGCAATCAATACCATCGACGGAGATT
ACGGGTAATAAATGCCCTCTTTCCTTCAAAAATGATATTTGTGTAGATACTCTCCCTAATAATGCAGGTGGCGTGATTCACGGCAATGAAGATGGAGAAATTGATGGCCG
AAACAAATCTGGAAAATTGAAATCAAACGAGCCAAATCAGGGAGAAGGGAGAAGGAGAAACGGCATGGAGGATCTAGGGCAAGAGGAGTCGGTGGAAAAAATCAATGTTG
GTGGGCCAGGGGCTAAAATAACCCATTCTATCATCAAGCCCATGGAAGTCACTCTGCCTACTGATAAGATTTTTAAACCCAACTCAAGCCCATTTGCATCTTTAACGAAG
TGCAGCCCAATTGTAGCAAATTCAAGTTTGAAATGTAGCCCATTTTATACTTCAACAATGAATCCAGTAGCTTCCAATGACCCAACTTCCAGCATGTTTTCCAGTAAGGC
CAATGAACAAAATAAATTTCAAAATCAATTGAATGGTGAAGACAACATGGAAGAGGATCTCTTGAAGCCCACACAGCACCCGACAGAAGTTCATCAGATATTACTAAAAA
ATGCTGGGGAATTACCAAACGGAAGGGAGGAAAATGGGGCAATAAAAATTTCAACCAGCAGTGAGAATAAGCAATCAAGAGGATGGAAGAGGGTAAATAGGCCAAGAGAT
ATAGTGAAGAACAACTACCCCACACAACAAACAGATCAGTTTGCTCACAAGAAGAGGGCGTTAGATGGCGACAACTCAGAAAATTGTCAAAGCAAGAGACTCAAATCACC
ACTTCCTCTAAATGATGGAAGTCTATCGGCGGAGCCTGTTAAACAGGCCCGCCGGGAGCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGTAATCGAAAAGCTAACAGCGCTCCAGCTTTCTCAAAAGGAAACTAGCGGCATTGACACATCGGAGTTAGGTCTGCAAACGACTCAAAATAAGCTCCAAAACAG
ATTGGTGGGAAAGGTTATCTCTCAAAGGATAATTAACATTGAAGCATTCAGAAAGACCATGAAAGGAGTGTGGAAATTTTGCAGAGGGATAACTATTGATAACATTGGGG
ACAATCTTTTCCTAATTAATCCTCAATCAGAACAGGGAAGGGACAAAATCTTGGATGAGAGTCCATGGTGTTTTGATGGCAACTTGATACTTCTCAAAGCTCCAACGACT
TTTGATTTACCAGAAGATATGGTTTTTGAAGATGCCCAATTCTGGGTTCAATTTCATCACTTACCGCTTGGACTAAGGAATGAAATAGTGGCTCACACTCTTGGGAGTCG
ATTAGGGATAGTTCAAAGAGTTGAGACGAATGAAGCGGATGAATGTTGGGGTCGCTTCCTTCGAGTTCGAGTCAAGATGAGGATCGATGAACCATTATGTCGAAGCCTAA
CTATAAATGGCGGACCATCTGGAAAAATTTTGGCAGCAATTAAATACGAGAGACTACCGGATTTCTGTTACTATTGTGGGATGATTGATCACAACGATGGATCTTGCCCA
AAACTCAAGACTCAATCGAAACCAGAAAAACAATTCGGTTCATGGATGAGAGCGGCATCTCCCCCGCGACAGAGTCGTTTCCACACAAATAACAGAGACTATGAGGGATT
TAGAGGAGGAAGAGGAAGGTTTGGCTACAGACAAAGAAGACGAAACCCATGGCGATCAAATGTGGGGGAAGAGGAAAAGGAAACCAACCATAGAGGATCAGGGGACTCCC
AAAGCTCCGACGAAAGCGGTGACGAACGGCAGCCTGAGAACAATCGCCGGTCACCCAATGCACCCGTTTCCGATGATCTCACCACGCAATCAATACCATCGACGGAGATT
ACGGGTAATAAATGCCCTCTTTCCTTCAAAAATGATATTTGTGTAGATACTCTCCCTAATAATGCAGGTGGCGTGATTCACGGCAATGAAGATGGAGAAATTGATGGCCG
AAACAAATCTGGAAAATTGAAATCAAACGAGCCAAATCAGGGAGAAGGGAGAAGGAGAAACGGCATGGAGGATCTAGGGCAAGAGGAGTCGGTGGAAAAAATCAATGTTG
GTGGGCCAGGGGCTAAAATAACCCATTCTATCATCAAGCCCATGGAAGTCACTCTGCCTACTGATAAGATTTTTAAACCCAACTCAAGCCCATTTGCATCTTTAACGAAG
TGCAGCCCAATTGTAGCAAATTCAAGTTTGAAATGTAGCCCATTTTATACTTCAACAATGAATCCAGTAGCTTCCAATGACCCAACTTCCAGCATGTTTTCCAGTAAGGC
CAATGAACAAAATAAATTTCAAAATCAATTGAATGGTGAAGACAACATGGAAGAGGATCTCTTGAAGCCCACACAGCACCCGACAGAAGTTCATCAGATATTACTAAAAA
ATGCTGGGGAATTACCAAACGGAAGGGAGGAAAATGGGGCAATAAAAATTTCAACCAGCAGTGAGAATAAGCAATCAAGAGGATGGAAGAGGGTAAATAGGCCAAGAGAT
ATAGTGAAGAACAACTACCCCACACAACAAACAGATCAGTTTGCTCACAAGAAGAGGGCGTTAGATGGCGACAACTCAGAAAATTGTCAAAGCAAGAGACTCAAATCACC
ACTTCCTCTAAATGATGGAAGTCTATCGGCGGAGCCTGTTAAACAGGCCCGCCGGGAGCCATGA
Protein sequenceShow/hide protein sequence
MDVIEKLTALQLSQKETSGIDTSELGLQTTQNKLQNRLVGKVISQRIINIEAFRKTMKGVWKFCRGITIDNIGDNLFLINPQSEQGRDKILDESPWCFDGNLILLKAPTT
FDLPEDMVFEDAQFWVQFHHLPLGLRNEIVAHTLGSRLGIVQRVETNEADECWGRFLRVRVKMRIDEPLCRSLTINGGPSGKILAAIKYERLPDFCYYCGMIDHNDGSCP
KLKTQSKPEKQFGSWMRAASPPRQSRFHTNNRDYEGFRGGRGRFGYRQRRRNPWRSNVGEEEKETNHRGSGDSQSSDESGDERQPENNRRSPNAPVSDDLTTQSIPSTEI
TGNKCPLSFKNDICVDTLPNNAGGVIHGNEDGEIDGRNKSGKLKSNEPNQGEGRRRNGMEDLGQEESVEKINVGGPGAKITHSIIKPMEVTLPTDKIFKPNSSPFASLTK
CSPIVANSSLKCSPFYTSTMNPVASNDPTSSMFSSKANEQNKFQNQLNGEDNMEEDLLKPTQHPTEVHQILLKNAGELPNGREENGAIKISTSSENKQSRGWKRVNRPRD
IVKNNYPTQQTDQFAHKKRALDGDNSENCQSKRLKSPLPLNDGSLSAEPVKQARREP