; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028592 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028592
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr8:25983670..25985151
RNA-Seq ExpressionLag0028592
SyntenyLag0028592
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG48193.1 hypothetical protein EZV62_027487 [Acer yangbiense]2.1e-3531.54Show/hide
Query:  IVRQYGDLKVTEAEKASVYHLQEGEIDISRKKFENALVCKIFTQKRIPPEVFSSMMLKIWKQ-EQTIIDHVGFNLFLCKFRNAQIKGQIVELGPWFYDKS
        I + Y +L + E E A+V+ + E  I    +  +  LV K+ T K++  E F  ++ +IW Q  Q  ++ VG N F+  F N + + ++   GPW + KS
Subjt:  IVRQYGDLKVTEAEKASVYHLQEGEIDISRKKFENALVCKIFTQKRIPPEVFSSMMLKIWKQ-EQTIIDHVGFNLFLCKFRNAQIKGQIVELGPWFYDKS

Query:  LLLLEEPKGDIYSEDQEFKYVSFFIHFHKLPYACFSRDSTTRIGSLLGKVEMVDLEEEEKQGWGCALRIKIQVDVTIPLKRGIFLK--KREEDRWIAITY
        L++LE+PKG       +F    F++  H +P  C ++ +T  +   +G  E+V++  E ++ WG  +R+K+QVD+T PLKR + +K  K EE   +A+ Y
Subjt:  LLLLEEPKGDIYSEDQEFKYVSFFIHFHKLPYACFSRDSTTRIGSLLGKVEMVDLEEEEKQGWGCALRIKIQVDVTIPLKRGIFLK--KREEDRWIAITY

Query:  EKLPDFCYGCGHLGHTLKEC----EKDCGTNEEDLPYGPWLREPVKIKVLEAEPVRYSSFSFQRRGRGRSREEGRESWRSKAVQEE----MEDDNVNP
        E+LPDFC+ CG +GH+++EC     K    + +   +G W+R         A P+  S         G S E GR    S+ ++ +    M   NV P
Subjt:  EKLPDFCYGCGHLGHTLKEC----EKDCGTNEEDLPYGPWLREPVKIKVLEAEPVRYSSFSFQRRGRGRSREEGRESWRSKAVQEE----MEDDNVNP

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]2.7e-3530.51Show/hide
Query:  VESIVRQYGDLKVTEAEKASVYHLQEGEIDISRKKFENALVCKIFTQKRIPPEVFSSMMLKIWK-QEQTIIDHVGFNLFLCKFRNAQIKGQIVELGPWFY
        ++ I   +   K T  E  +V  +  G+  ++    +  +V K+ T KRI  E   S+M  +W+    T  + +G N+++  F++   K +++  GPW +
Subjt:  VESIVRQYGDLKVTEAEKASVYHLQEGEIDISRKKFENALVCKIFTQKRIPPEVFSSMMLKIWK-QEQTIIDHVGFNLFLCKFRNAQIKGQIVELGPWFY

Query:  DKSLLLLEEPKGDIYSEDQEFKYVSFFIHFHKLPYACFSRDSTTRIGSLLGKVEMVDLEEEEKQGW-GCALRIKIQVDVTIPLKRGIFLKKRE-EDRWIA
        +KSLL+L  P       D  F + +F+I  H +P+ C S +    +G+ LG VE  ++E +   GW G  +R+++++DV+ PL+RGI LK  + +D W  
Subjt:  DKSLLLLEEPKGDIYSEDQEFKYVSFFIHFHKLPYACFSRDSTTRIGSLLGKVEMVDLEEEEKQGW-GCALRIKIQVDVTIPLKRGIFLKKRE-EDRWIA

Query:  ITYEKLPDFCYGCGHLGHTLKECE---KDCGTNEEDLPYGPWLREPVKIKVLEAEPVRYSSFSFQRRGRGRSREEGR----------ESWRS----KAVQ
        + YEKLPDFCY CG +GH+ +ECE   K   TN  +  YG WLR  + +K   + P     +   R GRG     GR          E+WR     ++  
Subjt:  ITYEKLPDFCYGCGHLGHTLKECE---KDCGTNEEDLPYGPWLREPVKIKVLEAEPVRYSSFSFQRRGRGRSREEGR----------ESWRS----KAVQ

Query:  EEMEDDNVNPIQTDVGKGVAAAGLSEKEAKI
            ++ V+ +  +  + V AA ++ ++AKI
Subjt:  EEMEDDNVNPIQTDVGKGVAAAGLSEKEAKI

XP_030940122.1 uncharacterized protein LOC115965057 [Quercus lobata]1.3e-3435.47Show/hide
Query:  ESIVRQYGDLKVTEAEKASVYHLQEGEIDISRKKFENALVCKIFTQKRIPPEVFSSMMLKIWKQEQTIIDHVGFNLFLCKFRNAQIKGQIVELGPWFYDK
        E + + +  L  TE E   +  L E     +++  +N LV K+ TQ+ I  E     M  +WK  +T+       LFL +F + + K +I+E+ PW ++K
Subjt:  ESIVRQYGDLKVTEAEKASVYHLQEGEIDISRKKFENALVCKIFTQKRIPPEVFSSMMLKIWKQEQTIIDHVGFNLFLCKFRNAQIKGQIVELGPWFYDK

Query:  SLLLLEEPKGDIYSEDQEFKYVSFFIHFHKLPYACFSRDSTTRIGSLLGKVEMVDLEEEEKQGWGCALRIKIQVDVTIPLKRGIFLKK----REEDRWIA
        +L+LL+E  G++  ++ + K+  F +    LP    +R++   IGS LGKV  VD+ E+  Q WG  LR++IQ+DVT   K+ I+ KK     +E RW+ 
Subjt:  SLLLLEEPKGDIYSEDQEFKYVSFFIHFHKLPYACFSRDSTTRIGSLLGKVEMVDLEEEEKQGWGCALRIKIQVDVTIPLKRGIFLKK----REEDRWIA

Query:  ITYEKLPDFCYGCGHLGHTLKEC----EKDCGTNEEDLPYGPWLREPVKIKVLEAEPVRYSSFSF
          YE+LP+FCY CG +GH+  EC    E + G  +E   YG WLR         AEPV+ S +SF
Subjt:  ITYEKLPDFCYGCGHLGHTLKEC----EKDCGTNEEDLPYGPWLREPVKIKVLEAEPVRYSSFSF

XP_030964252.1 uncharacterized protein LOC115985457 [Quercus lobata]3.2e-3635.74Show/hide
Query:  ESIVRQYGDLKVTEAEKASVYHLQEGEIDISRKKFENALVCKIFTQKRIPPEVFSSMMLKIWKQEQTI-IDHVGFNLFLCKFRNAQIKGQIVELGPWFYD
        E + + +  L  TE E   +  L E     +++  +N LV K+ TQ+ I  E     M  +WK  +T+ I  +   LFL +F + + K +I+E+ PW ++
Subjt:  ESIVRQYGDLKVTEAEKASVYHLQEGEIDISRKKFENALVCKIFTQKRIPPEVFSSMMLKIWKQEQTI-IDHVGFNLFLCKFRNAQIKGQIVELGPWFYD

Query:  KSLLLLEEPKGDIYSEDQEFKYVSFFIHFHKLPYACFSRDSTTRIGSLLGKVEMVDLEEEEKQGWGCALRIKIQVDVTIPLKRGIFLK-KREEDRWIAIT
        K+L+LL+E  G++  ++ + K+  F++    LP    +R++   IGS LGKV  VD+ E+  Q WG  LR++IQ+DVT  L R   +  + +E RW+   
Subjt:  KSLLLLEEPKGDIYSEDQEFKYVSFFIHFHKLPYACFSRDSTTRIGSLLGKVEMVDLEEEEKQGWGCALRIKIQVDVTIPLKRGIFLK-KREEDRWIAIT

Query:  YEKLPDFCYGCGHLGHTLKEC----EKDCGTNEEDLPYGPWLREPVKIKVLEAEPVRYSSFSF
        YE+LP+FCY CG +GH+ +EC    E + G  +E   YG WLR         AEPV+ S FSF
Subjt:  YEKLPDFCYGCGHLGHTLKEC----EKDCGTNEEDLPYGPWLREPVKIKVLEAEPVRYSSFSF

XP_035541689.1 uncharacterized protein LOC118344688 [Juglans regia]7.8e-3532.94Show/hide
Query:  LKVTEAEKASVYHLQEGEIDISRKKFENALVCKIFTQKRIPPEVFSSMMLKIWKQEQTI-IDHVGFNLFLCKFRNAQIKGQIVELGPWFYDKSLLLLEEP
        LK+TE E+   Y+L E EI  S+    + LV  +   + +    F + M ++W  E  I    +G N FL KF N  ++ +++   PW +D++L+ ++E 
Subjt:  LKVTEAEKASVYHLQEGEIDISRKKFENALVCKIFTQKRIPPEVFSSMMLKIWKQEQTI-IDHVGFNLFLCKFRNAQIKGQIVELGPWFYDKSLLLLEEP

Query:  KGDIYSEDQEFKYVSFFIHFHKLPYACFSRDSTTRIGSLLGKVEMVDLEEEEKQGWGCALRIKIQVDVTIPLKRGIFLKKREEDRWIAITYEKLPDFCYG
        KG +  +D +F    F++  H LP+A  ++ +  ++G+  GKV MVD++E+ +  WG  LR+K+ ++++ PL RG  +   +   WI   YE+LP FCY 
Subjt:  KGDIYSEDQEFKYVSFFIHFHKLPYACFSRDSTTRIGSLLGKVEMVDLEEEEKQGWGCALRIKIQVDVTIPLKRGIFLKKREEDRWIAITYEKLPDFCYG

Query:  CGHLGHTLKECEK---DCGTNEEDLP-YGPWLREPVKIKVLEAEPVRYSSFSFQR
        CG + H+   C +   D  T+ E  P YGPWLR     K     P  Y + + QR
Subjt:  CGHLGHTLKECEK---DCGTNEEDLP-YGPWLREPVKIKVLEAEPVRYSSFSFQR

TrEMBL top hitse value%identityAlignment
A0A5C7GU64 CCHC-type domain-containing protein1.0e-3531.54Show/hide
Query:  IVRQYGDLKVTEAEKASVYHLQEGEIDISRKKFENALVCKIFTQKRIPPEVFSSMMLKIWKQ-EQTIIDHVGFNLFLCKFRNAQIKGQIVELGPWFYDKS
        I + Y +L + E E A+V+ + E  I    +  +  LV K+ T K++  E F  ++ +IW Q  Q  ++ VG N F+  F N + + ++   GPW + KS
Subjt:  IVRQYGDLKVTEAEKASVYHLQEGEIDISRKKFENALVCKIFTQKRIPPEVFSSMMLKIWKQ-EQTIIDHVGFNLFLCKFRNAQIKGQIVELGPWFYDKS

Query:  LLLLEEPKGDIYSEDQEFKYVSFFIHFHKLPYACFSRDSTTRIGSLLGKVEMVDLEEEEKQGWGCALRIKIQVDVTIPLKRGIFLK--KREEDRWIAITY
        L++LE+PKG       +F    F++  H +P  C ++ +T  +   +G  E+V++  E ++ WG  +R+K+QVD+T PLKR + +K  K EE   +A+ Y
Subjt:  LLLLEEPKGDIYSEDQEFKYVSFFIHFHKLPYACFSRDSTTRIGSLLGKVEMVDLEEEEKQGWGCALRIKIQVDVTIPLKRGIFLK--KREEDRWIAITY

Query:  EKLPDFCYGCGHLGHTLKEC----EKDCGTNEEDLPYGPWLREPVKIKVLEAEPVRYSSFSFQRRGRGRSREEGRESWRSKAVQEE----MEDDNVNP
        E+LPDFC+ CG +GH+++EC     K    + +   +G W+R         A P+  S         G S E GR    S+ ++ +    M   NV P
Subjt:  EKLPDFCYGCGHLGHTLKEC----EKDCGTNEEDLPYGPWLREPVKIKVLEAEPVRYSSFSFQRRGRGRSREEGRESWRSKAVQEE----MEDDNVNP

A0A6J1BSZ1 uncharacterized protein LOC1110054812.5e-3432.37Show/hide
Query:  SIVRQYGDLKVTEAEKASVYHLQEGEIDISRKKFENALVCKIFTQKRIPPEVFSSMMLKIWKQE--QTIIDHVGFNLFLCKFRNAQIKGQIVELGPWFYD
        +++ ++ + K+T  E      +    ++ + K  E +L+CK+ +++ I   V  + +   WK +     +D +GFN+FL  F  +  + +I+ +GPW +D
Subjt:  SIVRQYGDLKVTEAEKASVYHLQEGEIDISRKKFENALVCKIFTQKRIPPEVFSSMMLKIWKQE--QTIIDHVGFNLFLCKFRNAQIKGQIVELGPWFYD

Query:  KSLLLLEEPKGDIYSEDQEFKYVSFFIHFHKLPYACFSRDSTTRIGSLLGKVEMVDLEEEEKQGWGCALRIKIQVDVTIPLKRGIFLKKREE--DRWIAI
        ++L++++ P       D +F+ VS ++HF  L  AC ++   TR+G+ +G  E V+        WG  LR++++ DV  PL RGI L         WI I
Subjt:  KSLLLLEEPKGDIYSEDQEFKYVSFFIHFHKLPYACFSRDSTTRIGSLLGKVEMVDLEEEEKQGWGCALRIKIQVDVTIPLKRGIFLKKREE--DRWIAI

Query:  TYEKLPDFCYGCGHLGHTLKECEKDC-GTNEEDLPYGPWLR
         YE+LPDF Y CG L H LK+C   C  +  ++L YGPWLR
Subjt:  TYEKLPDFCYGCGHLGHTLKECEKDC-GTNEEDLPYGPWLR

A0A6J1D765 uncharacterized protein LOC1110179021.3e-3530.51Show/hide
Query:  VESIVRQYGDLKVTEAEKASVYHLQEGEIDISRKKFENALVCKIFTQKRIPPEVFSSMMLKIWK-QEQTIIDHVGFNLFLCKFRNAQIKGQIVELGPWFY
        ++ I   +   K T  E  +V  +  G+  ++    +  +V K+ T KRI  E   S+M  +W+    T  + +G N+++  F++   K +++  GPW +
Subjt:  VESIVRQYGDLKVTEAEKASVYHLQEGEIDISRKKFENALVCKIFTQKRIPPEVFSSMMLKIWK-QEQTIIDHVGFNLFLCKFRNAQIKGQIVELGPWFY

Query:  DKSLLLLEEPKGDIYSEDQEFKYVSFFIHFHKLPYACFSRDSTTRIGSLLGKVEMVDLEEEEKQGW-GCALRIKIQVDVTIPLKRGIFLKKRE-EDRWIA
        +KSLL+L  P       D  F + +F+I  H +P+ C S +    +G+ LG VE  ++E +   GW G  +R+++++DV+ PL+RGI LK  + +D W  
Subjt:  DKSLLLLEEPKGDIYSEDQEFKYVSFFIHFHKLPYACFSRDSTTRIGSLLGKVEMVDLEEEEKQGW-GCALRIKIQVDVTIPLKRGIFLKKRE-EDRWIA

Query:  ITYEKLPDFCYGCGHLGHTLKECE---KDCGTNEEDLPYGPWLREPVKIKVLEAEPVRYSSFSFQRRGRGRSREEGR----------ESWRS----KAVQ
        + YEKLPDFCY CG +GH+ +ECE   K   TN  +  YG WLR  + +K   + P     +   R GRG     GR          E+WR     ++  
Subjt:  ITYEKLPDFCYGCGHLGHTLKECE---KDCGTNEEDLPYGPWLREPVKIKVLEAEPVRYSSFSFQRRGRGRSREEGR----------ESWRS----KAVQ

Query:  EEMEDDNVNPIQTDVGKGVAAAGLSEKEAKI
            ++ V+ +  +  + V AA ++ ++AKI
Subjt:  EEMEDDNVNPIQTDVGKGVAAAGLSEKEAKI

A0A6P9E2G0 uncharacterized protein LOC1183446883.8e-3532.94Show/hide
Query:  LKVTEAEKASVYHLQEGEIDISRKKFENALVCKIFTQKRIPPEVFSSMMLKIWKQEQTI-IDHVGFNLFLCKFRNAQIKGQIVELGPWFYDKSLLLLEEP
        LK+TE E+   Y+L E EI  S+    + LV  +   + +    F + M ++W  E  I    +G N FL KF N  ++ +++   PW +D++L+ ++E 
Subjt:  LKVTEAEKASVYHLQEGEIDISRKKFENALVCKIFTQKRIPPEVFSSMMLKIWKQEQTI-IDHVGFNLFLCKFRNAQIKGQIVELGPWFYDKSLLLLEEP

Query:  KGDIYSEDQEFKYVSFFIHFHKLPYACFSRDSTTRIGSLLGKVEMVDLEEEEKQGWGCALRIKIQVDVTIPLKRGIFLKKREEDRWIAITYEKLPDFCYG
        KG +  +D +F    F++  H LP+A  ++ +  ++G+  GKV MVD++E+ +  WG  LR+K+ ++++ PL RG  +   +   WI   YE+LP FCY 
Subjt:  KGDIYSEDQEFKYVSFFIHFHKLPYACFSRDSTTRIGSLLGKVEMVDLEEEEKQGWGCALRIKIQVDVTIPLKRGIFLKKREEDRWIAITYEKLPDFCYG

Query:  CGHLGHTLKECEK---DCGTNEEDLP-YGPWLREPVKIKVLEAEPVRYSSFSFQR
        CG + H+   C +   D  T+ E  P YGPWLR     K     P  Y + + QR
Subjt:  CGHLGHTLKECEK---DCGTNEEDLP-YGPWLREPVKIKVLEAEPVRYSSFSFQR

A0A6P9EGW2 uncharacterized protein LOC1183486455.5e-3433.47Show/hide
Query:  ESIVRQYGDLKVTEAEKASVYHLQEGEIDISRKKFENALVCKIFTQKRIPPEVFSSMMLKIWKQEQTI-IDHVGFNLFLCKFRNAQIKGQIVELGPWFYD
        E I      LK+TE E+   Y+L+E EI  S     + LV  +   + +    F + M ++W  E  I    +G N FL +F N+ ++ +++   PW +D
Subjt:  ESIVRQYGDLKVTEAEKASVYHLQEGEIDISRKKFENALVCKIFTQKRIPPEVFSSMMLKIWKQEQTI-IDHVGFNLFLCKFRNAQIKGQIVELGPWFYD

Query:  KSLLLLEEPKGDIYSEDQEFKYVSFFIHFHKLPYACFSRDSTTRIGSLLGKVEMVDLEEEEKQGWGCALRIKIQVDVTIPLKRGIFLKKREEDRWIAITY
        ++L+ ++E KG +  +D +F    F++    LP+A  ++ ++ ++G+  GKV MVD++E+ +  WG  LR+K+ +D++ PL RG  +   +   WI   Y
Subjt:  KSLLLLEEPKGDIYSEDQEFKYVSFFIHFHKLPYACFSRDSTTRIGSLLGKVEMVDLEEEEKQGWGCALRIKIQVDVTIPLKRGIFLKKREEDRWIAITY

Query:  EKLPDFCYGCGHLGHTLKECEK---DCGTN-EEDLPYGPWLR
        EKLP FCY CG + HT   C +   D  T+ +  L YGPWLR
Subjt:  EKLPDFCYGCGHLGHTLKECEK---DCGTN-EEDLPYGPWLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding2.0e-0422.3Show/hide
Query:  FRNAQIKGQIVELGPWFYDKSLLLLEEPKGDIYSEDQEFKYVSFFIHFHKLPYACFSRDSTTRIGSLLGKVEMVDLEEEEKQGWGCALRIKIQVDVTIPL
        F++ +    I+  GPW ++  + +++  +      D EFK + F+I    +P    +    T IG  +                G  L   +  DV++  
Subjt:  FRNAQIKGQIVELGPWFYDKSLLLLEEPKGDIYSEDQEFKYVSFFIHFHKLPYACFSRDSTTRIGSLLGKVEMVDLEEEEKQGWGCALRIKIQVDVTIPL

Query:  KRGIFLKKREEDRWIAITYEKLPDFCYGCGHLGHTLKEC
                      +   YEKL +FC  CG L H   EC
Subjt:  KRGIFLKKREEDRWIAITYEKLPDFCYGCGHLGHTLKEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACCAACAACGATGTTGAGTCAATTGTGCGGCAGTATGGGGATCTGAAAGTGACAGAGGCAGAGAAAGCAAGTGTATATCATTTGCAAGAAGGGGAAATCGATAT
CTCAAGGAAAAAGTTTGAAAATGCATTAGTTTGCAAGATTTTTACACAAAAAAGAATTCCACCAGAGGTGTTTTCTTCCATGATGCTGAAGATTTGGAAGCAGGAGCAAA
CTATTATTGATCATGTCGGTTTCAACTTGTTTTTGTGCAAGTTTAGAAACGCTCAAATCAAAGGCCAAATCGTGGAATTGGGACCATGGTTCTATGATAAATCTTTGCTG
CTATTGGAAGAACCAAAGGGAGATATTTACAGTGAAGATCAAGAGTTCAAGTATGTTTCTTTTTTTATACACTTTCATAAGTTACCTTATGCTTGTTTTTCCAGGGATTC
AACCACAAGGATTGGAAGTCTCCTAGGAAAGGTTGAAATGGTGGATCTGGAAGAGGAGGAAAAACAAGGTTGGGGTTGTGCTTTGCGAATAAAAATTCAGGTGGATGTAA
CGATTCCATTGAAAAGAGGCATTTTTCTGAAAAAAAGAGAAGAAGATAGATGGATTGCGATCACCTATGAAAAACTTCCTGATTTTTGTTATGGTTGTGGTCATCTGGGG
CACACCTTGAAAGAATGTGAGAAGGATTGTGGCACGAATGAGGAAGATCTACCTTATGGTCCTTGGCTTAGAGAACCAGTCAAAATAAAAGTCCTTGAAGCAGAACCGGT
TCGTTATAGTTCTTTTAGTTTCCAACGGAGAGGGAGAGGTAGAAGTAGGGAGGAAGGAAGAGAGAGTTGGCGAAGTAAAGCAGTGCAAGAAGAGATGGAAGATGACAACG
TCAACCCAATCCAGACTGATGTTGGAAAGGGAGTGGCAGCGGCTGGTTTGTCGGAAAAGGAGGCGAAAATCGGCAGGGGGCGGAATCCGGCAAGTTGGTTACTAAAATGG
AAGGAAGTTGGCGGCAACAGTATGGATAAGCAATTTATTACAACAAACATTGTTGGGAATAAACGAAGTTCAGAAGTTGAGGTTGTAGGTGGGAGTAACAAAAAGGCTTT
GGTTACCAAGGAAATTGAATTTGCAAAAGCGGTGGAGGCTGAAGGACAACCCCGTCGAGCACAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTACCAACAACGATGTTGAGTCAATTGTGCGGCAGTATGGGGATCTGAAAGTGACAGAGGCAGAGAAAGCAAGTGTATATCATTTGCAAGAAGGGGAAATCGATAT
CTCAAGGAAAAAGTTTGAAAATGCATTAGTTTGCAAGATTTTTACACAAAAAAGAATTCCACCAGAGGTGTTTTCTTCCATGATGCTGAAGATTTGGAAGCAGGAGCAAA
CTATTATTGATCATGTCGGTTTCAACTTGTTTTTGTGCAAGTTTAGAAACGCTCAAATCAAAGGCCAAATCGTGGAATTGGGACCATGGTTCTATGATAAATCTTTGCTG
CTATTGGAAGAACCAAAGGGAGATATTTACAGTGAAGATCAAGAGTTCAAGTATGTTTCTTTTTTTATACACTTTCATAAGTTACCTTATGCTTGTTTTTCCAGGGATTC
AACCACAAGGATTGGAAGTCTCCTAGGAAAGGTTGAAATGGTGGATCTGGAAGAGGAGGAAAAACAAGGTTGGGGTTGTGCTTTGCGAATAAAAATTCAGGTGGATGTAA
CGATTCCATTGAAAAGAGGCATTTTTCTGAAAAAAAGAGAAGAAGATAGATGGATTGCGATCACCTATGAAAAACTTCCTGATTTTTGTTATGGTTGTGGTCATCTGGGG
CACACCTTGAAAGAATGTGAGAAGGATTGTGGCACGAATGAGGAAGATCTACCTTATGGTCCTTGGCTTAGAGAACCAGTCAAAATAAAAGTCCTTGAAGCAGAACCGGT
TCGTTATAGTTCTTTTAGTTTCCAACGGAGAGGGAGAGGTAGAAGTAGGGAGGAAGGAAGAGAGAGTTGGCGAAGTAAAGCAGTGCAAGAAGAGATGGAAGATGACAACG
TCAACCCAATCCAGACTGATGTTGGAAAGGGAGTGGCAGCGGCTGGTTTGTCGGAAAAGGAGGCGAAAATCGGCAGGGGGCGGAATCCGGCAAGTTGGTTACTAAAATGG
AAGGAAGTTGGCGGCAACAGTATGGATAAGCAATTTATTACAACAAACATTGTTGGGAATAAACGAAGTTCAGAAGTTGAGGTTGTAGGTGGGAGTAACAAAAAGGCTTT
GGTTACCAAGGAAATTGAATTTGCAAAAGCGGTGGAGGCTGAAGGACAACCCCGTCGAGCACAATAA
Protein sequenceShow/hide protein sequence
MATNNDVESIVRQYGDLKVTEAEKASVYHLQEGEIDISRKKFENALVCKIFTQKRIPPEVFSSMMLKIWKQEQTIIDHVGFNLFLCKFRNAQIKGQIVELGPWFYDKSLL
LLEEPKGDIYSEDQEFKYVSFFIHFHKLPYACFSRDSTTRIGSLLGKVEMVDLEEEEKQGWGCALRIKIQVDVTIPLKRGIFLKKREEDRWIAITYEKLPDFCYGCGHLG
HTLKECEKDCGTNEEDLPYGPWLREPVKIKVLEAEPVRYSSFSFQRRGRGRSREEGRESWRSKAVQEEMEDDNVNPIQTDVGKGVAAAGLSEKEAKIGRGRNPASWLLKW
KEVGGNSMDKQFITTNIVGNKRSSEVEVVGGSNKKALVTKEIEFAKAVEAEGQPRRAQ