; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019038 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019038
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr5:37899090..37900628
RNA-Seq ExpressionLag0019038
SyntenyLag0019038
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG48193.1 hypothetical protein EZV62_027487 [Acer yangbiense]5.7e-3332.64Show/hide
Query:  ELSNQLDSLRLSEEERNLVYRLDDDEVDDTDQKFTNSLVLKLVTTRNVNPEIFMKMMSRIWNNI-RIKFKCVGNNIFLGKFFYPKDKERVINESPRIYDK
        E++   ++L L E+E   V+ + +D + D D+     LV K++T + VN E F  ++ +IWN   +++ + VG N F+  F   + + +V N  P ++ K
Subjt:  ELSNQLDSLRLSEEERNLVYRLDDDEVDDTDQKFTNSLVLKLVTTRNVNPEIFMKMMSRIWNNI-RIKFKCVGNNIFLGKFFYPKDKERVINESPRIYDK

Query:  ALLAFEEPNSNASFSEQEFRYASFWVHFYNIPSGCLCRKRAIALGNMIGKFETINTDEDGNCWGQSLRVRVSIDISKPLKRGILMKPGAMAEEKWVTIAY
        +L+  E+P      ++ +F  A FWV  ++IP  C+ ++    L   IG+   I T E   CWG+ +RV+V +DI+KPLKR + +K G   E   V + Y
Subjt:  ALLAFEEPNSNASFSEQEFRYASFWVHFYNIPSGCLCRKRAIALGNMIGKFETINTDEDGNCWGQSLRVRVSIDISKPLKRGILMKPGAMAEEKWVTIAY

Query:  EKLSDFCYGCGKLGHLLMDCISLPRE----NDQEPRFGASMR
        E+L DFC+ CG++GH + +C+    +    + Q+ +FG+ MR
Subjt:  EKLSDFCYGCGKLGHLLMDCISLPRE----NDQEPRFGASMR

XP_006484927.1 uncharacterized protein LOC102626623 [Citrus sinensis]5.7e-3334.73Show/hide
Query:  DELSNQLDSLRLSEEERNLVYRLDDDEVDDTDQKFTNSLVLKLVTTRNVNPEIFMKMMSRIWNNIR-IKFKCVGNNIFLGKFFYPKDKERVINESPRIYD
        +EL  +  ++ L +EE + V  +   + +D ++   + LV K++  R+VN E     M   W   + IK + +G+NIF+ KF   +DK+RV+ E P  +D
Subjt:  DELSNQLDSLRLSEEERNLVYRLDDDEVDDTDQKFTNSLVLKLVTTRNVNPEIFMKMMSRIWNNIR-IKFKCVGNNIFLGKFFYPKDKERVINESPRIYD

Query:  KALLAFEEPNSNASFSEQEFRYASFWVHFYNIPSGCLCRKRAIALGNMIGKFETINTDEDGNCWGQSLRVRVSIDISKPLKRGILMKPGAMAEEKWVTIA
        KAL+   EP    S   Q F + SFWV  + +P  C+ +     LG  IG+ E + TDE+G C G   RVR+S+DI+KPLKR +++K     E+  + I 
Subjt:  KALLAFEEPNSNASFSEQEFRYASFWVHFYNIPSGCLCRKRAIALGNMIGKFETINTDEDGNCWGQSLRVRVSIDISKPLKRGILMKPGAMAEEKWVTIA

Query:  YEKLSDFCYGCGKLGHLLMDCISLPRENDQEPRFGASMR
        Y++L DFC+ CG +GH   +CI    +  ++  +G  M+
Subjt:  YEKLSDFCYGCGKLGHLLMDCISLPRENDQEPRFGASMR

XP_015380691.1 uncharacterized protein LOC107174364 [Citrus sinensis]3.7e-3233.05Show/hide
Query:  LVLKLVTTRNVNPEIFMKMMSRIWNNIR-IKFKCVGNNIFLGKFFYPKDKERVINESPRIYDKALLAFEEPNSNASFSEQEFRYASFWVHFYNIPSGCLC
        LV K++ TR VN E F   + ++W  ++ +K + +GNN F+ KF    DK+RV+   P  +D+ALL   EP      ++Q F + +FW+   N+P  C+ 
Subjt:  LVLKLVTTRNVNPEIFMKMMSRIWNNIR-IKFKCVGNNIFLGKFFYPKDKERVINESPRIYDKALLAFEEPNSNASFSEQEFRYASFWVHFYNIPSGCLC

Query:  RKRAIALGNMIGKFETINTDEDGNCWGQSLRVRVSIDISKPLKRGILMKPGAMAEEKWVTIAYEKLSDFCYGCGKLGHLLMDCISLPRENDQEPRFGASM
        ++    LG MIG  E I TDE+G C G+  R+RV I+I+ PLK+ + +K    ++ + + + YE+L DFCY CG +GH   +C     +  ++      +
Subjt:  RKRAIALGNMIGKFETINTDEDGNCWGQSLRVRVSIDISKPLKRGILMKPGAMAEEKWVTIAYEKLSDFCYGCGKLGHLLMDCISLPRENDQEPRFGASM

Query:  RYSGYLKRFVGGRGNSGESSNFGGRGRGRAGAW
         Y G++K  +            GGR R     W
Subjt:  RYSGYLKRFVGGRGNSGESSNFGGRGRGRAGAW

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]4.3e-3333.11Show/hide
Query:  RLSEEERNLVYRLDDDEVDDTDQKFTNSLVLKLVTTRNVNPEIFMKMMSRIWN-NIRIKFKCVGNNIFLGKFFYPKDKERVINESPRIYDKALLAFEEPN
        +L+ EE  +   +D D V   +Q    SLV KL+  R ++ ++  +++   W    ++  + +G N+FL  F    D  RV+   P  +DKAL+  ++P 
Subjt:  RLSEEERNLVYRLDDDEVDDTDQKFTNSLVLKLVTTRNVNPEIFMKMMSRIWN-NIRIKFKCVGNNIFLGKFFYPKDKERVINESPRIYDKALLAFEEPN

Query:  SNASFSEQEFRYASFWVHFYNIPSGCLCRKRAIALGNMIGKFETINTDEDGNCWGQSLRVRVSIDISKPLKRGILMKPGAMAEEKWVTIAYEKLSDFCYG
        S+ + SE EF   +FW+H +++P   L +  AI LGN IG F  ++ +E G  WG SLR+RV IDI+KPL+RGI +         W+ I YE+L DFCY 
Subjt:  SNASFSEQEFRYASFWVHFYNIPSGCLCRKRAIALGNMIGKFETINTDEDGNCWGQSLRVRVSIDISKPLKRGILMKPGAMAEEKWVTIAYEKLSDFCYG

Query:  CGKLGHLLMDCISLPRENDQEPRFGASMRYSGYLKRFVGGRGNSGESSNFGGRGRGR---AGAWGPNDNGKMGSEDKEDIVGIGEEGGREVQA
        CG +GH   DC +       + R  A+  Y  +L RFVG +  +G      G+   R    G+   N   +   E K+ +     + G + QA
Subjt:  CGKLGHLLMDCISLPRENDQEPRFGASMRYSGYLKRFVGGRGNSGESSNFGGRGRGR---AGAWGPNDNGKMGSEDKEDIVGIGEEGGREVQA

XP_024042073.1 uncharacterized protein LOC112099188 [Citrus clementina]1.4e-3131.8Show/hide
Query:  DELSNQLDSLRLSEEERNLVYRLDDDEVDDTDQKFTNSLVLKLVTTRNVNPEIFMKMMSRIWNNIR-IKFKCVGNNIFLGKFFYPKDKERVINESPRIYD
        +EL  +  ++ + EE+++ +  L  +  D  +Q     L+ K++ +R VN E     + ++W   + +K + +GNNIF+ KF    DK RV++  P  +D
Subjt:  DELSNQLDSLRLSEEERNLVYRLDDDEVDDTDQKFTNSLVLKLVTTRNVNPEIFMKMMSRIWNNIR-IKFKCVGNNIFLGKFFYPKDKERVINESPRIYD

Query:  KALLAFEEPNSNASFSEQEFRYASFWVHFYNIPSGCLCRKRAIALGNMIGKFETINTDEDGNCWGQSLRVRVSIDISKPLKRGILMKPGAMAEEKWVTIA
        +AL+   +P      S+Q+F +  FWV  +NIP  C+ R     LG +I K E + TDE+G+C+G+  R+R+SI+I++PLK  + +K     +   + + 
Subjt:  KALLAFEEPNSNASFSEQEFRYASFWVHFYNIPSGCLCRKRAIALGNMIGKFETINTDEDGNCWGQSLRVRVSIDISKPLKRGILMKPGAMAEEKWVTIA

Query:  YEKLSDFCYGCGKLGHLLMDCISLPRENDQEPRFGASMR
        YE+L DFC+ CG + H   +C     +  +E  FG  M+
Subjt:  YEKLSDFCYGCGKLGHLLMDCISLPRENDQEPRFGASMR

TrEMBL top hitse value%identityAlignment
A0A1S8AC25 CCHC-type domain-containing protein (Fragment)1.6e-3332.81Show/hide
Query:  DELSNQLDSLRLSEEERNLVYRLDDDEVDDTDQKFTNSLVLKLVTTRNVNPEIFMKMMSRIWNNIR-IKFKCVGNNIFLGKFFYPKDKERVINESPRIYD
        +EL  +  ++RLS+EE   V   +  +++  ++     LV K++ TR V+ E     M R+W   R +K + +G N+F+ KF    DK  ++   P  +D
Subjt:  DELSNQLDSLRLSEEERNLVYRLDDDEVDDTDQKFTNSLVLKLVTTRNVNPEIFMKMMSRIWNNIR-IKFKCVGNNIFLGKFFYPKDKERVINESPRIYD

Query:  KALLAFEEPNSNASFSEQEFRYASFWVHFYNIPSGCLCRKRAIALGNMIGKFETINTDEDGNCWGQSLRVRVSIDISKPLKRGI-LMKPGAMAEEKWVTI
        +AL+   EP       +Q+F + SFWV  +++P  C+ +  A  LG +IGK E + TD  G C+GQ LR+R+S+DI+KPLK+ I L +    A++  + +
Subjt:  KALLAFEEPNSNASFSEQEFRYASFWVHFYNIPSGCLCRKRAIALGNMIGKFETINTDEDGNCWGQSLRVRVSIDISKPLKRGI-LMKPGAMAEEKWVTI

Query:  AYEKLSDFCYGCGKLGHLLMDCISLPRENDQEPRFGASMRYSGYLKRFVGGRG
         YE+L DFC+ CG++GH   +C     ++  E  +G  ++ +   ++   GRG
Subjt:  AYEKLSDFCYGCGKLGHLLMDCISLPRENDQEPRFGASMRYSGYLKRFVGGRG

A0A5C7GU64 CCHC-type domain-containing protein2.8e-3332.64Show/hide
Query:  ELSNQLDSLRLSEEERNLVYRLDDDEVDDTDQKFTNSLVLKLVTTRNVNPEIFMKMMSRIWNNI-RIKFKCVGNNIFLGKFFYPKDKERVINESPRIYDK
        E++   ++L L E+E   V+ + +D + D D+     LV K++T + VN E F  ++ +IWN   +++ + VG N F+  F   + + +V N  P ++ K
Subjt:  ELSNQLDSLRLSEEERNLVYRLDDDEVDDTDQKFTNSLVLKLVTTRNVNPEIFMKMMSRIWNNI-RIKFKCVGNNIFLGKFFYPKDKERVINESPRIYDK

Query:  ALLAFEEPNSNASFSEQEFRYASFWVHFYNIPSGCLCRKRAIALGNMIGKFETINTDEDGNCWGQSLRVRVSIDISKPLKRGILMKPGAMAEEKWVTIAY
        +L+  E+P      ++ +F  A FWV  ++IP  C+ ++    L   IG+   I T E   CWG+ +RV+V +DI+KPLKR + +K G   E   V + Y
Subjt:  ALLAFEEPNSNASFSEQEFRYASFWVHFYNIPSGCLCRKRAIALGNMIGKFETINTDEDGNCWGQSLRVRVSIDISKPLKRGILMKPGAMAEEKWVTIAY

Query:  EKLSDFCYGCGKLGHLLMDCISLPRE----NDQEPRFGASMR
        E+L DFC+ CG++GH + +C+    +    + Q+ +FG+ MR
Subjt:  EKLSDFCYGCGKLGHLLMDCISLPRE----NDQEPRFGASMR

A0A5C7H9Y2 CCHC-type domain-containing protein1.2e-3128.02Show/hide
Query:  DELSNQLDSLRLSEEERNLVYRLDDDEVDDTDQKFTNSLVLKLVTTRNVNPEIFMKMMSRIW-NNIRIKFKCVGNNIFLGKFFYPKDKERVINESPRIYD
        D++S + + L L +++   + R+     +  +Q  + SL+ K +T + +N E F   +S IW     +  + +G NIF  +F    D++R++   P ++D
Subjt:  DELSNQLDSLRLSEEERNLVYRLDDDEVDDTDQKFTNSLVLKLVTTRNVNPEIFMKMMSRIW-NNIRIKFKCVGNNIFLGKFFYPKDKERVINESPRIYD

Query:  KALLAFEEPNSNASFSEQEFRYASFWVHFYNIPSGCLCRKRAIALGNMIGKFETINTDEDGNCWGQSLRVRVSIDISKPLKRGILMKPGAMAEEKWVTIA
        K LL   E + +   ++ +FRY  FW+  +N+P  CL R+  + LG ++G+ + I+  E G C GQ +R+RV ID+  PLKRG+ +  G   +   V I 
Subjt:  KALLAFEEPNSNASFSEQEFRYASFWVHFYNIPSGCLCRKRAIALGNMIGKFETINTDEDGNCWGQSLRVRVSIDISKPLKRGILMKPGAMAEEKWVTIA

Query:  YEKLSDFCYGCGKLGHLLMDCISLPRE--NDQEPRFGASMR-YSGYLKRFVGGRGNSGESSNFGG--------RGRGRAGAWGPNDNGKMGSEDKEDIVG
        YE+L +FCY CGK+GHL+ DC    +E  +    +FG  MR  S    +  G + NS E S  GG        R +G +  W    +  +   D E +  
Subjt:  YEKLSDFCYGCGKLGHLLMDCISLPRE--NDQEPRFGASMR-YSGYLKRFVGGRGNSGESSNFGG--------RGRGRAGAWGPNDNGKMGSEDKEDIVG

Query:  IGE-EGGREVQAKEGLS---LVPENE----ATSKLKEVVAKRTVMES--AITSVIKANPII--ESTDYARPLEKDLDGKGITDTDRFNSIKMDQADTWNE
        + E + G  ++ K  +S   +V + E    A S  KE + + +   S    T+V   NP+I    ++    +  + +G  IT+  R+  +  ++  + NE
Subjt:  IGE-EGGREVQAKEGLS---LVPENE----ATSKLKEVVAKRTVMES--AITSVIKANPII--ESTDYARPLEKDLDGKGITDTDRFNSIKMDQADTWNE

Query:  -MMNFSQNSVDKKERWGQDDRKKSVMVSSTNGPSGEAQL
           +  +   D       D +K SV      G  G   L
Subjt:  -MMNFSQNSVDKKERWGQDDRKKSVMVSSTNGPSGEAQL

A0A6J1BSZ1 uncharacterized protein LOC1110054811.5e-3131.28Show/hide
Query:  LSNQLDSLRLSEEERNLVYRLDDDEVDDTDQKFTNSLVLKLVTTRNVNPEIFMKMMSRIWNNIRIKFK--CVGNNIFLGKFFYPKDKERVINESPRIYDK
        L  +  + +L+ EE  +   +D   ++ T +    SL+ KL++ R+++  +    +   W      F    +G NIFL  F    D+ R++   P  +D+
Subjt:  LSNQLDSLRLSEEERNLVYRLDDDEVDDTDQKFTNSLVLKLVTTRNVNPEIFMKMMSRIWNNIRIKFK--CVGNNIFLGKFFYPKDKERVINESPRIYDK

Query:  ALLAFEEPNSNASFSEQEFRYASFWVHFYNIPSGCLCRKRAIALGNMIGKFETINTDEDGNCWGQSLRVRVSIDISKPLKRGILMKPGAMAEEKWVTIAY
        AL+  + P S     + +FR  S WVHF+++   C+ +  A  LGN IG FE + ++ +  CWG  LRVRV  D+ KPL RGI +         W+ I Y
Subjt:  ALLAFEEPNSNASFSEQEFRYASFWVHFYNIPSGCLCRKRAIALGNMIGKFETINTDEDGNCWGQSLRVRVSIDISKPLKRGILMKPGAMAEEKWVTIAY

Query:  EKLSDFCYGCGKLGHLLMDCISLPREN-DQEPRFGASMRYSGY
        E+L DF Y CG+L H+L DC     ++  +  ++G  +R+ G+
Subjt:  EKLSDFCYGCGKLGHLLMDCISLPREN-DQEPRFGASMRYSGY

A0A6J1DU55 uncharacterized protein LOC1110231352.1e-3333.11Show/hide
Query:  RLSEEERNLVYRLDDDEVDDTDQKFTNSLVLKLVTTRNVNPEIFMKMMSRIWN-NIRIKFKCVGNNIFLGKFFYPKDKERVINESPRIYDKALLAFEEPN
        +L+ EE  +   +D D V   +Q    SLV KL+  R ++ ++  +++   W    ++  + +G N+FL  F    D  RV+   P  +DKAL+  ++P 
Subjt:  RLSEEERNLVYRLDDDEVDDTDQKFTNSLVLKLVTTRNVNPEIFMKMMSRIWN-NIRIKFKCVGNNIFLGKFFYPKDKERVINESPRIYDKALLAFEEPN

Query:  SNASFSEQEFRYASFWVHFYNIPSGCLCRKRAIALGNMIGKFETINTDEDGNCWGQSLRVRVSIDISKPLKRGILMKPGAMAEEKWVTIAYEKLSDFCYG
        S+ + SE EF   +FW+H +++P   L +  AI LGN IG F  ++ +E G  WG SLR+RV IDI+KPL+RGI +         W+ I YE+L DFCY 
Subjt:  SNASFSEQEFRYASFWVHFYNIPSGCLCRKRAIALGNMIGKFETINTDEDGNCWGQSLRVRVSIDISKPLKRGILMKPGAMAEEKWVTIAYEKLSDFCYG

Query:  CGKLGHLLMDCISLPRENDQEPRFGASMRYSGYLKRFVGGRGNSGESSNFGGRGRGR---AGAWGPNDNGKMGSEDKEDIVGIGEEGGREVQA
        CG +GH   DC +       + R  A+  Y  +L RFVG +  +G      G+   R    G+   N   +   E K+ +     + G + QA
Subjt:  CGKLGHLLMDCISLPRENDQEPRFGASMRYSGYLKRFVGGRGNSGESSNFGGRGRGR---AGAWGPNDNGKMGSEDKEDIVGIGEEGGREVQA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGATGAGCTTTCAAACCAATTAGACAGCTTGAGGTTGTCGGAGGAGGAGAGGAATTTAGTCTACAGATTGGATGATGATGAGGTTGACGATACTGATCAAAAGTT
CACAAATTCTTTAGTCTTGAAGCTTGTGACGACAAGAAATGTGAACCCTGAGATTTTTATGAAGATGATGTCGCGTATATGGAACAACATCCGTATAAAATTCAAGTGTG
TGGGGAACAACATTTTCCTGGGGAAATTCTTCTATCCCAAGGACAAAGAAAGAGTGATCAACGAGAGTCCCAGGATATACGACAAGGCTTTGCTGGCTTTTGAAGAACCC
AACAGCAACGCTAGTTTCTCGGAGCAAGAGTTCAGGTACGCATCCTTTTGGGTTCATTTTTATAACATTCCTTCTGGTTGTCTTTGCAGGAAACGTGCAATTGCCCTTGG
AAACATGATAGGAAAATTCGAAACGATTAACACGGATGAAGATGGTAACTGTTGGGGTCAATCGCTGAGGGTTCGTGTTTCTATTGATATCAGTAAGCCTCTGAAACGTG
GTATTTTGATGAAGCCAGGAGCCATGGCAGAGGAGAAATGGGTGACTATTGCGTACGAAAAATTATCCGATTTTTGCTATGGTTGCGGTAAGTTGGGGCACCTGCTAATG
GATTGCATAAGTCTACCCCGAGAGAACGATCAGGAACCTCGTTTTGGGGCTTCAATGCGCTATTCAGGTTACCTGAAGAGGTTTGTAGGAGGAAGAGGGAATTCAGGCGA
GTCTTCCAATTTCGGGGGAAGAGGGCGTGGAAGAGCAGGAGCCTGGGGACCTAATGACAATGGCAAGATGGGATCAGAGGATAAGGAGGACATTGTAGGGATTGGAGAAG
AAGGGGGAAGAGAGGTTCAGGCGAAAGAGGGGTTATCTCTGGTGCCGGAAAACGAAGCAACCTCTAAATTGAAGGAGGTTGTAGCGAAGAGAACGGTTATGGAATCTGCT
ATCACGTCGGTCATTAAGGCTAATCCCATTATTGAGTCCACTGATTATGCTCGACCGTTGGAGAAAGATTTGGACGGAAAAGGCATTACTGATACAGATAGATTTAATTC
CATTAAGATGGATCAGGCAGACACGTGGAATGAAATGATGAATTTTTCTCAAAATTCTGTGGACAAGAAAGAAAGGTGGGGCCAGGATGACAGAAAAAAATCTGTAATGG
TTTCCTCCACGAATGGGCCCTCTGGGGAAGCCCAACTTGAAGATCCATCAAATAATGGACTGCTATCAGAAGAGAAACAAGAAGTCAGTGATCCAAAAAGCCCTTTAACA
ATTAGTAGTCCCAAGAAATTGAAGCGGTTGGATAGAGGTAAGCGGGTAACCGAAGGAATTTCAACTTTGCAAAATAACCAGATGATGGTTCAGAAAGCTCCTAAACGTAA
GGGAGAAGAGGAGTTACAAGAAGACAAGAAGAAACTATGTTTGGCCAGTCTTAATCATTTCTTAGCATTATCGGTGGAGGCTGTGGAACAGCCACGCCGAGCTCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGATGAGCTTTCAAACCAATTAGACAGCTTGAGGTTGTCGGAGGAGGAGAGGAATTTAGTCTACAGATTGGATGATGATGAGGTTGACGATACTGATCAAAAGTT
CACAAATTCTTTAGTCTTGAAGCTTGTGACGACAAGAAATGTGAACCCTGAGATTTTTATGAAGATGATGTCGCGTATATGGAACAACATCCGTATAAAATTCAAGTGTG
TGGGGAACAACATTTTCCTGGGGAAATTCTTCTATCCCAAGGACAAAGAAAGAGTGATCAACGAGAGTCCCAGGATATACGACAAGGCTTTGCTGGCTTTTGAAGAACCC
AACAGCAACGCTAGTTTCTCGGAGCAAGAGTTCAGGTACGCATCCTTTTGGGTTCATTTTTATAACATTCCTTCTGGTTGTCTTTGCAGGAAACGTGCAATTGCCCTTGG
AAACATGATAGGAAAATTCGAAACGATTAACACGGATGAAGATGGTAACTGTTGGGGTCAATCGCTGAGGGTTCGTGTTTCTATTGATATCAGTAAGCCTCTGAAACGTG
GTATTTTGATGAAGCCAGGAGCCATGGCAGAGGAGAAATGGGTGACTATTGCGTACGAAAAATTATCCGATTTTTGCTATGGTTGCGGTAAGTTGGGGCACCTGCTAATG
GATTGCATAAGTCTACCCCGAGAGAACGATCAGGAACCTCGTTTTGGGGCTTCAATGCGCTATTCAGGTTACCTGAAGAGGTTTGTAGGAGGAAGAGGGAATTCAGGCGA
GTCTTCCAATTTCGGGGGAAGAGGGCGTGGAAGAGCAGGAGCCTGGGGACCTAATGACAATGGCAAGATGGGATCAGAGGATAAGGAGGACATTGTAGGGATTGGAGAAG
AAGGGGGAAGAGAGGTTCAGGCGAAAGAGGGGTTATCTCTGGTGCCGGAAAACGAAGCAACCTCTAAATTGAAGGAGGTTGTAGCGAAGAGAACGGTTATGGAATCTGCT
ATCACGTCGGTCATTAAGGCTAATCCCATTATTGAGTCCACTGATTATGCTCGACCGTTGGAGAAAGATTTGGACGGAAAAGGCATTACTGATACAGATAGATTTAATTC
CATTAAGATGGATCAGGCAGACACGTGGAATGAAATGATGAATTTTTCTCAAAATTCTGTGGACAAGAAAGAAAGGTGGGGCCAGGATGACAGAAAAAAATCTGTAATGG
TTTCCTCCACGAATGGGCCCTCTGGGGAAGCCCAACTTGAAGATCCATCAAATAATGGACTGCTATCAGAAGAGAAACAAGAAGTCAGTGATCCAAAAAGCCCTTTAACA
ATTAGTAGTCCCAAGAAATTGAAGCGGTTGGATAGAGGTAAGCGGGTAACCGAAGGAATTTCAACTTTGCAAAATAACCAGATGATGGTTCAGAAAGCTCCTAAACGTAA
GGGAGAAGAGGAGTTACAAGAAGACAAGAAGAAACTATGTTTGGCCAGTCTTAATCATTTCTTAGCATTATCGGTGGAGGCTGTGGAACAGCCACGCCGAGCTCAATGA
Protein sequenceShow/hide protein sequence
MVDELSNQLDSLRLSEEERNLVYRLDDDEVDDTDQKFTNSLVLKLVTTRNVNPEIFMKMMSRIWNNIRIKFKCVGNNIFLGKFFYPKDKERVINESPRIYDKALLAFEEP
NSNASFSEQEFRYASFWVHFYNIPSGCLCRKRAIALGNMIGKFETINTDEDGNCWGQSLRVRVSIDISKPLKRGILMKPGAMAEEKWVTIAYEKLSDFCYGCGKLGHLLM
DCISLPRENDQEPRFGASMRYSGYLKRFVGGRGNSGESSNFGGRGRGRAGAWGPNDNGKMGSEDKEDIVGIGEEGGREVQAKEGLSLVPENEATSKLKEVVAKRTVMESA
ITSVIKANPIIESTDYARPLEKDLDGKGITDTDRFNSIKMDQADTWNEMMNFSQNSVDKKERWGQDDRKKSVMVSSTNGPSGEAQLEDPSNNGLLSEEKQEVSDPKSPLT
ISSPKKLKRLDRGKRVTEGISTLQNNQMMVQKAPKRKGEEELQEDKKKLCLASLNHFLALSVEAVEQPRRAQ