; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0009453 (gene) of Chayote v1 genome

Gene IDSed0009453
OrganismSechium edule (Chayote v1)
DescriptionCCHC-type domain-containing protein
Genome locationLG06:15389895..15390950
RNA-Seq ExpressionSed0009453
SyntenySed0009453
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG53376.1 hypothetical protein EZV62_022545 [Acer yangbiense]3.2e-4333.01Show/hide
Query:  MRLNEDEDVVCDMRDFARFQNNNDNLSSCLVGKVLSLKLINLQSFKSLIKNIWKVHKDMVVDCIGDNIFYFKFKSVMEKNRIIAEGPWFFDGAILLLESP
        + L E++ VV +M +        D ++ CLVGKVLS K +N ++FK LI+ IW     + V+ +GDN+F F F +  ++NR++  GPW F  ++++LE  
Subjt:  MRLNEDEDVVCDMRDFARFQNNNDNLSSCLVGKVLSLKLINLQSFKSLIKNIWKVHKDMVVDCIGDNIFYFKFKSVMEKNRIIAEGPWFFDGAILLLESP

Query:  NAIVQPKDMSFSKADFWVQVHSVPKFCVQIEAIQTLGAKIGEVIDIDSTPLGGEWEKFVRIRARVDITKPLRRGLKYLAEDGGEISFLPCRYEKLPEFCF
                + F++ADFWVQ+H +P  C+     + L  +IGEV++I S      W KF+ ++ R+DI KPL+R LK       E++ +  +YEKLP+FC+
Subjt:  NAIVQPKDMSFSKADFWVQVHSVPKFCVQIEAIQTLGAKIGEVIDIDSTPLGGEWEKFVRIRARVDITKPLRRGLKYLAEDGGEISFLPCRYEKLPEFCF

Query:  DCGLIGHTKKECDSEEGK----NGEGSSYGPWLRTVEFIKPVERRSPGGRGGRGGGRDSSYRNSDEPWCRSGPSEQDFESDIGIPKEPQW----RQPMAS
         CG +G+  KEC  EE +     G  + +G WL+     K   R +  G G            S     RS  +  D E D  I  +P +    R+ ++S
Subjt:  DCGLIGHTKKECDSEEGK----NGEGSSYGPWLRTVEFIKPVERRSPGGRGGRGGGRDSSYRNSDEPWCRSGPSEQDFESDIGIPKEPQW----RQPMAS

Query:  SDSSRKQSK
          +++K+ K
Subjt:  SDSSRKQSK

TXG53482.1 hypothetical protein EZV62_022651 [Acer yangbiense]5.4e-4339.15Show/hide
Query:  LSSCLVGKVLSLKLINLQSFKSLIKNIWKVHKDMVVDCIGDNIFYFKFKSVMEKNRIIAEGPWFFDGAILLLESPNAIVQPKDMSFSKADFWVQVHSVPK
        ++ CL GK+LS  L+N  +F+SLI  IWKV   + ++ + +N++ F F+   ++ ++   GPW FD ++++LE P      K + F + DFWVQ+ ++P 
Subjt:  LSSCLVGKVLSLKLINLQSFKSLIKNIWKVHKDMVVDCIGDNIFYFKFKSVMEKNRIIAEGPWFFDGAILLLESPNAIVQPKDMSFSKADFWVQVHSVPK

Query:  FCVQIEAIQTLGAKIGEVIDIDSTPLGGEWEKFVRIRARVDITKPLRRGLKYLAEDGGEISFLPCRYEKLPEFCFDCGLIGHTKKEC---DSEEGKNGEG
         C+  E  + LG  IGEV ++DS P G    KFVR+R  VDITKPLRR L        E + +P +YE+LP FCF CGL+GHT   C   D     N + 
Subjt:  FCVQIEAIQTLGAKIGEVIDIDSTPLGGEWEKFVRIRARVDITKPLRRGLKYLAEDGGEISFLPCRYEKLPEFCFDCGLIGHTKKEC---DSEEGKNGEG

Query:  SSYGPWLRTVEFIKPVERRSPGGRGGRGGGRDSSY
          YG W+R     KPV  R    R   G  R  +Y
Subjt:  SSYGPWLRTVEFIKPVERRSPGGRGGRGGGRDSSY

XP_006485824.1 uncharacterized protein LOC102613298 [Citrus sinensis]1.2e-4240.98Show/hide
Query:  LSSCLVGKVLSLKLINLQSFKSLIKNIWKVHKDMVVDCIGDNIFYFKFKSVMEKNRIIAEGPWFFDGAILLLESPNAIVQPKDMSFSKADFWVQVHSVPK
        ++ CLVGK+L  + +  +  ++ ++  W+  K+  V+ +GDNIF FKF S  EK RI+  GPW FD +++++  P  I   K   F+ A FWVQ+H++P 
Subjt:  LSSCLVGKVLSLKLINLQSFKSLIKNIWKVHKDMVVDCIGDNIFYFKFKSVMEKNRIIAEGPWFFDGAILLLESPNAIVQPKDMSFSKADFWVQVHSVPK

Query:  FCVQIEAIQTLGAKIGEVIDIDSTPLGGEWEKFVRIRARVDITKPLRRGLKYLAEDGGEISFLPCRYEKLPEFCFDCGLIGHTKKECDSEEGKNGEGSSY
         C+  E IQ +G +IG+V ++++   G     FVR+R  V++T+PL + L    EDGG+IS L   YEKLP+FCF CGLIGH  +EC   +G++ +  +Y
Subjt:  FCVQIEAIQTLGAKIGEVIDIDSTPLGGEWEKFVRIRARVDITKPLRRGLKYLAEDGGEISFLPCRYEKLPEFCFDCGLIGHTKKECDSEEGKNGEGSSY

Query:  GPWLR
        G W+R
Subjt:  GPWLR

XP_015388532.1 uncharacterized protein LOC107178176 [Citrus sinensis]1.6e-4237.15Show/hide
Query:  LSSCLVGKVLSLKLINLQSFKSLIKNIWKVHKDMVVDCIGDNIFYFKFKSVMEKNRIIAEGPWFFDGAILLLESPNAIVQPKDMSFSKADFWVQVHSVPK
        ++ CLVGK+L  + +  +  ++ ++  W+  K+  V+ +GDNIF FKF S  EK RI+  GPW FD +++++  P  I   K   F+ A FWVQ+H++P 
Subjt:  LSSCLVGKVLSLKLINLQSFKSLIKNIWKVHKDMVVDCIGDNIFYFKFKSVMEKNRIIAEGPWFFDGAILLLESPNAIVQPKDMSFSKADFWVQVHSVPK

Query:  FCVQIEAIQTLGAKIGEVIDIDSTPLGGEWEKFVRIRARVDITKPLRRGLKYLAEDGGEISFLPCRYEKLPEFCFDCGLIGHTKKECDSEEGKNGEGSSY
         C+  E IQ +G +IG+V ++ +  +G     FVRIR  V++T PL + L    EDGG+IS L   YEKLP+FCF CGLIGH  +EC   +G++ +  +Y
Subjt:  FCVQIEAIQTLGAKIGEVIDIDSTPLGGEWEKFVRIRARVDITKPLRRGLKYLAEDGGEISFLPCRYEKLPEFCFDCGLIGHTKKECDSEEGKNGEGSSY

Query:  GPWLRTVEFIKPVERRSPGGRGGRGGGRDSSYRNSDEPWCRSGPSEQDFESDI
        G W+R        + R+   R      RD    N D+    + P  Q+ +  I
Subjt:  GPWLRTVEFIKPVERRSPGGRGGRGGGRDSSYRNSDEPWCRSGPSEQDFESDI

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]2.5e-4839.76Show/hide
Query:  DNLSSCLVGKVLSLKLINLQSFKSLIKNIWKVHKDMVVDCIGDNIFYFKFKSVMEKNRIIAEGPWFFDGAILLLESPNAIVQPKDMSFSKADFWVQVHSV
        DN+  C+V K+ + K I+ ++ +S++K++W+VH     + +G NI+   FKS+ EK+R+++ GPW F+ ++L+L SP A  QP DM+F+   FW+Q+H++
Subjt:  DNLSSCLVGKVLSLKLINLQSFKSLIKNIWKVHKDMVVDCIGDNIFYFKFKSVMEKNRIIAEGPWFFDGAILLLESPNAIVQPKDMSFSKADFWVQVHSV

Query:  PKFCVQIEAIQTLGAKIGEVIDIDSTPLGGEWEKFVRIRARVDITKPLRRGLKYLAEDGGEISFLPCRYEKLPEFCFDCGLIGHTKKECD--SEEGKNGE
        P  C+  E    LGAK+G+V +I+     G    F+R+R ++D++KPLRRG+K    DG +I + P RYEKLP+FC++CG IGH+ +EC+  S+      
Subjt:  PKFCVQIEAIQTLGAKIGEVIDIDSTPLGGEWEKFVRIRARVDITKPLRRGLKYLAEDGGEISFLPCRYEKLPEFCFDCGLIGHTKKECD--SEEGKNGE

Query:  GSSYGPWLRTVEFIKPVERRSP-----GGRGGRG----GGRD--SSYRNSDEPW
           YG WLR     K V          GGR GRG    GGR     +R  DE W
Subjt:  GSSYGPWLRTVEFIKPVERRSP-----GGRGGRG----GGRD--SSYRNSDEPW

TrEMBL top hitse value%identityAlignment
A0A5C7GU64 CCHC-type domain-containing protein3.8e-4236.86Show/hide
Query:  EDEDVVCDMRDFARFQNNNDNLSSCLVGKVLSLKLINLQSFKSLIKNIWKVHKDMVVDCIGDNIFYFKFKSVMEKNRIIAEGPWFFDGAILLLESPNAIV
        EDED           Q+ ++++  CLVGKVL+ K +N ++FK LI+ IW     + V+ +G+N F F F +   +N++   GPW F  ++++LE P    
Subjt:  EDEDVVCDMRDFARFQNNNDNLSSCLVGKVLSLKLINLQSFKSLIKNIWKVHKDMVVDCIGDNIFYFKFKSVMEKNRIIAEGPWFFDGAILLLESPNAIV

Query:  QPKDMSFSKADFWVQVHSVPKFCVQIEAIQTLGAKIGEVIDIDSTPLGGEWEKFVRIRARVDITKPLRRGLKYLAEDGGEISFLPCRYEKLPEFCFDCGL
            + F++ADFWVQ+H +P  C+     + L   IGEV++I  T     W K+VR++ +VDITKPL+R L+       EI+ +  +YE+LP+FCF CG 
Subjt:  QPKDMSFSKADFWVQVHSVPKFCVQIEAIQTLGAKIGEVIDIDSTPLGGEWEKFVRIRARVDITKPLRRGLKYLAEDGGEISFLPCRYEKLPEFCFDCGL

Query:  IGHTKKECDSEEGK----NGEGSSYGPWLRTVEFIK
        IGH+ +EC  E+ K    +G+ + +G W+R     K
Subjt:  IGHTKKECDSEEGK----NGEGSSYGPWLRTVEFIK

A0A5C7HA98 CCHC-type domain-containing protein2.6e-4339.15Show/hide
Query:  LSSCLVGKVLSLKLINLQSFKSLIKNIWKVHKDMVVDCIGDNIFYFKFKSVMEKNRIIAEGPWFFDGAILLLESPNAIVQPKDMSFSKADFWVQVHSVPK
        ++ CL GK+LS  L+N  +F+SLI  IWKV   + ++ + +N++ F F+   ++ ++   GPW FD ++++LE P      K + F + DFWVQ+ ++P 
Subjt:  LSSCLVGKVLSLKLINLQSFKSLIKNIWKVHKDMVVDCIGDNIFYFKFKSVMEKNRIIAEGPWFFDGAILLLESPNAIVQPKDMSFSKADFWVQVHSVPK

Query:  FCVQIEAIQTLGAKIGEVIDIDSTPLGGEWEKFVRIRARVDITKPLRRGLKYLAEDGGEISFLPCRYEKLPEFCFDCGLIGHTKKEC---DSEEGKNGEG
         C+  E  + LG  IGEV ++DS P G    KFVR+R  VDITKPLRR L        E + +P +YE+LP FCF CGL+GHT   C   D     N + 
Subjt:  FCVQIEAIQTLGAKIGEVIDIDSTPLGGEWEKFVRIRARVDITKPLRRGLKYLAEDGGEISFLPCRYEKLPEFCFDCGLIGHTKKEC---DSEEGKNGEG

Query:  SSYGPWLRTVEFIKPVERRSPGGRGGRGGGRDSSY
          YG W+R     KPV  R    R   G  R  +Y
Subjt:  SSYGPWLRTVEFIKPVERRSPGGRGGRGGGRDSSY

A0A5C7HB59 Uncharacterized protein1.5e-4333.01Show/hide
Query:  MRLNEDEDVVCDMRDFARFQNNNDNLSSCLVGKVLSLKLINLQSFKSLIKNIWKVHKDMVVDCIGDNIFYFKFKSVMEKNRIIAEGPWFFDGAILLLESP
        + L E++ VV +M +        D ++ CLVGKVLS K +N ++FK LI+ IW     + V+ +GDN+F F F +  ++NR++  GPW F  ++++LE  
Subjt:  MRLNEDEDVVCDMRDFARFQNNNDNLSSCLVGKVLSLKLINLQSFKSLIKNIWKVHKDMVVDCIGDNIFYFKFKSVMEKNRIIAEGPWFFDGAILLLESP

Query:  NAIVQPKDMSFSKADFWVQVHSVPKFCVQIEAIQTLGAKIGEVIDIDSTPLGGEWEKFVRIRARVDITKPLRRGLKYLAEDGGEISFLPCRYEKLPEFCF
                + F++ADFWVQ+H +P  C+     + L  +IGEV++I S      W KF+ ++ R+DI KPL+R LK       E++ +  +YEKLP+FC+
Subjt:  NAIVQPKDMSFSKADFWVQVHSVPKFCVQIEAIQTLGAKIGEVIDIDSTPLGGEWEKFVRIRARVDITKPLRRGLKYLAEDGGEISFLPCRYEKLPEFCF

Query:  DCGLIGHTKKECDSEEGK----NGEGSSYGPWLRTVEFIKPVERRSPGGRGGRGGGRDSSYRNSDEPWCRSGPSEQDFESDIGIPKEPQW----RQPMAS
         CG +G+  KEC  EE +     G  + +G WL+     K   R +  G G            S     RS  +  D E D  I  +P +    R+ ++S
Subjt:  DCGLIGHTKKECDSEEGK----NGEGSSYGPWLRTVEFIKPVERRSPGGRGGRGGGRDSSYRNSDEPWCRSGPSEQDFESDIGIPKEPQW----RQPMAS

Query:  SDSSRKQSK
          +++K+ K
Subjt:  SDSSRKQSK

A0A5C7IJL3 CCHC-type domain-containing protein5.0e-4232.89Show/hide
Query:  NLSSCLVGKVLSLKLINLQSFKSLIKNIWKVHKDMVVDCIGDNIFYFKFKSVMEKNRIIAEGPWFFDGAILLLESPNAIVQPKDMSFSKADFWVQVHSVP
        ++  CLVGKVLS K +N ++F S+I+++W     + ++ +G+N+F F F++  +++R+   GPW FD ++++LE P    +   + F+KADFWVQ+H +P
Subjt:  NLSSCLVGKVLSLKLINLQSFKSLIKNIWKVHKDMVVDCIGDNIFYFKFKSVMEKNRIIAEGPWFFDGAILLLESPNAIVQPKDMSFSKADFWVQVHSVP

Query:  KFCVQIEAIQTLGAKIGEVIDIDSTPLGGEWEKFVRIRARVDITKPLRRGLKYLAEDGGEISFLPCRYEKLPEFCFDCGLIGHTKKECDSEEGK----NG
          C+   + + L  +IGEVI+I S      W KF+R++ R+DI++PL+R L+   +  G +  +  +YE++PEFCF CG +GH   EC   E K     G
Subjt:  KFCVQIEAIQTLGAKIGEVIDIDSTPLGGEWEKFVRIRARVDITKPLRRGLKYLAEDGGEISFLPCRYEKLPEFCFDCGLIGHTKKECDSEEGK----NG

Query:  EGSSYGPWLRTVEFIKPVERRSPGGRGGRGGGRDSSYRNSDEPWCRSGPSEQDFESDIGIPKEPQWRQPMASSDSSRKQSKWNMPENIRRRSPETISGKE
            +G W+R     KP ++ S    GG    RD S  +S E   R          ++G         P +S       S W        + PET++G  
Subjt:  EGSSYGPWLRTVEFIKPVERRSPGGRGGRGGGRDSSYRNSDEPWCRSGPSEQDFESDIGIPKEPQWRQPMASSDSSRKQSKWNMPENIRRRSPETISGKE

Query:  V
        V
Subjt:  V

A0A6J1D765 uncharacterized protein LOC1110179021.2e-4839.76Show/hide
Query:  DNLSSCLVGKVLSLKLINLQSFKSLIKNIWKVHKDMVVDCIGDNIFYFKFKSVMEKNRIIAEGPWFFDGAILLLESPNAIVQPKDMSFSKADFWVQVHSV
        DN+  C+V K+ + K I+ ++ +S++K++W+VH     + +G NI+   FKS+ EK+R+++ GPW F+ ++L+L SP A  QP DM+F+   FW+Q+H++
Subjt:  DNLSSCLVGKVLSLKLINLQSFKSLIKNIWKVHKDMVVDCIGDNIFYFKFKSVMEKNRIIAEGPWFFDGAILLLESPNAIVQPKDMSFSKADFWVQVHSV

Query:  PKFCVQIEAIQTLGAKIGEVIDIDSTPLGGEWEKFVRIRARVDITKPLRRGLKYLAEDGGEISFLPCRYEKLPEFCFDCGLIGHTKKECD--SEEGKNGE
        P  C+  E    LGAK+G+V +I+     G    F+R+R ++D++KPLRRG+K    DG +I + P RYEKLP+FC++CG IGH+ +EC+  S+      
Subjt:  PKFCVQIEAIQTLGAKIGEVIDIDSTPLGGEWEKFVRIRARVDITKPLRRGLKYLAEDGGEISFLPCRYEKLPEFCFDCGLIGHTKKECD--SEEGKNGE

Query:  GSSYGPWLRTVEFIKPVERRSP-----GGRGGRG----GGRD--SSYRNSDEPW
           YG WLR     K V          GGR GRG    GGR     +R  DE W
Subjt:  GSSYGPWLRTVEFIKPVERRSP-----GGRGGRG----GGRD--SSYRNSDEPW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding6.0e-0821.83Show/hide
Query:  FKFKSVMEKNRIIAEGPWFFDGAILLLESPNAIVQPKDMSFSKADFWVQVHSVPKFCVQIEAIQTLGAKIGEVIDIDSTPLGGEWEKFVRIRARVDITKP
        F F+S      I+  GPW F+  + +++    +    D  F +  FW+Q+  +P   +    I ++G ++G  ++ +                       
Subjt:  FKFKSVMEKNRIIAEGPWFFDGAILLLESPNAIVQPKDMSFSKADFWVQVHSVPKFCVQIEAIQTLGAKIGEVIDIDSTPLGGEWEKFVRIRARVDITKP

Query:  LRRGLKYLAEDGGEISFLPCRYEKLPEFCFDCGLIGHTKKEC
                   G ++S L  +YEKL  FC  CG++ H   EC
Subjt:  LRRGLKYLAEDGGEISFLPCRYEKLPEFCFDCGLIGHTKKEC

AT5G36228.1 nucleic acid binding;zinc ion binding1.8e-1222.16Show/hide
Query:  LVGKVLSLKLINLQSFKSLIKNIWKVHKDMVVDCIGDNIFYFKFKSVMEKNRIIAEGPWFFDGAILLLESPNAIVQPKDMSFSKADFWVQVHSVPKFCVQ
        L+G++L+ +  +++     +   W +   +    + D  F  +F+S ++    +   PW F+   + L+       P +   +  D WV +  +P   V 
Subjt:  LVGKVLSLKLINLQSFKSLIKNIWKVHKDMVVDCIGDNIFYFKFKSVMEKNRIIAEGPWFFDGAILLLESPNAIVQPKDMSFSKADFWVQVHSVPKFCVQ

Query:  IEAIQTLGAKIGEVIDIDSTPLGGEWEKFVRIRARVDITKPLR--RGLKYLAEDGGEISFLPCRYEKLPEFCFDCGLIGHTKKEC
           ++ + + +GEV+ +D          F+R++ R+D T+PLR  R +++ + +   I F    YEKL   C +C  + H    C
Subjt:  IEAIQTLGAKIGEVIDIDSTPLGGEWEKFVRIRARVDITKPLR--RGLKYLAEDGGEISFLPCRYEKLPEFCFDCGLIGHTKKEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGATTGAATGAAGATGAAGATGTTGTATGCGATATGAGGGACTTTGCCCGTTTTCAGAACAACAACGATAATCTGTCTTCTTGCTTGGTCGGGAAGGTTTTGTCTCT
AAAACTAATCAATCTCCAATCTTTCAAATCTCTGATAAAAAATATTTGGAAGGTGCATAAGGATATGGTAGTGGATTGTATTGGAGATAATATCTTTTATTTCAAATTCA
AATCGGTTATGGAGAAGAATAGAATTATAGCGGAGGGCCCGTGGTTTTTTGATGGAGCGATTCTGCTTCTAGAATCGCCAAACGCCATAGTGCAACCAAAGGATATGTCG
TTTTCTAAAGCTGATTTTTGGGTACAGGTTCATAGTGTACCAAAATTTTGTGTGCAAATTGAAGCGATACAAACTCTAGGGGCGAAAATTGGGGAGGTTATTGACATCGA
TAGTACACCGCTTGGAGGAGAGTGGGAAAAATTTGTTCGGATCAGGGCAAGAGTGGATATTACCAAACCTCTGCGTAGAGGGCTAAAGTATTTGGCAGAAGATGGAGGTG
AGATTTCTTTTCTCCCGTGTAGGTATGAGAAATTACCAGAATTTTGCTTTGATTGTGGTCTTATAGGGCATACTAAAAAAGAGTGCGATAGTGAAGAAGGAAAAAATGGT
GAGGGCTCTTCTTATGGACCATGGTTGCGAACTGTGGAGTTTATCAAACCTGTGGAACGACGATCCCCAGGAGGTAGAGGAGGCAGGGGAGGTGGTAGAGATTCTTCATA
TCGGAATTCGGATGAACCATGGTGTAGATCGGGCCCGAGCGAACAAGATTTTGAATCAGATATAGGGATCCCGAAGGAGCCGCAATGGCGTCAACCTATGGCATCTTCTG
ACTCATCCAGAAAACAGAGCAAGTGGAACATGCCAGAAAACATTCGCCGCCGCTCACCGGAGACGATATCAGGAAAGGAGGTGGCGAATCCGAGGGAAGCGAAAAAAGGA
AGCCCCGTTGTTCACGCAAAAACAACTTTAATTGGGGAGAAAATTAAAGATGAGATTTTGCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCGATTGAATGAAGATGAAGATGTTGTATGCGATATGAGGGACTTTGCCCGTTTTCAGAACAACAACGATAATCTGTCTTCTTGCTTGGTCGGGAAGGTTTTGTCTCT
AAAACTAATCAATCTCCAATCTTTCAAATCTCTGATAAAAAATATTTGGAAGGTGCATAAGGATATGGTAGTGGATTGTATTGGAGATAATATCTTTTATTTCAAATTCA
AATCGGTTATGGAGAAGAATAGAATTATAGCGGAGGGCCCGTGGTTTTTTGATGGAGCGATTCTGCTTCTAGAATCGCCAAACGCCATAGTGCAACCAAAGGATATGTCG
TTTTCTAAAGCTGATTTTTGGGTACAGGTTCATAGTGTACCAAAATTTTGTGTGCAAATTGAAGCGATACAAACTCTAGGGGCGAAAATTGGGGAGGTTATTGACATCGA
TAGTACACCGCTTGGAGGAGAGTGGGAAAAATTTGTTCGGATCAGGGCAAGAGTGGATATTACCAAACCTCTGCGTAGAGGGCTAAAGTATTTGGCAGAAGATGGAGGTG
AGATTTCTTTTCTCCCGTGTAGGTATGAGAAATTACCAGAATTTTGCTTTGATTGTGGTCTTATAGGGCATACTAAAAAAGAGTGCGATAGTGAAGAAGGAAAAAATGGT
GAGGGCTCTTCTTATGGACCATGGTTGCGAACTGTGGAGTTTATCAAACCTGTGGAACGACGATCCCCAGGAGGTAGAGGAGGCAGGGGAGGTGGTAGAGATTCTTCATA
TCGGAATTCGGATGAACCATGGTGTAGATCGGGCCCGAGCGAACAAGATTTTGAATCAGATATAGGGATCCCGAAGGAGCCGCAATGGCGTCAACCTATGGCATCTTCTG
ACTCATCCAGAAAACAGAGCAAGTGGAACATGCCAGAAAACATTCGCCGCCGCTCACCGGAGACGATATCAGGAAAGGAGGTGGCGAATCCGAGGGAAGCGAAAAAAGGA
AGCCCCGTTGTTCACGCAAAAACAACTTTAATTGGGGAGAAAATTAAAGATGAGATTTTGCCTTAA
Protein sequenceShow/hide protein sequence
MRLNEDEDVVCDMRDFARFQNNNDNLSSCLVGKVLSLKLINLQSFKSLIKNIWKVHKDMVVDCIGDNIFYFKFKSVMEKNRIIAEGPWFFDGAILLLESPNAIVQPKDMS
FSKADFWVQVHSVPKFCVQIEAIQTLGAKIGEVIDIDSTPLGGEWEKFVRIRARVDITKPLRRGLKYLAEDGGEISFLPCRYEKLPEFCFDCGLIGHTKKECDSEEGKNG
EGSSYGPWLRTVEFIKPVERRSPGGRGGRGGGRDSSYRNSDEPWCRSGPSEQDFESDIGIPKEPQWRQPMASSDSSRKQSKWNMPENIRRRSPETISGKEVANPREAKKG
SPVVHAKTTLIGEKIKDEILP