; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG10G019375 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG10G019375
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrotransposon protein
Genome locationCG_Chr10:34355115..34356128
RNA-Seq ExpressionClCG10G019375
SyntenyClCG10G019375
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048280.1 retrotransposon protein [Cucumis melo var. makuwa]8.8e-3334.91Show/hide
Query:  SRARATKHIWTDKEDRILVECLVQCVQSGHWRVDNETFRHGFLANILRMMQQRIPGCSIQWDAERKCIDYQAEIFDAWVKSHPSAKELQHKSFSFYDDLA
        S +R  KH WT +E+  LVECLV+ V +G WR DN TF  G+L  + RMM  +IP   +    ++KCI  + E+FD WVKSHP+AK L +KSF  YD+L+
Subjt:  SRARATKHIWTDKEDRILVECLVQCVQSGHWRVDNETFRHGFLANILRMMQQRIPGCSIQWDAERKCIDYQAEIFDAWVKSHPSAKELQHKSFSFYDDLA

Query:  IVFGKDRATGSRATTTAEVESEPVMDEENEDILNNQSPDFENFYIPDLPYVNSPTSEDTPTTPGGRRSTSSMPSKSRRSRSSSIGEYSDVVREGFQLLTK
         VFGKDRATG RA + A++ S      +     +    DF   Y P L        E        RR+ S   S S+R RS    +  D+VR   +   +
Subjt:  IVFGKDRATGSRATTTAEVESEPVMDEENEDILNNQSPDFENFYIPDLPYVNSPTSEDTPTTPGGRRSTSSMPSKSRRSRSSSIGEYSDVVREGFQLLTK

Query:  SIDGIAQ----------RRRRELYAELQSIPSLSVQDGLTVAHSLLADPMLLSHFMDFPPQWKYDYCMQVLGQQR
         +  IA+            R+E+  +L+ IP L++ D   +   ++ +   +  F++ P   KY YC  +L + R
Subjt:  SIDGIAQ----------RRRRELYAELQSIPSLSVQDGLTVAHSLLADPMLLSHFMDFPPQWKYDYCMQVLGQQR

KAA0063789.1 retrotransposon protein [Cucumis melo var. makuwa]9.5e-3536.06Show/hide
Query:  SGSRARATKHIWTDKEDRILVECLVQCVQSGHWRVDNETFRHGFLANILRMMQQRIPGCS-IQWDAERKCIDYQAEIFDAWVKSHPSAKELQHKSFSFYD
        + + ++ATKH WT  ED  LVECL+Q V+ G WR DNETF+ G+L  ++  M +  P  S   W+ ERKCI+ +  +FD WVK H +A+ L +K+F ++ 
Subjt:  SGSRARATKHIWTDKEDRILVECLVQCVQSGHWRVDNETFRHGFLANILRMMQQRIPGCS-IQWDAERKCIDYQAEIFDAWVKSHPSAKELQHKSFSFYD

Query:  DLAIVFGKDRATGSRATTTAEVESEPVMDEENEDILNNQSPDFENFYIPDLPYVNSPTSEDTPTTPG--GRRSTSSMPSKSRRSRSSSIGEYSDVVREGF
        DL +VFG+DRAT  R  T  ++ S+   D E +D+  N     E+F IP+   +  P+ ED  +TP      + SS PSK RRS S              
Subjt:  DLAIVFGKDRATGSRATTTAEVESEPVMDEENEDILNNQSPDFENFYIPDLPYVNSPTSEDTPTTPG--GRRSTSSMPSKSRRSRSSSIGEYSDVVREGF

Query:  QLLTKSIDGIAQRRRRELYAELQSIPSLSVQDGLTVAHSLLADPMLLSHFMDFPPQWKYDYCMQVLGQQ
          L  +   I                            SLL DP +L  F+D+P +WKY  CM++LG+Q
Subjt:  QLLTKSIDGIAQRRRRELYAELQSIPSLSVQDGLTVAHSLLADPMLLSHFMDFPPQWKYDYCMQVLGQQ

TYK06362.1 retrotransposon protein [Cucumis melo var. makuwa]1.5e-3233.46Show/hide
Query:  SGSRARATKHIWTDKEDRILVECLVQCVQSGHWRVDNETFRHGFLANILRMMQQRIPGCS-IQWDAERKCIDYQAEIFDAWVKSHPSAKELQHKSFSFYD
        + + ++ATKH WT  ED +LV CL+Q V+ G WR DN TF+ G+L     + +   P CS   W+ ERKCI+ +  +FD WVK HP A+ L +K F ++ 
Subjt:  SGSRARATKHIWTDKEDRILVECLVQCVQSGHWRVDNETFRHGFLANILRMMQQRIPGCS-IQWDAERKCIDYQAEIFDAWVKSHPSAKELQHKSFSFYD

Query:  DLAIVFGKDRATGSRATTTAEVESEPVMDEENEDILNNQSPDFENFYIPDLPYVNSPTSEDTPTTPG--GRRSTSSMPSKSRRSRSSSIGEYSDVVREGF
        DL +VFG+DRATG R                              F IP+   +  P+ ED P+TP      + SS PSK RRS S  +           
Subjt:  DLAIVFGKDRATGSRATTTAEVESEPVMDEENEDILNNQSPDFENFYIPDLPYVNSPTSEDTPTTPG--GRRSTSSMPSKSRRSRSSSIGEYSDVVREGF

Query:  QLLTKSIDGIAQRRRRELYAELQSIPSLSVQDGLTVAHSLLADPMLLSHFMDFPPQWKYDYCMQVLGQQ
                                       D      SLL DP +L  F+D+P +WKY  CM++LG+Q
Subjt:  QLLTKSIDGIAQRRRRELYAELQSIPSLSVQDGLTVAHSLLADPMLLSHFMDFPPQWKYDYCMQVLGQQ

TYK07921.1 hypothetical protein E5676_scaffold265G00330 [Cucumis melo var. makuwa]5.9e-4542.11Show/hide
Query:  ETSGSRARATKHIWTDKEDRILVECLVQCVQSGHWRVDNETFRHGFLANILRMMQQRIPGCS-IQWDAERKCIDYQAEIFDAWVKSHPSAKELQHKSFSF
        E + + ++ATKH WT  ED +LVECL+Q V+ G WR DN TF+ G+L     + +   P CS   W+  +KCI+ +  +FD WVK HP+A+ L +K F +
Subjt:  ETSGSRARATKHIWTDKEDRILVECLVQCVQSGHWRVDNETFRHGFLANILRMMQQRIPGCS-IQWDAERKCIDYQAEIFDAWVKSHPSAKELQHKSFSF

Query:  YDDLAIVFGKDRATGSRATTTAEVESEPVMDEENEDILNNQSPDFENFYIPDLPYVNSPTSEDTPTTPGG--RRSTSSMPSKSRRSRSSSIGEYSDVVRE
        + DL +VFG+DRATG R  T  E+ S+   D E +D+  N     E+F IP+   +  P+ ED P+TP      + SS PSK RRS S   G+  D  R 
Subjt:  YDDLAIVFGKDRATGSRATTTAEVESEPVMDEENEDILNNQSPDFENFYIPDLPYVNSPTSEDTPTTPGG--RRSTSSMPSKSRRSRSSSIGEYSDVVRE

Query:  GFQLLTKSIDGIA--QRRRRE--------LYAELQSIPSLSVQDGLTVAHSLLADPMLLSHFMDFP
          +  +K I  IA  QR + E        LYAELQ+IP + V D L VA SLL DP +L  F+D+P
Subjt:  GFQLLTKSIDGIA--QRRRRE--------LYAELQSIPSLSVQDGLTVAHSLLADPMLLSHFMDFP

XP_008441954.1 PREDICTED: uncharacterized protein LOC103485953 [Cucumis melo]2.0e-3233.66Show/hide
Query:  SRARATKHIWTDKEDRILVECLVQCVQSGHWRVDNETFRHGFLANILRMMQQRIPGCSIQ-------------------------------WDAERKCID
        S +RA KH WT +E+   VECLV+ V SG WR DN TF+ G+LA + RMM +++PG +IQ                               W+ E +CI 
Subjt:  SRARATKHIWTDKEDRILVECLVQCVQSGHWRVDNETFRHGFLANILRMMQQRIPGCSIQ-------------------------------WDAERKCID

Query:  YQAEIFDAWVKSHPSAKELQHKSFSFYDDLAIVFGKDRATGSRATTTAEVESEPVMDEENEDILNNQSPDFENFYIPDLPYVNSPTSEDTPTTPGGRRS-
         + ++FD+W+KSHP+AK L HKSF +YDDL+ VFGKDRATG+R+ T   V S  V +  N+ I    S D       D+P + S     +P    G R+ 
Subjt:  YQAEIFDAWVKSHPSAKELQHKSFSFYDDLAIVFGKDRATGSRATTTAEVESEPVMDEENEDILNNQSPDFENFYIPDLPYVNSPTSEDTPTTPGGRRS-

Query:  ----TSSMPSKSRRSRSSSIGEYSDVVREGFQLLTKSIDGIAQ----------RRRRELYAELQSIPSLSVQDGLTVAHSLLADPMLLSHFMDFPPQWKY
              +  S S+R R S   E  +V+R   +   + +  IA             R ++  +LQ IP L  QD   +   L      +  F+  P + K 
Subjt:  ----TSSMPSKSRRSRSSSIGEYSDVVREGFQLLTKSIDGIAQ----------RRRRELYAELQSIPSLSVQDGLTVAHSLLADPMLLSHFMDFPPQWKY

Query:  DYC
        +YC
Subjt:  DYC

TrEMBL top hitse value%identityAlignment
A0A5A7TYQ6 Retrotransposon protein4.3e-3334.91Show/hide
Query:  SRARATKHIWTDKEDRILVECLVQCVQSGHWRVDNETFRHGFLANILRMMQQRIPGCSIQWDAERKCIDYQAEIFDAWVKSHPSAKELQHKSFSFYDDLA
        S +R  KH WT +E+  LVECLV+ V +G WR DN TF  G+L  + RMM  +IP   +    ++KCI  + E+FD WVKSHP+AK L +KSF  YD+L+
Subjt:  SRARATKHIWTDKEDRILVECLVQCVQSGHWRVDNETFRHGFLANILRMMQQRIPGCSIQWDAERKCIDYQAEIFDAWVKSHPSAKELQHKSFSFYDDLA

Query:  IVFGKDRATGSRATTTAEVESEPVMDEENEDILNNQSPDFENFYIPDLPYVNSPTSEDTPTTPGGRRSTSSMPSKSRRSRSSSIGEYSDVVREGFQLLTK
         VFGKDRATG RA + A++ S      +     +    DF   Y P L        E        RR+ S   S S+R RS    +  D+VR   +   +
Subjt:  IVFGKDRATGSRATTTAEVESEPVMDEENEDILNNQSPDFENFYIPDLPYVNSPTSEDTPTTPGGRRSTSSMPSKSRRSRSSSIGEYSDVVREGFQLLTK

Query:  SIDGIAQ----------RRRRELYAELQSIPSLSVQDGLTVAHSLLADPMLLSHFMDFPPQWKYDYCMQVLGQQR
         +  IA+            R+E+  +L+ IP L++ D   +   ++ +   +  F++ P   KY YC  +L + R
Subjt:  SIDGIAQ----------RRRRELYAELQSIPSLSVQDGLTVAHSLLADPMLLSHFMDFPPQWKYDYCMQVLGQQR

A0A5A7UME4 Retrotransposon protein9.5e-3333.22Show/hide
Query:  SRARATKHIWTDKEDRILVECLVQCVQSGHWRVDNETFRHGFLANILRMMQQRIPGCSIQ------------------------------WDAERKCIDY
        S +R  KH WT +E+  LVECLV+ V +G WR DN TFR G+L  + RMM  +IPG +I                               W+ E+KCI  
Subjt:  SRARATKHIWTDKEDRILVECLVQCVQSGHWRVDNETFRHGFLANILRMMQQRIPGCSIQ------------------------------WDAERKCIDY

Query:  QAEIFDAWVKSHPSAKELQHKSFSFYDDLAIVFGKDRATGSRATTTAEVESE--PVMDEENEDILNNQSPDFENFYIPDLPYVNSPTSEDTPTTPGGRRS
        + E+FD W  SHP+AK L +KSF  YD+L+ VFGKDRATG RA + A++ S   P  D E  D +     DF   Y P L        E        RR+
Subjt:  QAEIFDAWVKSHPSAKELQHKSFSFYDDLAIVFGKDRATGSRATTTAEVESE--PVMDEENEDILNNQSPDFENFYIPDLPYVNSPTSEDTPTTPGGRRS

Query:  TSSMPSKSRRSRSSSIGEYSDVVREGFQLLTKSIDGIAQ----------RRRRELYAELQSIPSLSVQDGLTVAHSLLADPMLLSHFMDFPPQWKYDYCM
         S   S S+R R     +  D+VR   +   + +  IA+          + R+E+   L++IP L++ D   +   L+ +   +  F++ P   KY YC 
Subjt:  TSSMPSKSRRSRSSSIGEYSDVVREGFQLLTKSIDGIAQ----------RRRRELYAELQSIPSLSVQDGLTVAHSLLADPMLLSHFMDFPPQWKYDYCM

Query:  QVLGQQR
         +L + +
Subjt:  QVLGQQR

A0A5A7VE44 Retrotransposon protein4.6e-3536.06Show/hide
Query:  SGSRARATKHIWTDKEDRILVECLVQCVQSGHWRVDNETFRHGFLANILRMMQQRIPGCS-IQWDAERKCIDYQAEIFDAWVKSHPSAKELQHKSFSFYD
        + + ++ATKH WT  ED  LVECL+Q V+ G WR DNETF+ G+L  ++  M +  P  S   W+ ERKCI+ +  +FD WVK H +A+ L +K+F ++ 
Subjt:  SGSRARATKHIWTDKEDRILVECLVQCVQSGHWRVDNETFRHGFLANILRMMQQRIPGCS-IQWDAERKCIDYQAEIFDAWVKSHPSAKELQHKSFSFYD

Query:  DLAIVFGKDRATGSRATTTAEVESEPVMDEENEDILNNQSPDFENFYIPDLPYVNSPTSEDTPTTPG--GRRSTSSMPSKSRRSRSSSIGEYSDVVREGF
        DL +VFG+DRAT  R  T  ++ S+   D E +D+  N     E+F IP+   +  P+ ED  +TP      + SS PSK RRS S              
Subjt:  DLAIVFGKDRATGSRATTTAEVESEPVMDEENEDILNNQSPDFENFYIPDLPYVNSPTSEDTPTTPG--GRRSTSSMPSKSRRSRSSSIGEYSDVVREGF

Query:  QLLTKSIDGIAQRRRRELYAELQSIPSLSVQDGLTVAHSLLADPMLLSHFMDFPPQWKYDYCMQVLGQQ
          L  +   I                            SLL DP +L  F+D+P +WKY  CM++LG+Q
Subjt:  QLLTKSIDGIAQRRRRELYAELQSIPSLSVQDGLTVAHSLLADPMLLSHFMDFPPQWKYDYCMQVLGQQ

A0A5D3C542 Retrotransposon protein7.3e-3333.46Show/hide
Query:  SGSRARATKHIWTDKEDRILVECLVQCVQSGHWRVDNETFRHGFLANILRMMQQRIPGCS-IQWDAERKCIDYQAEIFDAWVKSHPSAKELQHKSFSFYD
        + + ++ATKH WT  ED +LV CL+Q V+ G WR DN TF+ G+L     + +   P CS   W+ ERKCI+ +  +FD WVK HP A+ L +K F ++ 
Subjt:  SGSRARATKHIWTDKEDRILVECLVQCVQSGHWRVDNETFRHGFLANILRMMQQRIPGCS-IQWDAERKCIDYQAEIFDAWVKSHPSAKELQHKSFSFYD

Query:  DLAIVFGKDRATGSRATTTAEVESEPVMDEENEDILNNQSPDFENFYIPDLPYVNSPTSEDTPTTPG--GRRSTSSMPSKSRRSRSSSIGEYSDVVREGF
        DL +VFG+DRATG R                              F IP+   +  P+ ED P+TP      + SS PSK RRS S  +           
Subjt:  DLAIVFGKDRATGSRATTTAEVESEPVMDEENEDILNNQSPDFENFYIPDLPYVNSPTSEDTPTTPG--GRRSTSSMPSKSRRSRSSSIGEYSDVVREGF

Query:  QLLTKSIDGIAQRRRRELYAELQSIPSLSVQDGLTVAHSLLADPMLLSHFMDFPPQWKYDYCMQVLGQQ
                                       D      SLL DP +L  F+D+P +WKY  CM++LG+Q
Subjt:  QLLTKSIDGIAQRRRRELYAELQSIPSLSVQDGLTVAHSLLADPMLLSHFMDFPPQWKYDYCMQVLGQQ

A0A5D3C7T4 Uncharacterized protein2.9e-4542.11Show/hide
Query:  ETSGSRARATKHIWTDKEDRILVECLVQCVQSGHWRVDNETFRHGFLANILRMMQQRIPGCS-IQWDAERKCIDYQAEIFDAWVKSHPSAKELQHKSFSF
        E + + ++ATKH WT  ED +LVECL+Q V+ G WR DN TF+ G+L     + +   P CS   W+  +KCI+ +  +FD WVK HP+A+ L +K F +
Subjt:  ETSGSRARATKHIWTDKEDRILVECLVQCVQSGHWRVDNETFRHGFLANILRMMQQRIPGCS-IQWDAERKCIDYQAEIFDAWVKSHPSAKELQHKSFSF

Query:  YDDLAIVFGKDRATGSRATTTAEVESEPVMDEENEDILNNQSPDFENFYIPDLPYVNSPTSEDTPTTPGG--RRSTSSMPSKSRRSRSSSIGEYSDVVRE
        + DL +VFG+DRATG R  T  E+ S+   D E +D+  N     E+F IP+   +  P+ ED P+TP      + SS PSK RRS S   G+  D  R 
Subjt:  YDDLAIVFGKDRATGSRATTTAEVESEPVMDEENEDILNNQSPDFENFYIPDLPYVNSPTSEDTPTTPGG--RRSTSSMPSKSRRSRSSSIGEYSDVVRE

Query:  GFQLLTKSIDGIA--QRRRRE--------LYAELQSIPSLSVQDGLTVAHSLLADPMLLSHFMDFP
          +  +K I  IA  QR + E        LYAELQ+IP + V D L VA SLL DP +L  F+D+P
Subjt:  GFQLLTKSIDGIA--QRRRRE--------LYAELQSIPSLSVQDGLTVAHSLLADPMLLSHFMDFP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30140.1 unknown protein7.0e-0429.84Show/hide
Query:  WDAERKCIDYQAEIFDAWVKSHPSAKELQHKSFSFYDDLAIVFGKDRATGSRATTTAEV---------ESEPVMDEENEDILNNQSPDFENFYIPDLPYV
        WD E K      E++  ++K+HP+ K +Q +S   ++DL I+FG   ATGS A   ++          E     +  N+D    +  +F   +     Y 
Subjt:  WDAERKCIDYQAEIFDAWVKSHPSAKELQHKSFSFYDDLAIVFGKDRATGSRATTTAEV---------ESEPVMDEENEDILNNQSPDFENFYIPDLPYV

Query:  NSPTSEDTPTTPGGRRSTSSMPSK
         SP + D PTT G  RS   +P K
Subjt:  NSPTSEDTPTTPGGRRSTSSMPSK

AT2G24960.1 unknown protein9.2e-0426.26Show/hide
Query:  RHGFLANILRMMQQRIPGCSIQWDAERKCIDYQAEIFDAWVKSHPSAKELQHKSFSFYDDLAIVF------GKDRATGSRATTTAEVESEPVMDEENED
        R+  L    + M+  +      WD  R  I     ++D+++K HP A+  + KS   Y+DL  +F      G D      A  T+E ++     E+N D
Subjt:  RHGFLANILRMMQQRIPGCSIQWDAERKCIDYQAEIFDAWVKSHPSAKELQHKSFSFYDDLAIVF------GKDRATGSRATTTAEVESEPVMDEENED

AT2G24960.2 unknown protein9.2e-0426.26Show/hide
Query:  RHGFLANILRMMQQRIPGCSIQWDAERKCIDYQAEIFDAWVKSHPSAKELQHKSFSFYDDLAIVF------GKDRATGSRATTTAEVESEPVMDEENED
        R+  L    + M+  +      WD  R  I     ++D+++K HP A+  + KS   Y+DL  +F      G D      A  T+E ++     E+N D
Subjt:  RHGFLANILRMMQQRIPGCSIQWDAERKCIDYQAEIFDAWVKSHPSAKELQHKSFSFYDDLAIVF------GKDRATGSRATTTAEVESEPVMDEENED

AT5G27260.1 unknown protein5.6e-0921.9Show/hide
Query:  RARATKHIWTDKEDRILVECLVQCVQSGHWRVDNETF---------------------RHGFLANILRMMQQRIPGC--------SIQWDAERKCIDYQA
        R +   + W+ +E ++LV+ LV+ + + +WR  N T                       +    + ++ ++ +   C           WD   K      
Subjt:  RARATKHIWTDKEDRILVECLVQCVQSGHWRVDNETF---------------------RHGFLANILRMMQQRIPGC--------SIQWDAERKCIDYQA

Query:  EIFDAWVKSHPSAKELQHKSFSFYDDLAIVFGKDRATGSRATTTAEVESEPVMDEENEDILNNQSPDFENFYIPDLPYVNSPTSEDTPTTPGGRRSTSSM
        E++  ++K+HP+ K+L++ +F F+D+L I+FG+  ATG  A    +  ++ +     E+       DF+N Y  D    +  +    P    G   +  +
Subjt:  EIFDAWVKSHPSAKELQHKSFSFYDDLAIVFGKDRATGSRATTTAEVESEPVMDEENEDILNNQSPDFENFYIPDLPYVNSPTSEDTPTTPGGRRSTSSM

Query:  P--SKSRRSRSSSIGEYSDVVREGFQLLTKSIDGIAQRRRRE
        P   ++R  RS+S  E S ++     + +K +D I QR  R+
Subjt:  P--SKSRRSRSSSIGEYSDVVREGFQLLTKSIDGIAQRRRRE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACATCCGGATCACGCGCAAGAGCCACAAAACATATTTGGACGGATAAGGAGGACCGAATCCTCGTGGAGTGTTTGGTCCAATGCGTTCAGTCCGGGCAC
TGGCGAGTTGATAACGAGACTTTCCGACATGGATTCCTAGCGAACATACTGCGGATGATGCAGCAGAGGATTCCAGGGTGTTCCATACAGTGGGATGCGGAGCGC
AAATGTATTGACTATCAGGCAGAGATATTTGACGCGTGGGTCAAGAGTCATCCGAGTGCAAAAGAACTGCAACATAAGTCATTTTCGTTCTATGACGACTTGGCC
ATTGTATTCGGCAAAGATAGAGCCACGGGAAGTCGTGCAACCACCACTGCAGAGGTCGAATCTGAACCTGTTATGGACGAGGAGAACGAGGACATCTTGAATAAC
CAGTCCCCGGACTTTGAGAACTTCTATATTCCTGATCTACCTTATGTCAACTCTCCCACATCAGAGGACACTCCAACTACCCCTGGCGGTAGAAGATCTACAAGT
AGCATGCCATCAAAAAGTAGGAGGTCCCGAAGTTCCTCGATTGGAGAGTACAGCGACGTGGTTCGGGAAGGATTCCAACTTCTGACGAAGTCCATTGACGGCATT
GCACAGCGTCGTCGCCGAGAACTTTACGCCGAGTTGCAATCAATTCCTAGTCTATCGGTGCAAGATGGTTTGACTGTTGCACACTCATTGCTTGCAGATCCGATG
TTGTTAAGCCACTTCATGGACTTCCCACCACAGTGGAAGTACGACTATTGCATGCAAGTTCTTGGGCAACAACGGGATCCAGTCCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAACATCCGGATCACGCGCAAGAGCCACAAAACATATTTGGACGGATAAGGAGGACCGAATCCTCGTGGAGTGTTTGGTCCAATGCGTTCAGTCCGGGCAC
TGGCGAGTTGATAACGAGACTTTCCGACATGGATTCCTAGCGAACATACTGCGGATGATGCAGCAGAGGATTCCAGGGTGTTCCATACAGTGGGATGCGGAGCGC
AAATGTATTGACTATCAGGCAGAGATATTTGACGCGTGGGTCAAGAGTCATCCGAGTGCAAAAGAACTGCAACATAAGTCATTTTCGTTCTATGACGACTTGGCC
ATTGTATTCGGCAAAGATAGAGCCACGGGAAGTCGTGCAACCACCACTGCAGAGGTCGAATCTGAACCTGTTATGGACGAGGAGAACGAGGACATCTTGAATAAC
CAGTCCCCGGACTTTGAGAACTTCTATATTCCTGATCTACCTTATGTCAACTCTCCCACATCAGAGGACACTCCAACTACCCCTGGCGGTAGAAGATCTACAAGT
AGCATGCCATCAAAAAGTAGGAGGTCCCGAAGTTCCTCGATTGGAGAGTACAGCGACGTGGTTCGGGAAGGATTCCAACTTCTGACGAAGTCCATTGACGGCATT
GCACAGCGTCGTCGCCGAGAACTTTACGCCGAGTTGCAATCAATTCCTAGTCTATCGGTGCAAGATGGTTTGACTGTTGCACACTCATTGCTTGCAGATCCGATG
TTGTTAAGCCACTTCATGGACTTCCCACCACAGTGGAAGTACGACTATTGCATGCAAGTTCTTGGGCAACAACGGGATCCAGTCCCATGA
Protein sequenceShow/hide protein sequence
METSGSRARATKHIWTDKEDRILVECLVQCVQSGHWRVDNETFRHGFLANILRMMQQRIPGCSIQWDAERKCIDYQAEIFDAWVKSHPSAKELQHKSFSFYDDLA
IVFGKDRATGSRATTTAEVESEPVMDEENEDILNNQSPDFENFYIPDLPYVNSPTSEDTPTTPGGRRSTSSMPSKSRRSRSSSIGEYSDVVREGFQLLTKSIDGI
AQRRRRELYAELQSIPSLSVQDGLTVAHSLLADPMLLSHFMDFPPQWKYDYCMQVLGQQRDPVP