; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy05g014750 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy05g014750
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr05:18165556..18166281
RNA-Seq ExpressionLcy05g014750
SyntenyLcy05g014750
Gene Ontology termsNA
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]1.6e-4947.47Show/hide
Query:  IDKAEKELEQLLEEEEKYWRLRAKEDWLKWGDRNTKWFHSKVSRRNKKNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLISVATKDIKTS
        I K E ++  +L +EE YW+ R++ DWLK GD+NTK+FHSK S R +KN +  ++D  G WV+D EGI      +F++LF+S+ P    IS A K +   
Subjt:  IDKAEKELEQLLEEEEKYWRLRAKEDWLKWGDRNTKWFHSKVSRRNKKNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLISVATKDIKTS

Query:  ITEDQNRDLERLFSRGDNERALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTWISLIPKVNHPEKMEGFRPISLCCVVYK
        ++++ N  LE  F+  D  RAL +M P+KAPGPDG  A  FQ++W I G+ +TK CL ILN  G +  LN T+I+LIPKV  P K+  FRPISLC VVY+
Subjt:  ITEDQNRDLERLFSRGDNERALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTWISLIPKVNHPEKMEGFRPISLCCVVYK

Query:  IISKALANRMKKFLTKL
        I++KA+ANR+K  L  +
Subjt:  IISKALANRMKKFLTKL

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]2.0e-4947.49Show/hide
Query:  NLDSNIDKAEKELEQLLEEEEKYWRLRAKEDWLKWGDRNTKWFHSKVSRRNKKNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLISVATK
        +L   I++  +E+  LL++EE YW  RAK  WLK GDRNTK+FH++ S R K+NT+  + D  G W ++EE I + A+ YF  ++SS+ P    I   T+
Subjt:  NLDSNIDKAEKELEQLLEEEEKYWRLRAKEDWLKWGDRNTKWFHSKVSRRNKKNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLISVATK

Query:  DIKTSITEDQNRDLERLFSRGDNERALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTWISLIPKVNHPEKMEGFRPISLC
         I   +TE+ N  L R F++ +   ALK ++P+KAPGPDG  A+ FQ+YW I G+++T + L++LN++  I +LN T ISLIPK N+P++M  FRPISLC
Subjt:  DIKTSITEDQNRDLERLFSRGDNERALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTWISLIPKVNHPEKMEGFRPISLC

Query:  CVVYKIISKALANRMKKFL
         VVYK+ISK LANR+K  L
Subjt:  CVVYKIISKALANRMKKFL

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]2.4e-5045.61Show/hide
Query:  SANQGNLDSNIDKAEKELEQLLEEEEKYWRLRAKEDWLKWGDRNTKWFHSKVSRRNKKNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLI
        S   G+L   I+   KE+ +LL+ EE  W+ R++  WL  GDRNTK+FH+K S R ++NT++ + D NG W +  EGI +VAV YF+ ++SS+   P  I
Subjt:  SANQGNLDSNIDKAEKELEQLLEEEEKYWRLRAKEDWLKWGDRNTKWFHSKVSRRNKKNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLI

Query:  SVATKDIKTSITEDQNRDLERLFSRGDNERALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTWISLIPKVNHPEKMEGFR
        S     I T++TE+ N  L + F+R + E AL  M+P+KAPGPDG  A+ FQ+YW+I G+DI  + LD+LN++  + ++N T I+L+PK+ +P KM  FR
Subjt:  SVATKDIKTSITEDQNRDLERLFSRGDNERALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTWISLIPKVNHPEKMEGFR

Query:  PISLCCVVYKIISKALANRMKKFLTKLFLLLRRLSFLEG
        PISLC VVYK+ISK LANR+K  L ++ +   + +FL G
Subjt:  PISLCCVVYKIISKALANRMKKFLTKLFLLLRRLSFLEG

XP_030479133.1 uncharacterized protein LOC115696372 [Cannabis sativa]2.8e-4647.44Show/hide
Query:  NIDKAEKELEQLLEEEEKYWRLRAKEDWLKWGDRNTKWFHSKVSRRNKKNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLISVATKDIKT
        N+  AEK LE+LL  EE+YW+ R++ DWLK GDRNTK+FHSK S R+  N +  L+D +G  V  ++ I  V  +YF K+F++A  D   +S     I T
Subjt:  NIDKAEKELEQLLEEEEKYWRLRAKEDWLKWGDRNTKWFHSKVSRRNKKNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLISVATKDIKT

Query:  SITEDQNRDLERLFSRGDNERALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTWISLIPKVNHPEKMEGFRPISLCCVVY
        +I+  QN  L + F+R D   ALK M   K+PG DG  AM +Q YW I GD +T+V L++LN  G     N T ++LIPK+  P++M+ FRPISLC V+Y
Subjt:  SITEDQNRDLERLFSRGDNERALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTWISLIPKVNHPEKMEGFRPISLCCVVY

Query:  KIISKALANRMKKFL
        KIISK LA R+K+ L
Subjt:  KIISKALANRMKKFL

XP_030487384.1 uncharacterized protein LOC115704310 [Cannabis sativa]2.8e-4642.99Show/hide
Query:  IDKAEKELEQLLEEEEKYWRLRAKEDWLKWGDRNTKWFHSKVSRRNKKNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLISVATKDIKTS
        I + E +L  L+E++E+YWR R++  WL+WGDRNTK+FH K S R KKN +  L+D+ G W +D+E + ++  +Y+ +LF+S++ + D+       I+  
Subjt:  IDKAEKELEQLLEEEEKYWRLRAKEDWLKWGDRNTKWFHSKVSRRNKKNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLISVATKDIKTS

Query:  ITEDQNRDLERLFSRGDNERALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTWISLIPKVNHPEKMEGFRPISLCCVVYK
        +T   N DL   F+  +  +A+K+MNP+KAPG DG  A+ +Q++W    +D+   CL++LN   ++  LN T ++LIPKV+ P+++E FRPISLC V+YK
Subjt:  ITEDQNRDLERLFSRGDNERALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTWISLIPKVNHPEKMEGFRPISLCCVVYK

Query:  IISKALANRMKKFL
        I+SK LANRM+  L
Subjt:  IISKALANRMKKFL

TrEMBL top hitse value%identityAlignment
A0A2N9G3I8 Reverse transcriptase domain-containing protein7.1e-4843.36Show/hide
Query:  ANQGNLDSNIDKAEKELEQLLEEEEKYWRLRAKEDWLKWGDRNTKWFHSKVSRRNKKNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLIS
        A Q  + +NI+   KEL  LL +EEK+WR R++  WL  GDRNTK+FH + ++R ++N++ +L D  G W E  E I E+ +EY+  LF+++  +P  ++
Subjt:  ANQGNLDSNIDKAEKELEQLLEEEEKYWRLRAKEDWLKWGDRNTKWFHSKVSRRNKKNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLIS

Query:  VATKDIKTSITEDQNRDLERLFSRGDNERALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTWISLIPKVNHPEKMEGFRP
         AT +++  +T + N +L R F   + E+A++ M PSKAPGPDG   + +Q+YW + G D+T   L  LN+   +  +N T+I+LIPKV +PEK+  FRP
Subjt:  VATKDIKTSITEDQNRDLERLFSRGDNERALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTWISLIPKVNHPEKMEGFRP

Query:  ISLCCVVYKIISKALANRMKKFLTKL
        ISLC V+YK++SK LANR+K  L +L
Subjt:  ISLCCVVYKIISKALANRMKKFLTKL

A0A7N2L6Z9 Reverse transcriptase domain-containing protein2.4e-4845.91Show/hide
Query:  GNLDSNIDKAEKELEQLLEEEEKYWRLRAKEDWLKWGDRNTKWFHSKVSRRNKKNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLISVAT
        G L   I+   +EL  LL++EE +W  R+K  WLK GDRNTK+FH++ S R K+NT+  + D  G W ED + I   AV YF+ ++S++  +P ++   T
Subjt:  GNLDSNIDKAEKELEQLLEEEEKYWRLRAKEDWLKWGDRNTKWFHSKVSRRNKKNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLISVAT

Query:  KDIKTSITEDQNRDLERLFSRGDNERALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTWISLIPKVNHPEKMEGFRPISL
          I T ITE+ N +L R F+R +   ALK ++P+K+PGPDG  A+ FQ+YWDI G +++ + L++LN    +  +N T I LIPK ++P++M  FRPISL
Subjt:  KDIKTSITEDQNRDLERLFSRGDNERALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTWISLIPKVNHPEKMEGFRPISL

Query:  CCVVYKIISKALANRMKKFL
        C V+YK+ISK LANR+K FL
Subjt:  CCVVYKIISKALANRMKKFL

A0A7N2MR00 Uncharacterized protein6.4e-4947.73Show/hide
Query:  GNLDSNIDKAEKELEQLLEEEEKYWRLRAKEDWLKWGDRNTKWFHSKVSRRNKKNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLISVAT
        G +   I+   KE+  LL+ EE  W  R++  W   GDRNTK+FHSKVS+R KKNT+  + D NG W +  E I   A+ YFK LF+++  +P  I   T
Subjt:  GNLDSNIDKAEKELEQLLEEEEKYWRLRAKEDWLKWGDRNTKWFHSKVSRRNKKNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLISVAT

Query:  KDIKTSITEDQNRDLERLFSRGDNERALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTWISLIPKVNHPEKMEGFRPISL
          I T +T++ N+ L   F+R +   ALK M+P+KAPGPDG  A+ FQ+YWD+ G+DIT + L++LN+D  I  +N T I+LIPK+N P KM  FRPISL
Subjt:  KDIKTSITEDQNRDLERLFSRGDNERALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTWISLIPKVNHPEKMEGFRPISL

Query:  CCVVYKIISKALANRMKKFL
        C VVYK++SK +AN +K  L
Subjt:  CCVVYKIISKALANRMKKFL

A0A803P3X4 Uncharacterized protein7.8e-4743.93Show/hide
Query:  IDKAEKELEQLLEEEEKYWRLRAKEDWLKWGDRNTKWFHSKVSRRNKKNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLISVATKDIKTS
        I + E +L  LLE++E+Y R R++  WLKWGDRNTK+FH K S R KKN +  L+D  G W +D+  + ++  +Y+ +LF+S++ +  +     + ++  
Subjt:  IDKAEKELEQLLEEEEKYWRLRAKEDWLKWGDRNTKWFHSKVSRRNKKNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLISVATKDIKTS

Query:  ITEDQNRDLERLFSRGDNERALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTWISLIPKVNHPEKMEGFRPISLCCVVYK
        +T D N DL    +  D  RA+KDMNP+KAPGPDG  A+ +Q++W    +D+  VCL++LN+  ++  LN T ++LIPK++ P+++E FRPISLC V+YK
Subjt:  ITEDQNRDLERLFSRGDNERALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTWISLIPKVNHPEKMEGFRPISLCCVVYK

Query:  IISKALANRMKKFL
        I+SK LANRM+  L
Subjt:  IISKALANRMKKFL

A0A803QGT2 Uncharacterized protein1.3e-4647.44Show/hide
Query:  NIDKAEKELEQLLEEEEKYWRLRAKEDWLKWGDRNTKWFHSKVSRRNKKNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLISVATKDIKT
        N+  AEK LE+LL  EE+YW+ R++ DWLK GDRNTK+FHSK S R+  N +  L+D +G  V  ++ I  V  +YF K+F++A  D   +S     I T
Subjt:  NIDKAEKELEQLLEEEEKYWRLRAKEDWLKWGDRNTKWFHSKVSRRNKKNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLISVATKDIKT

Query:  SITEDQNRDLERLFSRGDNERALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTWISLIPKVNHPEKMEGFRPISLCCVVY
        +I+  QN  L + F+R D   ALK M   K+PG DG  AM +Q YW I GD +T+V L++LN  G     N T ++LIPK+  P++M+ FRPISLC V+Y
Subjt:  SITEDQNRDLERLFSRGDNERALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTWISLIPKVNHPEKMEGFRPISLCCVVY

Query:  KIISKALANRMKKFL
        KIISK LA R+K+ L
Subjt:  KIISKALANRMKKFL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein9.0e-0823.51Show/hide
Query:  SNIDKAEKELEQLLEEEEKY---------WRLRAK------EDWLKWGDRNTKWFHSKVS-----------RRNKKNTLDRLKDANGAWVEDEEGIGEVA
        S ID    +L++L ++E+ +          ++RA+      +  L+  + +  WF  +++           ++ +KN +D +K+  G    D   I    
Subjt:  SNIDKAEKELEQLLEEEEKY---------WRLRAK------EDWLKWGDRNTKWFHSKVS-----------RRNKKNTLDRLKDANGAWVEDEEGIGEVA

Query:  VEYFKKLFSSAKPD-PDLISVATKDIKTSITEDQNRDLERLFSRGDNERALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNS
         EY+K L+++   +  ++ +         + +++   L R  +  +    +  +   K+PGPDG  A  +QRY     +++    L +  +  + G L +
Subjt:  VEYFKKLFSSAKPD-PDLISVATKDIKTSITEDQNRDLERLFSRGDNERALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNS

Query:  TW----ISLIPKVNH-PEKMEGFRPISLCCVVYKIISKALANRMKKFLTKL
        ++    I LIPK      K E FRPISL  +  KI++K LANR+++ + KL
Subjt:  TW----ISLIPKVNH-PEKMEGFRPISLCCVVYKIISKALANRMKKFLTKL

P11369 LINE-1 retrotransposable element ORF2 protein4.0e-0824.65Show/hide
Query:  DRNTKWFHSKVSRRNK-----------KNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLISVATKDIKT-SITEDQNRDLERLFSRGDNE
        ++   WF  K+++ +K           K  ++++++  G    D E I      ++K+L+S+   + D +       +   + +DQ   L    S  + E
Subjt:  DRNTKWFHSKVSRRNK-----------KNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLISVATKDIKT-SITEDQNRDLERLFSRGDNE

Query:  RALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTW----ISLIPK-VNHPEKMEGFRPISLCCVVYKIISKALANRMKKFL
          +  +   K+PGPDG  A  +Q +     +D+  +   + +     G L +++    I+LIPK    P K+E FRPISL  +  KI++K LANR+++ +
Subjt:  RALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTW----ISLIPK-VNHPEKMEGFRPISLCCVVYKIISKALANRMKKFL

Query:  TKLFLLLRRLSFLEG
         K  +   ++ F+ G
Subjt:  TKLFLLLRRLSFLEG

P14381 Transposon TX1 uncharacterized 149 kDa protein4.2e-1326.34Show/hide
Query:  DRNTKWFHSKVSRRNKKNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLISVATKDIKTSITEDQNRDLERLFSRGDNERALKDMNPSKAP
        DR +++F++   ++  +  +  L   +G  +ED E I + A  +++ LFS     PD        +   ++E +   LE   +  +  +AL+ M  +K+P
Subjt:  DRNTKWFHSKVSRRNKKNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLISVATKDIKTSITEDQNRDLERLFSRGDNERALKDMNPSKAP

Query:  GPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTWISLIPKVNHPEKMEGFRPISLCCVVYKIISKALANRMKKFLTKL
        G DG     FQ +WD  G D  +V  +               +SL+PK      ++ +RP+SL    YKI++KA++ R+K  L ++
Subjt:  GPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTWISLIPKVNHPEKMEGFRPISLCCVVYKIISKALANRMKKFLTKL

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein8.3e-1731.09Show/hide
Query:  EKYWRLRAKEDWLKWGDRNTKWFHSKVSRRNKKNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLISVAT----KDIKTSITEDQNRD-LE
        E ++R +++  WL+ GD NT++FH  +     KN +  L+  +   VE+   + E+ V Y+  L  S   D D+++  +    KDI      D     L 
Subjt:  EKYWRLRAKEDWLKWGDRNTKWFHSKVSRRNKKNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLISVAT----KDIKTSITEDQNRD-LE

Query:  RLFSRGDNERALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTWISLIPKVNHPEKMEGFRPISLCCVVYKIIS
         L S  +   A+  M  +KAPGPD   A  F   W +  D       +       + + N+T I+LIPKV   +++  FRP+S C VVYKII+
Subjt:  RLFSRGDNERALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTWISLIPKVNHPEKMEGFRPISLCCVVYKIIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTCTGCAAATCAAGGGAATTTAGACAGCAACATCGACAAAGCAGAGAAGGAATTAGAACAGTTGCTTGAAGAAGAAGAGAAGTATTGGAGATTGAGAGCTAAGGA
AGATTGGCTAAAGTGGGGTGATAGAAACACTAAGTGGTTTCATTCCAAAGTCAGCCGAAGAAACAAGAAAAACACTTTGGATAGATTGAAAGATGCGAATGGAGCCTGGG
TGGAAGATGAGGAGGGCATAGGTGAGGTGGCGGTGGAGTACTTCAAAAAGCTTTTCTCTTCAGCCAAGCCCGATCCGGATTTGATCAGTGTAGCCACGAAAGATATCAAA
ACTAGCATTACTGAGGATCAAAACAGGGACCTTGAGAGGCTGTTTTCGAGAGGTGATAATGAAAGGGCGTTGAAAGACATGAATCCTTCTAAGGCACCAGGTCCTGATGG
TGCCCACGCCATGGTCTTTCAGAGGTATTGGGATATTGCCGGCGATGATATTACTAAGGTGTGCTTAGACATTCTTAACAATGATGGTGAGATAGGGCAGTTAAATAGCA
CGTGGATCTCTTTGATTCCCAAGGTGAATCACCCCGAGAAAATGGAAGGTTTTCGGCCTATCAGTTTATGCTGTGTGGTGTACAAAATCATCTCCAAAGCCCTTGCGAAT
AGAATGAAAAAGTTCTTGACAAAGTTATTTCTCCTTCTCAGACGACTTTCATTCCTGGAAGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACTCTGCAAATCAAGGGAATTTAGACAGCAACATCGACAAAGCAGAGAAGGAATTAGAACAGTTGCTTGAAGAAGAAGAGAAGTATTGGAGATTGAGAGCTAAGGA
AGATTGGCTAAAGTGGGGTGATAGAAACACTAAGTGGTTTCATTCCAAAGTCAGCCGAAGAAACAAGAAAAACACTTTGGATAGATTGAAAGATGCGAATGGAGCCTGGG
TGGAAGATGAGGAGGGCATAGGTGAGGTGGCGGTGGAGTACTTCAAAAAGCTTTTCTCTTCAGCCAAGCCCGATCCGGATTTGATCAGTGTAGCCACGAAAGATATCAAA
ACTAGCATTACTGAGGATCAAAACAGGGACCTTGAGAGGCTGTTTTCGAGAGGTGATAATGAAAGGGCGTTGAAAGACATGAATCCTTCTAAGGCACCAGGTCCTGATGG
TGCCCACGCCATGGTCTTTCAGAGGTATTGGGATATTGCCGGCGATGATATTACTAAGGTGTGCTTAGACATTCTTAACAATGATGGTGAGATAGGGCAGTTAAATAGCA
CGTGGATCTCTTTGATTCCCAAGGTGAATCACCCCGAGAAAATGGAAGGTTTTCGGCCTATCAGTTTATGCTGTGTGGTGTACAAAATCATCTCCAAAGCCCTTGCGAAT
AGAATGAAAAAGTTCTTGACAAAGTTATTTCTCCTTCTCAGACGACTTTCATTCCTGGAAGGCTGA
Protein sequenceShow/hide protein sequence
MDSANQGNLDSNIDKAEKELEQLLEEEEKYWRLRAKEDWLKWGDRNTKWFHSKVSRRNKKNTLDRLKDANGAWVEDEEGIGEVAVEYFKKLFSSAKPDPDLISVATKDIK
TSITEDQNRDLERLFSRGDNERALKDMNPSKAPGPDGAHAMVFQRYWDIAGDDITKVCLDILNNDGEIGQLNSTWISLIPKVNHPEKMEGFRPISLCCVVYKIISKALAN
RMKKFLTKLFLLLRRLSFLEG