; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016403 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016403
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr12:37355785..37360862
RNA-Seq ExpressionLag0016403
SyntenyLag0016403
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW54082.1 Retrovirus-related Pol polyprotein from transposon 297 [Vitis vinifera]5.9e-6834.98Show/hide
Query:  RKLEVPIFKGEDEEDPDGWLHRVERRENIDEHLEGIPKVIVGTISTD---------------------VPRRSVCSFNEVATGNHREGISPA--------
        R++E+P+F G   E+PDGW+    R   I   L    K++   +S D                     + RR +  F     G+  E             
Subjt:  RKLEVPIFKGEDEEDPDGWLHRVERRENIDEHLEGIPKVIVGTISTD---------------------VPRRSVCSFNEVATGNHREGISPA--------

Query:  ----FEQYAATLKDVSDGVLESKFECGLKEEIQSEMRKFQPVGLKAKMLMAQLIEDDNVVQEKKRTGKAQGQNASPKNNTNSNGASGGSSSLAGQCDWVV
            FE  A  LK +S  V+ES F  GL  EI++E+   QP GL   M MAQ +ED N+     R         S K  ++SN               VV
Subjt:  ----FEQYAATLKDVSDGVLESKFECGLKEEIQSEMRKFQPVGLKAKMLMAQLIEDDNVVQEKKRTGKAQGQNASPKNNTNSNGASGGSSSLAGQCDWVV

Query:  NNGDSNARRGSKGVTNPMLKRVTDNELQRKREKGLCYKCDEKWNPGPRCKKKELQIMVIQGEEDEDREGTDQMEEVCEPEPVTNEDVQANESEVAALSVN
             + RR         +KR+T++ELQ +REKGLC+KC+EK++PG RC KKEL+++++  +E+ED    D        +  T E       ++  LS+N
Subjt:  NNGDSNARRGSKGVTNPMLKRVTDNELQRKREKGLCYKCDEKWNPGPRCKKKELQIMVIQGEEDEDREGTDQMEEVCEPEPVTNEDVQANESEVAALSVN

Query:  SMVGIDSPKTMKVKGTIQGQGVVVLIDSGASHNFISTEVVQKLGLTVSPSASYGIVLGTGGPVSGAGVCKGVILTISDLTIIHDFLPLPLGSADVILDIT
        S+VG+ +P TMK+KGTI  + V++L+DSGA+HNF+S E+VQ+L L ++ + SYG+++GTG  V G G+C+GV +++  LT++ DFLPL LG+ DVIL + 
Subjt:  SMVGIDSPKTMKVKGTIQGQGVVVLIDSGASHNFISTEVVQKLGLTVSPSASYGIVLGTGGPVSGAGVCKGVILTISDLTIIHDFLPLPLGSADVILDIT

Query:  WLETLGKIQFDYRLSEMDFQIGSWTVQLRGDRSLVKSQVSLKSMMKSFGMGDQGVLIKLSNLESNNYKESEPLPLKVSEGVH---SELKEVLYKFAAVFE
        WL TLG ++ ++++  M  ++G   + L+GD SL +++VSLK+M ++     QGV ++L    + +          +SEGV      +KEVL +   +FE
Subjt:  WLETLGKIQFDYRLSEMDFQIGSWTVQLRGDRSLVKSQVSLKSMMKSFGMGDQGVLIKLSNLESNNYKESEPLPLKVSEGVH---SELKEVLYKFAAVFE

Query:  SQMTVPPSRNRDHGIELEPGVGAVNV
            +PPSR+ DH I+L PG   VNV
Subjt:  SQMTVPPSRNRDHGIELEPGVGAVNV

XP_022848903.1 uncharacterized protein LOC111371244 [Olea europaea var. sylvestris]6.5e-7535.08Show/hide
Query:  EKKRVWRQSWTYSSGTANDDEKKATSSDG------IQAPSSREVP-----------LFDMRLRKLEVPIFKGEDEEDPDGWLHRVERRENIDEHLEGIPK
        EK  +    W    G A+D + K  +         +++PS+ E P             D R R+LE+P+F G   E+PDGWL + ER  +I+ H     K
Subjt:  EKKRVWRQSWTYSSGTANDDEKKATSSDG------IQAPSSREVP-----------LFDMRLRKLEVPIFKGEDEEDPDGWLHRVERRENIDEHLEGIPK

Query:  VIVGTIS---------------------TDVPRRSVCSFNEVATGNHRE------------GISPAFEQYAATLKDVSDGVLESKFECGLKEEIQSEMRK
        +    +                       D+    +  F     G + E                 FE  AA L  V + +LE  F  GLK  I++E+R 
Subjt:  VIVGTIS---------------------TDVPRRSVCSFNEVATGNHRE------------GISPAFEQYAATLKDVSDGVLESKFECGLKEEIQSEMRK

Query:  FQPVGLKAKMLMAQLIEDDNVV----QEKKRTGKAQGQNASPKNNTN----SNGASGGSSSLAGQCDWVVNNGDSNARRGSKGVTNPMLKRVTDNELQRK
         +P GL   M +AQ +ED N +    Q     GK +  + SP  +       N    GS S             S+ R    G      K+++D ELQ K
Subjt:  FQPVGLKAKMLMAQLIEDDNVV----QEKKRTGKAQGQNASPKNNTN----SNGASGGSSSLAGQCDWVVNNGDSNARRGSKGVTNPMLKRVTDNELQRK

Query:  REKGLCYKCDEKWNPGPRCKKKELQIMVIQGEEDEDREGTDQMEEVCEPEPVTNEDVQANESEVAALSVNSMVGIDSPKTMKVKGTIQGQGVVVLIDSGA
        RE+GLCY+CDEK+ PG +C+ KEL ++V+QGEE+ +  G  Q++E            +    EV  LS+NS+VG+++PK+MK+KGTI G  V+VLID GA
Subjt:  REKGLCYKCDEKWNPGPRCKKKELQIMVIQGEEDEDREGTDQMEEVCEPEPVTNEDVQANESEVAALSVNSMVGIDSPKTMKVKGTIQGQGVVVLIDSGA

Query:  SHNFISTEVVQKLGLTVSPSASYGIVLGTGGPVSGAGVCKGVILTISDLTIIHDFLPLPLGSADVILDITWLETLGKIQFDYRLSEMDFQIGSWTVQLRG
        +HNFIS ++  +L +   P+  YGI++GTG  V G GVCKGV++++  + ++ DFLPL LG  DVIL + WLET+GK+Q D+    M  ++G   V L+G
Subjt:  SHNFISTEVVQKLGLTVSPSASYGIVLGTGGPVSGAGVCKGVILTISDLTIIHDFLPLPLGSADVILDITWLETLGKIQFDYRLSEMDFQIGSWTVQLRG

Query:  DRSLVKSQVSLKSMMKSFGMGDQGVLIKLSNLESNNYKESEPLPLKVSEGVHSELKEVLYKFAAVFESQMTVPPSRNRDHGIELEPGVGAVNV
        D SL K+Q S+K+M+K+F  GDQGVLI+L +L     +     P        S   E+L ++  VFE   T+PPSR+RDH I L+     VNV
Subjt:  DRSLVKSQVSLKSMMKSFGMGDQGVLIKLSNLESNNYKESEPLPLKVSEGVHSELKEVLYKFAAVFESQMTVPPSRNRDHGIELEPGVGAVNV

XP_031737572.1 uncharacterized protein LOC116402461 [Cucumis sativus]5.3e-7752.55Show/hide
Query:  GSKGVTNPMLKRVTDNELQRKREKGLCYKCDEKWNPGPRCKKKELQIMVIQGEEDEDREGTDQMEEVCEPEPVTNEDVQANESEVAALSVNSMVGIDSPK
        G+K   +   +++TDNE++ K+EKG C++CD+K++P  RCK++EL I+V+Q  ED   E TDQ+ E  E    TN + + N +E+A LS+NS+VG++S K
Subjt:  GSKGVTNPMLKRVTDNELQRKREKGLCYKCDEKWNPGPRCKKKELQIMVIQGEEDEDREGTDQMEEVCEPEPVTNEDVQANESEVAALSVNSMVGIDSPK

Query:  TMKVKGTIQGQGVVVLIDSGASHNFISTEVVQKLGLTVSPSASYGIVLGTGGPVSGAGVCKGVILTISDLTIIHDFLPLPLGSADVILDITWLETLGKIQ
        T+K+KG I+G+ VVVLID GA+HNFI+ EVV++L ++V    +YG+VLGTGG V   GVCK V L I++L+I H+FLPLPLGS DVIL +TWLETLGK+ 
Subjt:  TMKVKGTIQGQGVVVLIDSGASHNFISTEVVQKLGLTVSPSASYGIVLGTGGPVSGAGVCKGVILTISDLTIIHDFLPLPLGSADVILDITWLETLGKIQ

Query:  FDYRLSEMDFQIGSWTVQLRGDRSLVKSQVSLKSMMKSFGMGDQGVLIKLSNLESNNYKESEPLPLKVSEGVHSELKEVLYKFAAVFESQMTVPPSRNRD
        FDY+LSEM+F  G W V L+GDRSLV+SQVSLKSMMK+F   DQGVLI+LS +E     E+          +  E+++VL  F +VFE    +PP  + D
Subjt:  FDYRLSEMDFQIGSWTVQLRGDRSLVKSQVSLKSMMKSFGMGDQGVLIKLSNLESNNYKESEPLPLKVSEGVHSELKEVLYKFAAVFESQMTVPPSRNRD

Query:  HGIELEPGVGAVNV
        H IELEPG  A+NV
Subjt:  HGIELEPGVGAVNV

XP_034697296.1 uncharacterized protein K02A2.6-like [Vitis riparia]2.5e-6634.14Show/hide
Query:  DMRLRKLEVPIFKGEDEEDPDGWLHRVER-----RENIDEHLEGIPKVIVGT----ISTDVPRRSVCSFNEV------------ATGNHREGIS------
        + R RKLE+P+F G    +PDGW+ + ER     R   +E LE       G        +  +RS+  + E+            A   H + ++      
Subjt:  DMRLRKLEVPIFKGEDEEDPDGWLHRVER-----RENIDEHLEGIPKVIVGT----ISTDVPRRSVCSFNEV------------ATGNHREGIS------

Query:  -----PAFEQYAATLKDVSDGVLESKFECGLKEEIQSEMRKFQPVGLKAKMLMAQLIEDDNVVQEKKR---------TGKAQGQNASPKNNTNSNGASGG
               F + +A L+++SD V    F  GLK EI+ E+R  +P  L   M +AQ IE+   + +  +         TG ++G     + + +   A+  
Subjt:  -----PAFEQYAATLKDVSDGVLESKFECGLKEEIQSEMRKFQPVGLKAKMLMAQLIEDDNVVQEKKR---------TGKAQGQNASPKNNTNSNGASGG

Query:  SSSLAGQCDWVVNNGDSNARRGSKGVTNPMLKRVTDNELQRKREKGLCYKCDEKWNPGPRCKKKELQIMVIQGEEDEDREGTDQMEEVCEPEPVTNEDVQ
         + +AG+                       ++R++D+ELQ+KREKGLC++CDEKW PG RCKKKEL +++I   ++E+    D  EE          D++
Subjt:  SSSLAGQCDWVVNNGDSNARRGSKGVTNPMLKRVTDNELQRKREKGLCYKCDEKWNPGPRCKKKELQIMVIQGEEDEDREGTDQMEEVCEPEPVTNEDVQ

Query:  ANE-SEVAALSVNSMVGIDSPKTMKVKGTIQGQGVVVLIDSGASHNFISTEVVQKLGLTVSPSASYGIVLGTGGPVSGAGVCKGVILTISDLTIIHDFLP
        A E ++VA +S++S+VG+ +PKTMK+KG +  Q VVVLID GA+HNFIS ++V+++ L +  S  YG+ +GTG  V G G+C+GV L +  + ++ +FLP
Subjt:  ANE-SEVAALSVNSMVGIDSPKTMKVKGTIQGQGVVVLIDSGASHNFISTEVVQKLGLTVSPSASYGIVLGTGGPVSGAGVCKGVILTISDLTIIHDFLP

Query:  LPLGSADVILDITWLETLGKIQFDYRLSEMDFQIGSWTVQLRGDRSLVKSQVSLKSMMKSFGMGDQGVLIKLSNLESNNYKESEPLPLKVSEGVHSELKE
        L LGSADVIL I WLETLG    +++   M F++G+ +V LRGD SL K+ VSLK+MM++      G+L++L+ LE    ++S   P          L +
Subjt:  LPLGSADVILDITWLETLGKIQFDYRLSEMDFQIGSWTVQLRGDRSLVKSQVSLKSMMKSFGMGDQGVLIKLSNLESNNYKESEPLPLKVSEGVHSELKE

Query:  VLYKFAAVFESQMTVPPSRNRDHGIELEPGVGAVNV
        +L  +A VF+  M +PP R  +H I L+     ++V
Subjt:  VLYKFAAVFESQMTVPPSRNRDHGIELEPGVGAVNV

XP_038904464.1 uncharacterized protein LOC120090832 [Benincasa hispida]2.0e-8737.91Show/hide
Query:  ERWKEGCRVGREM--AEIAVKQVDLEVNLGSHMAEMGEKKRVWRQSWTYSSGTANDDEKKATSSDGIQAPSSREVPLFDMRLRKLEVPIFKGEDEEDPDG
        +R++E  ++   M  A+I      LE  +   +    +K  + ++  + SS     D  K  + + +     REV LFDMRLRKLE+PIFKGE  EDP G
Subjt:  ERWKEGCRVGREM--AEIAVKQVDLEVNLGSHMAEMGEKKRVWRQSWTYSSGTANDDEKKATSSDGIQAPSSREVPLFDMRLRKLEVPIFKGEDEEDPDG

Query:  WLHRVER---------RENIDEHL---------------EGIPKVIVGTISTDVPRRSV--------CSFNEVATGNHREGISPAFEQYAATLKDVSDGV
        W HRVER         ++ I+  +               E  P          +  R +          F  +            FEQ   +LKD+SD +
Subjt:  WLHRVER---------RENIDEHL---------------EGIPKVIVGTISTDVPRRSV--------CSFNEVATGNHREGISPAFEQYAATLKDVSDGV

Query:  LESKFECGLKEEIQSEMRKFQPVGLKAKMLMAQLIEDDNVVQEKKRTGKAQGQNASPKNNTNSNGASGGSSSLAGQCDWVVNNGDSNARR----------
        LESKF  GLK +IQ EMR F+ +GLK KM MAQ+IED     E+ R  K  G   +P  +T ++ ++ G ++  GQ     N    +  R          
Subjt:  LESKFECGLKEEIQSEMRKFQPVGLKAKMLMAQLIEDDNVVQEKKRTGKAQGQNASPKNNTNSNGASGGSSSLAGQCDWVVNNGDSNARR----------

Query:  ------------GSKGVTNPMLKRVTDNELQRKREKGLCYKCDEKWNPGPRCKKKELQIMVIQGEEDEDREGTDQMEE---VCEPEPVTNED--VQANES
                      +  T    KR++DN++Q +R+KGLCY+C+EK+ PG RCK+KEL I++   EE    EGT++ EE   V  P   T E   ++  + 
Subjt:  ------------GSKGVTNPMLKRVTDNELQRKREKGLCYKCDEKWNPGPRCKKKELQIMVIQGEEDEDREGTDQMEE---VCEPEPVTNED--VQANES

Query:  EVAALSVNSMVGIDSPKTMKVKGTIQGQGVVVLIDSGASHNFISTEVVQKLGLTVSPSASYGIVLGTGGPVSGAGVCKGVILTISDLTIIHDFLPLPLGS
        E A LS+NS+  IDSP+T+KV+G I  + VVVLIDSGASHNFI  E+V  L L   P+ SYGI+LG G  V   GVCKGVIL +S+LTII+D  PLPLG+
Subjt:  EVAALSVNSMVGIDSPKTMKVKGTIQGQGVVVLIDSGASHNFISTEVVQKLGLTVSPSASYGIVLGTGGPVSGAGVCKGVILTISDLTIIHDFLPLPLGS

Query:  ADVILDITWLETLGKIQFDYRLSEMDFQIGSWTVQLRGDRSLVKSQVSLKSMMKSFGMGDQGVLIKLSNLESNNYKESEPLPLKVSEGVHSELKEVLYKF
         DV+L + WL TLG+++ D+  SEM+FQIG W V L+G+R+L+K+Q+SLKSMMK      QG+L++LS+L          L  +  +G   +++E+L K+
Subjt:  ADVILDITWLETLGKIQFDYRLSEMDFQIGSWTVQLRGDRSLVKSQVSLKSMMKSFGMGDQGVLIKLSNLESNNYKESEPLPLKVSEGVHSELKEVLYKF

Query:  AAVFESQMTVPP
         +VF++   +PP
Subjt:  AAVFESQMTVPP

TrEMBL top hitse value%identityAlignment
A0A438F372 Retrovirus-related Pol polyprotein from transposon 2972.9e-6834.98Show/hide
Query:  RKLEVPIFKGEDEEDPDGWLHRVERRENIDEHLEGIPKVIVGTISTD---------------------VPRRSVCSFNEVATGNHREGISPA--------
        R++E+P+F G   E+PDGW+    R   I   L    K++   +S D                     + RR +  F     G+  E             
Subjt:  RKLEVPIFKGEDEEDPDGWLHRVERRENIDEHLEGIPKVIVGTISTD---------------------VPRRSVCSFNEVATGNHREGISPA--------

Query:  ----FEQYAATLKDVSDGVLESKFECGLKEEIQSEMRKFQPVGLKAKMLMAQLIEDDNVVQEKKRTGKAQGQNASPKNNTNSNGASGGSSSLAGQCDWVV
            FE  A  LK +S  V+ES F  GL  EI++E+   QP GL   M MAQ +ED N+     R         S K  ++SN               VV
Subjt:  ----FEQYAATLKDVSDGVLESKFECGLKEEIQSEMRKFQPVGLKAKMLMAQLIEDDNVVQEKKRTGKAQGQNASPKNNTNSNGASGGSSSLAGQCDWVV

Query:  NNGDSNARRGSKGVTNPMLKRVTDNELQRKREKGLCYKCDEKWNPGPRCKKKELQIMVIQGEEDEDREGTDQMEEVCEPEPVTNEDVQANESEVAALSVN
             + RR         +KR+T++ELQ +REKGLC+KC+EK++PG RC KKEL+++++  +E+ED    D        +  T E       ++  LS+N
Subjt:  NNGDSNARRGSKGVTNPMLKRVTDNELQRKREKGLCYKCDEKWNPGPRCKKKELQIMVIQGEEDEDREGTDQMEEVCEPEPVTNEDVQANESEVAALSVN

Query:  SMVGIDSPKTMKVKGTIQGQGVVVLIDSGASHNFISTEVVQKLGLTVSPSASYGIVLGTGGPVSGAGVCKGVILTISDLTIIHDFLPLPLGSADVILDIT
        S+VG+ +P TMK+KGTI  + V++L+DSGA+HNF+S E+VQ+L L ++ + SYG+++GTG  V G G+C+GV +++  LT++ DFLPL LG+ DVIL + 
Subjt:  SMVGIDSPKTMKVKGTIQGQGVVVLIDSGASHNFISTEVVQKLGLTVSPSASYGIVLGTGGPVSGAGVCKGVILTISDLTIIHDFLPLPLGSADVILDIT

Query:  WLETLGKIQFDYRLSEMDFQIGSWTVQLRGDRSLVKSQVSLKSMMKSFGMGDQGVLIKLSNLESNNYKESEPLPLKVSEGVH---SELKEVLYKFAAVFE
        WL TLG ++ ++++  M  ++G   + L+GD SL +++VSLK+M ++     QGV ++L    + +          +SEGV      +KEVL +   +FE
Subjt:  WLETLGKIQFDYRLSEMDFQIGSWTVQLRGDRSLVKSQVSLKSMMKSFGMGDQGVLIKLSNLESNNYKESEPLPLKVSEGVH---SELKEVLYKFAAVFE

Query:  SQMTVPPSRNRDHGIELEPGVGAVNV
            +PPSR+ DH I+L PG   VNV
Subjt:  SQMTVPPSRNRDHGIELEPGVGAVNV

A0A438HNN1 Retrovirus-related Pol polyprotein from transposon 17.64.7e-6337.68Show/hide
Query:  FEQYAATLKDVSDGVLESKFECGLKEEIQSEMRKFQPVGLKAKMLMAQLIEDDNVVQEKKRTGKAQGQNASPKNNTNSNGASGGSSSLAGQCDWVVNNGD
        FE  A  LK + D V+ES F  GL  EI++E+   QP  L   M MAQ +ED N+     R   AQ    +PK                           
Subjt:  FEQYAATLKDVSDGVLESKFECGLKEEIQSEMRKFQPVGLKAKMLMAQLIEDDNVVQEKKRTGKAQGQNASPKNNTNSNGASGGSSSLAGQCDWVVNNGD

Query:  SNARRGSKGVTNPMLKRVTDNELQRKREKGLCYKCDEKWNPGPRCKKKELQIMVIQGEEDEDREGTDQMEEVCEPEPVTNEDVQANESEVAALSVNSMVG
                   +  ++R+T++ELQ  REKGLC+KC+EK++PG RC KKELQ++++  +E+ED    +Q +     EP   E   A E     LS+NS+VG
Subjt:  SNARRGSKGVTNPMLKRVTDNELQRKREKGLCYKCDEKWNPGPRCKKKELQIMVIQGEEDEDREGTDQMEEVCEPEPVTNEDVQANESEVAALSVNSMVG

Query:  IDSPKTMKVKGTIQGQGVVVLIDSGASHNFISTEVVQKLGLTVSPSASYGIVLGTGGPVSGAGVCKGVILTISDLTIIHDFLPLPLGSADVILDITWLET
        + +  TMK+KGTI  + V++L+DSGA+HNF+S E+VQ+L L ++ + SYG+++GTG  V G G+C+GV + +  LT++ DFLPL LG+ DVIL + WL T
Subjt:  IDSPKTMKVKGTIQGQGVVVLIDSGASHNFISTEVVQKLGLTVSPSASYGIVLGTGGPVSGAGVCKGVILTISDLTIIHDFLPLPLGSADVILDITWLET

Query:  LGKIQFDYRLSEMDFQIGSWTVQLRGDRSLVKSQVSLKSMMKSFGMGDQGVLIKLSNLESNNYKESEPLPLKVSEGVH---SELKEVLYKFAAVFESQMT
        LG ++ ++++  M  ++G   + L+GD SL +++VSLK+M ++     QGV ++L    + +         ++SEGV      +KEVL +   +F     
Subjt:  LGKIQFDYRLSEMDFQIGSWTVQLRGDRSLVKSQVSLKSMMKSFGMGDQGVLIKLSNLESNNYKESEPLPLKVSEGVH---SELKEVLYKFAAVFESQMT

Query:  VPPSRNRDHGIELEPGVGAVNV
        +PPSR+ DH I+L PG   VNV
Subjt:  VPPSRNRDHGIELEPGVGAVNV

A0A5C7IJS7 Uncharacterized protein9.9e-6135.11Show/hide
Query:  DMRLRKLEVPIFKGEDEEDPDGWLHRVE-----RRENIDEHLEGIPKVIVGT---------------ISTDVPRRSVCSFNEVATGNHREGI--------
        D R RKLE+P+F G    +PDGW+ + E     +R N +E LE       G                +  ++    +  F     G+  E          
Subjt:  DMRLRKLEVPIFKGEDEEDPDGWLHRVE-----RRENIDEHLEGIPKVIVGT---------------ISTDVPRRSVCSFNEVATGNHREGI--------

Query:  ----SPAFEQYAATLKDVSDGVLESKFECGLKEEIQSEMRKFQPVGLKAKMLMAQLIEDDNVVQEKKRTGKAQGQNASPKNNTNSNGASGGSSSLAGQCD
               F ++ A L +VSD +  S+F  GL  EI++E+R   P+ L   M +AQ IE   +      T K  G  ++    T  +G +   SS  G   
Subjt:  ----SPAFEQYAATLKDVSDGVLESKFECGLKEEIQSEMRKFQPVGLKAKMLMAQLIEDDNVVQEKKRTGKAQGQNASPKNNTNSNGASGGSSSLAGQCD

Query:  WVVNNGDSNARRGSKGVTNPM------LKRVTDNELQRKREKGLCYKCDEKWNPGPRCKKKELQIMVIQGEEDEDREGTDQMEEVCEPEPVTNEDVQANE
                 A R + G T P       L+R+TD+ELQ KR  GLCY+CDEKW+PG +CKKKEL +++   EEDE+     +   V   EPV      +  
Subjt:  WVVNNGDSNARRGSKGVTNPM------LKRVTDNELQRKREKGLCYKCDEKWNPGPRCKKKELQIMVIQGEEDEDREGTDQMEEVCEPEPVTNEDVQANE

Query:  SEVAALSVNSMVGIDSPKTMKVKGTIQGQGVVVLIDSGASHNFISTEVVQKLGLTVSPSASYGIVLGTGGPVSGAGVCKGVILTISDLTIIHDFLPLPLG
        +E   +S+NS+VG+ +PKTMK+KG +  Q VV LID GA+HNFIS ++VQKL L ++ + +YG+ +GTG  V G G+CKGV L +  + I+ +FLPL LG
Subjt:  SEVAALSVNSMVGIDSPKTMKVKGTIQGQGVVVLIDSGASHNFISTEVVQKLGLTVSPSASYGIVLGTGGPVSGAGVCKGVILTISDLTIIHDFLPLPLG

Query:  SADVILDITWLETLGKIQFDYRLSEMDFQIGSWTVQLRGDRSLVKSQVSLKSMMKSFGMGDQGVLIKL--------SNLESNNYKESEPLPLKVSEGVH-
        S+DVIL I WL TLG    +++L  M FQ+G+ +V LRGD SL K+ VSLK+MM++    + G+L++L        S LE + + +     +      H 
Subjt:  SADVILDITWLETLGKIQFDYRLSEMDFQIGSWTVQLRGDRSLVKSQVSLKSMMKSFGMGDQGVLIKL--------SNLESNNYKESEPLPLKVSEGVH-

Query:  --SELKEVLY--------KFAAVFESQMTVPPSRNRDHGIELEP
          + L+EV+Y        K   V  S  T+PP  + D  + + P
Subjt:  --SELKEVLY--------KFAAVFESQMTVPPSRNRDHGIELEP

A5B2I6 Reverse transcriptase domain-containing protein1.7e-6032.33Show/hide
Query:  FGIRAVIFGEWWERWKEG---CRVGREMAEIAVKQVDLEVNLGSHMAEMGEKKRVWRQSWTYSSGTANDDEKKATSSDGIQAPSSREVPLFDMR------
        FGIRA I  +  ++  EG   C + +E+ EI  +   L     + + E   +K V   S   ++ TA   E +A    G+  PS   +   +MR      
Subjt:  FGIRAVIFGEWWERWKEG---CRVGREMAEIAVKQVDLEVNLGSHMAEMGEKKRVWRQSWTYSSGTANDDEKKATSSDGIQAPSSREVPLFDMR------

Query:  ---LRKLEVPIFKGEDEEDPDGWLHRVERRENIDEHLEGIPKVIVGTISTD---------------------VPRRSVCSFNEVATGN--------HREG
            R++E+P+F G   E+PDGW+ R + R      L    K++   +S D                     + RR +  F     G+         ++G
Subjt:  ---LRKLEVPIFKGEDEEDPDGWLHRVERRENIDEHLEGIPKVIVGTISTD---------------------VPRRSVCSFNEVATGN--------HREG

Query:  ISPA----FEQYAATLKDVSDGVLESKFECGLKEEIQSEMRKFQPVGLKAKMLMAQLIEDDNVVQEKKRTGKAQGQNASPKNNTNSNGASGGSSSLAGQC
           A    FE     LK +S+ V+ES F  GL  EI++E R  QP GL   M MAQ +ED N+        +A  +   PK+    + A+ G        
Subjt:  ISPA----FEQYAATLKDVSDGVLESKFECGLKEEIQSEMRKFQPVGLKAKMLMAQLIEDDNVVQEKKRTGKAQGQNASPKNNTNSNGASGGSSSLAGQC

Query:  DWVVNNGDSNARR----GSKGVTNPM---LKRVTDNELQRKREKGLCYKCDEKWNPGPRCKKKELQIMVIQGEEDEDREGTDQMEEVCEPEPVTNEDVQA
        +W +  G++   R    G K ++      +KR+T++ELQ +REKGL +KC+EK++PG RC KKEL+++++  +E+ED    +Q ++    EP   E   A
Subjt:  DWVVNNGDSNARR----GSKGVTNPM---LKRVTDNELQRKREKGLCYKCDEKWNPGPRCKKKELQIMVIQGEEDEDREGTDQMEEVCEPEPVTNEDVQA

Query:  NESEVAALSVNSMVGIDSPKTMKVKGTIQGQGVVVLIDSGASHNFISTEVVQKLGLTVSPSASYGIVLGTGGPVSGAGVCKGVILTISDLTIIHDFLPLP
         E     LS+NS+VG+ +P TMK+KGTI  + V++L+DSGA+HNF+S E+VQ+L L ++ + SYG+++GTG  V G G+C+GV +++  LT++ DFLPL 
Subjt:  NESEVAALSVNSMVGIDSPKTMKVKGTIQGQGVVVLIDSGASHNFISTEVVQKLGLTVSPSASYGIVLGTGGPVSGAGVCKGVILTISDLTIIHDFLPLP

Query:  LGSADVILDITWLETLGKIQFDYRLSEMDFQIGSWTVQLRGDRSLVKSQVSLKSMMKSFGMGDQGVLIKLSNLESNNYKESEPLPLKVSEGVHSELKEVL
        LG+ DVIL + WL TLG ++ ++++  M  ++G   + L+GD SL +++ S  S +       +GV                       + V   +KEVL
Subjt:  LGSADVILDITWLETLGKIQFDYRLSEMDFQIGSWTVQLRGDRSLVKSQVSLKSMMKSFGMGDQGVLIKLSNLESNNYKESEPLPLKVSEGVHSELKEVL

Query:  YKFAAVFESQMTVPPSRNRDHGIELEPGVGAVNV
         +   +FE    +PPSR+ DH I+L  G   VNV
Subjt:  YKFAAVFESQMTVPPSRNRDHGIELEPGVGAVNV

J3SDF5 Ty3/gypsy retrotransposon protein7.3e-6432.9Show/hide
Query:  RLRKLEVPIFKGEDEEDPDGWLHRVER---------RENIDEHLEGIPKVIVGTISTDVPRRS-----------VCSFNEVATGN-HREGISPA------
        R +KL++P F   D+ DPDGW+ R ER          E ++  +  +    +     +  RR            +  F  +  G+ H + +S        
Subjt:  RLRKLEVPIFKGEDEEDPDGWLHRVER---------RENIDEHLEGIPKVIVGTISTDVPRRS-----------VCSFNEVATGN-HREGISPA------

Query:  -----FEQYAATLKDVSDGVLESKFECGLKEEIQSEMRKFQPVGLKAKMLMAQLIEDDNVVQEKKRTGKAQGQNASPKNNTNSN-------GASGGSSSL
             F + AA L  + + +L  KF  GL  E+QSE+R   P  L   M +A  +E+ N V   +RTG   G  +      NSN       G+ GGS+  
Subjt:  -----FEQYAATLKDVSDGVLESKFECGLKEEIQSEMRKFQPVGLKAKMLMAQLIEDDNVVQEKKRTGKAQGQNASPKNNTNSN-------GASGGSSSL

Query:  AGQCDWVVNNGDSNARRGSKGVTNP---------MLKRVTDNELQRKREKGLCYKCDEKWNPGPRCKKKELQIMVIQGEEDEDREGTDQMEEVCEPEPVT
        A    W +N   SNA + S     P          ++R+T+ ELQ KR KGLC+KCDEKW  G +C++KEL ++ ++  E+++ EG     E   P P  
Subjt:  AGQCDWVVNNGDSNARRGSKGVTNP---------MLKRVTDNELQRKREKGLCYKCDEKWNPGPRCKKKELQIMVIQGEEDEDREGTDQMEEVCEPEPVT

Query:  NEDVQANESEVAALSVNSMVGIDSPKTMKVKGTIQGQGVVVLIDSGASHNFISTEVVQKLGLTVSPSASYGIVLGTGGPVSGAGVCKGVILTI-SDLTII
               E     +S+NS++G+ +PKTMK+ G I    VVV+ID GA+HNF+S + + KLG+ V+ S  +G+ LG G  V G G+C+ V L +   L ++
Subjt:  NEDVQANESEVAALSVNSMVGIDSPKTMKVKGTIQGQGVVVLIDSGASHNFISTEVVQKLGLTVSPSASYGIVLGTGGPVSGAGVCKGVILTI-SDLTII

Query:  HDFLPLPLGSADVILDITWLETLGKIQFDYRLSEMDFQIGSWTVQLRGDRSLVKSQVSLKSMMKSFGMGDQGVLIKLSNLESNNYKESEPLPLKVSEGVH
         DFLPL LG++DVIL + WLETLG +  +++  +M FQ+G     L GD +L +S+VSLK+M+++      G+ ++ + +E+           KV + + 
Subjt:  HDFLPLPLGSADVILDITWLETLGKIQFDYRLSEMDFQIGSWTVQLRGDRSLVKSQVSLKSMMKSFGMGDQGVLIKLSNLESNNYKESEPLPLKVSEGVH

Query:  SELKEVLYKFAAVFESQMTVPPSRNRDHGIELEPGVGAVNV
          L+E++ +F  VFE+ + +PP R  +H I L+ G   V V
Subjt:  SELKEVLYKFAAVFESQMTVPPSRNRDHGIELEPGVGAVNV

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.9e-1150.7Show/hide
Query:  HACSQNGIVERKHRHIVDMGLTLLSHASLSLDFWDNAFSTAVYIINRLPTIVHQQPIDQSNRPLSPSLATT
        H    NG+ ERKHRHIV+ GLTLLSHAS+   +W  AF+ AVY+INRLPT     P+ Q   P      T+
Subjt:  HACSQNGIVERKHRHIVDMGLTLLSHASLSLDFWDNAFSTAVYIINRLPTIVHQQPIDQSNRPLSPSLATT

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.8e-1268Show/hide
Query:  HACSQNGIVERKHRHIVDMGLTLLSHASLSLDFWDNAFSTAVYIINRLPT
        H    NG+ ERKHRHIV+MGLTLLSHAS+   +W  AFS AVY+INRLPT
Subjt:  HACSQNGIVERKHRHIVDMGLTLLSHASLSLDFWDNAFSTAVYIINRLPT

Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein6.0e-1031.08Show/hide
Query:  MVGIDSPKTMKVKGTIQGQGVVVLIDSGASHNFISTEVVQKLGLTVSPSASYGIVLGTGGPVSGAGVCKGVILTISDLTIIHDFLPLPLG--SADVILDI
        ++ +   K M+  G I    VVV IDSGA+ NFI  E+   L L  S +    ++LG    +   G C G+ L + ++ I  +FL L L     DVIL  
Subjt:  MVGIDSPKTMKVKGTIQGQGVVVLIDSGASHNFISTEVVQKLGLTVSPSASYGIVLGTGGPVSGAGVCKGVILTISDLTIIHDFLPLPLG--SADVILDI

Query:  TWLETLGKIQFDYRLSEMDFQIGSWTVQLRGDRSLVKSQVSLKSMMKS
         WL  LG+   +++  +  F      + L  +   ++ QV+ K  MKS
Subjt:  TWLETLGKIQFDYRLSEMDFQIGSWTVQLRGDRSLVKSQVSLKSMMKS

AT3G30770.1 Eukaryotic aspartyl protease family protein1.1e-0632.21Show/hide
Query:  KTMKVKGTIQGQGVVVLIDSGASHNFISTEVVQKLGLTVSPSASYGIVLGTGGPVSGAGVCKGVILTISDLTIIHDFLPLPLGSADV----------ILD
        K M+  G I    VVV+IDSGA++NFIS E+   L L  S +    ++LG    +   G C G+ L + ++ I  +FL L L   DV           L+
Subjt:  KTMKVKGTIQGQGVVVLIDSGASHNFISTEVVQKLGLTVSPSASYGIVLGTGGPVSGAGVCKGVILTISDLTIIHDFLPLPLGSADV----------ILD

Query:  ITWLETLGKIQFDYRLSEMDFQIGSWTVQLRGDRSLVKSQVSLKSMMKS
          WL  L +  F +      F    W      D+ L   QV+ K  MKS
Subjt:  ITWLETLGKIQFDYRLSEMDFQIGSWTVQLRGDRSLVKSQVSLKSMMKS

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.4e-0444.62Show/hide
Query:  SKVGIFK--PK--VYLTEYIEVEPPNVKEALKCDHWIQAMKEEYNALLHNDTWSLEEVPLTKGSL
        SK GI K  PK  + +T  I+ EP +V  ALK   W QAM+EE +AL  N TW L   P+ +  L
Subjt:  SKVGIFK--PK--VYLTEYIEVEPPNVKEALKCDHWIQAMKEEYNALLHNDTWSLEEVPLTKGSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCCTTGCAACGAGTGTATTCCATGTTACTACCCAAGAAAGTCGAATCCAGCGACATACCTCTTCAACTGTGAACAGCGATGGTTCTTTACCATCAGTGAACTTGA
CTCAGTCAGAATACAGCAGCCGAATAATAATGCTAATGATGCTGCCGGAAGAACTCTAATCAAGGGAATCAAAATTATCAAGGAGAATCAGTGGTATCCAGATTCAGGGG
CATCAAATCATGTCACCATGATGCTTCAAATCTTTCATTTGGGACAGAGTATCAAGGAGAGAATAAAGTCCACATTGGCAATGGTGCAGGATCTTTCCAATGGCCAAATA
CTTCTCCAAGGCAAACTAACTGATGGACTCTACAGTTTCAATCTGGAAAAGGCTGATTCTCCCTGTTTTTCGTCTTTTGATGGTTTCTCTTCCAAGGCCATTACTCCACA
AGTATTGACTGCAAACTTACTTGGGCATTCTGCCATTGCTACTGTTAAGAATGTTCTTTCTCTTTGTAAAATCACTTCATCAAATAAGAATCCACATTTGTGTCATGCTT
GTTCGCAGAATGGGATAGTAGAGAGAAAACACCGTCATATAGTAGATATGGGTCTTACACTCTTATCTCATGCTTCTCTTTCTTTGGACTTCTGGGATAATGCCTTTTCC
ACTGCAGTTTACATTATCAACCGTCTTCCTACTATAGTCCATCAGCAGCCCATTGATCAGTCAAATCGTCCTTTATCACCATCCTTGGCTACTACTTCAAGCAACCCATC
TTCTTTGCCTAATTCACCCTCCCAGCATTCTGTCTCACCAAATGCTTTTCAACCTTCTTCTTCTAATATTTTGAATACCAGTCCTGCACTTGGGCTAGATGAACTTGCTG
CTACAAATGTTCATTCAAGCAAAGTCGGAATTTTCAAACCGAAAGTATATCTTACTGAATATATTGAGGTTGAACCACCAAATGTCAAAGAGGCCCTTAAGTGTGATCAT
TGGATTCAAGCAATGAAAGAGGAGTACAATGCTCTGCTACATAATGATACATGGTCCCTGGAAGAAGTCCCTCTGACAAAAGGATCATTGGCTGCAAGTGGGTTGAGGCC
TATCAATTTTGGTATCAGAGCAGTGATTTTTGGGGAATGGTGGGAAAGATGGAAGGAAGGGTGTCGCGTTGGAAGAGAAATGGCCGAGATAGCTGTTAAGCAGGTCGATT
TGGAGGTGAACTTGGGATCTCATATGGCGGAAATGGGAGAGAAAAAACGAGTGTGGAGGCAAAGCTGGACTTACAGCTCCGGAACCGCGAATGATGACGAGAAGAAGGCC
ACGAGTAGTGACGGAATCCAGGCTCCTAGTAGTCGTGAAGTGCCCCTGTTCGACATGCGCCTAAGGAAGTTAGAGGTGCCCATATTTAAGGGGGAAGATGAGGAAGACCC
AGATGGTTGGTTGCATCGGGTGGAGCGAAGAGAGAACATCGATGAGCACTTGGAAGGAATTCCGAAGGTTATTGTTGGAACGATTTCGACCGACGTCCCAAGGAGATCGG
TATGCTCGTTTAATGAAGTTGCAACAGGAAACCACCGTGAGGGAATATCGCCGGCGTTTGAGCAATACGCGGCGACACTCAAGGACGTGAGTGACGGCGTGCTCGAGAGT
AAGTTCGAATGTGGGTTGAAGGAGGAGATCCAAAGTGAGATGAGGAAGTTTCAGCCCGTGGGCCTGAAGGCGAAGATGTTGATGGCCCAATTGATTGAAGATGACAACGT
CGTCCAAGAAAAGAAAAGAACGGGAAAAGCCCAAGGCCAAAACGCAAGCCCAAAAAATAACACAAACTCGAATGGGGCAAGCGGTGGATCCAGTAGTCTGGCGGGTCAAT
GCGACTGGGTCGTCAACAACGGTGACTCCAATGCGAGACGCGGCAGTAAAGGGGTCACAAATCCGATGCTTAAACGAGTGACGGATAATGAATTGCAAAGGAAGAGGGAG
AAGGGATTATGTTACAAATGTGATGAAAAGTGGAACCCGGGTCCTCGATGTAAAAAGAAGGAATTACAAATCATGGTAATTCAAGGTGAGGAAGATGAGGACCGAGAGGG
AACAGACCAGATGGAGGAAGTCTGTGAACCGGAGCCTGTCACGAATGAGGATGTTCAGGCCAACGAATCTGAAGTAGCAGCCTTGTCTGTAAATTCTATGGTTGGAATTG
ATTCACCAAAGACGATGAAAGTCAAGGGAACAATCCAAGGGCAAGGAGTCGTAGTGTTGATCGACAGCGGGGCTTCGCACAATTTTATTTCAACGGAGGTCGTTCAGAAA
TTAGGGCTCACGGTGTCGCCATCCGCCAGCTATGGCATTGTTTTGGGCACTGGAGGTCCAGTTTCGGGAGCGGGGGTGTGCAAAGGTGTGATCCTCACTATCTCTGATTT
AACTATTATTCACGATTTCCTTCCTCTGCCCTTGGGCAGTGCAGATGTAATTTTGGACATCACATGGTTGGAAACGCTGGGAAAAATACAATTCGATTACCGGCTGTCGG
AAATGGATTTTCAAATTGGGAGTTGGACGGTTCAGCTACGAGGGGATCGTAGTCTGGTAAAATCCCAGGTCTCTCTCAAGTCGATGATGAAATCTTTTGGGATGGGGGAT
CAAGGCGTATTAATCAAGTTGAGCAATTTAGAGTCAAACAATTACAAAGAAAGTGAGCCGTTACCACTGAAGGTGAGTGAAGGAGTTCATTCCGAATTGAAAGAGGTTTT
ATATAAGTTTGCTGCTGTGTTCGAATCACAGATGACAGTGCCTCCGAGCCGGAATAGAGATCACGGCATCGAGCTGGAGCCAGGGGTCGGCGCCGTCAATGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTCCTTGCAACGAGTGTATTCCATGTTACTACCCAAGAAAGTCGAATCCAGCGACATACCTCTTCAACTGTGAACAGCGATGGTTCTTTACCATCAGTGAACTTGA
CTCAGTCAGAATACAGCAGCCGAATAATAATGCTAATGATGCTGCCGGAAGAACTCTAATCAAGGGAATCAAAATTATCAAGGAGAATCAGTGGTATCCAGATTCAGGGG
CATCAAATCATGTCACCATGATGCTTCAAATCTTTCATTTGGGACAGAGTATCAAGGAGAGAATAAAGTCCACATTGGCAATGGTGCAGGATCTTTCCAATGGCCAAATA
CTTCTCCAAGGCAAACTAACTGATGGACTCTACAGTTTCAATCTGGAAAAGGCTGATTCTCCCTGTTTTTCGTCTTTTGATGGTTTCTCTTCCAAGGCCATTACTCCACA
AGTATTGACTGCAAACTTACTTGGGCATTCTGCCATTGCTACTGTTAAGAATGTTCTTTCTCTTTGTAAAATCACTTCATCAAATAAGAATCCACATTTGTGTCATGCTT
GTTCGCAGAATGGGATAGTAGAGAGAAAACACCGTCATATAGTAGATATGGGTCTTACACTCTTATCTCATGCTTCTCTTTCTTTGGACTTCTGGGATAATGCCTTTTCC
ACTGCAGTTTACATTATCAACCGTCTTCCTACTATAGTCCATCAGCAGCCCATTGATCAGTCAAATCGTCCTTTATCACCATCCTTGGCTACTACTTCAAGCAACCCATC
TTCTTTGCCTAATTCACCCTCCCAGCATTCTGTCTCACCAAATGCTTTTCAACCTTCTTCTTCTAATATTTTGAATACCAGTCCTGCACTTGGGCTAGATGAACTTGCTG
CTACAAATGTTCATTCAAGCAAAGTCGGAATTTTCAAACCGAAAGTATATCTTACTGAATATATTGAGGTTGAACCACCAAATGTCAAAGAGGCCCTTAAGTGTGATCAT
TGGATTCAAGCAATGAAAGAGGAGTACAATGCTCTGCTACATAATGATACATGGTCCCTGGAAGAAGTCCCTCTGACAAAAGGATCATTGGCTGCAAGTGGGTTGAGGCC
TATCAATTTTGGTATCAGAGCAGTGATTTTTGGGGAATGGTGGGAAAGATGGAAGGAAGGGTGTCGCGTTGGAAGAGAAATGGCCGAGATAGCTGTTAAGCAGGTCGATT
TGGAGGTGAACTTGGGATCTCATATGGCGGAAATGGGAGAGAAAAAACGAGTGTGGAGGCAAAGCTGGACTTACAGCTCCGGAACCGCGAATGATGACGAGAAGAAGGCC
ACGAGTAGTGACGGAATCCAGGCTCCTAGTAGTCGTGAAGTGCCCCTGTTCGACATGCGCCTAAGGAAGTTAGAGGTGCCCATATTTAAGGGGGAAGATGAGGAAGACCC
AGATGGTTGGTTGCATCGGGTGGAGCGAAGAGAGAACATCGATGAGCACTTGGAAGGAATTCCGAAGGTTATTGTTGGAACGATTTCGACCGACGTCCCAAGGAGATCGG
TATGCTCGTTTAATGAAGTTGCAACAGGAAACCACCGTGAGGGAATATCGCCGGCGTTTGAGCAATACGCGGCGACACTCAAGGACGTGAGTGACGGCGTGCTCGAGAGT
AAGTTCGAATGTGGGTTGAAGGAGGAGATCCAAAGTGAGATGAGGAAGTTTCAGCCCGTGGGCCTGAAGGCGAAGATGTTGATGGCCCAATTGATTGAAGATGACAACGT
CGTCCAAGAAAAGAAAAGAACGGGAAAAGCCCAAGGCCAAAACGCAAGCCCAAAAAATAACACAAACTCGAATGGGGCAAGCGGTGGATCCAGTAGTCTGGCGGGTCAAT
GCGACTGGGTCGTCAACAACGGTGACTCCAATGCGAGACGCGGCAGTAAAGGGGTCACAAATCCGATGCTTAAACGAGTGACGGATAATGAATTGCAAAGGAAGAGGGAG
AAGGGATTATGTTACAAATGTGATGAAAAGTGGAACCCGGGTCCTCGATGTAAAAAGAAGGAATTACAAATCATGGTAATTCAAGGTGAGGAAGATGAGGACCGAGAGGG
AACAGACCAGATGGAGGAAGTCTGTGAACCGGAGCCTGTCACGAATGAGGATGTTCAGGCCAACGAATCTGAAGTAGCAGCCTTGTCTGTAAATTCTATGGTTGGAATTG
ATTCACCAAAGACGATGAAAGTCAAGGGAACAATCCAAGGGCAAGGAGTCGTAGTGTTGATCGACAGCGGGGCTTCGCACAATTTTATTTCAACGGAGGTCGTTCAGAAA
TTAGGGCTCACGGTGTCGCCATCCGCCAGCTATGGCATTGTTTTGGGCACTGGAGGTCCAGTTTCGGGAGCGGGGGTGTGCAAAGGTGTGATCCTCACTATCTCTGATTT
AACTATTATTCACGATTTCCTTCCTCTGCCCTTGGGCAGTGCAGATGTAATTTTGGACATCACATGGTTGGAAACGCTGGGAAAAATACAATTCGATTACCGGCTGTCGG
AAATGGATTTTCAAATTGGGAGTTGGACGGTTCAGCTACGAGGGGATCGTAGTCTGGTAAAATCCCAGGTCTCTCTCAAGTCGATGATGAAATCTTTTGGGATGGGGGAT
CAAGGCGTATTAATCAAGTTGAGCAATTTAGAGTCAAACAATTACAAAGAAAGTGAGCCGTTACCACTGAAGGTGAGTGAAGGAGTTCATTCCGAATTGAAAGAGGTTTT
ATATAAGTTTGCTGCTGTGTTCGAATCACAGATGACAGTGCCTCCGAGCCGGAATAGAGATCACGGCATCGAGCTGGAGCCAGGGGTCGGCGCCGTCAATGTTTGA
Protein sequenceShow/hide protein sequence
MPPCNECIPCYYPRKSNPATYLFNCEQRWFFTISELDSVRIQQPNNNANDAAGRTLIKGIKIIKENQWYPDSGASNHVTMMLQIFHLGQSIKERIKSTLAMVQDLSNGQI
LLQGKLTDGLYSFNLEKADSPCFSSFDGFSSKAITPQVLTANLLGHSAIATVKNVLSLCKITSSNKNPHLCHACSQNGIVERKHRHIVDMGLTLLSHASLSLDFWDNAFS
TAVYIINRLPTIVHQQPIDQSNRPLSPSLATTSSNPSSLPNSPSQHSVSPNAFQPSSSNILNTSPALGLDELAATNVHSSKVGIFKPKVYLTEYIEVEPPNVKEALKCDH
WIQAMKEEYNALLHNDTWSLEEVPLTKGSLAASGLRPINFGIRAVIFGEWWERWKEGCRVGREMAEIAVKQVDLEVNLGSHMAEMGEKKRVWRQSWTYSSGTANDDEKKA
TSSDGIQAPSSREVPLFDMRLRKLEVPIFKGEDEEDPDGWLHRVERRENIDEHLEGIPKVIVGTISTDVPRRSVCSFNEVATGNHREGISPAFEQYAATLKDVSDGVLES
KFECGLKEEIQSEMRKFQPVGLKAKMLMAQLIEDDNVVQEKKRTGKAQGQNASPKNNTNSNGASGGSSSLAGQCDWVVNNGDSNARRGSKGVTNPMLKRVTDNELQRKRE
KGLCYKCDEKWNPGPRCKKKELQIMVIQGEEDEDREGTDQMEEVCEPEPVTNEDVQANESEVAALSVNSMVGIDSPKTMKVKGTIQGQGVVVLIDSGASHNFISTEVVQK
LGLTVSPSASYGIVLGTGGPVSGAGVCKGVILTISDLTIIHDFLPLPLGSADVILDITWLETLGKIQFDYRLSEMDFQIGSWTVQLRGDRSLVKSQVSLKSMMKSFGMGD
QGVLIKLSNLESNNYKESEPLPLKVSEGVHSELKEVLYKFAAVFESQMTVPPSRNRDHGIELEPGVGAVNV