; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019852 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019852
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr5:46093685..46096518
RNA-Seq ExpressionLag0019852
SyntenyLag0019852
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056838.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.6e-8327.38Show/hide
Query:  LRKDLSAFSSVSSIQPDKALLACEDEDQARTLANIK---GWYNVGKYQVRFLPWSAENMHC--KPVPSYGGWIKVRNLPLDKWSLDTFKLIGDECGGYIE
        LRK      S    Q DKA+L   + D A+ L + K   GW  VG YQV+F  W + N+H     +PSYGGW++ R +PL  W+ +TF+ IG  CGG+++
Subjt:  LRKDLSAFSSVSSIQPDKALLACEDEDQARTLANIK---GWYNVGKYQVRFLPWSAENMHC--KPVPSYGGWIKVRNLPLDKWSLDTFKLIGDECGGYIE

Query:  TASKTLSRMDMMEIGIKVQENDTGFIPGEIPI-PSSSHSPVVVRIDPF---FVEDHNIGYKASIHGKIPASPLHRDNSRATIAEVKT-NGCDNQGPRACD
         A +T+    +++  IKV+ N TGF+P  I I  +   + +V  + P    ++ + N+     +HG          +    +AE  T NG     P    
Subjt:  TASKTLSRMDMMEIGIKVQENDTGFIPGEIPI-PSSSHSPVVVRIDPF---FVEDHNIGYKASIHGKIPASPLHRDNSRATIAEVKT-NGCDNQGPRACD

Query:  FTRPCTKKENPPLSYRCDNQSAPNDVLFNSLDVTPADYPEAGPHLSPSFAEPNSPKSSSTSQPQSIAPNIDPKSQPSIHARRQTTPSQYPPQKPTHQSPD
             T+       +  D  S          + + ++Y      LS    E    K+      Q+             H        +   +K +  SP 
Subjt:  FTRPCTKKENPPLSYRCDNQSAPNDVLFNSLDVTPADYPEAGPHLSPSFAEPNSPKSSSTSQPQSIAPNIDPKSQPSIHARRQTTPSQYPPQKPTHQSPD

Query:  PLSYHNQPHTHRPGPTTFDPSTAVQIGRKK---PIMINNKETFLLTGTISSTNTEFHLSDSE-GVLSSPCTTVMDVSPTQSPQKAIIPAASPPSIRNLFE
         +  ++          + + ST      K+         K T+ +      +  + +LS  E G  S      +D+ P  SP +++I + +   +  L  
Subjt:  PLSYHNQPHTHRPGPTTFDPSTAVQIGRKK---PIMINNKETFLLTGTISSTNTEFHLSDSE-GVLSSPCTTVMDVSPTQSPQKAIIPAASPPSIRNLFE

Query:  SQAEQHPYLEEPTPLCREEPIHLCIQNPLNLEETALIEVDIEDERDKDLDEDTDEKDPAVFLPYLFPWLAEHGMCIMPMPSRQKITTAAKKKIKWVNELN
           + +    +       E  +L +      ++         +   KD    ++ +    F   L  WL E+ + + P               K+ N++ 
Subjt:  SQAEQHPYLEEPTPLCREEPIHLCIQNPLNLEETALIEVDIEDERDKDLDEDTDEKDPAVFLPYLFPWLAEHGMCIMPMPSRQKITTAAKKKIKWVNELN

Query:  NLHSTKRAFIKDLITSHNPSLVILQETKLASIDRKIIKSLWSSRSIVWAAVDASGSSGGIAILWNEASFSVLEVIEGIFSLTVHLSLADGYSFWVTGVYG
        +             +S  P +V                   S +++  A     G  GGI +LW++  F V ++  G +S+++++   +G ++W+T VYG
Subjt:  NLHSTKRAFIKDLITSHNPSLVILQETKLASIDRKIIKSLWSSRSIVWAAVDASGSSGGIAILWNEASFSVLEVIEGIFSLTVHLSLADGYSFWVTGVYG

Query:  PNSYRDRKIFWRELEDLQALCQPNWIVGGDFNIIRWTWEKSTNTTPNRAMRRFNRFIEEVDLQDIPLTNGKFTWSSFRPNPTMTLINRFLISEQISTKFD
        P  Y DR   W ELE LQ+LC PNW++ GDFNI+RW  E +  +   R M  FN FI   +L D PL N  FTWS+ R NPT + ++RFL+S+     F 
Subjt:  PNSYRDRKIFWRELEDLQALCQPNWIVGGDFNIIRWTWEKSTNTTPNRAMRRFNRFIEEVDLQDIPLTNGKFTWSSFRPNPTMTLINRFLISEQISTKFD

Query:  YVTARKLERVTSDHFPISLSLGKNLWGPVPFKFLNVWLTHHSFAATVESWWKSNPSSGWPRHGFIQKLKGLKRDLKQWNQNVFGNQKEQRVSLERELTDI
          T+R LER  SDHFPI L   +  WGP PF+  N  L    F     +WW ++  +G+P + FIQ L  L + +K+W  N        + +L +E+  I
Subjt:  YVTARKLERVTSDHFPISLSLGKNLWGPVPFKFLNVWLTHHSFAATVESWWKSNPSSGWPRHGFIQKLKGLKRDLKQWNQNVFGNQKEQRVSLERELTDI

Query:  DNREDRGELSKQDFTRRTDIKAKLFSLAVNDEILWQQRCKLKWFLEGDVNTTFFHRICAANRRKCSINEILAASGSSLTNDADIELEFVSFFKKLYSKST
        D  E +GE+S     +R  +K+ L S+  N   +W QR + +W L GD N ++FHRIC  N+RK  I  I   +G+SL +  DI   F+S F+ +Y+K  
Subjt:  DNREDRGELSKQDFTRRTDIKAKLFSLAVNDEILWQQRCKLKWFLEGDVNTTFFHRICAANRRKCSINEILAASGSSLTNDADIELEFVSFFKKLYSKST

Query:  TE
         E
Subjt:  TE

RVX12042.1 Splicing factor 3A subunit 2 [Vitis vinifera]2.4e-8239.95Show/hide
Query:  KRAFIKDLITSHNPSLVILQETKLASIDRKIIKSLWSSRSIVWAAVDASGSSGGIAILWNEASFSVLEVIEGIFSLTVHLSLADGYSFWVTGVYGPNSYR
        KR  ++  +++ NP +V+LQETK    D++++ S+W  +S+ W A+ A G+SGGI ILW+   F+  E + G FS+TV L+  +  SFW+T VYGPN   
Subjt:  KRAFIKDLITSHNPSLVILQETKLASIDRKIIKSLWSSRSIVWAAVDASGSSGGIAILWNEASFSVLEVIEGIFSLTVHLSLADGYSFWVTGVYGPNSYR

Query:  DRKIFWRELEDLQALCQPNWIVGGDFNIIRWTWEKSTNTTPNRAMRRFNRFIEEVDLQDIPLTNGKFTWSSFRPNPTMTLINRFLISEQISTKFDYVTAR
         R+ FW EL+DL  L  P W VGGDFN+IR   EK  ++     MR F+ FI E  L D PL N  FTWS+ + +P    ++RFL S +  + F      
Subjt:  DRKIFWRELEDLQALCQPNWIVGGDFNIIRWTWEKSTNTTPNRAMRRFNRFIEEVDLQDIPLTNGKFTWSSFRPNPTMTLINRFLISEQISTKFDYVTAR

Query:  KLERVTSDHFPISLSLGKNLWGPVPFKFLNVWLTHHSFAATVESWWKSNPSSGWPRHGFIQKLKGLKRDLKQWNQNVFGNQKEQRVSLERELTDIDNRED
         L R TSDH PI L     +WGP PF+F N+WL H  F      WW+     GW  H F++KLK +K  LK+WN  VFG+ +E++  +  +L  ID  E 
Subjt:  KLERVTSDHFPISLSLGKNLWGPVPFKFLNVWLTHHSFAATVESWWKSNPSSGWPRHGFIQKLKGLKRDLKQWNQNVFGNQKEQRVSLERELTDIDNRED

Query:  RGELSKQDFTRRTDIKAKLFSLAVNDEILWQQRCKLKWFLEGDVNTTFFHRICAANRRKCSINEILAASGSSLTNDADIELEFVSFFKKLYSK
         G L+ +  + R   + +L  L + +E+ W+Q+ ++KW  EGD N+ FFHR+    R +  I  +++  G +L N   I  E V+FF  LYSK
Subjt:  RGELSKQDFTRRTDIKAKLFSLAVNDEILWQQRCKLKWFLEGDVNTTFFHRICAANRRKCSINEILAASGSSLTNDADIELEFVSFFKKLYSK

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.7e-8628.18Show/hide
Query:  LRKDLSAFSSVSSIQPDKALLACEDEDQARTLANIKGWYNVGKYQVRFLPWSAENMHCKP--VPSYGGWIKVRNLPLDKWSLDTFKLIGDECGGYIETAS
        LRK      + ++   +KAL+       A  L   KGW  VGKY VRF  WS    H  P  +PSYGGW   R +PL  W++ TF+ IG  C G I+ A 
Subjt:  LRKDLSAFSSVSSIQPDKALLACEDEDQARTLANIKGWYNVGKYQVRFLPWSAENMHCKP--VPSYGGWIKVRNLPLDKWSLDTFKLIGDECGGYIETAS

Query:  KTLSRMDMMEIGIKVQENDTGFIPGEIPIPSSSHSPVVVRIDPFFVEDHNIGYKASIHGKIP-ASPLHRDNSRATIAEVKTNGCDNQGPRACDFTRPCTK
        +T S  +++E  IKV+ N +GF+P  + I  +  +   V++         I     +HG     +    D+      +    G +   P     +    K
Subjt:  KTLSRMDMMEIGIKVQENDTGFIPGEIPIPSSSHSPVVVRIDPFFVEDHNIGYKASIHGKIP-ASPLHRDNSRATIAEVKTNGCDNQGPRACDFTRPCTK

Query:  KENPPLSYRCDNQSAPNDVLF--NSLDVTPADYPEAGPHLSPSFAEPNSPKSSSTS----------QPQSIAPNIDPKSQPSIHARRQTTPSQYPPQKPT
           P      D  SA   V+   +     P+   E   + S   A  N  K    S            Q +   + P S  ++   ++      P  K  
Subjt:  KENPPLSYRCDNQSAPNDVLF--NSLDVTPADYPEAGPHLSPSFAEPNSPKSSSTS----------QPQSIAPNIDPKSQPSIHARRQTTPSQYPPQKPT

Query:  HQSPDPLSYHNQPHTHRPGPTTFDPSTAVQIGRKKPIMINNKETFLLTGTISSTNTEFHLSDSEGV-LSSPCTTVM-DVSPTQSPQKAIIPAASPPSIRN
          +PD     + P  H   P+   P    ++ R++ I   +          SST      + ++GV ++ P   V  D    +      +     P++  
Subjt:  HQSPDPLSYHNQPHTHRPGPTTFDPSTAVQIGRKKPIMINNKETFLLTGTISSTNTEFHLSDSEGV-LSSPCTTVM-DVSPTQSPQKAIIPAASPPSIRN

Query:  LFESQAEQHPYLEEPTPLCREEPIHLCIQNPLNLEETALIEVDIEDERDKDL--------DEDTDEKDP--AVFLPYLFPWLAEHGMCIMPMPSRQKITT
            +   +    E   +   E +    +  + + E +    +    + K +         ++  EKDP    F   L  WL ++G+ +         TT
Subjt:  LFESQAEQHPYLEEPTPLCREEPIHLCIQNPLNLEETALIEVDIEDERDKDL--------DEDTDEKDP--AVFLPYLFPWLAEHGMCIMPMPSRQKITT

Query:  AAKKKIKWVNELNNLHSTKRAFIKDLITSHNPSLVILQETKLASIDRKIIKSLWSSRSIVWAAVDASGSSGGIAILWNEASFSVLEVIEGIFSLTVHLSL
        +    +   N++N                          + L   +++IIKSLW S SI W A +ASGSSGGI ILW+  + S+L   EG+FSL+ +  L
Subjt:  AAKKKIKWVNELNNLHSTKRAFIKDLITSHNPSLVILQETKLASIDRKIIKSLWSSRSIVWAAVDASGSSGGIAILWNEASFSVLEVIEGIFSLTVHLSL

Query:  ADGYSFWVTGVYGPNSYRDRKIFWRELEDLQALCQPNWIVGGDFNIIRWTWEKSTNTTPNRAMRRFNRFIEEVDLQDIPLTNGKFTWSSFRPNPTMTLIN
         +  S+W+TG+YGP   R+R  FW EL +LQ L    WI+GGD N+IR   E ++  + +   R  N FI    L D PLTN +FTWS+ R  PT + I+
Subjt:  ADGYSFWVTGVYGPNSYRDRKIFWRELEDLQALCQPNWIVGGDFNIIRWTWEKSTNTTPNRAMRRFNRFIEEVDLQDIPLTNGKFTWSSFRPNPTMTLIN

Query:  RFLISEQISTKFDYVTARKLERVTSDHFPISL--SLGKNLWGPVPFKFLNVWLTHHSFAATVESWWKSNPSSGWPRHGFIQKLKGLKRDLKQWNQNVFGN
        RFL +      F   T R L R TSDHFP+    S  K  WGP+PF+  ++ L+   F   +  WW+++  +G+P   FIQ+LK L   +K W +    +
Subjt:  RFLISEQISTKFDYVTARKLERVTSDHFPISL--SLGKNLWGPVPFKFLNVWLTHHSFAATVESWWKSNPSSGWPRHGFIQKLKGLKRDLKQWNQNVFGN

Query:  QKEQRVSLERELTDIDNREDRGELSKQDFTRRTDIKAKLFSLAVNDEILWQQRCKLKWFLEGDVNTTFFHRICAANRRKCSINEILAASGSSLTNDADIE
            + ++ RE+  ID +E    L++++  RR  +KA L  L++ +   W QR K  W  EGD N++FFHRIC++ +++  I+EI    GS    +  I 
Subjt:  QKEQRVSLERELTDIDNREDRGELSKQDFTRRTDIKAKLFSLAVNDEILWQQRCKLKWFLEGDVNTTFFHRICAANRRKCSINEILAASGSSLTNDADIE

Query:  LEFVSFFKKLYSKSTTEPP
          F+ FF ++Y  ST   P
Subjt:  LEFVSFFKKLYSKSTTEPP

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]5.3e-8227.27Show/hide
Query:  LRKDLSAFSSVSSIQPDKALLACEDEDQARTLANIK---GWYNVGKYQVRFLPWSAENMHC--KPVPSYGGWIKVRNLPLDKWSLDTFKLIGDECGGYIE
        LRK      S    Q DKA+L   + D A+ L + K   GW  VG YQV+F  W + N+H     +PSYGGW++ R +PL  W+ +TF+ IG  CGG+++
Subjt:  LRKDLSAFSSVSSIQPDKALLACEDEDQARTLANIK---GWYNVGKYQVRFLPWSAENMHC--KPVPSYGGWIKVRNLPLDKWSLDTFKLIGDECGGYIE

Query:  TASKTLSRMDMMEIGIKVQENDTGFIPGEIPI-PSSSHSPVVVRIDPF---FVEDHNIGYKASIHGKIPASPLHRDNSRATIAEVKT-NGCDNQGPRACD
         A +T+    +++  IKV+ N  GF+P  I I  +   + +V  + P    ++ + N+     +HG          +    +AE  T NG     P    
Subjt:  TASKTLSRMDMMEIGIKVQENDTGFIPGEIPI-PSSSHSPVVVRIDPF---FVEDHNIGYKASIHGKIPASPLHRDNSRATIAEVKT-NGCDNQGPRACD

Query:  FTRPCTKKENPPLSYRCDNQSAPNDVLFNSLDVTPADYPEAGPHLSPSFAEPNSPKSSSTSQPQSIAPNIDPKSQPSIHARRQTTPSQYPPQKPTHQSPD
             T+       +  D  S          + + ++Y      LS    E          Q                H        +   +K +  SP 
Subjt:  FTRPCTKKENPPLSYRCDNQSAPNDVLFNSLDVTPADYPEAGPHLSPSFAEPNSPKSSSTSQPQSIAPNIDPKSQPSIHARRQTTPSQYPPQKPTHQSPD

Query:  PLSYHNQPHTHRPGPTTFDPSTAVQIGRKK---PIMINNKETFLLTGTISSTNTEFHLSDSE-GVLSSPCTTVMDVSPTQSPQKAIIPAASPPSIRNLFE
         +  ++          + + ST      K+         K T+ +      +  +  LS  E G  S      +D+ P  SP +++I + +   + + F 
Subjt:  PLSYHNQPHTHRPGPTTFDPSTAVQIGRKK---PIMINNKETFLLTGTISSTNTEFHLSDSE-GVLSSPCTTVMDVSPTQSPQKAIIPAASPPSIRNLFE

Query:  SQAEQHPYLEEPTPLCREEPIHLCIQNPLNLEETALIEVDIEDERDKDLDEDTDEKDPAVFLPYLFPWLAEHGMCIMPMPSRQKITTAAKKKIKWVNELN
        +Q        + T     + + + ++   +  ++A       +   KD    ++ +    F   L  WL E+ + + P               K+ N++ 
Subjt:  SQAEQHPYLEEPTPLCREEPIHLCIQNPLNLEETALIEVDIEDERDKDLDEDTDEKDPAVFLPYLFPWLAEHGMCIMPMPSRQKITTAAKKKIKWVNELN

Query:  NLHSTKRAFIKDLITSHNPSLVILQETKLASIDRKIIKSLWSSRSIVWAAVDASGSSGGIAILWNEASFSVLEVIEGIFSLTVHLSLADGYSFWVTGVYG
        +             +S+ P +V                   S +++  A     G  GGI +LW++ +F V ++  G +S+++++   +G ++W+T VYG
Subjt:  NLHSTKRAFIKDLITSHNPSLVILQETKLASIDRKIIKSLWSSRSIVWAAVDASGSSGGIAILWNEASFSVLEVIEGIFSLTVHLSLADGYSFWVTGVYG

Query:  PNSYRDRKIFWRELEDLQALCQPNWIVGGDFNIIRWTWEKSTNTTPNRAMRRFNRFIEEVDLQDIPLTNGKFTWSSFRPNPTMTLINRFLISEQISTKFD
        P  Y DR   W ELE LQ+LC PNW++ GDFNI+RW  E +  +   R M  FN FI   +L D P  N  FTWS+ R NPT + ++RFL+S+     F 
Subjt:  PNSYRDRKIFWRELEDLQALCQPNWIVGGDFNIIRWTWEKSTNTTPNRAMRRFNRFIEEVDLQDIPLTNGKFTWSSFRPNPTMTLINRFLISEQISTKFD

Query:  YVTARKLERVTSDHFPISLSLGKNLWGPVPFKFLNVWLTHHSFAATVESWWKSNPSSGWPRHGFIQKLKGLKRDLKQWNQNVFGNQKEQRVSLERELTDI
          T+R LER  SDHFPI L   +  WGP PF+  N  L    F     +WW S+  +G+P + FIQ L  L + +K+W  N        + +L +E+  I
Subjt:  YVTARKLERVTSDHFPISLSLGKNLWGPVPFKFLNVWLTHHSFAATVESWWKSNPSSGWPRHGFIQKLKGLKRDLKQWNQNVFGNQKEQRVSLERELTDI

Query:  DNREDRGELSKQDFTRRTDIKAKLFSLAVNDEILWQQRCKLKWFLEGDVNTTFFHRICAANRRKCSINEILAASGSSLTNDADIELEFVSFFKKLYSKST
        D  E +GE+S     +R  +K+ L S+  N   +W QR + +W L GD N ++FHRIC  N+RK  I  I   +G+SL +  DI   F+S F+ +Y+K +
Subjt:  DNREDRGELSKQDFTRRTDIKAKLFSLAVNDEILWQQRCKLKWFLEGDVNTTFFHRICAANRRKCSINEILAASGSSLTNDADIELEFVSFFKKLYSKST

Query:  TE
         E
Subjt:  TE

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]6.4e-9645.34Show/hide
Query:  KRAFIKDLITSHNPSLVILQETKLASIDRKIIKSLWSSRSIVWAAVDASGSSGGIAILWNEASFSVLEVIEGIFSLTVHLSLADGYSFWVTGVYGPNSYR
        K A IK  I+  NP++VILQETKL+ +D  I+KSLWS+  I W+A+DASG + GI ILWN+      E+IEG+FSLT++  L+DG+ FWV+G+YGP++  
Subjt:  KRAFIKDLITSHNPSLVILQETKLASIDRKIIKSLWSSRSIVWAAVDASGSSGGIAILWNEASFSVLEVIEGIFSLTVHLSLADGYSFWVTGVYGPNSYR

Query:  DRKIFWRELEDLQALCQPNWIVGGDFNIIRWTWEKSTNTTPNRAMRRFNRFIEEVDLQDIPLTNGKFTWSSFRPNPTMTLINRFLISEQISTKFDYVTAR
           +FW+EL DL  LC+ +WI+ GDFN+ RW+WEKS      ++M  FN FIE+  L D+PLTNG+ TWS    N + +LI+ FL++     K     A+
Subjt:  DRKIFWRELEDLQALCQPNWIVGGDFNIIRWTWEKSTNTTPNRAMRRFNRFIEEVDLQDIPLTNGKFTWSSFRPNPTMTLINRFLISEQISTKFDYVTAR

Query:  KLERVTSDHFPISLSLGKNLWGPVPFKFLNVWLTHHSFAATVESWWKSNPSSGWPRHGFIQKLKGLKRDLKQWNQNVFGNQKEQRVSLERELTDIDNRED
        ++ R TSDHFPI L  G+N WG  PF+F N+WL+H +F   +E+WW + P  GWP HG + KLK LK  +K W    F     Q+  L   +  +D+ E 
Subjt:  KLERVTSDHFPISLSLGKNLWGPVPFKFLNVWLTHHSFAATVESWWKSNPSSGWPRHGFIQKLKGLKRDLKQWNQNVFGNQKEQRVSLERELTDIDNRED

Query:  RGELSKQDFTRRTDIKAKLFSLAVNDEILWQQRCKLKWFLEGDVNTTFFHRICAANRRKCSINEILAASGSSLTNDADIELEFVSF
           ++      R   K  L S+   +E  W+QRCK KW  EGD NT FFHR  A  RR+  I EIL+  G  LT   DIE EF+ F
Subjt:  RGELSKQDFTRRTDIKAKLFSLAVNDEILWQQRCKLKWFLEGDVNTTFFHRICAANRRKCSINEILAASGSSLTNDADIELEFVSF

TrEMBL top hitse value%identityAlignment
A0A438JSU9 Splicing factor 3A subunit 21.1e-8239.95Show/hide
Query:  KRAFIKDLITSHNPSLVILQETKLASIDRKIIKSLWSSRSIVWAAVDASGSSGGIAILWNEASFSVLEVIEGIFSLTVHLSLADGYSFWVTGVYGPNSYR
        KR  ++  +++ NP +V+LQETK    D++++ S+W  +S+ W A+ A G+SGGI ILW+   F+  E + G FS+TV L+  +  SFW+T VYGPN   
Subjt:  KRAFIKDLITSHNPSLVILQETKLASIDRKIIKSLWSSRSIVWAAVDASGSSGGIAILWNEASFSVLEVIEGIFSLTVHLSLADGYSFWVTGVYGPNSYR

Query:  DRKIFWRELEDLQALCQPNWIVGGDFNIIRWTWEKSTNTTPNRAMRRFNRFIEEVDLQDIPLTNGKFTWSSFRPNPTMTLINRFLISEQISTKFDYVTAR
         R+ FW EL+DL  L  P W VGGDFN+IR   EK  ++     MR F+ FI E  L D PL N  FTWS+ + +P    ++RFL S +  + F      
Subjt:  DRKIFWRELEDLQALCQPNWIVGGDFNIIRWTWEKSTNTTPNRAMRRFNRFIEEVDLQDIPLTNGKFTWSSFRPNPTMTLINRFLISEQISTKFDYVTAR

Query:  KLERVTSDHFPISLSLGKNLWGPVPFKFLNVWLTHHSFAATVESWWKSNPSSGWPRHGFIQKLKGLKRDLKQWNQNVFGNQKEQRVSLERELTDIDNRED
         L R TSDH PI L     +WGP PF+F N+WL H  F      WW+     GW  H F++KLK +K  LK+WN  VFG+ +E++  +  +L  ID  E 
Subjt:  KLERVTSDHFPISLSLGKNLWGPVPFKFLNVWLTHHSFAATVESWWKSNPSSGWPRHGFIQKLKGLKRDLKQWNQNVFGNQKEQRVSLERELTDIDNRED

Query:  RGELSKQDFTRRTDIKAKLFSLAVNDEILWQQRCKLKWFLEGDVNTTFFHRICAANRRKCSINEILAASGSSLTNDADIELEFVSFFKKLYSK
         G L+ +  + R   + +L  L + +E+ W+Q+ ++KW  EGD N+ FFHR+    R +  I  +++  G +L N   I  E V+FF  LYSK
Subjt:  RGELSKQDFTRRTDIKAKLFSLAVNDEILWQQRCKLKWFLEGDVNTTFFHRICAANRRKCSINEILAASGSSLTNDADIELEFVSFFKKLYSK

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein2.6e-8227.27Show/hide
Query:  LRKDLSAFSSVSSIQPDKALLACEDEDQARTLANIK---GWYNVGKYQVRFLPWSAENMHC--KPVPSYGGWIKVRNLPLDKWSLDTFKLIGDECGGYIE
        LRK      S    Q DKA+L   + D A+ L + K   GW  VG YQV+F  W + N+H     +PSYGGW++ R +PL  W+ +TF+ IG  CGG+++
Subjt:  LRKDLSAFSSVSSIQPDKALLACEDEDQARTLANIK---GWYNVGKYQVRFLPWSAENMHC--KPVPSYGGWIKVRNLPLDKWSLDTFKLIGDECGGYIE

Query:  TASKTLSRMDMMEIGIKVQENDTGFIPGEIPI-PSSSHSPVVVRIDPF---FVEDHNIGYKASIHGKIPASPLHRDNSRATIAEVKT-NGCDNQGPRACD
         A +T+    +++  IKV+ N  GF+P  I I  +   + +V  + P    ++ + N+     +HG          +    +AE  T NG     P    
Subjt:  TASKTLSRMDMMEIGIKVQENDTGFIPGEIPI-PSSSHSPVVVRIDPF---FVEDHNIGYKASIHGKIPASPLHRDNSRATIAEVKT-NGCDNQGPRACD

Query:  FTRPCTKKENPPLSYRCDNQSAPNDVLFNSLDVTPADYPEAGPHLSPSFAEPNSPKSSSTSQPQSIAPNIDPKSQPSIHARRQTTPSQYPPQKPTHQSPD
             T+       +  D  S          + + ++Y      LS    E          Q                H        +   +K +  SP 
Subjt:  FTRPCTKKENPPLSYRCDNQSAPNDVLFNSLDVTPADYPEAGPHLSPSFAEPNSPKSSSTSQPQSIAPNIDPKSQPSIHARRQTTPSQYPPQKPTHQSPD

Query:  PLSYHNQPHTHRPGPTTFDPSTAVQIGRKK---PIMINNKETFLLTGTISSTNTEFHLSDSE-GVLSSPCTTVMDVSPTQSPQKAIIPAASPPSIRNLFE
         +  ++          + + ST      K+         K T+ +      +  +  LS  E G  S      +D+ P  SP +++I + +   + + F 
Subjt:  PLSYHNQPHTHRPGPTTFDPSTAVQIGRKK---PIMINNKETFLLTGTISSTNTEFHLSDSE-GVLSSPCTTVMDVSPTQSPQKAIIPAASPPSIRNLFE

Query:  SQAEQHPYLEEPTPLCREEPIHLCIQNPLNLEETALIEVDIEDERDKDLDEDTDEKDPAVFLPYLFPWLAEHGMCIMPMPSRQKITTAAKKKIKWVNELN
        +Q        + T     + + + ++   +  ++A       +   KD    ++ +    F   L  WL E+ + + P               K+ N++ 
Subjt:  SQAEQHPYLEEPTPLCREEPIHLCIQNPLNLEETALIEVDIEDERDKDLDEDTDEKDPAVFLPYLFPWLAEHGMCIMPMPSRQKITTAAKKKIKWVNELN

Query:  NLHSTKRAFIKDLITSHNPSLVILQETKLASIDRKIIKSLWSSRSIVWAAVDASGSSGGIAILWNEASFSVLEVIEGIFSLTVHLSLADGYSFWVTGVYG
        +             +S+ P +V                   S +++  A     G  GGI +LW++ +F V ++  G +S+++++   +G ++W+T VYG
Subjt:  NLHSTKRAFIKDLITSHNPSLVILQETKLASIDRKIIKSLWSSRSIVWAAVDASGSSGGIAILWNEASFSVLEVIEGIFSLTVHLSLADGYSFWVTGVYG

Query:  PNSYRDRKIFWRELEDLQALCQPNWIVGGDFNIIRWTWEKSTNTTPNRAMRRFNRFIEEVDLQDIPLTNGKFTWSSFRPNPTMTLINRFLISEQISTKFD
        P  Y DR   W ELE LQ+LC PNW++ GDFNI+RW  E +  +   R M  FN FI   +L D P  N  FTWS+ R NPT + ++RFL+S+     F 
Subjt:  PNSYRDRKIFWRELEDLQALCQPNWIVGGDFNIIRWTWEKSTNTTPNRAMRRFNRFIEEVDLQDIPLTNGKFTWSSFRPNPTMTLINRFLISEQISTKFD

Query:  YVTARKLERVTSDHFPISLSLGKNLWGPVPFKFLNVWLTHHSFAATVESWWKSNPSSGWPRHGFIQKLKGLKRDLKQWNQNVFGNQKEQRVSLERELTDI
          T+R LER  SDHFPI L   +  WGP PF+  N  L    F     +WW S+  +G+P + FIQ L  L + +K+W  N        + +L +E+  I
Subjt:  YVTARKLERVTSDHFPISLSLGKNLWGPVPFKFLNVWLTHHSFAATVESWWKSNPSSGWPRHGFIQKLKGLKRDLKQWNQNVFGNQKEQRVSLERELTDI

Query:  DNREDRGELSKQDFTRRTDIKAKLFSLAVNDEILWQQRCKLKWFLEGDVNTTFFHRICAANRRKCSINEILAASGSSLTNDADIELEFVSFFKKLYSKST
        D  E +GE+S     +R  +K+ L S+  N   +W QR + +W L GD N ++FHRIC  N+RK  I  I   +G+SL +  DI   F+S F+ +Y+K +
Subjt:  DNREDRGELSKQDFTRRTDIKAKLFSLAVNDEILWQQRCKLKWFLEGDVNTTFFHRICAANRRKCSINEILAASGSSLTNDADIELEFVSFFKKLYSKST

Query:  TE
         E
Subjt:  TE

A0A5D3BKT8 LINE-1 retrotransposable element ORF2 protein7.9e-8427.38Show/hide
Query:  LRKDLSAFSSVSSIQPDKALLACEDEDQARTLANIK---GWYNVGKYQVRFLPWSAENMHC--KPVPSYGGWIKVRNLPLDKWSLDTFKLIGDECGGYIE
        LRK      S    Q DKA+L   + D A+ L + K   GW  VG YQV+F  W + N+H     +PSYGGW++ R +PL  W+ +TF+ IG  CGG+++
Subjt:  LRKDLSAFSSVSSIQPDKALLACEDEDQARTLANIK---GWYNVGKYQVRFLPWSAENMHC--KPVPSYGGWIKVRNLPLDKWSLDTFKLIGDECGGYIE

Query:  TASKTLSRMDMMEIGIKVQENDTGFIPGEIPI-PSSSHSPVVVRIDPF---FVEDHNIGYKASIHGKIPASPLHRDNSRATIAEVKT-NGCDNQGPRACD
         A +T+    +++  IKV+ N TGF+P  I I  +   + +V  + P    ++ + N+     +HG          +    +AE  T NG     P    
Subjt:  TASKTLSRMDMMEIGIKVQENDTGFIPGEIPI-PSSSHSPVVVRIDPF---FVEDHNIGYKASIHGKIPASPLHRDNSRATIAEVKT-NGCDNQGPRACD

Query:  FTRPCTKKENPPLSYRCDNQSAPNDVLFNSLDVTPADYPEAGPHLSPSFAEPNSPKSSSTSQPQSIAPNIDPKSQPSIHARRQTTPSQYPPQKPTHQSPD
             T+       +  D  S          + + ++Y      LS    E    K+      Q+             H        +   +K +  SP 
Subjt:  FTRPCTKKENPPLSYRCDNQSAPNDVLFNSLDVTPADYPEAGPHLSPSFAEPNSPKSSSTSQPQSIAPNIDPKSQPSIHARRQTTPSQYPPQKPTHQSPD

Query:  PLSYHNQPHTHRPGPTTFDPSTAVQIGRKK---PIMINNKETFLLTGTISSTNTEFHLSDSE-GVLSSPCTTVMDVSPTQSPQKAIIPAASPPSIRNLFE
         +  ++          + + ST      K+         K T+ +      +  + +LS  E G  S      +D+ P  SP +++I + +   +  L  
Subjt:  PLSYHNQPHTHRPGPTTFDPSTAVQIGRKK---PIMINNKETFLLTGTISSTNTEFHLSDSE-GVLSSPCTTVMDVSPTQSPQKAIIPAASPPSIRNLFE

Query:  SQAEQHPYLEEPTPLCREEPIHLCIQNPLNLEETALIEVDIEDERDKDLDEDTDEKDPAVFLPYLFPWLAEHGMCIMPMPSRQKITTAAKKKIKWVNELN
           + +    +       E  +L +      ++         +   KD    ++ +    F   L  WL E+ + + P               K+ N++ 
Subjt:  SQAEQHPYLEEPTPLCREEPIHLCIQNPLNLEETALIEVDIEDERDKDLDEDTDEKDPAVFLPYLFPWLAEHGMCIMPMPSRQKITTAAKKKIKWVNELN

Query:  NLHSTKRAFIKDLITSHNPSLVILQETKLASIDRKIIKSLWSSRSIVWAAVDASGSSGGIAILWNEASFSVLEVIEGIFSLTVHLSLADGYSFWVTGVYG
        +             +S  P +V                   S +++  A     G  GGI +LW++  F V ++  G +S+++++   +G ++W+T VYG
Subjt:  NLHSTKRAFIKDLITSHNPSLVILQETKLASIDRKIIKSLWSSRSIVWAAVDASGSSGGIAILWNEASFSVLEVIEGIFSLTVHLSLADGYSFWVTGVYG

Query:  PNSYRDRKIFWRELEDLQALCQPNWIVGGDFNIIRWTWEKSTNTTPNRAMRRFNRFIEEVDLQDIPLTNGKFTWSSFRPNPTMTLINRFLISEQISTKFD
        P  Y DR   W ELE LQ+LC PNW++ GDFNI+RW  E +  +   R M  FN FI   +L D PL N  FTWS+ R NPT + ++RFL+S+     F 
Subjt:  PNSYRDRKIFWRELEDLQALCQPNWIVGGDFNIIRWTWEKSTNTTPNRAMRRFNRFIEEVDLQDIPLTNGKFTWSSFRPNPTMTLINRFLISEQISTKFD

Query:  YVTARKLERVTSDHFPISLSLGKNLWGPVPFKFLNVWLTHHSFAATVESWWKSNPSSGWPRHGFIQKLKGLKRDLKQWNQNVFGNQKEQRVSLERELTDI
          T+R LER  SDHFPI L   +  WGP PF+  N  L    F     +WW ++  +G+P + FIQ L  L + +K+W  N        + +L +E+  I
Subjt:  YVTARKLERVTSDHFPISLSLGKNLWGPVPFKFLNVWLTHHSFAATVESWWKSNPSSGWPRHGFIQKLKGLKRDLKQWNQNVFGNQKEQRVSLERELTDI

Query:  DNREDRGELSKQDFTRRTDIKAKLFSLAVNDEILWQQRCKLKWFLEGDVNTTFFHRICAANRRKCSINEILAASGSSLTNDADIELEFVSFFKKLYSKST
        D  E +GE+S     +R  +K+ L S+  N   +W QR + +W L GD N ++FHRIC  N+RK  I  I   +G+SL +  DI   F+S F+ +Y+K  
Subjt:  DNREDRGELSKQDFTRRTDIKAKLFSLAVNDEILWQQRCKLKWFLEGDVNTTFFHRICAANRRKCSINEILAASGSSLTNDADIELEFVSFFKKLYSKST

Query:  TE
         E
Subjt:  TE

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein1.3e-8628.18Show/hide
Query:  LRKDLSAFSSVSSIQPDKALLACEDEDQARTLANIKGWYNVGKYQVRFLPWSAENMHCKP--VPSYGGWIKVRNLPLDKWSLDTFKLIGDECGGYIETAS
        LRK      + ++   +KAL+       A  L   KGW  VGKY VRF  WS    H  P  +PSYGGW   R +PL  W++ TF+ IG  C G I+ A 
Subjt:  LRKDLSAFSSVSSIQPDKALLACEDEDQARTLANIKGWYNVGKYQVRFLPWSAENMHCKP--VPSYGGWIKVRNLPLDKWSLDTFKLIGDECGGYIETAS

Query:  KTLSRMDMMEIGIKVQENDTGFIPGEIPIPSSSHSPVVVRIDPFFVEDHNIGYKASIHGKIP-ASPLHRDNSRATIAEVKTNGCDNQGPRACDFTRPCTK
        +T S  +++E  IKV+ N +GF+P  + I  +  +   V++         I     +HG     +    D+      +    G +   P     +    K
Subjt:  KTLSRMDMMEIGIKVQENDTGFIPGEIPIPSSSHSPVVVRIDPFFVEDHNIGYKASIHGKIP-ASPLHRDNSRATIAEVKTNGCDNQGPRACDFTRPCTK

Query:  KENPPLSYRCDNQSAPNDVLF--NSLDVTPADYPEAGPHLSPSFAEPNSPKSSSTS----------QPQSIAPNIDPKSQPSIHARRQTTPSQYPPQKPT
           P      D  SA   V+   +     P+   E   + S   A  N  K    S            Q +   + P S  ++   ++      P  K  
Subjt:  KENPPLSYRCDNQSAPNDVLF--NSLDVTPADYPEAGPHLSPSFAEPNSPKSSSTS----------QPQSIAPNIDPKSQPSIHARRQTTPSQYPPQKPT

Query:  HQSPDPLSYHNQPHTHRPGPTTFDPSTAVQIGRKKPIMINNKETFLLTGTISSTNTEFHLSDSEGV-LSSPCTTVM-DVSPTQSPQKAIIPAASPPSIRN
          +PD     + P  H   P+   P    ++ R++ I   +          SST      + ++GV ++ P   V  D    +      +     P++  
Subjt:  HQSPDPLSYHNQPHTHRPGPTTFDPSTAVQIGRKKPIMINNKETFLLTGTISSTNTEFHLSDSEGV-LSSPCTTVM-DVSPTQSPQKAIIPAASPPSIRN

Query:  LFESQAEQHPYLEEPTPLCREEPIHLCIQNPLNLEETALIEVDIEDERDKDL--------DEDTDEKDP--AVFLPYLFPWLAEHGMCIMPMPSRQKITT
            +   +    E   +   E +    +  + + E +    +    + K +         ++  EKDP    F   L  WL ++G+ +         TT
Subjt:  LFESQAEQHPYLEEPTPLCREEPIHLCIQNPLNLEETALIEVDIEDERDKDL--------DEDTDEKDP--AVFLPYLFPWLAEHGMCIMPMPSRQKITT

Query:  AAKKKIKWVNELNNLHSTKRAFIKDLITSHNPSLVILQETKLASIDRKIIKSLWSSRSIVWAAVDASGSSGGIAILWNEASFSVLEVIEGIFSLTVHLSL
        +    +   N++N                          + L   +++IIKSLW S SI W A +ASGSSGGI ILW+  + S+L   EG+FSL+ +  L
Subjt:  AAKKKIKWVNELNNLHSTKRAFIKDLITSHNPSLVILQETKLASIDRKIIKSLWSSRSIVWAAVDASGSSGGIAILWNEASFSVLEVIEGIFSLTVHLSL

Query:  ADGYSFWVTGVYGPNSYRDRKIFWRELEDLQALCQPNWIVGGDFNIIRWTWEKSTNTTPNRAMRRFNRFIEEVDLQDIPLTNGKFTWSSFRPNPTMTLIN
         +  S+W+TG+YGP   R+R  FW EL +LQ L    WI+GGD N+IR   E ++  + +   R  N FI    L D PLTN +FTWS+ R  PT + I+
Subjt:  ADGYSFWVTGVYGPNSYRDRKIFWRELEDLQALCQPNWIVGGDFNIIRWTWEKSTNTTPNRAMRRFNRFIEEVDLQDIPLTNGKFTWSSFRPNPTMTLIN

Query:  RFLISEQISTKFDYVTARKLERVTSDHFPISL--SLGKNLWGPVPFKFLNVWLTHHSFAATVESWWKSNPSSGWPRHGFIQKLKGLKRDLKQWNQNVFGN
        RFL +      F   T R L R TSDHFP+    S  K  WGP+PF+  ++ L+   F   +  WW+++  +G+P   FIQ+LK L   +K W +    +
Subjt:  RFLISEQISTKFDYVTARKLERVTSDHFPISL--SLGKNLWGPVPFKFLNVWLTHHSFAATVESWWKSNPSSGWPRHGFIQKLKGLKRDLKQWNQNVFGN

Query:  QKEQRVSLERELTDIDNREDRGELSKQDFTRRTDIKAKLFSLAVNDEILWQQRCKLKWFLEGDVNTTFFHRICAANRRKCSINEILAASGSSLTNDADIE
            + ++ RE+  ID +E    L++++  RR  +KA L  L++ +   W QR K  W  EGD N++FFHRIC++ +++  I+EI    GS    +  I 
Subjt:  QKEQRVSLERELTDIDNREDRGELSKQDFTRRTDIKAKLFSLAVNDEILWQQRCKLKWFLEGDVNTTFFHRICAANRRKCSINEILAASGSSLTNDADIE

Query:  LEFVSFFKKLYSKSTTEPP
          F+ FF ++Y  ST   P
Subjt:  LEFVSFFKKLYSKSTTEPP

A0A6J1E2G6 uncharacterized protein LOC1110254053.1e-9645.34Show/hide
Query:  KRAFIKDLITSHNPSLVILQETKLASIDRKIIKSLWSSRSIVWAAVDASGSSGGIAILWNEASFSVLEVIEGIFSLTVHLSLADGYSFWVTGVYGPNSYR
        K A IK  I+  NP++VILQETKL+ +D  I+KSLWS+  I W+A+DASG + GI ILWN+      E+IEG+FSLT++  L+DG+ FWV+G+YGP++  
Subjt:  KRAFIKDLITSHNPSLVILQETKLASIDRKIIKSLWSSRSIVWAAVDASGSSGGIAILWNEASFSVLEVIEGIFSLTVHLSLADGYSFWVTGVYGPNSYR

Query:  DRKIFWRELEDLQALCQPNWIVGGDFNIIRWTWEKSTNTTPNRAMRRFNRFIEEVDLQDIPLTNGKFTWSSFRPNPTMTLINRFLISEQISTKFDYVTAR
           +FW+EL DL  LC+ +WI+ GDFN+ RW+WEKS      ++M  FN FIE+  L D+PLTNG+ TWS    N + +LI+ FL++     K     A+
Subjt:  DRKIFWRELEDLQALCQPNWIVGGDFNIIRWTWEKSTNTTPNRAMRRFNRFIEEVDLQDIPLTNGKFTWSSFRPNPTMTLINRFLISEQISTKFDYVTAR

Query:  KLERVTSDHFPISLSLGKNLWGPVPFKFLNVWLTHHSFAATVESWWKSNPSSGWPRHGFIQKLKGLKRDLKQWNQNVFGNQKEQRVSLERELTDIDNRED
        ++ R TSDHFPI L  G+N WG  PF+F N+WL+H +F   +E+WW + P  GWP HG + KLK LK  +K W    F     Q+  L   +  +D+ E 
Subjt:  KLERVTSDHFPISLSLGKNLWGPVPFKFLNVWLTHHSFAATVESWWKSNPSSGWPRHGFIQKLKGLKRDLKQWNQNVFGNQKEQRVSLERELTDIDNRED

Query:  RGELSKQDFTRRTDIKAKLFSLAVNDEILWQQRCKLKWFLEGDVNTTFFHRICAANRRKCSINEILAASGSSLTNDADIELEFVSF
           ++      R   K  L S+   +E  W+QRCK KW  EGD NT FFHR  A  RR+  I EIL+  G  LT   DIE EF+ F
Subjt:  RGELSKQDFTRRTDIKAKLFSLAVNDEILWQQRCKLKWFLEGDVNTTFFHRICAANRRKCSINEILAASGSSLTNDADIELEFVSF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.1e-1326.07Show/hide
Query:  IVGGDFNIIRWT---WEKSTNTTPNRAMRRFNRFIEEVDLQDIPLTNGKFTWSSFR-PNPTMTLINRFLISEQISTKFDYVTARKLERVTSDHFPISLSL
        I+ GDF+ I  T   +     + P R +  F   + + DL DIP     +TWS+ +  NP +  ++R + +    + F    A       SDH P  + L
Subjt:  IVGGDFNIIRWT---WEKSTNTTPNRAMRRFNRFIEEVDLQDIPLTNGKFTWSSFR-PNPTMTLINRFLISEQISTKFDYVTARKLERVTSDHFPISLSL

Query:  GKNL--WGPVPFKFLNVWLTHHSFAATVESWWKSNPSSGWPRHGFIQKLKGLKRDLKQWNQNVFGN----QKEQRVSLERELTDIDNREDRGELSKQDFT
         +NL       F++ +   TH +F  ++   W+     G       + LK  K+  K  N+  FGN     KE   SLE   + +           +   
Subjt:  GKNL--WGPVPFKFLNVWLTHHSFAATVESWWKSNPSSGWPRHGFIQKLKGLKRDLKQWNQNVFGN----QKEQRVSLERELTDIDNREDRGELSKQDFT

Query:  RRTDIKAKLFSLAVNDEILWQQRCKLKWFLEGDVNTTFFHRICAANRRKCSINEILAASGSSLTNDADIELEFVSFFKKL
        R+   K   F+ A+  E  ++Q+ ++KW  +GD NT FFH++  AN+ K  I          L  D D+ +E V+  K++
Subjt:  RRTDIKAKLFSLAVNDEILWQQRCKLKWFLEGDVNTTFFHRICAANRRKCSINEILAASGSSLTNDADIELEFVSFFKKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGAGTTTTACGGAAAGACCTATCCGCCTTCTCATCTGTCAGCTCCATTCAGCCAGACAAAGCGCTTCTTGCTTGCGAAGATGAAGATCAAGCACGTACTTTAGCCAA
CATCAAAGGGTGGTATAACGTTGGGAAATATCAGGTTCGTTTTTTACCATGGAGTGCGGAAAATATGCATTGCAAGCCTGTTCCATCTTATGGGGGATGGATTAAAGTCA
GAAACCTTCCCTTGGATAAATGGTCCCTCGATACCTTCAAGCTTATTGGTGATGAATGTGGAGGATATATCGAAACAGCTAGCAAAACCCTCTCCCGAATGGATATGATG
GAAATAGGTATCAAAGTTCAGGAAAACGATACTGGGTTCATCCCGGGAGAAATCCCCATCCCATCGTCGTCCCACAGCCCCGTTGTGGTTAGAATAGACCCGTTCTTTGT
AGAAGACCACAACATAGGATACAAAGCAAGTATCCATGGAAAGATTCCGGCCTCACCGTTGCATAGGGACAATTCACGCGCCACCATCGCCGAAGTAAAGACAAACGGAT
GCGACAACCAAGGTCCACGCGCCTGCGATTTCACAAGGCCATGTACAAAAAAGGAAAATCCACCTTTATCGTACAGATGCGATAACCAAAGCGCCCCCAATGATGTGCTC
TTCAACTCTTTGGATGTGACCCCTGCTGATTATCCAGAGGCGGGCCCACATTTATCCCCATCTTTCGCCGAACCCAATTCCCCAAAATCCTCCAGTACAAGCCAACCTCA
ATCCATAGCCCCCAACATAGACCCAAAAAGTCAGCCGTCTATCCACGCTAGACGACAAACCACACCTTCCCAGTACCCCCCTCAAAAGCCCACTCATCAAAGCCCAGACC
CACTATCCTACCACAACCAGCCACATACTCACAGGCCCGGACCAACAACTTTTGACCCATCTACTGCAGTCCAAATAGGCCGAAAAAAACCCATCATGATTAACAATAAA
GAGACATTTCTCCTTACGGGAACAATTAGCTCCACTAATACAGAATTCCATTTATCGGATTCCGAAGGTGTTTTATCATCTCCATGTACAACAGTAATGGATGTATCCCC
AACACAGTCTCCACAAAAGGCCATTATACCAGCAGCCTCTCCCCCGTCCATTCGCAACCTTTTTGAATCCCAAGCTGAACAGCATCCATATCTAGAGGAACCCACTCCCC
TGTGTAGAGAAGAACCAATCCACCTGTGTATACAGAACCCCCTAAACTTGGAGGAAACTGCCCTCATTGAAGTAGACATTGAAGATGAAAGGGACAAGGACCTTGACGAA
GACACAGATGAAAAAGACCCAGCGGTCTTCTTGCCTTATCTCTTCCCTTGGTTGGCTGAACATGGCATGTGCATTATGCCAATGCCGAGTAGACAAAAAATCACGACTGC
TGCAAAAAAGAAGATCAAATGGGTGAACGAGCTCAATAATTTGCACTCCACAAAGAGAGCTTTCATCAAGGATCTCATCACTTCCCATAATCCCTCTTTGGTGATTCTCC
AAGAAACAAAGTTAGCATCCATCGACAGAAAGATCATTAAATCTCTTTGGAGTTCCAGAAGCATTGTTTGGGCTGCCGTTGATGCCTCTGGATCCTCGGGTGGCATAGCC
ATTTTATGGAATGAGGCATCTTTCTCTGTGTTGGAGGTGATTGAAGGTATCTTTTCTTTAACCGTGCACCTTTCTCTTGCTGACGGTTACTCTTTTTGGGTTACAGGGGT
CTATGGCCCAAATTCTTATCGGGATAGAAAGATTTTTTGGAGAGAGTTGGAGGACCTTCAAGCCCTATGTCAGCCAAACTGGATTGTGGGAGGGGACTTTAACATCATTA
GATGGACTTGGGAAAAATCCACTAACACAACTCCCAACCGAGCCATGAGGAGGTTCAACCGATTCATTGAAGAAGTTGATCTCCAGGATATCCCCCTCACTAATGGAAAA
TTTACTTGGTCTAGCTTTCGGCCCAATCCTACCATGACCCTCATCAACAGGTTTCTTATTTCTGAGCAGATATCTACCAAATTTGATTATGTTACAGCCCGAAAACTAGA
AAGAGTCACCTCCGACCACTTTCCCATCAGTTTATCATTGGGGAAAAACCTTTGGGGCCCTGTTCCTTTTAAATTCCTTAATGTTTGGCTTACCCACCACTCTTTTGCTG
CCACAGTCGAATCGTGGTGGAAATCTAATCCTTCATCTGGCTGGCCGAGACATGGGTTTATCCAAAAATTAAAGGGCCTCAAGAGAGATCTAAAGCAGTGGAATCAGAAT
GTTTTTGGAAACCAAAAGGAACAGCGTGTTAGTTTGGAGCGGGAACTCACTGACATTGATAATAGAGAAGACAGAGGTGAGCTTTCTAAGCAAGATTTTACCAGACGAAC
AGATATTAAAGCCAAATTGTTCTCTCTGGCGGTGAATGATGAAATTTTGTGGCAACAAAGATGCAAGCTCAAATGGTTTTTGGAAGGTGATGTTAACACAACCTTTTTCC
ACCGAATATGCGCTGCTAACAGAAGGAAGTGTTCCATTAATGAGATCTTAGCAGCTTCTGGAAGCAGTCTCACCAACGATGCAGATATTGAACTGGAGTTTGTATCCTTC
TTCAAAAAGCTATACTCAAAGTCCACCACAGAGCCCCCCTTCCAGCCATTGCAGAATGGAACCCCATCAGTGTTGAGCAGAGAATTGCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGAGTTTTACGGAAAGACCTATCCGCCTTCTCATCTGTCAGCTCCATTCAGCCAGACAAAGCGCTTCTTGCTTGCGAAGATGAAGATCAAGCACGTACTTTAGCCAA
CATCAAAGGGTGGTATAACGTTGGGAAATATCAGGTTCGTTTTTTACCATGGAGTGCGGAAAATATGCATTGCAAGCCTGTTCCATCTTATGGGGGATGGATTAAAGTCA
GAAACCTTCCCTTGGATAAATGGTCCCTCGATACCTTCAAGCTTATTGGTGATGAATGTGGAGGATATATCGAAACAGCTAGCAAAACCCTCTCCCGAATGGATATGATG
GAAATAGGTATCAAAGTTCAGGAAAACGATACTGGGTTCATCCCGGGAGAAATCCCCATCCCATCGTCGTCCCACAGCCCCGTTGTGGTTAGAATAGACCCGTTCTTTGT
AGAAGACCACAACATAGGATACAAAGCAAGTATCCATGGAAAGATTCCGGCCTCACCGTTGCATAGGGACAATTCACGCGCCACCATCGCCGAAGTAAAGACAAACGGAT
GCGACAACCAAGGTCCACGCGCCTGCGATTTCACAAGGCCATGTACAAAAAAGGAAAATCCACCTTTATCGTACAGATGCGATAACCAAAGCGCCCCCAATGATGTGCTC
TTCAACTCTTTGGATGTGACCCCTGCTGATTATCCAGAGGCGGGCCCACATTTATCCCCATCTTTCGCCGAACCCAATTCCCCAAAATCCTCCAGTACAAGCCAACCTCA
ATCCATAGCCCCCAACATAGACCCAAAAAGTCAGCCGTCTATCCACGCTAGACGACAAACCACACCTTCCCAGTACCCCCCTCAAAAGCCCACTCATCAAAGCCCAGACC
CACTATCCTACCACAACCAGCCACATACTCACAGGCCCGGACCAACAACTTTTGACCCATCTACTGCAGTCCAAATAGGCCGAAAAAAACCCATCATGATTAACAATAAA
GAGACATTTCTCCTTACGGGAACAATTAGCTCCACTAATACAGAATTCCATTTATCGGATTCCGAAGGTGTTTTATCATCTCCATGTACAACAGTAATGGATGTATCCCC
AACACAGTCTCCACAAAAGGCCATTATACCAGCAGCCTCTCCCCCGTCCATTCGCAACCTTTTTGAATCCCAAGCTGAACAGCATCCATATCTAGAGGAACCCACTCCCC
TGTGTAGAGAAGAACCAATCCACCTGTGTATACAGAACCCCCTAAACTTGGAGGAAACTGCCCTCATTGAAGTAGACATTGAAGATGAAAGGGACAAGGACCTTGACGAA
GACACAGATGAAAAAGACCCAGCGGTCTTCTTGCCTTATCTCTTCCCTTGGTTGGCTGAACATGGCATGTGCATTATGCCAATGCCGAGTAGACAAAAAATCACGACTGC
TGCAAAAAAGAAGATCAAATGGGTGAACGAGCTCAATAATTTGCACTCCACAAAGAGAGCTTTCATCAAGGATCTCATCACTTCCCATAATCCCTCTTTGGTGATTCTCC
AAGAAACAAAGTTAGCATCCATCGACAGAAAGATCATTAAATCTCTTTGGAGTTCCAGAAGCATTGTTTGGGCTGCCGTTGATGCCTCTGGATCCTCGGGTGGCATAGCC
ATTTTATGGAATGAGGCATCTTTCTCTGTGTTGGAGGTGATTGAAGGTATCTTTTCTTTAACCGTGCACCTTTCTCTTGCTGACGGTTACTCTTTTTGGGTTACAGGGGT
CTATGGCCCAAATTCTTATCGGGATAGAAAGATTTTTTGGAGAGAGTTGGAGGACCTTCAAGCCCTATGTCAGCCAAACTGGATTGTGGGAGGGGACTTTAACATCATTA
GATGGACTTGGGAAAAATCCACTAACACAACTCCCAACCGAGCCATGAGGAGGTTCAACCGATTCATTGAAGAAGTTGATCTCCAGGATATCCCCCTCACTAATGGAAAA
TTTACTTGGTCTAGCTTTCGGCCCAATCCTACCATGACCCTCATCAACAGGTTTCTTATTTCTGAGCAGATATCTACCAAATTTGATTATGTTACAGCCCGAAAACTAGA
AAGAGTCACCTCCGACCACTTTCCCATCAGTTTATCATTGGGGAAAAACCTTTGGGGCCCTGTTCCTTTTAAATTCCTTAATGTTTGGCTTACCCACCACTCTTTTGCTG
CCACAGTCGAATCGTGGTGGAAATCTAATCCTTCATCTGGCTGGCCGAGACATGGGTTTATCCAAAAATTAAAGGGCCTCAAGAGAGATCTAAAGCAGTGGAATCAGAAT
GTTTTTGGAAACCAAAAGGAACAGCGTGTTAGTTTGGAGCGGGAACTCACTGACATTGATAATAGAGAAGACAGAGGTGAGCTTTCTAAGCAAGATTTTACCAGACGAAC
AGATATTAAAGCCAAATTGTTCTCTCTGGCGGTGAATGATGAAATTTTGTGGCAACAAAGATGCAAGCTCAAATGGTTTTTGGAAGGTGATGTTAACACAACCTTTTTCC
ACCGAATATGCGCTGCTAACAGAAGGAAGTGTTCCATTAATGAGATCTTAGCAGCTTCTGGAAGCAGTCTCACCAACGATGCAGATATTGAACTGGAGTTTGTATCCTTC
TTCAAAAAGCTATACTCAAAGTCCACCACAGAGCCCCCCTTCCAGCCATTGCAGAATGGAACCCCATCAGTGTTGAGCAGAGAATTGCTTTAG
Protein sequenceShow/hide protein sequence
MRVLRKDLSAFSSVSSIQPDKALLACEDEDQARTLANIKGWYNVGKYQVRFLPWSAENMHCKPVPSYGGWIKVRNLPLDKWSLDTFKLIGDECGGYIETASKTLSRMDMM
EIGIKVQENDTGFIPGEIPIPSSSHSPVVVRIDPFFVEDHNIGYKASIHGKIPASPLHRDNSRATIAEVKTNGCDNQGPRACDFTRPCTKKENPPLSYRCDNQSAPNDVL
FNSLDVTPADYPEAGPHLSPSFAEPNSPKSSSTSQPQSIAPNIDPKSQPSIHARRQTTPSQYPPQKPTHQSPDPLSYHNQPHTHRPGPTTFDPSTAVQIGRKKPIMINNK
ETFLLTGTISSTNTEFHLSDSEGVLSSPCTTVMDVSPTQSPQKAIIPAASPPSIRNLFESQAEQHPYLEEPTPLCREEPIHLCIQNPLNLEETALIEVDIEDERDKDLDE
DTDEKDPAVFLPYLFPWLAEHGMCIMPMPSRQKITTAAKKKIKWVNELNNLHSTKRAFIKDLITSHNPSLVILQETKLASIDRKIIKSLWSSRSIVWAAVDASGSSGGIA
ILWNEASFSVLEVIEGIFSLTVHLSLADGYSFWVTGVYGPNSYRDRKIFWRELEDLQALCQPNWIVGGDFNIIRWTWEKSTNTTPNRAMRRFNRFIEEVDLQDIPLTNGK
FTWSSFRPNPTMTLINRFLISEQISTKFDYVTARKLERVTSDHFPISLSLGKNLWGPVPFKFLNVWLTHHSFAATVESWWKSNPSSGWPRHGFIQKLKGLKRDLKQWNQN
VFGNQKEQRVSLERELTDIDNREDRGELSKQDFTRRTDIKAKLFSLAVNDEILWQQRCKLKWFLEGDVNTTFFHRICAANRRKCSINEILAASGSSLTNDADIELEFVSF
FKKLYSKSTTEPPFQPLQNGTPSVLSRELL