; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038493 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038493
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr2:18699998..18706990
RNA-Seq ExpressionLag0038493
SyntenyLag0038493
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047995.1 retrotransposon protein, putative, Ty1-copia sub-class [Cucumis melo var. makuwa]2.7e-9035.74Show/hide
Query:  MSMTRFEIEKFDGKGDFDLWKAKIKAILGQQKALQALQDPSQLPTIVTEEQKETMNVTAYGMLILNLSNNVLRQVIDEETPFGIWTKLNKLYEIKDVHNK
        M+ TRFE+ KF+G GDF LW+ KI+AIL Q K  + L D  +LP  +TE +K  M+  AY  ++L LS+ VLR V +  T   +W KL  LY  K + NK
Subjt:  MSMTRFEIEKFDGKGDFDLWKAKIKAILGQQKALQALQDPSQLPTIVTEEQKETMNVTAYGMLILNLSNNVLRQVIDEETPFGIWTKLNKLYEIKDVHNK

Query:  MFMRERFFTYKMDPSKPLTENLDDFKRMTSEFKNVGEKIGDENEAFVLLNSLPETFKEVKAALKYGRDIITTDIIISAIRTKDLELQSQKKE--------
        ++++E+FF YKMD SK L ENLD+F+++  +  N+GEK+ DEN+A +LLNSLPET++EVKAA+KYG D +T  I++ A++T++LE++ ++K+        
Subjt:  MFMRERFFTYKMDPSKPLTENLDDFKRMTSEFKNVGEKIGDENEAFVLLNSLPETFKEVKAALKYGRDIITTDIIISAIRTKDLELQSQKKE--------

Query:  -----------------------------------------SSEEAAVGENSIT--YSDALATSDQCSNEHPTVEKYD------------WVIDSGCSFH
                                                  S EA+  E ++T  Y+ A  T D C +     E  +            W++DSGC+FH
Subjt:  -----------------------------------------SSEEAAVGENSIT--YSDALATSDQCSNEHPTVEKYD------------WVIDSGCSFH

Query:  MTPSKGWFSTYREWDGGIVYMGNNNTCRVVGIGSVSLKLSDGWLYLW---------------------------RECWDHRDQKDSRTVLIGTKINGLYV
        MTP + + + +++ DGG V +G+N TC V G GSV +   DG + +                             E    +  K S   L GT  +GLYV
Subjt:  MTPSKGWFSTYREWDGGIVYMGNNNTCRVVGIGSVSLKLSDGWLYLW---------------------------RECWDHRDQKDSRTVLIGTKINGLYV

Query:  LKNVEMIQTALSVTENSLTECDLWHKRLSHISTKGLQVLANQGILPQGVGENLNFCEYCVVGKAKRHSFNKSQLTTKGILEYIYSDLWGPATTNSLSGS-
        L+   +  +A   +        LWHKRL+H+S +GLQ L+ QG+L       L FCE+C++GK+ R  F K + TTKGIL+YI+SDLWGP    S+ GS 
Subjt:  LKNVEMIQTALSVTENSLTECDLWHKRLSHISTKGLQVLANQGILPQGVGENLNFCEYCVVGKAKRHSFNKSQLTTKGILEYIYSDLWGPATTNSLSGS-

Query:  ----------------------------------------------RTDNGLEFCNESFDNFCKEHGIVRHRTVRHTPQQNG
                                                      RTDNGLEF N  F+ FCK  GI RH TV +TPQQNG
Subjt:  ----------------------------------------------RTDNGLEFCNESFDNFCKEHGIVRHRTVRHTPQQNG

KAA0050070.1 putative polyprotein [Cucumis melo var. makuwa]1.7e-8138.72Show/hide
Query:  MSMTRFEIEKFDGKGDFDLWKAKIKAILGQQKALQALQDPSQLPTIVTEEQKETMNVTAYGMLILNLSNNVLRQVIDEETPFG-IWTKLNKLYEIKDVHN
        M+ TRFE+ KF+G GDF LW+ KI+AIL Q K  + L D  +LP  +TE +K  M+   Y  ++L LS+ VLR ++DE T  G +W KL  LY  K + N
Subjt:  MSMTRFEIEKFDGKGDFDLWKAKIKAILGQQKALQALQDPSQLPTIVTEEQKETMNVTAYGMLILNLSNNVLRQVIDEETPFG-IWTKLNKLYEIKDVHN

Query:  KMFMRERFFTYKMDPSKPLTENLDDFKRMTSEFKNVGEKIGDENEAFVLLNSLPETFKEVKAALKYGRDIITTDIIISAIRTKDLELQSQKKE-------
        K++++E+FF YKMD SK L ENLD+F+++  +  N+GEK+ DEN+A +LLNSLPE ++EVKAA+KYGRD +   I++ A++T++LE++ ++K+       
Subjt:  KMFMRERFFTYKMDPSKPLTENLDDFKRMTSEFKNVGEKIGDENEAFVLLNSLPETFKEVKAALKYGRDIITTDIIISAIRTKDLELQSQKKE-------

Query:  --SSEEAAVGENSITYSDALATSDQC-----------------SNEHPTVEKYDWVIDSGCSFHMTPSKGWFSTYREWDGGIVYMGNNNTCRVVGIGSVS
          S +++  G+   + S +   S +C                 S E  T E    V D   S  +T   G+ S    ++   V M ++   R++G     
Subjt:  --SSEEAAVGENSITYSDALATSDQC-----------------SNEHPTVEKYDWVIDSGCSFHMTPSKGWFSTYREWDGGIVYMGNNNTCRVVGIGSVS

Query:  LKLSDGWLYLWREC---WDHRDQKDSRTVLI---GTKINGLYVLKNVEMIQTALSVTENSLTECDLWHKRLSHISTKGLQVLANQGILPQGVGENLNFCE
              W+     C   +++R  K ++  L+   GT  +GLYVL+   +  +A   +   +    LWH+RL+H+S +GLQ L+ QG+L       L FCE
Subjt:  LKLSDGWLYLWREC---WDHRDQKDSRTVLI---GTKINGLYVLKNVEMIQTALSVTENSLTECDLWHKRLSHISTKGLQVLANQGILPQGVGENLNFCE

Query:  YCVVGKAKRHSFNKSQLTTKGILEYIYSDLWGPATTNSLSGS-----RTDNGLEFCNESFDNFCKEHGIVRHRTVRHTPQQNG
        +C++GK+ R  F K + TTKGIL+Y++SDLWGP     + G      R DNGLEF N  F+NFCK   I RH TV +TPQQNG
Subjt:  YCVVGKAKRHSFNKSQLTTKGILEYIYSDLWGPATTNSLSGS-----RTDNGLEFCNESFDNFCKEHGIVRHRTVRHTPQQNG

KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]4.1e-9135.74Show/hide
Query:  MSMTRFEIEKFDGKGDFDLWKAKIKAILGQQKALQALQDPSQLPTIVTEEQKETMNVTAYGMLILNLSNNVLRQVIDEETPFGIWTKLNKLYEIKDVHNK
        M+ TRFE+ KF+G GDF LW+ KI+AIL Q K  + L D  +LP  +TE +K  M+  AY  ++L LS+ VLR V +  T   +W KL  LY  K + NK
Subjt:  MSMTRFEIEKFDGKGDFDLWKAKIKAILGQQKALQALQDPSQLPTIVTEEQKETMNVTAYGMLILNLSNNVLRQVIDEETPFGIWTKLNKLYEIKDVHNK

Query:  MFMRERFFTYKMDPSKPLTENLDDFKRMTSEFKNVGEKIGDENEAFVLLNSLPETFKEVKAALKYGRDIITTDIIISAIRTKDLELQSQKKE--------
        ++++E+FF YKMD SK L ENLD+F+++  +  N+GEK+ DEN+A +LLNSLPET++EVKAA+KYGRD +T  I++ A++T++LE++ ++K+        
Subjt:  MFMRERFFTYKMDPSKPLTENLDDFKRMTSEFKNVGEKIGDENEAFVLLNSLPETFKEVKAALKYGRDIITTDIIISAIRTKDLELQSQKKE--------

Query:  -----------------------------------------SSEEAAVGENSIT--YSDALATS--DQCSNEHPTVE---------KYDWVIDSGCSFHM
                                                  S EA+  E ++T  Y+ A  T   D     + + E         +  W++DSGC+FHM
Subjt:  -----------------------------------------SSEEAAVGENSIT--YSDALATS--DQCSNEHPTVE---------KYDWVIDSGCSFHM

Query:  TPSKGWFSTYREWDGGIVYMGNNNTCRVVGIGSVSLKLSDGWLYLW---------------------------RECWDHRDQKDSRTVLIGTKINGLYVL
        TP + + + +++ DGG V +G+N TC V G GSV +   DG + +                             E    +  K S   L GT  +GLYVL
Subjt:  TPSKGWFSTYREWDGGIVYMGNNNTCRVVGIGSVSLKLSDGWLYLW---------------------------RECWDHRDQKDSRTVLIGTKINGLYVL

Query:  KNVEMIQTALSVTENSLTECD-LWHKRLSHISTKGLQVLANQGILPQGVGENLNFCEYCVVGKAKRHSFNKSQLTTKGILEYIYSDLWGPATTNSLSGS-
        +   +  +A ++    +T+   LWHKRL+H+S +GLQ L+ QG+L       L FCE+C++GK+ R  F K + TTKGIL+Y++SDLWGP    S+ GS 
Subjt:  KNVEMIQTALSVTENSLTECD-LWHKRLSHISTKGLQVLANQGILPQGVGENLNFCEYCVVGKAKRHSFNKSQLTTKGILEYIYSDLWGPATTNSLSGS-

Query:  ----------------------------------------------RTDNGLEFCNESFDNFCKEHGIVRHRTVRHTPQQNG
                                                      RTDNGLEF N  F+ FCK  GI RH TV +TPQQNG
Subjt:  ----------------------------------------------RTDNGLEFCNESFDNFCKEHGIVRHRTVRHTPQQNG

KAA0051442.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.3e-10042.83Show/hide
Query:  MSMTRFEIEKFDGKGDFDLWKAKIKAILGQQKALQALQDPSQLPTIVTEEQKETMNVTAYGMLILNLSNNVLRQVIDEETPFGIWTKLNKLYEIKDVHNK
        M+ T+FEIEKFDG GDF LW  +I AILG QKAL+AL+DP +LP  +T+ ++ET+   AY  LI+N+++NVLRQVI+E T F  W KL  LYE KD+ NK
Subjt:  MSMTRFEIEKFDGKGDFDLWKAKIKAILGQQKALQALQDPSQLPTIVTEEQKETMNVTAYGMLILNLSNNVLRQVIDEETPFGIWTKLNKLYEIKDVHNK

Query:  MFMRERFFTYKMDPSKPLTENLDDFKRMTSEFKNVGEKIGDENEAFVLLNSLPETFKEVKAALKYGRDIITTDIIISAIRTKDLELQSQKKESS------
        MF++E+ F++K + +K L ENLD+FK++T+     GEK+G ENEA +L+NS+ +T+KEVK  LKYGR+ IT + +I+ +++K+LEL+++ K S+      
Subjt:  MFMRERFFTYKMDPSKPLTENLDDFKRMTSEFKNVGEKIGDENEAFVLLNSLPETFKEVKAALKYGRDIITTDIIISAIRTKDLELQSQKKESS------

Query:  ------------------------------------EEAAVGENSITYSDALATSDQCSNEHPTVEKYDWVIDSGCSFHMTPSKGWFSTYREWDGGIVYM
                                            +   VG  +  Y++ LA +++ + E  T E+ D V+DSGC++HMT  K WF  Y+  +G  VYM
Subjt:  ------------------------------------EEAAVGENSITYSDALATSDQCSNEHPTVEKYDWVIDSGCSFHMTPSKGWFSTYREWDGGIVYM

Query:  GNNNTCRVVGIGSVSLKLSDGWLYLWRECWDHRDQKDSRTVLIGTKINGLYVLKNVEMIQTALSVTENSLTECDLWHKRLSHISTKGLQVLANQGILPQG
        GNN  C ++G+GSV LKLS       RE      ++  R +L   K+ GLY +KNV   + AL        E +LWH+RLSHIS KGL  L  QG++   
Subjt:  GNNNTCRVVGIGSVSLKLSDGWLYLWRECWDHRDQKDSRTVLIGTKINGLYVLKNVEMIQTALSVTENSLTECDLWHKRLSHISTKGLQVLANQGILPQG

Query:  VGENLNFCEYCVVGKAKRHSFNKSQLTTKGILEYIYSDLWGPATTNSLSGSRT
          + L FCE+C+ GK+KR  F+K +  +K  L+Y++ DLWGPA T+S  GSRT
Subjt:  VGENLNFCEYCVVGKAKRHSFNKSQLTTKGILEYIYSDLWGPATTNSLSGSRT

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]5.4e-9135.74Show/hide
Query:  MSMTRFEIEKFDGKGDFDLWKAKIKAILGQQKALQALQDPSQLPTIVTEEQKETMNVTAYGMLILNLSNNVLRQVIDEETPFGIWTKLNKLYEIKDVHNK
        M+ TRFE+ KF+G GDF LW+ KI+AIL Q K  + L D  +LP  +TE +K  M+  AY  ++L LS+ VLR V +  T   +W KL  LY  K + NK
Subjt:  MSMTRFEIEKFDGKGDFDLWKAKIKAILGQQKALQALQDPSQLPTIVTEEQKETMNVTAYGMLILNLSNNVLRQVIDEETPFGIWTKLNKLYEIKDVHNK

Query:  MFMRERFFTYKMDPSKPLTENLDDFKRMTSEFKNVGEKIGDENEAFVLLNSLPETFKEVKAALKYGRDIITTDIIISAIRTKDLELQSQKKE--------
        ++++E+FF YKMD SK L ENLD+F+++  +  N+GEK+ DEN+A +LLNSLPET++EVKAA+KYGRD +T  I++ A++T++LE++ ++K+        
Subjt:  MFMRERFFTYKMDPSKPLTENLDDFKRMTSEFKNVGEKIGDENEAFVLLNSLPETFKEVKAALKYGRDIITTDIIISAIRTKDLELQSQKKE--------

Query:  -----------------------------------------SSEEAAVGENSIT--YSDALATS--DQCSNEHPTVE---------KYDWVIDSGCSFHM
                                                  S EA+  E ++T  Y+ A  T   D     + + E         +  W++DSGC+FHM
Subjt:  -----------------------------------------SSEEAAVGENSIT--YSDALATS--DQCSNEHPTVE---------KYDWVIDSGCSFHM

Query:  TPSKGWFSTYREWDGGIVYMGNNNTCRVVGIGSVSLKLSDGWLYLW---------------------------RECWDHRDQKDSRTVLIGTKINGLYVL
        TP + + + +++ DGG V +G+N TC V G GSV +   DG + +                             E    +  K S   L GT  +GLYVL
Subjt:  TPSKGWFSTYREWDGGIVYMGNNNTCRVVGIGSVSLKLSDGWLYLW---------------------------RECWDHRDQKDSRTVLIGTKINGLYVL

Query:  KNVEMIQTALSVTENSLTECD-LWHKRLSHISTKGLQVLANQGILPQGVGENLNFCEYCVVGKAKRHSFNKSQLTTKGILEYIYSDLWGPATTNSLSGS-
        +   +  +A ++    +T+   LWHKRL+H+S +GLQ L+ QG+L       L FCE+C++GK+ R  F K + TTKGIL+Y++SDLWGP    S+ GS 
Subjt:  KNVEMIQTALSVTENSLTECD-LWHKRLSHISTKGLQVLANQGILPQGVGENLNFCEYCVVGKAKRHSFNKSQLTTKGILEYIYSDLWGPATTNSLSGS-

Query:  ----------------------------------------------RTDNGLEFCNESFDNFCKEHGIVRHRTVRHTPQQNG
                                                      RTDNGLEF N  F+ FCK  GI RH TV +TPQQNG
Subjt:  ----------------------------------------------RTDNGLEFCNESFDNFCKEHGIVRHRTVRHTPQQNG

TrEMBL top hitse value%identityAlignment
A0A5A7U2U7 Retrotransposon protein, putative, Ty1-copia sub-class1.3e-9035.74Show/hide
Query:  MSMTRFEIEKFDGKGDFDLWKAKIKAILGQQKALQALQDPSQLPTIVTEEQKETMNVTAYGMLILNLSNNVLRQVIDEETPFGIWTKLNKLYEIKDVHNK
        M+ TRFE+ KF+G GDF LW+ KI+AIL Q K  + L D  +LP  +TE +K  M+  AY  ++L LS+ VLR V +  T   +W KL  LY  K + NK
Subjt:  MSMTRFEIEKFDGKGDFDLWKAKIKAILGQQKALQALQDPSQLPTIVTEEQKETMNVTAYGMLILNLSNNVLRQVIDEETPFGIWTKLNKLYEIKDVHNK

Query:  MFMRERFFTYKMDPSKPLTENLDDFKRMTSEFKNVGEKIGDENEAFVLLNSLPETFKEVKAALKYGRDIITTDIIISAIRTKDLELQSQKKE--------
        ++++E+FF YKMD SK L ENLD+F+++  +  N+GEK+ DEN+A +LLNSLPET++EVKAA+KYG D +T  I++ A++T++LE++ ++K+        
Subjt:  MFMRERFFTYKMDPSKPLTENLDDFKRMTSEFKNVGEKIGDENEAFVLLNSLPETFKEVKAALKYGRDIITTDIIISAIRTKDLELQSQKKE--------

Query:  -----------------------------------------SSEEAAVGENSIT--YSDALATSDQCSNEHPTVEKYD------------WVIDSGCSFH
                                                  S EA+  E ++T  Y+ A  T D C +     E  +            W++DSGC+FH
Subjt:  -----------------------------------------SSEEAAVGENSIT--YSDALATSDQCSNEHPTVEKYD------------WVIDSGCSFH

Query:  MTPSKGWFSTYREWDGGIVYMGNNNTCRVVGIGSVSLKLSDGWLYLW---------------------------RECWDHRDQKDSRTVLIGTKINGLYV
        MTP + + + +++ DGG V +G+N TC V G GSV +   DG + +                             E    +  K S   L GT  +GLYV
Subjt:  MTPSKGWFSTYREWDGGIVYMGNNNTCRVVGIGSVSLKLSDGWLYLW---------------------------RECWDHRDQKDSRTVLIGTKINGLYV

Query:  LKNVEMIQTALSVTENSLTECDLWHKRLSHISTKGLQVLANQGILPQGVGENLNFCEYCVVGKAKRHSFNKSQLTTKGILEYIYSDLWGPATTNSLSGS-
        L+   +  +A   +        LWHKRL+H+S +GLQ L+ QG+L       L FCE+C++GK+ R  F K + TTKGIL+YI+SDLWGP    S+ GS 
Subjt:  LKNVEMIQTALSVTENSLTECDLWHKRLSHISTKGLQVLANQGILPQGVGENLNFCEYCVVGKAKRHSFNKSQLTTKGILEYIYSDLWGPATTNSLSGS-

Query:  ----------------------------------------------RTDNGLEFCNESFDNFCKEHGIVRHRTVRHTPQQNG
                                                      RTDNGLEF N  F+ FCK  GI RH TV +TPQQNG
Subjt:  ----------------------------------------------RTDNGLEFCNESFDNFCKEHGIVRHRTVRHTPQQNG

A0A5A7U459 Putative polyprotein8.4e-8238.72Show/hide
Query:  MSMTRFEIEKFDGKGDFDLWKAKIKAILGQQKALQALQDPSQLPTIVTEEQKETMNVTAYGMLILNLSNNVLRQVIDEETPFG-IWTKLNKLYEIKDVHN
        M+ TRFE+ KF+G GDF LW+ KI+AIL Q K  + L D  +LP  +TE +K  M+   Y  ++L LS+ VLR ++DE T  G +W KL  LY  K + N
Subjt:  MSMTRFEIEKFDGKGDFDLWKAKIKAILGQQKALQALQDPSQLPTIVTEEQKETMNVTAYGMLILNLSNNVLRQVIDEETPFG-IWTKLNKLYEIKDVHN

Query:  KMFMRERFFTYKMDPSKPLTENLDDFKRMTSEFKNVGEKIGDENEAFVLLNSLPETFKEVKAALKYGRDIITTDIIISAIRTKDLELQSQKKE-------
        K++++E+FF YKMD SK L ENLD+F+++  +  N+GEK+ DEN+A +LLNSLPE ++EVKAA+KYGRD +   I++ A++T++LE++ ++K+       
Subjt:  KMFMRERFFTYKMDPSKPLTENLDDFKRMTSEFKNVGEKIGDENEAFVLLNSLPETFKEVKAALKYGRDIITTDIIISAIRTKDLELQSQKKE-------

Query:  --SSEEAAVGENSITYSDALATSDQC-----------------SNEHPTVEKYDWVIDSGCSFHMTPSKGWFSTYREWDGGIVYMGNNNTCRVVGIGSVS
          S +++  G+   + S +   S +C                 S E  T E    V D   S  +T   G+ S    ++   V M ++   R++G     
Subjt:  --SSEEAAVGENSITYSDALATSDQC-----------------SNEHPTVEKYDWVIDSGCSFHMTPSKGWFSTYREWDGGIVYMGNNNTCRVVGIGSVS

Query:  LKLSDGWLYLWREC---WDHRDQKDSRTVLI---GTKINGLYVLKNVEMIQTALSVTENSLTECDLWHKRLSHISTKGLQVLANQGILPQGVGENLNFCE
              W+     C   +++R  K ++  L+   GT  +GLYVL+   +  +A   +   +    LWH+RL+H+S +GLQ L+ QG+L       L FCE
Subjt:  LKLSDGWLYLWREC---WDHRDQKDSRTVLI---GTKINGLYVLKNVEMIQTALSVTENSLTECDLWHKRLSHISTKGLQVLANQGILPQGVGENLNFCE

Query:  YCVVGKAKRHSFNKSQLTTKGILEYIYSDLWGPATTNSLSGS-----RTDNGLEFCNESFDNFCKEHGIVRHRTVRHTPQQNG
        +C++GK+ R  F K + TTKGIL+Y++SDLWGP     + G      R DNGLEF N  F+NFCK   I RH TV +TPQQNG
Subjt:  YCVVGKAKRHSFNKSQLTTKGILEYIYSDLWGPATTNSLSGS-----RTDNGLEFCNESFDNFCKEHGIVRHRTVRHTPQQNG

A0A5A7U6R2 Retrovirus-related Pol polyprotein from transposon TNT 1-946.2e-10142.83Show/hide
Query:  MSMTRFEIEKFDGKGDFDLWKAKIKAILGQQKALQALQDPSQLPTIVTEEQKETMNVTAYGMLILNLSNNVLRQVIDEETPFGIWTKLNKLYEIKDVHNK
        M+ T+FEIEKFDG GDF LW  +I AILG QKAL+AL+DP +LP  +T+ ++ET+   AY  LI+N+++NVLRQVI+E T F  W KL  LYE KD+ NK
Subjt:  MSMTRFEIEKFDGKGDFDLWKAKIKAILGQQKALQALQDPSQLPTIVTEEQKETMNVTAYGMLILNLSNNVLRQVIDEETPFGIWTKLNKLYEIKDVHNK

Query:  MFMRERFFTYKMDPSKPLTENLDDFKRMTSEFKNVGEKIGDENEAFVLLNSLPETFKEVKAALKYGRDIITTDIIISAIRTKDLELQSQKKESS------
        MF++E+ F++K + +K L ENLD+FK++T+     GEK+G ENEA +L+NS+ +T+KEVK  LKYGR+ IT + +I+ +++K+LEL+++ K S+      
Subjt:  MFMRERFFTYKMDPSKPLTENLDDFKRMTSEFKNVGEKIGDENEAFVLLNSLPETFKEVKAALKYGRDIITTDIIISAIRTKDLELQSQKKESS------

Query:  ------------------------------------EEAAVGENSITYSDALATSDQCSNEHPTVEKYDWVIDSGCSFHMTPSKGWFSTYREWDGGIVYM
                                            +   VG  +  Y++ LA +++ + E  T E+ D V+DSGC++HMT  K WF  Y+  +G  VYM
Subjt:  ------------------------------------EEAAVGENSITYSDALATSDQCSNEHPTVEKYDWVIDSGCSFHMTPSKGWFSTYREWDGGIVYM

Query:  GNNNTCRVVGIGSVSLKLSDGWLYLWRECWDHRDQKDSRTVLIGTKINGLYVLKNVEMIQTALSVTENSLTECDLWHKRLSHISTKGLQVLANQGILPQG
        GNN  C ++G+GSV LKLS       RE      ++  R +L   K+ GLY +KNV   + AL        E +LWH+RLSHIS KGL  L  QG++   
Subjt:  GNNNTCRVVGIGSVSLKLSDGWLYLWRECWDHRDQKDSRTVLIGTKINGLYVLKNVEMIQTALSVTENSLTECDLWHKRLSHISTKGLQVLANQGILPQG

Query:  VGENLNFCEYCVVGKAKRHSFNKSQLTTKGILEYIYSDLWGPATTNSLSGSRT
          + L FCE+C+ GK+KR  F+K +  +K  L+Y++ DLWGPA T+S  GSRT
Subjt:  VGENLNFCEYCVVGKAKRHSFNKSQLTTKGILEYIYSDLWGPATTNSLSGSRT

A0A5A7UB25 Putative gag-pol polyprotein2.0e-9135.74Show/hide
Query:  MSMTRFEIEKFDGKGDFDLWKAKIKAILGQQKALQALQDPSQLPTIVTEEQKETMNVTAYGMLILNLSNNVLRQVIDEETPFGIWTKLNKLYEIKDVHNK
        M+ TRFE+ KF+G GDF LW+ KI+AIL Q K  + L D  +LP  +TE +K  M+  AY  ++L LS+ VLR V +  T   +W KL  LY  K + NK
Subjt:  MSMTRFEIEKFDGKGDFDLWKAKIKAILGQQKALQALQDPSQLPTIVTEEQKETMNVTAYGMLILNLSNNVLRQVIDEETPFGIWTKLNKLYEIKDVHNK

Query:  MFMRERFFTYKMDPSKPLTENLDDFKRMTSEFKNVGEKIGDENEAFVLLNSLPETFKEVKAALKYGRDIITTDIIISAIRTKDLELQSQKKE--------
        ++++E+FF YKMD SK L ENLD+F+++  +  N+GEK+ DEN+A +LLNSLPET++EVKAA+KYGRD +T  I++ A++T++LE++ ++K+        
Subjt:  MFMRERFFTYKMDPSKPLTENLDDFKRMTSEFKNVGEKIGDENEAFVLLNSLPETFKEVKAALKYGRDIITTDIIISAIRTKDLELQSQKKE--------

Query:  -----------------------------------------SSEEAAVGENSIT--YSDALATS--DQCSNEHPTVE---------KYDWVIDSGCSFHM
                                                  S EA+  E ++T  Y+ A  T   D     + + E         +  W++DSGC+FHM
Subjt:  -----------------------------------------SSEEAAVGENSIT--YSDALATS--DQCSNEHPTVE---------KYDWVIDSGCSFHM

Query:  TPSKGWFSTYREWDGGIVYMGNNNTCRVVGIGSVSLKLSDGWLYLW---------------------------RECWDHRDQKDSRTVLIGTKINGLYVL
        TP + + + +++ DGG V +G+N TC V G GSV +   DG + +                             E    +  K S   L GT  +GLYVL
Subjt:  TPSKGWFSTYREWDGGIVYMGNNNTCRVVGIGSVSLKLSDGWLYLW---------------------------RECWDHRDQKDSRTVLIGTKINGLYVL

Query:  KNVEMIQTALSVTENSLTECD-LWHKRLSHISTKGLQVLANQGILPQGVGENLNFCEYCVVGKAKRHSFNKSQLTTKGILEYIYSDLWGPATTNSLSGS-
        +   +  +A ++    +T+   LWHKRL+H+S +GLQ L+ QG+L       L FCE+C++GK+ R  F K + TTKGIL+Y++SDLWGP    S+ GS 
Subjt:  KNVEMIQTALSVTENSLTECD-LWHKRLSHISTKGLQVLANQGILPQGVGENLNFCEYCVVGKAKRHSFNKSQLTTKGILEYIYSDLWGPATTNSLSGS-

Query:  ----------------------------------------------RTDNGLEFCNESFDNFCKEHGIVRHRTVRHTPQQNG
                                                      RTDNGLEF N  F+ FCK  GI RH TV +TPQQNG
Subjt:  ----------------------------------------------RTDNGLEFCNESFDNFCKEHGIVRHRTVRHTPQQNG

A0A5D3DNU1 Putative gag-pol polyprotein2.6e-9135.74Show/hide
Query:  MSMTRFEIEKFDGKGDFDLWKAKIKAILGQQKALQALQDPSQLPTIVTEEQKETMNVTAYGMLILNLSNNVLRQVIDEETPFGIWTKLNKLYEIKDVHNK
        M+ TRFE+ KF+G GDF LW+ KI+AIL Q K  + L D  +LP  +TE +K  M+  AY  ++L LS+ VLR V +  T   +W KL  LY  K + NK
Subjt:  MSMTRFEIEKFDGKGDFDLWKAKIKAILGQQKALQALQDPSQLPTIVTEEQKETMNVTAYGMLILNLSNNVLRQVIDEETPFGIWTKLNKLYEIKDVHNK

Query:  MFMRERFFTYKMDPSKPLTENLDDFKRMTSEFKNVGEKIGDENEAFVLLNSLPETFKEVKAALKYGRDIITTDIIISAIRTKDLELQSQKKE--------
        ++++E+FF YKMD SK L ENLD+F+++  +  N+GEK+ DEN+A +LLNSLPET++EVKAA+KYGRD +T  I++ A++T++LE++ ++K+        
Subjt:  MFMRERFFTYKMDPSKPLTENLDDFKRMTSEFKNVGEKIGDENEAFVLLNSLPETFKEVKAALKYGRDIITTDIIISAIRTKDLELQSQKKE--------

Query:  -----------------------------------------SSEEAAVGENSIT--YSDALATS--DQCSNEHPTVE---------KYDWVIDSGCSFHM
                                                  S EA+  E ++T  Y+ A  T   D     + + E         +  W++DSGC+FHM
Subjt:  -----------------------------------------SSEEAAVGENSIT--YSDALATS--DQCSNEHPTVE---------KYDWVIDSGCSFHM

Query:  TPSKGWFSTYREWDGGIVYMGNNNTCRVVGIGSVSLKLSDGWLYLW---------------------------RECWDHRDQKDSRTVLIGTKINGLYVL
        TP + + + +++ DGG V +G+N TC V G GSV +   DG + +                             E    +  K S   L GT  +GLYVL
Subjt:  TPSKGWFSTYREWDGGIVYMGNNNTCRVVGIGSVSLKLSDGWLYLW---------------------------RECWDHRDQKDSRTVLIGTKINGLYVL

Query:  KNVEMIQTALSVTENSLTECD-LWHKRLSHISTKGLQVLANQGILPQGVGENLNFCEYCVVGKAKRHSFNKSQLTTKGILEYIYSDLWGPATTNSLSGS-
        +   +  +A ++    +T+   LWHKRL+H+S +GLQ L+ QG+L       L FCE+C++GK+ R  F K + TTKGIL+Y++SDLWGP    S+ GS 
Subjt:  KNVEMIQTALSVTENSLTECD-LWHKRLSHISTKGLQVLANQGILPQGVGENLNFCEYCVVGKAKRHSFNKSQLTTKGILEYIYSDLWGPATTNSLSGS-

Query:  ----------------------------------------------RTDNGLEFCNESFDNFCKEHGIVRHRTVRHTPQQNG
                                                      RTDNGLEF N  F+ FCK  GI RH TV +TPQQNG
Subjt:  ----------------------------------------------RTDNGLEFCNESFDNFCKEHGIVRHRTVRHTPQQNG

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-4824.62Show/hide
Query:  MSMTRFEIEKFDGKGDFDLWKAKIKAILGQQKALQALQDPSQLPTIVTEEQKETMNVTAYGMLILNLSNNVLRQVIDEETPFGIWTKLNKLYEIKDVHNK
        MS  ++E+ KF+G   F  W+ +++ +L QQ   + L   S+ P  +  E    ++  A   + L+LS++V+  +IDE+T  GIWT+L  LY  K + NK
Subjt:  MSMTRFEIEKFDGKGDFDLWKAKIKAILGQQKALQALQDPSQLPTIVTEEQKETMNVTAYGMLILNLSNNVLRQVIDEETPFGIWTKLNKLYEIKDVHNK

Query:  MFMRERFFTYKMDPSKPLTENLDDFKRMTSEFKNVGEKIGDENEAFVLLNSLPETFKEVKAALKYGRDII-----TTDIIISAIRTKDLELQSQ------
        ++++++ +   M        +L+ F  + ++  N+G KI +E++A +LLNSLP ++  +   + +G+  I     T+ ++++    K  E Q Q      
Subjt:  MFMRERFFTYKMDPSKPLTENLDDFKRMTSEFKNVGEKIGDENEAFVLLNSLPETFKEVKAALKYGRDII-----TTDIIISAIRTKDLELQSQ------

Query:  ---------------------------------------------------------KKESSEEAAVGENSITYSDALATSDQCSNEHPTVEKYDWVIDS
                                                                 +K     AA+ +N+      +   ++C   H +  + +WV+D+
Subjt:  ---------------------------------------------------------KKESSEEAAVGENSITYSDALATSDQCSNEHPTVEKYDWVIDS

Query:  GCSFHMTPSKGWFSTYREWDGGIVYMGNNNTCRVVGIGSVSLKLS----------------------------DGW-LYLWRECWDHRDQKDSRTVLIGT
          S H TP +  F  Y   D G V MGN +  ++ GIG + +K +                            DG+  Y   + W  R  K S  +  G 
Subjt:  GCSFHMTPSKGWFSTYREWDGGIVYMGNNNTCRVVGIGSVSLKLS----------------------------DGW-LYLWRECWDHRDQKDSRTVLIGT

Query:  KINGLYVLKNVEMIQTALSVTENSLTECDLWHKRLSHISTKGLQVLANQGILPQGVGENLNFCEYCVVGKAKRHSFNKSQLTTKGILEYIYSDLWGPATT
            LY   N E+ Q  L+  ++ ++  DLWHKR+ H+S KGLQ+LA + ++    G  +  C+YC+ GK  R SF  S      IL+ +YSD+ GP   
Subjt:  KINGLYVLKNVEMIQTALSVTENSLTECDLWHKRLSHISTKGLQVLANQGILPQGVGENLNFCEYCVVGKAKRHSFNKSQLTTKGILEYIYSDLWGPATT

Query:  NSLSGS-----------------------------------------------RTDNGLEFCNESFDNFCKEHGIVRHRTVRHTPQQNG
         S+ G+                                               R+DNG E+ +  F+ +C  HGI   +TV  TPQ NG
Subjt:  NSLSGS-----------------------------------------------RTDNGLEFCNESFDNFCKEHGIVRHRTVRHTPQQNG

P93293 Uncharacterized mitochondrial protein AtMg003009.1e-1740.19Show/hide
Query:  KDSRTVLIGTKINGLYVLKNVEMIQTALS-VTENSLTECDLWHKRLSHISTKGLQVLANQGILPQGVGENLNFCEYCVVGKAKRHSFNKSQLTTKGILEY
        K  RT+L G + + LY+L+    ++T  S + E +  E  LWH RL+H+S +G+++L  +G L      +L FCE C+ GK  R +F+  Q TTK  L+Y
Subjt:  KDSRTVLIGTKINGLYVLKNVEMIQTALS-VTENSLTECDLWHKRLSHISTKGLQVLANQGILPQGVGENLNFCEYCVVGKAKRHSFNKSQLTTKGILEY

Query:  IYSDLWG
        ++SDLWG
Subjt:  IYSDLWG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.3e-0436.36Show/hide
Query:  WHKRLSHISTKGL-QVLANQGILPQGVGENLNFCEYCVVGKAKRHSFNKSQLTTKGILEYIYSDLW
        WH RL H S   L  V++N  +        L  C  C + K+ +  F+ S +T+   LEYIYSD+W
Subjt:  WHKRLSHISTKGL-QVLANQGILPQGVGENLNFCEYCVVGKAKRHSFNKSQLTTKGILEYIYSDLW

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein6.5e-1840.19Show/hide
Query:  KDSRTVLIGTKINGLYVLKNVEMIQTALS-VTENSLTECDLWHKRLSHISTKGLQVLANQGILPQGVGENLNFCEYCVVGKAKRHSFNKSQLTTKGILEY
        K  RT+L G + + LY+L+    ++T  S + E +  E  LWH RL+H+S +G+++L  +G L      +L FCE C+ GK  R +F+  Q TTK  L+Y
Subjt:  KDSRTVLIGTKINGLYVLKNVEMIQTALS-VTENSLTECDLWHKRLSHISTKGLQVLANQGILPQGVGENLNFCEYCVVGKAKRHSFNKSQLTTKGILEY

Query:  IYSDLWG
        ++SDLWG
Subjt:  IYSDLWG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTAAGGTAGAATCGGGCAAATCTGCACATAAGCCTGGACGTCTGCGAGCAAATAGTACAGAACTGCTGGTCAGCCTCATGCCAAGGTCGAGGCCGACCATCCCCCT
TCGAAAAGGCCAGGAATTAACTTTTCAATGCAGGGTTGTGAAGTTTTCTATTTTTCCTATTGTTAATTATGGAATGAAGCACTCCGAACGACCCAGTACGGAGGCCGAAG
CCGACCAAAGAGGCCCTCCGGCCACTTCTGTGATTTTGGACCACACAGATGGACAAGGAGCTGACGAGGACAATCGGGCAGAGGTAGGATCAAGAGACCGACCCAGAGGA
AGACCGGACCAAAGGGTCGGGCCAAAATGGCCCGACCCATATGGTCGGCCTCGGCAAAAGGCCGAGGCCGACCATCCGGCCCGTTTGCACGGGCCGAGCCCGGTGACCTC
TTTTCGGTCCCTAATGCCCCGAATCGCCCCGGTTTCGCCTGGTTCGCCCCGAAACGCCACCGAATTCCTAAAAAACCCTAGGAGGACAAACAGGCATCGGAGGCGGTGTG
GCCTACACCACGCCGGTGTGCAGCGGTTTTTGCTGGTTTTGCAGGTCACGTCTTCCCCAGTTCCTACAAATTCACTGTTGGTGTCACGTGAAGGTCGGGTGATTTTGGAC
CACACAGATGGACAAGGAGCTGACGAGGACAATCGGGCAGAGGTAGGATCAAGAGACCGACCCAGAGGAAGACCGGACCAAAGGGTCGGGCCAAAATGGCCCGACCCATA
TGGTCGGCCTCGGCAAAAGGCCGAGGCCGACCATCCGGCCCGTTTGCACGGGCCGAGCCCGGTGACCTCTTTTCGGTCCCTAATGCCCCGAATCGCCCCGGTTTCGCCTG
GTTCGCCCCGAAACGCCACCGAATTCCTAAAAAACCCTAGGAGGACAAACAGGCATCGGAGGCGGTGTGGCCTACACCACGCCGGTGTGCAGCGGTTTTTGCTGGTTTTG
CAGGTCACGTCTTCCCCAGTTCCTACAAATTCACTGTTGGTGTCACGTGAAGGTCGGATGAGGGGAAGACGTGGTCTGCAAGACGGTAAACCTGCACACCGGTGTGGTGC
TGTCACACCGCCTCCGATGCTTAAGAATTCGGAGGCATTTCAGGACGAACCAGGCGAAACTGGGGTGGCTAGAGGCAGTAGGGATCGAGCGGAGTTGGAGGAACTCGGCC
CATGGCCGAGGCCGACCATGGGCCTCGGCCTCGGCCGAGTAGGGAGTCGGGCTTCTTGGTCCGACCTATTGGACAACACAGTGCCAGTCCGAGGATGCGACCACGAAGTG
AAGCCTGAAGGAAAGTCAGACGAAAGGGGTTGGGCCAGGCTTGGCCTATGCTCGGTCTCGGTCGAGGCCGAGTCCTTCCATCTCTGTTCGGTTCTTGGAGTCTTTGGTTG
CCCCGGTTCAACCCAGATCAATCCTAAGATGCCTAGAAACCCTAAAATAGGAAAATGTCATGTCTTCCTCCCTCTTCAAACAAATTTACCATTGGTGTCACGTGAAGTGG
ATGTAGGCCATGGATGCCGAACCACTATAATCACCTGTCTTTGTTTCTTCTTCCTTGAATCTCTATTTTTCTTTCTGCATTTTGGTGTGAGTTCTTCGTCTGCAAATCGG
CGGGTTTGGGGTGACTGTTCTTCGACTGAAAAAGTAACAGTGGTATCAGAGCTCAGAAATCAGTCAAGAACTCAATTCTTGAAGTCAAGAATGTCGATGACAAGGTTTGA
AATTGAGAAGTTTGATGGGAAGGGGGACTTTGATCTTTGGAAAGCCAAGATCAAAGCTATTCTTGGGCAACAAAAGGCTTTGCAAGCCTTGCAAGATCCTTCCCAGCTAC
CAACCATCGTGACTGAAGAACAGAAGGAGACTATGAATGTGACAGCTTATGGGATGCTAATTCTGAATCTAAGTAATAATGTACTGCGACAAGTAATCGATGAGGAAACG
CCTTTCGGTATATGGACAAAGCTCAACAAGTTGTATGAGATTAAAGATGTCCATAATAAAATGTTCATGAGGGAGAGGTTCTTTACTTACAAAATGGATCCCTCAAAGCC
TCTGACTGAAAACTTGGATGACTTCAAGAGAATGACTTCAGAATTCAAGAATGTGGGAGAGAAAATTGGGGACGAAAATGAAGCCTTTGTCCTACTTAACTCACTACCAG
AGACCTTTAAAGAAGTGAAGGCTGCTTTGAAATATGGCAGGGACATAATTACAACTGACATAATTATCTCTGCCATTAGAACCAAAGATCTTGAATTACAATCTCAGAAA
AAGGAGTCATCTGAAGAGGCTGCTGTTGGAGAGAACTCCATAACCTATTCTGATGCCTTAGCAACATCAGACCAGTGCAGTAATGAACATCCAACAGTTGAGAAGTACGA
TTGGGTGATAGACTCAGGCTGCTCATTCCATATGACTCCCTCCAAAGGTTGGTTCAGCACTTACCGAGAATGGGATGGAGGAATAGTCTATATGGGCAATAACAACACTT
GCAGAGTTGTTGGGATAGGTTCAGTATCCTTGAAGCTTTCAGACGGCTGGTTGTACTTATGGAGGGAGTGCTGGGACCATAGAGATCAAAAGGACTCAAGGACAGTCCTA
ATTGGAACCAAGATAAATGGTCTGTATGTACTGAAGAATGTAGAAATGATACAGACAGCTCTTAGTGTGACAGAAAATAGCCTAACCGAATGTGATTTATGGCACAAGAG
GCTATCACACATCAGCACTAAAGGGCTTCAAGTTTTAGCCAACCAAGGGATATTACCTCAAGGAGTTGGAGAAAATTTAAACTTCTGTGAATACTGTGTAGTTGGCAAAG
CAAAGAGGCACAGTTTCAACAAATCACAACTAACAACCAAAGGAATTCTTGAGTATATATATTCGGACCTATGGGGACCTGCTACAACTAACTCTCTCAGTGGTTCCAGG
ACCGACAATGGTCTAGAATTCTGTAATGAATCCTTTGACAATTTCTGTAAAGAACATGGTATAGTGAGGCACCGAACAGTCAGGCACACACCTCAGCAAAATGGGTGGCT
GAGAGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTAAGGTAGAATCGGGCAAATCTGCACATAAGCCTGGACGTCTGCGAGCAAATAGTACAGAACTGCTGGTCAGCCTCATGCCAAGGTCGAGGCCGACCATCCCCCT
TCGAAAAGGCCAGGAATTAACTTTTCAATGCAGGGTTGTGAAGTTTTCTATTTTTCCTATTGTTAATTATGGAATGAAGCACTCCGAACGACCCAGTACGGAGGCCGAAG
CCGACCAAAGAGGCCCTCCGGCCACTTCTGTGATTTTGGACCACACAGATGGACAAGGAGCTGACGAGGACAATCGGGCAGAGGTAGGATCAAGAGACCGACCCAGAGGA
AGACCGGACCAAAGGGTCGGGCCAAAATGGCCCGACCCATATGGTCGGCCTCGGCAAAAGGCCGAGGCCGACCATCCGGCCCGTTTGCACGGGCCGAGCCCGGTGACCTC
TTTTCGGTCCCTAATGCCCCGAATCGCCCCGGTTTCGCCTGGTTCGCCCCGAAACGCCACCGAATTCCTAAAAAACCCTAGGAGGACAAACAGGCATCGGAGGCGGTGTG
GCCTACACCACGCCGGTGTGCAGCGGTTTTTGCTGGTTTTGCAGGTCACGTCTTCCCCAGTTCCTACAAATTCACTGTTGGTGTCACGTGAAGGTCGGGTGATTTTGGAC
CACACAGATGGACAAGGAGCTGACGAGGACAATCGGGCAGAGGTAGGATCAAGAGACCGACCCAGAGGAAGACCGGACCAAAGGGTCGGGCCAAAATGGCCCGACCCATA
TGGTCGGCCTCGGCAAAAGGCCGAGGCCGACCATCCGGCCCGTTTGCACGGGCCGAGCCCGGTGACCTCTTTTCGGTCCCTAATGCCCCGAATCGCCCCGGTTTCGCCTG
GTTCGCCCCGAAACGCCACCGAATTCCTAAAAAACCCTAGGAGGACAAACAGGCATCGGAGGCGGTGTGGCCTACACCACGCCGGTGTGCAGCGGTTTTTGCTGGTTTTG
CAGGTCACGTCTTCCCCAGTTCCTACAAATTCACTGTTGGTGTCACGTGAAGGTCGGATGAGGGGAAGACGTGGTCTGCAAGACGGTAAACCTGCACACCGGTGTGGTGC
TGTCACACCGCCTCCGATGCTTAAGAATTCGGAGGCATTTCAGGACGAACCAGGCGAAACTGGGGTGGCTAGAGGCAGTAGGGATCGAGCGGAGTTGGAGGAACTCGGCC
CATGGCCGAGGCCGACCATGGGCCTCGGCCTCGGCCGAGTAGGGAGTCGGGCTTCTTGGTCCGACCTATTGGACAACACAGTGCCAGTCCGAGGATGCGACCACGAAGTG
AAGCCTGAAGGAAAGTCAGACGAAAGGGGTTGGGCCAGGCTTGGCCTATGCTCGGTCTCGGTCGAGGCCGAGTCCTTCCATCTCTGTTCGGTTCTTGGAGTCTTTGGTTG
CCCCGGTTCAACCCAGATCAATCCTAAGATGCCTAGAAACCCTAAAATAGGAAAATGTCATGTCTTCCTCCCTCTTCAAACAAATTTACCATTGGTGTCACGTGAAGTGG
ATGTAGGCCATGGATGCCGAACCACTATAATCACCTGTCTTTGTTTCTTCTTCCTTGAATCTCTATTTTTCTTTCTGCATTTTGGTGTGAGTTCTTCGTCTGCAAATCGG
CGGGTTTGGGGTGACTGTTCTTCGACTGAAAAAGTAACAGTGGTATCAGAGCTCAGAAATCAGTCAAGAACTCAATTCTTGAAGTCAAGAATGTCGATGACAAGGTTTGA
AATTGAGAAGTTTGATGGGAAGGGGGACTTTGATCTTTGGAAAGCCAAGATCAAAGCTATTCTTGGGCAACAAAAGGCTTTGCAAGCCTTGCAAGATCCTTCCCAGCTAC
CAACCATCGTGACTGAAGAACAGAAGGAGACTATGAATGTGACAGCTTATGGGATGCTAATTCTGAATCTAAGTAATAATGTACTGCGACAAGTAATCGATGAGGAAACG
CCTTTCGGTATATGGACAAAGCTCAACAAGTTGTATGAGATTAAAGATGTCCATAATAAAATGTTCATGAGGGAGAGGTTCTTTACTTACAAAATGGATCCCTCAAAGCC
TCTGACTGAAAACTTGGATGACTTCAAGAGAATGACTTCAGAATTCAAGAATGTGGGAGAGAAAATTGGGGACGAAAATGAAGCCTTTGTCCTACTTAACTCACTACCAG
AGACCTTTAAAGAAGTGAAGGCTGCTTTGAAATATGGCAGGGACATAATTACAACTGACATAATTATCTCTGCCATTAGAACCAAAGATCTTGAATTACAATCTCAGAAA
AAGGAGTCATCTGAAGAGGCTGCTGTTGGAGAGAACTCCATAACCTATTCTGATGCCTTAGCAACATCAGACCAGTGCAGTAATGAACATCCAACAGTTGAGAAGTACGA
TTGGGTGATAGACTCAGGCTGCTCATTCCATATGACTCCCTCCAAAGGTTGGTTCAGCACTTACCGAGAATGGGATGGAGGAATAGTCTATATGGGCAATAACAACACTT
GCAGAGTTGTTGGGATAGGTTCAGTATCCTTGAAGCTTTCAGACGGCTGGTTGTACTTATGGAGGGAGTGCTGGGACCATAGAGATCAAAAGGACTCAAGGACAGTCCTA
ATTGGAACCAAGATAAATGGTCTGTATGTACTGAAGAATGTAGAAATGATACAGACAGCTCTTAGTGTGACAGAAAATAGCCTAACCGAATGTGATTTATGGCACAAGAG
GCTATCACACATCAGCACTAAAGGGCTTCAAGTTTTAGCCAACCAAGGGATATTACCTCAAGGAGTTGGAGAAAATTTAAACTTCTGTGAATACTGTGTAGTTGGCAAAG
CAAAGAGGCACAGTTTCAACAAATCACAACTAACAACCAAAGGAATTCTTGAGTATATATATTCGGACCTATGGGGACCTGCTACAACTAACTCTCTCAGTGGTTCCAGG
ACCGACAATGGTCTAGAATTCTGTAATGAATCCTTTGACAATTTCTGTAAAGAACATGGTATAGTGAGGCACCGAACAGTCAGGCACACACCTCAGCAAAATGGGTGGCT
GAGAGATTAA
Protein sequenceShow/hide protein sequence
MLKVESGKSAHKPGRLRANSTELLVSLMPRSRPTIPLRKGQELTFQCRVVKFSIFPIVNYGMKHSERPSTEAEADQRGPPATSVILDHTDGQGADEDNRAEVGSRDRPRG
RPDQRVGPKWPDPYGRPRQKAEADHPARLHGPSPVTSFRSLMPRIAPVSPGSPRNATEFLKNPRRTNRHRRRCGLHHAGVQRFLLVLQVTSSPVPTNSLLVSREGRVILD
HTDGQGADEDNRAEVGSRDRPRGRPDQRVGPKWPDPYGRPRQKAEADHPARLHGPSPVTSFRSLMPRIAPVSPGSPRNATEFLKNPRRTNRHRRRCGLHHAGVQRFLLVL
QVTSSPVPTNSLLVSREGRMRGRRGLQDGKPAHRCGAVTPPPMLKNSEAFQDEPGETGVARGSRDRAELEELGPWPRPTMGLGLGRVGSRASWSDLLDNTVPVRGCDHEV
KPEGKSDERGWARLGLCSVSVEAESFHLCSVLGVFGCPGSTQINPKMPRNPKIGKCHVFLPLQTNLPLVSREVDVGHGCRTTIITCLCFFFLESLFFFLHFGVSSSSANR
RVWGDCSSTEKVTVVSELRNQSRTQFLKSRMSMTRFEIEKFDGKGDFDLWKAKIKAILGQQKALQALQDPSQLPTIVTEEQKETMNVTAYGMLILNLSNNVLRQVIDEET
PFGIWTKLNKLYEIKDVHNKMFMRERFFTYKMDPSKPLTENLDDFKRMTSEFKNVGEKIGDENEAFVLLNSLPETFKEVKAALKYGRDIITTDIIISAIRTKDLELQSQK
KESSEEAAVGENSITYSDALATSDQCSNEHPTVEKYDWVIDSGCSFHMTPSKGWFSTYREWDGGIVYMGNNNTCRVVGIGSVSLKLSDGWLYLWRECWDHRDQKDSRTVL
IGTKINGLYVLKNVEMIQTALSVTENSLTECDLWHKRLSHISTKGLQVLANQGILPQGVGENLNFCEYCVVGKAKRHSFNKSQLTTKGILEYIYSDLWGPATTNSLSGSR
TDNGLEFCNESFDNFCKEHGIVRHRTVRHTPQQNGWLRD