; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027556 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027556
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr8:2012362..2019125
RNA-Seq ExpressionLag0027556
SyntenyLag0027556
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047995.1 retrotransposon protein, putative, Ty1-copia sub-class [Cucumis melo var. makuwa]3.3e-15045.23Show/hide
Query:  IEKFDGKGDFDLWKAKIKAILGQKKALQALTDPSKLPTTLTEEHKETINTTAYGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMRERF
        + KF+G GDF LW+ KI+AIL Q K  + L D  +LP  +TE  K  ++  AY T++L LS+ VLR V +  T  ++W KL +LY TK   NK+Y++E+F
Subjt:  IEKFDGKGDFDLWKAKIKAILGQKKALQALTDPSKLPTTLTEEHKETINTTAYGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMRERF

Query:  FTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEVKTTLKYGRENITTEVIISAIRTKELEIQSQKKES----SEGFFVKGKR
        F +KMD +K L ENLDEF+++  +  N+GEK+SDEN+A +LLNSLP+ Y+EVK  +KYG +++T  +++ A++T+ LEI+ ++K+     + G   K   
Subjt:  FTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEVKTTLKYGRENITTEVIISAIRTKELEIQSQKKES----SEGFFVKGKR

Query:  KGKDNKHQPDEKSKAKVRCNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQEAAVGENSIT--YSDALATSDQCSNDQSSFEKHD------------WVI
        KGK+   +   K K++ +C  CHKEGH K++C               NK +EA+  E ++T  Y+ A  T D C + ++ +E  +            W++
Subjt:  KGKDNKHQPDEKSKAKVRCNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQEAAVGENSIT--YSDALATSDQCSNDQSSFEKHD------------WVI

Query:  DSGCSFHMTHFKGWFSIYCEWDGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGMIEIKRDLKIVLSGV
        DSGC+FHMT  + + + + + DG  V +G+N TC V G  SV +   DG V++L NVR+VP LKRNLISLG LD  GCT    +G++++ +   + L G 
Subjt:  DSGCSFHMTHFKGWFSIYCEWDGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGMIEIKRDLKIVLSGV

Query:  KINGLYVVKDIEMLKSALVVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPAST
          +GLYV++   +  SA + S    + S LWHKRL+H+ E+GLQ L++QG+L       L F EHC++GK+ R  F K + T+KGIL+Y+HSDLWGP   
Subjt:  KINGLYVVKDIEMLKSALVVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPAST

Query:  NSLSGSRYFLSFIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIME
         S+ GSRYF+S IDDFSRK W+Y LK KD+AF KF EWK  +E QT R VK LRTDNGLE+ N  F+ FCK+ G+ RH TV +TPQQNG+AER NRTIME
Subjt:  NSLSGSRYFLSFIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIME

Query:  RVRCQLTDAILVERYWAEAASYTVYTLNRCTHTSINFLT
        R RC LT+A L  ++W EAA    Y +NR   T++N  T
Subjt:  RVRCQLTDAILVERYWAEAASYTVYTLNRCTHTSINFLT

KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]6.7e-15145.54Show/hide
Query:  IEKFDGKGDFDLWKAKIKAILGQKKALQALTDPSKLPTTLTEEHKETINTTAYGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMRERF
        + KF+G GDF LW+ KI+AIL Q K  + L D  +LP  +TE  K  ++  AY T++L LS+ VLR V +  T  ++W KL +LY TK   NK+Y++E+F
Subjt:  IEKFDGKGDFDLWKAKIKAILGQKKALQALTDPSKLPTTLTEEHKETINTTAYGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMRERF

Query:  FTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEVKTTLKYGRENITTEVIISAIRTKELEIQSQKKES----SEGFFVKGKR
        F +KMD +KSL ENLDEF+++  +  N+GEK+SDEN+A +LLNSLP+ Y+EVK  +KYGR+++T  +++ A++T+ LEI+ ++K+     + G   K   
Subjt:  FTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEVKTTLKYGRENITTEVIISAIRTKELEIQSQKKES----SEGFFVKGKR

Query:  KGKDNKHQPDEKSKAKVRCNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQEAAVGENSIT--YSDALATSDQCSNDQSSFEKHD------------WVI
        KGK+   +   K K++ +C  CHKEGH K++C               NK +EA+  E ++T  Y+ A  T D   + ++ +E  +            W++
Subjt:  KGKDNKHQPDEKSKAKVRCNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQEAAVGENSIT--YSDALATSDQCSNDQSSFEKHD------------WVI

Query:  DSGCSFHMTHFKGWFSIYCEWDGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGMIEIKRDLKIVLSGV
        DSGC+FHMT  + + + + + DG  V +G+N TC V G  SV +   DG V++L NVR+VP LKRNLISLG LD  GCT    +G++++ +   + L G 
Subjt:  DSGCSFHMTHFKGWFSIYCEWDGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGMIEIKRDLKIVLSGV

Query:  KINGLYVVKDIEMLKSALVVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPAST
          +GLYV++   +  SA + S      S LWHKRL+H+ E+GLQ L++QG+L       L F EHC++GK+ R  F K + T+KGIL+YVHSDLWGP   
Subjt:  KINGLYVVKDIEMLKSALVVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPAST

Query:  NSLSGSRYFLSFIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIME
         S+ GSRYF+S IDDFSRK W+Y LK KD+AF KF EWK  +E QT R VK LRTDNGLE+ N  F+ FCK+ G+ RH TV +TPQQNG+AER NRTIME
Subjt:  NSLSGSRYFLSFIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIME

Query:  RVRCQLTDAILVERYWAEAASYTVYTLNRCTHTSINFLT
        R RC LT+A L  ++W EAA    Y +NR   T++N  T
Subjt:  RVRCQLTDAILVERYWAEAASYTVYTLNRCTHTSINFLT

KAD3641560.1 hypothetical protein E3N88_30784 [Mikania micrantha]1.4e-13242.22Show/hide
Query:  KANIEKFDGKGDFDLWKAKIKAILGQKKALQALTDPSKLPTTLTEEHKETINTTAYGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMR
        K +IEKF+GK DF LW+ K++A+L  +  + AL     LP  L++  K+ +   A+  +IL+L + VLR+V  E T   +WTKL +LY TK   N++Y++
Subjt:  KANIEKFDGKGDFDLWKAKIKAILGQKKALQALTDPSKLPTTLTEEHKETINTTAYGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMR

Query:  ERFFTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEVKTTLKYGRENITTEVIISAIRTKELEIQSQKK-ESSEGFFVKGKR
        +R +TF+M + KSL ++ DEF ++  + +N+  ++ +E+ A + L+SLP  Y+    TL +GRE+++ E +++A+ ++EL+ +   K E S+G FV+G+ 
Subjt:  ERFFTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEVKTTLKYGRENITTEVIISAIRTKELEIQSQKK-ESSEGFFVKGKR

Query:  KGKDNKHQPD--EKSKAKVRCNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQEA-AVGENSITYSDALATSDQCSNDQSSFEKHDWVIDSGCSFHMTHF
        + +D K +     KSK K+RC  C+ + HLKR C   K+K   +  + K K Q + +  E S+   D+    D  +    S    +WV+DSGCSFHMT  
Subjt:  KGKDNKHQPD--EKSKAKVRCNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQEA-AVGENSITYSDALATSDQCSNDQSSFEKHDWVIDSGCSFHMTHF

Query:  KGWFSIYCEWDGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGMIEIKRDLKIVLSGVKINGLYVVKDI
        K +F      D   V++G+N  C+V G+ +VS KL +GS+  L+ VR++P LKRNLISLGM +S G       G +++ +   +VL+G + N      D 
Subjt:  KGWFSIYCEWDGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGMIEIKRDLKIVLSGVKINGLYVVKDI

Query:  EMLKSALVVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPASTNSLSGSRYFLS
        E+   ++ V+E G S++ LWH R+ HI   GLQ L +QG+L        GF EHCV+GKA R  F +S   +KGIL+Y+H+DLWGP+   SL G+RYFLS
Subjt:  EMLKSALVVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPASTNSLSGSRYFLS

Query:  FIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIMERVRCQLTDAIL
         +D +SR+ WV+ LKSKD+ F KFKEWK M+E QT R VK LRTDNGLE+CN+ FD FCK  G+ RH +V  TPQQNG+ ER+N T++ +VRC L +A L
Subjt:  FIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIMERVRCQLTDAIL

Query:  VERYWAEAASYTVYTLNRCTHTS
         +++WAEA S  V+ +N C+ +S
Subjt:  VERYWAEAASYTVYTLNRCTHTS

PNX96445.1 copia LTR rider [Trifolium pratense]8.0e-13642.99Show/hide
Query:  KANIEKFDGKGDFDLWKAKIKAILGQKKALQALTDPSKLPTTLTEEHKETINTTAYGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMR
        K  IEKF G  DF LW+ K+KA+L Q+  L+AL   + +   LT   K  +   A+  ++L+L + VLRQV  E T   +W KL +LY TK   N++Y++
Subjt:  KANIEKFDGKGDFDLWKAKIKAILGQKKALQALTDPSKLPTTLTEEHKETINTTAYGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMR

Query:  ERFFTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEVKTTLKYGRENITTEVIISAIRTKELEIQSQKKESS--EGFFVKGK
        +  ++FKM   K L+E LD F ++  + +N+  KI DE++A +LL +LP ++   K TL YGRE++T E + SA+ +K+L  + + K S+  EG  VKGK
Subjt:  ERFFTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEVKTTLKYGRENITTEVIISAIRTKELEIQSQKKESS--EGFFVKGK

Query:  RKGKDNKHQPDEKSKAK--------VRCNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQ-EAAVGENSITYSDALATSDQCSNDQSSFEKHDWVIDSGC
           K+ K     KS++K        +RC +C KEGH ++ C          +R K +     AA+ ++    SD L  S       SS  + +W++DSGC
Subjt:  RKGKDNKHQPDEKSKAK--------VRCNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQ-EAAVGENSITYSDALATSDQCSNDQSSFEKHDWVIDSGC

Query:  SFHMTHFKGWFSIYCEWDGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGMIEIKRDLKIVLSGVKING
        ++HMT  K  F   C+ DG  V +GNN  C++ G+ SV  KL D S++LL  VR+VP+LKRNL+SLG  D  G  + G   ++ + +  K VL GVK  G
Subjt:  SFHMTHFKGWFSIYCEWDGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGMIEIKRDLKIVLSGVKING

Query:  LYVVKDIEMLKSALVVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPASTNSLS
        LY ++   +  S  VVS    S++++WH RL H+ E+GL  L +Q +L  +    L F E CV GK+ R  F K +Q + G L+Y+H+DLWGPA   S S
Subjt:  LYVVKDIEMLKSALVVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPASTNSLS

Query:  GSRYFLSFIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIMERVRC
        G+RYFLS +DD+SRK WV+  K+KD+ FE FK WKT++E QT R VK LRTDNGLE+CN  FD FC   G+ RH T   TPQQNG+AER NRTI+ERVRC
Subjt:  GSRYFLSFIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIMERVRC

Query:  QLTDAILVERYWAEAASYTVYTLNRCTHTSINFLT
         LT A L + +WAEA S   Y +NRC  T+++  T
Subjt:  QLTDAILVERYWAEAASYTVYTLNRCTHTSINFLT

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]8.8e-15145.54Show/hide
Query:  IEKFDGKGDFDLWKAKIKAILGQKKALQALTDPSKLPTTLTEEHKETINTTAYGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMRERF
        + KF+G GDF LW+ KI+AIL Q K  + L D  +LP  +TE  K  ++  AY T++L LS+ VLR V +  T  ++W KL +LY TK   NK+Y++E+F
Subjt:  IEKFDGKGDFDLWKAKIKAILGQKKALQALTDPSKLPTTLTEEHKETINTTAYGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMRERF

Query:  FTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEVKTTLKYGRENITTEVIISAIRTKELEIQSQKKES----SEGFFVKGKR
        F +KMD +KSL ENLDEF+++  +  N+GEK+SDEN+A +LLNSLP+ Y+EVK  +KYGR+++T  +++ A++T+ LEI+ ++K+     + G   K   
Subjt:  FTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEVKTTLKYGRENITTEVIISAIRTKELEIQSQKKES----SEGFFVKGKR

Query:  KGKDNKHQPDEKSKAKVRCNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQEAAVGENSIT--YSDALATSDQCSNDQSSFEKHD------------WVI
        KGK+   +   K K++ +C  CHKEGH K++C               NK +EA+  E ++T  Y+ A  T D   + ++ +E  +            W++
Subjt:  KGKDNKHQPDEKSKAKVRCNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQEAAVGENSIT--YSDALATSDQCSNDQSSFEKHD------------WVI

Query:  DSGCSFHMTHFKGWFSIYCEWDGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGMIEIKRDLKIVLSGV
        DSGC+FHMT  + + + + + DG  V +G+N TC V G  SV +   DG V++L NVR+VP LKRNLISLG LD  GCT    +G++++ +   + L G 
Subjt:  DSGCSFHMTHFKGWFSIYCEWDGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGMIEIKRDLKIVLSGV

Query:  KINGLYVVKDIEMLKSALVVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPAST
          +GLYV++   +  SA + S      S LWHKRL+H+ E+GLQ L++QG+L       L F EHC++GK+ R  F K + T+KGIL+YVHSDLWGP   
Subjt:  KINGLYVVKDIEMLKSALVVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPAST

Query:  NSLSGSRYFLSFIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIME
         S+ GSRYF+S IDDFSRK W+Y LK KD+AF KF EWK  +E QT R VK LRTDNGLE+ N  F+ FCK+ G+ RH TV +TPQQNG+AER NRTIME
Subjt:  NSLSGSRYFLSFIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIME

Query:  RVRCQLTDAILVERYWAEAASYTVYTLNRCTHTSINFLT
        R RC LT+A L  ++W EAA    Y +NR   T++N  T
Subjt:  RVRCQLTDAILVERYWAEAASYTVYTLNRCTHTSINFLT

TrEMBL top hitse value%identityAlignment
A0A2K3N065 Copia LTR rider3.9e-13642.99Show/hide
Query:  KANIEKFDGKGDFDLWKAKIKAILGQKKALQALTDPSKLPTTLTEEHKETINTTAYGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMR
        K  IEKF G  DF LW+ K+KA+L Q+  L+AL   + +   LT   K  +   A+  ++L+L + VLRQV  E T   +W KL +LY TK   N++Y++
Subjt:  KANIEKFDGKGDFDLWKAKIKAILGQKKALQALTDPSKLPTTLTEEHKETINTTAYGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMR

Query:  ERFFTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEVKTTLKYGRENITTEVIISAIRTKELEIQSQKKESS--EGFFVKGK
        +  ++FKM   K L+E LD F ++  + +N+  KI DE++A +LL +LP ++   K TL YGRE++T E + SA+ +K+L  + + K S+  EG  VKGK
Subjt:  ERFFTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEVKTTLKYGRENITTEVIISAIRTKELEIQSQKKESS--EGFFVKGK

Query:  RKGKDNKHQPDEKSKAK--------VRCNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQ-EAAVGENSITYSDALATSDQCSNDQSSFEKHDWVIDSGC
           K+ K     KS++K        +RC +C KEGH ++ C          +R K +     AA+ ++    SD L  S       SS  + +W++DSGC
Subjt:  RKGKDNKHQPDEKSKAK--------VRCNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQ-EAAVGENSITYSDALATSDQCSNDQSSFEKHDWVIDSGC

Query:  SFHMTHFKGWFSIYCEWDGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGMIEIKRDLKIVLSGVKING
        ++HMT  K  F   C+ DG  V +GNN  C++ G+ SV  KL D S++LL  VR+VP+LKRNL+SLG  D  G  + G   ++ + +  K VL GVK  G
Subjt:  SFHMTHFKGWFSIYCEWDGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGMIEIKRDLKIVLSGVKING

Query:  LYVVKDIEMLKSALVVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPASTNSLS
        LY ++   +  S  VVS    S++++WH RL H+ E+GL  L +Q +L  +    L F E CV GK+ R  F K +Q + G L+Y+H+DLWGPA   S S
Subjt:  LYVVKDIEMLKSALVVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPASTNSLS

Query:  GSRYFLSFIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIMERVRC
        G+RYFLS +DD+SRK WV+  K+KD+ FE FK WKT++E QT R VK LRTDNGLE+CN  FD FC   G+ RH T   TPQQNG+AER NRTI+ERVRC
Subjt:  GSRYFLSFIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIMERVRC

Query:  QLTDAILVERYWAEAASYTVYTLNRCTHTSINFLT
         LT A L + +WAEA S   Y +NRC  T+++  T
Subjt:  QLTDAILVERYWAEAASYTVYTLNRCTHTSINFLT

A0A5A7U2U7 Retrotransposon protein, putative, Ty1-copia sub-class1.6e-15045.23Show/hide
Query:  IEKFDGKGDFDLWKAKIKAILGQKKALQALTDPSKLPTTLTEEHKETINTTAYGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMRERF
        + KF+G GDF LW+ KI+AIL Q K  + L D  +LP  +TE  K  ++  AY T++L LS+ VLR V +  T  ++W KL +LY TK   NK+Y++E+F
Subjt:  IEKFDGKGDFDLWKAKIKAILGQKKALQALTDPSKLPTTLTEEHKETINTTAYGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMRERF

Query:  FTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEVKTTLKYGRENITTEVIISAIRTKELEIQSQKKES----SEGFFVKGKR
        F +KMD +K L ENLDEF+++  +  N+GEK+SDEN+A +LLNSLP+ Y+EVK  +KYG +++T  +++ A++T+ LEI+ ++K+     + G   K   
Subjt:  FTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEVKTTLKYGRENITTEVIISAIRTKELEIQSQKKES----SEGFFVKGKR

Query:  KGKDNKHQPDEKSKAKVRCNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQEAAVGENSIT--YSDALATSDQCSNDQSSFEKHD------------WVI
        KGK+   +   K K++ +C  CHKEGH K++C               NK +EA+  E ++T  Y+ A  T D C + ++ +E  +            W++
Subjt:  KGKDNKHQPDEKSKAKVRCNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQEAAVGENSIT--YSDALATSDQCSNDQSSFEKHD------------WVI

Query:  DSGCSFHMTHFKGWFSIYCEWDGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGMIEIKRDLKIVLSGV
        DSGC+FHMT  + + + + + DG  V +G+N TC V G  SV +   DG V++L NVR+VP LKRNLISLG LD  GCT    +G++++ +   + L G 
Subjt:  DSGCSFHMTHFKGWFSIYCEWDGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGMIEIKRDLKIVLSGV

Query:  KINGLYVVKDIEMLKSALVVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPAST
          +GLYV++   +  SA + S    + S LWHKRL+H+ E+GLQ L++QG+L       L F EHC++GK+ R  F K + T+KGIL+Y+HSDLWGP   
Subjt:  KINGLYVVKDIEMLKSALVVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPAST

Query:  NSLSGSRYFLSFIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIME
         S+ GSRYF+S IDDFSRK W+Y LK KD+AF KF EWK  +E QT R VK LRTDNGLE+ N  F+ FCK+ G+ RH TV +TPQQNG+AER NRTIME
Subjt:  NSLSGSRYFLSFIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIME

Query:  RVRCQLTDAILVERYWAEAASYTVYTLNRCTHTSINFLT
        R RC LT+A L  ++W EAA    Y +NR   T++N  T
Subjt:  RVRCQLTDAILVERYWAEAASYTVYTLNRCTHTSINFLT

A0A5A7UB25 Putative gag-pol polyprotein3.3e-15145.54Show/hide
Query:  IEKFDGKGDFDLWKAKIKAILGQKKALQALTDPSKLPTTLTEEHKETINTTAYGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMRERF
        + KF+G GDF LW+ KI+AIL Q K  + L D  +LP  +TE  K  ++  AY T++L LS+ VLR V +  T  ++W KL +LY TK   NK+Y++E+F
Subjt:  IEKFDGKGDFDLWKAKIKAILGQKKALQALTDPSKLPTTLTEEHKETINTTAYGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMRERF

Query:  FTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEVKTTLKYGRENITTEVIISAIRTKELEIQSQKKES----SEGFFVKGKR
        F +KMD +KSL ENLDEF+++  +  N+GEK+SDEN+A +LLNSLP+ Y+EVK  +KYGR+++T  +++ A++T+ LEI+ ++K+     + G   K   
Subjt:  FTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEVKTTLKYGRENITTEVIISAIRTKELEIQSQKKES----SEGFFVKGKR

Query:  KGKDNKHQPDEKSKAKVRCNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQEAAVGENSIT--YSDALATSDQCSNDQSSFEKHD------------WVI
        KGK+   +   K K++ +C  CHKEGH K++C               NK +EA+  E ++T  Y+ A  T D   + ++ +E  +            W++
Subjt:  KGKDNKHQPDEKSKAKVRCNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQEAAVGENSIT--YSDALATSDQCSNDQSSFEKHD------------WVI

Query:  DSGCSFHMTHFKGWFSIYCEWDGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGMIEIKRDLKIVLSGV
        DSGC+FHMT  + + + + + DG  V +G+N TC V G  SV +   DG V++L NVR+VP LKRNLISLG LD  GCT    +G++++ +   + L G 
Subjt:  DSGCSFHMTHFKGWFSIYCEWDGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGMIEIKRDLKIVLSGV

Query:  KINGLYVVKDIEMLKSALVVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPAST
          +GLYV++   +  SA + S      S LWHKRL+H+ E+GLQ L++QG+L       L F EHC++GK+ R  F K + T+KGIL+YVHSDLWGP   
Subjt:  KINGLYVVKDIEMLKSALVVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPAST

Query:  NSLSGSRYFLSFIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIME
         S+ GSRYF+S IDDFSRK W+Y LK KD+AF KF EWK  +E QT R VK LRTDNGLE+ N  F+ FCK+ G+ RH TV +TPQQNG+AER NRTIME
Subjt:  NSLSGSRYFLSFIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIME

Query:  RVRCQLTDAILVERYWAEAASYTVYTLNRCTHTSINFLT
        R RC LT+A L  ++W EAA    Y +NR   T++N  T
Subjt:  RVRCQLTDAILVERYWAEAASYTVYTLNRCTHTSINFLT

A0A5D3DNU1 Putative gag-pol polyprotein4.3e-15145.54Show/hide
Query:  IEKFDGKGDFDLWKAKIKAILGQKKALQALTDPSKLPTTLTEEHKETINTTAYGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMRERF
        + KF+G GDF LW+ KI+AIL Q K  + L D  +LP  +TE  K  ++  AY T++L LS+ VLR V +  T  ++W KL +LY TK   NK+Y++E+F
Subjt:  IEKFDGKGDFDLWKAKIKAILGQKKALQALTDPSKLPTTLTEEHKETINTTAYGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMRERF

Query:  FTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEVKTTLKYGRENITTEVIISAIRTKELEIQSQKKES----SEGFFVKGKR
        F +KMD +KSL ENLDEF+++  +  N+GEK+SDEN+A +LLNSLP+ Y+EVK  +KYGR+++T  +++ A++T+ LEI+ ++K+     + G   K   
Subjt:  FTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEVKTTLKYGRENITTEVIISAIRTKELEIQSQKKES----SEGFFVKGKR

Query:  KGKDNKHQPDEKSKAKVRCNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQEAAVGENSIT--YSDALATSDQCSNDQSSFEKHD------------WVI
        KGK+   +   K K++ +C  CHKEGH K++C               NK +EA+  E ++T  Y+ A  T D   + ++ +E  +            W++
Subjt:  KGKDNKHQPDEKSKAKVRCNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQEAAVGENSIT--YSDALATSDQCSNDQSSFEKHD------------WVI

Query:  DSGCSFHMTHFKGWFSIYCEWDGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGMIEIKRDLKIVLSGV
        DSGC+FHMT  + + + + + DG  V +G+N TC V G  SV +   DG V++L NVR+VP LKRNLISLG LD  GCT    +G++++ +   + L G 
Subjt:  DSGCSFHMTHFKGWFSIYCEWDGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGMIEIKRDLKIVLSGV

Query:  KINGLYVVKDIEMLKSALVVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPAST
          +GLYV++   +  SA + S      S LWHKRL+H+ E+GLQ L++QG+L       L F EHC++GK+ R  F K + T+KGIL+YVHSDLWGP   
Subjt:  KINGLYVVKDIEMLKSALVVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPAST

Query:  NSLSGSRYFLSFIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIME
         S+ GSRYF+S IDDFSRK W+Y LK KD+AF KF EWK  +E QT R VK LRTDNGLE+ N  F+ FCK+ G+ RH TV +TPQQNG+AER NRTIME
Subjt:  NSLSGSRYFLSFIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIME

Query:  RVRCQLTDAILVERYWAEAASYTVYTLNRCTHTSINFLT
        R RC LT+A L  ++W EAA    Y +NR   T++N  T
Subjt:  RVRCQLTDAILVERYWAEAASYTVYTLNRCTHTSINFLT

A0A5N6MMT4 Integrase catalytic domain-containing protein6.8e-13342.22Show/hide
Query:  KANIEKFDGKGDFDLWKAKIKAILGQKKALQALTDPSKLPTTLTEEHKETINTTAYGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMR
        K +IEKF+GK DF LW+ K++A+L  +  + AL     LP  L++  K+ +   A+  +IL+L + VLR+V  E T   +WTKL +LY TK   N++Y++
Subjt:  KANIEKFDGKGDFDLWKAKIKAILGQKKALQALTDPSKLPTTLTEEHKETINTTAYGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMR

Query:  ERFFTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEVKTTLKYGRENITTEVIISAIRTKELEIQSQKK-ESSEGFFVKGKR
        +R +TF+M + KSL ++ DEF ++  + +N+  ++ +E+ A + L+SLP  Y+    TL +GRE+++ E +++A+ ++EL+ +   K E S+G FV+G+ 
Subjt:  ERFFTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEVKTTLKYGRENITTEVIISAIRTKELEIQSQKK-ESSEGFFVKGKR

Query:  KGKDNKHQPD--EKSKAKVRCNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQEA-AVGENSITYSDALATSDQCSNDQSSFEKHDWVIDSGCSFHMTHF
        + +D K +     KSK K+RC  C+ + HLKR C   K+K   +  + K K Q + +  E S+   D+    D  +    S    +WV+DSGCSFHMT  
Subjt:  KGKDNKHQPD--EKSKAKVRCNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQEA-AVGENSITYSDALATSDQCSNDQSSFEKHDWVIDSGCSFHMTHF

Query:  KGWFSIYCEWDGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGMIEIKRDLKIVLSGVKINGLYVVKDI
        K +F      D   V++G+N  C+V G+ +VS KL +GS+  L+ VR++P LKRNLISLGM +S G       G +++ +   +VL+G + N      D 
Subjt:  KGWFSIYCEWDGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGMIEIKRDLKIVLSGVKINGLYVVKDI

Query:  EMLKSALVVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPASTNSLSGSRYFLS
        E+   ++ V+E G S++ LWH R+ HI   GLQ L +QG+L        GF EHCV+GKA R  F +S   +KGIL+Y+H+DLWGP+   SL G+RYFLS
Subjt:  EMLKSALVVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPASTNSLSGSRYFLS

Query:  FIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIMERVRCQLTDAIL
         +D +SR+ WV+ LKSKD+ F KFKEWK M+E QT R VK LRTDNGLE+CN+ FD FCK  G+ RH +V  TPQQNG+ ER+N T++ +VRC L +A L
Subjt:  FIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIMERVRCQLTDAIL

Query:  VERYWAEAASYTVYTLNRCTHTS
         +++WAEA S  V+ +N C+ +S
Subjt:  VERYWAEAASYTVYTLNRCTHTS

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-5028.16Show/hide
Query:  DLWKANIEKFDGKGDFDLWKAKIKAILGQKKALQALTDPSKLPTTLTEEHKETINTTAYGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKM
        D  K NI+ FDG+  + +WK +I+A+L ++  L+ +     +P  + +  K+     A  T+I  LS+S L     + T  +I   L+ +YE K   +++
Subjt:  DLWKANIEKFDGKGDFDLWKAKIKAILGQKKALQALTDPSKLPTTLTEEHKETINTTAYGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKM

Query:  YMRERFFTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEVKTTLK-YGRENITTEVIISAIRTKELEIQSQKKESSE-----
         +R+R  + K+ S  SL  +   F  + SE    G KI + ++   LL +LP  Y  + T ++    EN+T   + + +  +E++I++   ++S+     
Subjt:  YMRERFFTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEVKTTLK-YGRENITTEVIISAIRTKELEIQSQKKESSE-----

Query:  ------GFFVKGKRKGKDNKHQPDEK--SKAKVRCNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQEAAVGENSITYSDALATSDQCSNDQSSFEKHDW
                +     K +  K +   K  SK KV+C++C +EGH+K+DC+  KR   N     KNK  E  V       S  +A   +  N+ S  +   +
Subjt:  ------GFFVKGKRKGKDNKHQPDEK--SKAKVRCNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQEAAVGENSITYSDALATSDQCSNDQSSFEKHDW

Query:  VIDSGCSFHMTHFKGWFS----------IYCEWDGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGMIE
        V+DSG S H+ + +  ++          I     GE +Y       R+     ++L+          +V        NL+S+  L   G +       IE
Subjt:  VIDSGCSFHMTHFKGWFS----------IYCEWDGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGMIE

Query:  IKRDLKIVLSGVKI--NGLYVVKDIEMLKSALVV-----SENGPSESD--LWHKRLSHIGEKGLQVLARQGILPKEA-----GNTLGFYEHCVIGKAKRQ
          +      SGV I  NGL VVK+  ML +  V+     S N   +++  LWH+R  HI +  L  + R+ +   ++       +    E C+ GK  R 
Subjt:  IKRDLKIVLSGVKI--NGLYVVKDIEMLKSALVV-----SENGPSESD--LWHKRLSHIGEKGLQVLARQGILPKEA-----GNTLGFYEHCVIGKAKRQ

Query:  CF--TKSQQTSKGILEYVHSDLWGPASTNSLSGSRYFLSFIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKT
         F   K +   K  L  VHSD+ GP +  +L    YF+ F+D F+     Y +K K   F  F+++    E   N  V  L  DNG EY +     FC  
Subjt:  CF--TKSQQTSKGILEYVHSDLWGPASTNSLSGSRYFLSFIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKT

Query:  CGMVRHMTVRHTPQQNGVAERLNRTIMERVRCQLTDAILVERYWAEAASYTVYTLNR
         G+  H+TV HTPQ NGV+ER+ RTI E+ R  ++ A L + +W EA     Y +NR
Subjt:  CGMVRHMTVRHTPQQNGVAERLNRTIMERVRCQLTDAILVERYWAEAASYTVYTLNR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-9433.07Show/hide
Query:  KANIEKFDGKGDFDLWKAKIKAILGQKKALQALTDPSKLPTTLTEEHKETINTTAYGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMR
        K  + KF+G   F  W+ +++ +L Q+   + L   SK P T+  E    ++  A   + L+LS+ V+  + DE+T   IWT+L +LY +K   NK+Y++
Subjt:  KANIEKFDGKGDFDLWKAKIKAILGQKKALQALTDPSKLPTTLTEEHKETINTTAYGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMR

Query:  ERFFTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEVKTTLKYGRENITTEVIISAIRTKELEIQSQKKESSEGFFVKGKRK
        ++ +   M    +   +L+ F  + ++  N+G KI +E++A +LLNSLP +Y  + TT+ +G+  I  + + SA+   E   + +KK  ++G  +  + +
Subjt:  ERFFTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEVKTTLKYGRENITTEVIISAIRTKELEIQSQKKESSEGFFVKGKRK

Query:  GKDNKHQPD------------EKSKAKVR-CNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQEAAVGENSITYSDALATSDQCSNDQSSFEKHDWVIDS
        G+  +   +             +SK++VR C  C++ GH KRDC +  RK + +   +KN    AA+ +N+      +   ++C +   S  + +WV+D+
Subjt:  GKDNKHQPD------------EKSKAKVR-CNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQEAAVGENSITYSDALATSDQCSNDQSSFEKHDWVIDS

Query:  GCSFHMTHFKGWFSIYCEWDGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGMIEIKRDLKIVLSGVKI
          S H T  +  F  Y   D   V MGN +  ++ GI  + +K   G   +L++VRHVP+L+ NLIS   LD  G     ++    + +   ++  GV  
Subjt:  GCSFHMTHFKGWFSIYCEWDGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGMIEIKRDLKIVLSGVKI

Query:  NGLYVVKDIEMLKSALVVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPASTNS
          LY   + E+ +  L  +++  S  DLWHKR+ H+ EKGLQ+LA++ ++    G T+   ++C+ GK  R  F  S +    IL+ V+SD+ GP    S
Subjt:  NGLYVVKDIEMLKSALVVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPASTNS

Query:  LSGSRYFLSFIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIMERV
        + G++YF++FIDD SRK WVY LK+KDQ F+ F+++  ++E++T R +K LR+DNG EY +  F+ +C + G+    TV  TPQ NGVAER+NRTI+E+V
Subjt:  LSGSRYFLSFIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIMERV

Query:  RCQLTDAILVERYWAEAASYTVYTLNRCTHTSINF
        R  L  A L + +W EA     Y +NR     + F
Subjt:  RCQLTDAILVERYWAEAASYTVYTLNRCTHTSINF

Q12490 Transposon Ty1-BL Gag-Pol polyprotein2.7e-1726.53Show/hide
Query:  VRHVPNLKRNLISLGMLDSIGCTYGGSSGMIEIKRDLKIVLSGVKINGLYVVKDIEMLKSAL-VVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKE
        V H PN+  +L+SL  L ++  T   +  ++E + D  ++   VK    Y V    +L S + V + N    S+   K       + L     Q I    
Subjt:  VRHVPNLKRNLISLGMLDSIGCTYGGSSGMIEIKRDLKIVLSGVKINGLYVVKDIEMLKSAL-VVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKE

Query:  AGNTLGFYEH---------------CVIGKAKRQCFTKSQ----QTSKGILEYVHSDLWGPASTNSLSGSRYFLSFIDDFSRKSWVYFL--KSKDQAFEK
          NT+ ++                 C+IGK+ +    K      Q S    +Y+H+D++GP      S   YF+SF D+ ++  WVY L  + +D   + 
Subjt:  AGNTLGFYEH---------------CVIGKAKRQCFTKSQ----QTSKGILEYVHSDLWGPASTNSLSGSRYFLSFIDDFSRKSWVYFL--KSKDQAFEK

Query:  FKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIMERVRCQLTDAILVERYWAEAASYTVYTLN
        F      I+ Q   SV  ++ D G EY N     F +  G+    T     + +GVAERLNRT+++  R QL  + L    W  A  ++    N
Subjt:  FKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIMERVRCQLTDAILVERYWAEAASYTVYTLN

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.0e-2523.28Show/hide
Query:  YGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMRERFFTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEV
        Y  ++  +S SV   V    T  +IW  L  +Y      +   +R +   +    TK++ + +        +   +G+ +  + +   +L +LP+ YK V
Subjt:  YGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMRERFFTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEV

Query:  ----------KTTLKYGRENITTEVIISAIRTKEL-----EIQSQKKESSEGFFVKGKRKGK-DNK----------------HQPDEKSKAKV-RCNYCH
                   T  +     +  E  I A+ +  +        S +  ++      G R  + DN+                H  + +SK  + +C  C 
Subjt:  ----------KTTLKYGRENITTEVIISAIRTKEL-----EIQSQKKESSEGFFVKGKRKGK-DNK----------------HQPDEKSKAKV-RCNYCH

Query:  KEGHLKRDCYSLKRKNQNQQRFKKNKPQEAAVGENSITYSDALATSDQCSNDQSSFEKHDWVIDSGCSFHMTHFKGWFSIYCEW-DGEIVYMGNNNTCRV
         +GH      S KR +Q Q        Q+             LA         S +  ++W++DSG + H+T      S++  +  G+ V + + +T  +
Subjt:  KEGHLKRDCYSLKRKNQNQQRFKKNKPQEAAVGENSITYSDALATSDQCSNDQSSFEKHDWVIDSGCSFHMTHFKGWFSIYCEW-DGEIVYMGNNNTCRV

Query:  IGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGML---DSIGCTYGGSSGMIEIKRDLKIVLSGVKINGLYVVKDIEMLKSALVVSENGPSESDLWHK
            S SL  K   +  L N+ +VPN+ +NLIS+  L   + +   +  +S  ++       +L G   + LY          +L  S +  +    WH 
Subjt:  IGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGML---DSIGCTYGGSSGMIEIKRDLKIVLSGVKINGLYVVKDIEMLKSALVVSENGPSESDLWHK

Query:  RLSHIGEKGL-QVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPASTNSLSGSRYFLSFIDDFSRKSWVYFLKSKDQAF
        RL H     L  V++   +      +       C+I K+ +  F++S   S   LEY++SD+W  +   S    RY++ F+D F+R +W+Y LK K Q  
Subjt:  RLSHIGEKGL-QVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPASTNSLSGSRYFLSFIDDFSRKSWVYFLKSKDQAF

Query:  EKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIMERVRCQLTDAILVERYWAEAASYTVYTLNR
        E F  +K ++E +    +    +DNG E+  AL++ F +  G+    +  HTP+ NG++ER +R I+E     L+ A + + YW  A +  VY +NR
Subjt:  EKFKEWKTMIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIMERVRCQLTDAILVERYWAEAASYTVYTLNR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-2523.01Show/hide
Query:  YGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMRERFFTFKMDSTKSLSENLDEFKRMTSEFKNMGE---KISDENEAFVLLNSLPDAY
        Y  ++  +S SV   V    T  +IW  L  +Y      +   +R   F  + D    L + +D  +++    +N+ +    + D+  A     SL + +
Subjt:  YGTLILNLSNSVLRQVFDEETPLKIWTKLNTLYETKDAHNKMYMRERFFTFKMDSTKSLSENLDEFKRMTSEFKNMGE---KISDENEAFVLLNSLPDAY

Query:  KE-VKTTLKYGRENITTEVIISAIRTKELEIQSQKKESSEGFFVK-GKRKGKDNKHQPDEKSKAK---------VRCNYCHKEGHLKRDCYSLKRKNQNQ
        +  +    K    N    V I+A         + + +++ G          + N  QP                 RC  C  +GH  + C  L       
Subjt:  KE-VKTTLKYGRENITTEVIISAIRTKELEIQSQKKESSEGFFVK-GKRKGKDNKHQPDEKSKAK---------VRCNYCHKEGHLKRDCYSLKRKNQNQ

Query:  QRFKKNKPQEAAVGENSITYSDALATSDQCSNDQSSFEKHDWVIDSGCSFHMTHFKGWFSIYCEW-DGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLR
         +F+    Q+    +++  ++     ++   N  S +  ++W++DSG + H+T      S +  +  G+ V + + +T  +    S SL     S+  L 
Subjt:  QRFKKNKPQEAAVGENSITYSDALATSDQCSNDQSSFEKHDWVIDSGCSFHMTHFKGWFSIYCEW-DGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLR

Query:  NVRHVPNLKRNLISLGML---DSIGCTYGGSSGMIEIKRDLKIVLSGVKINGLYVVKDIEMLKSALVVSENGPSESDLWHKRLSHIGEKGL-QVLARQGI
         V +VPN+ +NLIS+  L   + +   +  +S  ++       +L G   + LY          ++  S    +    WH RL H     L  V++   +
Subjt:  NVRHVPNLKRNLISLGML---DSIGCTYGGSSGMIEIKRDLKIVLSGVKINGLYVVKDIEMLKSALVVSENGPSESDLWHKRLSHIGEKGL-QVLARQGI

Query:  LPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPASTNSLSGSRYFLSFIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVK
              + L     C I K+ +  F+ S  TS   LEY++SD+W  +   S+   RY++ F+D F+R +W+Y LK K Q  + F  +K+++E +    + 
Subjt:  LPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPASTNSLSGSRYFLSFIDDFSRKSWVYFLKSKDQAFEKFKEWKTMIEKQTNRSVK

Query:  CLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIMERVRCQLTDAILVERYWAEAASYTVYTLNR
         L +DNG E+   +   +    G+    +  HTP+ NG++ER +R I+E     L+ A + + YW  A S  VY +NR
Subjt:  CLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIMERVRCQLTDAILVERYWAEAASYTVYTLNR

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein2.0e-0425.47Show/hide
Query:  KSKAKVRCNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQEAAVGENSITYSDALATSDQCSNDQSSFEKHDWVIDSGCSFHMTHFKGWFSIYCEWDGEI
        KSK++  C  C+K  H + DC           +F+ +  +E    E+ I     L T         +++   W+I      +MT +  +F+         
Subjt:  KSKAKVRCNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQEAAVGENSITYSDALATSDQCSNDQSSFEKHDWVIDSGCSFHMTHFKGWFSIYCEWDGEI

Query:  VYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGM
        V   +     V G   V +++K+G  K +RNV  VP L RN++S G +  +   Y  S+GM
Subjt:  VYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGM

ATMG00300.1 Gag-Pol-related retrotransposon family protein1.2e-1735.59Show/hide
Query:  SSGMIEIKRDLKIVLSGVKINGLYVVK-DIEMLKSALVVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQ
        S G++++ +  + +L G + + LY+++  +E  +S L  +E    E+ LWH RL+H+ ++G+++L ++G L     ++L F E C+ GK  R  F+  Q 
Subjt:  SSGMIEIKRDLKIVLSGVKINGLYVVK-DIEMLKSALVVSENGPSESDLWHKRLSHIGEKGLQVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQ

Query:  TSKGILEYVHSDLWGPAS
        T+K  L+YVHSDLWG  S
Subjt:  TSKGILEYVHSDLWGPAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTAAGGTAGGGTCAGGCAATTCTACACATAAGCCTAGCTGTCGAGCAAACAAACAGTATAGAGAAGCTGGTCAGCCTCATGCCAAGGCCGAGGCCGACCATCCACC
TTCAAAAGGCCAGGGAGCGTCTACTTCTATCCTATTTTGCAGGATGACAACAAAAGCTACACATAAGCCAATCCAAGGCAAGGCGTACGAGGAGCTGACGGGACAACCGG
GAAGAGATAGGACCAAGAAAGGGACCCAGAGGGAGACCATACCGACGGGCCGGGCCAACGTGGGTCGACCCGTACGGTCGGCCTTGGCCCAAGGCCGAGGCCGACCACTC
GGCCCGTTTGCGCGGGCCGAGTCCGTTTGCCTCTGCTCGGCCCCTACCGCTTCCAGCTGCCTCGGTCCAGCCTGCTTCGTCCCAGAACGCCTCCAAACCCTAGGAGTCCG
AGCAGCTGCTCAGTTTCCTAACTTAGGCATCGGAGGCAGTGTGGCCTACACCACACCGGTGTCCAGCGATTCTTGCTGGTCTTGCAGGTCACGTCTTCCCCAGCTTCTAC
AAATTCACTGTTGGTGTCACGTGAAGGGCAGCTCCAACATATGCAATTCATGGGACATTGATATGTCTGTAAGCATCCATAAGAACTTCATCCTGCTTCTCCAAATGCGT
GCACAACGAATTCAGAAGTTAATGAACAACATCCAAGCTCAGAAAGCCAAAATTAAGGGGGATTTTGACTTGTGGAAAGCCAATATCGAGAAATTCGATGGGAAGGGAGA
TTTTGACTTGTGGAAGGCCAAAATTAAGGCAATTCTTGGCCAGAAGAAAGCTCTACAAGCACTAACTGATCCTTCAAAATTACCTACCACTCTGACAGAAGAACATAAGG
AAACTATAAACACAACAGCCTATGGGACGTTGATTTTAAACTTGAGCAACAGTGTATTAAGACAGGTATTCGATGAAGAGACGCCTTTGAAAATCTGGACTAAACTCAAC
ACACTTTATGAAACCAAGGATGCTCATAACAAGATGTACATGAGGGAGCGTTTCTTTACCTTTAAGATGGATTCAACCAAATCTTTATCAGAAAACTTGGATGAATTCAA
AAGGATGACCTCTGAGTTCAAAAATATGGGAGAAAAGATTAGTGATGAAAATGAGGCCTTTGTCCTCTTGAACTCCTTACCAGACGCCTATAAAGAAGTAAAGACGACAC
TTAAATATGGAAGGGAGAATATCACTACTGAGGTCATAATCTCAGCTATTAGAACCAAAGAACTGGAAATTCAATCCCAGAAGAAAGAATCCAGTGAAGGCTTTTTCGTT
AAAGGTAAAAGAAAAGGCAAAGACAATAAACATCAACCTGATGAAAAGAGCAAAGCTAAGGTTCGATGCAATTATTGTCACAAGGAGGGTCACCTAAAGAGGGATTGTTA
CTCTCTTAAAAGGAAAAATCAAAATCAACAAAGATTCAAGAAGAACAAACCGCAAGAAGCTGCTGTGGGTGAAAATTCAATCACTTATTCAGATGCTTTAGCTACTTCGG
ATCAATGTAGCAATGACCAATCATCATTCGAAAAACATGATTGGGTGATTGATTCAGGTTGCTCCTTCCATATGACTCATTTCAAAGGCTGGTTCAGTATATACTGTGAG
TGGGATGGAGAAATTGTTTACATGGGGAACAACAATACATGTAGAGTCATTGGAATCAGATCTGTTTCACTAAAATTGAAAGATGGTTCTGTCAAGCTGCTGCGCAATGT
AAGACATGTTCCAAATCTTAAAAGGAATCTTATTTCCTTAGGGATGTTGGACTCGATTGGTTGCACCTATGGTGGAAGTAGTGGGATGATTGAAATTAAAAGGGATTTGA
AAATTGTATTGTCTGGCGTGAAAATCAATGGTCTCTATGTTGTAAAGGATATTGAGATGCTAAAGTCAGCACTTGTGGTCTCTGAGAATGGCCCATCAGAGAGTGATCTT
TGGCACAAAAGGTTGTCCCACATCGGTGAGAAAGGGTTACAAGTACTGGCAAGACAAGGTATTCTACCTAAGGAAGCTGGAAACACACTTGGTTTCTATGAACACTGTGT
GATTGGGAAAGCAAAAAGACAGTGCTTCACAAAGTCACAGCAAACCTCAAAGGGGATCCTTGAGTATGTTCACTCAGATTTGTGGGGTCCTGCATCCACTAACTCACTCA
GTGGTTCGAGGTATTTCCTGTCTTTTATTGATGATTTTTCAAGAAAAAGTTGGGTCTATTTTCTGAAATCTAAAGACCAAGCCTTCGAAAAATTTAAAGAATGGAAGACG
ATGATTGAAAAACAAACAAATAGGTCTGTCAAATGCCTTAGAACAGATAACGGACTTGAATATTGCAATGCACTTTTTGATAGTTTTTGTAAAACTTGTGGCATGGTAAG
ACACATGACTGTTAGACATACTCCTCAGCAAAATGGAGTAGCTGAAAGGTTAAACCGAACGATCATGGAAAGGGTGAGATGTCAGCTAACTGATGCAATTCTAGTAGAGA
GATATTGGGCAGAGGCAGCAAGCTATACAGTCTATACTCTGAATAGATGCACACACACCTCAATAAACTTCTTAACTCTCATTTTTAAGTTTAATTCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTAAGGTAGGGTCAGGCAATTCTACACATAAGCCTAGCTGTCGAGCAAACAAACAGTATAGAGAAGCTGGTCAGCCTCATGCCAAGGCCGAGGCCGACCATCCACC
TTCAAAAGGCCAGGGAGCGTCTACTTCTATCCTATTTTGCAGGATGACAACAAAAGCTACACATAAGCCAATCCAAGGCAAGGCGTACGAGGAGCTGACGGGACAACCGG
GAAGAGATAGGACCAAGAAAGGGACCCAGAGGGAGACCATACCGACGGGCCGGGCCAACGTGGGTCGACCCGTACGGTCGGCCTTGGCCCAAGGCCGAGGCCGACCACTC
GGCCCGTTTGCGCGGGCCGAGTCCGTTTGCCTCTGCTCGGCCCCTACCGCTTCCAGCTGCCTCGGTCCAGCCTGCTTCGTCCCAGAACGCCTCCAAACCCTAGGAGTCCG
AGCAGCTGCTCAGTTTCCTAACTTAGGCATCGGAGGCAGTGTGGCCTACACCACACCGGTGTCCAGCGATTCTTGCTGGTCTTGCAGGTCACGTCTTCCCCAGCTTCTAC
AAATTCACTGTTGGTGTCACGTGAAGGGCAGCTCCAACATATGCAATTCATGGGACATTGATATGTCTGTAAGCATCCATAAGAACTTCATCCTGCTTCTCCAAATGCGT
GCACAACGAATTCAGAAGTTAATGAACAACATCCAAGCTCAGAAAGCCAAAATTAAGGGGGATTTTGACTTGTGGAAAGCCAATATCGAGAAATTCGATGGGAAGGGAGA
TTTTGACTTGTGGAAGGCCAAAATTAAGGCAATTCTTGGCCAGAAGAAAGCTCTACAAGCACTAACTGATCCTTCAAAATTACCTACCACTCTGACAGAAGAACATAAGG
AAACTATAAACACAACAGCCTATGGGACGTTGATTTTAAACTTGAGCAACAGTGTATTAAGACAGGTATTCGATGAAGAGACGCCTTTGAAAATCTGGACTAAACTCAAC
ACACTTTATGAAACCAAGGATGCTCATAACAAGATGTACATGAGGGAGCGTTTCTTTACCTTTAAGATGGATTCAACCAAATCTTTATCAGAAAACTTGGATGAATTCAA
AAGGATGACCTCTGAGTTCAAAAATATGGGAGAAAAGATTAGTGATGAAAATGAGGCCTTTGTCCTCTTGAACTCCTTACCAGACGCCTATAAAGAAGTAAAGACGACAC
TTAAATATGGAAGGGAGAATATCACTACTGAGGTCATAATCTCAGCTATTAGAACCAAAGAACTGGAAATTCAATCCCAGAAGAAAGAATCCAGTGAAGGCTTTTTCGTT
AAAGGTAAAAGAAAAGGCAAAGACAATAAACATCAACCTGATGAAAAGAGCAAAGCTAAGGTTCGATGCAATTATTGTCACAAGGAGGGTCACCTAAAGAGGGATTGTTA
CTCTCTTAAAAGGAAAAATCAAAATCAACAAAGATTCAAGAAGAACAAACCGCAAGAAGCTGCTGTGGGTGAAAATTCAATCACTTATTCAGATGCTTTAGCTACTTCGG
ATCAATGTAGCAATGACCAATCATCATTCGAAAAACATGATTGGGTGATTGATTCAGGTTGCTCCTTCCATATGACTCATTTCAAAGGCTGGTTCAGTATATACTGTGAG
TGGGATGGAGAAATTGTTTACATGGGGAACAACAATACATGTAGAGTCATTGGAATCAGATCTGTTTCACTAAAATTGAAAGATGGTTCTGTCAAGCTGCTGCGCAATGT
AAGACATGTTCCAAATCTTAAAAGGAATCTTATTTCCTTAGGGATGTTGGACTCGATTGGTTGCACCTATGGTGGAAGTAGTGGGATGATTGAAATTAAAAGGGATTTGA
AAATTGTATTGTCTGGCGTGAAAATCAATGGTCTCTATGTTGTAAAGGATATTGAGATGCTAAAGTCAGCACTTGTGGTCTCTGAGAATGGCCCATCAGAGAGTGATCTT
TGGCACAAAAGGTTGTCCCACATCGGTGAGAAAGGGTTACAAGTACTGGCAAGACAAGGTATTCTACCTAAGGAAGCTGGAAACACACTTGGTTTCTATGAACACTGTGT
GATTGGGAAAGCAAAAAGACAGTGCTTCACAAAGTCACAGCAAACCTCAAAGGGGATCCTTGAGTATGTTCACTCAGATTTGTGGGGTCCTGCATCCACTAACTCACTCA
GTGGTTCGAGGTATTTCCTGTCTTTTATTGATGATTTTTCAAGAAAAAGTTGGGTCTATTTTCTGAAATCTAAAGACCAAGCCTTCGAAAAATTTAAAGAATGGAAGACG
ATGATTGAAAAACAAACAAATAGGTCTGTCAAATGCCTTAGAACAGATAACGGACTTGAATATTGCAATGCACTTTTTGATAGTTTTTGTAAAACTTGTGGCATGGTAAG
ACACATGACTGTTAGACATACTCCTCAGCAAAATGGAGTAGCTGAAAGGTTAAACCGAACGATCATGGAAAGGGTGAGATGTCAGCTAACTGATGCAATTCTAGTAGAGA
GATATTGGGCAGAGGCAGCAAGCTATACAGTCTATACTCTGAATAGATGCACACACACCTCAATAAACTTCTTAACTCTCATTTTTAAGTTTAATTCGTAA
Protein sequenceShow/hide protein sequence
MLKVGSGNSTHKPSCRANKQYREAGQPHAKAEADHPPSKGQGASTSILFCRMTTKATHKPIQGKAYEELTGQPGRDRTKKGTQRETIPTGRANVGRPVRSALAQGRGRPL
GPFARAESVCLCSAPTASSCLGPACFVPERLQTLGVRAAAQFPNLGIGGSVAYTTPVSSDSCWSCRSRLPQLLQIHCWCHVKGSSNICNSWDIDMSVSIHKNFILLLQMR
AQRIQKLMNNIQAQKAKIKGDFDLWKANIEKFDGKGDFDLWKAKIKAILGQKKALQALTDPSKLPTTLTEEHKETINTTAYGTLILNLSNSVLRQVFDEETPLKIWTKLN
TLYETKDAHNKMYMRERFFTFKMDSTKSLSENLDEFKRMTSEFKNMGEKISDENEAFVLLNSLPDAYKEVKTTLKYGRENITTEVIISAIRTKELEIQSQKKESSEGFFV
KGKRKGKDNKHQPDEKSKAKVRCNYCHKEGHLKRDCYSLKRKNQNQQRFKKNKPQEAAVGENSITYSDALATSDQCSNDQSSFEKHDWVIDSGCSFHMTHFKGWFSIYCE
WDGEIVYMGNNNTCRVIGIRSVSLKLKDGSVKLLRNVRHVPNLKRNLISLGMLDSIGCTYGGSSGMIEIKRDLKIVLSGVKINGLYVVKDIEMLKSALVVSENGPSESDL
WHKRLSHIGEKGLQVLARQGILPKEAGNTLGFYEHCVIGKAKRQCFTKSQQTSKGILEYVHSDLWGPASTNSLSGSRYFLSFIDDFSRKSWVYFLKSKDQAFEKFKEWKT
MIEKQTNRSVKCLRTDNGLEYCNALFDSFCKTCGMVRHMTVRHTPQQNGVAERLNRTIMERVRCQLTDAILVERYWAEAASYTVYTLNRCTHTSINFLTLIFKFNS