; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0001637 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0001637
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag-pol polyprotein
Genome locationchr12:6772738..6775913
RNA-Seq ExpressionPay0001637
SyntenyPay0001637
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051798.1 gag-pol polyprotein [Cucumis melo var. makuwa]0.0e+0073.33Show/hide
Query:  MIFFIKTLDGKAWRALVSGYEPSMITMNGVSVPKPEIDWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFEGTSKVKISRLQLI
        MIFFIKTLDGKAWRALVSGYEPSMITMNGVSVPKPEIDWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFE TSKVKISRLQLI
Subjt:  MIFFIKTLDGKAWRALVSGYEPSMITMNGVSVPKPEIDWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFEGTSKVKISRLQLI

Query:  TSKFEALKMTEDETVSEYNERVLEIANDSLLLGEKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKS
        TSKFEALKMTEDETVSEYNERVLEIANDSLLLGEKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKS
Subjt:  TSKFEALKMTEDETVSEYNERVLEIANDSLLLGEKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKS

Query:  IYDQENTVNQSGNEANQDESITLLTKQFSKMARK-------------------------------RNSDHGKKKEDVGRSFRCRECDGFGHYQAECPLIL
        IYDQENTVNQSGNEANQDESITLLTKQFSKMARK                               RNSDHGKKKEDVGRSFRCRECDG            
Subjt:  IYDQENTVNQSGNEANQDESITLLTKQFSKMARK-------------------------------RNSDHGKKKEDVGRSFRCRECDGFGHYQAECPLIL

Query:  EDKRQIICSDIDEDKELTLEELKILRKEDSEAKTIQKERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSGKMLNSGTDSLDSILSLGQNGSSKYG
               CSDIDEDKELTLEELKILRKEDSEAKTIQKERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSGKMLNSGTDSLDSILSLGQNGSSKYG
Subjt:  EDKRQIICSDIDEDKELTLEELKILRKEDSEAKTIQKERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSGKMLNSGTDSLDSILSLGQNGSSKYG

Query:  LGFDTSTRGVKIIPEVKFVPASVEETTDPSCKK-------------------RGHIRSFCYKLLRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRV
        LGFDTSTRGVKIIPEVKFVPASVEETTDPSCKK                   RGHIRSFCYKLLRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRV
Subjt:  LGFDTSTRGVKIIPEVKFVPASVEETTDPSCKK-------------------RGHIRSFCYKLLRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRV

Query:  KASEKCNVAFTTVQTHVDAWYFDSGCSRHMTGNRSFFTELEECALRHVTFGDGAKGKIIAKGNIDKIVVDDYSKFTWVQFLNGKSDTVKLCISLCLNLQR
        KASEKCNVAFTTVQTHVDA                                                                          LCLNLQR
Subjt:  KASEKCNVAFTTVQTHVDAWYFDSGCSRHMTGNRSFFTELEECALRHVTFGDGAKGKIIAKGNIDKIVVDDYSKFTWVQFLNGKSDTVKLCISLCLNLQR

Query:  EKGQKIIRIRSDHGKEFDNEDLNNFCQTGGIHHEFV-------------RNITLQEMARVMIHAKNLPLNFWAEA-------------------------
        EKGQKIIRIRSDHGKEFDNEDLNNFCQTGGIHH+FV             RNITLQEMARVMIHAKNLPLNFWAEA                         
Subjt:  EKGQKIIRIRSDHGKEFDNEDLNNFCQTGGIHHEFV-------------RNITLQEMARVMIHAKNLPLNFWAEA-------------------------

Query:  -------------------------------------------NSRAYRVFNIKSGIVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKG
                                                   NSRAYRVFNIKSG VMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKG
Subjt:  -------------------------------------------NSRAYRVFNIKSGIVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKG

Query:  ESQLHSAKTDSSITDEVINNETMLVPSAHVKKNHPSSSIISDPSAEITTRRKEMV
        ESQLHSAKTDSSITDEVINNETMLVPSAHVKKNHPSSSIISDPSA ITTRRKEMV
Subjt:  ESQLHSAKTDSSITDEVINNETMLVPSAHVKKNHPSSSIISDPSAEITTRRKEMV

KAA0059847.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.7e-26773Show/hide
Query:  MIFFIKTLDGKAWRALVSGYEPSMITMNGVSVPKPEIDWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFEGTSKVKISRLQLI
        MIFFIK LDGKAWR +V GYEP MIT+NGVSVPKPEIDWTDAEE+ASVGNARAINA+FNGV+L++FKLINS  TAKEAWKILEVA+EGTSKVKISRLQLI
Subjt:  MIFFIKTLDGKAWRALVSGYEPSMITMNGVSVPKPEIDWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFEGTSKVKISRLQLI

Query:  TSKFEALKMTEDETVSEYNERVLEIANDSLLLGEKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKS
        TSKFEALKMTEDETVSEYNERVLEIANDSLLL EKIPESKIV KVLRSLPRKFDMKVTAIEEAQDI TLKLDELFGSLLTFEMAISDRESKKGK IAFKS
Subjt:  TSKFEALKMTEDETVSEYNERVLEIANDSLLLGEKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKS

Query:  IYDQENTVNQSGNEANQDESITLLTKQFSKMARKRNSDHGKKKEDVGRSFRCRECDGFGHYQAECPLILEDKRQIICSDIDEDKELTLEELKILRKEDSE
        +YDQENTVNQS   +  D                                                           S+I+ED+ELTLEELKILRKEDSE
Subjt:  IYDQENTVNQSGNEANQDESITLLTKQFSKMARKRNSDHGKKKEDVGRSFRCRECDGFGHYQAECPLILEDKRQIICSDIDEDKELTLEELKILRKEDSE

Query:  AKTIQKERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSGKMLNSGTDSLDSILSLGQNGSSKYGLGFDTSTRGVKIIPEVKFVPASVEETTDPSC
        A+ IQKERIQDLMDENERLMGIISSLKVKLK+VQNVYDQTIKS KMLN GTDSLDSILS GQNGSSKY        R  +++                  
Subjt:  AKTIQKERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSGKMLNSGTDSLDSILSLGQNGSSKYGLGFDTSTRGVKIIPEVKFVPASVEETTDPSC

Query:  KKRGHIRSFCYKLLRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRVKASEKCNVAFTTVQTHVDAWYFDSGCSRHMTGNRSFFTELEECALRHVTF
        +K+GHIRSFCYKLLRDRRHQQRPK  N+QN+Y TIKRN+DVRGTHWIWRV  S KCNVAFTTVQTHVDAWYFDSGCSR MTGNRSFFTELEEC   HVTF
Subjt:  KKRGHIRSFCYKLLRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRVKASEKCNVAFTTVQTHVDAWYFDSGCSRHMTGNRSFFTELEECALRHVTF

Query:  GDGAKGKIIAKGNIDK-------IVVDDYSKFTWVQFLNGKSDTVKLCISLCLNLQREKGQKIIRIRSDHGKEFDNEDLNNFCQTGGIHHEFV-------
         DGAKGKIIAKGNIDK       +VVDDYS+FTWV+FL GKSDT KLCISLCLNLQREKGQKIIRIR+DH KEFDNEDLNN CQT GIHHEF        
Subjt:  GDGAKGKIIAKGNIDK-------IVVDDYSKFTWVQFLNGKSDTVKLCISLCLNLQREKGQKIIRIRSDHGKEFDNEDLNNFCQTGGIHHEFV-------

Query:  ------RNITLQEMARVMIHAKNLPLNFWAEA----------NSRAYRVFNIKSGIVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGE
              +N TLQEMARVMIHAK+LPLNFWAEA          NSRAYRVFNIKS  VMETINVVVNDFESN+NQFNIEDDET+VTPEVTSTPL EMPK E
Subjt:  ------RNITLQEMARVMIHAKNLPLNFWAEA----------NSRAYRVFNIKSGIVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGE

KAA0067564.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.1e-24662.76Show/hide
Query:  MIFFIKTLDGKAWRALVSGYEPSMITMNGVSVPKPEIDWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFEGTSKVKISRLQLI
        MIFFIKTLDGKAWR LV GYEP M+T+N VSVPKPEIDWTDAEEQASVGNARAINAIF GV+L+VFKLINSC TAKEA KIL+VA+EGTSKVKISRLQLI
Subjt:  MIFFIKTLDGKAWRALVSGYEPSMITMNGVSVPKPEIDWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFEGTSKVKISRLQLI

Query:  TSKFEALKMTEDETVSEYNERVLEIANDSLLLGEKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKS
        TSKFEALKMTEDE+VSEYNERVLEIAND LLLGEKI ESKIV KVLRSLPRKFDMKV AIEEAQDI TLKLDELFGSLLTFEMA+SD ESKKGKGIAFKS
Subjt:  TSKFEALKMTEDETVSEYNERVLEIANDSLLLGEKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKS

Query:  IYDQENTVNQSGNEANQDESITLLTKQFSKMARK----RNSDHGKKKEDVGRSFRCRECDGFGHYQAECPLILEDKRQIICSDIDE----DKELTLEELK
         YDQE TVNQSGNE NQDESI LLTKQFSKMARK      ++  KK ED+                    ++   + ++I S  +E    D+ELTLEELK
Subjt:  IYDQENTVNQSGNEANQDESITLLTKQFSKMARK----RNSDHGKKKEDVGRSFRCRECDGFGHYQAECPLILEDKRQIICSDIDE----DKELTLEELK

Query:  ILRKEDSEAKTIQKERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSGKMLNSGTDSLDSILSLGQNGSSKYGLGFDTSTRGVKIIPEVKFVPASV
        +LRKEDSEA+ I+KERIQDL+                           IKS KMLNSGT+SLDSIL+ GQNGSSKY LGFDTS+RGVKI PEVKFVPASV
Subjt:  ILRKEDSEAKTIQKERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSGKMLNSGTDSLDSILSLGQNGSSKYGLGFDTSTRGVKIIPEVKFVPASV

Query:  EETTDPSCKKRGHIRSFCYKLLRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRVKASEKCNVAFTTVQTHVDAWYFDSGCSRHMTGNRSFFTELEE
        +ETTDPSCKK                                                       ++  TVQTHVDAWYFDS CSRHMTGNRS FTELEE
Subjt:  EETTDPSCKKRGHIRSFCYKLLRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRVKASEKCNVAFTTVQTHVDAWYFDSGCSRHMTGNRSFFTELEE

Query:  CALRHVTFGDGAKGKIIAKGNIDK--------------------------IVVDDYSKFTWVQFLNGKSDTVKLCISLCLNLQREKGQKIIRIRSDHGKE
        CA  HVTFGDGAKGKIIAKGNIDK                          +VVDDYS+FTWV+FL GKSDTVKLCISLCLNL  EKGQKIIRI SDHGKE
Subjt:  CALRHVTFGDGAKGKIIAKGNIDK--------------------------IVVDDYSKFTWVQFLNGKSDTVKLCISLCLNLQREKGQKIIRIRSDHGKE

Query:  FDNEDLNNFCQTGGIHHEFV-------------RNITLQEMARVMIHAKNLPLNFWAEANSRAYRVFN---IKSGIVMETI-------------------
        FDNEDLNNFCQT GIH+EF              +N TLQEM RVMIHAKNLPLNFWAEA + A  + N    +SG                         
Subjt:  FDNEDLNNFCQTGGIHHEFV-------------RNITLQEMARVMIHAKNLPLNFWAEANSRAYRVFN---IKSGIVMETI-------------------

Query:  ----------------NVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGESQLHSAKTDSSITDEVINNETMLVPSAHVKKNHPSSSIISDPSAE
                        +VVVNDFESNVNQFNIEDDETHVTP+V+ST LDEMPKG+SQ  SAKT+S+ITDEVINNE +LVPSAHVKKNH SSSII DPSA 
Subjt:  ----------------NVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGESQLHSAKTDSSITDEVINNETMLVPSAHVKKNHPSSSIISDPSAE

Query:  ITTRRKEMVDY
        ITT+ KE  +Y
Subjt:  ITTRRKEMVDY

TYK21443.1 gag-pol polyprotein [Cucumis melo var. makuwa]0.0e+0073.13Show/hide
Query:  MIFFIKTLDGKAWRALVSGYEPSMITMNGVSVPKPEIDWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFEGTSKVKISRLQLI
        MIFFIKTLDGKAWRALVSGYEPSMITMNGVSVPKPEIDWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFE TSKVKISRLQLI
Subjt:  MIFFIKTLDGKAWRALVSGYEPSMITMNGVSVPKPEIDWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFEGTSKVKISRLQLI

Query:  TSKFEALKMTEDETVSEYNERVLEIANDSLLLGEKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKS
        TSKFEALKMTEDETVSEYNERVLEIANDSLLLGEKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKS
Subjt:  TSKFEALKMTEDETVSEYNERVLEIANDSLLLGEKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKS

Query:  IYDQENTVNQSGNEANQDESITLLTKQFSKMARK-------------------------------RNSDHGKKKEDVGRSFRCRECDGFGHYQAECPLIL
        IYDQENTVNQSGNEANQDESITLLTKQFSKMARK                               RNSDHGKKKEDVGRSFRCRECDG            
Subjt:  IYDQENTVNQSGNEANQDESITLLTKQFSKMARK-------------------------------RNSDHGKKKEDVGRSFRCRECDGFGHYQAECPLIL

Query:  EDKRQIICSDIDEDKELTLEELKILRKEDSEAKTIQKERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSGKMLNSGTDSLDSILSLGQNGSSKYG
               CSDIDEDKELTLEELKILRKEDSEAKTIQKERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSGKMLNSGTDSLDSILSLGQNGSSKYG
Subjt:  EDKRQIICSDIDEDKELTLEELKILRKEDSEAKTIQKERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSGKMLNSGTDSLDSILSLGQNGSSKYG

Query:  LGFDTSTRGVKIIPEVKFVPASVEETTDPSCKK-------------------RGHIRSFCYKLLRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRV
        LGFDTSTRGVKIIPEVKFVPASVEETTDPSCKK                   RGHIRSFCYKLLRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRV
Subjt:  LGFDTSTRGVKIIPEVKFVPASVEETTDPSCKK-------------------RGHIRSFCYKLLRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRV

Query:  KASEKCNVAFTTVQTHVDAWYFDSGCSRHMTGNRSFFTELEECALRHVTFGDGAKGKIIAKGNIDKIVVDDYSKFTWVQFLNGKSDTVKLCISLCLNLQR
        KASEKCNVAFTTVQTHVDA                                                                          LCLNLQR
Subjt:  KASEKCNVAFTTVQTHVDAWYFDSGCSRHMTGNRSFFTELEECALRHVTFGDGAKGKIIAKGNIDKIVVDDYSKFTWVQFLNGKSDTVKLCISLCLNLQR

Query:  EKGQKIIRIRSDHGKEFDNEDLNNFCQTGGIHHEFV-------------RNITLQEMARVMIHAKNLPLNFWAEA-------------------------
        EKGQKIIRIRSDHGKEFDNEDLNNFCQTGGIHH+FV             RNITLQEMARVMIHAKNLPLNFWAEA                         
Subjt:  EKGQKIIRIRSDHGKEFDNEDLNNFCQTGGIHHEFV-------------RNITLQEMARVMIHAKNLPLNFWAEA-------------------------

Query:  -------------------------------------------NSRAYRVFNIKSGIVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKG
                                                   NSRAYRVFNIKSG VMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKG
Subjt:  -------------------------------------------NSRAYRVFNIKSGIVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKG

Query:  ESQLHSAKTDSSITDEVINNETMLVPSAHVKKNHPSSSIISDPSAEITTRRKEMVD
        ESQLHSAKTDSSITDEVINNETMLVPSAHVKKNHPSSSIISDPSA ITTRRKEM++
Subjt:  ESQLHSAKTDSSITDEVINNETMLVPSAHVKKNHPSSSIISDPSAEITTRRKEMVD

XP_016903608.1 PREDICTED: uncharacterized protein LOC107992254 [Cucumis melo]2.3e-29376.32Show/hide
Query:  MIFFIKTLDGKAWRALVSGYEPSMITMNGVSVPKPEIDWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFEGTSKVKISRLQLI
        MIFFIK LDGKAWR +V GYEP MIT+NGVSVPKPEIDWTDAEE+ASVGNARAINA+FNGV+L++FKLINS  TAKEAWKILEVA+EGTSKVKISRLQLI
Subjt:  MIFFIKTLDGKAWRALVSGYEPSMITMNGVSVPKPEIDWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFEGTSKVKISRLQLI

Query:  TSKFEALKMTEDETVSEYNERVLEIANDSLLLGEKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKS
        TSKFEALKMTEDETVSEYNERVLEIANDSLLL EKIPESKIV KVLRSLPRKFDMKVTAIEEAQDI TLKLDELFGSLLTFEMAISDRESKKGK IAFKS
Subjt:  TSKFEALKMTEDETVSEYNERVLEIANDSLLLGEKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKS

Query:  IYDQENTVNQSGNEANQDESITLLTKQFSKMARKRNSDHGKKKEDVGRSFRCRECDGFGHYQAECPLILEDKRQIIC-----------------------
        +YDQENTVNQSGNEANQDES+ LLTKQFSKMARKRNSDH KKKEDVG SFRCREC+G GHYQAECP  L  +++  C                       
Subjt:  IYDQENTVNQSGNEANQDESITLLTKQFSKMARKRNSDHGKKKEDVGRSFRCRECDGFGHYQAECPLILEDKRQIIC-----------------------

Query:  ------------SDIDEDKELTLEELKILRKEDSEAKTIQKERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSGKMLNSGTDSLDSILSLGQNGS
                    S+I+ED+ELTLEELKILRKEDSEA+ IQKERIQDLMDENERLMGIISSLKVKLK+VQNVYDQTIKS KMLN GTDSLDSILS GQNGS
Subjt:  ------------SDIDEDKELTLEELKILRKEDSEAKTIQKERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSGKMLNSGTDSLDSILSLGQNGS

Query:  SKYGLGFDTSTRGVKIIPEVKFVPASVEETTDPSCKKRGHIRSFCYKLLRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRVKASEKCNVAFTTVQT
        SKY        R  +++                  +K+GHIRSFCYKLLRDRRHQQRPK  N+QN+Y TIKRN+DVRGTHWIWRV  S KCNVAFTTVQT
Subjt:  SKYGLGFDTSTRGVKIIPEVKFVPASVEETTDPSCKKRGHIRSFCYKLLRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRVKASEKCNVAFTTVQT

Query:  HVDAWYFDSGCSRHMTGNRSFFTELEECALRHVTFGDGAKGKIIAKGNIDKIVVDDYSKFTWVQFLNGKSDTVKLCISLCLNLQREKGQKIIRIRSDHGK
        HVDAWYFDSGCSR MTGNRSFFTELEEC   HVTF DGAKGKIIAKGNIDK                 KSDT KLCISLCLNLQREKGQKIIRIR+DH K
Subjt:  HVDAWYFDSGCSRHMTGNRSFFTELEECALRHVTFGDGAKGKIIAKGNIDKIVVDDYSKFTWVQFLNGKSDTVKLCISLCLNLQREKGQKIIRIRSDHGK

Query:  EFDNEDLNNFCQTGGIHHEFV-------------RNITLQEMARVMIHAKNLPLNFWAEANSRAYRVFNIKSGIVMETINVVVNDFESNVNQFNIEDDET
        EFDNEDLNN CQT GIHHEF              +N TLQEMARVMIHAK+LPLNFWAEANSRAYRVFNIKS  VMETINVVVNDFESN+NQFNIEDDET
Subjt:  EFDNEDLNNFCQTGGIHHEFV-------------RNITLQEMARVMIHAKNLPLNFWAEANSRAYRVFNIKSGIVMETINVVVNDFESNVNQFNIEDDET

Query:  HVTPEVTSTPLDEMPKGE
        +VTPEVTSTPL EMPK E
Subjt:  HVTPEVTSTPLDEMPKGE

TrEMBL top hitse value%identityAlignment
A0A1S4E5V5 uncharacterized protein LOC1079922541.1e-29376.32Show/hide
Query:  MIFFIKTLDGKAWRALVSGYEPSMITMNGVSVPKPEIDWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFEGTSKVKISRLQLI
        MIFFIK LDGKAWR +V GYEP MIT+NGVSVPKPEIDWTDAEE+ASVGNARAINA+FNGV+L++FKLINS  TAKEAWKILEVA+EGTSKVKISRLQLI
Subjt:  MIFFIKTLDGKAWRALVSGYEPSMITMNGVSVPKPEIDWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFEGTSKVKISRLQLI

Query:  TSKFEALKMTEDETVSEYNERVLEIANDSLLLGEKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKS
        TSKFEALKMTEDETVSEYNERVLEIANDSLLL EKIPESKIV KVLRSLPRKFDMKVTAIEEAQDI TLKLDELFGSLLTFEMAISDRESKKGK IAFKS
Subjt:  TSKFEALKMTEDETVSEYNERVLEIANDSLLLGEKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKS

Query:  IYDQENTVNQSGNEANQDESITLLTKQFSKMARKRNSDHGKKKEDVGRSFRCRECDGFGHYQAECPLILEDKRQIIC-----------------------
        +YDQENTVNQSGNEANQDES+ LLTKQFSKMARKRNSDH KKKEDVG SFRCREC+G GHYQAECP  L  +++  C                       
Subjt:  IYDQENTVNQSGNEANQDESITLLTKQFSKMARKRNSDHGKKKEDVGRSFRCRECDGFGHYQAECPLILEDKRQIIC-----------------------

Query:  ------------SDIDEDKELTLEELKILRKEDSEAKTIQKERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSGKMLNSGTDSLDSILSLGQNGS
                    S+I+ED+ELTLEELKILRKEDSEA+ IQKERIQDLMDENERLMGIISSLKVKLK+VQNVYDQTIKS KMLN GTDSLDSILS GQNGS
Subjt:  ------------SDIDEDKELTLEELKILRKEDSEAKTIQKERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSGKMLNSGTDSLDSILSLGQNGS

Query:  SKYGLGFDTSTRGVKIIPEVKFVPASVEETTDPSCKKRGHIRSFCYKLLRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRVKASEKCNVAFTTVQT
        SKY        R  +++                  +K+GHIRSFCYKLLRDRRHQQRPK  N+QN+Y TIKRN+DVRGTHWIWRV  S KCNVAFTTVQT
Subjt:  SKYGLGFDTSTRGVKIIPEVKFVPASVEETTDPSCKKRGHIRSFCYKLLRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRVKASEKCNVAFTTVQT

Query:  HVDAWYFDSGCSRHMTGNRSFFTELEECALRHVTFGDGAKGKIIAKGNIDKIVVDDYSKFTWVQFLNGKSDTVKLCISLCLNLQREKGQKIIRIRSDHGK
        HVDAWYFDSGCSR MTGNRSFFTELEEC   HVTF DGAKGKIIAKGNIDK                 KSDT KLCISLCLNLQREKGQKIIRIR+DH K
Subjt:  HVDAWYFDSGCSRHMTGNRSFFTELEECALRHVTFGDGAKGKIIAKGNIDKIVVDDYSKFTWVQFLNGKSDTVKLCISLCLNLQREKGQKIIRIRSDHGK

Query:  EFDNEDLNNFCQTGGIHHEFV-------------RNITLQEMARVMIHAKNLPLNFWAEANSRAYRVFNIKSGIVMETINVVVNDFESNVNQFNIEDDET
        EFDNEDLNN CQT GIHHEF              +N TLQEMARVMIHAK+LPLNFWAEANSRAYRVFNIKS  VMETINVVVNDFESN+NQFNIEDDET
Subjt:  EFDNEDLNNFCQTGGIHHEFV-------------RNITLQEMARVMIHAKNLPLNFWAEANSRAYRVFNIKSGIVMETINVVVNDFESNVNQFNIEDDET

Query:  HVTPEVTSTPLDEMPKGE
        +VTPEVTSTPL EMPK E
Subjt:  HVTPEVTSTPLDEMPKGE

A0A5A7U931 Gag-pol polyprotein0.0e+0073.33Show/hide
Query:  MIFFIKTLDGKAWRALVSGYEPSMITMNGVSVPKPEIDWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFEGTSKVKISRLQLI
        MIFFIKTLDGKAWRALVSGYEPSMITMNGVSVPKPEIDWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFE TSKVKISRLQLI
Subjt:  MIFFIKTLDGKAWRALVSGYEPSMITMNGVSVPKPEIDWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFEGTSKVKISRLQLI

Query:  TSKFEALKMTEDETVSEYNERVLEIANDSLLLGEKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKS
        TSKFEALKMTEDETVSEYNERVLEIANDSLLLGEKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKS
Subjt:  TSKFEALKMTEDETVSEYNERVLEIANDSLLLGEKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKS

Query:  IYDQENTVNQSGNEANQDESITLLTKQFSKMARK-------------------------------RNSDHGKKKEDVGRSFRCRECDGFGHYQAECPLIL
        IYDQENTVNQSGNEANQDESITLLTKQFSKMARK                               RNSDHGKKKEDVGRSFRCRECDG            
Subjt:  IYDQENTVNQSGNEANQDESITLLTKQFSKMARK-------------------------------RNSDHGKKKEDVGRSFRCRECDGFGHYQAECPLIL

Query:  EDKRQIICSDIDEDKELTLEELKILRKEDSEAKTIQKERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSGKMLNSGTDSLDSILSLGQNGSSKYG
               CSDIDEDKELTLEELKILRKEDSEAKTIQKERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSGKMLNSGTDSLDSILSLGQNGSSKYG
Subjt:  EDKRQIICSDIDEDKELTLEELKILRKEDSEAKTIQKERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSGKMLNSGTDSLDSILSLGQNGSSKYG

Query:  LGFDTSTRGVKIIPEVKFVPASVEETTDPSCKK-------------------RGHIRSFCYKLLRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRV
        LGFDTSTRGVKIIPEVKFVPASVEETTDPSCKK                   RGHIRSFCYKLLRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRV
Subjt:  LGFDTSTRGVKIIPEVKFVPASVEETTDPSCKK-------------------RGHIRSFCYKLLRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRV

Query:  KASEKCNVAFTTVQTHVDAWYFDSGCSRHMTGNRSFFTELEECALRHVTFGDGAKGKIIAKGNIDKIVVDDYSKFTWVQFLNGKSDTVKLCISLCLNLQR
        KASEKCNVAFTTVQTHVDA                                                                          LCLNLQR
Subjt:  KASEKCNVAFTTVQTHVDAWYFDSGCSRHMTGNRSFFTELEECALRHVTFGDGAKGKIIAKGNIDKIVVDDYSKFTWVQFLNGKSDTVKLCISLCLNLQR

Query:  EKGQKIIRIRSDHGKEFDNEDLNNFCQTGGIHHEFV-------------RNITLQEMARVMIHAKNLPLNFWAEA-------------------------
        EKGQKIIRIRSDHGKEFDNEDLNNFCQTGGIHH+FV             RNITLQEMARVMIHAKNLPLNFWAEA                         
Subjt:  EKGQKIIRIRSDHGKEFDNEDLNNFCQTGGIHHEFV-------------RNITLQEMARVMIHAKNLPLNFWAEA-------------------------

Query:  -------------------------------------------NSRAYRVFNIKSGIVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKG
                                                   NSRAYRVFNIKSG VMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKG
Subjt:  -------------------------------------------NSRAYRVFNIKSGIVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKG

Query:  ESQLHSAKTDSSITDEVINNETMLVPSAHVKKNHPSSSIISDPSAEITTRRKEMV
        ESQLHSAKTDSSITDEVINNETMLVPSAHVKKNHPSSSIISDPSA ITTRRKEMV
Subjt:  ESQLHSAKTDSSITDEVINNETMLVPSAHVKKNHPSSSIISDPSAEITTRRKEMV

A0A5D3BBI3 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-24662.76Show/hide
Query:  MIFFIKTLDGKAWRALVSGYEPSMITMNGVSVPKPEIDWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFEGTSKVKISRLQLI
        MIFFIKTLDGKAWR LV GYEP M+T+N VSVPKPEIDWTDAEEQASVGNARAINAIF GV+L+VFKLINSC TAKEA KIL+VA+EGTSKVKISRLQLI
Subjt:  MIFFIKTLDGKAWRALVSGYEPSMITMNGVSVPKPEIDWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFEGTSKVKISRLQLI

Query:  TSKFEALKMTEDETVSEYNERVLEIANDSLLLGEKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKS
        TSKFEALKMTEDE+VSEYNERVLEIAND LLLGEKI ESKIV KVLRSLPRKFDMKV AIEEAQDI TLKLDELFGSLLTFEMA+SD ESKKGKGIAFKS
Subjt:  TSKFEALKMTEDETVSEYNERVLEIANDSLLLGEKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKS

Query:  IYDQENTVNQSGNEANQDESITLLTKQFSKMARK----RNSDHGKKKEDVGRSFRCRECDGFGHYQAECPLILEDKRQIICSDIDE----DKELTLEELK
         YDQE TVNQSGNE NQDESI LLTKQFSKMARK      ++  KK ED+                    ++   + ++I S  +E    D+ELTLEELK
Subjt:  IYDQENTVNQSGNEANQDESITLLTKQFSKMARK----RNSDHGKKKEDVGRSFRCRECDGFGHYQAECPLILEDKRQIICSDIDE----DKELTLEELK

Query:  ILRKEDSEAKTIQKERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSGKMLNSGTDSLDSILSLGQNGSSKYGLGFDTSTRGVKIIPEVKFVPASV
        +LRKEDSEA+ I+KERIQDL+                           IKS KMLNSGT+SLDSIL+ GQNGSSKY LGFDTS+RGVKI PEVKFVPASV
Subjt:  ILRKEDSEAKTIQKERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSGKMLNSGTDSLDSILSLGQNGSSKYGLGFDTSTRGVKIIPEVKFVPASV

Query:  EETTDPSCKKRGHIRSFCYKLLRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRVKASEKCNVAFTTVQTHVDAWYFDSGCSRHMTGNRSFFTELEE
        +ETTDPSCKK                                                       ++  TVQTHVDAWYFDS CSRHMTGNRS FTELEE
Subjt:  EETTDPSCKKRGHIRSFCYKLLRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRVKASEKCNVAFTTVQTHVDAWYFDSGCSRHMTGNRSFFTELEE

Query:  CALRHVTFGDGAKGKIIAKGNIDK--------------------------IVVDDYSKFTWVQFLNGKSDTVKLCISLCLNLQREKGQKIIRIRSDHGKE
        CA  HVTFGDGAKGKIIAKGNIDK                          +VVDDYS+FTWV+FL GKSDTVKLCISLCLNL  EKGQKIIRI SDHGKE
Subjt:  CALRHVTFGDGAKGKIIAKGNIDK--------------------------IVVDDYSKFTWVQFLNGKSDTVKLCISLCLNLQREKGQKIIRIRSDHGKE

Query:  FDNEDLNNFCQTGGIHHEFV-------------RNITLQEMARVMIHAKNLPLNFWAEANSRAYRVFN---IKSGIVMETI-------------------
        FDNEDLNNFCQT GIH+EF              +N TLQEM RVMIHAKNLPLNFWAEA + A  + N    +SG                         
Subjt:  FDNEDLNNFCQTGGIHHEFV-------------RNITLQEMARVMIHAKNLPLNFWAEANSRAYRVFN---IKSGIVMETI-------------------

Query:  ----------------NVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGESQLHSAKTDSSITDEVINNETMLVPSAHVKKNHPSSSIISDPSAE
                        +VVVNDFESNVNQFNIEDDETHVTP+V+ST LDEMPKG+SQ  SAKT+S+ITDEVINNE +LVPSAHVKKNH SSSII DPSA 
Subjt:  ----------------NVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGESQLHSAKTDSSITDEVINNETMLVPSAHVKKNHPSSSIISDPSAE

Query:  ITTRRKEMVDY
        ITT+ KE  +Y
Subjt:  ITTRRKEMVDY

A0A5D3DCZ8 Gag-pol polyprotein0.0e+0073.13Show/hide
Query:  MIFFIKTLDGKAWRALVSGYEPSMITMNGVSVPKPEIDWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFEGTSKVKISRLQLI
        MIFFIKTLDGKAWRALVSGYEPSMITMNGVSVPKPEIDWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFE TSKVKISRLQLI
Subjt:  MIFFIKTLDGKAWRALVSGYEPSMITMNGVSVPKPEIDWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFEGTSKVKISRLQLI

Query:  TSKFEALKMTEDETVSEYNERVLEIANDSLLLGEKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKS
        TSKFEALKMTEDETVSEYNERVLEIANDSLLLGEKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKS
Subjt:  TSKFEALKMTEDETVSEYNERVLEIANDSLLLGEKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKS

Query:  IYDQENTVNQSGNEANQDESITLLTKQFSKMARK-------------------------------RNSDHGKKKEDVGRSFRCRECDGFGHYQAECPLIL
        IYDQENTVNQSGNEANQDESITLLTKQFSKMARK                               RNSDHGKKKEDVGRSFRCRECDG            
Subjt:  IYDQENTVNQSGNEANQDESITLLTKQFSKMARK-------------------------------RNSDHGKKKEDVGRSFRCRECDGFGHYQAECPLIL

Query:  EDKRQIICSDIDEDKELTLEELKILRKEDSEAKTIQKERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSGKMLNSGTDSLDSILSLGQNGSSKYG
               CSDIDEDKELTLEELKILRKEDSEAKTIQKERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSGKMLNSGTDSLDSILSLGQNGSSKYG
Subjt:  EDKRQIICSDIDEDKELTLEELKILRKEDSEAKTIQKERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSGKMLNSGTDSLDSILSLGQNGSSKYG

Query:  LGFDTSTRGVKIIPEVKFVPASVEETTDPSCKK-------------------RGHIRSFCYKLLRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRV
        LGFDTSTRGVKIIPEVKFVPASVEETTDPSCKK                   RGHIRSFCYKLLRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRV
Subjt:  LGFDTSTRGVKIIPEVKFVPASVEETTDPSCKK-------------------RGHIRSFCYKLLRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRV

Query:  KASEKCNVAFTTVQTHVDAWYFDSGCSRHMTGNRSFFTELEECALRHVTFGDGAKGKIIAKGNIDKIVVDDYSKFTWVQFLNGKSDTVKLCISLCLNLQR
        KASEKCNVAFTTVQTHVDA                                                                          LCLNLQR
Subjt:  KASEKCNVAFTTVQTHVDAWYFDSGCSRHMTGNRSFFTELEECALRHVTFGDGAKGKIIAKGNIDKIVVDDYSKFTWVQFLNGKSDTVKLCISLCLNLQR

Query:  EKGQKIIRIRSDHGKEFDNEDLNNFCQTGGIHHEFV-------------RNITLQEMARVMIHAKNLPLNFWAEA-------------------------
        EKGQKIIRIRSDHGKEFDNEDLNNFCQTGGIHH+FV             RNITLQEMARVMIHAKNLPLNFWAEA                         
Subjt:  EKGQKIIRIRSDHGKEFDNEDLNNFCQTGGIHHEFV-------------RNITLQEMARVMIHAKNLPLNFWAEA-------------------------

Query:  -------------------------------------------NSRAYRVFNIKSGIVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKG
                                                   NSRAYRVFNIKSG VMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKG
Subjt:  -------------------------------------------NSRAYRVFNIKSGIVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKG

Query:  ESQLHSAKTDSSITDEVINNETMLVPSAHVKKNHPSSSIISDPSAEITTRRKEMVD
        ESQLHSAKTDSSITDEVINNETMLVPSAHVKKNHPSSSIISDPSA ITTRRKEM++
Subjt:  ESQLHSAKTDSSITDEVINNETMLVPSAHVKKNHPSSSIISDPSAEITTRRKEMVD

A0A5D3DMG6 Gag-pol polyprotein8.1e-26873Show/hide
Query:  MIFFIKTLDGKAWRALVSGYEPSMITMNGVSVPKPEIDWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFEGTSKVKISRLQLI
        MIFFIK LDGKAWR +V GYEP MIT+NGVSVPKPEIDWTDAEE+ASVGNARAINA+FNGV+L++FKLINS  TAKEAWKILEVA+EGTSKVKISRLQLI
Subjt:  MIFFIKTLDGKAWRALVSGYEPSMITMNGVSVPKPEIDWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFEGTSKVKISRLQLI

Query:  TSKFEALKMTEDETVSEYNERVLEIANDSLLLGEKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKS
        TSKFEALKMTEDETVSEYNERVLEIANDSLLL EKIPESKIV KVLRSLPRKFDMKVTAIEEAQDI TLKLDELFGSLLTFEMAISDRESKKGK IAFKS
Subjt:  TSKFEALKMTEDETVSEYNERVLEIANDSLLLGEKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKS

Query:  IYDQENTVNQSGNEANQDESITLLTKQFSKMARKRNSDHGKKKEDVGRSFRCRECDGFGHYQAECPLILEDKRQIICSDIDEDKELTLEELKILRKEDSE
        +YDQENTVNQS   +  D                                                           S+I+ED+ELTLEELKILRKEDSE
Subjt:  IYDQENTVNQSGNEANQDESITLLTKQFSKMARKRNSDHGKKKEDVGRSFRCRECDGFGHYQAECPLILEDKRQIICSDIDEDKELTLEELKILRKEDSE

Query:  AKTIQKERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSGKMLNSGTDSLDSILSLGQNGSSKYGLGFDTSTRGVKIIPEVKFVPASVEETTDPSC
        A+ IQKERIQDLMDENERLMGIISSLKVKLK+VQNVYDQTIKS KMLN GTDSLDSILS GQNGSSKY        R  +++                  
Subjt:  AKTIQKERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSGKMLNSGTDSLDSILSLGQNGSSKYGLGFDTSTRGVKIIPEVKFVPASVEETTDPSC

Query:  KKRGHIRSFCYKLLRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRVKASEKCNVAFTTVQTHVDAWYFDSGCSRHMTGNRSFFTELEECALRHVTF
        +K+GHIRSFCYKLLRDRRHQQRPK  N+QN+Y TIKRN+DVRGTHWIWRV  S KCNVAFTTVQTHVDAWYFDSGCSR MTGNRSFFTELEEC   HVTF
Subjt:  KKRGHIRSFCYKLLRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRVKASEKCNVAFTTVQTHVDAWYFDSGCSRHMTGNRSFFTELEECALRHVTF

Query:  GDGAKGKIIAKGNIDK-------IVVDDYSKFTWVQFLNGKSDTVKLCISLCLNLQREKGQKIIRIRSDHGKEFDNEDLNNFCQTGGIHHEFV-------
         DGAKGKIIAKGNIDK       +VVDDYS+FTWV+FL GKSDT KLCISLCLNLQREKGQKIIRIR+DH KEFDNEDLNN CQT GIHHEF        
Subjt:  GDGAKGKIIAKGNIDK-------IVVDDYSKFTWVQFLNGKSDTVKLCISLCLNLQREKGQKIIRIRSDHGKEFDNEDLNNFCQTGGIHHEFV-------

Query:  ------RNITLQEMARVMIHAKNLPLNFWAEA----------NSRAYRVFNIKSGIVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGE
              +N TLQEMARVMIHAK+LPLNFWAEA          NSRAYRVFNIKS  VMETINVVVNDFESN+NQFNIEDDET+VTPEVTSTPL EMPK E
Subjt:  ------RNITLQEMARVMIHAKNLPLNFWAEA----------NSRAYRVFNIKSGIVMETINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMPKGE

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-0419.34Show/hide
Query:  DWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFEGTSKVKISRLQLITSKFEALKMTEDET----VSEYNERVLEIANDSLLLG
        DW D +E       RA +AI   ++  V   I    TA+  W  LE  +   SK   ++L L   +  AL M+E       ++ +N  + ++AN    LG
Subjt:  DWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFEGTSKVKISRLQLITSKFEALKMTEDET----VSEYNERVLEIANDSLLLG

Query:  EKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKSIYDQENTVNQSGNEANQDESITLLTKQFSKMAR
         KI E      +L SLP  +D   T I   +   T++L ++  +LL                            +N+   +  +++   L+T+   +  +
Subjt:  EKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKSIYDQENTVNQSGNEANQDESITLLTKQFSKMAR

Query:  KRNSDHGK-------KKEDVGRSFRCRECDGFGHYQAECPLILEDKRQIICSDIDEDKELTLEELKILRKEDSEAKTIQKERIQDLMDENERLMGIISSL
        + ++++G+       K     R   C  C+  GH++ +CP   + K +      D++         +++  D+    I +E     +   E    + ++ 
Subjt:  KRNSDHGK-------KKEDVGRSFRCRECDGFGHYQAECPLILEDKRQIICSDIDEDKELTLEELKILRKEDSEAKTIQKERIQDLMDENERLMGIISSL

Query:  KVKLKEVQNVYDQTIKSG-KMLNSGTDSLDSILSLGQNGSSKYGLGFDTSTRGVKIIPEVK-----------------FVPASVEETTDPSCKKRGHIRS
              V++++ + +      +  G  S   I  +G +   K  +G     + V+ +P+++                 F       T       +G  R 
Subjt:  KVKLKEVQNVYDQTIKSG-KMLNSGTDSLDSILSLGQNGSSKYGLGFDTSTRGVKIIPEVK-----------------FVPASVEETTDPSCKKRGHIRS

Query:  FCYKL--------LRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRVKAS--EKCNVAFTTVQTHVDAWYFDSGCSRHMTGNRSFFTELEECALRHV
          Y+         L   + +    L +++  +M+ K    +     I   K +  + C+      Q  V    F +   R +      ++++  C    +
Subjt:  FCYKL--------LRDRRHQQRPKLVNQQNKYMTIKRNDDVRGTHWIWRVKAS--EKCNVAFTTVQTHVDAWYFDSGCSRHMTGNRSFFTELEECALRHV

Query:  TFGDGAKGKIIAKGNIDKIVVDDYSKFTWVQFLNGKSDTVKLCISLCLNLQREKGQKIIRIRSDHGKEFDNEDLNNFCQTGGIHHEFV------------
            G K  +          +DD S+  WV  L  K    ++       ++RE G+K+ R+RSD+G E+ + +   +C + GI HE              
Subjt:  TFGDGAKGKIIAKGNIDKIVVDDYSKFTWVQFLNGKSDTVKLCISLCLNLQREKGQKIIRIRSDHGKEFDNEDLNNFCQTGGIHHEFV------------

Query:  -RNITLQEMARVMIHAKNLPLNFWAEANSRAYRVFN
          N T+ E  R M+    LP +FW EA   A  + N
Subjt:  -RNITLQEMARVMIHAKNLPLNFWAEANSRAYRVFN

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATATTTTTCATTAAAACCTTAGATGGAAAAGCATGGAGAGCACTTGTTAGTGGTTATGAGCCTTCAATGATCACTATGAATGGAGTATCAGTGCCAAAACCAGAAAT
TGACTGGACAGATGCTGAAGAACAAGCTTCGGTTGGAAATGCAAGAGCCATAAATGCTATCTTCAACGGTGTCAATTTAAGCGTATTCAAACTTATTAATTCCTGCATTA
CTGCTAAAGAGGCGTGGAAAATACTGGAAGTTGCATTTGAAGGAACTTCTAAAGTGAAGATATCCAGACTGCAGTTGATAACTTCAAAATTCGAAGCCTTGAAAATGACT
GAAGATGAGACAGTCTCTGAATACAATGAGAGGGTCCTGGAGATAGCTAATGATTCGTTACTACTTGGTGAAAAGATTCCAGAGTCTAAGATTGTTAGCAAAGTGTTGCG
CTCCTTACCCAGAAAGTTTGACATGAAGGTCACTGCCATAGAAGAAGCCCAAGATATAATGACGTTAAAACTTGACGAACTATTTGGGTCGCTACTTACGTTTGAAATGG
CTATTTCGGATAGAGAAAGTAAGAAAGGTAAGGGGATAGCATTCAAATCAATTTATGATCAAGAGAACACTGTAAATCAGTCTGGTAATGAAGCTAATCAAGATGAGTCA
ATAACTCTCCTAACGAAGCAATTCTCGAAGATGGCCAGAAAAAGAAATAGCGACCATGGAAAGAAAAAAGAGGATGTAGGGAGGTCGTTTAGATGTAGAGAATGTGACGG
GTTTGGTCATTATCAGGCCGAATGCCCACTTATCTTAGAAGACAAAAGACAAATTATTTGTTCCGATATTGATGAAGATAAAGAGCTAACTCTTGAAGAACTCAAAATCC
TGAGGAAGGAAGACTCAGAAGCCAAAACTATTCAAAAAGAAAGAATTCAAGATTTAATGGATGAAAATGAACGTTTGATGGGGATTATATCATCTCTAAAAGTAAAGTTG
AAAGAAGTACAAAATGTGTATGATCAGACAATTAAGTCTGGGAAAATGTTGAATTCTGGAACTGACAGCTTAGACTCAATCCTGAGTTTAGGGCAAAATGGTTCAAGTAA
ATATGGCCTCGGATTTGATACTTCAACTAGGGGTGTTAAGATTATTCCAGAAGTAAAATTTGTTCCAGCCTCAGTAGAAGAAACAACTGACCCCAGTTGTAAAAAAAGAG
GTCATATACGGTCATTCTGCTACAAATTACTGAGAGATAGGAGACATCAGCAAAGACCAAAACTTGTAAACCAGCAAAATAAGTATATGACCATCAAAAGGAATGATGAT
GTAAGGGGAACTCACTGGATCTGGAGGGTGAAGGCTTCTGAGAAGTGCAATGTAGCATTTACAACAGTCCAAACCCATGTTGATGCTTGGTACTTTGACAGTGGATGCTC
AAGACATATGACTGGCAATCGATCCTTCTTTACTGAGTTAGAAGAATGTGCCTTAAGACATGTCACCTTTGGAGATGGGGCCAAAGGAAAAATTATTGCAAAAGGAAACA
TTGACAAAATTGTTGTGGATGACTACTCCAAATTCACCTGGGTTCAGTTCTTAAATGGAAAATCAGATACTGTTAAACTATGTATTAGTCTATGTTTGAACTTGCAACGT
GAGAAGGGGCAAAAGATAATCAGGATTCGTAGTGATCATGGGAAGGAATTTGATAATGAAGATCTGAATAACTTCTGTCAGACTGGAGGAATCCATCATGAATTTGTAAG
AAACATAACGTTACAAGAAATGGCTCGAGTTATGATACATGCCAAAAATTTGCCTTTGAATTTTTGGGCAGAAGCTAATAGTCGAGCGTACAGAGTCTTCAATATTAAAT
CTGGAATAGTCATGGAAACAATCAATGTTGTGGTTAATGATTTTGAGTCTAATGTCAATCAGTTTAATATTGAGGATGATGAGACCCATGTGACACCTGAAGTTACTTCT
ACTCCTCTTGACGAAATGCCTAAAGGTGAATCACAGCTACACAGTGCTAAGACCGATTCAAGCATAACTGATGAGGTCATAAACAATGAAACTATGCTTGTCCCTTCTGC
ACATGTGAAAAAGAATCATCCATCAAGTTCCATAATAAGCGATCCTTCAGCCGAAATTACTACCAGAAGAAAAGAAATGGTAGATTATACGAAAATGATTGCTAATTTTG
TTGGAGTTTATGTCCTAAAAATCATACTTTGTTATTTGATTCGATAA
mRNA sequenceShow/hide mRNA sequence
ATGATATTTTTCATTAAAACCTTAGATGGAAAAGCATGGAGAGCACTTGTTAGTGGTTATGAGCCTTCAATGATCACTATGAATGGAGTATCAGTGCCAAAACCAGAAAT
TGACTGGACAGATGCTGAAGAACAAGCTTCGGTTGGAAATGCAAGAGCCATAAATGCTATCTTCAACGGTGTCAATTTAAGCGTATTCAAACTTATTAATTCCTGCATTA
CTGCTAAAGAGGCGTGGAAAATACTGGAAGTTGCATTTGAAGGAACTTCTAAAGTGAAGATATCCAGACTGCAGTTGATAACTTCAAAATTCGAAGCCTTGAAAATGACT
GAAGATGAGACAGTCTCTGAATACAATGAGAGGGTCCTGGAGATAGCTAATGATTCGTTACTACTTGGTGAAAAGATTCCAGAGTCTAAGATTGTTAGCAAAGTGTTGCG
CTCCTTACCCAGAAAGTTTGACATGAAGGTCACTGCCATAGAAGAAGCCCAAGATATAATGACGTTAAAACTTGACGAACTATTTGGGTCGCTACTTACGTTTGAAATGG
CTATTTCGGATAGAGAAAGTAAGAAAGGTAAGGGGATAGCATTCAAATCAATTTATGATCAAGAGAACACTGTAAATCAGTCTGGTAATGAAGCTAATCAAGATGAGTCA
ATAACTCTCCTAACGAAGCAATTCTCGAAGATGGCCAGAAAAAGAAATAGCGACCATGGAAAGAAAAAAGAGGATGTAGGGAGGTCGTTTAGATGTAGAGAATGTGACGG
GTTTGGTCATTATCAGGCCGAATGCCCACTTATCTTAGAAGACAAAAGACAAATTATTTGTTCCGATATTGATGAAGATAAAGAGCTAACTCTTGAAGAACTCAAAATCC
TGAGGAAGGAAGACTCAGAAGCCAAAACTATTCAAAAAGAAAGAATTCAAGATTTAATGGATGAAAATGAACGTTTGATGGGGATTATATCATCTCTAAAAGTAAAGTTG
AAAGAAGTACAAAATGTGTATGATCAGACAATTAAGTCTGGGAAAATGTTGAATTCTGGAACTGACAGCTTAGACTCAATCCTGAGTTTAGGGCAAAATGGTTCAAGTAA
ATATGGCCTCGGATTTGATACTTCAACTAGGGGTGTTAAGATTATTCCAGAAGTAAAATTTGTTCCAGCCTCAGTAGAAGAAACAACTGACCCCAGTTGTAAAAAAAGAG
GTCATATACGGTCATTCTGCTACAAATTACTGAGAGATAGGAGACATCAGCAAAGACCAAAACTTGTAAACCAGCAAAATAAGTATATGACCATCAAAAGGAATGATGAT
GTAAGGGGAACTCACTGGATCTGGAGGGTGAAGGCTTCTGAGAAGTGCAATGTAGCATTTACAACAGTCCAAACCCATGTTGATGCTTGGTACTTTGACAGTGGATGCTC
AAGACATATGACTGGCAATCGATCCTTCTTTACTGAGTTAGAAGAATGTGCCTTAAGACATGTCACCTTTGGAGATGGGGCCAAAGGAAAAATTATTGCAAAAGGAAACA
TTGACAAAATTGTTGTGGATGACTACTCCAAATTCACCTGGGTTCAGTTCTTAAATGGAAAATCAGATACTGTTAAACTATGTATTAGTCTATGTTTGAACTTGCAACGT
GAGAAGGGGCAAAAGATAATCAGGATTCGTAGTGATCATGGGAAGGAATTTGATAATGAAGATCTGAATAACTTCTGTCAGACTGGAGGAATCCATCATGAATTTGTAAG
AAACATAACGTTACAAGAAATGGCTCGAGTTATGATACATGCCAAAAATTTGCCTTTGAATTTTTGGGCAGAAGCTAATAGTCGAGCGTACAGAGTCTTCAATATTAAAT
CTGGAATAGTCATGGAAACAATCAATGTTGTGGTTAATGATTTTGAGTCTAATGTCAATCAGTTTAATATTGAGGATGATGAGACCCATGTGACACCTGAAGTTACTTCT
ACTCCTCTTGACGAAATGCCTAAAGGTGAATCACAGCTACACAGTGCTAAGACCGATTCAAGCATAACTGATGAGGTCATAAACAATGAAACTATGCTTGTCCCTTCTGC
ACATGTGAAAAAGAATCATCCATCAAGTTCCATAATAAGCGATCCTTCAGCCGAAATTACTACCAGAAGAAAAGAAATGGTAGATTATACGAAAATGATTGCTAATTTTG
TTGGAGTTTATGTCCTAAAAATCATACTTTGTTATTTGATTCGATAA
Protein sequenceShow/hide protein sequence
MIFFIKTLDGKAWRALVSGYEPSMITMNGVSVPKPEIDWTDAEEQASVGNARAINAIFNGVNLSVFKLINSCITAKEAWKILEVAFEGTSKVKISRLQLITSKFEALKMT
EDETVSEYNERVLEIANDSLLLGEKIPESKIVSKVLRSLPRKFDMKVTAIEEAQDIMTLKLDELFGSLLTFEMAISDRESKKGKGIAFKSIYDQENTVNQSGNEANQDES
ITLLTKQFSKMARKRNSDHGKKKEDVGRSFRCRECDGFGHYQAECPLILEDKRQIICSDIDEDKELTLEELKILRKEDSEAKTIQKERIQDLMDENERLMGIISSLKVKL
KEVQNVYDQTIKSGKMLNSGTDSLDSILSLGQNGSSKYGLGFDTSTRGVKIIPEVKFVPASVEETTDPSCKKRGHIRSFCYKLLRDRRHQQRPKLVNQQNKYMTIKRNDD
VRGTHWIWRVKASEKCNVAFTTVQTHVDAWYFDSGCSRHMTGNRSFFTELEECALRHVTFGDGAKGKIIAKGNIDKIVVDDYSKFTWVQFLNGKSDTVKLCISLCLNLQR
EKGQKIIRIRSDHGKEFDNEDLNNFCQTGGIHHEFVRNITLQEMARVMIHAKNLPLNFWAEANSRAYRVFNIKSGIVMETINVVVNDFESNVNQFNIEDDETHVTPEVTS
TPLDEMPKGESQLHSAKTDSSITDEVINNETMLVPSAHVKKNHPSSSIISDPSAEITTRRKEMVDYTKMIANFVGVYVLKIILCYLIR