; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh10G011350 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh10G011350
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr10:9481278..9483344
RNA-Seq ExpressionCmoCh10G011350
SyntenyCmoCh10G011350
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0019538 - protein metabolic process (biological process)
GO:0019684 - photosynthesis, light reaction (biological process)
GO:0009579 - thylakoid (cellular component)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016740 - transferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022933017.1 uncharacterized protein LOC111439681 [Cucurbita moschata]0.0e+0088.17Show/hide
Query:  MECPENQNVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFAPSMVDTEEKMTE
        MECPENQNVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFAPSMVDTEEKMTE
Subjt:  MECPENQNVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFAPSMVDTEEKMTE

Query:  KFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRHEEPVVIGQKRPHESGGSDRPPPARRHRSNNRPTPRWDERHPPRHTNRNPRNQDGARGRR
        KFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRHEEPVVIGQKRPHESGGSDRPPPARRHRSNNRPTPRWDERHPPRHTNRNPRNQDGARGRR
Subjt:  KFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRHEEPVVIGQKRPHESGGSDRPPPARRHRSNNRPTPRWDERHPPRHTNRNPRNQDGARGRR

Query:  EEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNATAQANPPRVVEKTDQPAPPRAQARAYASTSKDTGRSDAVVTSTLSILGHFAFTLFDSGST
        EEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNATAQANPPRVVEKTDQPAPPRAQARAYASTSKDTGRSDAVVT                   
Subjt:  EEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNATAQANPPRVVEKTDQPAPPRAQARAYASTSKDTGRSDAVVTSTLSILGHFAFTLFDSGST

Query:  HSFIFMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDDQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCRKKEVKFSPPIGPTFKFKG
                     FELEPLLHEMSVSTPAGVDLVSRDRVKDDQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCRKKEVKFSPPIGPTFKFKG
Subjt:  HSFIFMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDDQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCRKKEVKFSPPIGPTFKFKG

Query:  TNTGITPKVVSMMKAKRLVQQGGWAVLACAVDVRGKEETLVNVPIVNEFPDVFPDDLPRIPPSRAVDFVIELEPGTGPIFKAPYRMAPAELKELKAQLHD
        TNTGITPKVVSMMKAKRLVQQGGWAVLACAVDVRGKEETLVNVPIVNEFPDVFPDDLPRIPPSRAVDFVIELEPGTGPIFKAPYRMAPAELKELKAQLHD
Subjt:  TNTGITPKVVSMMKAKRLVQQGGWAVLACAVDVRGKEETLVNVPIVNEFPDVFPDDLPRIPPSRAVDFVIELEPGTGPIFKAPYRMAPAELKELKAQLHD

Query:  LLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYKELNKRTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVM
        LLDKGFIRPSV                                                EATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVM
Subjt:  LLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYKELNKRTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVM

Query:  SFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDLKHEEHLLKVLTTIREHKLYAKFSKCEFGYDKSLS
        SFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDLKHEEHLLKVLTTIREHKLYAKFSKCEFGYDKSLS
Subjt:  SFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDLKHEEHLLKVLTTIREHKLYAKFSKCEFGYDKSLS

XP_022933231.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111440131 [Cucurbita moschata]1.3e-24264.62Show/hide
Query:  MWIAAVETTFETMECPENQNVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFA
        MWIA +ETTFE+M CP+   V CAT+VLQKDAE+WW DNK  +NP GG   WE FKEAFLK YYPK  R+K+QQEF  L QG  TV++Y+++F +L+RFA
Subjt:  MWIAAVETTFETMECPENQNVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFA

Query:  PSMVDTEEKMTEKFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRHEEPVVIGQKRPHESGGSDRPPPARRHRSNNRPTPRWDERHPPRHTNR
        PS+ DTEEK TEKFVLGL P+ RRMLEAFNPKTYEEALRTAKALE+P +EK+ E  V IG+KRP E    +  PP++R R  +RP        PP     
Subjt:  PSMVDTEEKMTEKFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRHEEPVVIGQKRPHESGGSDRPPPARRHRSNNRPTPRWDERHPPRHTNR

Query:  NPRNQDGARGRREEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNATAQANP---PRVVEKTDQPAPPRAQARAYASTSKDTGRSDAVVTSTLS
         P  +  A    +  C  CG+ H GRC+AGS  CY CG  GH+A  C   +      P   P + E T Q +P   Q +AY +TS + G S  VVT TLS
Subjt:  NPRNQDGARGRREEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNATAQANP---PRVVEKTDQPAPPRAQARAYASTSKDTGRSDAVVTSTLS

Query:  ILGHFAFTLFDSGSTHSFIFMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDDQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCRKKE
        ILGHFA TLFDSGSTHSF+  PF+ QAGF +EPL+H +SV TPAGVDLV++DRV+D QV+I  QT+ VDL VV+MTDFDVILGMDWLAEN A+IDC KKE
Subjt:  ILGHFAFTLFDSGSTHSFIFMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDDQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCRKKE

Query:  VKFSPPIGPTFKFKGTNTGITPKVVSMMKAKRLVQQGGWAVLACAVDVRGKEETLVNVPIVNEFPDVFPDDLPRIPPSRAVDFVIELEPGTGPIFKAPYR
        V F+PP G TFKFKGT+TG TPK++SMMKA+RL+QQGGWA LA AV+ +GKE+ +  +P+VNEF DVFP+DLP IPPSR VDF I+LE GTGPI KAPYR
Subjt:  VKFSPPIGPTFKFKGTNTGITPKVVSMMKAKRLVQQGGWAVLACAVDVRGKEETLVNVPIVNEFPDVFPDDLPRIPPSRAVDFVIELEPGTGPIFKAPYR

Query:  MAPAELKELKAQLHDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYKELNKRTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPK
        MAPAELKELK QL DLLD                    KD SMRLCI Y+ELNKRT+KNKYPLPRIEDLFDQLR ATVFSKIDLRSGYHQI+I  +D+PK
Subjt:  MAPAELKELKAQLHDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYKELNKRTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPK

Query:  TAFRTRYGHYEFVVMSFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDLKHEEHLLKVLTTIREHKLYAKFSKCEF
        TAFRTRYGHYEFVVMSFG TNAPA+FMELMN+VFKECLD+FVIVFIDDILIYS+TDLKH+EHL K LT +RE+KLYA F+KCEF
Subjt:  TAFRTRYGHYEFVVMSFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDLKHEEHLLKVLTTIREHKLYAKFSKCEF

XP_022937452.1 uncharacterized protein LOC111443856 [Cucurbita moschata]9.7e-25470.25Show/hide
Query:  MECPENQNVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFAPSMVDTEEKMTE
        MEC ENQ VACATFVLQKDAEIWWRDNKTLLNPEGGP+NWERFKEAFLKEYYPKSERLK+QQEFAHLVQGGLT EKYNREFN+LKRFAPSMVDTEEKMTE
Subjt:  MECPENQNVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFAPSMVDTEEKMTE

Query:  KFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRHEEPVVIGQKRPHESGGSDRPPPARRHRSNNRPTPRWDERHPPRHTNRNPRNQDGARGRR
        KFVLGL PRIRRMLEAFNPKTYEEALRTAKALEKPKD+KR EE V+IGQKRPHESGG DRPPPA RHRSNNRP PRWDER PPR T+RNPRNQDGARGRR
Subjt:  KFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRHEEPVVIGQKRPHESGGSDRPPPARRHRSNNRPTPRWDERHPPRHTNRNPRNQDGARGRR

Query:  EEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNATAQANPPRVVEKTDQPAPPRAQARAYASTSKDTGRSDAVVTSTLSILGHFAFTLFDSGST
        EEGCTICGRLH GRCMAGSRACYRCGQEGHIAVNCTA NA AQAN PRVVE+TDQPAPPRAQARAYASTSKDT RSDAVVT                   
Subjt:  EEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNATAQANPPRVVEKTDQPAPPRAQARAYASTSKDTGRSDAVVTSTLSILGHFAFTLFDSGST

Query:  HSFIFMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDDQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCRKKEVKFSPPIGPTFKFKG
                                                                                                            
Subjt:  HSFIFMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDDQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCRKKEVKFSPPIGPTFKFKG

Query:  TNTGITPKVVSMMKAKRLVQQGGWAVLACAVDVRGKEETLVNVPIVNEFPDVFPDDLPRIPPSRAVDFVIELEPGTGPIFKAPYRMAPAELKELKAQLHD
                                       DVRGKE+T VNVPIVN+FPDVFPDDLPRIPPSRAVD+VIEL+PGT PI KAPYRM PAELKELKAQL D
Subjt:  TNTGITPKVVSMMKAKRLVQQGGWAVLACAVDVRGKEETLVNVPIVNEFPDVFPDDLPRIPPSRAVDFVIELEPGTGPIFKAPYRMAPAELKELKAQLHD

Query:  LLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYKELNKRTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVM
        LL+KGFIRPSVSPWG PVLF++KKDGSMRLCIDY+ELNKRTIKN Y L RIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVM
Subjt:  LLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYKELNKRTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVM

Query:  SFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDLKHEEHLLKVLTTIREHKLYAKFSKCEF
        SFG TNAPA+ MELMN VFKECLDMF+IVFIDDI+IYSRTDL+HEEHL KVLTT+REHKLYAKFSKCEF
Subjt:  SFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDLKHEEHLLKVLTTIREHKLYAKFSKCEF

XP_022938339.1 uncharacterized protein LOC111444469 [Cucurbita moschata]6.5e-21879.8Show/hide
Query:  MECPENQNVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFAPSMVDTEEKMTE
        MECPENQ VACATFVLQKDAEIWWRDNKTL+NPEGGPMNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFAPSMVDTEEKMTE
Subjt:  MECPENQNVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFAPSMVDTEEKMTE

Query:  KFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRHEEPVVIGQKRPHESGGSDRPPPARRHRSNNRPTPRWDERHPPRHTNRNPRNQDGARGRR
        KFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKR EEPV+I QKRPHESGGSDRPPPARRHRSNNRP PRWDER PPRHT+RNPRNQDGARGRR
Subjt:  KFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRHEEPVVIGQKRPHESGGSDRPPPARRHRSNNRPTPRWDERHPPRHTNRNPRNQDGARGRR

Query:  EEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNATAQANPPRVVEKTDQPAPPRAQARAYASTSKDTGRSDAVVTSTLSILGHFAFTLFDSGST
        EEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNA AQANP RVVE+TDQPAPPRAQARAYASTSKDTGRSD VV                    
Subjt:  EEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNATAQANPPRVVEKTDQPAPPRAQARAYASTSKDTGRSDAVVTSTLSILGHFAFTLFDSGST

Query:  HSFIFMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDDQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCRKKEVKFSPPIGPTFKFKG
                                                                    TDFDVILGMDWLAENRASIDC +KEVKFSPPIGPTFKFKG
Subjt:  HSFIFMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDDQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCRKKEVKFSPPIGPTFKFKG

Query:  TNTGITPKVVSMMKAKRLVQQGGWAVLACAVDVRGKEETLVNVPIVNEFPDVFPDDLPRIPPSRAVDFVIELEPGTGPIFKAPYRMAPAELKELK
        TNTGITP+VVSMMKAK+LVQQGGWA+LACAVDVRGKEETLVNVPIVNEFPDVFPDDLP IPPSRAVDFVIELEPGTGPI KAPY MAPAELKELK
Subjt:  TNTGITPKVVSMMKAKRLVQQGGWAVLACAVDVRGKEETLVNVPIVNEFPDVFPDDLPRIPPSRAVDFVIELEPGTGPIFKAPYRMAPAELKELK

XP_023522446.1 uncharacterized protein LOC111786377 [Cucurbita pepo subsp. pepo]0.0e+0092.36Show/hide
Query:  MWIAAVETTFETMECPENQNVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFA
        MWIAAVETTFETMECPENQ VACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFA
Subjt:  MWIAAVETTFETMECPENQNVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFA

Query:  PSMVDTEEKMTEKFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRHEEPVVIGQKRPHESGGSDRPPPARRHRSNNRPTPRWDERHPPRHTNR
        PSMVDTEEKMTEKFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKR EEPV+IGQK PHESGGSDRPPP  RHRSNNR  PRWDERHPPR T+R
Subjt:  PSMVDTEEKMTEKFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRHEEPVVIGQKRPHESGGSDRPPPARRHRSNNRPTPRWDERHPPRHTNR

Query:  NPRNQDGARGRREEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNATAQANPPRVVEKTDQPAPPRAQARAYASTSKDTGRSDAVVTSTLSILG
        NPRNQDGARGRREEGCTICGRLH GRCMAGSRACYRCGQEGHIAVNCTAGNA AQANPPRVVE+TDQPAPPRAQARAY STSKDTGRSDAVVT TLSILG
Subjt:  NPRNQDGARGRREEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNATAQANPPRVVEKTDQPAPPRAQARAYASTSKDTGRSDAVVTSTLSILG

Query:  HFAFTLFDSGSTHSFIFMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDDQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCRKKEVKF
        HF FTLFDS STHSFI MPFVVQAGFELEPLLHEMSVSTP GVDLVSR RVKD QVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDC KK+VKF
Subjt:  HFAFTLFDSGSTHSFIFMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDDQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCRKKEVKF

Query:  SPPIGPTFKFKGTNTGITPKVVSMMKAKRLVQQGGWAVLACAVDVRGKEETLVNVPIVNEFPDVFPDDLPRIPPSRAVDFVIELEPGTGPIFKAPYRMAP
        SPP GP FKFKGT+TGITPKVVSMMKAKRLVQQ GWA+L+C VDVRGKE TLVNVPIVNEFPDVF DDLP IPPSRAV+FVIEL+ GTGPI KAPYRMAP
Subjt:  SPPIGPTFKFKGTNTGITPKVVSMMKAKRLVQQGGWAVLACAVDVRGKEETLVNVPIVNEFPDVFPDDLPRIPPSRAVDFVIELEPGTGPIFKAPYRMAP

Query:  AELKELKAQLHDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYKELNKRTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAF
        AELKELKAQL DLLDKGFIRPSVSP GAPVLFVKKKDGSMRLCIDY+ELNKR+IKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAF
Subjt:  AELKELKAQLHDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYKELNKRTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAF

Query:  RTRYGHYEFVVMSFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDLKHEEHLLKVLTTIREHKLYAKFSKCEF
        RTRYGHYEFVVMSFG TN+ A+FMELMN+VFKECLDMF+IVFIDDILIYS+T L+HEEHL KVLTT+REHKLYAKFSKCEF
Subjt:  RTRYGHYEFVVMSFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDLKHEEHLLKVLTTIREHKLYAKFSKCEF

TrEMBL top hitse value%identityAlignment
A0A6J1EYH9 Reverse transcriptase6.3e-24364.62Show/hide
Query:  MWIAAVETTFETMECPENQNVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFA
        MWIA +ETTFE+M CP+   V CAT+VLQKDAE+WW DNK  +NP GG   WE FKEAFLK YYPK  R+K+QQEF  L QG  TV++Y+++F +L+RFA
Subjt:  MWIAAVETTFETMECPENQNVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFA

Query:  PSMVDTEEKMTEKFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRHEEPVVIGQKRPHESGGSDRPPPARRHRSNNRPTPRWDERHPPRHTNR
        PS+ DTEEK TEKFVLGL P+ RRMLEAFNPKTYEEALRTAKALE+P +EK+ E  V IG+KRP E    +  PP++R R  +RP        PP     
Subjt:  PSMVDTEEKMTEKFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRHEEPVVIGQKRPHESGGSDRPPPARRHRSNNRPTPRWDERHPPRHTNR

Query:  NPRNQDGARGRREEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNATAQANP---PRVVEKTDQPAPPRAQARAYASTSKDTGRSDAVVTSTLS
         P  +  A    +  C  CG+ H GRC+AGS  CY CG  GH+A  C   +      P   P + E T Q +P   Q +AY +TS + G S  VVT TLS
Subjt:  NPRNQDGARGRREEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNATAQANP---PRVVEKTDQPAPPRAQARAYASTSKDTGRSDAVVTSTLS

Query:  ILGHFAFTLFDSGSTHSFIFMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDDQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCRKKE
        ILGHFA TLFDSGSTHSF+  PF+ QAGF +EPL+H +SV TPAGVDLV++DRV+D QV+I  QT+ VDL VV+MTDFDVILGMDWLAEN A+IDC KKE
Subjt:  ILGHFAFTLFDSGSTHSFIFMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDDQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCRKKE

Query:  VKFSPPIGPTFKFKGTNTGITPKVVSMMKAKRLVQQGGWAVLACAVDVRGKEETLVNVPIVNEFPDVFPDDLPRIPPSRAVDFVIELEPGTGPIFKAPYR
        V F+PP G TFKFKGT+TG TPK++SMMKA+RL+QQGGWA LA AV+ +GKE+ +  +P+VNEF DVFP+DLP IPPSR VDF I+LE GTGPI KAPYR
Subjt:  VKFSPPIGPTFKFKGTNTGITPKVVSMMKAKRLVQQGGWAVLACAVDVRGKEETLVNVPIVNEFPDVFPDDLPRIPPSRAVDFVIELEPGTGPIFKAPYR

Query:  MAPAELKELKAQLHDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYKELNKRTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPK
        MAPAELKELK QL DLLD                    KD SMRLCI Y+ELNKRT+KNKYPLPRIEDLFDQLR ATVFSKIDLRSGYHQI+I  +D+PK
Subjt:  MAPAELKELKAQLHDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYKELNKRTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPK

Query:  TAFRTRYGHYEFVVMSFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDLKHEEHLLKVLTTIREHKLYAKFSKCEF
        TAFRTRYGHYEFVVMSFG TNAPA+FMELMN+VFKECLD+FVIVFIDDILIYS+TDLKH+EHL K LT +RE+KLYA F+KCEF
Subjt:  TAFRTRYGHYEFVVMSFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDLKHEEHLLKVLTTIREHKLYAKFSKCEF

A0A6J1EYK0 uncharacterized protein LOC1114396810.0e+0088.17Show/hide
Query:  MECPENQNVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFAPSMVDTEEKMTE
        MECPENQNVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFAPSMVDTEEKMTE
Subjt:  MECPENQNVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFAPSMVDTEEKMTE

Query:  KFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRHEEPVVIGQKRPHESGGSDRPPPARRHRSNNRPTPRWDERHPPRHTNRNPRNQDGARGRR
        KFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRHEEPVVIGQKRPHESGGSDRPPPARRHRSNNRPTPRWDERHPPRHTNRNPRNQDGARGRR
Subjt:  KFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRHEEPVVIGQKRPHESGGSDRPPPARRHRSNNRPTPRWDERHPPRHTNRNPRNQDGARGRR

Query:  EEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNATAQANPPRVVEKTDQPAPPRAQARAYASTSKDTGRSDAVVTSTLSILGHFAFTLFDSGST
        EEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNATAQANPPRVVEKTDQPAPPRAQARAYASTSKDTGRSDAVVT                   
Subjt:  EEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNATAQANPPRVVEKTDQPAPPRAQARAYASTSKDTGRSDAVVTSTLSILGHFAFTLFDSGST

Query:  HSFIFMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDDQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCRKKEVKFSPPIGPTFKFKG
                     FELEPLLHEMSVSTPAGVDLVSRDRVKDDQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCRKKEVKFSPPIGPTFKFKG
Subjt:  HSFIFMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDDQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCRKKEVKFSPPIGPTFKFKG

Query:  TNTGITPKVVSMMKAKRLVQQGGWAVLACAVDVRGKEETLVNVPIVNEFPDVFPDDLPRIPPSRAVDFVIELEPGTGPIFKAPYRMAPAELKELKAQLHD
        TNTGITPKVVSMMKAKRLVQQGGWAVLACAVDVRGKEETLVNVPIVNEFPDVFPDDLPRIPPSRAVDFVIELEPGTGPIFKAPYRMAPAELKELKAQLHD
Subjt:  TNTGITPKVVSMMKAKRLVQQGGWAVLACAVDVRGKEETLVNVPIVNEFPDVFPDDLPRIPPSRAVDFVIELEPGTGPIFKAPYRMAPAELKELKAQLHD

Query:  LLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYKELNKRTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVM
        LLDKGFIRPSV                                                EATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVM
Subjt:  LLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYKELNKRTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVM

Query:  SFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDLKHEEHLLKVLTTIREHKLYAKFSKCEFGYDKSLS
        SFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDLKHEEHLLKVLTTIREHKLYAKFSKCEFGYDKSLS
Subjt:  SFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDLKHEEHLLKVLTTIREHKLYAKFSKCEFGYDKSLS

A0A6J1FB91 uncharacterized protein LOC1114438564.7e-25470.25Show/hide
Query:  MECPENQNVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFAPSMVDTEEKMTE
        MEC ENQ VACATFVLQKDAEIWWRDNKTLLNPEGGP+NWERFKEAFLKEYYPKSERLK+QQEFAHLVQGGLT EKYNREFN+LKRFAPSMVDTEEKMTE
Subjt:  MECPENQNVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFAPSMVDTEEKMTE

Query:  KFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRHEEPVVIGQKRPHESGGSDRPPPARRHRSNNRPTPRWDERHPPRHTNRNPRNQDGARGRR
        KFVLGL PRIRRMLEAFNPKTYEEALRTAKALEKPKD+KR EE V+IGQKRPHESGG DRPPPA RHRSNNRP PRWDER PPR T+RNPRNQDGARGRR
Subjt:  KFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRHEEPVVIGQKRPHESGGSDRPPPARRHRSNNRPTPRWDERHPPRHTNRNPRNQDGARGRR

Query:  EEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNATAQANPPRVVEKTDQPAPPRAQARAYASTSKDTGRSDAVVTSTLSILGHFAFTLFDSGST
        EEGCTICGRLH GRCMAGSRACYRCGQEGHIAVNCTA NA AQAN PRVVE+TDQPAPPRAQARAYASTSKDT RSDAVVT                   
Subjt:  EEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNATAQANPPRVVEKTDQPAPPRAQARAYASTSKDTGRSDAVVTSTLSILGHFAFTLFDSGST

Query:  HSFIFMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDDQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCRKKEVKFSPPIGPTFKFKG
                                                                                                            
Subjt:  HSFIFMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDDQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCRKKEVKFSPPIGPTFKFKG

Query:  TNTGITPKVVSMMKAKRLVQQGGWAVLACAVDVRGKEETLVNVPIVNEFPDVFPDDLPRIPPSRAVDFVIELEPGTGPIFKAPYRMAPAELKELKAQLHD
                                       DVRGKE+T VNVPIVN+FPDVFPDDLPRIPPSRAVD+VIEL+PGT PI KAPYRM PAELKELKAQL D
Subjt:  TNTGITPKVVSMMKAKRLVQQGGWAVLACAVDVRGKEETLVNVPIVNEFPDVFPDDLPRIPPSRAVDFVIELEPGTGPIFKAPYRMAPAELKELKAQLHD

Query:  LLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYKELNKRTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVM
        LL+KGFIRPSVSPWG PVLF++KKDGSMRLCIDY+ELNKRTIKN Y L RIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVM
Subjt:  LLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYKELNKRTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVM

Query:  SFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDLKHEEHLLKVLTTIREHKLYAKFSKCEF
        SFG TNAPA+ MELMN VFKECLDMF+IVFIDDI+IYSRTDL+HEEHL KVLTT+REHKLYAKFSKCEF
Subjt:  SFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDLKHEEHLLKVLTTIREHKLYAKFSKCEF

A0A6J1FDT1 uncharacterized protein LOC1114444693.2e-21879.8Show/hide
Query:  MECPENQNVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFAPSMVDTEEKMTE
        MECPENQ VACATFVLQKDAEIWWRDNKTL+NPEGGPMNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFAPSMVDTEEKMTE
Subjt:  MECPENQNVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFAPSMVDTEEKMTE

Query:  KFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRHEEPVVIGQKRPHESGGSDRPPPARRHRSNNRPTPRWDERHPPRHTNRNPRNQDGARGRR
        KFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKR EEPV+I QKRPHESGGSDRPPPARRHRSNNRP PRWDER PPRHT+RNPRNQDGARGRR
Subjt:  KFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRHEEPVVIGQKRPHESGGSDRPPPARRHRSNNRPTPRWDERHPPRHTNRNPRNQDGARGRR

Query:  EEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNATAQANPPRVVEKTDQPAPPRAQARAYASTSKDTGRSDAVVTSTLSILGHFAFTLFDSGST
        EEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNA AQANP RVVE+TDQPAPPRAQARAYASTSKDTGRSD VV                    
Subjt:  EEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNATAQANPPRVVEKTDQPAPPRAQARAYASTSKDTGRSDAVVTSTLSILGHFAFTLFDSGST

Query:  HSFIFMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDDQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCRKKEVKFSPPIGPTFKFKG
                                                                    TDFDVILGMDWLAENRASIDC +KEVKFSPPIGPTFKFKG
Subjt:  HSFIFMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDDQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCRKKEVKFSPPIGPTFKFKG

Query:  TNTGITPKVVSMMKAKRLVQQGGWAVLACAVDVRGKEETLVNVPIVNEFPDVFPDDLPRIPPSRAVDFVIELEPGTGPIFKAPYRMAPAELKELK
        TNTGITP+VVSMMKAK+LVQQGGWA+LACAVDVRGKEETLVNVPIVNEFPDVFPDDLP IPPSRAVDFVIELEPGTGPI KAPY MAPAELKELK
Subjt:  TNTGITPKVVSMMKAKRLVQQGGWAVLACAVDVRGKEETLVNVPIVNEFPDVFPDDLPRIPPSRAVDFVIELEPGTGPIFKAPYRMAPAELKELK

A0A6J1GK52 Reverse transcriptase3.0e-21665.84Show/hide
Query:  GGLTVEKYNREFNKLKRFAPSMVDTEEKMTEKFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRHEEPVVIGQKRPHESGGSDRPPPARRHRS
        G  TV++Y+++F +L+RFAPS+ DT+EK TEKFVLGL  + RR+LEAFNPKTYEEALRTAKALEKP +EK+ E  V   +KRP E   ++  PP +R   
Subjt:  GGLTVEKYNREFNKLKRFAPSMVDTEEKMTEKFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRHEEPVVIGQKRPHESGGSDRPPPARRHRS

Query:  NNRPTPRWDERHPPRHTNRNPRNQDGARGRREEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNATAQANP---PRVVEKTDQPAPPRAQARAY
             PR+  R P      +P          +  C  CG+ H GRC+AGS  CY CG  GH+A  C   +      P   P + E T Q  P   Q +AY
Subjt:  NNRPTPRWDERHPPRHTNRNPRNQDGARGRREEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNATAQANP---PRVVEKTDQPAPPRAQARAY

Query:  ASTSKDTGRSDAVVTSTLSILGHFAFTLFDSGSTHSFIFMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDDQVIIGNQTLSVDLMVVNMTDFDVI
         +TSK+ G S  VVT TLSILGHFA TLF+S STHSF+ +PFV QAGF +EPL+H +SV TPAGVDLV+++RVKD QV+I  QT+ +DL VV+M  FD I
Subjt:  ASTSKDTGRSDAVVTSTLSILGHFAFTLFDSGSTHSFIFMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDDQVIIGNQTLSVDLMVVNMTDFDVI

Query:  LGMDWLAENRASIDCRKKEVKFSPPIGPTFKFKGTNTGITPKVVSMMKAKRLVQQGGWAVLACAVDVRGKEETLVNVPIVNEFPDVFPDDLPRIPPSRAV
        LGMDWLAEN A+IDC KKEV F+PP   TFKFKGT+TG TPK++SMMKA+RL+QQG  A LACAV+ +GKE+ +  VP+VNEF DVFP+DLP IPPS+ V
Subjt:  LGMDWLAENRASIDCRKKEVKFSPPIGPTFKFKGTNTGITPKVVSMMKAKRLVQQGGWAVLACAVDVRGKEETLVNVPIVNEFPDVFPDDLPRIPPSRAV

Query:  DFVIELEPGTGPIFKAPYRMAPAELKELKAQLHDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYKELNKRTIKNKYPLPRIEDLFDQLREATVFSK
        DF I+LEP TGPI KAPYRMAPAELKELK QL DLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDY+ELNKRT+KNKYPLPRIEDLFDQLR ATVFSK
Subjt:  DFVIELEPGTGPIFKAPYRMAPAELKELKAQLHDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYKELNKRTIKNKYPLPRIEDLFDQLREATVFSK

Query:  IDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDLKHEEHLLKVLTTIREHKLYAKFSK
        IDLR GYHQI+I  +D+PKTAFRTRYGHYEFVVMSFG TNAPA+FMELMN+VFKECLD FVIVFIDDILIYS+TDL+H+EHL K LT +RE+KLYAKF++
Subjt:  IDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDLKHEEHLLKVLTTIREHKLYAKFSK

Query:  CEF
        CEF
Subjt:  CEF

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.0e-3635.24Show/hide
Query:  IVNEFPDVFPD-DLPRIP-PSRAVDFVIELEPGTGPIFKAPYRMAPAELKELKAQLHDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYKELNKRTI
        I  EF D+  + +  ++P P + ++F +EL      +    Y + P +++ +  +++  L  G IR S +    PV+FV KK+G++R+ +DYK LNK   
Subjt:  IVNEFPDVFPD-DLPRIP-PSRAVDFVIELEPGTGPIFKAPYRMAPAELKELKAQLHDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYKELNKRTI

Query:  KNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDL
         N YPLP IE L  +++ +T+F+K+DL+S YH IR+ + D  K AFR   G +E++VM +G + APA F   +N +  E  +  V+ ++DDILI+S+++ 
Subjt:  KNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDL

Query:  KHEEHLLKVLTTIREHKLYAKFSKCEF
        +H +H+  VL  ++   L    +KCEF
Subjt:  KHEEHLLKVLTTIREHKLYAKFSKCEF

P0CT35 Transposon Tf2-2 polyprotein2.0e-3635.24Show/hide
Query:  IVNEFPDVFPD-DLPRIP-PSRAVDFVIELEPGTGPIFKAPYRMAPAELKELKAQLHDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYKELNKRTI
        I  EF D+  + +  ++P P + ++F +EL      +    Y + P +++ +  +++  L  G IR S +    PV+FV KK+G++R+ +DYK LNK   
Subjt:  IVNEFPDVFPD-DLPRIP-PSRAVDFVIELEPGTGPIFKAPYRMAPAELKELKAQLHDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYKELNKRTI

Query:  KNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDL
         N YPLP IE L  +++ +T+F+K+DL+S YH IR+ + D  K AFR   G +E++VM +G + APA F   +N +  E  +  V+ ++DDILI+S+++ 
Subjt:  KNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDL

Query:  KHEEHLLKVLTTIREHKLYAKFSKCEF
        +H +H+  VL  ++   L    +KCEF
Subjt:  KHEEHLLKVLTTIREHKLYAKFSKCEF

P0CT41 Transposon Tf2-12 polyprotein2.0e-3635.24Show/hide
Query:  IVNEFPDVFPD-DLPRIP-PSRAVDFVIELEPGTGPIFKAPYRMAPAELKELKAQLHDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYKELNKRTI
        I  EF D+  + +  ++P P + ++F +EL      +    Y + P +++ +  +++  L  G IR S +    PV+FV KK+G++R+ +DYK LNK   
Subjt:  IVNEFPDVFPD-DLPRIP-PSRAVDFVIELEPGTGPIFKAPYRMAPAELKELKAQLHDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYKELNKRTI

Query:  KNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDL
         N YPLP IE L  +++ +T+F+K+DL+S YH IR+ + D  K AFR   G +E++VM +G + APA F   +N +  E  +  V+ ++DDILI+S+++ 
Subjt:  KNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDL

Query:  KHEEHLLKVLTTIREHKLYAKFSKCEF
        +H +H+  VL  ++   L    +KCEF
Subjt:  KHEEHLLKVLTTIREHKLYAKFSKCEF

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.5e-4239.09Show/hide
Query:  EETLVNVPI--VNEFPDVFPDDLPRIPP---SRAVDFVIELEPGTGPIFKAPYRMAPAELKELKAQLHDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLC
        ++T   +P+    ++ ++  +DLP  P    +  V   IE++PG       PY +     +E+   +  LLD  FI PS SP  +PV+ V KKDG+ RLC
Subjt:  EETLVNVPI--VNEFPDVFPDDLPRIPP---SRAVDFVIELEPGTGPIFKAPYRMAPAELKELKAQLHDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLC

Query:  IDYKELNKRTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGFTNAPAIFMELMNQVFKECLDMFVIVFI
        +DY+ LNK TI + +PLPRI++L  ++  A +F+ +DL SGYHQI +  KD  KTAF T  G YE+ VM FG  NAP+ F   M   F++    FV V++
Subjt:  IDYKELNKRTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGFTNAPAIFMELMNQVFKECLDMFVIVFI

Query:  DDILIYSRTDLKHEEHLLKVLTTIREHKLYAKFSKCEFGYDKS
        DDILI+S +  +H +HL  VL  ++   L  K  KC+F  +++
Subjt:  DDILIYSRTDLKHEEHLLKVLTTIREHKLYAKFSKCEFGYDKS

Q99315 Transposon Ty3-G Gag-Pol polyprotein3.1e-3725.1Show/hide
Query:  MNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFAPSMVDTEEKMTEKFVLGLEPRIRRMLEAFNPKT----YEEALRTAKALE
        + ++ F +   + +Y   +  K       L +  L +E+ N+ F K+    P    TE+     +   L      ++    P+T     EEA +T    E
Subjt:  MNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFAPSMVDTEEKMTEKFVLGLEPRIRRMLEAFNPKT----YEEALRTAKALE

Query:  K--PKDEKRHEEPVVIG-QKRPHESGGSDRPPPARRHRSNNRPTPRWDERH-PPRHTNRNPRNQDGARGRREEGCTICGRLHGGRCMAGSRACYRCGQEG
        +  P  E   +   +IG      E   SD        ++    T R    +  P   +RN RN + +R    E C              +R C+ C +EG
Subjt:  K--PKDEKRHEEPVVIG-QKRPHESGGSDRPPPARRHRSNNRPTPRWDERH-PPRHTNRNPRNQDGARGRREEGCTICGRLHGGRCMAGSRACYRCGQEG

Query:  HIAVNCTAGNATAQANPPRVVEKTDQPAP--PRAQARAYASTSKDTGRSDAVVTSTLSILGHFAFTLFDSGSTHSFIFMPFVVQAGFEL---EPLLHEMS
        H    C A  A         +E  DQ  P         Y +  +     D     T+ I      TLFDSGS  SFI    V    +E+    PL     
Subjt:  HIAVNCTAGNATAQANPPRVVEKTDQPAP--PRAQARAYASTSKDTGRSDAVVTSTLSILGHFAFTLFDSGSTHSFIFMPFVVQAGFEL---EPLLHEMS

Query:  VSTPAGVDLVSRDRVKDDQVIIGNQTLSVDLMVVNMTDFDVILG----------MDWLAENRASIDCRKKEVKFSPPIGPTFKFKGTNTG----------
        V+T +    V+ + V  D + I +  +++   +++  D+ +++G          +  +   R S D  K +   S  +     +   N G          
Subjt:  VSTPAGVDLVSRDRVKDDQVIIGNQTLSVDLMVVNMTDFDVILG----------MDWLAENRASIDCRKKEVKFSPPIGPTFKFKGTNTG----------

Query:  ---------------------------ITPKVVSMM--------------------KAKRLVQQGGWAVLACAVDV-------RGKEETLVNVPI--VNE
                                    TP  +  +                    +A  L + G ++ +   +            ++T   +P+    +
Subjt:  ---------------------------ITPKVVSMM--------------------KAKRLVQQGGWAVLACAVDV-------RGKEETLVNVPI--VNE

Query:  FPDVFPDDLPRIPP---SRAVDFVIELEPGTGPIFKAPYRMAPAELKELKAQLHDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYKELNKRTIKNK
        + ++  +DLP  P    +  V   IE++PG       PY +     +E+   +  LLD  FI PS SP  +PV+ V KKDG+ RLC+DY+ LNK TI + 
Subjt:  FPDVFPDDLPRIPP---SRAVDFVIELEPGTGPIFKAPYRMAPAELKELKAQLHDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYKELNKRTIKNK

Query:  YPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDLKHE
        +PLPRI++L  ++  A +F+ +DL SGYHQI +  KD  KTAF T  G YE+ VM FG  NAP+ F   M   F++    FV V++DDILI+S +  +H 
Subjt:  YPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGFTNAPAIFMELMNQVFKECLDMFVIVFIDDILIYSRTDLKHE

Query:  EHLLKVLTTIREHKLYAKFSKCEFGYDKS
        +HL  VL  ++   L  K  KC+F  +++
Subjt:  EHLLKVLTTIREHKLYAKFSKCEFGYDKS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGATTGCAGCGGTGGAAACCACCTTTGAGACTATGGAGTGCCCAGAGAACCAAAATGTCGCCTGTGCAACCTTCGTCCTACAAAAGGACGCAGAGATATGG
TGGAGAGATAACAAAACCCTCCTTAACCCAGAAGGGGGACCAATGAACTGGGAACGATTTAAGGAGGCCTTCCTTAAAGAATATTATCCTAAGTCAGAGCGACTT
AAAAGGCAGCAGGAGTTCGCCCACCTAGTACAGGGAGGACTCACAGTGGAGAAGTACAACAGAGAGTTTAATAAACTCAAGAGATTTGCACCGTCCATGGTGGAC
ACTGAGGAAAAGATGACAGAGAAATTTGTATTGGGTTTGGAACCAAGAATCCGCCGCATGTTGGAGGCATTCAACCCAAAGACCTATGAGGAAGCCTTGAGAACT
GCCAAGGCTTTAGAGAAACCAAAGGATGAGAAACGACATGAAGAGCCAGTTGTAATTGGGCAGAAGCGTCCTCATGAATCAGGAGGCTCTGACCGTCCACCACCA
GCACGTAGGCACCGTTCCAATAACAGACCCACTCCTAGATGGGATGAGCGACACCCTCCCCGACATACTAATAGGAACCCCAGGAATCAAGATGGGGCCAGAGGG
AGGAGAGAGGAAGGGTGCACTATCTGTGGAAGACTACACGGTGGGAGGTGCATGGCTGGCAGCCGAGCGTGTTATAGATGTGGTCAAGAGGGGCACATCGCTGTG
AACTGCACGGCCGGAAATGCTACAGCACAAGCAAACCCGCCCAGAGTAGTAGAGAAAACGGATCAACCAGCACCACCGCGAGCTCAAGCTAGGGCATACGCGTCA
ACCAGCAAGGACACTGGGAGGTCCGACGCCGTGGTGACAAGTACACTATCCATTTTAGGTCATTTCGCTTTTACCTTATTTGATTCTGGTTCCACGCATTCCTTT
ATTTTCATGCCTTTTGTTGTACAAGCAGGGTTCGAATTAGAACCCTTATTACATGAAATGTCTGTAAGCACCCCTGCGGGGGTAGACTTAGTATCTAGGGATAGA
GTAAAGGATGACCAAGTAATCATAGGGAACCAAACTTTAAGCGTTGACCTGATGGTGGTAAACATGACAGATTTCGACGTCATACTAGGCATGGATTGGTTAGCT
GAAAATCGAGCTAGTATAGACTGTCGCAAAAAGGAAGTAAAATTTTCACCACCGATAGGACCTACCTTTAAATTTAAAGGCACAAATACCGGGATTACCCCCAAG
GTAGTCTCGATGATGAAAGCAAAGAGGTTAGTCCAACAAGGTGGATGGGCTGTATTAGCATGTGCTGTAGACGTGAGAGGAAAGGAAGAGACCCTAGTAAACGTG
CCAATAGTAAACGAGTTCCCGGATGTATTTCCGGATGACTTACCTAGAATACCCCCTTCCCGAGCGGTCGACTTCGTCATCGAACTCGAGCCGGGAACTGGGCCT
ATTTTCAAAGCACCCTATCGCATGGCGCCAGCAGAGTTGAAGGAACTCAAGGCGCAACTGCATGACTTACTAGATAAAGGATTCATTCGACCTAGCGTGTCCCCC
TGGGGTGCGCCAGTGTTGTTTGTTAAGAAGAAAGATGGATCGATGCGTCTGTGCATCGATTACAAAGAGCTAAACAAGAGAACCATAAAAAACAAATATCCTCTG
CCTAGAATAGAAGACTTGTTTGATCAACTCAGAGAAGCAACAGTATTCTCTAAGATAGATCTTCGGTCCGGTTACCACCAAATTAGGATTAATGAAAAAGACGTA
CCAAAAACAGCGTTTAGGACAAGGTACGGTCACTACGAGTTTGTGGTGATGTCATTTGGCTTCACCAACGCCCCAGCTATATTTATGGAGTTAATGAACCAGGTA
TTCAAAGAATGCCTAGACATGTTTGTGATTGTGTTCATTGACGACATCCTCATATACTCGAGAACCGACCTAAAGCACGAGGAACACCTCTTAAAAGTCCTAACC
ACCATAAGAGAGCACAAGTTGTACGCCAAGTTCTCCAAATGCGAATTTGGTTACGACAAGTCTCTTTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGATTGCAGCGGTGGAAACCACCTTTGAGACTATGGAGTGCCCAGAGAACCAAAATGTCGCCTGTGCAACCTTCGTCCTACAAAAGGACGCAGAGATATGG
TGGAGAGATAACAAAACCCTCCTTAACCCAGAAGGGGGACCAATGAACTGGGAACGATTTAAGGAGGCCTTCCTTAAAGAATATTATCCTAAGTCAGAGCGACTT
AAAAGGCAGCAGGAGTTCGCCCACCTAGTACAGGGAGGACTCACAGTGGAGAAGTACAACAGAGAGTTTAATAAACTCAAGAGATTTGCACCGTCCATGGTGGAC
ACTGAGGAAAAGATGACAGAGAAATTTGTATTGGGTTTGGAACCAAGAATCCGCCGCATGTTGGAGGCATTCAACCCAAAGACCTATGAGGAAGCCTTGAGAACT
GCCAAGGCTTTAGAGAAACCAAAGGATGAGAAACGACATGAAGAGCCAGTTGTAATTGGGCAGAAGCGTCCTCATGAATCAGGAGGCTCTGACCGTCCACCACCA
GCACGTAGGCACCGTTCCAATAACAGACCCACTCCTAGATGGGATGAGCGACACCCTCCCCGACATACTAATAGGAACCCCAGGAATCAAGATGGGGCCAGAGGG
AGGAGAGAGGAAGGGTGCACTATCTGTGGAAGACTACACGGTGGGAGGTGCATGGCTGGCAGCCGAGCGTGTTATAGATGTGGTCAAGAGGGGCACATCGCTGTG
AACTGCACGGCCGGAAATGCTACAGCACAAGCAAACCCGCCCAGAGTAGTAGAGAAAACGGATCAACCAGCACCACCGCGAGCTCAAGCTAGGGCATACGCGTCA
ACCAGCAAGGACACTGGGAGGTCCGACGCCGTGGTGACAAGTACACTATCCATTTTAGGTCATTTCGCTTTTACCTTATTTGATTCTGGTTCCACGCATTCCTTT
ATTTTCATGCCTTTTGTTGTACAAGCAGGGTTCGAATTAGAACCCTTATTACATGAAATGTCTGTAAGCACCCCTGCGGGGGTAGACTTAGTATCTAGGGATAGA
GTAAAGGATGACCAAGTAATCATAGGGAACCAAACTTTAAGCGTTGACCTGATGGTGGTAAACATGACAGATTTCGACGTCATACTAGGCATGGATTGGTTAGCT
GAAAATCGAGCTAGTATAGACTGTCGCAAAAAGGAAGTAAAATTTTCACCACCGATAGGACCTACCTTTAAATTTAAAGGCACAAATACCGGGATTACCCCCAAG
GTAGTCTCGATGATGAAAGCAAAGAGGTTAGTCCAACAAGGTGGATGGGCTGTATTAGCATGTGCTGTAGACGTGAGAGGAAAGGAAGAGACCCTAGTAAACGTG
CCAATAGTAAACGAGTTCCCGGATGTATTTCCGGATGACTTACCTAGAATACCCCCTTCCCGAGCGGTCGACTTCGTCATCGAACTCGAGCCGGGAACTGGGCCT
ATTTTCAAAGCACCCTATCGCATGGCGCCAGCAGAGTTGAAGGAACTCAAGGCGCAACTGCATGACTTACTAGATAAAGGATTCATTCGACCTAGCGTGTCCCCC
TGGGGTGCGCCAGTGTTGTTTGTTAAGAAGAAAGATGGATCGATGCGTCTGTGCATCGATTACAAAGAGCTAAACAAGAGAACCATAAAAAACAAATATCCTCTG
CCTAGAATAGAAGACTTGTTTGATCAACTCAGAGAAGCAACAGTATTCTCTAAGATAGATCTTCGGTCCGGTTACCACCAAATTAGGATTAATGAAAAAGACGTA
CCAAAAACAGCGTTTAGGACAAGGTACGGTCACTACGAGTTTGTGGTGATGTCATTTGGCTTCACCAACGCCCCAGCTATATTTATGGAGTTAATGAACCAGGTA
TTCAAAGAATGCCTAGACATGTTTGTGATTGTGTTCATTGACGACATCCTCATATACTCGAGAACCGACCTAAAGCACGAGGAACACCTCTTAAAAGTCCTAACC
ACCATAAGAGAGCACAAGTTGTACGCCAAGTTCTCCAAATGCGAATTTGGTTACGACAAGTCTCTTTCCTAG
Protein sequenceShow/hide protein sequence
MWIAAVETTFETMECPENQNVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKRQQEFAHLVQGGLTVEKYNREFNKLKRFAPSMVD
TEEKMTEKFVLGLEPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRHEEPVVIGQKRPHESGGSDRPPPARRHRSNNRPTPRWDERHPPRHTNRNPRNQDGARG
RREEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNATAQANPPRVVEKTDQPAPPRAQARAYASTSKDTGRSDAVVTSTLSILGHFAFTLFDSGSTHSF
IFMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDDQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCRKKEVKFSPPIGPTFKFKGTNTGITPK
VVSMMKAKRLVQQGGWAVLACAVDVRGKEETLVNVPIVNEFPDVFPDDLPRIPPSRAVDFVIELEPGTGPIFKAPYRMAPAELKELKAQLHDLLDKGFIRPSVSP
WGAPVLFVKKKDGSMRLCIDYKELNKRTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGFTNAPAIFMELMNQV
FKECLDMFVIVFIDDILIYSRTDLKHEEHLLKVLTTIREHKLYAKFSKCEFGYDKSLS