; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004160 (gene) of Snake gourd v1 genome

Gene IDTan0004160
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
Genome locationLG11:37209436..37213138
RNA-Seq ExpressionTan0004160
SyntenyTan0004160
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR021109 - Aspartic peptidase domain superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023520282.1 uncharacterized protein LOC111783592 [Cucurbita pepo subsp. pepo]1.8e-12076.11Show/hide
Query:  EKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEED--------------EDPANLSLVARRALSTQIKE
        EKP+A  EKGESS+ GKEK+++SNVRNRDLKCW+CQGVGHYSRDCPN RIMTI+EGEIVTDDE  D              EDP ++SLV RRAL+T IKE
Subjt:  EKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEED--------------EDPANLSLVARRALSTQIKE

Query:  DSLDQRENLFHTRCLIQSIPCSVVIDSGSCTNVVSTIPVKKLNLETKPHPRPYKLQWLNDCVQVRVSKQALVSFTIGKYVDDVLCDVVSMHAGDLLLGRP
        D LDQRENLF TRCL+QS+PCSVVIDSGSCTNVVS+I VK+LNL+T+PHPRPYKLQWLNDC +VRV++Q LVSFTIGKYVDDVLCDVVSMH GDLLLGRP
Subjt:  DSLDQRENLFHTRCLIQSIPCSVVIDSGSCTNVVSTIPVKKLNLETKPHPRPYKLQWLNDCVQVRVSKQALVSFTIGKYVDDVLCDVVSMHAGDLLLGRP

Query:  WQFDRRVVYDGFANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEVDAKAKSEKEIIEKELREKKSLSEKQESSNRPRGKNEGKAKIMSL
        WQFDRRV+YDG+ANRYSFT+NGRKTTLVPLSPKDVFID CKLEKKRQE DAKA+     IEKE  EK SLSEKQES+ +PR K E KAK +SL
Subjt:  WQFDRRVVYDGFANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEVDAKAKSEKEIIEKELREKKSLSEKQESSNRPRGKNEGKAKIMSL

XP_023520835.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111784339 [Cucurbita pepo subsp. pepo]3.4e-14364.99Show/hide
Query:  EKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEED--------------EDPANLSLVARRALSTQIKE
        EKP+A  EKGESS+ GKEK+++SNVRNRDLKCW+CQGVGHYSRDCPN RIMTI+EGEIVTDDE  D              EDP ++SLV RRAL+T IKE
Subjt:  EKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEED--------------EDPANLSLVARRALSTQIKE

Query:  DSLDQRENLFHTRCLIQSIPCSVVIDSGSCTNVVSTIPVKKLNLETKPHPRPYKLQWLNDCVQVRVSKQALVSFTIGKYVDDVLCDVVSMHAGDLLLGRP
        D LDQRENLF TRCL+QS+PCSVVIDSGSCTNVVS+I VK+LNL+T+PHPRPYKLQWLNDC +VRV++Q LVSFTIGKYVDDVLCDVVSMH GDLLLGRP
Subjt:  DSLDQRENLFHTRCLIQSIPCSVVIDSGSCTNVVSTIPVKKLNLETKPHPRPYKLQWLNDCVQVRVSKQALVSFTIGKYVDDVLCDVVSMHAGDLLLGRP

Query:  WQFDRRVVYDGFANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEVDAKAKSEKEIIEKELREKKSLSEKQESSNRPRGKNEGKAKIMSL-------
        WQFDRRV+YDG+ANRYSFT+NGRKTTLVPLSPKDVFID CKLEKKRQE DAKA+     IEKE  EK SLSEKQES+ +PR K E KAK +SL       
Subjt:  WQFDRRVVYDGFANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEVDAKAKSEKEIIEKELREKKSLSEKQESSNRPRGKNEGKAKIMSL-------

Query:  ---------LLDPIVKG--YLT-----------------FFDI-------------GIFGFFDFILIWGLGVLDLKVLTEKEVEEIQRQVSELLAKGYVR
                 +L  + KG  Y T                 F D+             GI    DFI    +          KE EEIQRQVSELLAKGYVR
Subjt:  ---------LLDPIVKG--YLT-----------------FFDI-------------GIFGFFDFILIWGLGVLDLKVLTEKEVEEIQRQVSELLAKGYVR

Query:  ESLSPCSIPVIRVPKKDGSWHMCIDCRAINKITINFK
        ESLSPCS+PVI VPKKDGSW MC+DCRAINKITI ++
Subjt:  ESLSPCSIPVIRVPKKDGSWHMCIDCRAINKITINFK

XP_023521183.1 uncharacterized protein LOC111784872 [Cucurbita pepo subsp. pepo]3.4e-14364.99Show/hide
Query:  EKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEED--------------EDPANLSLVARRALSTQIKE
        EKP+A  EKGESS+ GKEK+++SNVRNRDLKCW+CQGVGHYSRDCPN RIMTI+EGEIVTDDE  D              EDP ++SLV RRAL+T IKE
Subjt:  EKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEED--------------EDPANLSLVARRALSTQIKE

Query:  DSLDQRENLFHTRCLIQSIPCSVVIDSGSCTNVVSTIPVKKLNLETKPHPRPYKLQWLNDCVQVRVSKQALVSFTIGKYVDDVLCDVVSMHAGDLLLGRP
        D LDQRENLF TRCL+QS+PCSVVIDSGSCTNVVS+I VK+LNL+T+PHPRPYKLQWLNDC +VRV++Q LVSFTIGKYVDDVLCDVVSMH GDLLLGRP
Subjt:  DSLDQRENLFHTRCLIQSIPCSVVIDSGSCTNVVSTIPVKKLNLETKPHPRPYKLQWLNDCVQVRVSKQALVSFTIGKYVDDVLCDVVSMHAGDLLLGRP

Query:  WQFDRRVVYDGFANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEVDAKAKSEKEIIEKELREKKSLSEKQESSNRPRGKNEGKAKIMSL-------
        WQFDRRV+YDG+ANRYSFT+NGRKTTLVPLSPKDVFID CKLEKKRQE DAKA+     IEKE  EK SLSEKQES+ +PR K E KAK +SL       
Subjt:  WQFDRRVVYDGFANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEVDAKAKSEKEIIEKELREKKSLSEKQESSNRPRGKNEGKAKIMSL-------

Query:  ---------LLDPIVKG--YLT-----------------FFDI-------------GIFGFFDFILIWGLGVLDLKVLTEKEVEEIQRQVSELLAKGYVR
                 +L  + KG  Y T                 F D+             GI    DFI    +          KE EEIQRQVSELLAKGYVR
Subjt:  ---------LLDPIVKG--YLT-----------------FFDI-------------GIFGFFDFILIWGLGVLDLKVLTEKEVEEIQRQVSELLAKGYVR

Query:  ESLSPCSIPVIRVPKKDGSWHMCIDCRAINKITINFK
        ESLSPCS+PVI VPKKDGSW MC+DCRAINKITI ++
Subjt:  ESLSPCSIPVIRVPKKDGSWHMCIDCRAINKITINFK

XP_023530046.1 uncharacterized protein LOC111792716 [Cucurbita pepo subsp. pepo]7.2e-14169.41Show/hide
Query:  EKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEED--------------EDPANLSLVARRALSTQIKE
        EKP+A  EKGESS+ GKEK+++SNVRNRDLKCW+CQGVGHYSRDCPN RIMTI+EGEIVTDDE  D              EDP ++SLV RRAL+T IKE
Subjt:  EKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEED--------------EDPANLSLVARRALSTQIKE

Query:  DSLDQRENLFHTRCLIQSIPCSVVIDSGSCTNVVSTIPVKKLNLETKPHPRPYKLQWLNDCVQVRVSKQALVSFTIGKYVDDVLCDVVSMHAGDLLLGRP
        D LDQRENLF TRCL+QS+PCSVVIDSGSCTNVVS+I VK+LNL+T+PHPRPYKLQWLNDC +VRV++Q LVSFTIGKYVDDVLCDVVSMH GDLLLGRP
Subjt:  DSLDQRENLFHTRCLIQSIPCSVVIDSGSCTNVVSTIPVKKLNLETKPHPRPYKLQWLNDCVQVRVSKQALVSFTIGKYVDDVLCDVVSMHAGDLLLGRP

Query:  WQFDRRVVYDGFANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEVDAKAKSEKEIIEKELREKKSLSEKQESSNRPRGKNEGKAKIMSLLLDPIVK
        WQFDRRV+YDG+ANRYSFT+NGRKTTLVPLSPKDVFID CKLEKKRQE DAKA+     IEKE  EK SLSEKQE         +  ++ M   L P+  
Subjt:  WQFDRRVVYDGFANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEVDAKAKSEKEIIEKELREKKSLSEKQESSNRPRGKNEGKAKIMSLLLDPIVK

Query:  GYLTFFDIGIFGFFDFILIWGLGVLDLKVLTEKEVEEIQRQVSELLAKGYVRESLSPCSIPVIRVPKKDGSWHMCIDCRAINKITINFK
                GI    DFI    +          KE EEIQRQVSELLAKGYVRESLSPCS+PVI VPKKDGSW MC+DCRAINKITI ++
Subjt:  GYLTFFDIGIFGFFDFILIWGLGVLDLKVLTEKEVEEIQRQVSELLAKGYVRESLSPCSIPVIRVPKKDGSWHMCIDCRAINKITINFK

XP_023553652.1 uncharacterized protein LOC111811140 [Cucurbita pepo subsp. pepo]3.4e-14364.99Show/hide
Query:  EKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEED--------------EDPANLSLVARRALSTQIKE
        EKP+A  EKGESS+ GKEK+++SNVRNRDLKCW+CQGVGHYSRDCPN RIMTI+EGEIVTDDE  D              EDP ++SLV RRAL+T IKE
Subjt:  EKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEED--------------EDPANLSLVARRALSTQIKE

Query:  DSLDQRENLFHTRCLIQSIPCSVVIDSGSCTNVVSTIPVKKLNLETKPHPRPYKLQWLNDCVQVRVSKQALVSFTIGKYVDDVLCDVVSMHAGDLLLGRP
        D LDQRENLF TRCL+QS+PCSVVIDSGSCTNVVS+I VK+LNL+T+PHPRPYKLQWLNDC +VRV++Q LVSFTIGKYVDDVLCDVVSMH GDLLLGRP
Subjt:  DSLDQRENLFHTRCLIQSIPCSVVIDSGSCTNVVSTIPVKKLNLETKPHPRPYKLQWLNDCVQVRVSKQALVSFTIGKYVDDVLCDVVSMHAGDLLLGRP

Query:  WQFDRRVVYDGFANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEVDAKAKSEKEIIEKELREKKSLSEKQESSNRPRGKNEGKAKIMSL-------
        WQFDRRV+YDG+ANRYSFT+NGRKTTLVPLSPKDVFID CKLEKKRQE DAKA+     IEKE  EK SLSEKQES+ +PR K E KAK +SL       
Subjt:  WQFDRRVVYDGFANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEVDAKAKSEKEIIEKELREKKSLSEKQESSNRPRGKNEGKAKIMSL-------

Query:  ---------LLDPIVKG--YLT-----------------FFDI-------------GIFGFFDFILIWGLGVLDLKVLTEKEVEEIQRQVSELLAKGYVR
                 +L  + KG  Y T                 F D+             GI    DFI    +          KE EEIQRQVSELLAKGYVR
Subjt:  ---------LLDPIVKG--YLT-----------------FFDI-------------GIFGFFDFILIWGLGVLDLKVLTEKEVEEIQRQVSELLAKGYVR

Query:  ESLSPCSIPVIRVPKKDGSWHMCIDCRAINKITINFK
        ESLSPCS+PVI VPKKDGSW MC+DCRAINKITI ++
Subjt:  ESLSPCSIPVIRVPKKDGSWHMCIDCRAINKITINFK

TrEMBL top hitse value%identityAlignment
A0A6J1EQJ1 uncharacterized protein LOC1114365308.1e-9855.08Show/hide
Query:  EKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEED--------------EDPANLSLVARRALSTQIKE
        EKP+A  EKGESS+ GKEK+++SNVRNRDLKCW+CQGVGHYSRDCPN RIMTI+EGEIVTDDE  D              EDP ++SLV RRAL+T IKE
Subjt:  EKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEED--------------EDPANLSLVARRALSTQIKE

Query:  DSLDQRENLFHTRCLIQSIPCSVVIDSGSCTNVVSTIPVKKLNLETKPHPRPYKLQWLNDCVQVRVSKQALVSFTIGKYVDDVLCDVVSMHAGDLLLGRP
        D LDQRENLF TRCL+QS+PCSVVIDSGSCTNVVS+I VK+LNL+T+PHPRPYKLQWLNDC +VRV++Q LVSFTIGKYVDDVLCDVVSMH GDLLLGRP
Subjt:  DSLDQRENLFHTRCLIQSIPCSVVIDSGSCTNVVSTIPVKKLNLETKPHPRPYKLQWLNDCVQVRVSKQALVSFTIGKYVDDVLCDVVSMHAGDLLLGRP

Query:  WQFDRRVVYDGFANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEVDAKAKSEKEIIEKELREKKSLSEKQESSNRPRGKNEGKAKIMSLLLDPIVK
        WQFDRR                                                            +   SE+  SS  P    E K   +     P   
Subjt:  WQFDRRVVYDGFANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEVDAKAKSEKEIIEKELREKKSLSEKQESSNRPRGKNEGKAKIMSLLLDPIVK

Query:  GYLTFFDIGIFGFFDFILIWGLGVLDLKVLTEKEVEEIQRQVSELLAKGYVRESLSPCSIPVIRVPKKDGSWHM
         Y T                            KE EEIQRQVSELLAKGYVRESLSPCS+PVI VPKKDGSW M
Subjt:  GYLTFFDIGIFGFFDFILIWGLGVLDLKVLTEKEVEEIQRQVSELLAKGYVRESLSPCSIPVIRVPKKDGSWHM

A0A6J1EVV9 uncharacterized protein LOC1114364631.5e-9654.81Show/hide
Query:  EKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEED--------------EDPANLSLVARRALSTQIKE
        EKP+A  EKGESS+ GKEK+++SNVRNRDLKCW+ QGVGHYSRDCPN RIMTI+EGEIVTDDE  D              EDP ++SLV RRAL+T IKE
Subjt:  EKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEED--------------EDPANLSLVARRALSTQIKE

Query:  DSLDQRENLFHTRCLIQSIPCSVVIDSGSCTNVVSTIPVKKLNLETKPHPRPYKLQWLNDCVQVRVSKQALVSFTIGKYVDDVLCDVVSMHAGDLLLGRP
        D LDQRENLF TRCL+QS+PCSVVIDSGSCTNVVS+I VK+LNL+T+PHPRPYKLQWLNDC +VRV++Q LVSFTIGKYVDDVLCDVVSMH GDLLLGRP
Subjt:  DSLDQRENLFHTRCLIQSIPCSVVIDSGSCTNVVSTIPVKKLNLETKPHPRPYKLQWLNDCVQVRVSKQALVSFTIGKYVDDVLCDVVSMHAGDLLLGRP

Query:  WQFDRRVVYDGFANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEVDAKAKSEKEIIEKELREKKSLSEKQESSNRPRGKNEGKAKIMSLLLDPIVK
        WQFDRR                                                            +   SE+  SS  P    E K   +     P   
Subjt:  WQFDRRVVYDGFANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEVDAKAKSEKEIIEKELREKKSLSEKQESSNRPRGKNEGKAKIMSLLLDPIVK

Query:  GYLTFFDIGIFGFFDFILIWGLGVLDLKVLTEKEVEEIQRQVSELLAKGYVRESLSPCSIPVIRVPKKDGSWHM
         Y T                            KE EEIQRQVSELLAKGYVRESLSPCS+PVI VPKKDGSW M
Subjt:  GYLTFFDIGIFGFFDFILIWGLGVLDLKVLTEKEVEEIQRQVSELLAKGYVRESLSPCSIPVIRVPKKDGSWHM

A0A6J1G2Q3 uncharacterized protein LOC1114502868.1e-9855.08Show/hide
Query:  EKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEED--------------EDPANLSLVARRALSTQIKE
        EKP+A  EKGESS+ GKEK+++SNVRNRDLKCW+CQGVGHYSRDCPN RIMTI+EGEIVTDDE  D              EDP ++SLV RRAL+T IKE
Subjt:  EKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEED--------------EDPANLSLVARRALSTQIKE

Query:  DSLDQRENLFHTRCLIQSIPCSVVIDSGSCTNVVSTIPVKKLNLETKPHPRPYKLQWLNDCVQVRVSKQALVSFTIGKYVDDVLCDVVSMHAGDLLLGRP
        D LDQRENLF TRCL+QS+PCSVVIDSGSCTNVVS+I VK+LNL+T+PHPRPYKLQWLNDC +VRV++Q LVSFTIGKYVDDVLCDVVSMH GDLLLGRP
Subjt:  DSLDQRENLFHTRCLIQSIPCSVVIDSGSCTNVVSTIPVKKLNLETKPHPRPYKLQWLNDCVQVRVSKQALVSFTIGKYVDDVLCDVVSMHAGDLLLGRP

Query:  WQFDRRVVYDGFANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEVDAKAKSEKEIIEKELREKKSLSEKQESSNRPRGKNEGKAKIMSLLLDPIVK
        WQFDRR                                                            +   SE+  SS  P    E K   +     P   
Subjt:  WQFDRRVVYDGFANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEVDAKAKSEKEIIEKELREKKSLSEKQESSNRPRGKNEGKAKIMSLLLDPIVK

Query:  GYLTFFDIGIFGFFDFILIWGLGVLDLKVLTEKEVEEIQRQVSELLAKGYVRESLSPCSIPVIRVPKKDGSWHM
         Y T                            KE EEIQRQVSELLAKGYVRESLSPCS+PVI VPKKDGSW M
Subjt:  GYLTFFDIGIFGFFDFILIWGLGVLDLKVLTEKEVEEIQRQVSELLAKGYVRESLSPCSIPVIRVPKKDGSWHM

A0A6J1I622 LOW QUALITY PROTEIN: uncharacterized protein LOC1114699476.2e-9855.35Show/hide
Query:  EKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEED--------------EDPANLSLVARRALSTQIKE
        EKP+A  EKGESS+ GKEK+++SNVRNRDLKCW+CQGVGHYSRDCPN RIMTI+EGEIVTDDE  D              EDP ++SLV R AL+T IKE
Subjt:  EKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEED--------------EDPANLSLVARRALSTQIKE

Query:  DSLDQRENLFHTRCLIQSIPCSVVIDSGSCTNVVSTIPVKKLNLETKPHPRPYKLQWLNDCVQVRVSKQALVSFTIGKYVDDVLCDVVSMHAGDLLLGRP
        D LDQRENLF TRCL+QS+PCSVVIDSGSCTNVVS+I VK+LNL+T+PHPRPYKLQWLNDC +VRV++Q LVSFTIGKYVDDVLCDVVSMH GDLLLGRP
Subjt:  DSLDQRENLFHTRCLIQSIPCSVVIDSGSCTNVVSTIPVKKLNLETKPHPRPYKLQWLNDCVQVRVSKQALVSFTIGKYVDDVLCDVVSMHAGDLLLGRP

Query:  WQFDRRVVYDGFANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEVDAKAKSEKEIIEKELREKKSLSEKQESSNRPRGKNEGKAKIMSLLLDPIVK
        WQFDRR                          +D+F                                  SE++ SS  P    E K   +     P   
Subjt:  WQFDRRVVYDGFANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEVDAKAKSEKEIIEKELREKKSLSEKQESSNRPRGKNEGKAKIMSLLLDPIVK

Query:  GYLTFFDIGIFGFFDFILIWGLGVLDLKVLTEKEVEEIQRQVSELLAKGYVRESLSPCSIPVIRVPKKDGSWHM
         Y T                            KE EEIQRQVSELLAKGYVRESLSPCS+PVI VPKKDGSW M
Subjt:  GYLTFFDIGIFGFFDFILIWGLGVLDLKVLTEKEVEEIQRQVSELLAKGYVRESLSPCSIPVIRVPKKDGSWHM

A0A6J1I8S0 uncharacterized protein LOC1114724892.8e-9855.5Show/hide
Query:  KPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEED--------------EDPANLSLVARRALSTQIKED
        KP+A  EKGESS+ GKEK+++SNVRNRDLKCW+CQGVGHYSRDCPN RIMTI+EGEIVTDDE  D              EDP ++SLV RRAL+T IKED
Subjt:  KPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEED--------------EDPANLSLVARRALSTQIKED

Query:  SLDQRENLFHTRCLIQSIPCSVVIDSGSCTNVVSTIPVKKLNLETKPHPRPYKLQWLNDCVQVRVSKQALVSFTIGKYVDDVLCDVVSMHAGDLLLGRPW
         LDQRENLF TRCL+QS+PCSVVIDSGSCTNVVS+I VK+LNL+T+PHPRPYKLQWLNDC +VRV++Q LVSFTIGKYVDDVLCDVVSMH GDLLLGRPW
Subjt:  SLDQRENLFHTRCLIQSIPCSVVIDSGSCTNVVSTIPVKKLNLETKPHPRPYKLQWLNDCVQVRVSKQALVSFTIGKYVDDVLCDVVSMHAGDLLLGRPW

Query:  QFDRRVVYDGFANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEVDAKAKSEKEIIEKELREKKSLSEKQESSNRPRGKNEGKAKIMSLLLDPIVKG
        QFDRR                          +D+F                                  SE++ SS  P    E K   +     P    
Subjt:  QFDRRVVYDGFANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEVDAKAKSEKEIIEKELREKKSLSEKQESSNRPRGKNEGKAKIMSLLLDPIVKG

Query:  YLTFFDIGIFGFFDFILIWGLGVLDLKVLTEKEVEEIQRQVSELLAKGYVRESLSPCSIPVIRVPKKDGSWHM
        Y T                            KE EEIQRQVSELLAKGYVRESLSPCS+PVI VPKKDGSW M
Subjt:  YLTFFDIGIFGFFDFILIWGLGVLDLKVLTEKEVEEIQRQVSELLAKGYVRESLSPCSIPVIRVPKKDGSWHM

SwissProt top hitse value%identityAlignment
O93209 Pro-Pol polyprotein6.5e-0437.5Show/hide
Query:  LTEKEVEEIQRQVSELLAKGYVRESLSPCSIPVIRVPKKDGSWHMCIDCRAINKIT
        +  K   +IQ  +++LL +G + +  S  + PV  VPK +G W M +D RA+NK+T
Subjt:  LTEKEVEEIQRQVSELLAKGYVRESLSPCSIPVIRVPKKDGSWHMCIDCRAINKIT

P10266 Endogenous retrovirus group K member 10 Pol protein7.7e-0536.36Show/hide
Query:  LTEKEVEEIQRQVSELLAKGYVRESLSPCSIPVIRVPKKDGSWHMCIDCRAINKITINFKLLQVGL
        L ++++E +    +E L KG++  S SP + PV  + KK G WH   D RA+N +      LQ GL
Subjt:  LTEKEVEEIQRQVSELLAKGYVRESLSPCSIPVIRVPKKDGSWHMCIDCRAINKITINFKLLQVGL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein5.2e-0948.28Show/hide
Query:  LTEKEVEEIQRQVSELLAKGYVRESLSPCSIPVIRVPKKDGSWHMCIDCRAINKITIN
        +TEK  +EI + V +LL   ++  S SPCS PV+ VPKKDG++ +C+D R +NK TI+
Subjt:  LTEKEVEEIQRQVSELLAKGYVRESLSPCSIPVIRVPKKDGSWHMCIDCRAINKITIN

Q99315 Transposon Ty3-G Gag-Pol polyprotein5.2e-0948.28Show/hide
Query:  LTEKEVEEIQRQVSELLAKGYVRESLSPCSIPVIRVPKKDGSWHMCIDCRAINKITIN
        +TEK  +EI + V +LL   ++  S SPCS PV+ VPKKDG++ +C+D R +NK TI+
Subjt:  LTEKEVEEIQRQVSELLAKGYVRESLSPCSIPVIRVPKKDGSWHMCIDCRAINKITIN

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGAATGATTCGTGGAATAGAAGAGTTATCGGAGCGTATACGGAGGTTGGAGGTTCAAAACCACGATGACCATGACAACGAATATGAGGGAGGAAGTTACGATCA
ACTAGAAGATGACCAAGTTACATTAATAGCAATGTATGAGAAACCTAAAGCGAATGTAGAGAAAGGGGAGAGTTCAAAAAAGGGGAAAGAGAAGATAGATGAATCTAATG
TGCGAAATAGGGATTTGAAATGCTGGAAATGTCAAGGGGTAGGTCACTATAGTAGAGATTGCCCTAATAGGAGAATTATGACCATTAGAGAGGGAGAGATTGTGACTGAT
GATGAAGAGGAAGATGAGGATCCTGCAAACTTGTCCTTAGTTGCTAGGAGAGCTTTAAGCACCCAAATTAAGGAGGATAGTCTAGACCAAAGAGAGAACTTGTTTCACAC
TAGGTGCCTTATTCAATCTATACCTTGTAGTGTGGTCATTGATAGCGGTAGTTGCACCAATGTTGTGAGTACAATTCCGGTCAAGAAGCTTAATTTAGAGACCAAACCAC
ATCCTAGACCATATAAACTTCAATGGTTGAATGATTGTGTGCAAGTAAGGGTGAGTAAGCAAGCTCTTGTTTCTTTTACCATTGGAAAGTATGTTGATGATGTTTTGTGT
GATGTTGTATCTATGCATGCTGGAGATTTATTGTTGGGGAGGCCTTGGCAATTTGATCGTAGGGTAGTATATGATGGGTTTGCAAATCGTTACTCTTTTACTTATAATGG
TAGAAAAACTACTCTTGTTCCATTGTCTCCAAAAGATGTATTTATTGATCAATGCAAACTTGAAAAGAAAAGGCAAGAGGTTGATGCAAAAGCAAAAAGTGAAAAAGAAA
TAATAGAAAAAGAATTGAGAGAAAAGAAGAGTTTGAGTGAAAAGCAAGAGAGTAGCAATCGGCCTAGAGGAAAAAATGAGGGAAAAGCCAAAATAATGAGTTTGTTGTTA
GATCCAATTGTCAAGGGTTACTTAACATTTTTTGACATCGGAATCTTTGGCTTCTTTGATTTCATTTTGATTTGGGGTTTGGGGGTTTTAGATCTAAAGGTCTTGACTGA
AAAGGAGGTCGAAGAGATACAAAGGCAAGTGAGTGAACTCCTTGCTAAAGGGTATGTGCGTGAGAGTTTGAGTCCATGTTCTATTCCAGTTATTCGTGTGCCTAAGAAAG
ATGGTTCATGGCATATGTGTATTGATTGTAGGGCTATAAACAAGATAACTATTAACTTCAAGCTACTTCAAGTGGGGCTTCATTTATTCATGGATGATTACAAAGTATTG
ATAGGCTTGATAGAAGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAGAATGATTCGTGGAATAGAAGAGTTATCGGAGCGTATACGGAGGTTGGAGGTTCAAAACCACGATGACCATGACAACGAATATGAGGGAGGAAGTTACGATCA
ACTAGAAGATGACCAAGTTACATTAATAGCAATGTATGAGAAACCTAAAGCGAATGTAGAGAAAGGGGAGAGTTCAAAAAAGGGGAAAGAGAAGATAGATGAATCTAATG
TGCGAAATAGGGATTTGAAATGCTGGAAATGTCAAGGGGTAGGTCACTATAGTAGAGATTGCCCTAATAGGAGAATTATGACCATTAGAGAGGGAGAGATTGTGACTGAT
GATGAAGAGGAAGATGAGGATCCTGCAAACTTGTCCTTAGTTGCTAGGAGAGCTTTAAGCACCCAAATTAAGGAGGATAGTCTAGACCAAAGAGAGAACTTGTTTCACAC
TAGGTGCCTTATTCAATCTATACCTTGTAGTGTGGTCATTGATAGCGGTAGTTGCACCAATGTTGTGAGTACAATTCCGGTCAAGAAGCTTAATTTAGAGACCAAACCAC
ATCCTAGACCATATAAACTTCAATGGTTGAATGATTGTGTGCAAGTAAGGGTGAGTAAGCAAGCTCTTGTTTCTTTTACCATTGGAAAGTATGTTGATGATGTTTTGTGT
GATGTTGTATCTATGCATGCTGGAGATTTATTGTTGGGGAGGCCTTGGCAATTTGATCGTAGGGTAGTATATGATGGGTTTGCAAATCGTTACTCTTTTACTTATAATGG
TAGAAAAACTACTCTTGTTCCATTGTCTCCAAAAGATGTATTTATTGATCAATGCAAACTTGAAAAGAAAAGGCAAGAGGTTGATGCAAAAGCAAAAAGTGAAAAAGAAA
TAATAGAAAAAGAATTGAGAGAAAAGAAGAGTTTGAGTGAAAAGCAAGAGAGTAGCAATCGGCCTAGAGGAAAAAATGAGGGAAAAGCCAAAATAATGAGTTTGTTGTTA
GATCCAATTGTCAAGGGTTACTTAACATTTTTTGACATCGGAATCTTTGGCTTCTTTGATTTCATTTTGATTTGGGGTTTGGGGGTTTTAGATCTAAAGGTCTTGACTGA
AAAGGAGGTCGAAGAGATACAAAGGCAAGTGAGTGAACTCCTTGCTAAAGGGTATGTGCGTGAGAGTTTGAGTCCATGTTCTATTCCAGTTATTCGTGTGCCTAAGAAAG
ATGGTTCATGGCATATGTGTATTGATTGTAGGGCTATAAACAAGATAACTATTAACTTCAAGCTACTTCAAGTGGGGCTTCATTTATTCATGGATGATTACAAAGTATTG
ATAGGCTTGATAGAAGTTTAA
Protein sequenceShow/hide protein sequence
MERMIRGIEELSERIRRLEVQNHDDHDNEYEGGSYDQLEDDQVTLIAMYEKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTD
DEEEDEDPANLSLVARRALSTQIKEDSLDQRENLFHTRCLIQSIPCSVVIDSGSCTNVVSTIPVKKLNLETKPHPRPYKLQWLNDCVQVRVSKQALVSFTIGKYVDDVLC
DVVSMHAGDLLLGRPWQFDRRVVYDGFANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEVDAKAKSEKEIIEKELREKKSLSEKQESSNRPRGKNEGKAKIMSLLL
DPIVKGYLTFFDIGIFGFFDFILIWGLGVLDLKVLTEKEVEEIQRQVSELLAKGYVRESLSPCSIPVIRVPKKDGSWHMCIDCRAINKITINFKLLQVGLHLFMDDYKVL
IGLIEV