; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005871 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005871
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr6:32619364..32620495
RNA-Seq ExpressionLag0005871
SyntenyLag0005871
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]3.8e-2137.22Show/hide
Query:  IGNGNKLNIAYVGNSDFMNGVTTLKLKDILCVLEIAKNLVSVSKLAQDNRIFIEFHGGFCLVKDNDSGQTVLIAMLKDSLYQLEIVKTNMVDQPNLEDSI
        +GNG KL I   G+S     + +L L DIL V  I KNL+SVSKLA DN I +EF    C VKD  +G+ +L  +LKD LYQL   K N    P+   S+
Subjt:  IGNGNKLNIAYVGNSDFMNGVTTLKLKDILCVLEIAKNLVSVSKLAQDNRIFIEFHGGFCLVKDNDSGQTVLIAMLKDSLYQLEIVKTNMVDQPNLEDSI

Query:  MEKSRREEIDPS----VFVLSNTSVNV-------------YEKSHALPFSSFTSRAIAPFELVHSDLWGSAPILSTEGFYAIKALWDTFLTATQLI----
         E   R    P+      VL +  V V             Y K H LPF S +S A  P ELVH+D+WG API+++ GF       D F   T +     
Subjt:  MEKSRREEIDPS----VFVLSNTSVNV-------------YEKSHALPFSSFTSRAIAPFELVHSDLWGSAPILSTEGFYAIKALWDTFLTATQLI----

Query:  -NDLALVVLQGKSPIELLFNRKL
         ++     +Q K+  E  FN+++
Subjt:  -NDLALVVLQGKSPIELLFNRKL

KAG8475861.1 hypothetical protein CXB51_032757 [Gossypium anomalum]1.1e-2031.58Show/hide
Query:  VRIGNGNKLNIAYVGNSDFMNGVTTLKLKDILCVLEIAKNLVSVSKLAQDNRIFIEFHGGFCLVKDNDSGQTVLIAMLKDSLYQLEI--VKTNMVDQPNL
        V +GNG  ++IA + +S    G   L+LK++L V  I KNL+SV + A+DN I+ EFH  FC V D  + +T+L+  + + LY+ +      +     +L
Subjt:  VRIGNGNKLNIAYVGNSDFMNGVTTLKLKDILCVLEIAKNLVSVSKLAQDNRIFIEFHGGFCLVKDNDSGQTVLIAMLKDSLYQLEI--VKTNMVDQPNL

Query:  EDSIMEKSRREEIDPSVFVLSNTSVNVYEKSHALPFSSFTSRAIAPFELVHSDLWGSAPILSTE-------------------GFYAIKALW--------
          S+   +   ++ PS  +  +  V    K+H LPFSS  +    PFELV SD+WG A I S+                    G ++   LW        
Subjt:  EDSIMEKSRREEIDPSVFVLSNTSVNVYEKSHALPFSSFTSRAIAPFELVHSDLWGSAPILSTE-------------------GFYAIKALW--------

Query:  ------------------DTFLTATQLINDLALVVLQGKSPIELLFNRKLDYASLRTFGCAYFPLL
                            F  A  L+N L    LQ KSP E+L N +  Y+ LR FGCA FP L
Subjt:  ------------------DTFLTATQLINDLALVVLQGKSPIELLFNRKLDYASLRTFGCAYFPLL

KYP50444.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]1.3e-2128.57Show/hide
Query:  IGNGNKLNIAYVGNSDFMNGVTTLKLKDILCVLEIAKNLVSVSKLAQDNRIFIEFHGGFCLVKDNDSGQTVLIAMLKDSLYQLEIVKTNMVDQPNLEDSI
        +GNG  L I   G+S       +L LKDIL V +I KNL+S+SKL  DN I++EFH   C VKD  +G+ +L   +KD LYQL    T+   +P++  SI
Subjt:  IGNGNKLNIAYVGNSDFMNGVTTLKLKDILCVLEIAKNLVSVSKLAQDNRIFIEFHGGFCLVKDNDSGQTVLIAMLKDSLYQLEIVKTNMVDQPNLEDSI

Query:  MEKSRREEIDPSVFVLSNTS--VNV---------------YEKSHALPFSSFTSRAIAPFELVHSDLWGSAPILSTEGF----------------YAIKA
         E   R+   P+  VL+      N+               + K+H LPF +  S A  P +LVHSD+WG API S  GF                Y +K 
Subjt:  MEKSRREEIDPSVFVLSNTS--VNV---------------YEKSHALPFSSFTSRAIAPFELVHSDLWGSAPILSTEGF----------------YAIKA

Query:  --------------------------------------------------------------------------------------LWDTFLTATQLIND
                                                                                               W+ F TA  LIN 
Subjt:  --------------------------------------------------------------------------------------LWDTFLTATQLIND

Query:  LALVVLQGKSPIELLFNRKLDYASLRTFGCAYFPLL
        L   V++ KSP + LF++  DY +++TFGCA +P L
Subjt:  LALVVLQGKSPIELLFNRKLDYASLRTFGCAYFPLL

PNY01489.1 copia-like polyprotein, partial [Trifolium pratense]3.5e-2237.22Show/hide
Query:  IGNGNKLNIAYVGNSDFMNGVTTLKLKDILCVLEIAKNLVSVSKLAQDNRIFIEFHGGFCLVKDNDSGQTVLIAMLKDSLYQLEIVKTNMVDQPNLEDSI
        +GNG KL I   G++     + TL L D+L V +I KNL+SVSKL  DN IF+EF    C VKD  +GQT+L   LKD LYQL  V       P +  S+
Subjt:  IGNGNKLNIAYVGNSDFMNGVTTLKLKDILCVLEIAKNLVSVSKLAQDNRIFIEFHGGFCLVKDNDSGQTVLIAMLKDSLYQLEIVKTNMVDQPNLEDSI

Query:  MEKSRREEIDPS----VFVLSNTSVNV-------------YEKSHALPFSSFTSRAIAPFELVHSDLWGSAPILSTEGFYAIKALWDTFLTATQLI----
         E   R+   P+      VL + +V +             + K H LPF S +S    P  L+HSD+WG APILS  GF       D F   T +     
Subjt:  MEKSRREEIDPS----VFVLSNTSVNV-------------YEKSHALPFSSFTSRAIAPFELVHSDLWGSAPILSTEGFYAIKALWDTFLTATQLI----

Query:  -NDLALVVLQGKSPIELLFNRKL
         +D     +Q K+  E  FN+K+
Subjt:  -NDLALVVLQGKSPIELLFNRKL

RZB67542.1 Retrovirus-related Pol polyprotein from transposon RE1 [Glycine soja]1.5e-2034.53Show/hide
Query:  IGNGNKLNIAYVGNSDFMNGVTTLKLKDILCVLEIAKNLVSVSKLAQDNRIFIEFHGGFCLVKDNDSGQTVLIAMLKDSLYQLEIVKTNMVDQPNLEDSI
        +GNG +L+I   G++   N    L L ++L V EI KNL+SVSKL  DN   +EF    C VKD  +G+ +L   L+D LYQL  VK+ +   P    S+
Subjt:  IGNGNKLNIAYVGNSDFMNGVTTLKLKDILCVLEIAKNLVSVSKLAQDNRIFIEFHGGFCLVKDNDSGQTVLIAMLKDSLYQLEIVKTNMVDQPNLEDSI

Query:  MEKSRREEIDPSVFVLSNTSVNV-----------------YEKSHALPFSSFTSRAIAPFELVHSDLWGSAPILSTEGFYAIKALWDTFLTATQLI----
         E   R+   P+  VL     N                  + K H LPF   +S A  P +L+HSD+WG APILS   F       D F   T +     
Subjt:  MEKSRREEIDPSVFVLSNTSVNV-----------------YEKSHALPFSSFTSRAIAPFELVHSDLWGSAPILSTEGFYAIKALWDTFLTATQLI----

Query:  -NDLALVVLQGKSPIELLFNRKL
         ++     +Q K+ +E  FNRK+
Subjt:  -NDLALVVLQGKSPIELLFNRKL

TrEMBL top hitse value%identityAlignment
A0A151S6M8 Retrovirus-related Pol polyprotein from transposon TNT 1-946.4e-2228.57Show/hide
Query:  IGNGNKLNIAYVGNSDFMNGVTTLKLKDILCVLEIAKNLVSVSKLAQDNRIFIEFHGGFCLVKDNDSGQTVLIAMLKDSLYQLEIVKTNMVDQPNLEDSI
        +GNG  L I   G+S       +L LKDIL V +I KNL+S+SKL  DN I++EFH   C VKD  +G+ +L   +KD LYQL    T+   +P++  SI
Subjt:  IGNGNKLNIAYVGNSDFMNGVTTLKLKDILCVLEIAKNLVSVSKLAQDNRIFIEFHGGFCLVKDNDSGQTVLIAMLKDSLYQLEIVKTNMVDQPNLEDSI

Query:  MEKSRREEIDPSVFVLSNTS--VNV---------------YEKSHALPFSSFTSRAIAPFELVHSDLWGSAPILSTEGF----------------YAIKA
         E   R+   P+  VL+      N+               + K+H LPF +  S A  P +LVHSD+WG API S  GF                Y +K 
Subjt:  MEKSRREEIDPSVFVLSNTS--VNV---------------YEKSHALPFSSFTSRAIAPFELVHSDLWGSAPILSTEGF----------------YAIKA

Query:  --------------------------------------------------------------------------------------LWDTFLTATQLIND
                                                                                               W+ F TA  LIN 
Subjt:  --------------------------------------------------------------------------------------LWDTFLTATQLIND

Query:  LALVVLQGKSPIELLFNRKLDYASLRTFGCAYFPLL
        L   V++ KSP + LF++  DY +++TFGCA +P L
Subjt:  LALVVLQGKSPIELLFNRKLDYASLRTFGCAYFPLL

A0A2K3NEN7 Copia-like polyprotein (Fragment)1.7e-2237.22Show/hide
Query:  IGNGNKLNIAYVGNSDFMNGVTTLKLKDILCVLEIAKNLVSVSKLAQDNRIFIEFHGGFCLVKDNDSGQTVLIAMLKDSLYQLEIVKTNMVDQPNLEDSI
        +GNG KL I   G++     + TL L D+L V +I KNL+SVSKL  DN IF+EF    C VKD  +GQT+L   LKD LYQL  V       P +  S+
Subjt:  IGNGNKLNIAYVGNSDFMNGVTTLKLKDILCVLEIAKNLVSVSKLAQDNRIFIEFHGGFCLVKDNDSGQTVLIAMLKDSLYQLEIVKTNMVDQPNLEDSI

Query:  MEKSRREEIDPS----VFVLSNTSVNV-------------YEKSHALPFSSFTSRAIAPFELVHSDLWGSAPILSTEGFYAIKALWDTFLTATQLI----
         E   R+   P+      VL + +V +             + K H LPF S +S    P  L+HSD+WG APILS  GF       D F   T +     
Subjt:  MEKSRREEIDPS----VFVLSNTSVNV-------------YEKSHALPFSSFTSRAIAPFELVHSDLWGSAPILSTEGFYAIKALWDTFLTATQLI----

Query:  -NDLALVVLQGKSPIELLFNRKL
         +D     +Q K+  E  FN+K+
Subjt:  -NDLALVVLQGKSPIELLFNRKL

A0A2Z6MBG6 Integrase catalytic domain-containing protein1.9e-2137.22Show/hide
Query:  IGNGNKLNIAYVGNSDFMNGVTTLKLKDILCVLEIAKNLVSVSKLAQDNRIFIEFHGGFCLVKDNDSGQTVLIAMLKDSLYQLEIVKTNMVDQPNLEDSI
        +GNG KL I   G+S     + +L L DIL V  I KNL+SVSKLA DN I +EF    C VKD  +G+ +L  +LKD LYQL   K N    P+   S+
Subjt:  IGNGNKLNIAYVGNSDFMNGVTTLKLKDILCVLEIAKNLVSVSKLAQDNRIFIEFHGGFCLVKDNDSGQTVLIAMLKDSLYQLEIVKTNMVDQPNLEDSI

Query:  MEKSRREEIDPS----VFVLSNTSVNV-------------YEKSHALPFSSFTSRAIAPFELVHSDLWGSAPILSTEGFYAIKALWDTFLTATQLI----
         E   R    P+      VL +  V V             Y K H LPF S +S A  P ELVH+D+WG API+++ GF       D F   T +     
Subjt:  MEKSRREEIDPS----VFVLSNTSVNV-------------YEKSHALPFSSFTSRAIAPFELVHSDLWGSAPILSTEGFYAIKALWDTFLTATQLI----

Query:  -NDLALVVLQGKSPIELLFNRKL
         ++     +Q K+  E  FN+++
Subjt:  -NDLALVVLQGKSPIELLFNRKL

A0A803PEH4 Uncharacterized protein2.6e-2336.65Show/hide
Query:  KGVRIGNGNKLNIAYVGNSDF-MNGVTTLKLKDILCVLEIAKNLVSVSKLAQDNRIFIEFHGGFCLVKDNDSGQTVLIAMLKDSLYQLEIVKT-------
        + V +GNG+KL I ++GN    +     L LKD+L V +IAKNLVSVSKLA DN + IEF+  FCLVKD  + + +L  +LKD LYQL+   T       
Subjt:  KGVRIGNGNKLNIAYVGNSDF-MNGVTTLKLKDILCVLEIAKNLVSVSKLAQDNRIFIEFHGGFCLVKDNDSGQTVLIAMLKDSLYQLEIVKT-------

Query:  ----------------NMVDQPNLEDSIMEKSRREEIDPSVFVLSNT--SVNV---------------YEKSHALPFSSFTSRAIAPFELVHSDLWGSAP
                        N     +L  S M+   R    PS+ VL++   SVNV               Y K+HALPF S  +RA +  +L+H+DLWG AP
Subjt:  ----------------NMVDQPNLEDSIMEKSRREEIDPSVFVLSNT--SVNV---------------YEKSHALPFSSFTSRAIAPFELVHSDLWGSAP

Query:  ILST-EGFYAIKALWD----TFLTATQLINDLALVVLQGKSPIELLFNRKL
        I S     Y I  + D    T+L   +L +D     +Q K+ +E  F +K+
Subjt:  ILST-EGFYAIKALWD----TFLTATQLINDLALVVLQGKSPIELLFNRKL

A0A803QE35 Uncharacterized protein2.5e-2636.6Show/hide
Query:  IGNGNKLNIAYVGNSDF-MNGVTTLKLKDILCVLEIAKNLVSVSKLAQDNRIFIEFHGGFCLVKDNDSGQTVLIAMLKDSLYQ-LEIVKTNMVDQPNLED
        +G+GN+LNI Y+G+ +   N     +L D+L V +IAKNL+S+SKL  DN++F+EF    C VKD  +   VL   LKD LYQ    V    V + ++  
Subjt:  IGNGNKLNIAYVGNSDF-MNGVTTLKLKDILCVLEIAKNLVSVSKLAQDNRIFIEFHGGFCLVKDNDSGQTVLIAMLKDSLYQ-LEIVKTNMVDQPNLED

Query:  SIMEKSRREEIDPSVFVLSNTSVNV-----------YEKSHALPFSSFTSRAIAPFELVHSDLWGSAPILS---------------TEGFYAIKALWDTF
          +     + I+  V  LSN  V+            + KSHALPF S +SRA    +L+H+DLWG AP++S                +    +K   D F
Subjt:  SIMEKSRREEIDPSVFVLSNTSVNV-----------YEKSHALPFSSFTSRAIAPFELVHSDLWGSAPILS---------------TEGFYAIKALWDTF

Query:  LTATQLINDLALVVLQGKSPIELLFNRKLDYASLR
         T+  LIN L  +VL+GKS  E LF ++ DY  L+
Subjt:  LTATQLINDLALVVLQGKSPIELLFNRKLDYASLR

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.2e-1128.36Show/hide
Query:  VRIGNGNKLNIAYVGNSDFMNGVTTLKLKDILCVLEIAKNLVSVSKLAQDNRIFIEFHGGFCLVKDNDSGQTVLIAMLKDSLYQLEIVKTNMVD------
        V + +G+ + I++ G++        L L +IL V  I KNL+SV +L   N + +EF      VKD ++G  +L    KD LY+  I  +  V       
Subjt:  VRIGNGNKLNIAYVGNSDFMNGVTTLKLKDILCVLEIAKNLVSVSKLAQDNRIFIEFHGGFCLVKDNDSGQTVLIAMLKDSLYQLEIVKTNMVD------

Query:  ---------------QPNLEDSIMEKSRREEIDPSVFVLSNTSVNVYEKSHALPFSSFTSRAIAPFELVHSDLWGSAPILSTEGFYAIKALWDTFLTATQ
                        P++ +S++       ++PS   LS  S  +  KS+ +PFS  T  +  P E ++SD+W S+PILS + +       D F   T 
Subjt:  ---------------QPNLEDSIMEKSRREEIDPSVFVLSNTSVNVYEKSHALPFSSFTSRAIAPFELVHSDLWGSAPILSTEGFYAIKALWDTFLTATQ

Query:  L
        L
Subjt:  L

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.9e-1329.35Show/hide
Query:  VRIGNGNKLNIAYVGNSDFMNGVTTLKLKDILCVLEIAKNLVSVSKLAQDNRIFIEFHGGFCLVKDNDSGQTVLIAMLKDSLYQLEIVKTNMVD------
        V I +G+ + I + G++       +L L  +L V  I KNL+SV +L   NR+ +EF      VKD ++G  +L    KD LY+  I  +  V       
Subjt:  VRIGNGNKLNIAYVGNSDFMNGVTTLKLKDILCVLEIAKNLVSVSKLAQDNRIFIEFHGGFCLVKDNDSGQTVLIAMLKDSLYQLEIVKTNMVD------

Query:  -------------QPNLE--DSIMEKSRREEIDPSVFVLSNTSVNVYEKSHALPFSSFTSRAIAPFELVHSDLWGSAPILSTEGFYAIKALWDTFLTATQ
                      P+L   +S++       ++PS  +LS +   +  KSH +PFS+ T  +  P E ++SD+W S+PILS + +       D F   T 
Subjt:  -------------QPNLE--DSIMEKSRREEIDPSVFVLSNTSVNVYEKSHALPFSSFTSRAIAPFELVHSDLWGSAPILSTEGFYAIKALWDTFLTATQ

Query:  L
        L
Subjt:  L

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGTAATGAAAGGGGTAAGGATAGGTAATGGAAATAAGTTGAACATTGCTTATGTTGGCAATTCTGATTTTATGAATGGCGTCACTACTCTTAAACTTAAAGATAT
TTTGTGTGTACTTGAAATAGCGAAAAATCTTGTAAGTGTATCAAAATTAGCCCAAGACAACAGAATATTTATTGAGTTTCATGGTGGTTTTTGTCTTGTTAAGGATAATG
ATTCGGGCCAAACTGTGCTAATAGCAATGCTTAAAGACAGTTTATATCAACTTGAGATTGTGAAGACTAACATGGTAGATCAACCAAACTTGGAGGATTCAATAATGGAG
AAGTCAAGAAGGGAAGAGATTGATCCCTCAGTTTTTGTTTTATCAAATACTAGTGTCAATGTGTATGAAAAGTCACATGCTCTTCCCTTCTCCTCTTTTACATCTAGAGC
AATTGCTCCTTTTGAACTTGTTCACTCCGATCTTTGGGGGTCAGCACCAATTTTGTCTACTGAAGGCTTCTATGCCATTAAAGCACTATGGGATACATTTCTTACAGCAA
CCCAACTCATCAATGACTTAGCATTAGTGGTGCTTCAAGGTAAGTCTCCCATTGAACTTTTATTTAATCGAAAGCTTGATTATGCTTCTCTTCGAACATTTGGGTGTGCC
TACTTCCCTTTACTTACGACTTTATCAGGAGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGGTAATGAAAGGGGTAAGGATAGGTAATGGAAATAAGTTGAACATTGCTTATGTTGGCAATTCTGATTTTATGAATGGCGTCACTACTCTTAAACTTAAAGATAT
TTTGTGTGTACTTGAAATAGCGAAAAATCTTGTAAGTGTATCAAAATTAGCCCAAGACAACAGAATATTTATTGAGTTTCATGGTGGTTTTTGTCTTGTTAAGGATAATG
ATTCGGGCCAAACTGTGCTAATAGCAATGCTTAAAGACAGTTTATATCAACTTGAGATTGTGAAGACTAACATGGTAGATCAACCAAACTTGGAGGATTCAATAATGGAG
AAGTCAAGAAGGGAAGAGATTGATCCCTCAGTTTTTGTTTTATCAAATACTAGTGTCAATGTGTATGAAAAGTCACATGCTCTTCCCTTCTCCTCTTTTACATCTAGAGC
AATTGCTCCTTTTGAACTTGTTCACTCCGATCTTTGGGGGTCAGCACCAATTTTGTCTACTGAAGGCTTCTATGCCATTAAAGCACTATGGGATACATTTCTTACAGCAA
CCCAACTCATCAATGACTTAGCATTAGTGGTGCTTCAAGGTAAGTCTCCCATTGAACTTTTATTTAATCGAAAGCTTGATTATGCTTCTCTTCGAACATTTGGGTGTGCC
TACTTCCCTTTACTTACGACTTTATCAGGAGCATAA
Protein sequenceShow/hide protein sequence
MGVMKGVRIGNGNKLNIAYVGNSDFMNGVTTLKLKDILCVLEIAKNLVSVSKLAQDNRIFIEFHGGFCLVKDNDSGQTVLIAMLKDSLYQLEIVKTNMVDQPNLEDSIME
KSRREEIDPSVFVLSNTSVNVYEKSHALPFSSFTSRAIAPFELVHSDLWGSAPILSTEGFYAIKALWDTFLTATQLINDLALVVLQGKSPIELLFNRKLDYASLRTFGCA
YFPLLTTLSGA