; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0042123 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0042123
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr13:36857355..36857930
RNA-Seq ExpressionLag0042123
SyntenyLag0042123
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022157437.1 uncharacterized protein LOC111024135 [Momordica charantia]2.7e-5756.91Show/hide
Query:  MSCVETVEYSVLVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDSLVFCRANEKDCNTIKMILNMY
        M+CVE+V ++VL+NG P   F PNRG RQGDPLSPYLF++CAEGLS+++  EE   N   L+IN+ CP ISHLF+ADD L+F +A+  +C +IK IL  Y
Subjt:  MSCVETVEYSVLVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDSLVFCRANEKDCNTIKMILNMY

Query:  EQA-SGQVINFEKSMFMASKNIKKDKMGTLSNILGVRHSESIGHYLGMPSQNGRNKNALFSKVKDKVWKAVQGWKENLFSLGGKEILI
        E+A SGQ IN +KSMF+ SKN K+  +G +   L V H+ES+G YLG+PSQ GRNK  +F+ +KD+VWKA+QGWK  LFS+GG+E+L+
Subjt:  EQA-SGQVINFEKSMFMASKNIKKDKMGTLSNILGVRHSESIGHYLGMPSQNGRNKNALFSKVKDKVWKAVQGWKENLFSLGGKEILI

XP_030508852.1 uncharacterized protein LOC115723496 [Cannabis sativa]6.1e-4950.26Show/hide
Query:  MSCVETVEYSVLVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDSLVFCRANEKDCNTIKMILNMY
        + C+++V YS L+NG  Q    P+RG RQGDPLSPYLFLICAEGLS +L  EEL+ + +GL+I++  PS+SHLFFADDS++FCRAN++    I   L  Y
Subjt:  MSCVETVEYSVLVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDSLVFCRANEKDCNTIKMILNMY

Query:  EQASGQVINFEKSMFMASKNIKKDKMGTLSNILGVRHSESIGHYLGMPSQNGRNKNALFSKVKDKVWKAVQGWKENLFSLGGKEILIKAII
         +ASGQVIN EK +   S+N ++ +     ++LG+        YLG+PS +G+NK  LF  + DK+WK +  WKE+LFS GGKE+L+KA++
Subjt:  EQASGQVINFEKSMFMASKNIKKDKMGTLSNILGVRHSESIGHYLGMPSQNGRNKNALFSKVKDKVWKAVQGWKENLFSLGGKEILIKAII

XP_030939568.1 uncharacterized protein LOC115964386 [Quercus lobata]7.3e-5051.05Show/hide
Query:  SCVETVEYSVLVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDSLVFCRANEKDCNTIKMILNMYE
        SC+ TV +SVL+NG P   F PNRG RQGDPLSPYLFL+CAEGL S++ + E   + +G+ + +  P +SHLFFADDSL+FCRAN+KD NTI  IL+ YE
Subjt:  SCVETVEYSVLVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDSLVFCRANEKDCNTIKMILNMYE

Query:  QASGQVINFEKSMFMASKNIKKDKMGTLSNILGVRHSESIGHYLGMPSQNGRNKNALFSKVKDKVWKAVQGWKENLFSLGGKEILIKAII
        +ASGQ IN EK+    S N        + + +GV  +  I  YLG+P+  GR K   FS +++++W  +QGWKE L S GG+E+LIKA++
Subjt:  QASGQVINFEKSMFMASKNIKKDKMGTLSNILGVRHSESIGHYLGMPSQNGRNKNALFSKVKDKVWKAVQGWKENLFSLGGKEILIKAII

XP_030939975.1 uncharacterized protein LOC115964883 [Quercus lobata]4.7e-4950Show/hide
Query:  SCVETVEYSVLVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDSLVFCRANEKDCNTIKMILNMYE
        SC+ +V +SVLVNG P  +F PNRG RQGDPLSPYLFL+CAEGL S++ + E+S   KG+ +    P +SHLFFADDSL+FCRAN ++ ++I  IL  YE
Subjt:  SCVETVEYSVLVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDSLVFCRANEKDCNTIKMILNMYE

Query:  QASGQVINFEKSMFMASKNIKKDKMGTLSNILGVRHSESIGHYLGMPSQNGRNKNALFSKVKDKVWKAVQGWKENLFSLGGKEILIKAII
        +ASGQ IN EK+    S N        +  +LGV  + +   YLG+PS  GR K   F  +++++W  +QGWKE L S GG+E+LIKA++
Subjt:  QASGQVINFEKSMFMASKNIKKDKMGTLSNILGVRHSESIGHYLGMPSQNGRNKNALFSKVKDKVWKAVQGWKENLFSLGGKEILIKAII

XP_030946812.1 uncharacterized protein LOC115971195 [Quercus lobata]1.2e-4950.53Show/hide
Query:  SCVETVEYSVLVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDSLVFCRANEKDCNTIKMILNMYE
        SC+ +V +SVLVNG P  +F PNRG RQGDPLSPYLFL+CAEGL S++ + E+S + KG+ +    P +SHLFFADDSL+FCRAN ++ ++I  IL  YE
Subjt:  SCVETVEYSVLVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDSLVFCRANEKDCNTIKMILNMYE

Query:  QASGQVINFEKSMFMASKNIKKDKMGTLSNILGVRHSESIGHYLGMPSQNGRNKNALFSKVKDKVWKAVQGWKENLFSLGGKEILIKAII
        +ASGQ IN EK+    S N        +  +LGV  + +   YLG+PS  GR K   F+ ++++VW+ +QGWKE L S GG+E+LIKA++
Subjt:  QASGQVINFEKSMFMASKNIKKDKMGTLSNILGVRHSESIGHYLGMPSQNGRNKNALFSKVKDKVWKAVQGWKENLFSLGGKEILIKAII

TrEMBL top hitse value%identityAlignment
A0A2N9G656 Reverse transcriptase domain-containing protein2.7e-5050.79Show/hide
Query:  MSCVETVEYSVLVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDSLVFCRANEKDCNTIKMILNMY
        M CV TV YSVLVNG P    KP+RG RQGDPLSPYLFLICAEGL +++ +   + +  G+ + +  P I+HLFFADDSL+FC+A  ++CN I+ IL+ Y
Subjt:  MSCVETVEYSVLVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDSLVFCRANEKDCNTIKMILNMY

Query:  EQASGQVINFEKSMFMASKNIKKDKMGTLSNILGVRHSESIGHYLGMPSQNGRNKNALFSKVKDKVWKAVQGWKENLFSLGGKEILIKAII
        E+ASGQ +N  K+    S N  +++   L N+LGV        YLG+PS  GR+K   F+ +K++VW+ +QGWKE L S  GKEILIKA++
Subjt:  EQASGQVINFEKSMFMASKNIKKDKMGTLSNILGVRHSESIGHYLGMPSQNGRNKNALFSKVKDKVWKAVQGWKENLFSLGGKEILIKAII

A0A2N9HJV7 Reverse transcriptase domain-containing protein1.1e-5152.11Show/hide
Query:  MSCVETVEYSVLVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDSLVFCRANEKDCNTIKMILNMY
        M C+ TV YSVL+NG P+    P+RG RQGDPLSPY+FL+CAEGL ++L+R E     +G+++ +  P+ISHLFFADDS++FC+A  + C TI+ IL+ Y
Subjt:  MSCVETVEYSVLVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDSLVFCRANEKDCNTIKMILNMY

Query:  EQASGQVINFEKSMFMASKNIKKDKMGTLSNILGVRHSESIGHYLGMPSQNGRNKNALFSKVKDKVWKAVQGWKENLFSLGGKEILIKAI
        E ASGQ+IN EK+    SKN K ++   L N L V+       YLG+PS  GR+K  +FS +K++VWK +QGWKE L S  GKE+LIKA+
Subjt:  EQASGQVINFEKSMFMASKNIKKDKMGTLSNILGVRHSESIGHYLGMPSQNGRNKNALFSKVKDKVWKAVQGWKENLFSLGGKEILIKAI

A0A2N9HV89 Reverse transcriptase domain-containing protein2.7e-5050.79Show/hide
Query:  MSCVETVEYSVLVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDSLVFCRANEKDCNTIKMILNMY
        M CV TV YSVLVNG P    KP+RG RQGDPLSPYLFLICAEGL +++ +   + +  G+ + +  P I+HLFFADDSL+FC+A  ++CN I+ IL+ Y
Subjt:  MSCVETVEYSVLVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDSLVFCRANEKDCNTIKMILNMY

Query:  EQASGQVINFEKSMFMASKNIKKDKMGTLSNILGVRHSESIGHYLGMPSQNGRNKNALFSKVKDKVWKAVQGWKENLFSLGGKEILIKAII
        E+ASGQ +N  K+    S N  +++   L N+LGV        YLG+PS  GR+K   F+ +K++VW+ +QGWKE L S  GKEILIKA++
Subjt:  EQASGQVINFEKSMFMASKNIKKDKMGTLSNILGVRHSESIGHYLGMPSQNGRNKNALFSKVKDKVWKAVQGWKENLFSLGGKEILIKAII

A0A2N9J7Z5 Reverse transcriptase domain-containing protein2.7e-5050.79Show/hide
Query:  MSCVETVEYSVLVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDSLVFCRANEKDCNTIKMILNMY
        M CV TV YSVLVNG P    KP+RG RQGDPLSPYLFLICAEGL +++ +   + +  G+ + +  P I+HLFFADDSL+FC+A  ++CN I+ IL+ Y
Subjt:  MSCVETVEYSVLVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDSLVFCRANEKDCNTIKMILNMY

Query:  EQASGQVINFEKSMFMASKNIKKDKMGTLSNILGVRHSESIGHYLGMPSQNGRNKNALFSKVKDKVWKAVQGWKENLFSLGGKEILIKAII
        E+ASGQ +N  K+    S N  +++   L N+LGV        YLG+PS  GR+K   F+ +K++VW+ +QGWKE L S  GKEILIKA++
Subjt:  EQASGQVINFEKSMFMASKNIKKDKMGTLSNILGVRHSESIGHYLGMPSQNGRNKNALFSKVKDKVWKAVQGWKENLFSLGGKEILIKAII

A0A6J1DUG8 uncharacterized protein LOC1110241351.3e-5756.91Show/hide
Query:  MSCVETVEYSVLVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDSLVFCRANEKDCNTIKMILNMY
        M+CVE+V ++VL+NG P   F PNRG RQGDPLSPYLF++CAEGLS+++  EE   N   L+IN+ CP ISHLF+ADD L+F +A+  +C +IK IL  Y
Subjt:  MSCVETVEYSVLVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDSLVFCRANEKDCNTIKMILNMY

Query:  EQA-SGQVINFEKSMFMASKNIKKDKMGTLSNILGVRHSESIGHYLGMPSQNGRNKNALFSKVKDKVWKAVQGWKENLFSLGGKEILI
        E+A SGQ IN +KSMF+ SKN K+  +G +   L V H+ES+G YLG+PSQ GRNK  +F+ +KD+VWKA+QGWK  LFS+GG+E+L+
Subjt:  EQA-SGQVINFEKSMFMASKNIKKDKMGTLSNILGVRHSESIGHYLGMPSQNGRNKNALFSKVKDKVWKAVQGWKENLFSLGGKEILI

SwissProt top hitse value%identityAlignment
P11369 LINE-1 retrotransposable element ORF2 protein4.9e-0927.57Show/hide
Query:  SVLVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDSLVFCRANEKDCNTIKMILNMYEQASGQVIN
        ++ VNG    +     G RQG PLSPYLF I  E L+  + +++     KG+QI K    IS L  ADD +V+    +     +  ++N + +  G  IN
Subjt:  SVLVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDSLVFCRANEKDCNTIKMILNMYEQASGQVIN

Query:  FEKSM---FMASKNIKKDKMGTLSNILGVRHSESIGHYLGMPSQNGRNKNALFSKVKDKVWKAVQGWKENLFSLGGKEILIKAII
          KSM   +  +K  +K+   T    +   + + +G  L    ++  +KN  F  +K ++ + ++ WK+   S  G+  ++K  I
Subjt:  FEKSM---FMASKNIKKDKMGTLSNILGVRHSESIGHYLGMPSQNGRNKNALFSKVKDKVWKAVQGWKENLFSLGGKEILIKAII

P92555 Uncharacterized mitochondrial protein AtMg012501.5e-1347.06Show/hide
Query:  LVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDS
        ++NG PQ    P+RG RQGDPLSPYLF++C E LS +  R +      G++++ + P I+HL FADD+
Subjt:  LVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDS

Arabidopsis top hitse value%identityAlignment
ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.1e-1447.06Show/hide
Query:  LVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDS
        ++NG PQ    P+RG RQGDPLSPYLF++C E LS +  R +      G++++ + P I+HL FADD+
Subjt:  LVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTGTGTGGAAACAGTGGAGTATTCGGTTTTGGTGAATGGCAACCCTCAAGCCTCCTTCAAACCAAACAGGGGGCACAGGCAAGGAGACCCTCTATCTCCATACTT
GTTTCTGATATGTGCAGAAGGACTATCGAGCATCCTAATCAGGGAAGAACTCTCTTCAAACTTTAAAGGGTTACAGATTAATAAACATTGTCCTTCTATTTCTCACTTGT
TTTTCGCTGATGATAGCTTGGTTTTTTGTAGGGCAAATGAAAAGGACTGTAACACCATCAAAATGATCCTCAATATGTATGAGCAAGCTTCAGGGCAAGTTATAAATTTT
GAGAAGTCAATGTTTATGGCTAGTAAAAATATAAAGAAAGATAAAATGGGGACCTTGAGTAACATTCTGGGCGTGAGGCATTCGGAGTCTATTGGTCATTATCTGGGAAT
GCCCTCGCAAAATGGGAGAAACAAGAATGCTCTGTTCAGTAAAGTTAAAGACAAAGTGTGGAAAGCAGTTCAAGGGTGGAAGGAGAACTTATTCTCTTTGGGAGGAAAGG
AAATTCTCATAAAAGCCATAATATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTGTGTGGAAACAGTGGAGTATTCGGTTTTGGTGAATGGCAACCCTCAAGCCTCCTTCAAACCAAACAGGGGGCACAGGCAAGGAGACCCTCTATCTCCATACTT
GTTTCTGATATGTGCAGAAGGACTATCGAGCATCCTAATCAGGGAAGAACTCTCTTCAAACTTTAAAGGGTTACAGATTAATAAACATTGTCCTTCTATTTCTCACTTGT
TTTTCGCTGATGATAGCTTGGTTTTTTGTAGGGCAAATGAAAAGGACTGTAACACCATCAAAATGATCCTCAATATGTATGAGCAAGCTTCAGGGCAAGTTATAAATTTT
GAGAAGTCAATGTTTATGGCTAGTAAAAATATAAAGAAAGATAAAATGGGGACCTTGAGTAACATTCTGGGCGTGAGGCATTCGGAGTCTATTGGTCATTATCTGGGAAT
GCCCTCGCAAAATGGGAGAAACAAGAATGCTCTGTTCAGTAAAGTTAAAGACAAAGTGTGGAAAGCAGTTCAAGGGTGGAAGGAGAACTTATTCTCTTTGGGAGGAAAGG
AAATTCTCATAAAAGCCATAATATAA
Protein sequenceShow/hide protein sequence
MSCVETVEYSVLVNGNPQASFKPNRGHRQGDPLSPYLFLICAEGLSSILIREELSSNFKGLQINKHCPSISHLFFADDSLVFCRANEKDCNTIKMILNMYEQASGQVINF
EKSMFMASKNIKKDKMGTLSNILGVRHSESIGHYLGMPSQNGRNKNALFSKVKDKVWKAVQGWKENLFSLGGKEILIKAII