; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh14G011590 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh14G011590
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCmo_Chr14:9261716..9267900
RNA-Seq ExpressionCmoCh14G011590
SyntenyCmoCh14G011590
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR021109 - Aspartic peptidase domain superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022930200.1 uncharacterized protein LOC111436716 [Cucurbita moschata]4.4e-15390.57Show/hide
Query:  AYKLQDHKSNHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVVRPTTQVIKEEGPSTRTEIQHERVEDAVNTLIYILIEFFVGDPLKYQERSAEIL
        AYKLQDHKS+HQIAQVLVT FTGQLKDWWDKYLDETTRQQILNHYVVRPTTQ+IKEEGPSTRTE+QHERVEDAVNTL Y LIEFFVGDPLKYQERSAEIL
Subjt:  AYKLQDHKSNHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVVRPTTQVIKEEGPSTRTEIQHERVEDAVNTLIYILIEFFVGDPLKYQERSAEIL

Query:  MNLKYPTLGDFRWYKDMYFSKVLIRTDISLEFWKGNFVNGLPKHFSRRIKDGLKTKYNGTIPWQTLSYGTIASFIIEEGLRLCNESKIQNKLNSSISNRK
        MNLK PTLGDFRWYKD+YFSKVLIRTD SLEFWK NFVNGLPKHFSRRIKDGLKTKYNGTI WQTLSY +IASFIIEEGLRLCNESKIQNKLNSSISNRK
Subjt:  MNLKYPTLGDFRWYKDMYFSKVLIRTDISLEFWKGNFVNGLPKHFSRRIKDGLKTKYNGTIPWQTLSYGTIASFIIEEGLRLCNESKIQNKLNSSISNRK

Query:  ELGRFRDQYGCKGTEAPSTSRRKKVKTHQKSYNSYRTREKYRNKPVQSQKPTYSRRKYIPTKTHRGKKNQTCFKCREEGHYANMCPIRGKINELDIE
        ELGRF DQYGCK  EAPSTSRRKKVKTH K Y+SYR RE YRNKPVQSQKPTYSRR+Y PTK H+GKK +TCFKCREEGHYAN CPIRGKINELDIE
Subjt:  ELGRFRDQYGCKGTEAPSTSRRKKVKTHQKSYNSYRTREKYRNKPVQSQKPTYSRRKYIPTKTHRGKKNQTCFKCREEGHYANMCPIRGKINELDIE

XP_022933349.1 uncharacterized protein LOC111440658 [Cucurbita moschata]7.6e-12979.07Show/hide
Query:  AYKLQDHKSNHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVVRPTTQVIKEEGPSTRTEIQHERVEDAVNTLIYILIEFFVGDPLKYQERSAEIL
        AYKLQ HKS+HQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYV+RPTTQ+IKEEGPSTRTE+QHERVEDAVNTLIY LIEFFVGDPLKYQERSAEIL
Subjt:  AYKLQDHKSNHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVVRPTTQVIKEEGPSTRTEIQHERVEDAVNTLIYILIEFFVGDPLKYQERSAEIL

Query:  MNLKYPTLGDFRWYKDMYFSKVLIRTDISLEFWKGNFVNGLPKHFSRRIKDGLKTKYNGTIPWQTLSYGTIASFIIEEGLRLCNESKIQNKLNSSISNRK
        MNLK PTLGDFRWYKDMYFSKVLIRTD SLEFWK NFVNGLPKHFSRRIKDGLKTK                                           K
Subjt:  MNLKYPTLGDFRWYKDMYFSKVLIRTDISLEFWKGNFVNGLPKHFSRRIKDGLKTKYNGTIPWQTLSYGTIASFIIEEGLRLCNESKIQNKLNSSISNRK

Query:  ELGRFRDQYGCKGTEAPSTSRRKKVKTHQKSYNSYRTREKYRNKPVQSQKPTYSRRKYIPTKTHRGKKNQTCFKCREEGHYANMCPIRGKINELDIEQEL
        ELGRF DQYGCKG EAPSTSRRKKVKTH K Y+SYR RE YR+KPVQSQKPTYSRRKYIPTKTHRGKK QTCFKCREEGHYAN CPIRGKINELDI+QEL
Subjt:  ELGRFRDQYGCKGTEAPSTSRRKKVKTHQKSYNSYRTREKYRNKPVQSQKPTYSRRKYIPTKTHRGKKNQTCFKCREEGHYANMCPIRGKINELDIEQEL

Query:  K
        K
Subjt:  K

XP_023520850.1 uncharacterized protein LOC111784362 [Cucurbita pepo subsp. pepo]4.5e-15891.36Show/hide
Query:  AYKLQDHKSNHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVVRPTTQVIKEEGPSTRTEIQHERVEDAVNTLIYILIEFFVGDPLKYQERSAEIL
        AYKLQ HKS+HQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYV+RPTTQ+IKEEGPSTRTE+QHERVEDAVNTLIY LIEFFVGDPLKYQERSAEIL
Subjt:  AYKLQDHKSNHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVVRPTTQVIKEEGPSTRTEIQHERVEDAVNTLIYILIEFFVGDPLKYQERSAEIL

Query:  MNLKYPTLGDFRWYKDMYFSKVLIRTDISLEFWKGNFVNGLPKHFSRRIKDGLKTKYNGTIPWQTLSYGTIASFIIEEGLRLCNESKIQNKLNSSISNRK
        MNLK PTLGDFRWYKDMYFSKVLIRTD SLEFWK NFVNGLPKHFSRRIKDGLKTKYNGTIPWQTLSYG+IASFIIEEGLRLCNESKIQNKLNSSISNRK
Subjt:  MNLKYPTLGDFRWYKDMYFSKVLIRTDISLEFWKGNFVNGLPKHFSRRIKDGLKTKYNGTIPWQTLSYGTIASFIIEEGLRLCNESKIQNKLNSSISNRK

Query:  ELGRFRDQYGCKGTEAPSTSRRKKVKTHQKSYNSYRTREKYRNKPVQSQKPTYSRRKYIPTKTHRGKKNQTCFKCREEGHYANMCPIRGKINELDIEQEL
        ELGRF DQYGCKG EAP TSR+KKVKTH K Y+SYR RE YR+KPVQSQKPTYSRRKYIPTKTHRGKK QTCFKCR EGHYA  CPI+GKINELDI+QEL
Subjt:  ELGRFRDQYGCKGTEAPSTSRRKKVKTHQKSYNSYRTREKYRNKPVQSQKPTYSRRKYIPTKTHRGKKNQTCFKCREEGHYANMCPIRGKINELDIEQEL

Query:  K
        K
Subjt:  K

XP_023521035.1 uncharacterized protein LOC111784623 [Cucurbita pepo subsp. pepo]1.2e-13983.39Show/hide
Query:  AYKLQDHKSNHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVVRPTTQVIKEEGPSTRTEIQHERVEDAVNTLIYILIEFFVGDPLKYQERSAEIL
        AYKLQ HKS+HQIAQVLVTGFTGQLKDWWDKYLDE TRQQIL+HYV+RPTTQ+IKEEGPSTRTE+QHERVEDAVNTLIY LIEFFVGDPLKYQERSAEIL
Subjt:  AYKLQDHKSNHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVVRPTTQVIKEEGPSTRTEIQHERVEDAVNTLIYILIEFFVGDPLKYQERSAEIL

Query:  MNLKYPTLGDFRWYKDMYFSKVLIRTDISLEFWKGNFVNGLPKHFSRRIKDGLKTKYNGTIPWQTLSYGTIASFIIEEGLRLCNESKIQNKLNSSISNRK
        MNLK PTLGDFRWYKDMYFSKVLIRTD SLEFWK NFVNGLPKHFSRRIKDGLKTKYNGTIPWQTLSYG                              K
Subjt:  MNLKYPTLGDFRWYKDMYFSKVLIRTDISLEFWKGNFVNGLPKHFSRRIKDGLKTKYNGTIPWQTLSYGTIASFIIEEGLRLCNESKIQNKLNSSISNRK

Query:  ELGRFRDQYGCKGTEAPSTSRRKKVKTHQKSYNSYRTREKYRNKPVQSQKPTYSRRKYIPTKTHRGKKNQTCFKCREEGHYANMCPIRGKINELDIEQEL
        ELGRF DQYGCKG EAPSTSRRKKVKTH K YNSYR REKYRNKPVQSQKPTYSRRKY PTKTHRGKK QTCFKCREEGHYAN CPIRGKINELDI+QEL
Subjt:  ELGRFRDQYGCKGTEAPSTSRRKKVKTHQKSYNSYRTREKYRNKPVQSQKPTYSRRKYIPTKTHRGKKNQTCFKCREEGHYANMCPIRGKINELDIEQEL

Query:  K
        K
Subjt:  K

XP_023522280.1 uncharacterized protein LOC111786173, partial [Cucurbita pepo subsp. pepo]1.1e-15691.69Show/hide
Query:  AYKLQDHKSNHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVVRPTTQVIKEEGPSTRTEIQHERVEDAVNTLIYILIEFFVGDPLKYQERSAEIL
        AYKLQ HKS+HQIAQVLVTGFTGQLKDWWDKYLDE TRQQIL+HYV+RPTTQ+IKEEGPSTRTE+QHERVEDAVNTLIY LIEFFVG+PLKYQERSAEIL
Subjt:  AYKLQDHKSNHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVVRPTTQVIKEEGPSTRTEIQHERVEDAVNTLIYILIEFFVGDPLKYQERSAEIL

Query:  MNLKYPTLGDFRWYKDMYFSKVLIRTDISLEFWKGNFVNGLPKHFSRRIKDGLKTKYNGTIPWQTLSYGTIASFIIEEGLRLCNESKIQNKLNSSISNRK
        MNLK PTLGDFRWYK MY SKVLIRTD SLEFWK NFVNGLPKHFSRRIKDGLKTKYNGTIPWQTLSYG+IASFIIEEGLRLCNESKIQNKL SSISNRK
Subjt:  MNLKYPTLGDFRWYKDMYFSKVLIRTDISLEFWKGNFVNGLPKHFSRRIKDGLKTKYNGTIPWQTLSYGTIASFIIEEGLRLCNESKIQNKLNSSISNRK

Query:  ELGRFRDQYGCKGTEAPSTSRRKKVKTHQKSYNSYRTREKYRNKPVQSQKPTYSRRKYIPTKTHRGKKNQTCFKCREEGHYANMCPIRGKINELDIEQEL
        ELGRF DQYGCKG EAPSTSRRKKVKTH K YNSYR REKYRNKPVQSQKPTYSRRKY PTKTHRGKK QTCFKCREEGHYAN CPIRGKINELDI+QEL
Subjt:  ELGRFRDQYGCKGTEAPSTSRRKKVKTHQKSYNSYRTREKYRNKPVQSQKPTYSRRKYIPTKTHRGKKNQTCFKCREEGHYANMCPIRGKINELDIEQEL

Query:  K
        K
Subjt:  K

TrEMBL top hitse value%identityAlignment
A0A6J1EW44 uncharacterized protein LOC1114366182.0e-9890.05Show/hide
Query:  MNLKYPTLGDFRWYKDMYFSKVLIRTDISLEFWKGNFVNGLPKHFSRRIKDGLKTKYNGTIPWQTLSYGTIASFIIEEGLRLCNESKIQNKLNSSISNRK
        MNLK PTLGDF+WYKDMYFSKVLIRTD SLEFWK NFVNGLPKHFSRRIKDGLKTKYNGTIPWQTLSYG+IAS IIEEGLRLCNESKIQNKLNSSISNRK
Subjt:  MNLKYPTLGDFRWYKDMYFSKVLIRTDISLEFWKGNFVNGLPKHFSRRIKDGLKTKYNGTIPWQTLSYGTIASFIIEEGLRLCNESKIQNKLNSSISNRK

Query:  ELGRFRDQYGCKGTEAPSTSRRKKVKTHQKSYNSYRTREKYRNKPVQSQKPTYSRRKYIPTKTHRGKKNQTCFKCREEGHYANMCPIRGKINELDIEQEL
        ELGRF DQYGCKG EAPSTSRRKKVKTH K Y+SYR RE Y NKPVQSQKP YSRRKYIPTKTHRGKK QT FKCREEGHY N CPIRGKINELDI+QEL
Subjt:  ELGRFRDQYGCKGTEAPSTSRRKKVKTHQKSYNSYRTREKYRNKPVQSQKPTYSRRKYIPTKTHRGKKNQTCFKCREEGHYANMCPIRGKINELDIEQEL

Query:  K
        K
Subjt:  K

A0A6J1EWB4 uncharacterized protein LOC1114367162.1e-15390.57Show/hide
Query:  AYKLQDHKSNHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVVRPTTQVIKEEGPSTRTEIQHERVEDAVNTLIYILIEFFVGDPLKYQERSAEIL
        AYKLQDHKS+HQIAQVLVT FTGQLKDWWDKYLDETTRQQILNHYVVRPTTQ+IKEEGPSTRTE+QHERVEDAVNTL Y LIEFFVGDPLKYQERSAEIL
Subjt:  AYKLQDHKSNHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVVRPTTQVIKEEGPSTRTEIQHERVEDAVNTLIYILIEFFVGDPLKYQERSAEIL

Query:  MNLKYPTLGDFRWYKDMYFSKVLIRTDISLEFWKGNFVNGLPKHFSRRIKDGLKTKYNGTIPWQTLSYGTIASFIIEEGLRLCNESKIQNKLNSSISNRK
        MNLK PTLGDFRWYKD+YFSKVLIRTD SLEFWK NFVNGLPKHFSRRIKDGLKTKYNGTI WQTLSY +IASFIIEEGLRLCNESKIQNKLNSSISNRK
Subjt:  MNLKYPTLGDFRWYKDMYFSKVLIRTDISLEFWKGNFVNGLPKHFSRRIKDGLKTKYNGTIPWQTLSYGTIASFIIEEGLRLCNESKIQNKLNSSISNRK

Query:  ELGRFRDQYGCKGTEAPSTSRRKKVKTHQKSYNSYRTREKYRNKPVQSQKPTYSRRKYIPTKTHRGKKNQTCFKCREEGHYANMCPIRGKINELDIE
        ELGRF DQYGCK  EAPSTSRRKKVKTH K Y+SYR RE YRNKPVQSQKPTYSRR+Y PTK H+GKK +TCFKCREEGHYAN CPIRGKINELDIE
Subjt:  ELGRFRDQYGCKGTEAPSTSRRKKVKTHQKSYNSYRTREKYRNKPVQSQKPTYSRRKYIPTKTHRGKKNQTCFKCREEGHYANMCPIRGKINELDIE

A0A6J1EYM2 uncharacterized protein LOC1114397305.7e-9890.05Show/hide
Query:  MNLKYPTLGDFRWYKDMYFSKVLIRTDISLEFWKGNFVNGLPKHFSRRIKDGLKTKYNGTIPWQTLSYGTIASFIIEEGLRLCNESKIQNKLNSSISNRK
        MNLK PTLGDFRWYKDMYFSKVLIRTD SLEFWK NFVNGLPKHFSRRIKDGLKTK+NGTIPWQTLSYG+IASFIIEEGLRL NESKIQNKLNSSISNRK
Subjt:  MNLKYPTLGDFRWYKDMYFSKVLIRTDISLEFWKGNFVNGLPKHFSRRIKDGLKTKYNGTIPWQTLSYGTIASFIIEEGLRLCNESKIQNKLNSSISNRK

Query:  ELGRFRDQYGCKGTEAPSTSRRKKVKTHQKSYNSYRTREKYRNKPVQSQKPTYSRRKYIPTKTHRGKKNQTCFKCREEGHYANMCPIRGKINELDIEQEL
        ELGRF DQYGCKG EAPSTS R KVKTH K Y+SYR RE YRNKPVQSQKP YSRRKYIPTKTH GKK QTCFKCREEGHYAN CPIRGKINELDI+QEL
Subjt:  ELGRFRDQYGCKGTEAPSTSRRKKVKTHQKSYNSYRTREKYRNKPVQSQKPTYSRRKYIPTKTHRGKKNQTCFKCREEGHYANMCPIRGKINELDIEQEL

Query:  K
        K
Subjt:  K

A0A6J1EZI3 uncharacterized protein LOC1114406583.7e-12979.07Show/hide
Query:  AYKLQDHKSNHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVVRPTTQVIKEEGPSTRTEIQHERVEDAVNTLIYILIEFFVGDPLKYQERSAEIL
        AYKLQ HKS+HQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYV+RPTTQ+IKEEGPSTRTE+QHERVEDAVNTLIY LIEFFVGDPLKYQERSAEIL
Subjt:  AYKLQDHKSNHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHYVVRPTTQVIKEEGPSTRTEIQHERVEDAVNTLIYILIEFFVGDPLKYQERSAEIL

Query:  MNLKYPTLGDFRWYKDMYFSKVLIRTDISLEFWKGNFVNGLPKHFSRRIKDGLKTKYNGTIPWQTLSYGTIASFIIEEGLRLCNESKIQNKLNSSISNRK
        MNLK PTLGDFRWYKDMYFSKVLIRTD SLEFWK NFVNGLPKHFSRRIKDGLKTK                                           K
Subjt:  MNLKYPTLGDFRWYKDMYFSKVLIRTDISLEFWKGNFVNGLPKHFSRRIKDGLKTKYNGTIPWQTLSYGTIASFIIEEGLRLCNESKIQNKLNSSISNRK

Query:  ELGRFRDQYGCKGTEAPSTSRRKKVKTHQKSYNSYRTREKYRNKPVQSQKPTYSRRKYIPTKTHRGKKNQTCFKCREEGHYANMCPIRGKINELDIEQEL
        ELGRF DQYGCKG EAPSTSRRKKVKTH K Y+SYR RE YR+KPVQSQKPTYSRRKYIPTKTHRGKK QTCFKCREEGHYAN CPIRGKINELDI+QEL
Subjt:  ELGRFRDQYGCKGTEAPSTSRRKKVKTHQKSYNSYRTREKYRNKPVQSQKPTYSRRKYIPTKTHRGKKNQTCFKCREEGHYANMCPIRGKINELDIEQEL

Query:  K
        K
Subjt:  K

A0A6J1IFV1 uncharacterized protein LOC1114729522.0e-9581.78Show/hide
Query:  DPLKYQERSAEILMNLKYPTLGDFRWYKDMYFSKVLIRTDISLEFWKGNFVNGLPKHFSRRIKDGLKTKYNGTIPWQTLSYGTIASFIIEEGLRLCNESK
        D +  +E S EILMNLK PTLG FRWYKDMYF+KVLIRT+ SLEFWK NFVNGLPK+FSRRIKDGLKTKYNGTIPWQT SY +I SFII EGLRLCNESK
Subjt:  DPLKYQERSAEILMNLKYPTLGDFRWYKDMYFSKVLIRTDISLEFWKGNFVNGLPKHFSRRIKDGLKTKYNGTIPWQTLSYGTIASFIIEEGLRLCNESK

Query:  IQNKLNSSISNRKELGRFRDQYGCKGTEAPSTSRRKKVKTHQKSYNSYRTREKYRNKPVQSQKPTYSRRKYIPTKTHRGKKNQTCFKCREEGHYANMCPI
        IQNKL+SSIS++KELGRF DQYGCKG +APSTSRRKKVKT+QK YNSYR REKYR+KP+QSQKPTYS+++Y PTKTHRGKK Q CFKCREEGHYAN CPI
Subjt:  IQNKLNSSISNRKELGRFRDQYGCKGTEAPSTSRRKKVKTHQKSYNSYRTREKYRNKPVQSQKPTYSRRKYIPTKTHRGKKNQTCFKCREEGHYANMCPI

Query:  RGKINELDIEQELK
        +GKINELDI+QELK
Subjt:  RGKINELDIEQELK

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.3e-1748.31Show/hide
Query:  IGGRWVFKTKLNENGEVDKYKARLVAKGYAQQHGIDYTEVFAPLARWDTIRMTITLAARNSRNVYQLDVKSAFLYGELKEAVFVEQPQG
        +  RWVF  K NE G   +YKARLVA+G+ Q++ IDY E FAP+AR  + R  ++L  + +  V+Q+DVK+AFL G LKE +++  PQG
Subjt:  IGGRWVFKTKLNENGEVDKYKARLVAKGYAQQHGIDYTEVFAPLARWDTIRMTITLAARNSRNVYQLDVKSAFLYGELKEAVFVEQPQG

P04146 Copia protein3.3e-1031.37Show/hide
Query:  EFVAAGSCACQGVWMRGVLEKLGHFQDKWITMLCDNSSTIKLSKNPIRHGCSKHIDVRFHFLRDLTRNGVIELKHCVTQEQGANIMTKPLKLNAYIKLQA
        E++A      + +W++ +L  +    +  I +  DN   I ++ NP  H  +KHID+++HF R+  +N VI L++  T+ Q A+I TKPL    +++L+ 
Subjt:  EFVAAGSCACQGVWMRGVLEKLGHFQDKWITMLCDNSSTIKLSKNPIRHGCSKHIDVRFHFLRDLTRNGVIELKHCVTQEQGANIMTKPLKLNAYIKLQA

Query:  SL
         L
Subjt:  SL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.6e-1847.73Show/hide
Query:  RWVFKTKLNENGEVDKYKARLVAKGYAQQHGIDYTEVFAPLARWDTIRMTITLAARNSRNVYQLDVKSAFLYGELKEAVFVEQPQGYE
        +WVFK K + + ++ +YKARLV KG+ Q+ GID+ E+F+P+ +  +IR  ++LAA     V QLDVK+AFL+G+L+E +++EQP+G+E
Subjt:  RWVFKTKLNENGEVDKYKARLVAKGYAQQHGIDYTEVFAPLARWDTIRMTITLAARNSRNVYQLDVKSAFLYGELKEAVFVEQPQGYE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.7e-1132.63Show/hide
Query:  EFVAAGSCACQGVWMRGVLEKLGHFQDKWITMLCDNSSTIKLSKNPIRHGCSKHIDVRFHFLRDLTRNGVIELKHCVTQEQGANIMTKPLKLNAY
        E++AA     + +W++  L++LG  Q +++ + CD+ S I LSKN + H  +KHIDVR+H++R++  +  +++    T E  A+++TK +  N +
Subjt:  EFVAAGSCACQGVWMRGVLEKLGHFQDKWITMLCDNSSTIKLSKNPIRHGCSKHIDVRFHFLRDLTRNGVIELKHCVTQEQGANIMTKPLKLNAY

P92520 Uncharacterized mitochondrial protein AtMg008206.2e-0949.12Show/hide
Query:  IGGRWVFKTKLNENGEVDKYKARLVAKGYAQQHGIDYTEVFAPLARWDTIRMTITLA
        +G +WVFKTKL+ +G +D+ KARLVAKG+ Q+ GI + E ++P+ R  TIR  + +A
Subjt:  IGGRWVFKTKLNENGEVDKYKARLVAKGYAQQHGIDYTEVFAPLARWDTIRMTITLA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.1e-1744.68Show/hide
Query:  IGGRWVFKTKLNENGEVDKYKARLVAKGYAQQHGIDYTEVFAPLARWDTIRMTITLAARNSRNVYQLDVKSAFLYGELKEAVFVEQPQGYEKKE
        +G RW+F  K N +G +++YKARLVAKGY Q+ G+DY E F+P+ +  +IR+ + +A   S  + QLDV +AFL G L + V++ QP G+  K+
Subjt:  IGGRWVFKTKLNENGEVDKYKARLVAKGYAQQHGIDYTEVFAPLARWDTIRMTITLAARNSRNVYQLDVKSAFLYGELKEAVFVEQPQGYEKKE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.4e-0829.47Show/hide
Query:  EFVAAGSCACQGVWMRGVLEKLGHFQDKWITMLCDNSSTIKLSKNPIRHGCSKHIDVRFHFLRDLTRNGVIELKHCVTQEQGANIMTKPLKLNAY
        E+ +  + + +  W+  +L +LG    +   + CDN     L  NP+ H   KHI + +HF+R+  ++G + + H  T +Q A+ +TKPL   A+
Subjt:  EFVAAGSCACQGVWMRGVLEKLGHFQDKWITMLCDNSSTIKLSKNPIRHGCSKHIDVRFHFLRDLTRNGVIELKHCVTQEQGANIMTKPLKLNAY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.6e-1844.68Show/hide
Query:  IGGRWVFKTKLNENGEVDKYKARLVAKGYAQQHGIDYTEVFAPLARWDTIRMTITLAARNSRNVYQLDVKSAFLYGELKEAVFVEQPQGYEKKE
        +G RW+F  K N +G +++YKARLVAKGY Q+ G+DY E F+P+ +  +IR+ + +A   S  + QLDV +AFL G L + V++ QP G+  K+
Subjt:  IGGRWVFKTKLNENGEVDKYKARLVAKGYAQQHGIDYTEVFAPLARWDTIRMTITLAARNSRNVYQLDVKSAFLYGELKEAVFVEQPQGYEKKE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-0729.47Show/hide
Query:  EFVAAGSCACQGVWMRGVLEKLGHFQDKWITMLCDNSSTIKLSKNPIRHGCSKHIDVRFHFLRDLTRNGVIELKHCVTQEQGANIMTKPLKLNAY
        E+ +  + + +  W+  +L +LG        + CDN     L  NP+ H   KHI + +HF+R+  ++G + + H  T +Q A+ +TKPL   A+
Subjt:  EFVAAGSCACQGVWMRGVLEKLGHFQDKWITMLCDNSSTIKLSKNPIRHGCSKHIDVRFHFLRDLTRNGVIELKHCVTQEQGANIMTKPLKLNAY

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.7e-2041.67Show/hide
Query:  KKIGGRWVFKTKLNENGEVDKYKARLVAKGYAQQHGIDYTEVFAPLARWDTIRMTITLAARNSRNVYQLDVKSAFLYGELKEAVFVEQPQGYEKKE
        K IG +WV+K K N +G +++YKARLVAKGY QQ GID+ E F+P+ +  ++++ + ++A  +  ++QLD+ +AFL G+L E ++++ P GY  ++
Subjt:  KKIGGRWVFKTKLNENGEVDKYKARLVAKGYAQQHGIDYTEVFAPLARWDTIRMTITLAARNSRNVYQLDVKSAFLYGELKEAVFVEQPQGYEKKE

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.4e-1049.12Show/hide
Query:  IGGRWVFKTKLNENGEVDKYKARLVAKGYAQQHGIDYTEVFAPLARWDTIRMTITLA
        +G +WVFKTKL+ +G +D+ KARLVAKG+ Q+ GI + E ++P+ R  TIR  + +A
Subjt:  IGGRWVFKTKLNENGEVDKYKARLVAKGYAQQHGIDYTEVFAPLARWDTIRMTITLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCAAGACGAGAAAAATGGAAGGGAGATCAATGATCTAGTTGAAGAAGTCCAGAAGAATGAGGAGGAACTGGAAGATGGGATTCATTTGGCACTTATAACAAGAAG
ACTTGTTAAAACTCAAGTTACGGAGAATGATGTTGATGATCAAAGAGACAACCTGTTCCACACGCGGTGTTTCGTCAAGGGGACTGCTTGTAGTCTTGTCATTGATAGTG
GAAGTTGCACCAATGTTGTAAGTACCATGTTGATCAAAAGATTACAAATTCCAACCCAAAGTCATCCTAAACCTTACAAGCTCCAATGGCTCAATGATAGTGGAACAATG
AAGCTGGAGATTTATTACTTGGCCGGCCATGGCAATTTGACAAGTATAAGGCTAGGTTGGCAATTTGACAAGATAAAGTTCTTTGTGATGTATTATTACCAATGCAAGCT
GAAGAAGATTGGAGGAAGGTGGGTTTTCAAAACCAAACTCAACGAAAATGGCGAAGTTGACAAGTATAAGGCTAGGTTGGTAGCAAAAGGTTATGCACAACAACATGGTA
TAGACTATACCGAGGTGTTTGCACCTTTGGCTAGGTGGGATACTATTCGAATGACAATTACTTTGGCAGCTCGAAATAGTCGAAACGTGTATCAGCTTGACGTGAAAAGT
GCTTTCTTATATGGAGAGTTAAAAGAAGCCGTCTTTGTTGAGCAACCACAAGGTTATGAGAAAAAAGAATTTGTGGCAGCTGGGTCTTGTGCTTGTCAAGGGGTGTGGAT
GAGGGGAGTTTTGGAAAAACTTGGTCATTTTCAAGACAAGTGGATCACTATGTTATGTGATAACAGTTCCACTATTAAGCTATCCAAGAATCCAATCAGGCATGGATGTA
GCAAACACATTGATGTGAGGTTTCATTTCTTGCGTGATTTGACCAGAAATGGAGTTATTGAGCTGAAGCATTGTGTCACACAAGAACAAGGTGCAAATATTATGACAAAA
CCACTGAAGCTGAATGCATACATAAAACTACAGGCATCTCTTGCCGGCTTCTTTCTTCTCCCTTGGTTGTATCTCTATCCTTGCGATTTACAAAGAAACAAGGAAGATGA
TAAGAAAATTCAGATGAATTTGGAGTATGTGATAGCTGAGCTTATAGCCCTTGTGTTCCAGGTGGTAAGACGCGGGAAGCGTAGCGCGTATAAATTACAGGACCATAAAT
CTAACCATCAAATAGCTCAAGTTTTAGTAACCGGATTTACCGGGCAACTCAAAGACTGGTGGGACAAATATCTCGACGAAACAACTCGCCAACAGATATTAAACCACTAT
GTCGTTAGGCCAACTACTCAAGTCATCAAAGAAGAAGGTCCATCAACTAGGACTGAAATACAACATGAAAGGGTAGAGGATGCTGTCAACACCCTTATTTATATCCTCAT
AGAATTCTTCGTCGGCGACCCTCTGAAATACCAGGAGAGATCCGCCGAGATACTCATGAACCTGAAATACCCTACCTTAGGTGATTTTAGGTGGTACAAAGACATGTACT
TCAGCAAAGTTCTTATTAGAACAGATATTTCGTTGGAATTCTGGAAAGGGAATTTTGTCAATGGACTACCAAAACACTTCTCAAGGAGGATCAAAGACGGTCTGAAGACG
AAGTATAATGGAACAATTCCATGGCAGACTTTGTCATATGGAACGATAGCATCCTTCATCATAGAAGAAGGACTCAGACTTTGTAATGAGTCAAAGATTCAAAACAAGCT
CAATTCTTCAATATCAAACAGAAAAGAGCTTGGAAGATTTCGTGACCAATATGGATGCAAGGGGACAGAAGCTCCCTCAACCTCTAGAAGAAAGAAGGTTAAGACGCATC
AAAAATCTTATAATTCATACAGGACCAGAGAAAAGTATCGGAATAAACCTGTGCAATCTCAAAAGCCAACGTATTCAAGAAGGAAGTATATTCCTACGAAAACCCATAGA
GGAAAGAAAAATCAAACTTGCTTCAAATGTCGTGAAGAAGGACACTATGCTAACATGTGTCCGATCAGAGGAAAGATCAATGAGTTGGATATAGAACAAGAGCTGAAACT
CAGCTATTACGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAACCAAGACGAGAAAAATGGAAGGGAGATCAATGATCTAGTTGAAGAAGTCCAGAAGAATGAGGAGGAACTGGAAGATGGGATTCATTTGGCACTTATAACAAGAAG
ACTTGTTAAAACTCAAGTTACGGAGAATGATGTTGATGATCAAAGAGACAACCTGTTCCACACGCGGTGTTTCGTCAAGGGGACTGCTTGTAGTCTTGTCATTGATAGTG
GAAGTTGCACCAATGTTGTAAGTACCATGTTGATCAAAAGATTACAAATTCCAACCCAAAGTCATCCTAAACCTTACAAGCTCCAATGGCTCAATGATAGTGGAACAATG
AAGCTGGAGATTTATTACTTGGCCGGCCATGGCAATTTGACAAGTATAAGGCTAGGTTGGCAATTTGACAAGATAAAGTTCTTTGTGATGTATTATTACCAATGCAAGCT
GAAGAAGATTGGAGGAAGGTGGGTTTTCAAAACCAAACTCAACGAAAATGGCGAAGTTGACAAGTATAAGGCTAGGTTGGTAGCAAAAGGTTATGCACAACAACATGGTA
TAGACTATACCGAGGTGTTTGCACCTTTGGCTAGGTGGGATACTATTCGAATGACAATTACTTTGGCAGCTCGAAATAGTCGAAACGTGTATCAGCTTGACGTGAAAAGT
GCTTTCTTATATGGAGAGTTAAAAGAAGCCGTCTTTGTTGAGCAACCACAAGGTTATGAGAAAAAAGAATTTGTGGCAGCTGGGTCTTGTGCTTGTCAAGGGGTGTGGAT
GAGGGGAGTTTTGGAAAAACTTGGTCATTTTCAAGACAAGTGGATCACTATGTTATGTGATAACAGTTCCACTATTAAGCTATCCAAGAATCCAATCAGGCATGGATGTA
GCAAACACATTGATGTGAGGTTTCATTTCTTGCGTGATTTGACCAGAAATGGAGTTATTGAGCTGAAGCATTGTGTCACACAAGAACAAGGTGCAAATATTATGACAAAA
CCACTGAAGCTGAATGCATACATAAAACTACAGGCATCTCTTGCCGGCTTCTTTCTTCTCCCTTGGTTGTATCTCTATCCTTGCGATTTACAAAGAAACAAGGAAGATGA
TAAGAAAATTCAGATGAATTTGGAGTATGTGATAGCTGAGCTTATAGCCCTTGTGTTCCAGGTGGTAAGACGCGGGAAGCGTAGCGCGTATAAATTACAGGACCATAAAT
CTAACCATCAAATAGCTCAAGTTTTAGTAACCGGATTTACCGGGCAACTCAAAGACTGGTGGGACAAATATCTCGACGAAACAACTCGCCAACAGATATTAAACCACTAT
GTCGTTAGGCCAACTACTCAAGTCATCAAAGAAGAAGGTCCATCAACTAGGACTGAAATACAACATGAAAGGGTAGAGGATGCTGTCAACACCCTTATTTATATCCTCAT
AGAATTCTTCGTCGGCGACCCTCTGAAATACCAGGAGAGATCCGCCGAGATACTCATGAACCTGAAATACCCTACCTTAGGTGATTTTAGGTGGTACAAAGACATGTACT
TCAGCAAAGTTCTTATTAGAACAGATATTTCGTTGGAATTCTGGAAAGGGAATTTTGTCAATGGACTACCAAAACACTTCTCAAGGAGGATCAAAGACGGTCTGAAGACG
AAGTATAATGGAACAATTCCATGGCAGACTTTGTCATATGGAACGATAGCATCCTTCATCATAGAAGAAGGACTCAGACTTTGTAATGAGTCAAAGATTCAAAACAAGCT
CAATTCTTCAATATCAAACAGAAAAGAGCTTGGAAGATTTCGTGACCAATATGGATGCAAGGGGACAGAAGCTCCCTCAACCTCTAGAAGAAAGAAGGTTAAGACGCATC
AAAAATCTTATAATTCATACAGGACCAGAGAAAAGTATCGGAATAAACCTGTGCAATCTCAAAAGCCAACGTATTCAAGAAGGAAGTATATTCCTACGAAAACCCATAGA
GGAAAGAAAAATCAAACTTGCTTCAAATGTCGTGAAGAAGGACACTATGCTAACATGTGTCCGATCAGAGGAAAGATCAATGAGTTGGATATAGAACAAGAGCTGAAACT
CAGCTATTACGACTGA
Protein sequenceShow/hide protein sequence
MNQDEKNGREINDLVEEVQKNEEELEDGIHLALITRRLVKTQVTENDVDDQRDNLFHTRCFVKGTACSLVIDSGSCTNVVSTMLIKRLQIPTQSHPKPYKLQWLNDSGTM
KLEIYYLAGHGNLTSIRLGWQFDKIKFFVMYYYQCKLKKIGGRWVFKTKLNENGEVDKYKARLVAKGYAQQHGIDYTEVFAPLARWDTIRMTITLAARNSRNVYQLDVKS
AFLYGELKEAVFVEQPQGYEKKEFVAAGSCACQGVWMRGVLEKLGHFQDKWITMLCDNSSTIKLSKNPIRHGCSKHIDVRFHFLRDLTRNGVIELKHCVTQEQGANIMTK
PLKLNAYIKLQASLAGFFLLPWLYLYPCDLQRNKEDDKKIQMNLEYVIAELIALVFQVVRRGKRSAYKLQDHKSNHQIAQVLVTGFTGQLKDWWDKYLDETTRQQILNHY
VVRPTTQVIKEEGPSTRTEIQHERVEDAVNTLIYILIEFFVGDPLKYQERSAEILMNLKYPTLGDFRWYKDMYFSKVLIRTDISLEFWKGNFVNGLPKHFSRRIKDGLKT
KYNGTIPWQTLSYGTIASFIIEEGLRLCNESKIQNKLNSSISNRKELGRFRDQYGCKGTEAPSTSRRKKVKTHQKSYNSYRTREKYRNKPVQSQKPTYSRRKYIPTKTHR
GKKNQTCFKCREEGHYANMCPIRGKINELDIEQELKLSYYD