; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g17080 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g17080
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr2:12825523..12831186
RNA-Seq ExpressionMoc02g17080
SyntenyMoc02g17080
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW38706.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.5e-5435.84Show/hide
Query:  KDAERFHKHVEKERIYDFLAGLRPELDDVRGRLLATKPISAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGDYWPPPPT--NTPSSRTSS
        KD +RF   + KERIYDFLAGL  +LD VR R+LA K +  I+E+FAEVR E + KRVM+  T T  L  S ++SAL ARG  +  P    +   +R++S
Subjt:  KDAERFHKHVEKERIYDFLAGLRPELDDVRGRLLATKPISAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGDYWPPPPT--NTPSSRTSS

Query:  SGYQVGPSVSNSQDLAISLPPFPKAQLEQLYRLLTPPVESTP-----SSSFVAQRGIFSAALT-SQQHSDQWILDSGATDHMTVFHDMFTMYSPNPIQTH
           +V  + S   +         K QLEQLY+LLTP V +       ++SF+AQ+  F  AL+ + +  D WI+DS A++HMT +   F+ Y+ +     
Subjt:  SGYQVGPSVSNSQDLAISLPPFPKAQLEQLYRLLTPPVESTP-----SSSFVAQRGIFSAALT-SQQHSDQWILDSGATDHMTVFHDMFTMYSPNPIQTH

Query:  VKLADGSSAIIKGFGSIILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGMTIGNADGFEGLYYFRGPSLRNKQVLQGETEPI
        VK ADGS   +   G+I+L+P+ITL+ VLHVPKL CNL+S+ KLT DL C   F  S C FQ++ +G  IG+A    GLYYF   +  NKQ         
Subjt:  VKLADGSSAIIKGFGSIILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGMTIGNADGFEGLYYFRGPSLRNKQVLQGETEPI

Query:  TSSLDGNFWEIDDLNTRIESPQSKIPEIDGLNIESPQPKIPILIVPITQIEESV---PIISCNNEDDQVNPNRSDK-QPETLVYS--RRPTVQRGVEPPQ
         SS+  N           +   SK+ +        P    P  IV  +Q++ +    P++     D  + P  + + Q E  VYS   RP         Q
Subjt:  TSSLDGNFWEIDDLNTRIESPQSKIPEIDGLNIESPQPKIPILIVPITQIEESV---PIISCNNEDDQVNPNRSDK-QPETLVYS--RRPTVQRGVEPPQ

Query:  PQQQSHESISSLGTEQSTLVPQDNTNDLDLPIALRKGDEI----KPNDGHETLNAYEERYVYMIEI
        P Q+             T++P+   NDL+LPI +RKG  +     P + +E LN  E +   + E+
Subjt:  PQQQSHESISSLGTEQSTLVPQDNTNDLDLPIALRKGDEI----KPNDGHETLNAYEERYVYMIEI

RVW98618.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.1e-5537.32Show/hide
Query:  MVMAWLINSMEEDIKEFFIFCSTAKDLWNALTMAFSDFDNSAQLLN---------YAIRHVPYDKNS-------------------KDAERFHKHVEKER
        +V+AWLINSME  I +  +F  TAKD+W A+   +SD +NS+Q+ +            R V    N                     D+ R  K  E +R
Subjt:  MVMAWLINSMEEDIKEFFIFCSTAKDLWNALTMAFSDFDNSAQLLN---------YAIRHVPYDKNS-------------------KDAERFHKHVEKER

Query:  IYDFLAGLRPELDDVRGRLLATKPISAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAARG-------------DYWPPP-------------P
        +Y FLAGL   LD+VRGR+L  KP+ +I E+F+EVR E +R++VM+ D   K  +  +ESSAL ++G             D+   P             P
Subjt:  IYDFLAGLRPELDDVRGRLLATKPISAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAARG-------------DYWPPP-------------P

Query:  TNTPSSRTSSS-GYQVGPSVSNSQDLAISLPPFPKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQ--WILDSGATDHMTVFHDMFTMYS
         N      S    +Q   + S    +    P F K QL  LY+L   P  S PS S   Q     AAL+S + +    WI+DSGATDHMT    +F+ Y 
Subjt:  TNTPSSRTSSS-GYQVGPSVSNSQDLAISLPPFPKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQ--WILDSGATDHMTVFHDMFTMYS

Query:  PNPIQTHVKLADGSSAIIKGFGSIILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGMTIGNADGFEGLYYFRGPSLRNK--Q
        P      +K+ADGS + I G GS+ +SP++TLH+VLHVP L CNL+S+ K+T D +CQA F  S C FQ+L +G TIGNA    GLY+F   S   K  Q
Subjt:  PNPIQTHVKLADGSSAIIKGFGSIILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGMTIGNADGFEGLYYFRGPSLRNK--Q

Query:  VLQGETEPITS--SLDGN
         + G+   + +  SLDG+
Subjt:  VLQGETEPITS--SLDGN

XP_022159153.1 uncharacterized protein LOC111025577 [Momordica charantia]5.4e-14575.13Show/hide
Query:  MVMAWLINSMEEDIKEFFIFCSTAKDLWNALTMAFSDFDNSAQLLNY-----AIRHVPYD-----------------------KNSKDAERFHKHVEKER
        MVMAWLINSMEEDIKE FIF STAKDLWNALTMAFSDFDNSAQL        ++R    D                       +NSKDAERF KHVEKER
Subjt:  MVMAWLINSMEEDIKEFFIFCSTAKDLWNALTMAFSDFDNSAQLLNY-----AIRHVPYD-----------------------KNSKDAERFHKHVEKER

Query:  IYDFLAGLRPELDDVRGRLLATKPISAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGDYWPPPPTNTPSSRTSSSGYQVGPSVSNSQDLA
        IYDFLAGLRPELDDVRGRLLATKPI AIDEIFAEV WESSRKRVMMGDTHTKPLSLSLESSALAARG    PPP+  P                      
Subjt:  IYDFLAGLRPELDDVRGRLLATKPISAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGDYWPPPPTNTPSSRTSSSGYQVGPSVSNSQDLA

Query:  ISLPPFPKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTVFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSIILSPN
                AQLEQLYRLLT PVESTPSSSFVAQRGI SAALT QQHSDQWILDSGATDHMT FHDMFTMYSPNPIQTHVKLADGSSAIIKGFGS+ILSPN
Subjt:  ISLPPFPKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTVFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSIILSPN

Query:  ITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGMTIGNADGFEGLYYFRGPSLRNKQVLQGETEPITSSL
        ITLHSVL VPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITG TIG+ADGFEGLYYFRGPSLRNKQVLQG T   +S +
Subjt:  ITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGMTIGNADGFEGLYYFRGPSLRNKQVLQGETEPITSSL

XP_024044151.1 uncharacterized protein LOC18046468 isoform X1 [Citrus clementina]1.2e-5438.38Show/hide
Query:  MVMAWLINSMEEDIKEFFIFCSTAKDLWNALTMAFSDFDNSAQLLNYAIRHVPYDKNSK----------------------------DAERFHKHVEKER
        M+M+WL+NSME++I + ++F  TAKDLW+A+T  +SD  NSAQ+ +   R     + S+                            D+ ++ K +EKER
Subjt:  MVMAWLINSMEEDIKEFFIFCSTAKDLWNALTMAFSDFDNSAQLLNYAIRHVPYDKNSK----------------------------DAERFHKHVEKER

Query:  IYDFLAGLRPELDDVRGRLLATKPISAIDEIFAEVRWESSRKRVMMGDTH------------------TKPLSLSLESSAL---------AARGDYW---
        +++FLAGL  +LD+VRGR+L  +P+ +  E+F+ VR E SRK VMMG +                   TK L  S E   +           R   W   
Subjt:  IYDFLAGLRPELDDVRGRLLATKPISAIDEIFAEVRWESSRKRVMMGDTH------------------TKPLSLSLESSAL---------AARGDYW---

Query:  --PPPPTNTPSSRTSSSGYQ-VGPSVSNSQDLAISLPPFPKAQLEQLYRLLTPPVESTPSSSF--VAQRG-IFSAALTSQQHSDQWILDSGATDHMTVFH
          PP   N   S   S G+Q VG +   +         F K QLEQLYR L         SSF  +AQ+G  F+A     +  D WI+DSGATDHMT   
Subjt:  --PPPPTNTPSSRTSSSGYQ-VGPSVSNSQDLAISLPPFPKAQLEQLYRLLTPPVESTPSSSF--VAQRG-IFSAALTSQQHSDQWILDSGATDHMTVFH

Query:  DMFTMYSPNPIQTHVKLADGSSAIIKGFGSIILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGMTIGNADGFEGLYYF
         +F+ Y P      +K+ADGS + + G GSI +S N+ L SVLHVP L CNL+SV K+T DL C A F+ S C FQDL +G  IG+A   +GLYYF
Subjt:  DMFTMYSPNPIQTHVKLADGSSAIIKGFGSIILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGMTIGNADGFEGLYYF

XP_024044152.1 uncharacterized protein LOC18046468 isoform X2 [Citrus clementina]1.2e-5438.38Show/hide
Query:  MVMAWLINSMEEDIKEFFIFCSTAKDLWNALTMAFSDFDNSAQLLNYAIRHVPYDKNSK----------------------------DAERFHKHVEKER
        M+M+WL+NSME++I + ++F  TAKDLW+A+T  +SD  NSAQ+ +   R     + S+                            D+ ++ K +EKER
Subjt:  MVMAWLINSMEEDIKEFFIFCSTAKDLWNALTMAFSDFDNSAQLLNYAIRHVPYDKNSK----------------------------DAERFHKHVEKER

Query:  IYDFLAGLRPELDDVRGRLLATKPISAIDEIFAEVRWESSRKRVMMGDTH------------------TKPLSLSLESSAL---------AARGDYW---
        +++FLAGL  +LD+VRGR+L  +P+ +  E+F+ VR E SRK VMMG +                   TK L  S E   +           R   W   
Subjt:  IYDFLAGLRPELDDVRGRLLATKPISAIDEIFAEVRWESSRKRVMMGDTH------------------TKPLSLSLESSAL---------AARGDYW---

Query:  --PPPPTNTPSSRTSSSGYQ-VGPSVSNSQDLAISLPPFPKAQLEQLYRLLTPPVESTPSSSF--VAQRG-IFSAALTSQQHSDQWILDSGATDHMTVFH
          PP   N   S   S G+Q VG +   +         F K QLEQLYR L         SSF  +AQ+G  F+A     +  D WI+DSGATDHMT   
Subjt:  --PPPPTNTPSSRTSSSGYQ-VGPSVSNSQDLAISLPPFPKAQLEQLYRLLTPPVESTPSSSF--VAQRG-IFSAALTSQQHSDQWILDSGATDHMTVFH

Query:  DMFTMYSPNPIQTHVKLADGSSAIIKGFGSIILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGMTIGNADGFEGLYYF
         +F+ Y P      +K+ADGS + + G GSI +S N+ L SVLHVP L CNL+SV K+T DL C A F+ S C FQDL +G  IG+A   +GLYYF
Subjt:  DMFTMYSPNPIQTHVKLADGSSAIIKGFGSIILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGMTIGNADGFEGLYYF

TrEMBL top hitse value%identityAlignment
A0A2Z7D6C0 Beta-galactosidase1.1e-5334.83Show/hide
Query:  MVMAWLINSMEEDIKEFFIFCSTAKDLWNALTMAFSDFDNSAQLLNYAIR-------------------------HVPYDKN---SKDAERFHKHVEKER
        MV AWLINSME  I   F+F   A+++W+A+   +SD +NS+Q+ +   R                          + YD       D+ ++HK +E +R
Subjt:  MVMAWLINSMEEDIKEFFIFCSTAKDLWNALTMAFSDFDNSAQLLNYAIR-------------------------HVPYDKN---SKDAERFHKHVEKER

Query:  IYDFLAGLRPELDDVRGRLLATKPISAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAAR-------------------------------GDY
        +Y FLAGL  +LD+VRGR+L   P+ ++ E+FAEVR E  R+++M+    +  LS + E+SA+ ++                                D 
Subjt:  IYDFLAGLRPELDDVRGRLLATKPISAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAAR-------------------------------GDY

Query:  WPPPPTNTPSSRTSSSG-----YQVGPSVSNSQDLAISLPPFPKAQLEQLYRLL--TPPVESTPSSSFVAQRGIFSAAL---TSQQHSDQWILDSGATDH
           PP   P S     G     YQ G     + + +  +  F K Q++QLY+L   T    S P S  +A +G +  ++    S      WI+DSGATDH
Subjt:  WPPPPTNTPSSRTSSSG-----YQVGPSVSNSQDLAISLPPFPKAQLEQLYRLL--TPPVESTPSSSFVAQRGIFSAAL---TSQQHSDQWILDSGATDH

Query:  MTVFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSIILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGMTIGNADGFEGLYY
        MT    +F  Y P      +K+ADG+ + I G G+I++S  ITLH+VLHVP L CNL+S+ KLT DLKC A F+ + C+FQ+L +G TIG+A    GLYY
Subjt:  MTVFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSIILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGMTIGNADGFEGLYY

Query:  FRGPSLRNKQVLQGETEPITSS
        F   S    Q  Q    PI+SS
Subjt:  FRGPSLRNKQVLQGETEPITSS

A0A438DT81 Retrovirus-related Pol polyprotein from transposon RE17.4e-5535.84Show/hide
Query:  KDAERFHKHVEKERIYDFLAGLRPELDDVRGRLLATKPISAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGDYWPPPPT--NTPSSRTSS
        KD +RF   + KERIYDFLAGL  +LD VR R+LA K +  I+E+FAEVR E + KRVM+  T T  L  S ++SAL ARG  +  P    +   +R++S
Subjt:  KDAERFHKHVEKERIYDFLAGLRPELDDVRGRLLATKPISAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGDYWPPPPT--NTPSSRTSS

Query:  SGYQVGPSVSNSQDLAISLPPFPKAQLEQLYRLLTPPVESTP-----SSSFVAQRGIFSAALT-SQQHSDQWILDSGATDHMTVFHDMFTMYSPNPIQTH
           +V  + S   +         K QLEQLY+LLTP V +       ++SF+AQ+  F  AL+ + +  D WI+DS A++HMT +   F+ Y+ +     
Subjt:  SGYQVGPSVSNSQDLAISLPPFPKAQLEQLYRLLTPPVESTP-----SSSFVAQRGIFSAALT-SQQHSDQWILDSGATDHMTVFHDMFTMYSPNPIQTH

Query:  VKLADGSSAIIKGFGSIILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGMTIGNADGFEGLYYFRGPSLRNKQVLQGETEPI
        VK ADGS   +   G+I+L+P+ITL+ VLHVPKL CNL+S+ KLT DL C   F  S C FQ++ +G  IG+A    GLYYF   +  NKQ         
Subjt:  VKLADGSSAIIKGFGSIILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGMTIGNADGFEGLYYFRGPSLRNKQVLQGETEPI

Query:  TSSLDGNFWEIDDLNTRIESPQSKIPEIDGLNIESPQPKIPILIVPITQIEESV---PIISCNNEDDQVNPNRSDK-QPETLVYS--RRPTVQRGVEPPQ
         SS+  N           +   SK+ +        P    P  IV  +Q++ +    P++     D  + P  + + Q E  VYS   RP         Q
Subjt:  TSSLDGNFWEIDDLNTRIESPQSKIPEIDGLNIESPQPKIPILIVPITQIEESV---PIISCNNEDDQVNPNRSDK-QPETLVYS--RRPTVQRGVEPPQ

Query:  PQQQSHESISSLGTEQSTLVPQDNTNDLDLPIALRKGDEI----KPNDGHETLNAYEERYVYMIEI
        P Q+             T++P+   NDL+LPI +RKG  +     P + +E LN  E +   + E+
Subjt:  PQQQSHESISSLGTEQSTLVPQDNTNDLDLPIALRKGDEI----KPNDGHETLNAYEERYVYMIEI

A0A438IPH4 Retrovirus-related Pol polyprotein from transposon RE15.1e-5637.32Show/hide
Query:  MVMAWLINSMEEDIKEFFIFCSTAKDLWNALTMAFSDFDNSAQLLN---------YAIRHVPYDKNS-------------------KDAERFHKHVEKER
        +V+AWLINSME  I +  +F  TAKD+W A+   +SD +NS+Q+ +            R V    N                     D+ R  K  E +R
Subjt:  MVMAWLINSMEEDIKEFFIFCSTAKDLWNALTMAFSDFDNSAQLLN---------YAIRHVPYDKNS-------------------KDAERFHKHVEKER

Query:  IYDFLAGLRPELDDVRGRLLATKPISAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAARG-------------DYWPPP-------------P
        +Y FLAGL   LD+VRGR+L  KP+ +I E+F+EVR E +R++VM+ D   K  +  +ESSAL ++G             D+   P             P
Subjt:  IYDFLAGLRPELDDVRGRLLATKPISAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAARG-------------DYWPPP-------------P

Query:  TNTPSSRTSSS-GYQVGPSVSNSQDLAISLPPFPKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQ--WILDSGATDHMTVFHDMFTMYS
         N      S    +Q   + S    +    P F K QL  LY+L   P  S PS S   Q     AAL+S + +    WI+DSGATDHMT    +F+ Y 
Subjt:  TNTPSSRTSSS-GYQVGPSVSNSQDLAISLPPFPKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQ--WILDSGATDHMTVFHDMFTMYS

Query:  PNPIQTHVKLADGSSAIIKGFGSIILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGMTIGNADGFEGLYYFRGPSLRNK--Q
        P      +K+ADGS + I G GS+ +SP++TLH+VLHVP L CNL+S+ K+T D +CQA F  S C FQ+L +G TIGNA    GLY+F   S   K  Q
Subjt:  PNPIQTHVKLADGSSAIIKGFGSIILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGMTIGNADGFEGLYYFRGPSLRNK--Q

Query:  VLQGETEPITS--SLDGN
         + G+   + +  SLDG+
Subjt:  VLQGETEPITS--SLDGN

A0A6A2WU09 60S ribosomal protein L386.2e-5437.32Show/hide
Query:  MVMAWLINSMEEDIKEFFIFCSTAKDLWNALTMAFSDFDNSAQLLNYAIR---------------------------HVPYDKN-SKDAERFHKHVEKER
        M+M+WLINSM+  +   ++F  TA D+WNA+   +SD  N+ Q      R                              Y+ + +KD   F K VEKER
Subjt:  MVMAWLINSMEEDIKEFFIFCSTAKDLWNALTMAFSDFDNSAQLLNYAIR---------------------------HVPYDKN-SKDAERFHKHVEKER

Query:  IYDFLAGLRPELDDVRGRLLATKPISAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAA---------RGDYWPP-------------------
        +++FL GL  ELD+VRGR+L  +P+ +  E+F+EVR E SR+ VM+G     P     ESSAL +         R D   P                   
Subjt:  IYDFLAGLRPELDDVRGRLLATKPISAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAA---------RGDYWPP-------------------

Query:  --PPTNTPSSRTSSSG---YQVGPSVSNSQDLAISLPPFPKAQLEQLYRLL-------TPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHM
          P  N  ++R S +    +      S     A  L  F K QLEQLY+L+       TP   S  +SS +AQ+G +  A  +   S+ WI+DSGATDHM
Subjt:  --PPTNTPSSRTSSSG---YQVGPSVSNSQDLAISLPPFPKAQLEQLYRLL-------TPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHM

Query:  TVFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSIILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGMTIGNADGFEGLYYF
        T    +F+ Y P      VK+ADGS   I G GSII+SP++TL +VLHVPKL CNLISV ++ HD KC A  T +   FQD  +G  IGNA   +GLY+ 
Subjt:  TVFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSIILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGMTIGNADGFEGLYYF

Query:  RGPSLRNKQV
           +  NKQV
Subjt:  RGPSLRNKQV

A0A6J1DY12 uncharacterized protein LOC1110255772.6e-14575.13Show/hide
Query:  MVMAWLINSMEEDIKEFFIFCSTAKDLWNALTMAFSDFDNSAQLLNY-----AIRHVPYD-----------------------KNSKDAERFHKHVEKER
        MVMAWLINSMEEDIKE FIF STAKDLWNALTMAFSDFDNSAQL        ++R    D                       +NSKDAERF KHVEKER
Subjt:  MVMAWLINSMEEDIKEFFIFCSTAKDLWNALTMAFSDFDNSAQLLNY-----AIRHVPYD-----------------------KNSKDAERFHKHVEKER

Query:  IYDFLAGLRPELDDVRGRLLATKPISAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGDYWPPPPTNTPSSRTSSSGYQVGPSVSNSQDLA
        IYDFLAGLRPELDDVRGRLLATKPI AIDEIFAEV WESSRKRVMMGDTHTKPLSLSLESSALAARG    PPP+  P                      
Subjt:  IYDFLAGLRPELDDVRGRLLATKPISAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGDYWPPPPTNTPSSRTSSSGYQVGPSVSNSQDLA

Query:  ISLPPFPKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTVFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSIILSPN
                AQLEQLYRLLT PVESTPSSSFVAQRGI SAALT QQHSDQWILDSGATDHMT FHDMFTMYSPNPIQTHVKLADGSSAIIKGFGS+ILSPN
Subjt:  ISLPPFPKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTVFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSIILSPN

Query:  ITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGMTIGNADGFEGLYYFRGPSLRNKQVLQGETEPITSSL
        ITLHSVL VPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITG TIG+ADGFEGLYYFRGPSLRNKQVLQG T   +S +
Subjt:  ITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGMTIGNADGFEGLYYFRGPSLRNKQVLQGETEPITSSL

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.0e-0932.21Show/hide
Query:  QLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTVFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSIILSPN---ITLHSVLHV
        QL   L+      P S F   +   + AL S   S+ W+LDSGAT H+T   +  +++ P      V +ADGS+  I   GS  LS     + LH++L+V
Subjt:  QLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTVFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSIILSPN---ITLHSVLHV

Query:  PKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGMTIGNADGFEGLY
        P +  NLISV +L +       F  +    +DL TG+ +      + LY
Subjt:  PKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGMTIGNADGFEGLY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.9e-0727.55Show/hide
Query:  RGDYWPPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFPKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTVFHD
        R + W P      SS + S   Q  P +   Q    S+      +  QL++  +   +   +S F   +   + A+ S  +++ W+LDSGAT H+T   +
Subjt:  RGDYWPPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFPKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTVFHD

Query:  MFTMYSPNPIQTHVKLADGSSAIIKGFGSIIL---SPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGMTIGNADGFEGLY
          + + P      V +ADGS+  I   GS  L   S ++ L+ VL+VP +  NLISV +L +  +    F  +    +DL TG+ +      + LY
Subjt:  MFTMYSPNPIQTHVKLADGSSAIIKGFGSIIL---SPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGMTIGNADGFEGLY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCATGGCATGGCTCATTAACTCGATGGAGGAGGACATTAAAGAATTCTTCATCTTCTGCTCAACAGCAAAGGATCTTTGGAATGCGCTCACTATGGCTTTT
TCTGATTTTGATAACTCGGCTCAATTGTTGAATTACGCAATAAGGCACGTTCCTTACGACAAGAACTCGAAGGATGCTGAACGCTTCCACAAACACGTCGAGAAG
GAACGAATTTATGATTTTCTTGCAGGTCTTCGTCCGGAATTAGATGATGTGCGTGGCCGCTTACTTGCCACAAAGCCAATCTCAGCCATTGACGAAATCTTCGCA
GAAGTTCGCTGGGAGTCAAGCCGTAAACGTGTGATGATGGGTGATACACACACAAAACCTCTGTCCCTCTCACTGGAATCATCAGCTCTGGCGGCACGAGGGGAT
TATTGGCCACCTCCACCTACAAATACTCCATCTTCTCGAACCAGCTCCTCCGGCTATCAAGTGGGCCCTAGTGTGTCAAACTCCCAAGATTTGGCCATCTCTCTC
CCTCCATTTCCGAAGGCACAACTTGAACAACTCTATCGCCTCTTAACACCGCCGGTTGAGTCTACTCCTTCGTCAAGTTTTGTGGCACAACGAGGTATTTTTAGT
GCAGCTTTAACAAGTCAGCAGCATTCCGATCAGTGGATCTTAGATTCGGGTGCAACTGATCATATGACCGTTTTTCATGATATGTTTACCATGTACTCACCCAAC
CCGATTCAGACACATGTCAAGCTTGCAGATGGGTCATCGGCCATTATTAAGGGCTTTGGTTCTATTATTCTTAGCCCAAACATTACATTGCACTCAGTGCTCCAT
GTGCCTAAATTATGTTGCAATCTAATTTCTGTTCAGAAGTTGACTCATGATTTAAAGTGTCAAGCCCTGTTCACTGACTCTAAGTGTTTGTTTCAGGACTTGATA
ACGGGCATGACGATTGGCAATGCTGATGGCTTTGAAGGGCTCTACTACTTCAGAGGACCAAGTCTAAGAAATAAACAAGTTCTCCAGGGAGAGACTGAGCCTATT
ACAAGTAGTCTTGATGGAAATTTTTGGGAGATTGACGATCTAAATACCAGAATTGAGTCCCCTCAGTCAAAAATACCTGAGATTGATGGTCTAAATATCGAGTCA
CCTCAGCCAAAAATACCTATCCTGATCGTCCCAATTACTCAGATAGAAGAGTCAGTTCCTATTATTTCCTGTAATAATGAAGATGATCAAGTCAACCCAAATCGA
AGTGACAAGCAACCAGAGACTCTTGTTTATTCTCGGCGACCAACGGTTCAAAGAGGAGTGGAGCCACCACAGCCTCAACAGCAAAGTCATGAATCCATCTCGTCC
TTAGGTACGGAACAATCTACCCTTGTGCCTCAAGACAATACTAATGATCTTGATCTTCCTATTGCACTTAGGAAGGGTGACGAAATTAAACCAAATGATGGACAC
GAAACTCTGAATGCTTATGAAGAGCGGTATGTTTACATGATTGAAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCATGGCATGGCTCATTAACTCGATGGAGGAGGACATTAAAGAATTCTTCATCTTCTGCTCAACAGCAAAGGATCTTTGGAATGCGCTCACTATGGCTTTT
TCTGATTTTGATAACTCGGCTCAATTGTTGAATTACGCAATAAGGCACGTTCCTTACGACAAGAACTCGAAGGATGCTGAACGCTTCCACAAACACGTCGAGAAG
GAACGAATTTATGATTTTCTTGCAGGTCTTCGTCCGGAATTAGATGATGTGCGTGGCCGCTTACTTGCCACAAAGCCAATCTCAGCCATTGACGAAATCTTCGCA
GAAGTTCGCTGGGAGTCAAGCCGTAAACGTGTGATGATGGGTGATACACACACAAAACCTCTGTCCCTCTCACTGGAATCATCAGCTCTGGCGGCACGAGGGGAT
TATTGGCCACCTCCACCTACAAATACTCCATCTTCTCGAACCAGCTCCTCCGGCTATCAAGTGGGCCCTAGTGTGTCAAACTCCCAAGATTTGGCCATCTCTCTC
CCTCCATTTCCGAAGGCACAACTTGAACAACTCTATCGCCTCTTAACACCGCCGGTTGAGTCTACTCCTTCGTCAAGTTTTGTGGCACAACGAGGTATTTTTAGT
GCAGCTTTAACAAGTCAGCAGCATTCCGATCAGTGGATCTTAGATTCGGGTGCAACTGATCATATGACCGTTTTTCATGATATGTTTACCATGTACTCACCCAAC
CCGATTCAGACACATGTCAAGCTTGCAGATGGGTCATCGGCCATTATTAAGGGCTTTGGTTCTATTATTCTTAGCCCAAACATTACATTGCACTCAGTGCTCCAT
GTGCCTAAATTATGTTGCAATCTAATTTCTGTTCAGAAGTTGACTCATGATTTAAAGTGTCAAGCCCTGTTCACTGACTCTAAGTGTTTGTTTCAGGACTTGATA
ACGGGCATGACGATTGGCAATGCTGATGGCTTTGAAGGGCTCTACTACTTCAGAGGACCAAGTCTAAGAAATAAACAAGTTCTCCAGGGAGAGACTGAGCCTATT
ACAAGTAGTCTTGATGGAAATTTTTGGGAGATTGACGATCTAAATACCAGAATTGAGTCCCCTCAGTCAAAAATACCTGAGATTGATGGTCTAAATATCGAGTCA
CCTCAGCCAAAAATACCTATCCTGATCGTCCCAATTACTCAGATAGAAGAGTCAGTTCCTATTATTTCCTGTAATAATGAAGATGATCAAGTCAACCCAAATCGA
AGTGACAAGCAACCAGAGACTCTTGTTTATTCTCGGCGACCAACGGTTCAAAGAGGAGTGGAGCCACCACAGCCTCAACAGCAAAGTCATGAATCCATCTCGTCC
TTAGGTACGGAACAATCTACCCTTGTGCCTCAAGACAATACTAATGATCTTGATCTTCCTATTGCACTTAGGAAGGGTGACGAAATTAAACCAAATGATGGACAC
GAAACTCTGAATGCTTATGAAGAGCGGTATGTTTACATGATTGAAATTTGA
Protein sequenceShow/hide protein sequence
MVMAWLINSMEEDIKEFFIFCSTAKDLWNALTMAFSDFDNSAQLLNYAIRHVPYDKNSKDAERFHKHVEKERIYDFLAGLRPELDDVRGRLLATKPISAIDEIFA
EVRWESSRKRVMMGDTHTKPLSLSLESSALAARGDYWPPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFPKAQLEQLYRLLTPPVESTPSSSFVAQRGIFS
AALTSQQHSDQWILDSGATDHMTVFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSIILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLI
TGMTIGNADGFEGLYYFRGPSLRNKQVLQGETEPITSSLDGNFWEIDDLNTRIESPQSKIPEIDGLNIESPQPKIPILIVPITQIEESVPIISCNNEDDQVNPNR
SDKQPETLVYSRRPTVQRGVEPPQPQQQSHESISSLGTEQSTLVPQDNTNDLDLPIALRKGDEIKPNDGHETLNAYEERYVYMIEI