; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0002422 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0002422
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr11:20753938..20755638
RNA-Seq ExpressionPI0002422
SyntenyPI0002422
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833177.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]2.5e-7347.55Show/hide
Query:  MSDGEQPQFKLDPEIERTFRRNRRRARQRQARR-MENNRNA-----------------PPPHAVPEPNAAYIAHDLDRPIRSYATPNLYNFNPGIAYPAF
        MS+G+ P F +DPEIERTFRR  R+ +QR++ + +E N +A                    HA  + N   +AHD +RP+R YA+PNLYNF PGI  P F
Subjt:  MSDGEQPQFKLDPEIERTFRRNRRRARQRQARR-MENNRNA-----------------PPPHAVPEPNAAYIAHDLDRPIRSYATPNLYNFNPGIAYPAF

Query:  GENARFEIKSIILQMIQNAGQFGGHPGENPHEHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRWTNALEDGEVGTWDQLIEKFMKKFFPPHENA
          N RFE+K ++LQM+Q AGQFGG  GE+PH H++SF  IC++F M G+  + +R TLFP +LRDEA++W  + E GE+ TW +++EKFM+K+FPP  +A
Subjt:  GENARFEIKSIILQMIQNAGQFGGHPGENPHEHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRWTNALEDGEVGTWDQLIEKFMKKFFPPHENA

Query:  RRRKKLMSFQQKDRENLHDAWSRFKRMVRACPHNGIPECILMEVFYFGLNKARQQTADAVFANDMLKSSYNRIKATLDTMTNNNEE
        +RR+ +++F+QKD E   +AW+RFKR+VR CPHNGIP C+ ME+FY GLNK  Q  ADA  A  ++  +Y + K  LD ++ N  +
Subjt:  RRRKKLMSFQQKDRENLHDAWSRFKRMVRACPHNGIPECILMEVFYFGLNKARQQTADAVFANDMLKSSYNRIKATLDTMTNNNEE

XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]2.7e-7535Show/hide
Query:  MSDGEQPQFKLDPEIERTFRRNR---RRARQRQARRMENNRNAPPPHAVPEPNAAYIAHDLDRPIRSYATPNLYNFNPGIAYPAFGENARFEIKSIILQM
        MS      F  DPEIERTF R R   R+ +Q Q    +N  N   P     P  A+I  D DR IR YA P     N GI  P   +  +FE+K ++ QM
Subjt:  MSDGEQPQFKLDPEIERTFRRNR---RRARQRQARRMENNRNAPPPHAVPEPNAAYIAHDLDRPIRSYATPNLYNFNPGIAYPAFGENARFEIKSIILQM

Query:  IQNAGQFGGHPGENPHEHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRWTNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKKLMSFQQKDRE
        +Q  GQF G P E+PH H+R F  I  SF   G+  + LR  LFP ++RD A+ W N+L  G V TW+ L EKF+ K+FPP+ NA+ R ++ SFQQ+D E
Subjt:  IQNAGQFGGHPGENPHEHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRWTNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKKLMSFQQKDRE

Query:  NLHDAWSRFKRMVRACPHNGIPECILMEVFYFGLNKARQQTADAVFANDMLKSSYNRIKATLDTMTNNNEEWDEDDFGNRRGGRSRGDESMDKSVVVALQ
        +L+DAW RFK ++R CPH+GI  CI ME FY GLN   +   DA     +L  SYN+    L+T+   N +W       + G +  G   +D   + +++
Subjt:  NLHDAWSRFKRMVRACPHNGIPECILMEVFYFGLNKARQQTADAVFANDMLKSSYNRIKATLDTMTNNNEEWDEDDFGNRRGGRSRGDESMDKSVVVALQ

Query:  GQMTAMNNLLKSMAISQVNATGGSVHA-TNQIDDMGCVGCSDPHNTDACPLNNETVAYQP----------------------------------------
         Q+ +M ++LK++++    +   S+ +  NQ  ++ CV C + H  D+CP N E+V Y                                          
Subjt:  GQMTAMNNLLKSMAISQVNATGGSVHA-TNQIDDMGCVGCSDPHNTDACPLNNETVAYQP----------------------------------------

Query:  ------TTIAPSTSSMENLLREYMQKN-------DALLQNQASSIRNLEVQLGQLANDFSRRLQGSLPSNTEMPNQARGSRKEKCHAVTLRSGRNLTICD
              +  AP ++S+EN+L+EY+ KN       +AL+Q+QA+S+RNLE Q+GQLAN+   R  G+LPS+TE P   +G   E C A+TL+SG+ L    
Subjt:  ------TTIAPSTSSMENLLREYMQKN-------DALLQNQASSIRNLEVQLGQLANDFSRRLQGSLPSNTEMPNQARGSRKEKCHAVTLRSGRNLTICD

Query:  PDAERSYSNSNSTAEIGSSSKIP---TLVHFPLTDNVSSSQNNDAPSKGFESKRQRNQQS
         DA+  Y +S    E   + +IP      +  ++   SS +++  P   F  + Q+ +Q+
Subjt:  PDAERSYSNSNSTAEIGSSSKIP---TLVHFPLTDNVSSSQNNDAPSKGFESKRQRNQQS

XP_017239618.1 PREDICTED: uncharacterized protein LOC108212402 [Daucus carota subsp. sativus]1.3e-7436.08Show/hide
Query:  MSDGEQPQFKLDPEIERTFRRNR---RRARQRQARRMENNRNAPPPHAVPEPNAAYIAHDLDRPIRSYATPNLYNFNPGIAYPAFGENARFEIKSIILQM
        MS+     F  DPEIERTF R R   R+ +Q Q    +N  N   P     P  A+I  D DR IR YA P     N GI  P   +  +FE+K ++ QM
Subjt:  MSDGEQPQFKLDPEIERTFRRNR---RRARQRQARRMENNRNAPPPHAVPEPNAAYIAHDLDRPIRSYATPNLYNFNPGIAYPAFGENARFEIKSIILQM

Query:  IQNAGQFGGHPGENPHEHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRWTNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKKLMSFQQKDRE
        +Q  GQF G P E+PH H+R F  I  SF   G+  + LR  LFP ++RD A+ W N+L  G V TW+ L EKF+ K+FPP+ NA+   ++ SFQQ+D E
Subjt:  IQNAGQFGGHPGENPHEHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRWTNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKKLMSFQQKDRE

Query:  NLHDAWSRFKRMVRACPHNGIPECILMEVFYFGLNKARQQTADAVFANDMLKSSYNRIKATLDTMTNNNEEWDEDDFGNRRGGRSRGDESMDKSVVVALQ
        +L+DAW RFK ++R CPH+GI   I ME FY GLN   +   DA     +L  SYN+    L+T+  NN +W       + G +  G   +D   + +++
Subjt:  NLHDAWSRFKRMVRACPHNGIPECILMEVFYFGLNKARQQTADAVFANDMLKSSYNRIKATLDTMTNNNEEWDEDDFGNRRGGRSRGDESMDKSVVVALQ

Query:  GQMTAMNNLLKSMAISQVNATGGSVHA-TNQIDDMGCVGCSDPHNTDACPLNNETVAYQP----------------------------------------
         Q+ +M ++LK++++    +   S+ +  NQ  ++ CV C + H  D+CP N E+V Y                                          
Subjt:  GQMTAMNNLLKSMAISQVNATGGSVHA-TNQIDDMGCVGCSDPHNTDACPLNNETVAYQP----------------------------------------

Query:  ------TTIAPSTSSMENLLREYMQKN-------DALLQNQASSIRNLEVQLGQLANDFSRRLQGSLPSNTEMPNQARGSRKEKCHAVTLRSGRNLTICD
              +  AP ++S+EN+L+EY+ KN       +AL+Q+QA+S+RNLE Q+GQL N+   R  G+LPS+TE P   +G   E C A+TL+SG+ L    
Subjt:  ------TTIAPSTSSMENLLREYMQKN-------DALLQNQASSIRNLEVQLGQLANDFSRRLQGSLPSNTEMPNQARGSRKEKCHAVTLRSGRNLTICD

Query:  PDAERSYSNSNSTAEIGSSSK
         DA+   S   S  E  S +K
Subjt:  PDAERSYSNSNSTAEIGSSSK

XP_030483210.1 uncharacterized protein LOC115699807 [Cannabis sativa]4.2e-7336.61Show/hide
Query:  IAHDLDRPIRSYATPNLYNFNPGIAYPAFGENARFEIKSIILQMIQNAGQFGGHPGENPHEHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRWT
        +A D D+ IR YA P     NPGI  P   +  +FE+K ++ QM+Q  GQF G P E+PH H+R F  +  SF +PG++ + LR  LFP +LRD+A+ W 
Subjt:  IAHDLDRPIRSYATPNLYNFNPGIAYPAFGENARFEIKSIILQMIQNAGQFGGHPGENPHEHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRWT

Query:  NALEDGEVGTWDQLIEKFMKKFFPPHENARRRKKLMSFQQKDRENLHDAWSRFKRMVRACPHNGIPECILMEVFYFGLNKARQQTADAVFANDMLKSSYN
        N+L    V TW +L E+F+ K+FPP +NA+ RK++ SFQQ + E+L++AW RFK ++R CPH+GIP CI ME FY GLN   +   DA     +L  SYN
Subjt:  NALEDGEVGTWDQLIEKFMKKFFPPHENARRRKKLMSFQQKDRENLHDAWSRFKRMVRACPHNGIPECILMEVFYFGLNKARQQTADAVFANDMLKSSYN

Query:  RIKATLDTMTNNNEEWDED--DFGNRRGGRSRGDESMDKSVVVALQGQMTAMNNLLKSMAISQVNATGGSVHATNQIDDMGCVGCSDPHNTDACPLNNET
             ++ ++NNN +W       G +  G       ++   + AL  Q+ +M+N++K+M++ Q            Q++++ CV CS+ H  D CP N  +
Subjt:  RIKATLDTMTNNNEEWDED--DFGNRRGGRSRGDESMDKSVVVALQGQMTAMNNLLKSMAISQVNATGGSVHATNQIDDMGCVGCSDPHNTDACPLNNET

Query:  VAYQ-------------------------------PTTIAP---------------STSSMENLLRE--------------YMQKNDALLQNQASSIRNL
        V Y                                P +  P                TSS+E+++R+              YM KND  +Q+QA+S+R L
Subjt:  VAYQ-------------------------------PTTIAP---------------STSSMENLLRE--------------YMQKNDALLQNQASSIRNL

Query:  EVQLGQLANDFSRRLQGSLPSNTEMPNQARGSRKEKCHAVTLRSGRNL
        E Q+GQLAN+   R QG+LPS+TE P   R   KE C AV LRSG+ L
Subjt:  EVQLGQLANDFSRRLQGSLPSNTEMPNQARGSRKEKCHAVTLRSGRNL

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]6.7e-7938.99Show/hide
Query:  EPNAAYIAHDLDRPIRSYATPNLYNFNPGIAYPAFGENARFEIKSIILQMIQNAGQFGGHPGENPHEHIRSFYSICASFHMPGISPEELRFTLFPLTLRD
        E N   +A D  R IR YA P     NPGI  P   +   FE+K ++ QM+Q  GQFGG P E+PH HIRSF  +  SF + G+S E LR  LFP +LRD
Subjt:  EPNAAYIAHDLDRPIRSYATPNLYNFNPGIAYPAFGENARFEIKSIILQMIQNAGQFGGHPGENPHEHIRSFYSICASFHMPGISPEELRFTLFPLTLRD

Query:  EAKRWTNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKKLMSFQQKDRENLHDAWSRFKRMVRACPHNGIPECILMEVFYFGLNKARQQTADAVFANDM
         A+ W N L    V  W+ L EKF++K+FPP  NA+ R ++MSFQQ + E   DAW RFK ++R CPH+GIP CI +E FY GLN A +   DA     +
Subjt:  EAKRWTNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKKLMSFQQKDRENLHDAWSRFKRMVRACPHNGIPECILMEVFYFGLNKARQQTADAVFANDM

Query:  LKSSYNRIKATLDTMTNNNEEWDEDDFGNRRGGRSRGDESMDKSVVVALQGQMTAMNNLLKSMAISQVNATGGSVH--ATNQIDDMGCVGCSDPHNTDAC
        L  SYN     L+ + +NN +W      NR     +    ++   + AL  QM +M N+LK+M +      GGSV   A  Q     CV C D H  + C
Subjt:  LKSSYNRIKATLDTMTNNNEEWDEDDFGNRRGGRSRGDESMDKSVVVALQGQMTAMNNLLKSMAISQVNATGGSVH--ATNQIDDMGCVGCSDPHNTDAC

Query:  PLNNETVAY-------------------------------------------QPTTIAP------STSSMENLLREYMQKNDALLQNQASSIRNLEVQLG
        P N  +V Y                                           QP    P       TSS+E+L+R+YM KND ++Q+QA+S+RNLEVQLG
Subjt:  PLNNETVAY-------------------------------------------QPTTIAP------STSSMENLLREYMQKNDALLQNQASSIRNLEVQLG

Query:  QLANDFSRRLQGSLPSNTEMPNQARGSRKEKCHAVTLRSGRNLTICDPDAERSYSNSNSTAEIGSSSKIPTLVHFPLTDNVSSSQNNDAPSKGFE
        QLAND   R QG+LPS+TE P   R   KE C AVTLRSG+ +   +  A RS   S+   E G   K P      +   V++S  + A  K  +
Subjt:  QLANDFSRRLQGSLPSNTEMPNQARGSRKEKCHAVTLRSGRNLTICDPDAERSYSNSNSTAEIGSSSKIPTLVHFPLTDNVSSSQNNDAPSKGFE

TrEMBL top hitse value%identityAlignment
A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129452.5e-6333.75Show/hide
Query:  DPEIERTFRRNRRR----ARQRQARRMENNRNAPPPHAVPEPNAAYIAHDLDRPIRSYATPNLYNFNPGIAYPAFGENARFEIKSIILQMIQNAGQFGGH
        DP+IERTFRR+RR     A   Q    +NN N          NA  +  + +R +R Y  P +   +  I  P+   N  FEIK   +QMIQ++ QF G 
Subjt:  DPEIERTFRRNRRR----ARQRQARRMENNRNAPPPHAVPEPNAAYIAHDLDRPIRSYATPNLYNFNPGIAYPAFGENARFEIKSIILQMIQNAGQFGGH

Query:  PGENPHEHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRWTNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKKLMSFQQKDRENLHDAWSRFK
        P ++P+ H+ +F  IC +F   G++ + +R  LFP +LRD+AK W N+L +G + TW+ L +KF+ KFFPP + A+ R  + SF Q D E+L++AW RFK
Subjt:  PGENPHEHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRWTNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKKLMSFQQKDRENLHDAWSRFK

Query:  RMVRACPHNGIPECILMEVFYFGLNKARQQTADAVFANDMLKSSYNRIKATLDTMTNNNEEWDEDDFGNRRGGRSRGDESMDKSVVVALQGQMTAMNNLL
         ++R CPH+GIP+ + ++ FY GL  + +   DA     ++  +       L+ M +NN +W  +  G+R   ++ G   +D   +  L  Q+ A++  L
Subjt:  RMVRACPHNGIPECILMEVFYFGLNKARQQTADAVFANDMLKSSYNRIKATLDTMTNNNEEWDEDDFGNRRGGRSRGDESMDKSVVVALQGQMTAMNNLL

Query:  KSMAISQVNATGGSVHATNQIDDMGCVGCSDPHNTDACPLNNETVAY-----------------------------------QPTTIAP-----------
         ++           VHA  Q   + C  C D H+ D CP N+E+V +                                    P  I P           
Subjt:  KSMAISQVNATGGSVHATNQIDDMGCVGCSDPHNTDACPLNNETVAY-----------------------------------QPTTIAP-----------

Query:  --STSSMENLLREYMQKNDALLQNQASSIRNLEVQLGQLANDFSRRLQGSLPSNTEMPNQARGSRKEKCHAVTLRSGRNL
            S +E LL +Y+ K DA++Q+Q +S+RNLE Q+GQLAN  + R QGSLPS+T    Q     KE+C A+TLRSG+ +
Subjt:  --STSSMENLLREYMQKNDALLQNQASSIRNLEVQLGQLANDFSRRLQGSLPSNTEMPNQARGSRKEKCHAVTLRSGRNL

A0A6J1DW02 uncharacterized protein LOC1110248971.5e-6032.14Show/hide
Query:  LDPEIERTFRRNRRRARQRQARRMENNR------------------NAPPPHAVPEP---------------NAAYIAHDLDRPIRSYATPNLYNFNPGI
        LDPEIERT R+ R+  R R+    +  R                  + PP   V  P               N   +A + D  +R YA     NF+ GI
Subjt:  LDPEIERTFRRNRRRARQRQARRMENNR------------------NAPPPHAVPEP---------------NAAYIAHDLDRPIRSYATPNLYNFNPGI

Query:  AYPAFGENARFEIKSIILQMIQNAGQFGGHPGENPHEHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRWTNALEDGEVGTWDQLIEKFMKKFFP
          P    +  FE+K ++ QM+Q  G FGG   E+PH+H++SF  I  +F +PGI+ +    TLFP +L+D+A+   NA   G + TW  L+EKF+ KFFP
Subjt:  AYPAFGENARFEIKSIILQMIQNAGQFGGHPGENPHEHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRWTNALEDGEVGTWDQLIEKFMKKFFP

Query:  PHENARRRKKLMSFQQKDRENLHDAWSRFKRMVRACPHNGIPECILMEVFYFGLNKARQQTADAVFANDMLKSSYNRIKATLDTMTNNNEEWDEDDFGNR
        P  +A  R++++SF+Q DRE +H+AW RFK ++R C ++G+P C  +E F+ GL+   +   +        K ++N I   L+ + ++NE W        
Subjt:  PHENARRRKKLMSFQQKDRENLHDAWSRFKRMVRACPHNGIPECILMEVFYFGLNKARQQTADAVFANDMLKSSYNRIKATLDTMTNNNEEWDEDDFGNR

Query:  RGGRSRGDES--MDKSVVVALQGQMTAMNNLLKSMAISQVNATGGSVHATN----------QIDDMGC-----------VGCSDPHNTDACPLNNETVAY
        R    + D +  +   +  ++Q +M  MN  LK MA+   N     +              Q++D+ C            G S   N      N +   Y
Subjt:  RGGRSRGDES--MDKSVVVALQGQMTAMNNLLKSMAISQVNATGGSVHATN----------QIDDMGC-----------VGCSDPHNTDACPLNNETVAY

Query:  QPTT-------------------IAPSTSSMENLLREYMQKNDALLQNQASSIRNLEVQLGQLANDFSRRLQGSLPSNTEMPNQARGSRKEKCHAVTLRS
         P T                   I  + S++EN+++EYM + DA++Q+QA+S+RN   QLG LAN+   R QGS P +TE+P   R   KE+C AVTLRS
Subjt:  QPTT-------------------IAPSTSSMENLLREYMQKNDALLQNQASSIRNLEVQLGQLANDFSRRLQGSLPSNTEMPNQARGSRKEKCHAVTLRS

Query:  G
        G
Subjt:  G

A0A6J1EQ90 uncharacterized protein LOC1114364111.4e-5831.51Show/hide
Query:  QFKLDPEIERTFRRNRRRARQRQARRMEN-------NRNAPPPHAVPE-----PNAAYIAHDLDRPIRSYATPNLYNFNPGIAYPAFGENARFEIKSIIL
        +F LDPEIERTFRR  ++ ++   + ++        NR    P  +        N  ++A D +R IR+YA P +   NP I  P   +   FE+K ++ 
Subjt:  QFKLDPEIERTFRRNRRRARQRQARRMEN-------NRNAPPPHAVPE-----PNAAYIAHDLDRPIRSYATPNLYNFNPGIAYPAFGENARFEIKSIIL

Query:  QMIQNAGQFGGHPGENPHEHIRSFYSI-------CASFHMPGISPEELRFTLFPLTLRDEAKRWTNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKKL
        QM+Q  GQF G P E+PH H++SF  +         SF   G+  + +R +LFP  LRD AK W N L  G + +W+ L E F+ K+FPP  NAR + ++
Subjt:  QMIQNAGQFGGHPGENPHEHIRSFYSI-------CASFHMPGISPEELRFTLFPLTLRDEAKRWTNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKKL

Query:  MSFQQKDRENLHDAWSRFKRMVRACPHNGIPECILMEVFYFGLNKARQQTADAVFANDMLKSSYNRIKATLDTMTNNNEEWDEDDFGNRRGGRSRGDESM
        ++FQQ + E L +A  RFK M+R CPH+G+P CI ME FY GLN   +Q  DA     +L  +YN     L+ + +NN +W   D  +  G ++RG   +
Subjt:  MSFQQKDRENLHDAWSRFKRMVRACPHNGIPECILMEVFYFGLNKARQQTADAVFANDMLKSSYNRIKATLDTMTNNNEEWDEDDFGNRRGGRSRGDESM

Query:  DKSVVVALQGQMTAMNNLLKSMAISQVNATGGSVH---ATNQIDDMGCVGCSDPHNTDACPLNNETVAY-------------------------------
        D   + ++  Q+ ++ N+L+++A+ Q +     VH   A NQ     CV C + H  D CP N  ++ Y                               
Subjt:  DKSVVVALQGQMTAMNNLLKSMAISQVNATGGSVH---ATNQIDDMGCVGCSDPHNTDACPLNNETVAY-------------------------------

Query:  -------------------------------------QPTTIAPSTS--SMENLLREYMQKNDALLQNQASSIRNLEVQLGQLANDFSRRLQGSLPSNTE
                                             + TT A  TS  S+E+L++EYM KNDA++Q+Q +S+RNLEVQ+G   N           ++T+
Subjt:  -------------------------------------QPTTIAPSTS--SMENLLREYMQKNDALLQNQASSIRNLEVQLGQLANDFSRRLQGSLPSNTE

Query:  MPNQARGSRKE
          N+    +KE
Subjt:  MPNQARGSRKE

A0A6J1G7Q6 uncharacterized protein LOC1114515981.2e-6232.96Show/hide
Query:  NAAYIAHDLDRPIRSYATPNLYNFNPGIAYPAFGENARFEIKSIILQMIQNAGQFGGHPGENPHEHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEA
        NA ++A D +R IR+YA P +   NP I  P   +   FE+K ++ QM+Q  GQF G   ++PH H++SF  +  SF   G+  + +R + F  +LRD A
Subjt:  NAAYIAHDLDRPIRSYATPNLYNFNPGIAYPAFGENARFEIKSIILQMIQNAGQFGGHPGENPHEHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEA

Query:  KRWTNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKKLMSFQQKDRENLHDAWSRFKRMVRACPHNGIPECILMEVFYFGLNKARQQTADAVFANDMLK
        K W N L  G + +W+ L EKF+ K+FPP  +AR R ++++FQ+ + E L +AW RFK  +R CPH+G+P CI +E FY GLN A +Q  DA    D+L 
Subjt:  KRWTNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKKLMSFQQKDRENLHDAWSRFKRMVRACPHNGIPECILMEVFYFGLNKARQQTADAVFANDMLK

Query:  SSYNRIKATLDTMTNNNEEWDEDDFGNRRGGRSRGDESMDKSVVVALQGQMTAMNNLLKSMAISQVNATGGSVHATN---QIDDMGCVGCSDPHNTDACP
         +YN     L+ + +NN +W   D  +  G ++R  E ++   + ++  Q+ +M N+L+++A  Q +      H      Q     CV C + H  D CP
Subjt:  SSYNRIKATLDTMTNNNEEWDEDDFGNRRGGRSRGDESMDKSVVVALQGQMTAMNNLLKSMAISQVNATGGSVHATN---QIDDMGCVGCSDPHNTDACP

Query:  LNNETVAY--------------------------------------------------------------QPTTIAPSTSS--------MENLLREYMQK
         N  ++ Y                                                              Q TT    TS         +E+L++EYM +
Subjt:  LNNETVAY--------------------------------------------------------------QPTTIAPSTSS--------MENLLREYMQK

Query:  NDALLQNQASSIRNLEVQLGQLANDFSRRLQGSLPSNTEMPNQ
        NDA++Q+Q  S+RNLEVQ+GQLAN+   R  G LP++TEMP +
Subjt:  NDALLQNQASSIRNLEVQLGQLANDFSRRLQGSLPSNTEMPNQ

U5CUI2 Retrotrans_gag domain-containing protein3.6e-6242.81Show/hide
Query:  NAAYIAHDLDRPIRSYATPNLYNFNPGIAYPAFGENARFEIKSIILQMIQNAGQFGGHPGENPHEHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEA
        N   +A D  R IR YA P     NPGI  P   +  +FE+K ++ QM+Q  GQF G P E+PH H+RSF  +  SF + G+S E LR  LFP +LRD A
Subjt:  NAAYIAHDLDRPIRSYATPNLYNFNPGIAYPAFGENARFEIKSIILQMIQNAGQFGGHPGENPHEHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEA

Query:  KRWTNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKKLMSFQQKDRENLHDAWSRFKRMVRACPHNGIPECILMEVFYFGLNKARQQTADAVFANDMLK
        + W N L    V  W+ L EKF++K+FPP  NA+ R ++MSFQQ + E+  DAW RFK ++R CPH+GIP CI ME FY GLN A +   DA     +L 
Subjt:  KRWTNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKKLMSFQQKDRENLHDAWSRFKRMVRACPHNGIPECILMEVFYFGLNKARQQTADAVFANDMLK

Query:  SSYNRIKATLDTMTNNNEEWDEDDFGNRRGGRSRGDES-MDKSVVVALQGQMTAMNNLLKSMAISQVNATGGSVHATNQIDDMGCVGCSDPHNTDACPLN
         SYN     L+T+ +NN +W      N R   SR     ++   + AL  QM +M N+LK+++I   NA      A  Q DD+ CV C + H  + CP N
Subjt:  SSYNRIKATLDTMTNNNEEWDEDDFGNRRGGRSRGDES-MDKSVVVALQGQMTAMNNLLKSMAISQVNATGGSVHATNQIDDMGCVGCSDPHNTDACPLN

Query:  NETVAY
         E+V Y
Subjt:  NETVAY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGACGGTGAACAACCACAATTCAAACTTGACCCTGAAATTGAGCGAACATTTCGACGTAACCGGCGAAGAGCAAGGCAGAGACAAGCTAGAAGAATGGAAAACAA
CAGAAATGCCCCTCCGCCGCATGCTGTCCCAGAACCAAATGCTGCCTACATAGCACACGACTTGGATAGGCCGATTAGATCTTATGCGACACCCAACCTCTACAACTTCA
ACCCAGGAATCGCTTACCCTGCATTCGGCGAGAACGCCAGGTTTGAAATCAAATCTATAATACTTCAGATGATTCAGAACGCCGGACAATTCGGCGGTCATCCTGGGGAA
AATCCACACGAACATATAAGAAGTTTCTACTCCATCTGCGCTTCCTTCCACATGCCAGGCATCTCACCTGAGGAATTGAGATTCACCCTATTTCCGTTAACTTTGAGGGA
CGAGGCGAAGAGGTGGACAAACGCCTTGGAAGATGGCGAGGTGGGAACATGGGACCAATTAATAGAAAAATTTATGAAGAAATTTTTCCCACCTCACGAAAACGCTAGAA
GAAGGAAGAAGCTTATGAGCTTCCAGCAGAAGGATAGAGAAAACCTACATGACGCGTGGAGTAGGTTTAAGAGGATGGTGAGAGCATGCCCCCACAATGGCATTCCTGAA
TGCATATTGATGGAGGTTTTCTATTTTGGGCTAAACAAGGCTAGACAGCAGACTGCTGATGCTGTGTTTGCAAACGATATGTTAAAGAGCTCCTACAACCGAATTAAGGC
GACGCTGGATACAATGACCAACAACAATGAAGAATGGGATGAAGATGATTTCGGCAATCGTCGAGGAGGACGATCAAGAGGTGATGAAAGCATGGATAAGAGCGTCGTGG
TGGCATTGCAAGGACAAATGACTGCGATGAACAATCTACTTAAATCCATGGCAATATCGCAGGTCAACGCCACAGGAGGCTCTGTGCATGCGACTAACCAAATTGATGAC
ATGGGATGCGTAGGTTGCAGCGATCCTCATAACACTGACGCATGCCCACTCAATAATGAAACTGTCGCGTACCAGCCCACCACCATTGCTCCATCCACCTCATCTATGGA
AAACCTCCTCCGCGAATATATGCAGAAAAATGATGCTCTTCTGCAAAATCAAGCTTCATCAATTCGCAATCTGGAGGTACAGTTAGGGCAGCTCGCCAACGATTTCTCCA
GAAGACTGCAAGGATCCCTCCCAAGCAATACAGAAATGCCAAATCAGGCGAGGGGATCTCGTAAAGAGAAGTGTCACGCAGTGACACTACGCAGCGGAAGAAATTTAACC
ATCTGCGACCCTGATGCTGAACGTAGCTACTCCAATTCTAATTCTACTGCCGAGATTGGCAGTTCAAGTAAAATTCCTACTCTTGTACATTTCCCTTTAACTGATAATGT
TTCTTCCTCGCAGAATAATGACGCTCCAAGCAAGGGCTTTGAGAGCAAGAGGCAACGGAATCAACAGAGCAAAGCGTCTCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTGACGGTGAACAACCACAATTCAAACTTGACCCTGAAATTGAGCGAACATTTCGACGTAACCGGCGAAGAGCAAGGCAGAGACAAGCTAGAAGAATGGAAAACAA
CAGAAATGCCCCTCCGCCGCATGCTGTCCCAGAACCAAATGCTGCCTACATAGCACACGACTTGGATAGGCCGATTAGATCTTATGCGACACCCAACCTCTACAACTTCA
ACCCAGGAATCGCTTACCCTGCATTCGGCGAGAACGCCAGGTTTGAAATCAAATCTATAATACTTCAGATGATTCAGAACGCCGGACAATTCGGCGGTCATCCTGGGGAA
AATCCACACGAACATATAAGAAGTTTCTACTCCATCTGCGCTTCCTTCCACATGCCAGGCATCTCACCTGAGGAATTGAGATTCACCCTATTTCCGTTAACTTTGAGGGA
CGAGGCGAAGAGGTGGACAAACGCCTTGGAAGATGGCGAGGTGGGAACATGGGACCAATTAATAGAAAAATTTATGAAGAAATTTTTCCCACCTCACGAAAACGCTAGAA
GAAGGAAGAAGCTTATGAGCTTCCAGCAGAAGGATAGAGAAAACCTACATGACGCGTGGAGTAGGTTTAAGAGGATGGTGAGAGCATGCCCCCACAATGGCATTCCTGAA
TGCATATTGATGGAGGTTTTCTATTTTGGGCTAAACAAGGCTAGACAGCAGACTGCTGATGCTGTGTTTGCAAACGATATGTTAAAGAGCTCCTACAACCGAATTAAGGC
GACGCTGGATACAATGACCAACAACAATGAAGAATGGGATGAAGATGATTTCGGCAATCGTCGAGGAGGACGATCAAGAGGTGATGAAAGCATGGATAAGAGCGTCGTGG
TGGCATTGCAAGGACAAATGACTGCGATGAACAATCTACTTAAATCCATGGCAATATCGCAGGTCAACGCCACAGGAGGCTCTGTGCATGCGACTAACCAAATTGATGAC
ATGGGATGCGTAGGTTGCAGCGATCCTCATAACACTGACGCATGCCCACTCAATAATGAAACTGTCGCGTACCAGCCCACCACCATTGCTCCATCCACCTCATCTATGGA
AAACCTCCTCCGCGAATATATGCAGAAAAATGATGCTCTTCTGCAAAATCAAGCTTCATCAATTCGCAATCTGGAGGTACAGTTAGGGCAGCTCGCCAACGATTTCTCCA
GAAGACTGCAAGGATCCCTCCCAAGCAATACAGAAATGCCAAATCAGGCGAGGGGATCTCGTAAAGAGAAGTGTCACGCAGTGACACTACGCAGCGGAAGAAATTTAACC
ATCTGCGACCCTGATGCTGAACGTAGCTACTCCAATTCTAATTCTACTGCCGAGATTGGCAGTTCAAGTAAAATTCCTACTCTTGTACATTTCCCTTTAACTGATAATGT
TTCTTCCTCGCAGAATAATGACGCTCCAAGCAAGGGCTTTGAGAGCAAGAGGCAACGGAATCAACAGAGCAAAGCGTCTCAATAG
Protein sequenceShow/hide protein sequence
MSDGEQPQFKLDPEIERTFRRNRRRARQRQARRMENNRNAPPPHAVPEPNAAYIAHDLDRPIRSYATPNLYNFNPGIAYPAFGENARFEIKSIILQMIQNAGQFGGHPGE
NPHEHIRSFYSICASFHMPGISPEELRFTLFPLTLRDEAKRWTNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKKLMSFQQKDRENLHDAWSRFKRMVRACPHNGIPE
CILMEVFYFGLNKARQQTADAVFANDMLKSSYNRIKATLDTMTNNNEEWDEDDFGNRRGGRSRGDESMDKSVVVALQGQMTAMNNLLKSMAISQVNATGGSVHATNQIDD
MGCVGCSDPHNTDACPLNNETVAYQPTTIAPSTSSMENLLREYMQKNDALLQNQASSIRNLEVQLGQLANDFSRRLQGSLPSNTEMPNQARGSRKEKCHAVTLRSGRNLT
ICDPDAERSYSNSNSTAEIGSSSKIPTLVHFPLTDNVSSSQNNDAPSKGFESKRQRNQQSKASQ