; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0018061 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0018061
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr02:11658110..11667210
RNA-Seq ExpressionPI0018061
SyntenyPI0018061
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]5.3e-7736.96Show/hide
Query:  FKLDPEIERTF---RGNQRRARQRQIRRMENNRNAPPPQADPEPNATYIAHDLDRPIRSYATPNLYNFSPGIAYPVFGENARFEIKSVMLQMIQNVGQFG
        F  DPEIERTF   R  QR+ +Q Q+   +N  N   P     P   +I  D DR IR YA P     + GI  P   +  +FE+K VM QM+Q +GQF 
Subjt:  FKLDPEIERTF---RGNQRRARQRQIRRMENNRNAPPPQADPEPNATYIAHDLDRPIRSYATPNLYNFSPGIAYPVFGENARFEIKSVMLQMIQNVGQFG

Query:  GHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARKRKELMSFQQRDRENLHDAWSR
        G P EDPH H+R F  I  SF   G+  + LR  LFP ++RD A+ W N+L  G V TW+ L EKF+ K+FPP+ NA+ R E+ SFQQ+D E+L+DAW R
Subjt:  GHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARKRKELMSFQQRDRENLHDAWSR

Query:  FKRMVKACPHNGIPKCILMEVFYFRLNKATEQTVDAVFVDGMLKSTYNQIKTTLDTMANNNEEWDEDDFGNRRGGRPKDDGMDRNAVVALQGQMTAMNNL
        FK +++ CPH+GI  CI ME FY  LN  T+  VDA     +L  +YNQ    L+T+A  N +W        + G+      D +++ +++ Q+ +M ++
Subjt:  FKRMVKACPHNGIPKCILMEVFYFRLNKATEQTVDAVFVDGMLKSTYNQIKTTLDTMANNNEEWDEDDFGNRRGGRPKDDGMDRNAVVALQGQMTAMNNL

Query:  LKSMAISQVNVAGNSMAA-ANQIDEMGCVGCGGPHNTDACPLNTETVAF---------------------------------------------------
        LK++++        S+++  NQ   + CV CG  H  D+CP N E+V +                                                   
Subjt:  LKSMAISQVNVAGNSMAA-ANQIDEMGCVGCGGPHNTDACPLNTETVAF---------------------------------------------------

Query:  --STSSMENLFREYMQKN-------DALLQSQASSIRNLEVQLGQLASDFSGRQQGSLPSNTKMPNQAGGSGKEKCHAVTLRSGRNL
           ++S+EN+ +EY+ KN       +AL+QSQA+S+RNLE Q+GQLA++   R  G+LPS+T+ P    G G E C A+TL+SG+ L
Subjt:  --STSSMENLFREYMQKN-------DALLQSQASSIRNLEVQLGQLASDFSGRQQGSLPSNTKMPNQAGGSGKEKCHAVTLRSGRNL

XP_017239618.1 PREDICTED: uncharacterized protein LOC108212402 [Daucus carota subsp. sativus]1.1e-7436.55Show/hide
Query:  FKLDPEIERTF---RGNQRRARQRQIRRMENNRNAPPPQADPEPNATYIAHDLDRPIRSYATPNLYNFSPGIAYPVFGENARFEIKSVMLQMIQNVGQFG
        F  DPEIERTF   R  QR+ +Q Q+   +N  N   P     P   +I  D DR IR YA P     + GI  P   +  +FE+K VM QM+Q +GQF 
Subjt:  FKLDPEIERTF---RGNQRRARQRQIRRMENNRNAPPPQADPEPNATYIAHDLDRPIRSYATPNLYNFSPGIAYPVFGENARFEIKSVMLQMIQNVGQFG

Query:  GHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARKRKELMSFQQRDRENLHDAWSR
        G P EDPH H+R F  I  SF   G+  + LR  LFP ++RD A+ W N+L  G V TW+ L EKF+ K+FPP+ NA+   E+ SFQQ+D E+L+DAW R
Subjt:  GHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARKRKELMSFQQRDRENLHDAWSR

Query:  FKRMVKACPHNGIPKCILMEVFYFRLNKATEQTVDAVFVDGMLKSTYNQIKTTLDTMANNNEEWDEDDFGNRRGGRPKDDGMDRNAVVALQGQMTAMNNL
        FK +++ CPH+GI   I ME FY  LN  T+  VDA     +L  +YNQ    L+T+A NN +W        + G+      D +++ +++ Q+ +M ++
Subjt:  FKRMVKACPHNGIPKCILMEVFYFRLNKATEQTVDAVFVDGMLKSTYNQIKTTLDTMANNNEEWDEDDFGNRRGGRPKDDGMDRNAVVALQGQMTAMNNL

Query:  LKSMAISQVNVAGNSMAA-ANQIDEMGCVGCGGPHNTDACPLNTETVAF---------------------------------------------------
        LK++++        S+++  NQ   + CV CG  H  D+CP N E+V +                                                   
Subjt:  LKSMAISQVNVAGNSMAA-ANQIDEMGCVGCGGPHNTDACPLNTETVAF---------------------------------------------------

Query:  --STSSMENLFREYMQKN-------DALLQSQASSIRNLEVQLGQLASDFSGRQQGSLPSNTKMPNQAGGSGKEKCHAVTLRSGRNL
           ++S+EN+ +EY+ KN       +AL+QSQA+S+RNLE Q+GQL ++   R  G+LPS+T+ P    G G E C A+TL+SG+ L
Subjt:  --STSSMENLFREYMQKN-------DALLQSQASSIRNLEVQLGQLASDFSGRQQGSLPSNTKMPNQAGGSGKEKCHAVTLRSGRNL

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]2.5e-7937.83Show/hide
Query:  ADPEPNATYIAHDLDRPIRSYATPNLYNFSPGIAYPVFGENARFEIKSVMLQMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLT
        A  E N   +A D  R IR YA P     +PGI  P   +   FE+K VM QM+Q VGQFGG P EDPH HIRSF  +  SF + G+S E LR  LFP +
Subjt:  ADPEPNATYIAHDLDRPIRSYATPNLYNFSPGIAYPVFGENARFEIKSVMLQMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLT

Query:  LRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARKRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFRLNKATEQTVDAVFV
        LRD A+ W N L    V  W+ L EKF++K+FPP  NA+ R E+MSFQQ + E   DAW RFK +++ CPH+GIP CI +E FY  LN A+   +DA   
Subjt:  LRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARKRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFRLNKATEQTVDAVFV

Query:  DGMLKSTYNQIKTTLDTMANNNEEWDEDDFGNRRGGRPKDDG-MDRNAVVALQGQMTAMNNLLKSMAISQVNVAGN-SMAAANQIDEMGCVGCGGPHNTD
          +L  +YN+    L+ +A+NN +W      NR     K  G ++ +A+ AL  QM +M N+LK+M     N+ G+   AAA Q  +  CV CG  H  +
Subjt:  DGMLKSTYNQIKTTLDTMANNNEEWDEDDFGNRRGGRPKDDG-MDRNAVVALQGQMTAMNNLLKSMAISQVNVAGN-SMAAANQIDEMGCVGCGGPHNTD

Query:  ACPLNTETVAF--------------------------------------------------------STSSMENLFREYMQKNDALLQSQASSIRNLEVQ
         CP N  +V +                                                         TSS+E+L R+YM KND ++QSQA+S+RNLEVQ
Subjt:  ACPLNTETVAF--------------------------------------------------------STSSMENLFREYMQKNDALLQSQASSIRNLEVQ

Query:  LGQLASDFSGRQQGSLPSNTKMPNQAGGSGKEKCHAVTLRSGRNLTIRNPDADVVTPLLTLLPRLAVQIRVCSDGPESKRPTSRTSGEHKTPQYIHLKHD
        LGQLA+D   R QG+LPS+T+ P +    GKE C AVTLRSG+ +     +++V         R      +  +G   K+P   TS     P        
Subjt:  LGQLASDFSGRQQGSLPSNTKMPNQAGGSGKEKCHAVTLRSGRNLTIRNPDADVVTPLLTLLPRLAVQIRVCSDGPESKRPTSRTSGEHKTPQYIHLKHD

Query:  KG--KVLLKPPPEAAENFLEVEVDNQ
            K L KPPP   + F + + D Q
Subjt:  KG--KVLLKPPPEAAENFLEVEVDNQ

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]4.6e-7338.41Show/hide
Query:  IAHDLDRPIRSYATPNLYNFSPGIAYPVFGENARFEIKSVMLQMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKRWA
        +  D  R IR YA P     +PGI  P   +  +FE+K VM QM+Q VGQF   P EDPH H+RSF  +  SF + G+S E  R  LFP +LRD A+ W 
Subjt:  IAHDLDRPIRSYATPNLYNFSPGIAYPVFGENARFEIKSVMLQMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKRWA

Query:  NALEDGEVGTWDQLIEKFMKKFFPPHENARKRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFRLNKATEQTVDAVFVDGMLKSTYN
        N L    V  W+   EKF++K+FPP  NA+ R E+MSF Q + E+  DAW RFK +++ CPH+GIP CI ME FY  LN  ++  +DA     +L  +YN
Subjt:  NALEDGEVGTWDQLIEKFMKKFFPPHENARKRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFRLNKATEQTVDAVFVDGMLKSTYN

Query:  QIKTTLDTMANNNEEWDEDDFGNRRGGRPKDDGMDRNAVVALQGQMTAMNNLLKSMAISQVNVAGNSMAAANQIDEMGCVGCGGPHNTDACPLNTETV--
        +    L+T+A+NN +W         G R     ++ +A+ AL  QM +M N+LK+++I   N      AAA Q D++ CV C   H  + CP N E+V  
Subjt:  QIKTTLDTMANNNEEWDEDDFGNRRGGRPKDDGMDRNAVVALQGQMTAMNNLLKSMAISQVNVAGNSMAAANQIDEMGCVGCGGPHNTDACPLNTETV--

Query:  -----------AFST-------------------------------------------------SSMENLFREYMQKNDALLQSQASSIRNLEVQLGQLA
                   AFS                                                  SS+E+L R+YM KNDA++QSQA+ +RNLE+QLG LA
Subjt:  -----------AFST-------------------------------------------------SSMENLFREYMQKNDALLQSQASSIRNLEVQLGQLA

Query:  SDFSGRQQGSLPSNTKMPNQAGGSGKEKCHAVTLRSGRNL
        ++   R QGSLPS+T+ P +    GKE+C ++ LRSG++L
Subjt:  SDFSGRQQGSLPSNTKMPNQAGGSGKEKCHAVTLRSGRNL

XP_030510138.1 uncharacterized protein LOC115724905 [Cannabis sativa]1.0e-7238.74Show/hide
Query:  SPGIAYPVFGENARFEIKSVMLQMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMK
        +PGI  P   +   FE+K VM QM+Q VGQFGG P EDPH HI SF  +  SF + G+S E LR  LFP +LRD A+ W N L    V  W+ L E F++
Subjt:  SPGIAYPVFGENARFEIKSVMLQMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMK

Query:  KFFPPHENARKRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFRLNKATEQTVDAVFVDGMLKSTYNQIKTTLDTMANNNEEWDEDD
        K+FPP  NA+ R E+MSFQQ + E   DAW RFK +++ CPH+GIP CI +E FY  LN A+   +DA     +L  +YN+    L+ +A+NN +W    
Subjt:  KFFPPHENARKRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFRLNKATEQTVDAVFVDGMLKSTYNQIKTTLDTMANNNEEWDEDD

Query:  FGNRRGGRPKDDG-MDRNAVVALQGQMTAMNNLLKSMAISQVNVAGN-SMAAANQIDEMGCVGCGGPHNTDACPLNTETVAF------------------
          NR     K  G ++ +A+ AL  QM +M N+LK+M     N+ G+   AAA Q  E+ CV CG  H  + CP N  +V +                  
Subjt:  FGNRRGGRPKDDG-MDRNAVVALQGQMTAMNNLLKSMAISQVNVAGN-SMAAANQIDEMGCVGCGGPHNTDACPLNTETVAF------------------

Query:  -------------------------------------------STSSMENLFREYMQKNDALLQSQASSIRNLEVQLGQLASDFSGRQQGSLPSNTKMPN
                                                    TSS+E+L R+YM KNDA++QSQA+S+RNLEVQLGQLA+D   R QG+LPS+T+ P 
Subjt:  -------------------------------------------STSSMENLFREYMQKNDALLQSQASSIRNLEVQLGQLASDFSGRQQGSLPSNTKMPN

Query:  QAGGSGKEKCHAVTLRSGRNLTIRNPDADVVTPLLTLLPRLAVQIRVCSDGPESKRPTSRTSGE-HKTPQYIHLK
        +     KE C AVTLRSG+ +     +++VVT       R A+           K P     GE  K  Q +HLK
Subjt:  QAGGSGKEKCHAVTLRSGRNLTIRNPDADVVTPLLTLLPRLAVQIRVCSDGPESKRPTSRTSGE-HKTPQYIHLK

TrEMBL top hitse value%identityAlignment
A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129458.0e-6332.78Show/hide
Query:  DPEIERTFRGNQRR----ARQRQIRRMENNRNAPPPQADPEPNATYIAHDLDRPIRSYATPNLYNFSPGIAYPVFGENARFEIKSVMLQMIQNVGQFGGH
        DP+IERTFR ++R     A   Q    +NN N          NA  +  + +R +R Y  P +      I  P    N  FEIK   +QMIQ+  QF G 
Subjt:  DPEIERTFRGNQRR----ARQRQIRRMENNRNAPPPQADPEPNATYIAHDLDRPIRSYATPNLYNFSPGIAYPVFGENARFEIKSVMLQMIQNVGQFGGH

Query:  PGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARKRKELMSFQQRDRENLHDAWSRFK
        P +DP+ H+ +F  IC +F  +G++ + +R  LFP +LRD+AK W N+L +G + TW+ L +KF+ KFFPP + A+ R ++ SF Q D E+L++AW RFK
Subjt:  PGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARKRKELMSFQQRDRENLHDAWSRFK

Query:  RMVKACPHNGIPKCILMEVFYFRLNKATEQTVDAVFVDGMLKSTYNQIKTTLDTMANNNEEWDEDDFGNRRGGRPKDDGMDRNAVVALQGQMTAMNNLLK
         +++ CPH+GIP  + ++ FY  L  + +  +DA     ++          L+ MA+NN +W  +    R G R      + +A+  L  Q+ A++  L 
Subjt:  RMVKACPHNGIPKCILMEVFYFRLNKATEQTVDAVFVDGMLKSTYNQIKTTLDTMANNNEEWDEDDFGNRRGGRPKDDGMDRNAVVALQGQMTAMNNLLK

Query:  SMAISQVNVAGNSMAAANQIDEMGCVGCGGPHNTDACPLNTETVAF------------------------------------------------------
        ++ +  V    NS+          C  CG  H+ D CP N+E+V F                                                      
Subjt:  SMAISQVNVAGNSMAAANQIDEMGCVGCGGPHNTDACPLNTETVAF------------------------------------------------------

Query:  -STSSMENLFREYMQKNDALLQSQASSIRNLEVQLGQLASDFSGRQQGSLPSNTKMPNQAGGSGKEKCHAVTLRSGRNL
           S +E L  +Y+ K DA++QSQ +S+RNLE Q+GQLA+  + R QGSLPS+T    Q    GKE+C A+TLRSG+ +
Subjt:  -STSSMENLFREYMQKNDALLQSQASSIRNLEVQLGQLASDFSGRQQGSLPSNTKMPNQAGGSGKEKCHAVTLRSGRNL

A0A6J1EEI2 uncharacterized protein LOC1114333943.5e-5831.85Show/hide
Query:  RENKFMSDSE-QPFKLDPEIERTFRGNQRRARQRQIRRMENNRNAPPPQADPEPNATYIAHDLDRPIRSYATPNLYNFSPGIAYPVFGENARFEIKSVML
        +E K M++   Q  +L  ++ R F      A Q +I                  NA ++A D +R IR+YA P +   +P I  P   +   FE+K VM 
Subjt:  RENKFMSDSE-QPFKLDPEIERTFRGNQRRARQRQIRRMENNRNAPPPQADPEPNATYIAHDLDRPIRSYATPNLYNFSPGIAYPVFGENARFEIKSVML

Query:  QMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARKRKELMSFQQRD
        QM+Q +GQF G P EDPH H++SF  +  SF    +  + +R +LFP +LRD AK W N L  G + +W+ L+EKF+ K+FPP  NAR R E++ FQQ +
Subjt:  QMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARKRKELMSFQQRD

Query:  RENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFRLNKATEQTVDAVFVDGMLKSTYNQIKTTLDTMANNNEEWDEDDFGNRRGGRPKDDGMDRNAVVAL
         + L +AW RFK M++ CPH+G+P CI ME FY  LN AT+Q VDA     +L  TYN+    L+ +A+NN +W +        GR     ++ +A+ ++
Subjt:  RENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFRLNKATEQTVDAVFVDGMLKSTYNQIKTTLDTMANNNEEWDEDDFGNRRGGRPKDDGMDRNAVVAL

Query:  QGQMTAMNNLLKSMAISQ---VNVAGNSMAAANQIDEMGCVGCGGPHNTDACPLNTETV-----------------------------------------
          Q+ ++ N+L+++A+ Q   +    +++A  NQ     CV CG  H  D CP N  ++                                         
Subjt:  QGQMTAMNNLLKSMAISQ---VNVAGNSMAAANQIDEMGCVGCGGPHNTDACPLNTETV-----------------------------------------

Query:  ------------------AFST------------------SSMENLFREYMQKNDALLQSQASSIRNLEVQ
                          A+S+                  +S+E+L +EYM KND ++Q+Q +S+RNLEVQ
Subjt:  ------------------AFST------------------SSMENLFREYMQKNDALLQSQASSIRNLEVQ

A0A6J1EQ90 uncharacterized protein LOC1114364111.2e-5831.43Show/hide
Query:  FKLDPEIERTFRGNQRRARQ------------RQIRRMENNRNAPPPQADPEPNATYIAHDLDRPIRSYATPNLYNFSPGIAYPVFGENARFEIKSVMLQ
        F LDPEIERTFR   ++ ++             Q+ R   N      Q     N  ++A D +R IR+YA P +   +P I  P   +   FE+K VM Q
Subjt:  FKLDPEIERTFRGNQRRARQ------------RQIRRMENNRNAPPPQADPEPNATYIAHDLDRPIRSYATPNLYNFSPGIAYPVFGENARFEIKSVMLQ

Query:  MIQNVGQFGGHPGEDPHEHIRSFYSI-------CASFHMSGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARKRKELM
        M+Q +GQF G P EDPH H++SF  +         SF   G+  + +R +LFP  LRD AK W N L  G + +W+ L E F+ K+FPP  NAR + E++
Subjt:  MIQNVGQFGGHPGEDPHEHIRSFYSI-------CASFHMSGISPEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPHENARKRKELM

Query:  SFQQRDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFRLNKATEQTVDAVFVDGMLKSTYNQIKTTLDTMANNNEEWDEDDFGNRRGGRPKDDGMDR
        +FQQ + E L +A  RFK M++ CPH+G+P CI ME FY  LN  T+Q VDA     +L  TYN+    L+ +A+NN +W +        GR     ++ 
Subjt:  SFQQRDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFRLNKATEQTVDAVFVDGMLKSTYNQIKTTLDTMANNNEEWDEDDFGNRRGGRPKDDGMDR

Query:  NAVVALQGQMTAMNNLLKSMAISQ---VNVAGNSMAAANQIDEMGCVGCGGPHNTDACPLNTETVAF---------------------------------
        +A+ ++  Q+ ++ N+L+++A+ Q   +    ++ AA NQ     CV CG  H  D CP N  ++ +                                 
Subjt:  NAVVALQGQMTAMNNLLKSMAISQ---VNVAGNSMAAANQIDEMGCVGCGGPHNTDACPLNTETVAF---------------------------------

Query:  --------------------------------------------STSSMENLFREYMQKNDALLQSQASSIRNLEVQLGQLASDFSGRQQGSLPSNTKMP
                                                    S +S+E+L +EYM KNDA++QSQ +S+RNLEVQ+G   +   G       ++T+  
Subjt:  --------------------------------------------STSSMENLFREYMQKNDALLQSQASSIRNLEVQLGQLASDFSGRQQGSLPSNTKMP

Query:  NQAGGSGKE
        N+     KE
Subjt:  NQAGGSGKE

A0A6J1G7Q6 uncharacterized protein LOC1114515985.7e-6132.21Show/hide
Query:  NATYIAHDLDRPIRSYATPNLYNFSPGIAYPVFGENARFEIKSVMLQMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEA
        NA ++A D +R IR+YA P +   +P I  P   +   FE+K VM QM+Q +GQF G   +DPH H++SF  +  SF   G+  + +R + F  +LRD A
Subjt:  NATYIAHDLDRPIRSYATPNLYNFSPGIAYPVFGENARFEIKSVMLQMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEA

Query:  KRWANALEDGEVGTWDQLIEKFMKKFFPPHENARKRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFRLNKATEQTVDAVFVDGMLK
        K W N L  G + +W+ L EKF+ K+FPP  +AR R E+++FQ+ + E L +AW RFK  ++ CPH+G+P CI +E FY  LN AT+Q VDA     +L 
Subjt:  KRWANALEDGEVGTWDQLIEKFMKKFFPPHENARKRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFRLNKATEQTVDAVFVDGMLK

Query:  STYNQIKTTLDTMANNNEEWDEDDFGNRRGGRPKDDGMDRNAVVALQGQMTAMNNLLKSMAISQ---VNVAGNSMAAANQIDEMGCVGCGGPHNTDACPL
         TYN+    L+ +A+NN +W +        G+   + ++ +A+ ++  Q+ +M N+L+++A  Q   +    ++     Q     CV CG  H  D CP 
Subjt:  STYNQIKTTLDTMANNNEEWDEDDFGNRRGGRPKDDGMDRNAVVALQGQMTAMNNLLKSMAISQ---VNVAGNSMAAANQIDEMGCVGCGGPHNTDACPL

Query:  NTETVAF-----------------------------------------------------------------------------STSSMENLFREYMQKN
        N  ++ +                                                                             S + +E+L +EYM +N
Subjt:  NTETVAF-----------------------------------------------------------------------------STSSMENLFREYMQKN

Query:  DALLQSQASSIRNLEVQLGQLASDFSGRQQGSLPSNTKMPNQAG
        DA++QSQ  S+RNLEVQ+GQLA++   R  G LP++T+MP + G
Subjt:  DALLQSQASSIRNLEVQLGQLASDFSGRQQGSLPSNTKMPNQAG

U5CUI2 Retrotrans_gag domain-containing protein1.0e-6243.46Show/hide
Query:  NATYIAHDLDRPIRSYATPNLYNFSPGIAYPVFGENARFEIKSVMLQMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEA
        N   +A D  R IR YA P     +PGI  P   +  +FE+K VM QM+Q VGQF G P EDPH H+RSF  +  SF + G+S E LR  LFP +LRD A
Subjt:  NATYIAHDLDRPIRSYATPNLYNFSPGIAYPVFGENARFEIKSVMLQMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEA

Query:  KRWANALEDGEVGTWDQLIEKFMKKFFPPHENARKRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFRLNKATEQTVDAVFVDGMLK
        + W N L    V  W+ L EKF++K+FPP  NA+ R E+MSFQQ + E+  DAW RFK +++ CPH+GIP CI ME FY  LN A+   +DA     +L 
Subjt:  KRWANALEDGEVGTWDQLIEKFMKKFFPPHENARKRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFRLNKATEQTVDAVFVDGMLK

Query:  STYNQIKTTLDTMANNNEEWDEDDFGNRRG--GRPKDDGMDRNAVVALQGQMTAMNNLLKSMAISQVNVAGNSMAAANQIDEMGCVGCGGPHNTDACPLN
         +YN+    L+T+A+NN +W      N R    R     ++ +A+ AL  QM +M N+LK+++I   N      AAA Q D++ CV CG  H  + CP N
Subjt:  STYNQIKTTLDTMANNNEEWDEDDFGNRRG--GRPKDDGMDRNAVVALQGQMTAMNNLLKSMAISQVNVAGNSMAAANQIDEMGCVGCGGPHNTDACPLN

Query:  TETVAF
         E+V +
Subjt:  TETVAF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGTCCTGCTTAGCAGGCGGCGAAAGAAATAGAGATATTCTCTTCGCCGAACTTATCCTTAGCATATTACTCAGAAATGACGAAATATGTGGCGGAAGCACCGAGAG
GTTGCTCGCCTTAATTAAGGGTGAAACTCAGTTTCAGTTGAATCACGCGTTCAAACCCGAAGAACGTGAGAACAAGTTTATGAGTGACAGCGAACAACCATTCAAACTTG
ACCCTGAGATTGAACGAACATTTCGGGGTAATCAGCGAAGAGCAAGGCAAAGACAAATTCGTAGAATGGAAAATAACAGAAATGCTCCTCCGCCGCAAGCTGACCCAGAA
CCCAATGCCACCTATATCGCACATGACTTGGACAGGCCAATTAGATCTTACGCGACACCCAACCTTTATAACTTCAGTCCAGGAATCGCCTACCCTGTATTTGGCGAGAA
CGCCAGATTTGAAATCAAATCTGTTATGCTTCAGATGATTCAGAACGTCGGACAATTCGGCGGGCATCCTGGGGAAGACCCACACGAGCATATAAGGAGTTTCTACTCCA
TCTGCGCTTCTTTCCACATGTCAGGCATCTCACCTGAAGAATTAAGATTCGCCCTCTTCCCGTTAACTCTGAGGGATGAGGCGAAGAGGTGGGCAAATGCCTTGGAAGAT
GGCGAGGTTGGAACTTGGGATCAATTAATAGAAAAATTTATGAAGAAATTCTTCCCACCTCATGAAAATGCAAGAAAAAGGAAGGAGCTTATGAGCTTCCAGCAGAGGGA
TAGAGAAAACCTACATGATGCGTGGAGTAGGTTTAAAAGGATGGTCAAAGCATGCCCCCACAATGGCATTCCTAAATGCATATTGATGGAGGTGTTCTATTTTAGACTGA
ACAAGGCAACAGAGCAGACTGTTGATGCTGTGTTTGTAGACGGTATGTTGAAAAGTACATACAACCAGATTAAGACGACGCTGGATACGATGGCCAACAACAACGAAGAA
TGGGATGAAGATGATTTCGGCAATCGCAGAGGAGGACGACCAAAAGATGATGGTATGGATAGGAATGCCGTGGTGGCACTGCAGGGACAAATGACTGCGATGAACAATTT
GCTTAAATCAATGGCAATATCGCAAGTTAACGTCGCAGGAAACTCTATGGCTGCGGCTAACCAAATTGATGAAATGGGATGTGTGGGATGCGGGGGTCCCCATAACACTG
ACGCATGCCCACTCAACACGGAGACCGTCGCATTCTCCACCTCTTCCATGGAAAACCTCTTCCGCGAATACATGCAGAAAAATGATGCTCTTCTGCAAAGCCAGGCTTCA
TCAATACGTAATCTGGAGGTACAGTTAGGTCAGCTAGCTAGTGATTTCTCCGGAAGACAACAAGGATCCCTCCCAAGTAATACAAAAATGCCAAATCAGGCAGGAGGATC
TGGTAAAGAGAAGTGTCACGCGGTGACACTTCGCAGTGGAAGGAATTTAACCATCCGCAATCCTGACGCTGACGTAGTTACCCCACTTCTAACTCTACTGCCGAGATTGG
CAGTTCAAATTAGAGTATGTTCTGATGGACCTGAATCAAAACGACCCACATCGCGGACAAGTGGCGAACACAAAACGCCGCAATACATCCACTTGAAGCACGACAAGGGG
AAGGTTCTACTGAAACCTCCGCCGGAAGCAGCTGAAAATTTCTTGGAAGTTGAAGTTGATAATCAGGATACAGAGGCGGTGCTCGAATTCCTTAATGAACGAGCAAAGAA
GAGGAAGGAGGCCCACATTAAAAGAACTAAGGAAGCTCGTCGCCGAAAAGACGAGCAGCTTCACAAGAAAATCTCTGACAAGCTCGCACAAGTCTCGTTCGCTAAAACGA
GGAAAACTATCGAGGCAGTTAAGGCTGCCTTAAAAAGGAAAGAAGAAAAGAAAAAGATGTTCGCAGAGTTAAGCGAACAAGTGACGGAGCTCCCCGCGAAAGCAAGAGCA
TTGGAGCCAGAAAGAAACCTCGACGCGATCGTTGAAGAATTCGAGGAAGAGCTGGAGGCGATGAGCCCACTTGATGACGGGCCATCGTCAAAAAAATCAAGGGAGGTCGT
AGGACCATCAAGAGGAAGAAAGAAACTTGGACGTTCTGGACCTGAAAAACGCCTATCAGGCGGCGACACCATAAGCAAGACACCCTTTATCAACTCTCTCATCAAAGTTG
AAAAGGGGTTGTTTCCGTTCAATGGTCAACTCCCTGACTTCCTCTACGCGCCAATTCAGGCGTTTGGATGGAAGTCATTTTTCAAAGGGCACACCAAGATACGATTAGGA
GTGGTAGAAAAATTCTACGCAGCTAAACTCAACGCTGAAGAGTTTAGCGTACAAATCAGTGGAAAGACAGTGAGTTTCAACGCGGAGGCCATCAATGCGTTGTATGAATT
GCCCAACAATGTTGAAACCCCAGGGCAATTATACGTAGAAAGTCCTACGAAGAGGATGGCCCGTGAAGCGTTGGAAGTCATCGCATGGCCTGGGGCCGCATGGGAAGTAA
CGCCAAGAGGGAAGTATCAGTTGTATCCACACCAACTGACCACTGAAGCAAGTGTGTGGCTATTCTTTATCAAGAAGAAGATCTTCCTAACACGCCATGATAGCACCATC
AATTTGGAGTCAGCGATGCTACTTTATTGCATCCTAGCAAAGAAGCGTGTTAACCTTGGTGAACTTATAGCCACAACCATTCTGGCATGGATGCGAGCTCCCAAAGGTGC
GATGCCCTTCCCTTCAACCGTTGAGGCCCTTTGCCTTAAAGCTTTGTCATTCTTATCCGCCATCCAAACAATCTCAATACCTGGCGGACTGTGTAATCAAATAGCTTTAA
ACCGCATGATTACTTTCCATGGACACAAAGAAATGGAAAGGCGGGCAAAGACATTAGGTGACACGCCTGAAGGAATGGCCCTAGCAGAAAGAAAAAGAAAGGCCCCAGTC
GTCGCATCAACCCCACCTAAAGCAAAAAAAACAAAGGTTCTTGCGACGAAGCAACTTCCACTAAAATTTCCCCACTCCTCATCTCGCCCAATACAGCGAGCCCCACCATC
AGTCCAAAATTCCAGCAACTCCAATCCCCCCTGTGCTTCTTCGCCCATTCCAATCACTCCACCATCCCCAAATATCTCTCCCCGCCATTCACCTCTTCCCCACATTCGTT
CCCCCACCAACATTCCTCACCTTTTCCCACAACCGCCTACACCTCCACCCACAAAATCCACTTCCCCACTTCACTCCAAATCACCCTCACCAAGGCGAGCTGAACCCCTT
TCGCCTTTTCTTCTTTCGCCTATCATGGACCTGACCGTTCTTCGCCATGACCAACCCGCGACAAACACTGCGGTCGTTGAGGTTTCTTCGCCCATCACCCATCCAACCAA
CCGGCCTCTACAACCTTCCCCCATTCTTCTAATCTCAAAAGAGGGCACACTTCCAACCAACCACCCATCCCAACAATCACCACCACTGCCCATTATGGCCGCCGCGGAAA
AAGTTGATGACCCACACGTTAAGGACAAAAACCCCATCCTTAATGAAGTTGGCGAGACTACTTCCTCTGCGCATACCCCCATCGCTCAACCTTCTACCACACCGGAGACG
AAGATTTCGCCGAAATGCTGGGTTCCTTTGTGTGTAAGCCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGTCCTGCTTAGCAGGCGGCGAAAGAAATAGAGATATTCTCTTCGCCGAACTTATCCTTAGCATATTACTCAGAAATGACGAAATATGTGGCGGAAGCACCGAGAG
GTTGCTCGCCTTAATTAAGGGTGAAACTCAGTTTCAGTTGAATCACGCGTTCAAACCCGAAGAACGTGAGAACAAGTTTATGAGTGACAGCGAACAACCATTCAAACTTG
ACCCTGAGATTGAACGAACATTTCGGGGTAATCAGCGAAGAGCAAGGCAAAGACAAATTCGTAGAATGGAAAATAACAGAAATGCTCCTCCGCCGCAAGCTGACCCAGAA
CCCAATGCCACCTATATCGCACATGACTTGGACAGGCCAATTAGATCTTACGCGACACCCAACCTTTATAACTTCAGTCCAGGAATCGCCTACCCTGTATTTGGCGAGAA
CGCCAGATTTGAAATCAAATCTGTTATGCTTCAGATGATTCAGAACGTCGGACAATTCGGCGGGCATCCTGGGGAAGACCCACACGAGCATATAAGGAGTTTCTACTCCA
TCTGCGCTTCTTTCCACATGTCAGGCATCTCACCTGAAGAATTAAGATTCGCCCTCTTCCCGTTAACTCTGAGGGATGAGGCGAAGAGGTGGGCAAATGCCTTGGAAGAT
GGCGAGGTTGGAACTTGGGATCAATTAATAGAAAAATTTATGAAGAAATTCTTCCCACCTCATGAAAATGCAAGAAAAAGGAAGGAGCTTATGAGCTTCCAGCAGAGGGA
TAGAGAAAACCTACATGATGCGTGGAGTAGGTTTAAAAGGATGGTCAAAGCATGCCCCCACAATGGCATTCCTAAATGCATATTGATGGAGGTGTTCTATTTTAGACTGA
ACAAGGCAACAGAGCAGACTGTTGATGCTGTGTTTGTAGACGGTATGTTGAAAAGTACATACAACCAGATTAAGACGACGCTGGATACGATGGCCAACAACAACGAAGAA
TGGGATGAAGATGATTTCGGCAATCGCAGAGGAGGACGACCAAAAGATGATGGTATGGATAGGAATGCCGTGGTGGCACTGCAGGGACAAATGACTGCGATGAACAATTT
GCTTAAATCAATGGCAATATCGCAAGTTAACGTCGCAGGAAACTCTATGGCTGCGGCTAACCAAATTGATGAAATGGGATGTGTGGGATGCGGGGGTCCCCATAACACTG
ACGCATGCCCACTCAACACGGAGACCGTCGCATTCTCCACCTCTTCCATGGAAAACCTCTTCCGCGAATACATGCAGAAAAATGATGCTCTTCTGCAAAGCCAGGCTTCA
TCAATACGTAATCTGGAGGTACAGTTAGGTCAGCTAGCTAGTGATTTCTCCGGAAGACAACAAGGATCCCTCCCAAGTAATACAAAAATGCCAAATCAGGCAGGAGGATC
TGGTAAAGAGAAGTGTCACGCGGTGACACTTCGCAGTGGAAGGAATTTAACCATCCGCAATCCTGACGCTGACGTAGTTACCCCACTTCTAACTCTACTGCCGAGATTGG
CAGTTCAAATTAGAGTATGTTCTGATGGACCTGAATCAAAACGACCCACATCGCGGACAAGTGGCGAACACAAAACGCCGCAATACATCCACTTGAAGCACGACAAGGGG
AAGGTTCTACTGAAACCTCCGCCGGAAGCAGCTGAAAATTTCTTGGAAGTTGAAGTTGATAATCAGGATACAGAGGCGGTGCTCGAATTCCTTAATGAACGAGCAAAGAA
GAGGAAGGAGGCCCACATTAAAAGAACTAAGGAAGCTCGTCGCCGAAAAGACGAGCAGCTTCACAAGAAAATCTCTGACAAGCTCGCACAAGTCTCGTTCGCTAAAACGA
GGAAAACTATCGAGGCAGTTAAGGCTGCCTTAAAAAGGAAAGAAGAAAAGAAAAAGATGTTCGCAGAGTTAAGCGAACAAGTGACGGAGCTCCCCGCGAAAGCAAGAGCA
TTGGAGCCAGAAAGAAACCTCGACGCGATCGTTGAAGAATTCGAGGAAGAGCTGGAGGCGATGAGCCCACTTGATGACGGGCCATCGTCAAAAAAATCAAGGGAGGTCGT
AGGACCATCAAGAGGAAGAAAGAAACTTGGACGTTCTGGACCTGAAAAACGCCTATCAGGCGGCGACACCATAAGCAAGACACCCTTTATCAACTCTCTCATCAAAGTTG
AAAAGGGGTTGTTTCCGTTCAATGGTCAACTCCCTGACTTCCTCTACGCGCCAATTCAGGCGTTTGGATGGAAGTCATTTTTCAAAGGGCACACCAAGATACGATTAGGA
GTGGTAGAAAAATTCTACGCAGCTAAACTCAACGCTGAAGAGTTTAGCGTACAAATCAGTGGAAAGACAGTGAGTTTCAACGCGGAGGCCATCAATGCGTTGTATGAATT
GCCCAACAATGTTGAAACCCCAGGGCAATTATACGTAGAAAGTCCTACGAAGAGGATGGCCCGTGAAGCGTTGGAAGTCATCGCATGGCCTGGGGCCGCATGGGAAGTAA
CGCCAAGAGGGAAGTATCAGTTGTATCCACACCAACTGACCACTGAAGCAAGTGTGTGGCTATTCTTTATCAAGAAGAAGATCTTCCTAACACGCCATGATAGCACCATC
AATTTGGAGTCAGCGATGCTACTTTATTGCATCCTAGCAAAGAAGCGTGTTAACCTTGGTGAACTTATAGCCACAACCATTCTGGCATGGATGCGAGCTCCCAAAGGTGC
GATGCCCTTCCCTTCAACCGTTGAGGCCCTTTGCCTTAAAGCTTTGTCATTCTTATCCGCCATCCAAACAATCTCAATACCTGGCGGACTGTGTAATCAAATAGCTTTAA
ACCGCATGATTACTTTCCATGGACACAAAGAAATGGAAAGGCGGGCAAAGACATTAGGTGACACGCCTGAAGGAATGGCCCTAGCAGAAAGAAAAAGAAAGGCCCCAGTC
GTCGCATCAACCCCACCTAAAGCAAAAAAAACAAAGGTTCTTGCGACGAAGCAACTTCCACTAAAATTTCCCCACTCCTCATCTCGCCCAATACAGCGAGCCCCACCATC
AGTCCAAAATTCCAGCAACTCCAATCCCCCCTGTGCTTCTTCGCCCATTCCAATCACTCCACCATCCCCAAATATCTCTCCCCGCCATTCACCTCTTCCCCACATTCGTT
CCCCCACCAACATTCCTCACCTTTTCCCACAACCGCCTACACCTCCACCCACAAAATCCACTTCCCCACTTCACTCCAAATCACCCTCACCAAGGCGAGCTGAACCCCTT
TCGCCTTTTCTTCTTTCGCCTATCATGGACCTGACCGTTCTTCGCCATGACCAACCCGCGACAAACACTGCGGTCGTTGAGGTTTCTTCGCCCATCACCCATCCAACCAA
CCGGCCTCTACAACCTTCCCCCATTCTTCTAATCTCAAAAGAGGGCACACTTCCAACCAACCACCCATCCCAACAATCACCACCACTGCCCATTATGGCCGCCGCGGAAA
AAGTTGATGACCCACACGTTAAGGACAAAAACCCCATCCTTAATGAAGTTGGCGAGACTACTTCCTCTGCGCATACCCCCATCGCTCAACCTTCTACCACACCGGAGACG
AAGATTTCGCCGAAATGCTGGGTTCCTTTGTGTGTAAGCCAATGA
Protein sequenceShow/hide protein sequence
MQSCLAGGERNRDILFAELILSILLRNDEICGGSTERLLALIKGETQFQLNHAFKPEERENKFMSDSEQPFKLDPEIERTFRGNQRRARQRQIRRMENNRNAPPPQADPE
PNATYIAHDLDRPIRSYATPNLYNFSPGIAYPVFGENARFEIKSVMLQMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMSGISPEELRFALFPLTLRDEAKRWANALED
GEVGTWDQLIEKFMKKFFPPHENARKRKELMSFQQRDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFRLNKATEQTVDAVFVDGMLKSTYNQIKTTLDTMANNNEE
WDEDDFGNRRGGRPKDDGMDRNAVVALQGQMTAMNNLLKSMAISQVNVAGNSMAAANQIDEMGCVGCGGPHNTDACPLNTETVAFSTSSMENLFREYMQKNDALLQSQAS
SIRNLEVQLGQLASDFSGRQQGSLPSNTKMPNQAGGSGKEKCHAVTLRSGRNLTIRNPDADVVTPLLTLLPRLAVQIRVCSDGPESKRPTSRTSGEHKTPQYIHLKHDKG
KVLLKPPPEAAENFLEVEVDNQDTEAVLEFLNERAKKRKEAHIKRTKEARRRKDEQLHKKISDKLAQVSFAKTRKTIEAVKAALKRKEEKKKMFAELSEQVTELPAKARA
LEPERNLDAIVEEFEEELEAMSPLDDGPSSKKSREVVGPSRGRKKLGRSGPEKRLSGGDTISKTPFINSLIKVEKGLFPFNGQLPDFLYAPIQAFGWKSFFKGHTKIRLG
VVEKFYAAKLNAEEFSVQISGKTVSFNAEAINALYELPNNVETPGQLYVESPTKRMAREALEVIAWPGAAWEVTPRGKYQLYPHQLTTEASVWLFFIKKKIFLTRHDSTI
NLESAMLLYCILAKKRVNLGELIATTILAWMRAPKGAMPFPSTVEALCLKALSFLSAIQTISIPGGLCNQIALNRMITFHGHKEMERRAKTLGDTPEGMALAERKRKAPV
VASTPPKAKKTKVLATKQLPLKFPHSSSRPIQRAPPSVQNSSNSNPPCASSPIPITPPSPNISPRHSPLPHIRSPTNIPHLFPQPPTPPPTKSTSPLHSKSPSPRRAEPL
SPFLLSPIMDLTVLRHDQPATNTAVVEVSSPITHPTNRPLQPSPILLISKEGTLPTNHPSQQSPPLPIMAAAEKVDDPHVKDKNPILNEVGETTSSAHTPIAQPSTTPET
KISPKCWVPLCVSQ