; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011670 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011670
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr1:30310724..30312614
RNA-Seq ExpressionLag0011670
SyntenyLag0011670
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049700.1 T4.5 [Cucumis melo var. makuwa]3.9e-5738.34Show/hide
Query:  SSETSALNPAANSSNTSVSILNNICNLVSVRLDSTNYFLWQFQISPLLKSHKLFKYVDGSIKVPEAI---IRTEVQPPRVNPDHEIWYERDQALITLINA
        SS T   + A   S + + +L+NICNL+S+RLDSTN+ LW+FQ++ +LK+HKL+ ++DG+   P        T   PP+ NP +E W  +DQAL+T+INA
Subjt:  SSETSALNPAANSSNTSVSILNNICNLVSVRLDSTNYFLWQFQISPLLKSHKLFKYVDGSIKVPEAI---IRTEVQPPRVNPDHEIWYERDQALITLINA

Query:  TLTQTALSYVIGCQTSKEVWDTLEKHFSSSTRTNIIGLKTELQSVSKQSGETIDAYVRRVKEIVNKLAAVSVVIDAEDLIIY------------------
        TL+  AL+YV+G  +SK+VWD L K +SS +R+N++ LK++LQ++ K+  E+IDAY++R+KEI +KLA VS  I+ EDL+IY                  
Subjt:  TLTQTALSYVIGCQTSKEVWDTLEKHFSSSTRTNIIGLKTELQSVSKQSGETIDAYVRRVKEIVNKLAAVSVVIDAEDLIIY------------------

Query:  --------------KTEETTLDKQSKVEDATVVSHLALTANLESQGRGGWRPQGTRGKGTGGFFDNNHGRGRG-GNSFSSG--NSNGSGPGQFSSSSFSG
                      + EE+ L KQSK +D+     + L++   SQ      P           FDNN  RG G G  +  G  + +    G  SS     
Subjt:  --------------KTEETTLDKQSKVEDATVVSHLALTANLESQGRGGWRPQGTRGKGTGGFFDNNHGRGRG-GNSFSSG--NSNGSGPGQFSSSSFSG

Query:  QNSSKVNCQICHKYNHNALGCFNRMNYSFQGRHPPTKLEAMAASASSIANAFPSGTLPQESSVWLTDTRCNAHLTSELTNLNVSTAYNGDENITDKTSGQ
         + +   CQIC +  H AL CFNRMNY+FQGRHPP +L AM AS +   NAF    L   +S  LTD+ CN  +TS++  ++++  YNG+E +    +GQ
Subjt:  QNSSKVNCQICHKYNHNALGCFNRMNYSFQGRHPPTKLEAMAASASSIANAFPSGTLPQESSVWLTDTRCNAHLTSELTNLNVSTAYNGDENITDKTSGQ

Query:  ILFHGPNINGLY--PISTSPVS-AAQVGLFAHI
             P  + +Y  P+  S V+  A    FAHI
Subjt:  ILFHGPNINGLY--PISTSPVS-AAQVGLFAHI

KAA8524269.1 hypothetical protein F0562_010692 [Nyssa sinensis]5.8e-6132.14Show/hide
Query:  SSETSALNPAANSSNTSVSILNNICNLVSVRLDSTNYFLWQFQISPLLKSHKLFKYVDGSIKVPEAIIRTE--VQPPRVNPDHEIWYERDQALITLINAT
        ++ T++ +   N S + + +L+NICNL++ RLDS+NY  W+FQIS +LK+H L  Y+DG+   P   ++ E      ++NP+++IW  +DQAL+TL+NAT
Subjt:  SSETSALNPAANSSNTSVSILNNICNLVSVRLDSTNYFLWQFQISPLLKSHKLFKYVDGSIKVPEAIIRTE--VQPPRVNPDHEIWYERDQALITLINAT

Query:  LTQTALSYVIGCQTSKEVWDTLEKHFSSSTRTNIIGLKTELQSVSKQSGETIDAYVRRVKEIVNKLAAVSVVIDAEDLIIY------------------K
        L+QTALS+VIG  TS+E W  LE+ FS+STR+NI+ LK+ L ++SK   ++ID+Y++++K+  + LA+VSV+I+ ED++IY                  K
Subjt:  LTQTALSYVIGCQTSKEVWDTLEKHFSSSTRTNIIGLKTELQSVSKQSGETIDAYVRRVKEIVNKLAAVSVVIDAEDLIIY------------------K

Query:  TEETTLDK---QSKVEDATVVS-------------HLALTANLESQGRGGWRPQGTRGKGTG-GFFDNNHGRGRGGNSFSS---GNSNGSGP-GQFSSSS
        +E  TL++     K+E+ T+ S              +A           G+ P    G+G G G F N  GR      F S   G SN   P  Q   S+
Subjt:  TEETTLDK---QSKVEDATVVS-------------HLALTANLESQGRGGWRPQGTRGKGTG-GFFDNNHGRGRGGNSFSS---GNSNGSGP-GQFSSSS

Query:  FSGQNSSKVNCQICHKYNHNALGCFNRMNYSFQGRHPPTKLEAMAASASSIANAFPSGTLPQESSVWLTDTRCNAHLTSELTNLNVSTAYNGDENIT---
            NS  V CQIC+K  H+AL C++RM++S+QG+ P  +L AM+A+ ++ ++  P        + W TDT    H+T++L NLN    Y GD+NIT   
Subjt:  FSGQNSSKVNCQICHKYNHNALGCFNRMNYSFQGRHPPTKLEAMAASASSIANAFPSGTLPQESSVWLTDTRCNAHLTSELTNLNVSTAYNGDENIT---

Query:  ------------------------------------------------------------DKTSGQILFHGPNINGLYPISTSPVS-----AAQVGLF--
                                                                    DK + Q+LF GP+ +GLYP+ TS ++     + Q  L   
Subjt:  ------------------------------------------------------------DKTSGQILFHGPNINGLYPISTSPVS-----AAQVGLF--

Query:  -----------------------AHIGTKVTPTVWHDRLGHPCSSILRSVLKSFGLS
                               A++G +V+  +WHDRLGHP ++ L+S+L S  ++
Subjt:  -----------------------AHIGTKVTPTVWHDRLGHPCSSILRSVLKSFGLS

KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]1.3e-5739.64Show/hide
Query:  SSETSALNPAANSSNTSVSILNNICNLVSVRLDSTNYFLWQFQISPLLKSHKLFKYVDGSIKVPE-AIIRTEVQPPRVNPDHEIWYERDQALITLINATL
        SS T   + A   S + + +L+NICNL+S+RLDSTN+ LW+FQ++ +LK+HKLF +VDG+   P+ +   T   PP+ NP +E W  +DQAL+T+INATL
Subjt:  SSETSALNPAANSSNTSVSILNNICNLVSVRLDSTNYFLWQFQISPLLKSHKLFKYVDGSIKVPE-AIIRTEVQPPRVNPDHEIWYERDQALITLINATL

Query:  TQTALSYVIGCQTSKEVWDTLEKHFSSSTRTNIIGLKTELQSVSKQSGETIDAYVRRVKEIVNKLAAVSVVIDAEDLIIY--------------------
        +  AL+YV+G  +SK+VWD L K +SS +R+N++ LK++LQ++ K+  E+IDAY++R+KEI +KLA VS  I+ EDL+IY                    
Subjt:  TQTALSYVIGCQTSKEVWDTLEKHFSSSTRTNIIGLKTELQSVSKQSGETIDAYVRRVKEIVNKLAAVSVVIDAEDLIIY--------------------

Query:  ------------KTEETTLDKQSKVEDATVVSHLALTANLESQGRGGWRPQGTRGKGTGGFFDNNHGRGRG-GNSFSSG--NSNGSGPGQFSSSSFSGQN
                    + EE+ L KQSK +D+     + L++   SQ      P           F+NN  RG G G ++  G  + +    G   S      +
Subjt:  ------------KTEETTLDKQSKVEDATVVSHLALTANLESQGRGGWRPQGTRGKGTGGFFDNNHGRGRG-GNSFSSG--NSNGSGPGQFSSSSFSGQN

Query:  SSKVNCQICHKYNHNALGCFNRMNYSFQGRHPPTKLEAMAASASSIANAFPSGTLPQESSVWLTDTRCNAHLTSELTNLNVSTAYNGDENI
         +   CQIC +  H AL CFNRMNY+FQGRHPP +L AM AS +   NAF    L   +S  LTD+ CN H+TS++  ++++  YNG+E +
Subjt:  SSKVNCQICHKYNHNALGCFNRMNYSFQGRHPPTKLEAMAASASSIANAFPSGTLPQESSVWLTDTRCNAHLTSELTNLNVSTAYNGDENI

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]1.3e-5739.64Show/hide
Query:  SSETSALNPAANSSNTSVSILNNICNLVSVRLDSTNYFLWQFQISPLLKSHKLFKYVDGSIKVPE-AIIRTEVQPPRVNPDHEIWYERDQALITLINATL
        SS T   + A   S + + +L+NICNL+S+RLDSTN+ LW+FQ++ +LK+HKLF +VDG+   P+ +   T   PP+ NP +E W  +DQAL+T+INATL
Subjt:  SSETSALNPAANSSNTSVSILNNICNLVSVRLDSTNYFLWQFQISPLLKSHKLFKYVDGSIKVPE-AIIRTEVQPPRVNPDHEIWYERDQALITLINATL

Query:  TQTALSYVIGCQTSKEVWDTLEKHFSSSTRTNIIGLKTELQSVSKQSGETIDAYVRRVKEIVNKLAAVSVVIDAEDLIIY--------------------
        +  AL+YV+G  +SK+VWD L K +SS +R+N++ LK++LQ++ K+  E+IDAY++R+KEI +KLA VS  I+ EDL+IY                    
Subjt:  TQTALSYVIGCQTSKEVWDTLEKHFSSSTRTNIIGLKTELQSVSKQSGETIDAYVRRVKEIVNKLAAVSVVIDAEDLIIY--------------------

Query:  ------------KTEETTLDKQSKVEDATVVSHLALTANLESQGRGGWRPQGTRGKGTGGFFDNNHGRGRG-GNSFSSG--NSNGSGPGQFSSSSFSGQN
                    + EE+ L KQSK +D+     + L++   SQ      P           F+NN  RG G G ++  G  + +    G   S      +
Subjt:  ------------KTEETTLDKQSKVEDATVVSHLALTANLESQGRGGWRPQGTRGKGTGGFFDNNHGRGRG-GNSFSSG--NSNGSGPGQFSSSSFSGQN

Query:  SSKVNCQICHKYNHNALGCFNRMNYSFQGRHPPTKLEAMAASASSIANAFPSGTLPQESSVWLTDTRCNAHLTSELTNLNVSTAYNGDENI
         +   CQIC +  H AL CFNRMNY+FQGRHPP +L AM AS +   NAF    L   +S  LTD+ CN H+TS++  ++++  YNG+E +
Subjt:  SSKVNCQICHKYNHNALGCFNRMNYSFQGRHPPTKLEAMAASASSIANAFPSGTLPQESSVWLTDTRCNAHLTSELTNLNVSTAYNGDENI

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]2.1e-6339.85Show/hide
Query:  TSALNPAANSSNTSVSILNNICNLVSVRLDSTNYFLWQFQISPLLKSHKLFKYVDGSIKVPEAII----RTEVQP------PRVNPDHEIWYERDQALIT
        TS+        ++ + +L+NICNLVS+RLDST++ LW+FQ++ +LK+HKLF ++DGS+  P   +     TE QP      P +NP  E W  +DQAL+T
Subjt:  TSALNPAANSSNTSVSILNNICNLVSVRLDSTNYFLWQFQISPLLKSHKLFKYVDGSIKVPEAII----RTEVQP------PRVNPDHEIWYERDQALIT

Query:  LINATLTQTALSYVIGCQTSKEVWDTLEKHFSSSTRTNIIGLKTELQSVSKQSGETIDAYVRRVKEIVNKLAAVSVVIDAEDLIIY--------------
        LINATL+  AL+YV+   TSK+VW+ LEKH+SS++RTN++ LK++LQS+ K++ E+IDAYV+R+KEI +K A VS+ I+ E L+IY              
Subjt:  LINATLTQTALSYVIGCQTSKEVWDTLEKHFSSSTRTNIIGLKTELQSVSKQSGETIDAYVRRVKEIVNKLAAVSVVIDAEDLIIY--------------

Query:  ------------------KTEETTLDKQSKVEDATVVSHLALTANLESQGR-GGWRPQGTRGKGTGGFFDNNHGRGRGGNSFSSGNSNGSGPGQFSSSSF
                          K+EE+ ++KQ K ED     +    ++ +SQ R   + P  +  +G G     N+GRG+   +F+   +N  G G+ S + F
Subjt:  ------------------KTEETTLDKQSKVEDATVVSHLALTANLESQGR-GGWRPQGTRGKGTGGFFDNNHGRGRGGNSFSSGNSNGSGPGQFSSSSF

Query:  -SGQNSSKVNCQICHKYNHNALGCFNRMNYSFQGRHPPTKLEAMAA----SASSIANAFPSGTLPQESSVWLTDTRCNAHLTSELTNLNVSTA---YNGD
         S Q  ++  CQIC K  H AL C+NRMN+ FQGRHPP +L AM A    S  ++ N+ P        + WL D+ CN H+T++L+NL++++    YNG+
Subjt:  -SGQNSSKVNCQICHKYNHNALGCFNRMNYSFQGRHPPTKLEAMAA----SASSIANAFPSGTLPQESSVWLTDTRCNAHLTSELTNLNVSTA---YNGD

Query:  ENIT
        ENI+
Subjt:  ENIT

TrEMBL top hitse value%identityAlignment
A0A2N9F9F8 Uncharacterized protein6.5e-5834.92Show/hide
Query:  SSNTSVSILNNICNLVSVRLDSTNYFLWQFQISPLLKSHKLFKYVDGSIKVPEAII--RTEVQPPRVNPDHEIWYERDQALITLINATLTQTALSYVIGC
        S  T + +L+N+ NL+SV+LDSTN+ +W+ Q+S +LK++ +  YVDG++  P   +          VNP+ ++W  RDQ L+ LIN+TL+ + LS V+G 
Subjt:  SSNTSVSILNNICNLVSVRLDSTNYFLWQFQISPLLKSHKLFKYVDGSIKVPEAII--RTEVQPPRVNPDHEIWYERDQALITLINATLTQTALSYVIGC

Query:  QTSKEVWDTLEKHFSSSTRTNIIGLKTELQSVSKQSGETIDAYVRRVKEIVNKLAAVSVVIDAEDL--------------------------------II
         +++EVW TLE  F+S++R N++ LK EL ++ K S E+I++Y+++VK   +KL AV  +ID E+L                                ++
Subjt:  QTSKEVWDTLEKHFSSSTRTNIIGLKTELQSVSKQSGETIDAYVRRVKEIVNKLAAVSVVIDAEDL--------------------------------II

Query:  YKTEETTLDKQSKVEDATVVSH-LALTA----NLESQGRGGWRPQGTRGKGTGGFFDNNHGRGRGGNSFSSGNSNGSGPGQFSSSSFSGQNSSKVNCQIC
         +TEE +  + S   D+   SH +A+ A    N  S  +  +    T+ +G G    NN  RGRGG  ++S  +  S   Q +S        S+  CQIC
Subjt:  YKTEETTLDKQSKVEDATVVSH-LALTA----NLESQGRGGWRPQGTRGKGTGGFFDNNHGRGRGGNSFSSGNSNGSGPGQFSSSSFSGQNSSKVNCQIC

Query:  HKYNHNALGCFNRMNYSFQGRHPPTKLEAMAASASSIANAFPSGTLPQESSVWLTDTRCNAHLTSELTNLNVSTAYNGDENITDKTSGQILFHGPNINGL
         K  H AL C++RM++++QGRHPP KL AMA++++            Q    WLTDT    HLT+ LTNL  +  Y G E   D  SG++L+ G + NGL
Subjt:  HKYNHNALGCFNRMNYSFQGRHPPTKLEAMAASASSIANAFPSGTLPQESSVWLTDTRCNAHLTSELTNLNVSTAYNGDENITDKTSGQILFHGPNINGL

Query:  YPISTSP------VSAAQVGLFAHIGTKVTPTVWHDRLGHPCSSILRSVLKSFGLSVAHSN
        YPI T P       SA+   + A + +K    +WH RLGHP   +L S + +    ++ SN
Subjt:  YPISTSP------VSAAQVGLFAHIGTKVTPTVWHDRLGHPCSSILRSVLKSFGLSVAHSN

A0A2N9G7E3 Integrase catalytic domain-containing protein5.0e-5833.58Show/hide
Query:  AANSSNTSVSILNNICNLVSVRLDSTNYFLWQFQISPLLKSHKLFKYVDGSIKVPEAIIRTEVQPPRVNPDHEIWYERDQALITLINATLTQTALSYVIG
        ++N + T + +L+NI NLVSV+LD TNY LW+FQI+  LK++KL   VDGS   PE   R     P +N D   W  +DQALI++I ATL+ +AL+ VIG
Subjt:  AANSSNTSVSILNNICNLVSVRLDSTNYFLWQFQISPLLKSHKLFKYVDGSIKVPEAIIRTEVQPPRVNPDHEIWYERDQALITLINATLTQTALSYVIG

Query:  CQTSKEVWDTLEKHFSSSTRTNIIGLKTELQSVSKQSGETIDAYVRRVKEIVNKLAAVSVVIDAEDLII------------YKTEETTLDKQSKVEDATV
         +++K VWDTLEK F+S +R+N++ LK +L S+ K++ E+I+ Y++++KE  +KL AV V I+AE+++             + +   T +     E+  V
Subjt:  CQTSKEVWDTLEKHFSSSTRTNIIGLKTELQSVSKQSGETIDAYVRRVKEIVNKLAAVSVVIDAEDLII------------YKTEETTLDKQSKVEDATV

Query:  V---SHLALTANLESQ------GRGGWRPQGTRGK---------------GTGGFFDNNHGRGRGGNSFSSGNSNGSGPGQFS--SSSFSGQNSSKVNCQ
        +      +L  N ES          G  P+GT G                G GG F+N  G GRGG +F++ NSN  G    S   S+++ Q SS+  CQ
Subjt:  V---SHLALTANLESQ------GRGGWRPQGTRGK---------------GTGGFFDNNHGRGRGGNSFSSGNSNGSGPGQFS--SSSFSGQNSSKVNCQ

Query:  ICHKYNHNALGCFNRMNYSFQGRHPPTKLEAMAASASSIANAFPSGTLPQESSVWLTDTRCNAHLTSELTNLNVSTAYNGDENIT---------------
        IC+K  H AL C++RM+++FQG+HPPTKL AMA S+++             S+ W++DT    H T +L NL  +  YNG++ +T               
Subjt:  ICHKYNHNALGCFNRMNYSFQGRHPPTKLEAMAASASSIANAFPSGTLPQESSVWLTDTRCNAHLTSELTNLNVSTAYNGDENIT---------------

Query:  ------------------------------------------------DKTSGQILFHGPNINGLYPISTSPVSAAQVGL---------------FAHIG
                                                        D  SG++L+ G N  GLYPI   P  + +V                  A+  
Subjt:  ------------------------------------------------DKTSGQILFHGPNINGLYPISTSPVSAAQVGL---------------FAHIG

Query:  TKVTPTVWHDRLGHPCSSILRSVLKSFGLSVAHSNT
        TKV+ + WH RLGHP S IL+SV K    S   S++
Subjt:  TKVTPTVWHDRLGHPCSSILRSVLKSFGLSVAHSNT

A0A2N9IB37 Uncharacterized protein1.1e-5733.4Show/hide
Query:  AANSSNTSVSILNNICNLVSVRLDSTNYFLWQFQISPLLKSHKLFKYVDGSIKVPEAIIRTEVQPPRVNPDHEIWYERDQALITLINATLTQTALSYVIG
        ++N + T + +L+NI NLVSV+LD TNY LW+FQI+  LK++KL   VDGS   PE   R     P +N D   W  +DQALI++I ATL+ +AL+ VIG
Subjt:  AANSSNTSVSILNNICNLVSVRLDSTNYFLWQFQISPLLKSHKLFKYVDGSIKVPEAIIRTEVQPPRVNPDHEIWYERDQALITLINATLTQTALSYVIG

Query:  CQTSKEVWDTLEKHFSSSTRTNIIGLKTELQSVSKQSGETIDAYVRRVKEIVNKLAAVSVVIDAEDLII------------YKTEETTLDKQSKVEDATV
         +++K VWDTLEK F+S +R+N++ LK +L S+ K++ E+I+ Y++++KE  +KL A+ V I+AE+++             + +   T +     E+  V
Subjt:  CQTSKEVWDTLEKHFSSSTRTNIIGLKTELQSVSKQSGETIDAYVRRVKEIVNKLAAVSVVIDAEDLII------------YKTEETTLDKQSKVEDATV

Query:  V---SHLALTANLESQ------GRGGWRPQGTRGK---------------GTGGFFDNNHGRGRGGNSFSSGNSNGSGPGQFS--SSSFSGQNSSKVNCQ
        +      +L  N ES          G  P+GT G                G GG F+N  G GRGG +F++ NSN  G    S   S+++ Q SS+  CQ
Subjt:  V---SHLALTANLESQ------GRGGWRPQGTRGK---------------GTGGFFDNNHGRGRGGNSFSSGNSNGSGPGQFS--SSSFSGQNSSKVNCQ

Query:  ICHKYNHNALGCFNRMNYSFQGRHPPTKLEAMAASASSIANAFPSGTLPQESSVWLTDTRCNAHLTSELTNLNVSTAYNGDENIT---------------
        IC+K  H AL C++RM+++FQG+HPPTKL AMA S+++             S+ W++DT    H T +L NL  +  YNG++ +T               
Subjt:  ICHKYNHNALGCFNRMNYSFQGRHPPTKLEAMAASASSIANAFPSGTLPQESSVWLTDTRCNAHLTSELTNLNVSTAYNGDENIT---------------

Query:  ------------------------------------------------DKTSGQILFHGPNINGLYPISTSPVSAAQVGL---------------FAHIG
                                                        D  SG++L+ G N  GLYPI   P  + +V                  A+  
Subjt:  ------------------------------------------------DKTSGQILFHGPNINGLYPISTSPVSAAQVGL---------------FAHIG

Query:  TKVTPTVWHDRLGHPCSSILRSVLKSFGLSVAHSNT
        TKV+ + WH RLGHP S IL+SV K    S   S++
Subjt:  TKVTPTVWHDRLGHPCSSILRSVLKSFGLSVAHSNT

A0A5J5A1U7 Integrase catalytic domain-containing protein2.8e-6132.14Show/hide
Query:  SSETSALNPAANSSNTSVSILNNICNLVSVRLDSTNYFLWQFQISPLLKSHKLFKYVDGSIKVPEAIIRTE--VQPPRVNPDHEIWYERDQALITLINAT
        ++ T++ +   N S + + +L+NICNL++ RLDS+NY  W+FQIS +LK+H L  Y+DG+   P   ++ E      ++NP+++IW  +DQAL+TL+NAT
Subjt:  SSETSALNPAANSSNTSVSILNNICNLVSVRLDSTNYFLWQFQISPLLKSHKLFKYVDGSIKVPEAIIRTE--VQPPRVNPDHEIWYERDQALITLINAT

Query:  LTQTALSYVIGCQTSKEVWDTLEKHFSSSTRTNIIGLKTELQSVSKQSGETIDAYVRRVKEIVNKLAAVSVVIDAEDLIIY------------------K
        L+QTALS+VIG  TS+E W  LE+ FS+STR+NI+ LK+ L ++SK   ++ID+Y++++K+  + LA+VSV+I+ ED++IY                  K
Subjt:  LTQTALSYVIGCQTSKEVWDTLEKHFSSSTRTNIIGLKTELQSVSKQSGETIDAYVRRVKEIVNKLAAVSVVIDAEDLIIY------------------K

Query:  TEETTLDK---QSKVEDATVVS-------------HLALTANLESQGRGGWRPQGTRGKGTG-GFFDNNHGRGRGGNSFSS---GNSNGSGP-GQFSSSS
        +E  TL++     K+E+ T+ S              +A           G+ P    G+G G G F N  GR      F S   G SN   P  Q   S+
Subjt:  TEETTLDK---QSKVEDATVVS-------------HLALTANLESQGRGGWRPQGTRGKGTG-GFFDNNHGRGRGGNSFSS---GNSNGSGP-GQFSSSS

Query:  FSGQNSSKVNCQICHKYNHNALGCFNRMNYSFQGRHPPTKLEAMAASASSIANAFPSGTLPQESSVWLTDTRCNAHLTSELTNLNVSTAYNGDENIT---
            NS  V CQIC+K  H+AL C++RM++S+QG+ P  +L AM+A+ ++ ++  P        + W TDT    H+T++L NLN    Y GD+NIT   
Subjt:  FSGQNSSKVNCQICHKYNHNALGCFNRMNYSFQGRHPPTKLEAMAASASSIANAFPSGTLPQESSVWLTDTRCNAHLTSELTNLNVSTAYNGDENIT---

Query:  ------------------------------------------------------------DKTSGQILFHGPNINGLYPISTSPVS-----AAQVGLF--
                                                                    DK + Q+LF GP+ +GLYP+ TS ++     + Q  L   
Subjt:  ------------------------------------------------------------DKTSGQILFHGPNINGLYPISTSPVS-----AAQVGLF--

Query:  -----------------------AHIGTKVTPTVWHDRLGHPCSSILRSVLKSFGLS
                               A++G +V+  +WHDRLGHP ++ L+S+L S  ++
Subjt:  -----------------------AHIGTKVTPTVWHDRLGHPCSSILRSVLKSFGLS

A0A6J1D9L6 uncharacterized protein LOC1110188921.0e-6339.85Show/hide
Query:  TSALNPAANSSNTSVSILNNICNLVSVRLDSTNYFLWQFQISPLLKSHKLFKYVDGSIKVPEAII----RTEVQP------PRVNPDHEIWYERDQALIT
        TS+        ++ + +L+NICNLVS+RLDST++ LW+FQ++ +LK+HKLF ++DGS+  P   +     TE QP      P +NP  E W  +DQAL+T
Subjt:  TSALNPAANSSNTSVSILNNICNLVSVRLDSTNYFLWQFQISPLLKSHKLFKYVDGSIKVPEAII----RTEVQP------PRVNPDHEIWYERDQALIT

Query:  LINATLTQTALSYVIGCQTSKEVWDTLEKHFSSSTRTNIIGLKTELQSVSKQSGETIDAYVRRVKEIVNKLAAVSVVIDAEDLIIY--------------
        LINATL+  AL+YV+   TSK+VW+ LEKH+SS++RTN++ LK++LQS+ K++ E+IDAYV+R+KEI +K A VS+ I+ E L+IY              
Subjt:  LINATLTQTALSYVIGCQTSKEVWDTLEKHFSSSTRTNIIGLKTELQSVSKQSGETIDAYVRRVKEIVNKLAAVSVVIDAEDLIIY--------------

Query:  ------------------KTEETTLDKQSKVEDATVVSHLALTANLESQGR-GGWRPQGTRGKGTGGFFDNNHGRGRGGNSFSSGNSNGSGPGQFSSSSF
                          K+EE+ ++KQ K ED     +    ++ +SQ R   + P  +  +G G     N+GRG+   +F+   +N  G G+ S + F
Subjt:  ------------------KTEETTLDKQSKVEDATVVSHLALTANLESQGR-GGWRPQGTRGKGTGGFFDNNHGRGRGGNSFSSGNSNGSGPGQFSSSSF

Query:  -SGQNSSKVNCQICHKYNHNALGCFNRMNYSFQGRHPPTKLEAMAA----SASSIANAFPSGTLPQESSVWLTDTRCNAHLTSELTNLNVSTA---YNGD
         S Q  ++  CQIC K  H AL C+NRMN+ FQGRHPP +L AM A    S  ++ N+ P        + WL D+ CN H+T++L+NL++++    YNG+
Subjt:  -SGQNSSKVNCQICHKYNHNALGCFNRMNYSFQGRHPPTKLEAMAA----SASSIANAFPSGTLPQESSVWLTDTRCNAHLTSELTNLNVSTA---YNGD

Query:  ENIT
        ENI+
Subjt:  ENIT

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.0e-2525.05Show/hide
Query:  SVSILN-NICNLVSVRLDSTNYFLWQFQISPLLKSHKLFKYVDGSIKVPEAIIRTEVQPPRVNPDHEIWYERDQALITLINATLTQTALSYVIGCQTSKE
        + SILN N+ N+   +L STNY +W  Q+  L   ++L  ++DGS  +P A I T+   PRVNPD+  W  +D+ + + +   ++ +    V    T+ +
Subjt:  SVSILN-NICNLVSVRLDSTNYFLWQFQISPLLKSHKLFKYVDGSIKVPEAIIRTEVQPPRVNPDHEIWYERDQALITLINATLTQTALSYVIGCQTSKE

Query:  VWDTLEKHFSSSTRTNIIGLKTELQSVSKQSGETIDAYVRRVKEIVNKLAAVSVVIDAEDLI--------------------------IYKTEETTLDKQ
        +W+TL K +++ +  ++  L+T+L+  +K + +TID Y++ +    ++LA +   +D ++ +                          + +  E  L+ +
Subjt:  VWDTLEKHFSSSTRTNIIGLKTELQSVSKQSGETIDAYVRRVKEIVNKLAAVSVVIDAEDLI--------------------------IYKTEETTLDKQ

Query:  SKVEDATVVSHLALTANLESQGRGGWRPQGTRGKGTGGFFDNNHGRGRGGNSFSS-GNSNGSGPGQFSSSSFSGQNSSKV----NCQICHKYNHNALGCF
        SK+   +  + + +TAN  S           R   T     NN+  G   N + +  N+N S P Q SS++F   N+        CQIC    H+A  C 
Subjt:  SKVEDATVVSHLALTANLESQGRGGWRPQGTRGKGTGGFFDNNHGRGRGGNSFSS-GNSNGSGPGQFSSSSFSGQNSSKV----NCQICHKYNHNALGCF

Query:  NRMNY--SFQGRHPPTKLEAMAASASSIANAFPSGTLPQESSVWLTDTRCNAHLTSELTNLNVSTAYNGDENI------------TDKTSGQI-------
           ++  S   + PP+        A+    +      P  S+ WL D+    H+TS+  NL++   Y G +++            T  TS          
Subjt:  NRMNY--SFQGRHPPTKLEAMAASASSIANAFPSGTLPQESSVWLTDTRCNAHLTSELTNLNVSTAYNGDENI------------TDKTSGQI-------

Query:  --LFHGPNI-------------NGL----YPISTS----------------------PVSAAQ-VGLFAHIGTKVTPTVWHDRLGHPCSSILRSVLKSFG
          + + PNI             NG+    +P S                        P++++Q V LFA   +K T + WH RLGHP  SIL SV+ ++ 
Subjt:  --LFHGPNI-------------NGL----YPISTS----------------------PVSAAQ-VGLFAHIGTKVTPTVWHDRLGHPCSSILRSVLKSFG

Query:  LSV
        LSV
Subjt:  LSV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.5e-1625.05Show/hide
Query:  SVSILN-NICNLVSVRLDSTNYFLWQFQISPLLKSHKLFKYVDGSIKVPEAIIRTEVQPPRVNPDHEIWYERDQALITLINATLTQTALSYVIGCQTSKE
        + +ILN N+ N+   +L STNY +W  Q+  L   ++L  ++DGS  +P A I T+   PRVNPD+  W  +D+ + + I   ++ +    V    T+ +
Subjt:  SVSILN-NICNLVSVRLDSTNYFLWQFQISPLLKSHKLFKYVDGSIKVPEAIIRTEVQPPRVNPDHEIWYERDQALITLINATLTQTALSYVIGCQTSKE

Query:  VWDTLEKHFSSSTRTNIIGLK-----TELQSVSKQSGETIDAYVRRVKE-IVNKLAAVSVVIDAEDLIIYKTE--ETTLDKQSKVEDATVVSHLALTANL
        +W+TL K +++ +  ++  L+      +L  + K      D  V RV E + +    V   I A+D     TE  E  ++++SK+        + +TAN+
Subjt:  VWDTLEKHFSSSTRTNIIGLK-----TELQSVSKQSGETIDAYVRRVKE-IVNKLAAVSVVIDAEDLIIYKTE--ETTLDKQSKVEDATVVSHLALTANL

Query:  ESQGRGGWRPQGTRGKGTGGFFDNNHGRGRGGNSFSSGNSNGSGPGQFSSSSFSGQNSSKV----NCQICHKYNHNALGC-----FNRMNYSFQGRHPPT
         +           R   T     N +   RG N   + N+N S   Q SSS     N         CQIC    H+A  C     F       Q   P T
Subjt:  ESQGRGGWRPQGTRGKGTGGFFDNNHGRGRGGNSFSSGNSNGSGPGQFSSSSFSGQNSSKV----NCQICHKYNHNALGC-----FNRMNYSFQGRHPPT

Query:  KLEAMAASASSIANAFPSGTLPQESSVWLTDTRCNAHLTSELTNLNVSTAYNGDEN----------ITDKTSGQI-----------LFHGPNIN------
          +  A  A +          P  ++ WL D+    H+TS+  NL+    Y G ++          IT   S  +           + + PNI+      
Subjt:  KLEAMAASASSIANAFPSGTLPQESSVWLTDTRCNAHLTSELTNLNVSTAYNGDEN----------ITDKTSGQI-----------LFHGPNIN------

Query:  -----------GLYPISTS----------------------PVSAAQ-VGLFAHIGTKVTPTVWHDRLGHPCSSILRSVLKSFGLSV
                     +P S                        P++++Q V +FA   +K T + WH RLGHP  +IL SV+ +  L V
Subjt:  -----------GLYPISTS----------------------PVSAAQ-VGLFAHIGTKVTPTVWHDRLGHPCSSILRSVLKSFGLSV

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAATCTTCTGCAAATCAGTCTTCCGAGACTTCAGCCCTAAATCCAGCGGCAAATTCTTCGAATACATCAGTTTCTATCCTCAACAATATCTGCAATTTAGTCTCCGT
TCGTCTTGATTCTACGAACTATTTTCTATGGCAGTTCCAGATTTCTCCTCTCCTGAAATCGCATAAGCTCTTTAAGTATGTTGATGGCTCAATCAAAGTTCCTGAGGCGA
TTATTCGTACTGAAGTACAACCGCCGCGCGTCAATCCTGACCATGAGATCTGGTATGAACGAGATCAGGCGCTGATCACCTTGATTAACGCCACTTTGACTCAGACGGCG
TTATCGTATGTTATCGGTTGTCAGACCTCCAAGGAAGTATGGGATACCTTGGAGAAGCACTTCTCTTCGTCTACTCGAACCAACATTATTGGCCTCAAAACTGAGTTACA
GAGCGTTTCGAAACAGTCTGGTGAAACAATTGATGCGTATGTCCGCCGTGTGAAGGAAATCGTCAACAAATTGGCCGCTGTATCTGTTGTAATTGATGCTGAAGATCTGA
TCATATACAAAACAGAAGAAACGACACTTGATAAACAATCCAAGGTTGAGGATGCTACTGTTGTCTCACATCTGGCTTTGACGGCAAATCTTGAATCTCAAGGGCGAGGG
GGATGGCGCCCACAAGGCACTCGTGGTAAAGGCACTGGCGGTTTCTTTGACAACAATCATGGACGTGGTCGAGGAGGTAATTCCTTCTCTTCGGGCAACTCCAATGGTTC
TGGTCCAGGTCAGTTTTCGTCCTCTTCCTTCTCTGGACAGAATTCAAGCAAAGTGAATTGTCAAATCTGTCACAAATACAATCACAATGCCCTTGGCTGCTTCAATCGAA
TGAACTACTCATTCCAAGGAAGGCATCCTCCGACGAAACTCGAGGCAATGGCTGCCTCTGCCAGTTCGATTGCTAATGCTTTCCCTTCTGGAACTTTACCTCAGGAATCC
AGTGTTTGGTTGACAGATACAAGGTGCAACGCACATTTGACCAGTGAGTTAACAAATCTGAACGTGTCGACTGCATACAACGGTGATGAGAACATTACAGACAAAACATC
AGGCCAAATTCTATTCCACGGACCCAACATTAATGGTTTGTATCCTATATCCACCTCACCTGTTTCAGCAGCACAAGTAGGTCTCTTTGCTCACATTGGTACCAAGGTCA
CTCCTACTGTATGGCACGATAGGTTAGGCCATCCATGCTCCTCTATACTTCGTTCTGTTCTTAAATCTTTTGGATTGTCTGTTGCTCACAGTAATACTATTTATTATGCT
GTTGAGCGACTGGAGGGAGCAAATTCTGTGCTGCAGCAAAACTGGGAACAGAATTGCCACATCACAGCTCGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAATCTTCTGCAAATCAGTCTTCCGAGACTTCAGCCCTAAATCCAGCGGCAAATTCTTCGAATACATCAGTTTCTATCCTCAACAATATCTGCAATTTAGTCTCCGT
TCGTCTTGATTCTACGAACTATTTTCTATGGCAGTTCCAGATTTCTCCTCTCCTGAAATCGCATAAGCTCTTTAAGTATGTTGATGGCTCAATCAAAGTTCCTGAGGCGA
TTATTCGTACTGAAGTACAACCGCCGCGCGTCAATCCTGACCATGAGATCTGGTATGAACGAGATCAGGCGCTGATCACCTTGATTAACGCCACTTTGACTCAGACGGCG
TTATCGTATGTTATCGGTTGTCAGACCTCCAAGGAAGTATGGGATACCTTGGAGAAGCACTTCTCTTCGTCTACTCGAACCAACATTATTGGCCTCAAAACTGAGTTACA
GAGCGTTTCGAAACAGTCTGGTGAAACAATTGATGCGTATGTCCGCCGTGTGAAGGAAATCGTCAACAAATTGGCCGCTGTATCTGTTGTAATTGATGCTGAAGATCTGA
TCATATACAAAACAGAAGAAACGACACTTGATAAACAATCCAAGGTTGAGGATGCTACTGTTGTCTCACATCTGGCTTTGACGGCAAATCTTGAATCTCAAGGGCGAGGG
GGATGGCGCCCACAAGGCACTCGTGGTAAAGGCACTGGCGGTTTCTTTGACAACAATCATGGACGTGGTCGAGGAGGTAATTCCTTCTCTTCGGGCAACTCCAATGGTTC
TGGTCCAGGTCAGTTTTCGTCCTCTTCCTTCTCTGGACAGAATTCAAGCAAAGTGAATTGTCAAATCTGTCACAAATACAATCACAATGCCCTTGGCTGCTTCAATCGAA
TGAACTACTCATTCCAAGGAAGGCATCCTCCGACGAAACTCGAGGCAATGGCTGCCTCTGCCAGTTCGATTGCTAATGCTTTCCCTTCTGGAACTTTACCTCAGGAATCC
AGTGTTTGGTTGACAGATACAAGGTGCAACGCACATTTGACCAGTGAGTTAACAAATCTGAACGTGTCGACTGCATACAACGGTGATGAGAACATTACAGACAAAACATC
AGGCCAAATTCTATTCCACGGACCCAACATTAATGGTTTGTATCCTATATCCACCTCACCTGTTTCAGCAGCACAAGTAGGTCTCTTTGCTCACATTGGTACCAAGGTCA
CTCCTACTGTATGGCACGATAGGTTAGGCCATCCATGCTCCTCTATACTTCGTTCTGTTCTTAAATCTTTTGGATTGTCTGTTGCTCACAGTAATACTATTTATTATGCT
GTTGAGCGACTGGAGGGAGCAAATTCTGTGCTGCAGCAAAACTGGGAACAGAATTGCCACATCACAGCTCGTTAG
Protein sequenceShow/hide protein sequence
MQSSANQSSETSALNPAANSSNTSVSILNNICNLVSVRLDSTNYFLWQFQISPLLKSHKLFKYVDGSIKVPEAIIRTEVQPPRVNPDHEIWYERDQALITLINATLTQTA
LSYVIGCQTSKEVWDTLEKHFSSSTRTNIIGLKTELQSVSKQSGETIDAYVRRVKEIVNKLAAVSVVIDAEDLIIYKTEETTLDKQSKVEDATVVSHLALTANLESQGRG
GWRPQGTRGKGTGGFFDNNHGRGRGGNSFSSGNSNGSGPGQFSSSSFSGQNSSKVNCQICHKYNHNALGCFNRMNYSFQGRHPPTKLEAMAASASSIANAFPSGTLPQES
SVWLTDTRCNAHLTSELTNLNVSTAYNGDENITDKTSGQILFHGPNINGLYPISTSPVSAAQVGLFAHIGTKVTPTVWHDRLGHPCSSILRSVLKSFGLSVAHSNTIYYA
VERLEGANSVLQQNWEQNCHITAR