; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011774 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011774
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr1:32577075..32578589
RNA-Seq ExpressionLag0011774
SyntenyLag0011774
Gene Ontology termsNA
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.5e-9544.44Show/hide
Query:  MSESSLEQALQCTYAKEIWDCLLQIFNSRHLAQMMKIKSKSQNIQKGGSTMNEYVSKIKKCVDALAAIEKEIPVEDHILYILSGLGAEYEIMVSVIAAKN
        MSE  L Q L C  AKEIW+ L  IF+SR+LAQ M+ K+K  NI+KG   + EY  KI +CVDALA+I K +  +DHILYIL+GLG++Y+ M+SVI+A+ 
Subjt:  MSESSLEQALQCTYAKEIWDCLLQIFNSRHLAQMMKIKSKSQNIQKGGSTMNEYVSKIKKCVDALAAIEKEIPVEDHILYILSGLGAEYEIMVSVIAAKN

Query:  GSQSVQDVIALLLTHESRIESKSSINTNGVLPTANLTVQNTAK---------------KDSFGQPYG---------------------------------
         S SVQ+V++LLLT ES+ ESK    T   LP+ N+  Q T K                 S+ Q  G                                 
Subjt:  GSQSVQDVIALLLTHESRIESKSSINTNGVLPTANLTVQNTAK---------------KDSFGQPYG---------------------------------

Query:  ------------------------NHFPQMQAMLVAPSFNQDCNWYPDSGATNHLTNNLGNMSMSSEYLGNNQVLIGNGSGF---HISRLGYGSITSPSN
                                N+ PQM AM+ A   N D NWYPDSGATNHLT++L N+S+ SEY G NQ+   NGSG    H   + + S T P  
Subjt:  ------------------------NHFPQMQAMLVAPSFNQDCNWYPDSGATNHLTNNLGNMSMSSEYLGNNQVLIGNGSGF---HISRLGYGSITSPSN

Query:  HVFHLHNLLHVPSITKNLISVSQIAKDNAVFFEFHPNFCVVKDLATRRALLRGTLHEGLYKFDLS---KLQPSTNSNASPTVQHSSLSCNQPTAFSVSSV
          F L+NLL VPSITKNLISVSQ AKDN VFFEFHP  C VKDL T + LL+G L++GLYKF +    K    +NSN  P         N P        
Subjt:  HVFHLHNLLHVPSITKNLISVSQIAKDNAVFFEFHPNFCVVKDLATRRALLRGTLHEGLYKFDLS---KLQPSTNSNASPTVQHSSLSCNQPTAFSVSSV

Query:  QYLNNMSLDIWHQRLGHSALSVVKEVLRNCKSSFPTNAVTSFCNACAIGKHHAIPFSPSTTTYSSPLKLIVTDLWGPSYKLSRHGFRYYISFVVA
               LD+WH+RLGH  L +VK VL +  +S  T    +FC ACA+GKHHA+PFS S T Y+ PL+LI  DLWGP+  +S +GFRYYISFV A
Subjt:  QYLNNMSLDIWHQRLGHSALSVVKEVLRNCKSSFPTNAVTSFCNACAIGKHHAIPFSPSTTTYSSPLKLIVTDLWGPSYKLSRHGFRYYISFVVA

KZV26181.1 hypothetical protein F511_06348 [Dorcoceras hygrometricum]2.5e-7939.09Show/hide
Query:  MSESSLEQALQCTYAKEIWDCLLQIFNSRHLAQMMKIKSKSQNIQKGGSTMNEYVSKIKKCVDALAAIEKEIPVEDHILYILSGLGAEYEIMVSVIAAKN
        MSES+  Q + C  + ++W  + Q+F +R  A++M+ K + Q ++KG  +M +Y+ K+K  +D LAA    IP +D IL+IL G+G EYE +V  + ++ 
Subjt:  MSESSLEQALQCTYAKEIWDCLLQIFNSRHLAQMMKIKSKSQNIQKGGSTMNEYVSKIKKCVDALAAIEKEIPVEDHILYILSGLGAEYEIMVSVIAAKN

Query:  GSQSVQDVIALLLTHESRIESKSSINTNGVLPTANLTV---QNTAKKDSFGQP------------------YGNHFP-----------------------
         S S+ +V ALLL HE RIE+ +    +   P+ N+T    Q  A+  S  QP                  + N  P                       
Subjt:  GSQSVQDVIALLLTHESRIESKSSINTNGVLPTANLTV---QNTAKKDSFGQP------------------YGNHFP-----------------------

Query:  -----------QMQAMLVAPSFNQDCN------------WYPDSGATNHLTNNLGNMSMSSEYLGNNQVLIGNGSGFHISRLGYGSITS-PSNHVFHLHN
                   Q Q    +PS+                 WYPDSGA++H+TN+LGN+S+SSEY G ++V +GNG+G  IS +G  ++   PS+  F L N
Subjt:  -----------QMQAMLVAPSFNQDCN------------WYPDSGATNHLTNNLGNMSMSSEYLGNNQVLIGNGSGFHISRLGYGSITS-PSNHVFHLHN

Query:  LLHVPSITKNLISVSQIAKDNAVFFEFHPNFCVVKDLATRRALLRGTLHEGLYKFDL-SKLQPSTNSNASPTVQHSSLSCNQPTAFSVSSVQYLNNMSLD
        LLHVP ITKNLISVS+ A DN V+FEFHP+FC+VKD AT   LLRGTLH GLY+F+L S++    +   SP    SS+S   P      S   L   +LD
Subjt:  LLHVPSITKNLISVSQIAKDNAVFFEFHPNFCVVKDLATRRALLRGTLHEGLYKFDL-SKLQPSTNSNASPTVQHSSLSCNQPTAFSVSSVQYLNNMSLD

Query:  IWHQRLGHSALSVVKEVLRNCKSSFPTNAVTSFCNACAIGKHHAIPFSPSTTTYSSPLKLIVTDLWGPSYKLSRHGFRYYISFVVA
         WH RLGH +++ VK+VL +C      N   SFC++C +GK+H +PF  STT +S+P +++ +DLWGP++  SR+G RYYISFV A
Subjt:  IWHQRLGHSALSVVKEVLRNCKSSFPTNAVTSFCNACAIGKHHAIPFSPSTTTYSSPLKLIVTDLWGPSYKLSRHGFRYYISFVVA

RVW60229.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.9e-7235.19Show/hide
Query:  LEQALQCTYAKEIWDCLLQIFNSRHLAQMMKIKSKSQNIQKGGSTMNEYVSKIKKCVDALAAIEKEIPVEDHILYILSGLGAEYEIMVSVIAAKNGSQSV
        L Q + C+ A E+W+ + Q FNS+  A++M  KS+ Q ++K G TM +Y++K+K   D LA    +I   DHIL I+ GLG EYE +++VI++K  S S+
Subjt:  LEQALQCTYAKEIWDCLLQIFNSRHLAQMMKIKSKSQNIQKGGSTMNEYVSKIKKCVDALAAIEKEIPVEDHILYILSGLGAEYEIMVSVIAAKNGSQSV

Query:  QDVIALLLTHESRIESK--------------------SSINTNGV----LPTANLTVQNTAKKDSF--------GQPYGNHFPQ----------------
        Q V + L+ HE RI  K                    SS N+NG         N    N   + SF        G+  G   PQ                
Subjt:  QDVIALLLTHESRIESK--------------------SSINTNGV----LPTANLTVQNTAKKDSF--------GQPYGNHFPQ----------------

Query:  -----------------------------------------------------MQAMLVAPSFNQDCNWYPDSGATNHLTNNLGNMSMSSEYLGNNQVLI
                                                             M+AM+  P   Q+C W+PDSGATNH+T++LGN++  +EY GN+++ +
Subjt:  -----------------------------------------------------MQAMLVAPSFNQDCNWYPDSGATNHLTNNLGNMSMSSEYLGNNQVLI

Query:  GNGSGFHISRLGYGSITSPS--NHVFHLHNLLHVPSITKNLISVSQIAKDNAVFFEFHPNFCVVKDLATRRALLRGTLHEGLYKFDLSKLQPSTNSNASP
        GNG+G  IS +G     S S  N V  L N+L VP+I KNL+SVSQ A+DN V+FEFHP  C VKD +    LL+G LH+GLY+F+LSK      S  S 
Subjt:  GNGSGFHISRLGYGSITSPS--NHVFHLHNLLHVPSITKNLISVSQIAKDNAVFFEFHPNFCVVKDLATRRALLRGTLHEGLYKFDLSKLQPSTNSNASP

Query:  TVQHSSLSCNQPTAFSVSSVQYLNNMS-----LDIWHQRLGHSALSVVKEVLRNCKSSFPTNAVTSFCNACAIGKHHAIPFSPSTTTYSSPLKLIVTDLW
        +   + L+C   +     +  +    +      D+WH+RLGH A  +V +VL + K  F T + +S C+AC +GK H +PF  S T Y+ PL+L+V+DLW
Subjt:  TVQHSSLSCNQPTAFSVSSVQYLNNMS-----LDIWHQRLGHSALSVVKEVLRNCKSSFPTNAVTSFCNACAIGKHHAIPFSPSTTTYSSPLKLIVTDLW

Query:  GPSYKLSRHGFRYYISFVVA
        GP+   S +GF YY+SFV A
Subjt:  GPSYKLSRHGFRYYISFVVA

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.5e-9544.44Show/hide
Query:  MSESSLEQALQCTYAKEIWDCLLQIFNSRHLAQMMKIKSKSQNIQKGGSTMNEYVSKIKKCVDALAAIEKEIPVEDHILYILSGLGAEYEIMVSVIAAKN
        MSE  L Q L C  AKEIW+ L  IF+SR+LAQ M+ K+K  NI+KG   + EY  KI +CVDALA+I K +  +DHILYIL+GLG++Y+ M+SVI+A+ 
Subjt:  MSESSLEQALQCTYAKEIWDCLLQIFNSRHLAQMMKIKSKSQNIQKGGSTMNEYVSKIKKCVDALAAIEKEIPVEDHILYILSGLGAEYEIMVSVIAAKN

Query:  GSQSVQDVIALLLTHESRIESKSSINTNGVLPTANLTVQNTAK---------------KDSFGQPYG---------------------------------
         S SVQ+V++LLLT ES+ ESK    T   LP+ N+  Q T K                 S+ Q  G                                 
Subjt:  GSQSVQDVIALLLTHESRIESKSSINTNGVLPTANLTVQNTAK---------------KDSFGQPYG---------------------------------

Query:  ------------------------NHFPQMQAMLVAPSFNQDCNWYPDSGATNHLTNNLGNMSMSSEYLGNNQVLIGNGSGF---HISRLGYGSITSPSN
                                N+ PQM AM+ A   N D NWYPDSGATNHLT++L N+S+ SEY G NQ+   NGSG    H   + + S T P  
Subjt:  ------------------------NHFPQMQAMLVAPSFNQDCNWYPDSGATNHLTNNLGNMSMSSEYLGNNQVLIGNGSGF---HISRLGYGSITSPSN

Query:  HVFHLHNLLHVPSITKNLISVSQIAKDNAVFFEFHPNFCVVKDLATRRALLRGTLHEGLYKFDLS---KLQPSTNSNASPTVQHSSLSCNQPTAFSVSSV
          F L+NLL VPSITKNLISVSQ AKDN VFFEFHP  C VKDL T + LL+G L++GLYKF +    K    +NSN  P         N P        
Subjt:  HVFHLHNLLHVPSITKNLISVSQIAKDNAVFFEFHPNFCVVKDLATRRALLRGTLHEGLYKFDLS---KLQPSTNSNASPTVQHSSLSCNQPTAFSVSSV

Query:  QYLNNMSLDIWHQRLGHSALSVVKEVLRNCKSSFPTNAVTSFCNACAIGKHHAIPFSPSTTTYSSPLKLIVTDLWGPSYKLSRHGFRYYISFVVA
               LD+WH+RLGH  L +VK VL +  +S  T    +FC ACA+GKHHA+PFS S T Y+ PL+LI  DLWGP+  +S +GFRYYISFV A
Subjt:  QYLNNMSLDIWHQRLGHSALSVVKEVLRNCKSSFPTNAVTSFCNACAIGKHHAIPFSPSTTTYSSPLKLIVTDLWGPSYKLSRHGFRYYISFVVA

XP_022156747.1 uncharacterized protein LOC111023586 [Momordica charantia]2.5e-8744.39Show/hide
Query:  MSESSLEQALQCTYAKEIWDCLLQIFNSRHLAQMMKIKSKSQNIQKGGSTMNEYVSKIKKCVDALAAIEKEIPVEDHILYILSGLGAEYEIMVSVIAAKN
        MSE  L Q L C   KEIW  L   F SR+LA++M++KSK +N++KG   +  Y  KIK  VD+LA   K +P +DHI++IL+ LG E++ +VSVI+ + 
Subjt:  MSESSLEQALQCTYAKEIWDCLLQIFNSRHLAQMMKIKSKSQNIQKGGSTMNEYVSKIKKCVDALAAIEKEIPVEDHILYILSGLGAEYEIMVSVIAAKN

Query:  GSQSVQDVIALLLTHESRIESKSSINTNGVLPTANLTVQNTAKKDSFGQPYGNHFPQMQAMLVAPSFNQDCNWYPDSGATNHLTNNLGNMSMSSEYLGNN
          QS+Q+  +   +H    + +SS            +  +T  + +FG  +G   PQMQAM+VA  FN+D  WYPDSGATNH+TN+ GN S+ S+Y GN 
Subjt:  GSQSVQDVIALLLTHESRIESKSSINTNGVLPTANLTVQNTAKKDSFGQPYGNHFPQMQAMLVAPSFNQDCNWYPDSGATNHLTNNLGNMSMSSEYLGNN

Query:  QVLIGNGSGFHISRLGYG-----SITSPSNHVFHLHNLLHVPSITKNLISVSQIAKDNAVFFEFHPNFCVVKDLATRRALLRGTLHEGLYKFDLSK----
        ++ +GNG+   IS +G       S ++ S  VFHL NLLHVP I KNLIS+S  AKDN VFFEFHP+   VKDL T + L +GT+H+ LY+F+L K    
Subjt:  QVLIGNGSGFHISRLGYG-----SITSPSNHVFHLHNLLHVPSITKNLISVSQIAKDNAVFFEFHPNFCVVKDLATRRALLRGTLHEGLYKFDLSK----

Query:  --LQPSTNSNASPTVQHSSLS-CNQPTAFSVSSVQYLNNMSLDIWHQRLGHS-ALSVVKEVLRNCKSSFPTNAVTSFCNACAIGKHHAIPFSPSTTTYSS
             S+ SN SPT+ +S L   N P   +  +    N   LDIWH+R GHS  L +V+ V+R+C      N  ++ C+  AIGK H +PF  S T Y++
Subjt:  --LQPSTNSNASPTVQHSSLS-CNQPTAFSVSSVQYLNNMSLDIWHQRLGHS-ALSVVKEVLRNCKSSFPTNAVTSFCNACAIGKHHAIPFSPSTTTYSS

Query:  PLKLIVTDLWGPSYKLSRHGFRYYISFV
        PL+L+V DLWG +Y  S++GFRY++SFV
Subjt:  PLKLIVTDLWGPSYKLSRHGFRYYISFV

TrEMBL top hitse value%identityAlignment
A0A2Z7AWA7 Integrase catalytic domain-containing protein1.2e-7939.09Show/hide
Query:  MSESSLEQALQCTYAKEIWDCLLQIFNSRHLAQMMKIKSKSQNIQKGGSTMNEYVSKIKKCVDALAAIEKEIPVEDHILYILSGLGAEYEIMVSVIAAKN
        MSES+  Q + C  + ++W  + Q+F +R  A++M+ K + Q ++KG  +M +Y+ K+K  +D LAA    IP +D IL+IL G+G EYE +V  + ++ 
Subjt:  MSESSLEQALQCTYAKEIWDCLLQIFNSRHLAQMMKIKSKSQNIQKGGSTMNEYVSKIKKCVDALAAIEKEIPVEDHILYILSGLGAEYEIMVSVIAAKN

Query:  GSQSVQDVIALLLTHESRIESKSSINTNGVLPTANLTV---QNTAKKDSFGQP------------------YGNHFP-----------------------
         S S+ +V ALLL HE RIE+ +    +   P+ N+T    Q  A+  S  QP                  + N  P                       
Subjt:  GSQSVQDVIALLLTHESRIESKSSINTNGVLPTANLTV---QNTAKKDSFGQP------------------YGNHFP-----------------------

Query:  -----------QMQAMLVAPSFNQDCN------------WYPDSGATNHLTNNLGNMSMSSEYLGNNQVLIGNGSGFHISRLGYGSITS-PSNHVFHLHN
                   Q Q    +PS+                 WYPDSGA++H+TN+LGN+S+SSEY G ++V +GNG+G  IS +G  ++   PS+  F L N
Subjt:  -----------QMQAMLVAPSFNQDCN------------WYPDSGATNHLTNNLGNMSMSSEYLGNNQVLIGNGSGFHISRLGYGSITS-PSNHVFHLHN

Query:  LLHVPSITKNLISVSQIAKDNAVFFEFHPNFCVVKDLATRRALLRGTLHEGLYKFDL-SKLQPSTNSNASPTVQHSSLSCNQPTAFSVSSVQYLNNMSLD
        LLHVP ITKNLISVS+ A DN V+FEFHP+FC+VKD AT   LLRGTLH GLY+F+L S++    +   SP    SS+S   P      S   L   +LD
Subjt:  LLHVPSITKNLISVSQIAKDNAVFFEFHPNFCVVKDLATRRALLRGTLHEGLYKFDL-SKLQPSTNSNASPTVQHSSLSCNQPTAFSVSSVQYLNNMSLD

Query:  IWHQRLGHSALSVVKEVLRNCKSSFPTNAVTSFCNACAIGKHHAIPFSPSTTTYSSPLKLIVTDLWGPSYKLSRHGFRYYISFVVA
         WH RLGH +++ VK+VL +C      N   SFC++C +GK+H +PF  STT +S+P +++ +DLWGP++  SR+G RYYISFV A
Subjt:  IWHQRLGHSALSVVKEVLRNCKSSFPTNAVTSFCNACAIGKHHAIPFSPSTTTYSSPLKLIVTDLWGPSYKLSRHGFRYYISFVVA

A0A438FJP6 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-7235.19Show/hide
Query:  LEQALQCTYAKEIWDCLLQIFNSRHLAQMMKIKSKSQNIQKGGSTMNEYVSKIKKCVDALAAIEKEIPVEDHILYILSGLGAEYEIMVSVIAAKNGSQSV
        L Q + C+ A E+W+ + Q FNS+  A++M  KS+ Q ++K G TM +Y++K+K   D LA    +I   DHIL I+ GLG EYE +++VI++K  S S+
Subjt:  LEQALQCTYAKEIWDCLLQIFNSRHLAQMMKIKSKSQNIQKGGSTMNEYVSKIKKCVDALAAIEKEIPVEDHILYILSGLGAEYEIMVSVIAAKNGSQSV

Query:  QDVIALLLTHESRIESK--------------------SSINTNGV----LPTANLTVQNTAKKDSF--------GQPYGNHFPQ----------------
        Q V + L+ HE RI  K                    SS N+NG         N    N   + SF        G+  G   PQ                
Subjt:  QDVIALLLTHESRIESK--------------------SSINTNGV----LPTANLTVQNTAKKDSF--------GQPYGNHFPQ----------------

Query:  -----------------------------------------------------MQAMLVAPSFNQDCNWYPDSGATNHLTNNLGNMSMSSEYLGNNQVLI
                                                             M+AM+  P   Q+C W+PDSGATNH+T++LGN++  +EY GN+++ +
Subjt:  -----------------------------------------------------MQAMLVAPSFNQDCNWYPDSGATNHLTNNLGNMSMSSEYLGNNQVLI

Query:  GNGSGFHISRLGYGSITSPS--NHVFHLHNLLHVPSITKNLISVSQIAKDNAVFFEFHPNFCVVKDLATRRALLRGTLHEGLYKFDLSKLQPSTNSNASP
        GNG+G  IS +G     S S  N V  L N+L VP+I KNL+SVSQ A+DN V+FEFHP  C VKD +    LL+G LH+GLY+F+LSK      S  S 
Subjt:  GNGSGFHISRLGYGSITSPS--NHVFHLHNLLHVPSITKNLISVSQIAKDNAVFFEFHPNFCVVKDLATRRALLRGTLHEGLYKFDLSKLQPSTNSNASP

Query:  TVQHSSLSCNQPTAFSVSSVQYLNNMS-----LDIWHQRLGHSALSVVKEVLRNCKSSFPTNAVTSFCNACAIGKHHAIPFSPSTTTYSSPLKLIVTDLW
        +   + L+C   +     +  +    +      D+WH+RLGH A  +V +VL + K  F T + +S C+AC +GK H +PF  S T Y+ PL+L+V+DLW
Subjt:  TVQHSSLSCNQPTAFSVSSVQYLNNMS-----LDIWHQRLGHSALSVVKEVLRNCKSSFPTNAVTSFCNACAIGKHHAIPFSPSTTTYSSPLKLIVTDLW

Query:  GPSYKLSRHGFRYYISFVVA
        GP+   S +GF YY+SFV A
Subjt:  GPSYKLSRHGFRYYISFVVA

A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-9544.44Show/hide
Query:  MSESSLEQALQCTYAKEIWDCLLQIFNSRHLAQMMKIKSKSQNIQKGGSTMNEYVSKIKKCVDALAAIEKEIPVEDHILYILSGLGAEYEIMVSVIAAKN
        MSE  L Q L C  AKEIW+ L  IF+SR+LAQ M+ K+K  NI+KG   + EY  KI +CVDALA+I K +  +DHILYIL+GLG++Y+ M+SVI+A+ 
Subjt:  MSESSLEQALQCTYAKEIWDCLLQIFNSRHLAQMMKIKSKSQNIQKGGSTMNEYVSKIKKCVDALAAIEKEIPVEDHILYILSGLGAEYEIMVSVIAAKN

Query:  GSQSVQDVIALLLTHESRIESKSSINTNGVLPTANLTVQNTAK---------------KDSFGQPYG---------------------------------
         S SVQ+V++LLLT ES+ ESK    T   LP+ N+  Q T K                 S+ Q  G                                 
Subjt:  GSQSVQDVIALLLTHESRIESKSSINTNGVLPTANLTVQNTAK---------------KDSFGQPYG---------------------------------

Query:  ------------------------NHFPQMQAMLVAPSFNQDCNWYPDSGATNHLTNNLGNMSMSSEYLGNNQVLIGNGSGF---HISRLGYGSITSPSN
                                N+ PQM AM+ A   N D NWYPDSGATNHLT++L N+S+ SEY G NQ+   NGSG    H   + + S T P  
Subjt:  ------------------------NHFPQMQAMLVAPSFNQDCNWYPDSGATNHLTNNLGNMSMSSEYLGNNQVLIGNGSGF---HISRLGYGSITSPSN

Query:  HVFHLHNLLHVPSITKNLISVSQIAKDNAVFFEFHPNFCVVKDLATRRALLRGTLHEGLYKFDLS---KLQPSTNSNASPTVQHSSLSCNQPTAFSVSSV
          F L+NLL VPSITKNLISVSQ AKDN VFFEFHP  C VKDL T + LL+G L++GLYKF +    K    +NSN  P         N P        
Subjt:  HVFHLHNLLHVPSITKNLISVSQIAKDNAVFFEFHPNFCVVKDLATRRALLRGTLHEGLYKFDLS---KLQPSTNSNASPTVQHSSLSCNQPTAFSVSSV

Query:  QYLNNMSLDIWHQRLGHSALSVVKEVLRNCKSSFPTNAVTSFCNACAIGKHHAIPFSPSTTTYSSPLKLIVTDLWGPSYKLSRHGFRYYISFVVA
               LD+WH+RLGH  L +VK VL +  +S  T    +FC ACA+GKHHA+PFS S T Y+ PL+LI  DLWGP+  +S +GFRYYISFV A
Subjt:  QYLNNMSLDIWHQRLGHSALSVVKEVLRNCKSSFPTNAVTSFCNACAIGKHHAIPFSPSTTTYSSPLKLIVTDLWGPSYKLSRHGFRYYISFVVA

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-9544.44Show/hide
Query:  MSESSLEQALQCTYAKEIWDCLLQIFNSRHLAQMMKIKSKSQNIQKGGSTMNEYVSKIKKCVDALAAIEKEIPVEDHILYILSGLGAEYEIMVSVIAAKN
        MSE  L Q L C  AKEIW+ L  IF+SR+LAQ M+ K+K  NI+KG   + EY  KI +CVDALA+I K +  +DHILYIL+GLG++Y+ M+SVI+A+ 
Subjt:  MSESSLEQALQCTYAKEIWDCLLQIFNSRHLAQMMKIKSKSQNIQKGGSTMNEYVSKIKKCVDALAAIEKEIPVEDHILYILSGLGAEYEIMVSVIAAKN

Query:  GSQSVQDVIALLLTHESRIESKSSINTNGVLPTANLTVQNTAK---------------KDSFGQPYG---------------------------------
         S SVQ+V++LLLT ES+ ESK    T   LP+ N+  Q T K                 S+ Q  G                                 
Subjt:  GSQSVQDVIALLLTHESRIESKSSINTNGVLPTANLTVQNTAK---------------KDSFGQPYG---------------------------------

Query:  ------------------------NHFPQMQAMLVAPSFNQDCNWYPDSGATNHLTNNLGNMSMSSEYLGNNQVLIGNGSGF---HISRLGYGSITSPSN
                                N+ PQM AM+ A   N D NWYPDSGATNHLT++L N+S+ SEY G NQ+   NGSG    H   + + S T P  
Subjt:  ------------------------NHFPQMQAMLVAPSFNQDCNWYPDSGATNHLTNNLGNMSMSSEYLGNNQVLIGNGSGF---HISRLGYGSITSPSN

Query:  HVFHLHNLLHVPSITKNLISVSQIAKDNAVFFEFHPNFCVVKDLATRRALLRGTLHEGLYKFDLS---KLQPSTNSNASPTVQHSSLSCNQPTAFSVSSV
          F L+NLL VPSITKNLISVSQ AKDN VFFEFHP  C VKDL T + LL+G L++GLYKF +    K    +NSN  P         N P        
Subjt:  HVFHLHNLLHVPSITKNLISVSQIAKDNAVFFEFHPNFCVVKDLATRRALLRGTLHEGLYKFDLS---KLQPSTNSNASPTVQHSSLSCNQPTAFSVSSV

Query:  QYLNNMSLDIWHQRLGHSALSVVKEVLRNCKSSFPTNAVTSFCNACAIGKHHAIPFSPSTTTYSSPLKLIVTDLWGPSYKLSRHGFRYYISFVVA
               LD+WH+RLGH  L +VK VL +  +S  T    +FC ACA+GKHHA+PFS S T Y+ PL+LI  DLWGP+  +S +GFRYYISFV A
Subjt:  QYLNNMSLDIWHQRLGHSALSVVKEVLRNCKSSFPTNAVTSFCNACAIGKHHAIPFSPSTTTYSSPLKLIVTDLWGPSYKLSRHGFRYYISFVVA

A0A6J1DSS1 uncharacterized protein LOC1110235861.2e-8744.39Show/hide
Query:  MSESSLEQALQCTYAKEIWDCLLQIFNSRHLAQMMKIKSKSQNIQKGGSTMNEYVSKIKKCVDALAAIEKEIPVEDHILYILSGLGAEYEIMVSVIAAKN
        MSE  L Q L C   KEIW  L   F SR+LA++M++KSK +N++KG   +  Y  KIK  VD+LA   K +P +DHI++IL+ LG E++ +VSVI+ + 
Subjt:  MSESSLEQALQCTYAKEIWDCLLQIFNSRHLAQMMKIKSKSQNIQKGGSTMNEYVSKIKKCVDALAAIEKEIPVEDHILYILSGLGAEYEIMVSVIAAKN

Query:  GSQSVQDVIALLLTHESRIESKSSINTNGVLPTANLTVQNTAKKDSFGQPYGNHFPQMQAMLVAPSFNQDCNWYPDSGATNHLTNNLGNMSMSSEYLGNN
          QS+Q+  +   +H    + +SS            +  +T  + +FG  +G   PQMQAM+VA  FN+D  WYPDSGATNH+TN+ GN S+ S+Y GN 
Subjt:  GSQSVQDVIALLLTHESRIESKSSINTNGVLPTANLTVQNTAKKDSFGQPYGNHFPQMQAMLVAPSFNQDCNWYPDSGATNHLTNNLGNMSMSSEYLGNN

Query:  QVLIGNGSGFHISRLGYG-----SITSPSNHVFHLHNLLHVPSITKNLISVSQIAKDNAVFFEFHPNFCVVKDLATRRALLRGTLHEGLYKFDLSK----
        ++ +GNG+   IS +G       S ++ S  VFHL NLLHVP I KNLIS+S  AKDN VFFEFHP+   VKDL T + L +GT+H+ LY+F+L K    
Subjt:  QVLIGNGSGFHISRLGYG-----SITSPSNHVFHLHNLLHVPSITKNLISVSQIAKDNAVFFEFHPNFCVVKDLATRRALLRGTLHEGLYKFDLSK----

Query:  --LQPSTNSNASPTVQHSSLS-CNQPTAFSVSSVQYLNNMSLDIWHQRLGHS-ALSVVKEVLRNCKSSFPTNAVTSFCNACAIGKHHAIPFSPSTTTYSS
             S+ SN SPT+ +S L   N P   +  +    N   LDIWH+R GHS  L +V+ V+R+C      N  ++ C+  AIGK H +PF  S T Y++
Subjt:  --LQPSTNSNASPTVQHSSLS-CNQPTAFSVSSVQYLNNMSLDIWHQRLGHS-ALSVVKEVLRNCKSSFPTNAVTSFCNACAIGKHHAIPFSPSTTTYSS

Query:  PLKLIVTDLWGPSYKLSRHGFRYYISFV
        PL+L+V DLWG +Y  S++GFRY++SFV
Subjt:  PLKLIVTDLWGPSYKLSRHGFRYYISFV

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-0822.49Show/hide
Query:  DCNWYPDSGATNHLT--NNLGNMSMSSEYLGNNQVLIGNGSGFHISRLGYGSITSPSNHVFHLHNLLHVPSITKNLISVSQIAKDNAVFFEFHPNFCVVK
        +  W  D+ A++H T   +L    ++ ++     V +GN S   I+ +G   I +       L ++ HVP +  NLIS   + +D    +  +  + + K
Subjt:  DCNWYPDSGATNHLT--NNLGNMSMSSEYLGNNQVLIGNGSGFHISRLGYGSITSPSNHVFHLHNLLHVPSITKNLISVSQIAKDNAVFFEFHPNFCVVK

Query:  -DLATRRALLRGTLHEGLYKFDLSKLQPSTNSNASPTVQHSSLSCNQPTAFSVSSVQYLNNMSLDIWHQRLGHSALSVVKEVLRNCKSSFPTNAVTSFCN
          L   + + RGTL+              TN+     +    L+  Q            + +S+D+WH+R+GH +   ++ + +    S+        C+
Subjt:  -DLATRRALLRGTLHEGLYKFDLSKLQPSTNSNASPTVQHSSLSCNQPTAFSVSSVQYLNNMSLDIWHQRLGHSALSVVKEVLRNCKSSFPTNAVTSFCN

Query:  ACAIGKHHAIPFSPSTTTYSSPLKLIVTDLWGPSYKLSRHGFRYYISFV
         C  GK H + F  S+    + L L+ +D+ GP    S  G +Y+++F+
Subjt:  ACAIGKHHAIPFSPSTTTYSSPLKLIVTDLWGPSYKLSRHGFRYYISFV

P93293 Uncharacterized mitochondrial protein AtMg003006.2e-0435.53Show/hide
Query:  IWHQRLGHSALSVVKEVLRNCKSSFPTNAVTS--FCNACAIGKHHAIPFSPSTTTYSSPLKLIVTDLWG-PSYKLS
        +WH RL H +   ++ +++  K    ++ V+S  FC  C  GK H + FS    T  +PL  + +DLWG PS  LS
Subjt:  IWHQRLGHSALSVVKEVLRNCKSSFPTNAVTS--FCNACAIGKHHAIPFSPSTTTYSSPLKLIVTDLWG-PSYKLS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.9e-3828.42Show/hide
Query:  QCTYAKEIWDCLLQIFNSRHLAQMMKIKSKSQNIQKGGSTMNEYVSKIKKCVDALAAIEKEIPVEDHILYILSGLGAEYEIMVSVIAAKNGSQSVQDVIA
        + T A +IW+ L +I+ +     + +++++ +   KG  T+++Y+  +    D LA + K +  ++ +  +L  L  EY+ ++  IAAK+   ++ ++  
Subjt:  QCTYAKEIWDCLLQIFNSRHLAQMMKIKSKSQNIQKGGSTMNEYVSKIKKCVDALAAIEKEIPVEDHILYILSGLGAEYEIMVSVIAAKNGSQSVQDVIA

Query:  LLLTHESRIESKSSI----------------------------------NTNGVLP----TANLTVQNTAKKDSFG------------------------
         LL HES+I + SS                                   N N   P    + N    N   K   G                        
Subjt:  LLLTHESRIESKSSI----------------------------------NTNGVLP----TANLTVQNTAKKDSFG------------------------

Query:  ----QPYGNHFP-QMQAMLVAPSFNQDCNWYPDSGATNHLTNNLGNMSMSSEYLGNNQVLIGNGSGFHISRLGYGSITSPSNHVFHLHNLLHVPSITKNL
            QP     P Q +A L   S     NW  DSGAT+H+T++  N+S+   Y G + V++ +GS   IS  G  S+++ S  + +LHN+L+VP+I KNL
Subjt:  ----QPYGNHFP-QMQAMLVAPSFNQDCNWYPDSGATNHLTNNLGNMSMSSEYLGNNQVLIGNGSGFHISRLGYGSITSPSNHVFHLHNLLHVPSITKNL

Query:  ISVSQIAKDNAVFFEFHPNFCVVKDLATRRALLRGTLHEGLYKFDLSKLQP-STNSNASPTVQHSSLSCNQPTAFSVSSVQYLNNMSLDIWHQRLGHSAL
        ISV ++   N V  EF P    VKDL T   LL+G   + LY++ ++  QP S  ++ S    HSS                        WH RLGH A 
Subjt:  ISVSQIAKDNAVFFEFHPNFCVVKDLATRRALLRGTLHEGLYKFDLSKLQP-STNSNASPTVQHSSLSCNQPTAFSVSSVQYLNNMSLDIWHQRLGHSAL

Query:  SVVKEVLRNCKSSF--PTNAVTSFCNACAIGKHHAIPFSPSTTTYSSPLKLIVTDLWGPSYKLSRHGFRYYISFV
        S++  V+ N   S   P++   S C+ C I K + +PFS ST   + PL+ I +D+W  S  LS   +RYY+ FV
Subjt:  SVVKEVLRNCKSSF--PTNAVTSFCNACAIGKHHAIPFSPSTTTYSSPLKLIVTDLWGPSYKLSRHGFRYYISFV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-3034.41Show/hide
Query:  QNTAKKDSFGQPYGNHFPQMQAMLVAPSFNQDCNWYPDSGATNHLTNNLGNMSMSSEYLGNNQVLIGNGSGFHISRLGYGSITSPSNHVFHLHNLLHVPS
        Q+T  +     P+    P+    + +P +N + NW  DSGAT+H+T++  N+S    Y G + V+I +GS   I+  G  S+ + S+    L+ +L+VP+
Subjt:  QNTAKKDSFGQPYGNHFPQMQAMLVAPSFNQDCNWYPDSGATNHLTNNLGNMSMSSEYLGNNQVLIGNGSGFHISRLGYGSITSPSNHVFHLHNLLHVPS

Query:  ITKNLISVSQIAKDNAVFFEFHPNFCVVKDLATRRALLRGTLHEGLYKFDLSKLQPSTNSNASPTVQHSSLSCNQPTAFSVSSVQYLNNMSLDIWHQRLG
        I KNLISV ++   N V  EF P    VKDL T   LL+G   + LY++ ++  Q + +  ASP        C++ T  S              WH RLG
Subjt:  ITKNLISVSQIAKDNAVFFEFHPNFCVVKDLATRRALLRGTLHEGLYKFDLSKLQPSTNSNASPTVQHSSLSCNQPTAFSVSSVQYLNNMSLDIWHQRLG

Query:  HSALSVVKEVLRNCKSSF--PTNAVTSFCNACAIGKHHAIPFSPSTTTYSSPLKLIVTDLWGPSYKLSRHGFRYYISFV
        H +L+++  V+ N       P++ + S C+ C I K H +PFS ST T S PL+ I +D+W  S  LS   +RYY+ FV
Subjt:  HSALSVVKEVLRNCKSSF--PTNAVTSFCNACAIGKHHAIPFSPSTTTYSSPLKLIVTDLWGPSYKLSRHGFRYYISFV

Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.4e-0624.16Show/hide
Query:  CTYAKEIWDCLLQIFNSRHLAQMMKIKSKSQNIQKGGSTMNEYVSKIKKCVDALAAIEKEIPVEDHILYILSGLGAEYEIMVSVIAAKNGSQSVQDVIAL
        CT A+++W  L  +F     A+ ++ +++ +       +++EY  K+K   D L  ++  I     ++++L+GL  +Y+ +++VI  K+   S  +  ++
Subjt:  CTYAKEIWDCLLQIFNSRHLAQMMKIKSKSQNIQKGGSTMNEYVSKIKKCVDALAAIEKEIPVEDHILYILSGLGAEYEIMVSVIAAKNGSQSVQDVIAL

Query:  LLTHESRI--ESKSSINTNGVLPTANLTVQNTAKKDSFGQPYGNHFPQM
        LL  ESR+  +SKSS++       +N+      +++ + Q Y N+   M
Subjt:  LLTHESRI--ESKSSINTNGVLPTANLTVQNTAKKDSFGQPYGNHFPQM

ATMG00300.1 Gag-Pol-related retrotransposon family protein4.4e-0535.53Show/hide
Query:  IWHQRLGHSALSVVKEVLRNCKSSFPTNAVTS--FCNACAIGKHHAIPFSPSTTTYSSPLKLIVTDLWG-PSYKLS
        +WH RL H +   ++ +++  K    ++ V+S  FC  C  GK H + FS    T  +PL  + +DLWG PS  LS
Subjt:  IWHQRLGHSALSVVKEVLRNCKSSFPTNAVTS--FCNACAIGKHHAIPFSPSTTTYSSPLKLIVTDLWG-PSYKLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGAGTCTAGTTTAGAGCAAGCCCTACAGTGTACTTATGCGAAAGAGATTTGGGATTGCCTTCTTCAAATTTTTAATTCTAGGCATCTTGCTCAAATGATGAAAAT
AAAATCGAAATCGCAAAATATTCAGAAAGGGGGTTCGACTATGAATGAATATGTCTCTAAGATTAAAAAGTGTGTTGATGCTCTAGCCGCTATAGAAAAGGAGATTCCAG
TAGAAGATCATATTTTGTATATATTGTCTGGCCTTGGTGCGGAATATGAAATAATGGTCTCGGTTATTGCTGCTAAAAATGGGTCTCAGTCTGTCCAAGATGTAATTGCG
CTGCTTTTAACTCATGAGAGTCGAATAGAGAGTAAATCCTCTATTAATACAAATGGAGTTTTACCCACTGCTAACCTCACTGTTCAAAACACTGCTAAAAAAGATAGCTT
TGGTCAGCCATATGGAAATCACTTTCCTCAAATGCAAGCTATGTTGGTAGCTCCTAGTTTTAATCAGGATTGTAACTGGTATCCAGACTCTGGTGCTACAAACCATTTGA
CCAATAATCTTGGTAATATGTCCATGAGTTCTGAATATCTTGGCAATAATCAGGTTCTTATCGGCAATGGTTCAGGTTTTCATATTTCTCGACTTGGATATGGCTCTATT
ACTTCTCCTTCTAATCATGTTTTTCATCTTCACAACTTGTTACATGTTCCTTCAATTACTAAAAATCTAATTAGTGTCAGCCAAATCGCCAAAGATAATGCTGTTTTCTT
TGAGTTTCACCCAAATTTTTGTGTTGTGAAGGACCTAGCAACTCGACGAGCACTTCTCCGAGGGACTCTTCATGAAGGACTATACAAGTTCGATCTGTCAAAGCTTCAAC
CATCCACAAATTCGAATGCATCCCCTACTGTCCAACATTCTTCTTTGTCTTGCAATCAACCTACTGCCTTTTCTGTTTCCTCTGTTCAATATTTAAATAATATGTCTCTA
GATATTTGGCATCAACGTTTAGGTCATTCTGCTCTTTCAGTTGTTAAAGAAGTTCTTCGCAATTGTAAATCAAGTTTTCCTACCAATGCAGTCACTTCTTTTTGCAATGC
TTGTGCTATAGGAAAACATCATGCAATTCCTTTTTCTCCCTCTACTACTACCTATTCTTCACCATTGAAATTAATTGTGACTGATTTATGGGGACCATCTTACAAATTGT
CTCGCCATGGTTTTCGATATTATATTAGCTTTGTTGTTGCTGTAAACCCGAGACTTCCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTGAGTCTAGTTTAGAGCAAGCCCTACAGTGTACTTATGCGAAAGAGATTTGGGATTGCCTTCTTCAAATTTTTAATTCTAGGCATCTTGCTCAAATGATGAAAAT
AAAATCGAAATCGCAAAATATTCAGAAAGGGGGTTCGACTATGAATGAATATGTCTCTAAGATTAAAAAGTGTGTTGATGCTCTAGCCGCTATAGAAAAGGAGATTCCAG
TAGAAGATCATATTTTGTATATATTGTCTGGCCTTGGTGCGGAATATGAAATAATGGTCTCGGTTATTGCTGCTAAAAATGGGTCTCAGTCTGTCCAAGATGTAATTGCG
CTGCTTTTAACTCATGAGAGTCGAATAGAGAGTAAATCCTCTATTAATACAAATGGAGTTTTACCCACTGCTAACCTCACTGTTCAAAACACTGCTAAAAAAGATAGCTT
TGGTCAGCCATATGGAAATCACTTTCCTCAAATGCAAGCTATGTTGGTAGCTCCTAGTTTTAATCAGGATTGTAACTGGTATCCAGACTCTGGTGCTACAAACCATTTGA
CCAATAATCTTGGTAATATGTCCATGAGTTCTGAATATCTTGGCAATAATCAGGTTCTTATCGGCAATGGTTCAGGTTTTCATATTTCTCGACTTGGATATGGCTCTATT
ACTTCTCCTTCTAATCATGTTTTTCATCTTCACAACTTGTTACATGTTCCTTCAATTACTAAAAATCTAATTAGTGTCAGCCAAATCGCCAAAGATAATGCTGTTTTCTT
TGAGTTTCACCCAAATTTTTGTGTTGTGAAGGACCTAGCAACTCGACGAGCACTTCTCCGAGGGACTCTTCATGAAGGACTATACAAGTTCGATCTGTCAAAGCTTCAAC
CATCCACAAATTCGAATGCATCCCCTACTGTCCAACATTCTTCTTTGTCTTGCAATCAACCTACTGCCTTTTCTGTTTCCTCTGTTCAATATTTAAATAATATGTCTCTA
GATATTTGGCATCAACGTTTAGGTCATTCTGCTCTTTCAGTTGTTAAAGAAGTTCTTCGCAATTGTAAATCAAGTTTTCCTACCAATGCAGTCACTTCTTTTTGCAATGC
TTGTGCTATAGGAAAACATCATGCAATTCCTTTTTCTCCCTCTACTACTACCTATTCTTCACCATTGAAATTAATTGTGACTGATTTATGGGGACCATCTTACAAATTGT
CTCGCCATGGTTTTCGATATTATATTAGCTTTGTTGTTGCTGTAAACCCGAGACTTCCCTAG
Protein sequenceShow/hide protein sequence
MSESSLEQALQCTYAKEIWDCLLQIFNSRHLAQMMKIKSKSQNIQKGGSTMNEYVSKIKKCVDALAAIEKEIPVEDHILYILSGLGAEYEIMVSVIAAKNGSQSVQDVIA
LLLTHESRIESKSSINTNGVLPTANLTVQNTAKKDSFGQPYGNHFPQMQAMLVAPSFNQDCNWYPDSGATNHLTNNLGNMSMSSEYLGNNQVLIGNGSGFHISRLGYGSI
TSPSNHVFHLHNLLHVPSITKNLISVSQIAKDNAVFFEFHPNFCVVKDLATRRALLRGTLHEGLYKFDLSKLQPSTNSNASPTVQHSSLSCNQPTAFSVSSVQYLNNMSL
DIWHQRLGHSALSVVKEVLRNCKSSFPTNAVTSFCNACAIGKHHAIPFSPSTTTYSSPLKLIVTDLWGPSYKLSRHGFRYYISFVVAVNPRLP