; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034851 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034851
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr3:11612298..11613485
RNA-Seq ExpressionLag0034851
SyntenyLag0034851
Gene Ontology termsGO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]1.5e-8152.39Show/hide
Query:  TATEAPPAPVSSQINPPYEEWVAKDQALMTLINVTLSPIVLAYLVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSGLQSITKKSDESSDDYVKRIKELKDK
        T +  PP     Q NP YE+W+AKDQALMT+IN TLSP  LAY+VG TSSKQ W+VL K YSS SR+N+VNLKS LQ+I KK DES D Y+KRIKE+KDK
Subjt:  TATEAPPAPVSSQINPPYEEWVAKDQALMTLINVTLSPIVLAYLVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSGLQSITKKSDESSDDYVKRIKELKDK

Query:  LAIVSVIVDKEDILIYALNGLPFVFNTFRTSMITRSQSITFNELHVLLKTEEVAIEKQSKQDDALTQPSAMFASQTTPNYLSQRSNSSRNFSQERSSGRG
        LA VS  +++ED+LIYALNGLP  +NTFRTSM TRSQ +TF ELHVLL+ EE A+ KQSK DD+  QP+ + +S  + + LS     + NF   R +G G
Subjt:  LAIVSVIVDKEDILIYALNGLPFVFNTFRTSMITRSQSITFNELHVLLKTEEVAIEKQSKQDDALTQPSAMFASQTTPNYLSQRSNSSRNFSQERSSGRG

Query:  RNQGHG-YGFNPSGRGN--FNQGRDIFSSHTQSSNGQVRVSCQICQRPEHSAIDCFNRMNYHFQGRHLPTQLAAMVASQNVSYCNTTSSSVDNGVSGSSS
        +N GHG + F+   RG+    + + +  +H          +CQIC R  H+A+DCFNRMNY+FQGRH P QLAAMVASQN ++ +  +SS          
Subjt:  RNQGHG-YGFNPSGRGN--FNQGRDIFSSHTQSSNGQVRVSCQICQRPEHSAIDCFNRMNYHFQGRHLPTQLAAMVASQNVSYCNTTSSSVDNGVSGSSS

Query:  TWLTDSGCNALLTADLNNFCIASEYAGDDQVSVGRAQSLPIFHNGSGSLHTSSSS
          LTDSGCN  +T+D+N   +A EY G++QV VG  Q+ PI H+G   L  +S S
Subjt:  TWLTDSGCNALLTADLNNFCIASEYAGDDQVSVGRAQSLPIFHNGSGSLHTSSSS

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]1.9e-8151.53Show/hide
Query:  TATEAPPAPVSSQINPPYEEWVAKDQALMTLINVTLSPIVLAYLVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSGLQSITKKSDESSDDYVKRIKELKDK
        T   +  + V  Q NP YE+W+AKDQALMT+IN TLSP  LAY+VG TSSKQ W+VL K YSS SR+N+VNLKS LQ+I KK DES D Y+KRIKE+KDK
Subjt:  TATEAPPAPVSSQINPPYEEWVAKDQALMTLINVTLSPIVLAYLVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSGLQSITKKSDESSDDYVKRIKELKDK

Query:  LAIVSVIVDKEDILIYALNGLPFVFNTFRTSMITRSQSITFNELHVLLKTEEVAIEKQSKQDDALTQPSAMFASQTTPNYLSQRSNSSRNFSQERSSGRG
        LA VS  +++ED+LIYALNGLP  +NTFRTSM TRSQ +TF ELHVLL+ EE A+ KQSK DD+  QP+ + +S  + + LS       NF   R +G G
Subjt:  LAIVSVIVDKEDILIYALNGLPFVFNTFRTSMITRSQSITFNELHVLLKTEEVAIEKQSKQDDALTQPSAMFASQTTPNYLSQRSNSSRNFSQERSSGRG

Query:  RNQGHG-YGFNPSGRGNFNQGRDIFSSHTQSSNGQVRVSCQICQRPEHSAIDCFNRMNYHFQGRHLPTQLAAMVASQNVSYCNTTSSSVDNGVSGSSSTW
        ++ GHG + F+   RG+        SS  Q S      +CQIC R  H+A+DCFNRMNY+FQGRH P QLAAMVASQN ++ +  +SS            
Subjt:  RNQGHG-YGFNPSGRGNFNQGRDIFSSHTQSSNGQVRVSCQICQRPEHSAIDCFNRMNYHFQGRHLPTQLAAMVASQNVSYCNTTSSSVDNGVSGSSSTW

Query:  LTDSGCNALLTADLNNFCIASEYAGDDQVSVGRAQSLPIFHNGSGSLHTSSSSFQLTNL
        LTDSGCN  +T+D+N   +A EY G++QV +G  Q+ P+ H+G      +S S  +  L
Subjt:  LTDSGCNALLTADLNNFCIASEYAGDDQVSVGRAQSLPIFHNGSGSLHTSSSSFQLTNL

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]4.3e-8153.06Show/hide
Query:  TATEAPPAPVSSQINPPYEEWVAKDQALMTLINVTLSPIVLAYLVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSGLQSITKKSDESSDDYVKRIKELKDK
        T   +  + V  Q NP YE+W+AKDQALMT+IN TLSP  LAY+VG TSSKQ W+VL K YSS SR+N+VNLKS LQ+I KK DES D Y+KRIKE+KDK
Subjt:  TATEAPPAPVSSQINPPYEEWVAKDQALMTLINVTLSPIVLAYLVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSGLQSITKKSDESSDDYVKRIKELKDK

Query:  LAIVSVIVDKEDILIYALNGLPFVFNTFRTSMITRSQSITFNELHVLLKTEEVAIEKQSKQDDALTQPSAMFASQTTPNYLSQRSNSSRNFSQERSSGRG
        LA VS  +++ED+LIYALNGLP  +NTFRTSM TRSQ +TF ELHVLL+ EE A+ KQSK DD+  QP+ + +S  + + LS       NF   R +G G
Subjt:  LAIVSVIVDKEDILIYALNGLPFVFNTFRTSMITRSQSITFNELHVLLKTEEVAIEKQSKQDDALTQPSAMFASQTTPNYLSQRSNSSRNFSQERSSGRG

Query:  RNQGHG-YGFNPSGRGNFNQGRDIFSSHTQSSNGQVRVSCQICQRPEHSAIDCFNRMNYHFQGRHLPTQLAAMVASQNVSYCNTTSSSVDNGVSGSSSTW
        ++ GHG + F+   RG+        SS  Q S      +CQIC R  H+A+DCFNRMNY+FQGRH P QLAAMVASQN ++ +  +SS            
Subjt:  RNQGHG-YGFNPSGRGNFNQGRDIFSSHTQSSNGQVRVSCQICQRPEHSAIDCFNRMNYHFQGRHLPTQLAAMVASQNVSYCNTTSSSVDNGVSGSSSTW

Query:  LTDSGCNALLTADLNNFCIASEYAGDDQVSVGRAQSLPIFHNG
        LTDSGCN  +T+D+N   +A EY G++QV +G  Q+ P+ H+G
Subjt:  LTDSGCNALLTADLNNFCIASEYAGDDQVSVGRAQSLPIFHNG

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]4.3e-8153.04Show/hide
Query:  TATEAPPAPVSSQINPPYEEWVAKDQALMTLINVTLSPIVLAYLVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSGLQSITKKSDESSDDYVKRIKELKDK
        T +  PP     Q NP YE+W+AKDQALMT+IN TLSP  LAY+VG TSSKQ W+VL K YSS SR+N+VNLKS LQ+I KK DES D Y+KRIKE+KDK
Subjt:  TATEAPPAPVSSQINPPYEEWVAKDQALMTLINVTLSPIVLAYLVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSGLQSITKKSDESSDDYVKRIKELKDK

Query:  LAIVSVIVDKEDILIYALNGLPFVFNTFRTSMITRSQSITFNELHVLLKTEEVAIEKQSKQDDALTQPSAMFASQTTPNYLSQRSNSSRNFSQERSSGRG
        LA VS  +++ED+LIYALNGLP  +NTFRTSM TRSQ +TF ELHVLL+ EE A+ KQSK DD+  QP+ + +S  + + LS     + NF   R +G G
Subjt:  LAIVSVIVDKEDILIYALNGLPFVFNTFRTSMITRSQSITFNELHVLLKTEEVAIEKQSKQDDALTQPSAMFASQTTPNYLSQRSNSSRNFSQERSSGRG

Query:  RNQGHG-YGFNPSGRGN--FNQGRDIFSSHTQSSNGQVRVSCQICQRPEHSAIDCFNRMNYHFQGRHLPTQLAAMVASQNVSYCNTTSSSVDNGVSGSSS
        +N GHG + F+   RG+    + + +  +H          +CQIC R  H+A+DCFNRMNY+FQGRH P QLAAMVASQN ++ +  +SS          
Subjt:  RNQGHG-YGFNPSGRGN--FNQGRDIFSSHTQSSNGQVRVSCQICQRPEHSAIDCFNRMNYHFQGRHLPTQLAAMVASQNVSYCNTTSSSVDNGVSGSSS

Query:  TWLTDSGCNALLTADLNNFCIASEYAGDDQVSVGRAQSLPIFHNG
          LTDSGCN  +T+D+N   +A EY G++QV VG  Q+ PI H+G
Subjt:  TWLTDSGCNALLTADLNNFCIASEYAGDDQVSVGRAQSLPIFHNG

XP_016900446.1 PREDICTED: uncharacterized protein LOC103490319 isoform X1 [Cucumis melo]1.9e-8151.53Show/hide
Query:  TATEAPPAPVSSQINPPYEEWVAKDQALMTLINVTLSPIVLAYLVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSGLQSITKKSDESSDDYVKRIKELKDK
        T   +  + V  Q NP YE+W+AKDQALMT+IN TLSP  LAY+VG TSSKQ W+VL K YSS SR+N+VNLKS LQ+I KK DES D Y+KRIKE+KDK
Subjt:  TATEAPPAPVSSQINPPYEEWVAKDQALMTLINVTLSPIVLAYLVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSGLQSITKKSDESSDDYVKRIKELKDK

Query:  LAIVSVIVDKEDILIYALNGLPFVFNTFRTSMITRSQSITFNELHVLLKTEEVAIEKQSKQDDALTQPSAMFASQTTPNYLSQRSNSSRNFSQERSSGRG
        LA VS  +++ED+LIYALNGLP  +NTFRTSM TRSQ +TF ELHVLL+ EE A+ KQSK DD+  QP+ + +S  + + LS       NF   R +G G
Subjt:  LAIVSVIVDKEDILIYALNGLPFVFNTFRTSMITRSQSITFNELHVLLKTEEVAIEKQSKQDDALTQPSAMFASQTTPNYLSQRSNSSRNFSQERSSGRG

Query:  RNQGHG-YGFNPSGRGNFNQGRDIFSSHTQSSNGQVRVSCQICQRPEHSAIDCFNRMNYHFQGRHLPTQLAAMVASQNVSYCNTTSSSVDNGVSGSSSTW
        ++ GHG + F+   RG+        SS  Q S      +CQIC R  H+A+DCFNRMNY+FQGRH P QLAAMVASQN ++ +  +SS            
Subjt:  RNQGHG-YGFNPSGRGNFNQGRDIFSSHTQSSNGQVRVSCQICQRPEHSAIDCFNRMNYHFQGRHLPTQLAAMVASQNVSYCNTTSSSVDNGVSGSSSTW

Query:  LTDSGCNALLTADLNNFCIASEYAGDDQVSVGRAQSLPIFHNGSGSLHTSSSSFQLTNL
        LTDSGCN  +T+D+N   +A EY G++QV +G  Q+ P+ H+G      +S S  +  L
Subjt:  LTDSGCNALLTADLNNFCIASEYAGDDQVSVGRAQSLPIFHNGSGSLHTSSSSFQLTNL

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X29.4e-8251.53Show/hide
Query:  TATEAPPAPVSSQINPPYEEWVAKDQALMTLINVTLSPIVLAYLVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSGLQSITKKSDESSDDYVKRIKELKDK
        T   +  + V  Q NP YE+W+AKDQALMT+IN TLSP  LAY+VG TSSKQ W+VL K YSS SR+N+VNLKS LQ+I KK DES D Y+KRIKE+KDK
Subjt:  TATEAPPAPVSSQINPPYEEWVAKDQALMTLINVTLSPIVLAYLVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSGLQSITKKSDESSDDYVKRIKELKDK

Query:  LAIVSVIVDKEDILIYALNGLPFVFNTFRTSMITRSQSITFNELHVLLKTEEVAIEKQSKQDDALTQPSAMFASQTTPNYLSQRSNSSRNFSQERSSGRG
        LA VS  +++ED+LIYALNGLP  +NTFRTSM TRSQ +TF ELHVLL+ EE A+ KQSK DD+  QP+ + +S  + + LS       NF   R +G G
Subjt:  LAIVSVIVDKEDILIYALNGLPFVFNTFRTSMITRSQSITFNELHVLLKTEEVAIEKQSKQDDALTQPSAMFASQTTPNYLSQRSNSSRNFSQERSSGRG

Query:  RNQGHG-YGFNPSGRGNFNQGRDIFSSHTQSSNGQVRVSCQICQRPEHSAIDCFNRMNYHFQGRHLPTQLAAMVASQNVSYCNTTSSSVDNGVSGSSSTW
        ++ GHG + F+   RG+        SS  Q S      +CQIC R  H+A+DCFNRMNY+FQGRH P QLAAMVASQN ++ +  +SS            
Subjt:  RNQGHG-YGFNPSGRGNFNQGRDIFSSHTQSSNGQVRVSCQICQRPEHSAIDCFNRMNYHFQGRHLPTQLAAMVASQNVSYCNTTSSSVDNGVSGSSSTW

Query:  LTDSGCNALLTADLNNFCIASEYAGDDQVSVGRAQSLPIFHNGSGSLHTSSSSFQLTNL
        LTDSGCN  +T+D+N   +A EY G++QV +G  Q+ P+ H+G      +S S  +  L
Subjt:  LTDSGCNALLTADLNNFCIASEYAGDDQVSVGRAQSLPIFHNGSGSLHTSSSSFQLTNL

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X32.1e-8153.06Show/hide
Query:  TATEAPPAPVSSQINPPYEEWVAKDQALMTLINVTLSPIVLAYLVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSGLQSITKKSDESSDDYVKRIKELKDK
        T   +  + V  Q NP YE+W+AKDQALMT+IN TLSP  LAY+VG TSSKQ W+VL K YSS SR+N+VNLKS LQ+I KK DES D Y+KRIKE+KDK
Subjt:  TATEAPPAPVSSQINPPYEEWVAKDQALMTLINVTLSPIVLAYLVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSGLQSITKKSDESSDDYVKRIKELKDK

Query:  LAIVSVIVDKEDILIYALNGLPFVFNTFRTSMITRSQSITFNELHVLLKTEEVAIEKQSKQDDALTQPSAMFASQTTPNYLSQRSNSSRNFSQERSSGRG
        LA VS  +++ED+LIYALNGLP  +NTFRTSM TRSQ +TF ELHVLL+ EE A+ KQSK DD+  QP+ + +S  + + LS       NF   R +G G
Subjt:  LAIVSVIVDKEDILIYALNGLPFVFNTFRTSMITRSQSITFNELHVLLKTEEVAIEKQSKQDDALTQPSAMFASQTTPNYLSQRSNSSRNFSQERSSGRG

Query:  RNQGHG-YGFNPSGRGNFNQGRDIFSSHTQSSNGQVRVSCQICQRPEHSAIDCFNRMNYHFQGRHLPTQLAAMVASQNVSYCNTTSSSVDNGVSGSSSTW
        ++ GHG + F+   RG+        SS  Q S      +CQIC R  H+A+DCFNRMNY+FQGRH P QLAAMVASQN ++ +  +SS            
Subjt:  RNQGHG-YGFNPSGRGNFNQGRDIFSSHTQSSNGQVRVSCQICQRPEHSAIDCFNRMNYHFQGRHLPTQLAAMVASQNVSYCNTTSSSVDNGVSGSSSTW

Query:  LTDSGCNALLTADLNNFCIASEYAGDDQVSVGRAQSLPIFHNG
        LTDSGCN  +T+D+N   +A EY G++QV +G  Q+ P+ H+G
Subjt:  LTDSGCNALLTADLNNFCIASEYAGDDQVSVGRAQSLPIFHNG

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X19.4e-8251.53Show/hide
Query:  TATEAPPAPVSSQINPPYEEWVAKDQALMTLINVTLSPIVLAYLVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSGLQSITKKSDESSDDYVKRIKELKDK
        T   +  + V  Q NP YE+W+AKDQALMT+IN TLSP  LAY+VG TSSKQ W+VL K YSS SR+N+VNLKS LQ+I KK DES D Y+KRIKE+KDK
Subjt:  TATEAPPAPVSSQINPPYEEWVAKDQALMTLINVTLSPIVLAYLVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSGLQSITKKSDESSDDYVKRIKELKDK

Query:  LAIVSVIVDKEDILIYALNGLPFVFNTFRTSMITRSQSITFNELHVLLKTEEVAIEKQSKQDDALTQPSAMFASQTTPNYLSQRSNSSRNFSQERSSGRG
        LA VS  +++ED+LIYALNGLP  +NTFRTSM TRSQ +TF ELHVLL+ EE A+ KQSK DD+  QP+ + +S  + + LS       NF   R +G G
Subjt:  LAIVSVIVDKEDILIYALNGLPFVFNTFRTSMITRSQSITFNELHVLLKTEEVAIEKQSKQDDALTQPSAMFASQTTPNYLSQRSNSSRNFSQERSSGRG

Query:  RNQGHG-YGFNPSGRGNFNQGRDIFSSHTQSSNGQVRVSCQICQRPEHSAIDCFNRMNYHFQGRHLPTQLAAMVASQNVSYCNTTSSSVDNGVSGSSSTW
        ++ GHG + F+   RG+        SS  Q S      +CQIC R  H+A+DCFNRMNY+FQGRH P QLAAMVASQN ++ +  +SS            
Subjt:  RNQGHG-YGFNPSGRGNFNQGRDIFSSHTQSSNGQVRVSCQICQRPEHSAIDCFNRMNYHFQGRHLPTQLAAMVASQNVSYCNTTSSSVDNGVSGSSSTW

Query:  LTDSGCNALLTADLNNFCIASEYAGDDQVSVGRAQSLPIFHNGSGSLHTSSSSFQLTNL
        LTDSGCN  +T+D+N   +A EY G++QV +G  Q+ P+ H+G      +S S  +  L
Subjt:  LTDSGCNALLTADLNNFCIASEYAGDDQVSVGRAQSLPIFHNGSGSLHTSSSSFQLTNL

A0A5D3CLI6 T4.52.3e-8050.54Show/hide
Query:  TATEAPPAPVSSQINPPYEEWVAKDQALMTLINVTLSPIVLAYLVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSGLQSITKKSDESSDDYVKRIKELKDK
        T   +  + V  Q NP YE+W+AKDQALMT+IN TLSP  LAY+VG TSSKQ W+VL K YSS SR+N+VNLKS LQ+I KK DES D Y+KRIKE+KDK
Subjt:  TATEAPPAPVSSQINPPYEEWVAKDQALMTLINVTLSPIVLAYLVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSGLQSITKKSDESSDDYVKRIKELKDK

Query:  LAIVSVIVDKEDILIYALNGLPFVFNTFRTSMITRSQSITFNELHVLLKTEEVAIEKQSKQDDALTQPSAMFASQTTPNYLSQRSNSSRNFSQERSSGRG
        LA VS  +++ED+LIYALNGLP  +NTFRTSM TRSQ +TF ELHVLL+ EE A+ KQSK DD+  QP+ + +S  + + LS       NF   R +G G
Subjt:  LAIVSVIVDKEDILIYALNGLPFVFNTFRTSMITRSQSITFNELHVLLKTEEVAIEKQSKQDDALTQPSAMFASQTTPNYLSQRSNSSRNFSQERSSGRG

Query:  RNQGHG-YGFNPSGRGNFNQGRDIFSSHTQSSNGQVRVSCQICQRPEHSAIDCFNRMNYHFQGRHLPTQLAAMVASQNVSYCNTTSSSVDNGVSGSSSTW
        ++ GHG + F+   RG+        SS  Q S      +CQIC R  H+A+DCFNRMNY+FQGRH P QLAAMVASQN ++ +  +SS            
Subjt:  RNQGHG-YGFNPSGRGNFNQGRDIFSSHTQSSNGQVRVSCQICQRPEHSAIDCFNRMNYHFQGRHLPTQLAAMVASQNVSYCNTTSSSVDNGVSGSSSTW

Query:  LTDSGCNALLTADLNNFCIASEYAGDDQVSVGRAQSLPIFHNGSGSLHTSSSSFQLTNLLYVPHIATNLMFV
        LTDSGCN  +T+D+N   +A EY G++QV +G  Q+ P+ H+        S+   + +     HIA  L FV
Subjt:  LTDSGCNALLTADLNNFCIASEYAGDDQVSVGRAQSLPIFHNGSGSLHTSSSSFQLTNLLYVPHIATNLMFV

A0A6J1D9L6 uncharacterized protein LOC1110188921.8e-8053.52Show/hide
Query:  TEAPPAPVSS--QINPPYEEWVAKDQALMTLINVTLSPIVLAYLVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSGLQSITKKSDESSDDYVKRIKELKDK
        TE+ P   +S   INP +E+W+AKDQALMTLIN TLS   LAY+V   +SKQ WEVLEKHYSS+SRTN+VNLKS LQSI KK++ES D YVKRIKE+KDK
Subjt:  TEAPPAPVSS--QINPPYEEWVAKDQALMTLINVTLSPIVLAYLVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSGLQSITKKSDESSDDYVKRIKELKDK

Query:  LAIVSVIVDKEDILIYALNGLPFVFNTFRTSMITRSQSITFNELHVLLKTEEVAIEKQSKQDDALTQPSAMFASQTTPNYLSQRSNSSRNFSQERSSGRG
         A VS+ ++ E +LIYALNGL   +NT  TSM TR+QS++F ELHV +K+EE AIEKQ K++D +TQP+A+FAS       SQ   S+ + +Q    GRG
Subjt:  LAIVSVIVDKEDILIYALNGLPFVFNTFRTSMITRSQSITFNELHVLLKTEEVAIEKQSKQDDALTQPSAMFASQTTPNYLSQRSNSSRNFSQERSSGRG

Query:  RNQGHG-YGFNPSGRGNFNQGRDIFSSHTQSS-NGQVRVSCQICQRPEHSAIDCFNRMNYHFQGRHLPTQLAAMVASQNVSYCNTTSSSVDNGVSGSSST
        +N G G   F P+     NQGR   S +  +S     R  CQIC +  H+A+DC+NRMN+HFQGRH P QLAAMVA QN SY    +S        S +T
Subjt:  RNQGHG-YGFNPSGRGNFNQGRDIFSSHTQSS-NGQVRVSCQICQRPEHSAIDCFNRMNYHFQGRHLPTQLAAMVASQNVSYCNTTSSSVDNGVSGSSST

Query:  WLTDSGCNALLTADLNNF---CIASEYAGDDQVSVGRAQSLPIFHNGSGSLHTSS
        WL DS CN  +TADL+N     IAS+Y G++ +SVG  QS PI H G G +  S+
Subjt:  WLTDSGCNALLTADLNNF---CIASEYAGDDQVSVGRAQSLPIFHNGSGSLHTSS

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.5e-2325.44Show/hide
Query:  METATEAPPAPVSS----QINPPYEEWVAKDQALMTLINVTLSPIVLAYLVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSGLQSITKKSDESSDDYVKRI
        ++ +T  PPA + +    ++NP Y  W  +D+ + + +   +S  V   +   T++ Q WE L K Y++ S  ++  L++ L+  T K  ++ DDY++ +
Subjt:  METATEAPPAPVSS----QINPPYEEWVAKDQALMTLINVTLSPIVLAYLVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSGLQSITKKSDESSDDYVKRI

Query:  KELKDKLAIVSVIVDKEDILIYALNGLPFVFNTFRTSMITRSQSITFNELHVLLKTEEVAIEKQSKQDDALTQPSAMFASQTTPNYLSQRSNSSRNFSQE
            D+LA++   +D ++ +   L  LP  +      +  +    T  E+H          E+    +  +   S+      T N +S R+ ++ N    
Subjt:  KELKDKLAIVSVIVDKEDILIYALNGLPFVFNTFRTSMITRSQSITFNELHVLLKTEEVAIEKQSKQDDALTQPSAMFASQTTPNYLSQRSNSSRNFSQE

Query:  RSSGRGRNQGHGYGFNPSGRGNFNQGRDIFSSHTQSSNGQVRVSCQICQRPEHSAIDCFNRMNY--HFQGRHLPTQLAAMVASQNVSYCNTTSSSVDNGV
         ++   RN  +    N +    + Q    F  +   S   +   CQIC    HSA  C    ++      +  P+         N++           G 
Subjt:  RSSGRGRNQGHGYGFNPSGRGNFNQGRDIFSSHTQSSNGQVRVSCQICQRPEHSAIDCFNRMNY--HFQGRHLPTQLAAMVASQNVSYCNTTSSSVDNGV

Query:  SGSSSTWLTDSGCNALLTADLNNFCIASEYAGDDQVSVGRAQSLPIFHNGSGSLHTSSSSFQLTNLLYVPHIATNLMFVHQLCVDNNCIIIFDSHKFVVQ
          SS+ WL DSG    +T+D NN  +   Y G D V V    ++PI H GS SL T S    L N+LYVP+I  NL+ V++LC  N   + F    F V+
Subjt:  SGSSSTWLTDSGCNALLTADLNNFCIASEYAGDDQVSVGRAQSLPIFHNGSGSLHTSSSSFQLTNLLYVPHIATNLMFVHQLCVDNNCIIIFDSHKFVVQ

Query:  D
        D
Subjt:  D

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.1e-1726.62Show/hide
Query:  METATEAPPAPVSS----QINPPYEEWVAKDQALMTLINVTLSPIVLAYLVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSGLQSITKKSDESSDDYVKRI
        ++ +T  PPA + +    ++NP Y  W  +D+ + + I   +S  V   +   T++ Q WE L K Y++ S  ++  L+                ++ R 
Subjt:  METATEAPPAPVSS----QINPPYEEWVAKDQALMTLINVTLSPIVLAYLVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSGLQSITKKSDESSDDYVKRI

Query:  KELKDKLAIVSVIVDKEDILIYALNGLPFVFNTFRTSMITRSQSITFNELHVLLKTEEVAIEKQSKQDDALTQPSAMFASQTTPNYLSQRSNSSRNFSQE
            D+LA++   +D ++ +   L  LP  +      +  +    +  E+H      E  I ++SK    L   SA     T      + +N++RN    
Subjt:  KELKDKLAIVSVIVDKEDILIYALNGLPFVFNTFRTSMITRSQSITFNELHVLLKTEEVAIEKQSKQDDALTQPSAMFASQTTPNYLSQRSNSSRNFSQE

Query:  RSSGRGRNQGHGYGFNPSGRGNFNQGRDIFSSHTQSSNGQVRV---SCQICQRPEHSAIDCFNRMNYHFQGRHLPTQLAAMVASQNVSYCNTTSSSVDNG
          + RG N+ +    N + R N  Q     SS ++S N Q +     CQIC    HSA  C     + FQ     T       S    +    + +V++ 
Subjt:  RSSGRGRNQGHGYGFNPSGRGNFNQGRDIFSSHTQSSNGQVRV---SCQICQRPEHSAIDCFNRMNYHFQGRHLPTQLAAMVASQNVSYCNTTSSSVDNG

Query:  VSGSSSTWLTDSGCNALLTADLNNFCIASEYAGDDQVSVGRAQSLPIFHNGSGSLHTSSSSFQLTNLLYVPHIATNLMFVHQLCVDNNCIIIFDSHKFVV
           +++ WL DSG    +T+D NN      Y G D V +    ++PI H GS SL TSS S  L  +LYVP+I  NL+ V++LC  N   + F    F V
Subjt:  VSGSSSTWLTDSGCNALLTADLNNFCIASEYAGDDQVSVGRAQSLPIFHNGSGSLHTSSSSFQLTNLLYVPHIATNLMFVHQLCVDNNCIIIFDSHKFVV

Query:  QD
        +D
Subjt:  QD

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGACTGCAACTGAAGCACCTCCTGCTCCAGTATCTTCTCAGATTAATCCACCCTATGAGGAGTGGGTTGCCAAAGATCAAGCTTTAATGACACTGATCAATGTCAC
TCTGTCGCCGATAGTTTTAGCCTATCTCGTAGGTTGCACATCATCCAAACAAGCCTGGGAGGTCCTTGAAAAGCACTACTCTTCGAGCTCAAGAACCAACATTGTCAATT
TGAAGTCAGGTCTTCAATCAATAACAAAGAAGTCAGATGAGTCAAGCGATGATTATGTTAAACGAATCAAAGAACTCAAGGATAAATTAGCTATTGTTTCTGTTATTGTG
GATAAAGAAGATATTCTTATATATGCTCTAAATGGCTTACCTTTTGTTTTTAACACTTTTCGCACATCTATGATAACTCGGTCACAGTCGATTACATTCAATGAGCTACA
CGTCCTTCTGAAAACAGAGGAGGTTGCTATTGAGAAACAATCAAAGCAAGATGATGCCCTAACTCAGCCTTCTGCCATGTTTGCATCTCAAACAACACCCAATTATCTCT
CTCAACGTTCCAACTCCTCTAGAAATTTTAGTCAAGAAAGGTCATCTGGTCGTGGACGCAATCAAGGTCATGGTTATGGATTCAATCCTTCTGGTCGTGGAAATTTTAAT
CAAGGCAGAGACATATTTTCTTCTCATACACAGTCTTCTAACGGACAAGTTCGAGTTTCATGTCAAATTTGTCAACGCCCTGAACACAGTGCAATTGATTGTTTTAACAG
AATGAACTACCATTTTCAAGGGCGTCATCTACCTACGCAACTAGCAGCCATGGTGGCCTCTCAAAATGTTTCTTACTGCAACACAACATCAAGTAGTGTGGATAATGGAG
TTTCTGGTTCCTCTTCTACTTGGTTAACCGATTCTGGGTGTAATGCTCTTCTTACCGCTGATCTCAATAATTTTTGTATTGCTTCAGAATATGCAGGTGATGATCAAGTA
TCAGTAGGCAGAGCCCAATCCCTCCCAATTTTTCACAATGGCTCAGGTTCTCTCCATACATCTTCCTCTTCTTTCCAACTTACTAATCTTCTCTATGTTCCTCATATTGC
TACCAACCTTATGTTTGTCCATCAACTATGTGTAGACAACAACTGTATTATCATATTTGACTCACACAAGTTTGTTGTTCAGGACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGACTGCAACTGAAGCACCTCCTGCTCCAGTATCTTCTCAGATTAATCCACCCTATGAGGAGTGGGTTGCCAAAGATCAAGCTTTAATGACACTGATCAATGTCAC
TCTGTCGCCGATAGTTTTAGCCTATCTCGTAGGTTGCACATCATCCAAACAAGCCTGGGAGGTCCTTGAAAAGCACTACTCTTCGAGCTCAAGAACCAACATTGTCAATT
TGAAGTCAGGTCTTCAATCAATAACAAAGAAGTCAGATGAGTCAAGCGATGATTATGTTAAACGAATCAAAGAACTCAAGGATAAATTAGCTATTGTTTCTGTTATTGTG
GATAAAGAAGATATTCTTATATATGCTCTAAATGGCTTACCTTTTGTTTTTAACACTTTTCGCACATCTATGATAACTCGGTCACAGTCGATTACATTCAATGAGCTACA
CGTCCTTCTGAAAACAGAGGAGGTTGCTATTGAGAAACAATCAAAGCAAGATGATGCCCTAACTCAGCCTTCTGCCATGTTTGCATCTCAAACAACACCCAATTATCTCT
CTCAACGTTCCAACTCCTCTAGAAATTTTAGTCAAGAAAGGTCATCTGGTCGTGGACGCAATCAAGGTCATGGTTATGGATTCAATCCTTCTGGTCGTGGAAATTTTAAT
CAAGGCAGAGACATATTTTCTTCTCATACACAGTCTTCTAACGGACAAGTTCGAGTTTCATGTCAAATTTGTCAACGCCCTGAACACAGTGCAATTGATTGTTTTAACAG
AATGAACTACCATTTTCAAGGGCGTCATCTACCTACGCAACTAGCAGCCATGGTGGCCTCTCAAAATGTTTCTTACTGCAACACAACATCAAGTAGTGTGGATAATGGAG
TTTCTGGTTCCTCTTCTACTTGGTTAACCGATTCTGGGTGTAATGCTCTTCTTACCGCTGATCTCAATAATTTTTGTATTGCTTCAGAATATGCAGGTGATGATCAAGTA
TCAGTAGGCAGAGCCCAATCCCTCCCAATTTTTCACAATGGCTCAGGTTCTCTCCATACATCTTCCTCTTCTTTCCAACTTACTAATCTTCTCTATGTTCCTCATATTGC
TACCAACCTTATGTTTGTCCATCAACTATGTGTAGACAACAACTGTATTATCATATTTGACTCACACAAGTTTGTTGTTCAGGACTAA
Protein sequenceShow/hide protein sequence
METATEAPPAPVSSQINPPYEEWVAKDQALMTLINVTLSPIVLAYLVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSGLQSITKKSDESSDDYVKRIKELKDKLAIVSVIV
DKEDILIYALNGLPFVFNTFRTSMITRSQSITFNELHVLLKTEEVAIEKQSKQDDALTQPSAMFASQTTPNYLSQRSNSSRNFSQERSSGRGRNQGHGYGFNPSGRGNFN
QGRDIFSSHTQSSNGQVRVSCQICQRPEHSAIDCFNRMNYHFQGRHLPTQLAAMVASQNVSYCNTTSSSVDNGVSGSSSTWLTDSGCNALLTADLNNFCIASEYAGDDQV
SVGRAQSLPIFHNGSGSLHTSSSSFQLTNLLYVPHIATNLMFVHQLCVDNNCIIIFDSHKFVVQD