; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008483 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008483
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposable element protein
Genome locationchr9:23154178..23158207
RNA-Seq ExpressionLag0008483
SyntenyLag0008483
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBG99665.1 transposable element gene [Prunus dulcis]2.3e-6850.17Show/hide
Query:  RILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEVIACHQSDAKTVARFLQSH
        ++L  GFFWPTLFKDA+ F   CD CQR GNL  R++MPLT IL +++FDVW                    VDYVSKWVE IA   +DAK V  FL+ +
Subjt:  RILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEVIACHQSDAKTVARFLQSH

Query:  IFARFGTPTALVSDEA-------------KYGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRSAYKTPL--GAIRMLQL
        IF RFGTP A++SD               KYGI H++AT YHPQ +GQ EISNREIK ILEK V+ +RKDWS RLD+ALWAYR+AYKTP+  G  R LQL
Subjt:  IFARFGTPTALVSDEA-------------KYGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRSAYKTPL--GAIRMLQL

Query:  NELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQK-------------------------------------DEKDGSVFKVNGQRVKHYWGEEF
        NELEE R  +YENAK+YKEKTK +HDKKI  K F KGQK                                     + KDGS FKVNG R+K Y+   F
Subjt:  NELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQK-------------------------------------DEKDGSVFKVNGQRVKHYWGEEF

KAA3473721.1 retroelement pol polyprotein-like [Gossypium australe]1.3e-6646.32Show/hide
Query:  RILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEVIACHQSDAKTVARFLQSH
        ++L  GFFWPTLFKDA+ + K CD CQR GN+  R+EMP T I+E ELFDVW                    V YVSKWVE  A    DAK V +FLQ H
Subjt:  RILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEVIACHQSDAKTVARFLQSH

Query:  IFARFGTPTALVSDEA-------------KYGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRSAYKTPLGAI-------
        +F RFGTP A++SDE              K+G+KH++AT+YHPQ NGQAE++N+EIK ILEKVV  +++DWS RLD+ALWAY+  YKTPLG         
Subjt:  IFARFGTPTALVSDEA-------------KYGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRSAYKTPLGAI-------

Query:  ------------------------------RMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQKDEKDGSVFKVNGQRVKHYWGEEFRLK
                                      RMLQLNELEEFR FSYENAK+ KE+ K WHDK I+ +EF  G+     G  F+VN QR+KHY+GE     
Subjt:  ------------------------------RMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQKDEKDGSVFKVNGQRVKHYWGEEFRLK

Query:  YPSLRLDLILHGLVDFILQYFIIGSL
              D IL  L D  +  F I  L
Subjt:  YPSLRLDLILHGLVDFILQYFIIGSL

XP_012841295.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC105961608 [Erythranthe guttata]1.4e-6833.99Show/hide
Query:  RILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEVIACHQSDAKTVARFLQSH
        ++L  GFFWPTLFKD + F   CD CQR GN+  R E+PL  ++EVE FDVW                    VDYVSKWVE  A   +D+  V  FL+  
Subjt:  RILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEVIACHQSDAKTVARFLQSH

Query:  IFARFGTPTALVSDE-------------AKYGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRSAYKTPLGAI-------
        IF RFGTP A+++D              AKYG+KH++A +YHPQ NGQAE++NREIK ILEKVV P+ KDWS +LD+ALWAYR+AYKTPLG         
Subjt:  IFARFGTPTALVSDE-------------AKYGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRSAYKTPLGAI-------

Query:  ------------------------------RMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQKDEKDGSVFKVNGQRVKHYWGEEFRLK
                                      RMLQLNEL+E R  +YE+ ++YKEKTK WHD KI  +EF KGQ      S  K+   ++K  W   F + 
Subjt:  ------------------------------RMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQKDEKDGSVFKVNGQRVKHYWGEEFRLK

Query:  YPSLRLDLILHGLVDFILQYFIIGSLDLALNFRSNIRVAFCWSLIYLQPLNLETKPIISTTVQEGDVVKNQETEAEEQVAGRRGVRVVW--NTPLPSTSN
             LD+  HG V+ +         +   +F+ N +      L + Q      +    T +QE  +V       E     RR VR+ +  N     T N
Subjt:  YPSLRLDLILHGLVDFILQYFIIGSLDLALNFRSNIRVAFCWSLIYLQPLNLETKPIISTTVQEGDVVKNQETEAEEQVAGRRGVRVVW--NTPLPSTSN

Query:  FEEEKREAENKAKEEEARKAEEERLHEQRENRGKGIAEASGEIEEPREPFIRFVNELARAKYQKVLKRDFLFERGFGSDMPRF--LESRIASLGWRQFCA
           +  ++  K   +    ++ E       ++GKG  + + +I  P +    F  + A A+++K   +  + ER +   +P+F  L       GW+    
Subjt:  FEEEKREAENKAKEEEARKAEEERLHEQRENRGKGIAEASGEIEEPREPFIRFVNELARAKYQKVLKRDFLFERGFGSDMPRF--LESRIASLGWRQFCA

Query:  KPDPVNANIVREFYAN--LDVKDDFEV-IVRGVPVQWSPEAINNLFDLQDFPHAVFNEMMVA
            +N ++VREFYAN  + + ++  V  VRG  ++W+  AIN+ + L+      + +M +A
Subjt:  KPDPVNANIVREFYAN--LDVKDDFEV-IVRGVPVQWSPEAINNLFDLQDFPHAVFNEMMVA

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]1.7e-6643.37Show/hide
Query:  RILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEVIACHQSDAKTVARFLQSH
        ++L  GFFWP++F+D++   K CD CQR GN+  R E+PL  ILEVELFDVW                    VDYVSKWVE IA   +DAK V +FL  +
Subjt:  RILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEVIACHQSDAKTVARFLQSH

Query:  IFARFGTPTALVSDE-------------AKYGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRSAYKTPLGAI-------
        IF RFGTP A++SDE             +KYG+KH+IA +YHPQ NGQAEISNREIK ILEK V+ +RKDW+ +LD+ALWAYR+A+KTP+G         
Subjt:  IFARFGTPTALVSDE-------------AKYGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRSAYKTPLGAI-------

Query:  ------------------------------RMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQK--------------------------
                                      R+LQLNE++EFR  +YENAK+YKE+TK WHDK+I  +EF  GQ+                          
Subjt:  ------------------------------RMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQK--------------------------

Query:  ----------DEKDGSVFKVNGQRVKHYWGEE
                   +K G +F+VNGQR+KHY+GE+
Subjt:  ----------DEKDGSVFKVNGQRVKHYWGEE

XP_042757945.1 uncharacterized protein LOC111885853 [Lactuca sativa]1.1e-6845.51Show/hide
Query:  MRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEVIACHQSDAKTVARFLQS
        +R+LH GF+WP+LFKDA+ F K+CD CQR GN+G R EMPL+ I+EVELFDVW                    VDYVSKWVE +AC ++DA+TV  FL+ 
Subjt:  MRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEVIACHQSDAKTVARFLQS

Query:  HIFARFGTPTALVSDE-------------AKYGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRSAYKTPLGAI------
         IF+RFGTP A++SDE             AKY IKHR+AT+YHPQ NG AE +N+++K ILEKVV+ SRKDW+ +LD+ LWAYR+AY+T LG        
Subjt:  HIFARFGTPTALVSDE-------------AKYGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRSAYKTPLGAI------

Query:  -------------------------------RMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQK-------------------------
                                       RM QL ELEEFR  +YENAK+ KEK K WHDKKI  +EF +GQ                          
Subjt:  -------------------------------RMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQK-------------------------

Query:  ------------DEKDGSVFKVNGQRVKHYWGEE
                    +EKDG  F VNGQRVKHY+GEE
Subjt:  ------------DEKDGSVFKVNGQRVKHYWGEE

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase6.9e-6643.23Show/hide
Query:  RILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEVIACHQSDAKTVARFLQSH
        +IL  GFFWP LFKDAH F   CD CQR GN+  R EMPL  ILEVELFDVW                    VDYVSKWVE  A   +D+K V  F++ +
Subjt:  RILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEVIACHQSDAKTVARFLQSH

Query:  IFARFGTPTALVSDE-------------AKYGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRSAYKTPLG---------
        IF RFGTP A++SD              +KYG+KH+I+T YHPQ +GQ E+SNREIK ILEK V  +RKDWS RLDEALWAYR+AYKTP+G         
Subjt:  IFARFGTPTALVSDE-------------AKYGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRSAYKTPLG---------

Query:  ---------------AI-------------RMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQ---------------------------
                       AI             R+LQLNEL+EFR  +YENAK+YKEK K WH+KKI  + F  GQ                           
Subjt:  ---------------AI-------------RMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQ---------------------------

Query:  ----------KDEKDGSVFKVNGQRVKHYWGEEFRLKYPSLRLDLIL
                  +++   + FKVN QR+KHYWGE    ++ S+ L+ ++
Subjt:  ----------KDEKDGSVFKVNGQRVKHYWGEEFRLKYPSLRLDLIL

A0A2G9HBV9 DNA-directed DNA polymerase3.4e-6543.31Show/hide
Query:  RILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEVIACHQSDAKTVARFLQSH
        +IL  GFFWP LFKDA+ F   CD CQR GN+  R EMPL  ILEVELFDVW                    VDYVSKWVE +A   +D+K V  F++ +
Subjt:  RILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEVIACHQSDAKTVARFLQSH

Query:  IFARFGTPTALVSDE-------------AKYGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRSAYKTPLG---------
        IF RFGTP A++S+              +KYG+KH+I+T YHPQ +GQ E+SNREIK ILEK V  +RKDWS RLDEALWAYR+A+KTP+G         
Subjt:  IFARFGTPTALVSDE-------------AKYGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRSAYKTPLG---------

Query:  ---------------AI-------------RMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQ---------------------------
                       AI             R+LQLNEL+EFR  +YENAK+YKEKTK WHDKKI  + F  GQ                           
Subjt:  ---------------AI-------------RMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQ---------------------------

Query:  ----------KDEKDGSVFKVNGQRVKHYWGEEFRLKYPSLRLD
                  ++E   + FKVN QR+KHYWG      + S+ L+
Subjt:  ----------KDEKDGSVFKVNGQRVKHYWGEEFRLKYPSLRLD

A0A2K3NJZ5 Integrase catalytic domain-containing protein (Fragment)2.0e-6542.57Show/hide
Query:  RILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEVIACHQSDAKTVARFLQSH
        ++L  G FWPTLFKDA  + K+CD CQR GN+  R+EMP   +LEVE+FDVW                    VDYVSKWVE IA   +DA+ V  FL+ +
Subjt:  RILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEVIACHQSDAKTVARFLQSH

Query:  IFARFGTPTALVSDEA-------------KYGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRSAYKTPL----------
        IF+RFG P AL+SDE              KY + HRIAT YHPQ +GQ E+SNR+IK ILEK V+ SRKDWS +LD+ALWAYR+A+KTP+          
Subjt:  IFARFGTPTALVSDEA-------------KYGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRSAYKTPL----------

Query:  ---------------------------GAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQ---------------------------
                                   G  R+LQL+EL+EFR F+YENAK++KEKTK WHDKKI+++EF +GQ                           
Subjt:  ---------------------------GAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQ---------------------------

Query:  ----------KDEKDGSVFKVNGQRVKHYWGEEFRLKYPSLRL
                  +D      FKVNGQR+K Y+G+E  L   S+ L
Subjt:  ----------KDEKDGSVFKVNGQRVKHYWGEEFRLKYPSLRL

A0A4Y1R6P1 Reverse transcriptase1.1e-6850.17Show/hide
Query:  RILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEVIACHQSDAKTVARFLQSH
        ++L  GFFWPTLFKDA+ F   CD CQR GNL  R++MPLT IL +++FDVW                    VDYVSKWVE IA   +DAK V  FL+ +
Subjt:  RILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEVIACHQSDAKTVARFLQSH

Query:  IFARFGTPTALVSDEA-------------KYGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRSAYKTPL--GAIRMLQL
        IF RFGTP A++SD               KYGI H++AT YHPQ +GQ EISNREIK ILEK V+ +RKDWS RLD+ALWAYR+AYKTP+  G  R LQL
Subjt:  IFARFGTPTALVSDEA-------------KYGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRSAYKTPL--GAIRMLQL

Query:  NELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQK-------------------------------------DEKDGSVFKVNGQRVKHYWGEEF
        NELEE R  +YENAK+YKEKTK +HDKKI  K F KGQK                                     + KDGS FKVNG R+K Y+   F
Subjt:  NELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQK-------------------------------------DEKDGSVFKVNGQRVKHYWGEEF

A0A5B6VWJ0 Retroelement pol polyprotein-like6.2e-6746.32Show/hide
Query:  RILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEVIACHQSDAKTVARFLQSH
        ++L  GFFWPTLFKDA+ + K CD CQR GN+  R+EMP T I+E ELFDVW                    V YVSKWVE  A    DAK V +FLQ H
Subjt:  RILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEVIACHQSDAKTVARFLQSH

Query:  IFARFGTPTALVSDEA-------------KYGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRSAYKTPLGAI-------
        +F RFGTP A++SDE              K+G+KH++AT+YHPQ NGQAE++N+EIK ILEKVV  +++DWS RLD+ALWAY+  YKTPLG         
Subjt:  IFARFGTPTALVSDEA-------------KYGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRSAYKTPLGAI-------

Query:  ------------------------------RMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQKDEKDGSVFKVNGQRVKHYWGEEFRLK
                                      RMLQLNELEEFR FSYENAK+ KE+ K WHDK I+ +EF  G+     G  F+VN QR+KHY+GE     
Subjt:  ------------------------------RMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQKDEKDGSVFKVNGQRVKHYWGEEFRLK

Query:  YPSLRLDLILHGLVDFILQYFIIGSL
              D IL  L D  +  F I  L
Subjt:  YPSLRLDLILHGLVDFILQYFIIGSL

SwissProt top hitse value%identityAlignment
A1Z651 Gag-Pol polyprotein4.8e-0835.65Show/hide
Query:  VWVDYVSKWVEVIACHQSDAKTVARFLQSHIFARFGTPTALVSDEAK-------------YGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPS-RK
        V+VD  S WVE     +  AK V++ L   IF RFG P  L SD                 GI  ++  +Y PQ++GQ E  NR IK  L K+   S  +
Subjt:  VWVDYVSKWVEVIACHQSDAKTVARFLQSHIFARFGTPTALVSDEAK-------------YGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPS-RK

Query:  DWSFRLDEALWAYRS
        DW   L  AL+  R+
Subjt:  DWSFRLDEALWAYRS

P10273 Gag-Pol polyprotein1.1e-0733.04Show/hide
Query:  VWVDYVSKWVEVIACHQSDAKTVARFLQSHIFARFGTPTALVSDEAK-------------YGIKHRIATSYHPQANGQAEISNREIKAILEKV-VHPSRK
        V++D  S W E        AK VA+ L   IF R+G P  L SD                 GI  ++  +Y PQ++GQ E  NR IK  L K+ +    K
Subjt:  VWVDYVSKWVEVIACHQSDAKTVARFLQSHIFARFGTPTALVSDEAK-------------YGIKHRIATSYHPQANGQAEISNREIKAILEKV-VHPSRK

Query:  DWSFRLDEALWAYRS
        DW   L   L+  R+
Subjt:  DWSFRLDEALWAYRS

P92516 Uncharacterized mitochondrial protein AtMg007503.1e-1561.4Show/hide
Query:  ILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWVDYVSK
        +L  GF+WPT FKDAH F   CDACQR+GN   R+EMP  +ILEVE+FDVW  Y  K
Subjt:  ILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWVDYVSK

Q2F7J0 Gag-Pol polyprotein4.8e-0835.65Show/hide
Query:  VWVDYVSKWVEVIACHQSDAKTVARFLQSHIFARFGTPTALVSDEAK-------------YGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPS-RK
        V+VD  S WVE     +  AK V++ L   IF RFG P  L SD                 GI  ++  +Y PQ++GQ E  NR IK  L K+   S  +
Subjt:  VWVDYVSKWVEVIACHQSDAKTVARFLQSHIFARFGTPTALVSDEAK-------------YGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPS-RK

Query:  DWSFRLDEALWAYRS
        DW   L  AL+  R+
Subjt:  DWSFRLDEALWAYRS

Q2F7J3 Gag-Pol polyprotein6.3e-0835.65Show/hide
Query:  VWVDYVSKWVEVIACHQSDAKTVARFLQSHIFARFGTPTALVSDEAK-------------YGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPS-RK
        V+VD  S WVE     +  AK V + L   IF RFG P  L SD                 GI  ++  +Y PQ++GQ E  NR IK  L K+   S  +
Subjt:  VWVDYVSKWVEVIACHQSDAKTVARFLQSHIFARFGTPTALVSDEAK-------------YGIKHRIATSYHPQANGQAEISNREIKAILEKVVHPS-RK

Query:  DWSFRLDEALWAYRS
        DW   L  AL+  R+
Subjt:  DWSFRLDEALWAYRS

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein2.2e-1661.4Show/hide
Query:  ILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWVDYVSK
        +L  GF+WPT FKDAH F   CDACQR+GN   R+EMP  +ILEVE+FDVW  Y  K
Subjt:  ILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWVDYVSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGATTTTGCACTGCGGGTTTTTCTGGCCTACGTTATTTAAGGATGCCCATTGGTTCTACAAGCAATGTGATGCTTGCCAAAGGAGAGGAAACTTAGGGCCTAGAGA
TGAAATGCCTCTTACTTACATTTTAGAAGTTGAATTATTCGATGTATGGGTTGATTATGTGTCTAAGTGGGTGGAGGTCATTGCATGCCATCAGAGTGATGCCAAGACAG
TAGCAAGGTTTCTTCAATCGCACATCTTTGCGCGGTTTGGGACACCTACGGCTCTAGTGAGTGATGAGGCTAAGTATGGGATTAAGCATAGGATAGCTACCTCTTATCAC
CCACAAGCAAATGGTCAAGCTGAAATTAGTAATAGGGAAATTAAAGCTATTTTAGAGAAAGTAGTCCATCCATCTAGAAAGGATTGGTCTTTTAGGTTGGATGAGGCTCT
TTGGGCTTATAGATCAGCTTATAAGACTCCCCTAGGAGCAATAAGAATGCTGCAGCTTAATGAGTTAGAAGAATTTCGTCAATTTTCTTACGAAAATGCGAAAATGTATA
AGGAAAAGACTAAGCTGTGGCATGACAAGAAAATTAAATCTAAGGAGTTTGTCAAGGGTCAAAAAGATGAAAAAGATGGGAGTGTGTTCAAGGTAAATGGACAGCGTGTG
AAGCATTATTGGGGTGAGGAGTTTCGGTTGAAATATCCTTCCCTAAGGTTAGATTTGATTTTGCATGGTTTAGTAGATTTTATCCTTCAGTATTTTATTATTGGTAGTTT
AGATTTGGCTTTGAATTTTCGCAGTAATATTAGGGTTGCATTTTGCTGGAGCCTTATTTATTTACAACCGCTGAATCTTGAGACAAAACCGATAATTTCTACTACGGTTC
AAGAAGGGGATGTTGTGAAAAATCAGGAAACGGAGGCTGAAGAGCAAGTCGCAGGAAGGCGGGGCGTGAGGGTGGTTTGGAACACTCCATTGCCTTCGACGTCGAACTTT
GAGGAAGAAAAGAGGGAAGCTGAAAATAAGGCAAAAGAAGAGGAGGCAAGGAAGGCAGAAGAAGAGCGTTTGCACGAACAAAGAGAAAATAGGGGCAAAGGAATTGCTGA
AGCATCGGGTGAGATTGAGGAGCCGAGGGAACCGTTCATTCGCTTCGTCAACGAACTTGCCAGAGCAAAATATCAGAAAGTACTGAAGCGTGATTTCTTGTTCGAACGGG
GATTTGGCAGTGATATGCCTAGGTTTTTGGAGTCTAGAATAGCAAGCCTGGGGTGGAGACAGTTTTGTGCCAAGCCTGATCCTGTCAATGCCAACATCGTTCGGGAATTC
TACGCCAATCTTGACGTGAAGGATGATTTTGAAGTTATAGTGCGAGGAGTGCCTGTACAATGGAGCCCAGAGGCCATTAATAATTTGTTTGATCTTCAGGACTTTCCACA
TGCAGTTTTCAATGAGATGATGGTTGCCCATCGAGCGACCAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGGATTTTGCACTGCGGGTTTTTCTGGCCTACGTTATTTAAGGATGCCCATTGGTTCTACAAGCAATGTGATGCTTGCCAAAGGAGAGGAAACTTAGGGCCTAGAGA
TGAAATGCCTCTTACTTACATTTTAGAAGTTGAATTATTCGATGTATGGGTTGATTATGTGTCTAAGTGGGTGGAGGTCATTGCATGCCATCAGAGTGATGCCAAGACAG
TAGCAAGGTTTCTTCAATCGCACATCTTTGCGCGGTTTGGGACACCTACGGCTCTAGTGAGTGATGAGGCTAAGTATGGGATTAAGCATAGGATAGCTACCTCTTATCAC
CCACAAGCAAATGGTCAAGCTGAAATTAGTAATAGGGAAATTAAAGCTATTTTAGAGAAAGTAGTCCATCCATCTAGAAAGGATTGGTCTTTTAGGTTGGATGAGGCTCT
TTGGGCTTATAGATCAGCTTATAAGACTCCCCTAGGAGCAATAAGAATGCTGCAGCTTAATGAGTTAGAAGAATTTCGTCAATTTTCTTACGAAAATGCGAAAATGTATA
AGGAAAAGACTAAGCTGTGGCATGACAAGAAAATTAAATCTAAGGAGTTTGTCAAGGGTCAAAAAGATGAAAAAGATGGGAGTGTGTTCAAGGTAAATGGACAGCGTGTG
AAGCATTATTGGGGTGAGGAGTTTCGGTTGAAATATCCTTCCCTAAGGTTAGATTTGATTTTGCATGGTTTAGTAGATTTTATCCTTCAGTATTTTATTATTGGTAGTTT
AGATTTGGCTTTGAATTTTCGCAGTAATATTAGGGTTGCATTTTGCTGGAGCCTTATTTATTTACAACCGCTGAATCTTGAGACAAAACCGATAATTTCTACTACGGTTC
AAGAAGGGGATGTTGTGAAAAATCAGGAAACGGAGGCTGAAGAGCAAGTCGCAGGAAGGCGGGGCGTGAGGGTGGTTTGGAACACTCCATTGCCTTCGACGTCGAACTTT
GAGGAAGAAAAGAGGGAAGCTGAAAATAAGGCAAAAGAAGAGGAGGCAAGGAAGGCAGAAGAAGAGCGTTTGCACGAACAAAGAGAAAATAGGGGCAAAGGAATTGCTGA
AGCATCGGGTGAGATTGAGGAGCCGAGGGAACCGTTCATTCGCTTCGTCAACGAACTTGCCAGAGCAAAATATCAGAAAGTACTGAAGCGTGATTTCTTGTTCGAACGGG
GATTTGGCAGTGATATGCCTAGGTTTTTGGAGTCTAGAATAGCAAGCCTGGGGTGGAGACAGTTTTGTGCCAAGCCTGATCCTGTCAATGCCAACATCGTTCGGGAATTC
TACGCCAATCTTGACGTGAAGGATGATTTTGAAGTTATAGTGCGAGGAGTGCCTGTACAATGGAGCCCAGAGGCCATTAATAATTTGTTTGATCTTCAGGACTTTCCACA
TGCAGTTTTCAATGAGATGATGGTTGCCCATCGAGCGACCAATTAA
Protein sequenceShow/hide protein sequence
MRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWVDYVSKWVEVIACHQSDAKTVARFLQSHIFARFGTPTALVSDEAKYGIKHRIATSYH
PQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRSAYKTPLGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQKDEKDGSVFKVNGQRV
KHYWGEEFRLKYPSLRLDLILHGLVDFILQYFIIGSLDLALNFRSNIRVAFCWSLIYLQPLNLETKPIISTTVQEGDVVKNQETEAEEQVAGRRGVRVVWNTPLPSTSNF
EEEKREAENKAKEEEARKAEEERLHEQRENRGKGIAEASGEIEEPREPFIRFVNELARAKYQKVLKRDFLFERGFGSDMPRFLESRIASLGWRQFCAKPDPVNANIVREF
YANLDVKDDFEVIVRGVPVQWSPEAINNLFDLQDFPHAVFNEMMVAHRATN