; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021940 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021940
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase Ty1/copia-type domain-containing protein
Genome locationchr7:14265667..14271480
RNA-Seq ExpressionLag0021940
SyntenyLag0021940
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN78706.1 hypothetical protein VITISV_028658 [Vitis vinifera]8.3e-4726.3Show/hide
Query:  SRIQNPVSTPISSFPYDLATPPYLSDTTSKIED--YPHKPTCVRTSLKGHGLGHFIDDDAQIPPKFIQSGDDPSSKIPNPEYETWIRQDNLISSWFLGSM
        +R  NP +T  S     +A  P     T ++ED  +      +   ++G+GL  F+    Q+PPK +  G      +PNP++  + RQD+L+ SW L S+
Subjt:  SRIQNPVSTPISSFPYDLATPPYLSDTTSKIED--YPHKPTCVRTSLKGHGLGHFIDDDAQIPPKFIQSGDDPSSKIPNPEYETWIRQDNLISSWFLGSM

Query:  SNEILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTKLETTKKGSLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKGLGSEYDSTVNLYQQLNN
         +  L +V+ C +A ++W  ++  F+S++ A+V+  K+++   KK  L+++DY  K+KN  D LA AG KIS  DH+L I++GLG EY+S + +      
Subjt:  SNEILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTKLETTKKGSLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKGLGSEYDSTVNLYQQLNN

Query:  NNHQFKKARGETMEVKGIRITMESETTSEIGITMGNHNANFVVVLDTLQSAVTSDLSG----GIKNQASAS----------TPHGFFPPHQPP-------
             KK+      V    I  E     +  I+  + + N+        S+ + + +G    G +N+                HG  P + P        
Subjt:  NNHQFKKARGETMEVKGIRITMESETTSEIGITMGNHNANFVVVLDTLQSAVTSDLSG----GIKNQASAS----------TPHGFFPPHQPP-------

Query:  ----------------YTTYSLRHDLNKENQ--------------WYPDSGASNHVTSDLSNLNMGTDYTGDNKVHEGNGASLDVLHIGLLNWPNLLEGK
                         T Y  + D +                   +PDSGA+NHVT DL NLN G +Y G++K+H GNG   ++ HIGL  +P+     
Subjt:  ----------------YTTYSLRHDLNKENQ--------------WYPDSGASNHVTSDLSNLNMGTDYTGDNKVHEGNGASLDVLHIGLLNWPNLLEGK

Query:  VAYELYQF----SLEK---ATSQHSTGGCPSSKFHSQIESWDIQTTSTIDKYAEIEPPLVKEALKSSHWVDAMRTEYEALMRNDIWSWLNLPLIKRSLYA
            L       +++K   + SQ +       +FH ++ + ++            EP   +EA+    W +AM  E+ ALM+N  WS ++LP  K  +  
Subjt:  VAYELYQF----SLEK---ATSQHSTGGCPSSKFHSQIESWDIQTTSTIDKYAEIEPPLVKEALKSSHWVDAMRTEYEALMRNDIWSWLNLPLIKRSLYA

Query:  NGSSRSRGTPMDLLLVTSPVVKPTTIRVLFTMALAFGWPLRHVDINNVFLHGLLNEEVFMSHPPGMLDQSKPHMVSHLLLWISSFVYFIRNLPGRTW
            + +  P          +     R+   +A  +   +R +D+NN FL+G L EEV+M  PPG     + +    L+  +   +Y ++  P R W
Subjt:  NGSSRSRGTPMDLLLVTSPVVKPTTIRVLFTMALAFGWPLRHVDINNVFLHGLLNEEVFMSHPPGMLDQSKPHMVSHLLLWISSFVYFIRNLPGRTW

KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.7e-4433.33Show/hide
Query:  VRTSLKGHGLGHFIDDDAQIPPKFIQSGDDPSSK---IPNPEYETWIRQDNLISSWFLGSMSNEILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTK
        + T+L+ + L +F++ +++ P K++ S +  S+     PNP Y+ W RQD LISSW LGSMS EIL+++L C++AK+IW  L   FSSR LAQ ++ K K
Subjt:  VRTSLKGHGLGHFIDDDAQIPPKFIQSGDDPSSK---IPNPEYETWIRQDNLISSWFLGSMSNEILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTK

Query:  LETTKKGSLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKGLGSEYDSTVNLYQQLNNNNHQFKKARGETMEVKGIRITMESETTSEIGITMGNHNA
        L   KKGS+ LK+YF+KI   VD+LA+  K +S DDH+L+IL GLGS+Y S +++     ++            EV  + +T ES+  S++       + 
Subjt:  LETTKKGSLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKGLGSEYDSTVNLYQQLNNNNHQFKKARGETMEVKGIRITMESETTSEIGITMGNHNA

Query:  NFVVVLDTLQSAVTS---------------DLSGGIKNQAS--------------------------------ASTPHGFFP-PHQPPYTTYS-------
        N  +V  T +    S               +  GG  N  S                                 S   G+ P  H   YT  +       
Subjt:  NFVVVLDTLQSAVTS---------------DLSGGIKNQAS--------------------------------ASTPHGFFP-PHQPPYTTYS-------

Query:  --LRHDLNKENQWYPDSGASNHVTSDLSNLNMGTDYTGDNKVHEGNGASLDVLHIGLLNW
             DLN ++ WYPDSGA+NH+T  LSNL++G++Y G N+++  NG+ L + H G +++
Subjt:  --LRHDLNKENQWYPDSGASNHVTSDLSNLNMGTDYTGDNKVHEGNGASLDVLHIGLLNW

RVW13474.1 Retrovirus-related Pol polyprotein from transposon RE2 [Vitis vinifera]4.4e-4825.96Show/hide
Query:  VRTSLKGHGLGHFIDDDAQIPPKFIQSGDDPSSKIPNPEYETWIRQDNLISSWFLGSMSNEILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTKLET
        +   ++G+GL  F+     +PPK +  G      +PNP++  + RQD+L+ SW + S+ +  L +V+ C +  ++W  ++  F+S++ A+V+  K++++ 
Subjt:  VRTSLKGHGLGHFIDDDAQIPPKFIQSGDDPSSKIPNPEYETWIRQDNLISSWFLGSMSNEILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTKLET

Query:  TKKGSLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKGLGSEYDS-------------------------------------TVNLYQQLNN-----
         KK  L+++DY  K+KN  D LA AG KIS  DH+L I++GLG EY+S                                     +VN   Q +N     
Subjt:  TKKGSLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKGLGSEYDS-------------------------------------TVNLYQQLNN-----

Query:  -------------NNHQF------------KKARGETMEVKGIRITMESETTSEIGITMG----NHNANFVVVLDTLQSAVTSDLSGGIKNQASASTPHG
                     N +QF             + RG      GI+   + +  ++ G T+      ++ NF   +          L  G KN AS S    
Subjt:  -------------NNHQF------------KKARGETMEVKGIRITMESETTSEIGITMG----NHNANFVVVLDTLQSAVTSDLSGGIKNQASASTPHG

Query:  FFPPHQPPYTTYSLRHDLNKENQ--------------WYPDSGASNHVTSDLSNLNMGTDYTGDNKVHEGNGASLDVLHIGLLNWPNLL---EGKVAYEL
                 T Y  + D +                  W+PDSGA+NHVT DL NLN GT+Y G++K+H GNG  L++ HIGL   P+     +G +   L
Subjt:  FFPPHQPPYTTYSLRHDLNKENQ--------------WYPDSGASNHVTSDLSNLNMGTDYTGDNKVHEGNGASLDVLHIGLLNWPNLL---EGKVAYEL

Query:  YQFSLEK--------------------------------ATSQHSTGGCPSSKFHS----------------QIESWDIQTTS-----------------
        YQF L K                                A S H     P S   S                  ES  I ++S                 
Subjt:  YQFSLEK--------------------------------ATSQHSTGGCPSSKFHS----------------QIESWDIQTTS-----------------

Query:  --------------------------------TIDKYAEIEPPLVKEALKSSHWVDAMRTEYEALMRNDIWSWLNLP-----------------------
                                        T+D   E EP  ++E +    W +AM  E+ ALM+N  WS + LP                       
Subjt:  --------------------------------TIDKYAEIEPPLVKEALKSSHWVDAMRTEYEALMRNDIWSWLNLP-----------------------

Query:  LIKRSLYANGSSRSRGTPMDLLLVTSPVVKPTTIRVLFTMALAFGWPLRHVDINNVFLHGLLNEEVFMSHPPGMLDQSKPHMVSHLLLWISSFVYFIRNL
          K  L A G S+  G   D     SPVVKPTTIRV+  +A++  W +R +D+NN FL+G L EEV+M  PPG     + +    L+  +   +Y ++  
Subjt:  LIKRSLYANGSSRSRGTPMDLLLVTSPVVKPTTIRVLFTMALAFGWPLRHVDINNVFLHGLLNEEVFMSHPPGMLDQSKPHMVSHLLLWISSFVYFIRNL

Query:  PGRTW
        P R W
Subjt:  PGRTW

XP_022156747.1 uncharacterized protein LOC111023586 [Momordica charantia]1.8e-5742.02Show/hide
Query:  DYPHKPTCVRTSLKGHGLGHFIDDDAQIPPKFIQSGDDPSSKI---PNPEYETWIRQDNLISSWFLGSMSNEILSKVLDCETAKDIWTVLNARFSSRNLA
        D P +   V T+++GHGL  +ID D + P +FIQ+GD  +S     PNPEY  WI+QD LIS W LGSMS EILS++LDC   K+IWT+L   F+SRNLA
Subjt:  DYPHKPTCVRTSLKGHGLGHFIDDDAQIPPKFIQSGDDPSSKI---PNPEYETWIRQDNLISSWFLGSMSNEILSKVLDCETAKDIWTVLNARFSSRNLA

Query:  QVLKLKTKLETTKKGSLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKGLGSEYDSTVNLYQQLNNNNHQFKKARGETMEVKGIRITMESETTSEIG
        +V++LK+KLE  KKGS++LK+YF+KIKNLVDSLA AGK++  DDH++HIL  LG E+DS V++                    +   +     +  S  G
Subjt:  QVLKLKTKLETTKKGSLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKGLGSEYDSTVNLYQQLNNNNHQFKKARGETMEVKGIRITMESETTSEIG

Query:  ITMGNHNANFVVVLDTLQSAVTSDLSGGIKNQASASTPHGFFPPHQPPYTTYSLRHDLNKENQWYPDSGASNHVTSDLSNLNMGTDYTGDNKVHEGNGAS
         + G            +QS+     +G   +   A +  G F    P      + +D N++  WYPDSGA+NHVT+D  N ++G+ Y G+ K+  GNG +
Subjt:  ITMGNHNANFVVVLDTLQSAVTSDLSGGIKNQASASTPHGFFPPHQPPYTTYSLRHDLNKENQWYPDSGASNHVTSDLSNLNMGTDYTGDNKVHEGNGAS

Query:  LDVLHIG
        L + HIG
Subjt:  LDVLHIG

XP_022156747.1 uncharacterized protein LOC111023586 [Momordica charantia]3.3e-0355.56Show/hide
Query:  YAEIEPPLVKEALKSSHWVDAMRTEYEALMRNDIWSWLNLPLIKR
        Y EIEPPLVK AL+  +WV AM  EY+AL+RN  WS +  P  K+
Subjt:  YAEIEPPLVKEALKSSHWVDAMRTEYEALMRNDIWSWLNLPLIKR

XP_022156747.1 uncharacterized protein LOC111023586 [Momordica charantia]8.9e-4933.77Show/hide
Query:  VRTSLKGHGLGHFIDDDAQIPPKFIQSGDDPSSKI---PNPEYETWIRQDNLISSWFLGSMSNEILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTK
        +RT+L+G+GL  +ID +   P +F+Q+ +D SS      NP Y  WI+QD LIS+W LGSM+ +ILS++LDC++A++IWTVL   F+SR LA+V++LK K
Subjt:  VRTSLKGHGLGHFIDDDAQIPPKFIQSGDDPSSKI---PNPEYETWIRQDNLISSWFLGSMSNEILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTK

Query:  LETTKKGSLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKGLGSEYDSTVN-----------------LYQQLNNNNHQFKKARGETMEVKGIRITM
        LE  KKG+LSLKDYF+KIKNLVDSLA AGKK+S +DH++HIL GLG E+D+ ++                 L QQ   N      + G    V       
Subjt:  LETTKKGSLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKGLGSEYDSTVN-----------------LYQQLNNNNHQFKKARGETMEVKGIRITM

Query:  ESETTSEIGITMGNHNANFVVVLDTLQSAVTSDLSGGIKN-----------------------------------QASASTPHGF---FPPHQPPYTTYS
          +           H +N+     + +   T++ S   +N                                     +  +P GF   FP + P +  +S
Subjt:  ESETTSEIGITMGNHNANFVVVLDTLQSAVTSDLSGGIKN-----------------------------------QASASTPHGF---FPPHQPPYTTYS

Query:  -----------------------LRHDLNKENQWYPDSGASNHVTSDLSNLNMGTDYTGDNKVHEGNGASLDVLHIGLLNWP
                               +  D N+++ WY DSG +NHVT++  N ++G++Y GD K+  GNG        G  NWP
Subjt:  -----------------------LRHDLNKENQWYPDSGASNHVTSDLSNLNMGTDYTGDNKVHEGNGASLDVLHIGLLNWP

TrEMBL top hitse value%identityAlignment
A0A2N9FW32 Uncharacterized protein1.3e-4527.61Show/hide
Query:  LKGHGLGHFIDDDAQIPPKFIQSGDDPSSKIPNPEYETWIRQDNLISSWFLGSMSNEILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTKLETTKKG
        L+G  L  F+D     P  ++     P    PNP ++ W+ QD +I S  + S+S  +L+ V+ C T++D+W  L   F++ + A+ + ++ +L T KKG
Subjt:  LKGHGLGHFIDDDAQIPPKFIQSGDDPSSKIPNPEYETWIRQDNLISSWFLGSMSNEILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTKLETTKKG

Query:  SLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKGLGSEYDSTV-------------NLYQQLNNNNHQFKKARGET-MEVKGIRITMESETTSEIGI
          S+ DYF     LVD+LAA  + +++++ +  +L GLGSEY+S +              LY  L +   +  +A+ +  + + G   T    ++S    
Subjt:  SLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKGLGSEYDSTV-------------NLYQQLNNNNHQFKKARGET-MEVKGIRITMESETTSEIGI

Query:  TMGNHNANFVVVLDTLQSAVTSDLSGGIKNQASASTPHGFFPPHQ-------PPYTTY-----SLRHDLNK-----------ENQWYPDSGASNHVTSDL
          G+ N  F       +S+  +  + G +N+   S+ +G  P  Q          T Y     S   D NK           ++QWY DSGA++H+T+DL
Subjt:  TMGNHNANFVVVLDTLQSAVTSDLSGGIKNQASASTPHGFFPPHQ-------PPYTTY-----SLRHDLNK-----------ENQWYPDSGASNHVTSDL

Query:  SNLNMGTD-YTGDNKVHEGNGASL--------DVLHIGLLNWPNL---LEGKVAYE------------------------LYQFSLEKATSQHSTGGC--
        +NLN+  D Y G + +H GN  S+              ++N P +   LE   A +                        L   +  +A+SQ        
Subjt:  SNLNMGTD-YTGDNKVHEGNGASL--------DVLHIGLLNWPNL---LEGKVAYE------------------------LYQFSLEKATSQHSTGGC--

Query:  --PSSKFH------SQIESWDIQTTSTI-------------DKYAEIEPPLVKEALKSSHWVDAMRTEYEALMRNDIWS--------------WL-----
          P+S  H      +QI    I T  TI                +  EP     A+KS  W +AM  E++AL+RN  W+              W+     
Subjt:  --PSSKFH------SQIESWDIQTTSTI-------------DKYAEIEPPLVKEALKSSHWVDAMRTEYEALMRNDIWS--------------WL-----

Query:  ----NLPLIKRSLYANGSSRSRGTPMDLLLVTSPVVKPTTIRVLFTMALAFGWPLRHVDINNVFLHGLLNEEVFMSHPPGMLDQSKPHMVSHLLLWISSF
             +   K  L A G  +  G   D     SPV+KPTT+R + ++A++ GW LR +DI N FLHG L+EEVFMS PPG      P+ V  L       
Subjt:  ----NLPLIKRSLYANGSSRSRGTPMDLLLVTSPVVKPTTIRVLFTMALAFGWPLRHVDINNVFLHGLLNEEVFMSHPPGMLDQSKPHMVSHLLLWISSF

Query:  VYFIRNLPGRTW
        +Y ++  P R W
Subjt:  VYFIRNLPGRTW

A0A2N9HLF0 Reverse transcriptase Ty1/copia-type domain-containing protein1.6e-4828.04Show/hide
Query:  SRIQNPVSTPISSFPYDLATPPYLSDTTSKI----EDYPHKPTCVRTSLKGHGLGHFIDDDAQIPPKFIQSGDDPSSKIPNPEYETWIRQDNLISSWFLG
        S   +  +T  +S   +  TP   S T  +I    ++Y    T +   + G+ + H ID     PPK I S   PS  IPNP Y TW   D L+ S  + 
Subjt:  SRIQNPVSTPISSFPYDLATPPYLSDTTSKI----EDYPHKPTCVRTSLKGHGLGHFIDDDAQIPPKFIQSGDDPSSKIPNPEYETWIRQDNLISSWFLG

Query:  SMSNEILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTKLETTKKGSLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKGLGSEYDSTV------
        ++S  ++S ++   ++ ++W  L   FSS++ A+V++ +  L T KK + ++ +YF K K   D LA+ G+ +S +D V ++L GL S+YDS +      
Subjt:  SMSNEILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTKLETTKKGSLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKGLGSEYDSTV------

Query:  -------NLYQQLNNNNHQFKKARGETMEVKGIRITMESETTSEIGITMGNHNANFVVVLDTLQSAVTSDLSGGIKNQASASTPHG--------FFPPHQ
               +LY  L    H+ +  +  T+   G+  T    T             NF    +          +G   N  S+S  +             H 
Subjt:  -------NLYQQLNNNNHQFKKARGETMEVKGIRITMESETTSEIGITMGNHNANFVVVLDTLQSAVTSDLSGGIKNQASASTPHG--------FFPPHQ

Query:  PPYTTYSLRHDLNKENQ------------------WYPDSGASNHVTSDLSNLNMGTD-YTGDNKVHEGNGASLDVL------HIGLLNWPNLLEGKVAY
         P  T   R + N + Q                  WYPD+GA+NH+TSDLSNLN+  + Y G ++VH GNG +   L      H+ L + P  +      
Subjt:  PPYTTYSLRHDLNKENQ------------------WYPDSGASNHVTSDLSNLNMGTD-YTGDNKVHEGNGASLDVL------HIGLLNWPNLLEGKVAY

Query:  ELYQFSLEKATSQHS--TGGCPSSKFHSQI-----------ESWDIQTTS--TIDKYAEIEPPLVK----------------------EALKSSHWVDAM
         +   +   +T   S  +   PS  + + +               +QT S   I K  ++ P ++K                      EA KS+ W  AM
Subjt:  ELYQFSLEKATSQHS--TGGCPSSKFHSQI-----------ESWDIQTTS--TIDKYAEIEPPLVK----------------------EALKSSHWVDAM

Query:  RTEYEALMRNDIWSWLNLPLIKRSLYANGSSRSRGTPMDLLLVTSPVVKPTTIRVLFTMALAFGWPLRHVDINNVFLHGLLNEEVFMSHPPGMLDQSKPH
         TE+ AL++N  W+ L  P  K  L A G  + +G  +D     SPV+KP TIR + ++A A  W +R +D+ N FLHG L+E+V+M+ PPG +  S P+
Subjt:  RTEYEALMRNDIWSWLNLPLIKRSLYANGSSRSRGTPMDLLLVTSPVVKPTTIRVLFTMALAFGWPLRHVDINNVFLHGLLNEEVFMSHPPGMLDQSKPH

Query:  MVSHLLLWISSFVYFIRNLPGRTW
         V HL       +Y ++  P R W
Subjt:  MVSHLLLWISSFVYFIRNLPGRTW

A0A6J1DSS1 uncharacterized protein LOC1110235868.7e-5842.02Show/hide
Query:  DYPHKPTCVRTSLKGHGLGHFIDDDAQIPPKFIQSGDDPSSKI---PNPEYETWIRQDNLISSWFLGSMSNEILSKVLDCETAKDIWTVLNARFSSRNLA
        D P +   V T+++GHGL  +ID D + P +FIQ+GD  +S     PNPEY  WI+QD LIS W LGSMS EILS++LDC   K+IWT+L   F+SRNLA
Subjt:  DYPHKPTCVRTSLKGHGLGHFIDDDAQIPPKFIQSGDDPSSKI---PNPEYETWIRQDNLISSWFLGSMSNEILSKVLDCETAKDIWTVLNARFSSRNLA

Query:  QVLKLKTKLETTKKGSLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKGLGSEYDSTVNLYQQLNNNNHQFKKARGETMEVKGIRITMESETTSEIG
        +V++LK+KLE  KKGS++LK+YF+KIKNLVDSLA AGK++  DDH++HIL  LG E+DS V++                    +   +     +  S  G
Subjt:  QVLKLKTKLETTKKGSLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKGLGSEYDSTVNLYQQLNNNNHQFKKARGETMEVKGIRITMESETTSEIG

Query:  ITMGNHNANFVVVLDTLQSAVTSDLSGGIKNQASASTPHGFFPPHQPPYTTYSLRHDLNKENQWYPDSGASNHVTSDLSNLNMGTDYTGDNKVHEGNGAS
         + G            +QS+     +G   +   A +  G F    P      + +D N++  WYPDSGA+NHVT+D  N ++G+ Y G+ K+  GNG +
Subjt:  ITMGNHNANFVVVLDTLQSAVTSDLSGGIKNQASASTPHGFFPPHQPPYTTYSLRHDLNKENQWYPDSGASNHVTSDLSNLNMGTDYTGDNKVHEGNGAS

Query:  LDVLHIG
        L + HIG
Subjt:  LDVLHIG

A0A6J1DSS1 uncharacterized protein LOC1110235861.6e-0355.56Show/hide
Query:  YAEIEPPLVKEALKSSHWVDAMRTEYEALMRNDIWSWLNLPLIKR
        Y EIEPPLVK AL+  +WV AM  EY+AL+RN  WS +  P  K+
Subjt:  YAEIEPPLVKEALKSSHWVDAMRTEYEALMRNDIWSWLNLPLIKR

A0A6J1DSS1 uncharacterized protein LOC1110235864.3e-4933.77Show/hide
Query:  VRTSLKGHGLGHFIDDDAQIPPKFIQSGDDPSSKI---PNPEYETWIRQDNLISSWFLGSMSNEILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTK
        +RT+L+G+GL  +ID +   P +F+Q+ +D SS      NP Y  WI+QD LIS+W LGSM+ +ILS++LDC++A++IWTVL   F+SR LA+V++LK K
Subjt:  VRTSLKGHGLGHFIDDDAQIPPKFIQSGDDPSSKI---PNPEYETWIRQDNLISSWFLGSMSNEILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTK

Query:  LETTKKGSLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKGLGSEYDSTVN-----------------LYQQLNNNNHQFKKARGETMEVKGIRITM
        LE  KKG+LSLKDYF+KIKNLVDSLA AGKK+S +DH++HIL GLG E+D+ ++                 L QQ   N      + G    V       
Subjt:  LETTKKGSLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKGLGSEYDSTVN-----------------LYQQLNNNNHQFKKARGETMEVKGIRITM

Query:  ESETTSEIGITMGNHNANFVVVLDTLQSAVTSDLSGGIKN-----------------------------------QASASTPHGF---FPPHQPPYTTYS
          +           H +N+     + +   T++ S   +N                                     +  +P GF   FP + P +  +S
Subjt:  ESETTSEIGITMGNHNANFVVVLDTLQSAVTSDLSGGIKN-----------------------------------QASASTPHGF---FPPHQPPYTTYS

Query:  -----------------------LRHDLNKENQWYPDSGASNHVTSDLSNLNMGTDYTGDNKVHEGNGASLDVLHIGLLNWP
                               +  D N+++ WY DSG +NHVT++  N ++G++Y GD K+  GNG        G  NWP
Subjt:  -----------------------LRHDLNKENQWYPDSGASNHVTSDLSNLNMGTDYTGDNKVHEGNGASLDVLHIGLLNWP

A5B6L3 Uncharacterized protein4.0e-4726.3Show/hide
Query:  SRIQNPVSTPISSFPYDLATPPYLSDTTSKIED--YPHKPTCVRTSLKGHGLGHFIDDDAQIPPKFIQSGDDPSSKIPNPEYETWIRQDNLISSWFLGSM
        +R  NP +T  S     +A  P     T ++ED  +      +   ++G+GL  F+    Q+PPK +  G      +PNP++  + RQD+L+ SW L S+
Subjt:  SRIQNPVSTPISSFPYDLATPPYLSDTTSKIED--YPHKPTCVRTSLKGHGLGHFIDDDAQIPPKFIQSGDDPSSKIPNPEYETWIRQDNLISSWFLGSM

Query:  SNEILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTKLETTKKGSLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKGLGSEYDSTVNLYQQLNN
         +  L +V+ C +A ++W  ++  F+S++ A+V+  K+++   KK  L+++DY  K+KN  D LA AG KIS  DH+L I++GLG EY+S + +      
Subjt:  SNEILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTKLETTKKGSLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKGLGSEYDSTVNLYQQLNN

Query:  NNHQFKKARGETMEVKGIRITMESETTSEIGITMGNHNANFVVVLDTLQSAVTSDLSG----GIKNQASAS----------TPHGFFPPHQPP-------
             KK+      V    I  E     +  I+  + + N+        S+ + + +G    G +N+                HG  P + P        
Subjt:  NNHQFKKARGETMEVKGIRITMESETTSEIGITMGNHNANFVVVLDTLQSAVTSDLSG----GIKNQASAS----------TPHGFFPPHQPP-------

Query:  ----------------YTTYSLRHDLNKENQ--------------WYPDSGASNHVTSDLSNLNMGTDYTGDNKVHEGNGASLDVLHIGLLNWPNLLEGK
                         T Y  + D +                   +PDSGA+NHVT DL NLN G +Y G++K+H GNG   ++ HIGL  +P+     
Subjt:  ----------------YTTYSLRHDLNKENQ--------------WYPDSGASNHVTSDLSNLNMGTDYTGDNKVHEGNGASLDVLHIGLLNWPNLLEGK

Query:  VAYELYQF----SLEK---ATSQHSTGGCPSSKFHSQIESWDIQTTSTIDKYAEIEPPLVKEALKSSHWVDAMRTEYEALMRNDIWSWLNLPLIKRSLYA
            L       +++K   + SQ +       +FH ++ + ++            EP   +EA+    W +AM  E+ ALM+N  WS ++LP  K  +  
Subjt:  VAYELYQF----SLEK---ATSQHSTGGCPSSKFHSQIESWDIQTTSTIDKYAEIEPPLVKEALKSSHWVDAMRTEYEALMRNDIWSWLNLPLIKRSLYA

Query:  NGSSRSRGTPMDLLLVTSPVVKPTTIRVLFTMALAFGWPLRHVDINNVFLHGLLNEEVFMSHPPGMLDQSKPHMVSHLLLWISSFVYFIRNLPGRTW
            + +  P          +     R+   +A  +   +R +D+NN FL+G L EEV+M  PPG     + +    L+  +   +Y ++  P R W
Subjt:  NGSSRSRGTPMDLLLVTSPVVKPTTIRVLFTMALAFGWPLRHVDINNVFLHGLLNEEVFMSHPPGMLDQSKPHMVSHLLLWISSFVYFIRNLPGRTW

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.5e-0625.84Show/hide
Query:  IPNPEYETWIRQDNLISSWFLGSMSNEILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTKLETTKKGS-LSLKDYFMKIKNLVDSLAAAGKKISQDD
        +PN   ++W + +    S  +  +S+  L+      TA+ I   L+A +  ++LA  L L+ +L + K  S +SL  +F     L+  L AAG KI + D
Subjt:  IPNPEYETWIRQDNLISSWFLGSMSNEILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTKLETTKKGS-LSLKDYFMKIKNLVDSLAAAGKKISQDD

Query:  HVLHILKGLGSEYDSTVNLYQQLNNNNHQFKKARGETMEVKGIRITMESETTSEIGITMGNHNANFVVVLDTLQSAVT
         + H+L  L S YD  +   + L+  N      +   ++ + I+I  +   TS+  +    HN N     +  ++ VT
Subjt:  HVLHILKGLGSEYDSTVNLYQQLNNNNHQFKKARGETMEVKGIRITMESETTSEIGITMGNHNANFVVVLDTLQSAVT

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-0927.27Show/hide
Query:  QIESWDIQTTSTIDKYAEIEPPLVKEAL---KSSHWVDAMRTEYEALMRNDIWSWLNLPLIKR-----------------------SLYANGSSRSRGTP
        ++ES    +T  +    + EP  +KE L   + +  + AM+ E E+L +N  +  + LP  KR                        L   G  + +G  
Subjt:  QIESWDIQTTSTIDKYAEIEPPLVKEAL---KSSHWVDAMRTEYEALMRNDIWSWLNLPLIKR-----------------------SLYANGSSRSRGTP

Query:  MDLLLVTSPVVKPTTIRVLFTMALAFGWPLRHVDINNVFLHGLLNEEVFMSHPPGMLDQSKPHMVSHLLLWISSFVYFIRNLPGRTW
         D +   SPVVK T+IR + ++A +    +  +D+   FLHG L EE++M  P G     K HMV  L    +  +Y ++  P R W
Subjt:  MDLLLVTSPVVKPTTIRVLFTMALAFGWPLRHVDINNVFLHGLLNEEVFMSHPPGMLDQSKPHMVSHLLLWISSFVYFIRNLPGRTW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.1e-2025Show/hide
Query:  VRTSLKGHGLGHFIDDDAQIPPKFIQSGDDPSSKIPNPEYETWIRQDNLISSWFLGSMSNEILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTKLET
        V     G+ L  F+D    +PP  I  G D + ++ NP+Y  W RQD LI S  LG++S  +   V    TA  IW  L   +++ +   V +L+T+L+ 
Subjt:  VRTSLKGHGLGHFIDDDAQIPPKFIQSGDDPSSKIPNPEYETWIRQDNLISSWFLGSMSNEILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTKLET

Query:  TKKGSLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKGLGSEYDSTVN-------------LYQQLNNNNHQFKKARGETMEVKGIRITMESETTSE
          KG+ ++ DY   +    D LA  GK +  D+ V  +L+ L  EY   ++             ++++L N+  +       T+            TT+ 
Subjt:  TKKGSLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKGLGSEYDSTVN-------------LYQQLNNNNHQFKKARGETMEVKGIRITMESETTSE

Query:  IGITMGNHNANFVVVLDTLQSAVTSDLSGGIKNQASASTPH------------------------GFFPPHQP--PYTTYSLRHDL-----NKENQWYPD
             GN N  +    +   S      S       + S P+                              QP  P+T +  R +L        N W  D
Subjt:  IGITMGNHNANFVVVLDTLQSAVTSDLSGGIKNQASASTPH------------------------GFFPPHQP--PYTTYSLRHDL-----NKENQWYPD

Query:  SGASNHVTSDLSNLNMGTDYTGDNKVHEGNGASLDVLHIG
        SGA++H+TSD +NL++   YTG + V   +G+++ + H G
Subjt:  SGASNHVTSDLSNLNMGTDYTGDNKVHEGNGASLDVLHIG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.0e-1631.95Show/hide
Query:  AEIEPPLVKEALKSSHWVDAMRTEYEALMRNDIWS---------------WL---------NLPLIKRSLYANGSSRSRGTPMDLLLVTSPVVKPTTIRV
        AE EP    +ALK   W +AM +E  A + N  W                W+         +L   K  L A G ++  G  +D     SPV+K T+IR+
Subjt:  AEIEPPLVKEALKSSHWVDAMRTEYEALMRNDIWS---------------WL---------NLPLIKRSLYANGSSRSRGTPMDLLLVTSPVVKPTTIRV

Query:  LFTMALAFGWPLRHVDINNVFLHGLLNEEVFMSHPPGMLDQSKPHMVSHLLLWISSFVYFIRNLPGRTW
        +  +A+   WP+R +D+NN FL G L ++V+MS PPG +D+ +P+ V  L       +Y ++  P R W
Subjt:  LFTMALAFGWPLRHVDINNVFLHGLLNEEVFMSHPPGMLDQSKPHMVSHLLLWISSFVYFIRNLPGRTW

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.9e-1524.92Show/hide
Query:  TSDLSGGIKNQASASTPHGFFPPHQPPYTTYSLRHD--LNKENQWYPDSGASNHVT----SDLSNLNMGTDYTGDNKVHEGNGASLDVLHIGLLNWPNLL
        +S +S    ++ +A + +G  P  QP  T  S  +   LN  N   P   + N  +    S +S+ ++ T  T  ++ +  + +S           P L 
Subjt:  TSDLSGGIKNQASASTPHGFFPPHQPPYTTYSLRHD--LNKENQWYPDSGASNHVT----SDLSNLNMGTDYTGDNKVHEGNGASLDVLHIGLLNWPNLL

Query:  EGKVAYELYQFSLEKATSQHSTGGCPSSKFHSQIESWDIQTTSTIDKYAEIEPPLVKEALKSSHWVDAMRTEYEALMRNDIWS---------------WL
            A  + Q + +   + HS             + +   T+      A  EP    +A+K   W  AM +E  A + N  W                W+
Subjt:  EGKVAYELYQFSLEKATSQHSTGGCPSSKFHSQIESWDIQTTSTIDKYAEIEPPLVKEALKSSHWVDAMRTEYEALMRNDIWS---------------WL

Query:  ---------NLPLIKRSLYANGSSRSRGTPMDLLLVTSPVVKPTTIRVLFTMALAFGWPLRHVDINNVFLHGLLNEEVFMSHPPGMLDQSKPHMVSHLLL
                 +L   K  L A G ++  G  +D     SPV+K T+IR++  +A+   WP+R +D+NN FL G L +EV+MS PPG +D+ +P  V  L  
Subjt:  ---------NLPLIKRSLYANGSSRSRGTPMDLLLVTSPVVKPTTIRVLFTMALAFGWPLRHVDINNVFLHGLLNEEVFMSHPPGMLDQSKPHMVSHLLL

Query:  WISSFVYFIRNLPGRTW
             +Y ++  P R W
Subjt:  WISSFVYFIRNLPGRTW

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.3e-1323.53Show/hide
Query:  VRTSLKGHGLGHFIDDDAQIPPKFIQSGDDPSSKIPNPEYETWIRQDNLISSWFLGSMSNEILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTKLET
        V     G+ L  F+D    +PP  I  G D   ++ NP+Y  W RQD LI S  LG++S  +   V    TA  IW  L   +++ +   V +L      
Subjt:  VRTSLKGHGLGHFIDDDAQIPPKFIQSGDDPSSKIPNPEYETWIRQDNLISSWFLGSMSNEILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTKLET

Query:  TKKGSLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKGLGSEYDS---------------------------------------TVNLYQQLNNNNH
                     +     D LA  GK +  D+ V  +L+ L  +Y                                         T N+    N N +
Subjt:  TKKGSLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKGLGSEYDS---------------------------------------TVNLYQQLNNNNH

Query:  QFKKARGETMEVKGIRITMESETTSEIGITMGN-----------------HNANFVVVLDTLQSAVTSDLSGGIKNQASASTPHGFFPPHQPPYTTYSLR
        + +  RG+            S   S  G    N                 H+A     L   QS           NQ  +++P   F P Q P    ++ 
Subjt:  QFKKARGETMEVKGIRITMESETTSEIGITMGN-----------------HNANFVVVLDTLQSAVTSDLSGGIKNQASASTPHGFFPPHQPPYTTYSLR

Query:  HDLNKENQWYPDSGASNHVTSDLSNLNMGTDYTGDNKVHEGNGASLDVLHIGLLNWP
           N  N W  DSGA++H+TSD +NL+    YTG + V   +G+++ + H G  + P
Subjt:  HDLNKENQWYPDSGASNHVTSDLSNLNMGTDYTGDNKVHEGNGASLDVLHIGLLNWP

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).4.9e-0526.58Show/hide
Query:  NPVSTPISSF--PYDLATPPYLS-DTTSKIED-YPHKPTCVRTSLKGHGLGHFIDDDAQIPPKFIQSGDDPSSKIPNPEYETWIRQDNLISSWFLGSMSN
        +P S P S +  P D+  P   S    SK ED Y       R+ L+      FID     P  F            +P Y+ W + + ++  W + SM++
Subjt:  NPVSTPISSF--PYDLATPPYLS-DTTSKIED-YPHKPTCVRTSLKGHGLGHFIDDDAQIPPKFIQSGDDPSSKIPNPEYETWIRQDNLISSWFLGSMSN

Query:  EILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTKLETTKKGSLSLKDYFMKIKNL
        ++L  V+  ETA  +W  L   F      ++ +L+ +L T ++G  S+++YF K+  +
Subjt:  EILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTKLETTKKGSLSLKDYFMKIKNL

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)5.6e-0925.23Show/hide
Query:  WIRQDNLISSWFLGSMS-NEILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTKLETTKKGSLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKG
        W ++D ++     G+++  +     +   T++DIW  +  +F +   A+ L+L ++L T   G + + DY+ K+K L DSL      ++  + V+++L G
Subjt:  WIRQDNLISSWFLGSMS-NEILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTKLETTKKGSLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKG

Query:  LGSEYDSTVNL
        L  ++D+ +N+
Subjt:  LGSEYDSTVNL

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.6e-0924.24Show/hide
Query:  EPPLVKEALKSSHWVDAMRTEYEALMRNDIWSWLNLP-----------------------LIKRSLYANGSSRSRGTPMDLLLVTSPVVKPTTIRVLFTM
        EP    EA +   W  AM  E  A+     W    LP                         K  L A G ++  G  +D +   SPV K T+++++  +
Subjt:  EPPLVKEALKSSHWVDAMRTEYEALMRNDIWSWLNLP-----------------------LIKRSLYANGSSRSRGTPMDLLLVTSPVVKPTTIRVLFTM

Query:  ALAFGWPLRHVDINNVFLHGLLNEEVFMSHPPGMLDQSKPHMVSHLLLWISSFVYFIRNLPGRTW
        +  + + L  +DI+N FL+G L+EE++M  PPG   +    +  + + ++   +Y ++    R W
Subjt:  ALAFGWPLRHVDINNVFLHGLLNEEVFMSHPPGMLDQSKPHMVSHLLLWISSFVYFIRNLPGRTW

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.2e-1131.5Show/hide
Query:  GDDPSSKIPNPEYE-TWIRQDNLISSWFLGSMSNEILSKVLDCE-TAKDIWTVLNARFSSRNLAQVLKLKTKLETTKKGSLSLKDYFMKIKNLVDSLAAA
        G    S  P P  E  W  +D L+  W  G++++ +L  ++    TA+D+W  L   F     A+ L+ + +L TT    LS+ +Y  K+K+L D L   
Subjt:  GDDPSSKIPNPEYE-TWIRQDNLISSWFLGSMSNEILSKVLDCE-TAKDIWTVLNARFSSRNLAQVLKLKTKLETTKKGSLSLKDYFMKIKNLVDSLAAA

Query:  GKKISQDDHVLHILKGLGSEYDSTVNL
           IS    V+H+L GL  +YD  +N+
Subjt:  GKKISQDDHVLHILKGLGSEYDSTVNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCTCTCACCCTCCTCACTCTCTCACGCCGTTCTTGACCATTCAACCCGCCGCCGCCACTATCGACTTTTCACTCTGTCGTCTCCGCCCTCGACTTTTCACCCCGCC
ACCGCCGCCTTCGAGTCTCGACACACCTAGAACAATCCAGATTCGGTACCTAATGGCCCCTTCATTTCCAAAGATCCCATGGAGTCCCCTCAATCTACATTGGTCTTGTA
ACGCCCCAGGGTCCAGGATTCAGAATCCGGTTTCGACACCGATCTCGTCATTCCCCTACGACCTAGCCACGCCACCATACTTGTCTGATACTACTTCTAAGATTGAAGAT
TATCCTCACAAACCAACATGTGTTCGTACTAGTTTGAAAGGACATGGCCTTGGGCACTTTATCGATGATGATGCCCAAATACCTCCCAAATTCATTCAATCCGGAGATGA
TCCCTCCTCCAAGATTCCCAATCCAGAGTACGAAACATGGATTCGGCAAGATAATCTCATCTCGTCCTGGTTCTTGGGATCAATGTCGAATGAAATTTTGTCTAAAGTTC
TTGATTGCGAAACTGCCAAAGACATTTGGACAGTATTGAATGCCCGTTTCTCCTCACGGAATCTTGCTCAGGTCTTAAAGCTCAAGACCAAACTGGAAACCACCAAGAAA
GGTAGTCTGAGTCTTAAAGACTACTTCATGAAGATTAAAAATCTTGTGGATTCGCTTGCAGCTGCAGGAAAGAAGATATCACAAGACGATCATGTACTCCATATTCTCAA
GGGCTTGGGGTCAGAATATGACTCTACAGTCAATCTGTACCAACAGCTAAACAACAACAACCATCAGTTCAAGAAGGCCAGAGGCGAAACAATGGAGGTCAAAGGAATTC
GAATAACAATGGAGTCAGAAACAACCAGCGAAATTGGAATAACAATGGGAAACCACAATGCCAACTTTGTGGTCGTTTTGGACACACTACAGTCCGCTGTTACTTCAGAT
TTGAGCGGTGGTATCAAGAATCAGGCTTCTGCTTCAACTCCTCATGGCTTCTTTCCTCCACATCAGCCTCCTTATACAACATATTCTCTTCGACATGACCTAAACAAAGA
GAACCAGTGGTATCCAGACTCAGGGGCATCCAACCATGTTACTAGTGATCTTTCTAATCTCAATATGGGGACTGATTACACAGGCGACAATAAAGTTCATGAAGGCAATG
GTGCAAGTTTGGATGTTCTTCATATTGGACTCCTTAACTGGCCAAACCTTCTCGAAGGCAAGGTAGCTTATGAACTCTATCAATTTTCACTGGAAAAGGCTACATCCCAA
CACTCTACAGGTGGTTGTCCTTCTTCAAAATTCCATTCACAAATCGAAAGCTGGGATATTCAAACCACGAGTACTATTGACAAATATGCTGAGATTGAACCTCCTTTAGT
GAAAGAGGCTCTCAAGAGTTCCCATTGGGTTGATGCTATGAGAACTGAGTATGAAGCTTTGATGAGGAATGATATATGGTCTTGGTTGAACCTCCCACTGATAAAAAGGT
CATTGTATGCAAATGGGTCTTCAAGATCAAGAGGAACTCCGATGGATCTGTTGCTCGTTACAAGCCCAGTTGTTAAACCTACTACCATTCGGGTCCTCTTTACTATGGCA
CTTGCTTTTGGATGGCCTCTTCGCCATGTTGACATAAATAATGTTTTCCTTCACGGTCTTCTGAATGAGGAAGTATTTATGTCACACCCACCTGGAATGCTTGATCAGTC
GAAACCTCATATGGTGTCTCATCTTCTGCTATGGATCAGCTCATTTGTTTACTTCATCAGAAATTTGCCTGGAAGGACCTGGCCGGCTACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCTCTCACCCTCCTCACTCTCTCACGCCGTTCTTGACCATTCAACCCGCCGCCGCCACTATCGACTTTTCACTCTGTCGTCTCCGCCCTCGACTTTTCACCCCGCC
ACCGCCGCCTTCGAGTCTCGACACACCTAGAACAATCCAGATTCGGTACCTAATGGCCCCTTCATTTCCAAAGATCCCATGGAGTCCCCTCAATCTACATTGGTCTTGTA
ACGCCCCAGGGTCCAGGATTCAGAATCCGGTTTCGACACCGATCTCGTCATTCCCCTACGACCTAGCCACGCCACCATACTTGTCTGATACTACTTCTAAGATTGAAGAT
TATCCTCACAAACCAACATGTGTTCGTACTAGTTTGAAAGGACATGGCCTTGGGCACTTTATCGATGATGATGCCCAAATACCTCCCAAATTCATTCAATCCGGAGATGA
TCCCTCCTCCAAGATTCCCAATCCAGAGTACGAAACATGGATTCGGCAAGATAATCTCATCTCGTCCTGGTTCTTGGGATCAATGTCGAATGAAATTTTGTCTAAAGTTC
TTGATTGCGAAACTGCCAAAGACATTTGGACAGTATTGAATGCCCGTTTCTCCTCACGGAATCTTGCTCAGGTCTTAAAGCTCAAGACCAAACTGGAAACCACCAAGAAA
GGTAGTCTGAGTCTTAAAGACTACTTCATGAAGATTAAAAATCTTGTGGATTCGCTTGCAGCTGCAGGAAAGAAGATATCACAAGACGATCATGTACTCCATATTCTCAA
GGGCTTGGGGTCAGAATATGACTCTACAGTCAATCTGTACCAACAGCTAAACAACAACAACCATCAGTTCAAGAAGGCCAGAGGCGAAACAATGGAGGTCAAAGGAATTC
GAATAACAATGGAGTCAGAAACAACCAGCGAAATTGGAATAACAATGGGAAACCACAATGCCAACTTTGTGGTCGTTTTGGACACACTACAGTCCGCTGTTACTTCAGAT
TTGAGCGGTGGTATCAAGAATCAGGCTTCTGCTTCAACTCCTCATGGCTTCTTTCCTCCACATCAGCCTCCTTATACAACATATTCTCTTCGACATGACCTAAACAAAGA
GAACCAGTGGTATCCAGACTCAGGGGCATCCAACCATGTTACTAGTGATCTTTCTAATCTCAATATGGGGACTGATTACACAGGCGACAATAAAGTTCATGAAGGCAATG
GTGCAAGTTTGGATGTTCTTCATATTGGACTCCTTAACTGGCCAAACCTTCTCGAAGGCAAGGTAGCTTATGAACTCTATCAATTTTCACTGGAAAAGGCTACATCCCAA
CACTCTACAGGTGGTTGTCCTTCTTCAAAATTCCATTCACAAATCGAAAGCTGGGATATTCAAACCACGAGTACTATTGACAAATATGCTGAGATTGAACCTCCTTTAGT
GAAAGAGGCTCTCAAGAGTTCCCATTGGGTTGATGCTATGAGAACTGAGTATGAAGCTTTGATGAGGAATGATATATGGTCTTGGTTGAACCTCCCACTGATAAAAAGGT
CATTGTATGCAAATGGGTCTTCAAGATCAAGAGGAACTCCGATGGATCTGTTGCTCGTTACAAGCCCAGTTGTTAAACCTACTACCATTCGGGTCCTCTTTACTATGGCA
CTTGCTTTTGGATGGCCTCTTCGCCATGTTGACATAAATAATGTTTTCCTTCACGGTCTTCTGAATGAGGAAGTATTTATGTCACACCCACCTGGAATGCTTGATCAGTC
GAAACCTCATATGGTGTCTCATCTTCTGCTATGGATCAGCTCATTTGTTTACTTCATCAGAAATTTGCCTGGAAGGACCTGGCCGGCTACTTGA
Protein sequenceShow/hide protein sequence
MTSHPPHSLTPFLTIQPAAATIDFSLCRLRPRLFTPPPPPSSLDTPRTIQIRYLMAPSFPKIPWSPLNLHWSCNAPGSRIQNPVSTPISSFPYDLATPPYLSDTTSKIED
YPHKPTCVRTSLKGHGLGHFIDDDAQIPPKFIQSGDDPSSKIPNPEYETWIRQDNLISSWFLGSMSNEILSKVLDCETAKDIWTVLNARFSSRNLAQVLKLKTKLETTKK
GSLSLKDYFMKIKNLVDSLAAAGKKISQDDHVLHILKGLGSEYDSTVNLYQQLNNNNHQFKKARGETMEVKGIRITMESETTSEIGITMGNHNANFVVVLDTLQSAVTSD
LSGGIKNQASASTPHGFFPPHQPPYTTYSLRHDLNKENQWYPDSGASNHVTSDLSNLNMGTDYTGDNKVHEGNGASLDVLHIGLLNWPNLLEGKVAYELYQFSLEKATSQ
HSTGGCPSSKFHSQIESWDIQTTSTIDKYAEIEPPLVKEALKSSHWVDAMRTEYEALMRNDIWSWLNLPLIKRSLYANGSSRSRGTPMDLLLVTSPVVKPTTIRVLFTMA
LAFGWPLRHVDINNVFLHGLLNEEVFMSHPPGMLDQSKPHMVSHLLLWISSFVYFIRNLPGRTWPAT