; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg024750 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg024750
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold12:14926206..14928889
RNA-Seq ExpressionSpg024750
SyntenySpg024750
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039309.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.4e-6024.6Show/hide
Query:  RSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKHGYFVEINQLQNSGSRISI
        RS  I+RK F +  D+Y++ +   +TE G + + S+ ++ + L+W+ ST  +L + P + +FF E R  E+ + + K  N  G   EI ++ +   +  I
Subjt:  RSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKHGYFVEINQLQNSGSRISI

Query:  LIPSESNKQGWFSFFSLISDYPKEFNHQTSKP------------------QSRSYKEILQQKQPKVS------------------HPLNTSPPDLVWTDI
        L+P    K  W SF S+I+  PK      ++P                    RSY + + + +  +S                   P ++  P L+   +
Subjt:  LIPSESNKQGWFSFFSLISDYPKEFNHQTSKP------------------QSRSYKEILQQKQPKVS------------------HPLNTSPPDLVWTDI

Query:  IVVQRFYQRDDWPSIRSSILSSISNRCSINPFQDNKALLHVYDRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSED
        ++V+RF+  DDW  I  ++        + N F   K L+H      A  LC++  W+ +GK+ ++F      ++       SYGGW   R +    W+  
Subjt:  IVVQRFYQRDDWPSIRSSILSSISNRCSINPFQDNKALLHVYDRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSED

Query:  VFRFIGDSCGGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEGITVQIRGLTGETIGREQFNEWHQLRKGSYEVEEDKSESKDLNLEEK
         F+ IG +CGG +  +  T    NL+EA+LK+R N +GF+P+ + +     G    VQ+                              SE K L     
Subjt:  VFRFIGDSCGGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEGITVQIRGLTGETIGREQFNEWHQLRKGSYEVEEDKSESKDLNLEEK

Query:  ENTQVIARESSPIHERESPIMEQPPFNEDSISVDFPNLTDSTKSSKGKEPLIMEEPSVELKNNISLP----VGPTNL-----KIGQKGSTSGLSPKLVIV
               R+++   +  +P  EQ  F  D +    P+L ++   S  ++ +  E+PS  LK+ I  P      PT L           +T+  S   ++ 
Subjt:  ENTQVIARESSPIHERESPIMEQPPFNEDSISVDFPNLTDSTKSSKGKEPLIMEEPSVELKNNISLP----VGPTNL-----KIGQKGSTSGLSPKLVIV

Query:  GSDTEAYLSS-------PSPINS-------PHKINLDPPPTHDFDLTIFNPDQHHLPLAIMPPESTNAG-QSAINNKVNAPAPDILTNHPQPETPPINQP
        G   +  L         PS + S         K++ + P       T FNPD    P    P +      + ++  K +   P +  N  Q +   I QP
Subjt:  GSDTEAYLSS-------PSPINS-------PHKINLDPPPTHDFDLTIFNPDQHHLPLAIMPPESTNAG-QSAINNKVNAPAPDILTNHPQPETPPINQP

Query:  MFALPEYLRHIAPILSEHGLCI---MAIPPFLPPKR-------------------KTVTTTGKKTKLQRELDNLKTTVHYDKTAS--------LALTEGV
        +  +   L       S+ GL +   +   P L P +                   + V  T +      E  N    V+Y K               +  
Subjt:  MFALPEYLRHIAPILSEHGLCI---MAIPPFLPPKR-------------------KTVTTTGKKTKLQRELDNLKTTVHYDKTAS--------LALTEGV

Query:  QETKTSNIDNHLIKSLWSSSHIGWSSLDSVGA--------------SGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDF
        ++T +    N L+  L  +        DS GA              +GGILI+W     ++    +G FS+S + F +   S+WLT +YGP +R  R++ 
Subjt:  QETKTSNIDNHLIKSLWSSSHIGWSSLDSVGA--------------SGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDF

Query:  WKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRHITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRI
        W++LH+L  L    WI+GGD NV R   E +     + S  + N  I+   L+D PL N  +TWS+  + P  S +DRFL +      F       L R 
Subjt:  WKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRHITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRI

Query:  TSDHYPLSL--TFGDINWGPGPFRFENSWLQIASFREVLDNWW
        TSDH+PL    +   + WGP PFR  +  L    F+  ++ WW
Subjt:  TSDHYPLSL--TFGDINWGPGPFRFENSWLQIASFREVLDNWW

KAA0044449.1 hypothetical protein E6C27_scaffold46G001820 [Cucumis melo var. makuwa]1.0e-7127.65Show/hide
Query:  NHPTRSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKHGYFVEINQLQNSGS
        N   R  S+++K F ++ D+ SR S   ITE G   S S+++T  SL WL  TF  L   P T +FF E R  ++ L V+ ++N+ GY  EI ++ + G 
Subjt:  NHPTRSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKHGYFVEINQLQNSGS

Query:  RISILIPSESNKQGWFSFFSLIS--------DYPKEFNHQTSKPQSR---SYKEILQQKQPK--------VSHPLNTSPPDLVWT-DIIVVQRFYQRDDW
        +  IL+P   +K GW  F  +++        + P    +   K + +   SY      + P+         S   ++S  D   T     ++R    DDW
Subjt:  RISILIPSESNKQGWFSFFSLIS--------DYPKEFNHQTSKPQSR---SYKEILQQKQPK--------VSHPLNTSPPDLVWT-DIIVVQRFYQRDDW

Query:  PSIRSSILSSISNRCS---INPFQDNKALLHVYDRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSEDVFRFIGDSC
          I   +      + S     PF  +KALL + D++ A  LCK+  W+ +G   +KF   +  A+       SYGGW   R +    W+ + F  IG++ 
Subjt:  PSIRSSILSSISNRCS---INPFQDNKALLHVYDRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSEDVFRFIGDSC

Query:  GGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEGITVQIRGLTGETIGREQFNEWHQLRKGSYEVEEDKSESKDLNLEEKENTQVIARE
        GG++  +  +   + L EA +KV++N TGF+P+ I +      EG    I+ +T          +W + R  S      K+ +++ N             
Subjt:  GGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEGITVQIRGLTGETIGREQFNEWHQLRKGSYEVEEDKSESKDLNLEEKENTQVIARE

Query:  SSPIHERESPIMEQPPFNEDSISVDFPNLTDSTKSSKGKEPLIMEE---PSVELKNNISLPVGPTNLKIGQKGSTSGLSPKLVIVGSDTE----------
                +P  EQ  F  +   +  P+L +S+K  KG + +  +    P+  +K   +  V   NL        S  S K+      +E          
Subjt:  SSPIHERESPIMEQPPFNEDSISVDFPNLTDSTKSSKGKEPLIMEE---PSVELKNNISLPVGPTNLKIGQKGSTSGLSPKLVIVGSDTE----------

Query:  ---AYLSSPSPINSPHKINLDPPPTHDFDLTIFNPDQHHLPL------AIMPPESTNAGQSAINNKV-----NAPAPDILTNHPQPETPPINQPMFALPE
            Y    SPIN+  K++   P    F  +  +     L L      A M  +S    QS I  KV      +   +      Q +   I+   F L  
Subjt:  ---AYLSSPSPINSPHKINLDPPPTHDFDLTIFNPDQHHLPL------AIMPPESTNAGQSAINNKV-----NAPAPDILTNHPQPETPPINQPMFALPE

Query:  YLRHIAPILSEHGLCIMAIPPFLPPKRKTVTTTGKKTKLQRELDNLKTTVHYD--KTASLALTEGVQETKTS---NIDNHLIKSLWSSSHIGWSSLDSVG
         L HI+ +LS+        P ++P    + T+  +   ++  L ++ T  H D  K     + E  ++ + S    + + L ++    +    S  +SV 
Subjt:  YLRHIAPILSEHGLCIMAIPPFLPPKRKTVTTTGKKTKLQRELDNLKTTVHYD--KTASLALTEGVQETKTS---NIDNHLIKSLWSSSHIGWSSLDSVG

Query:  ASGGILIMWSEPDFTI----KETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRHITRSMR
            I I+   P+  +    ++ I G FSVSI V   +G S+WL+ IYGP++R+ R  FW+EL +L  +    WILGGDFNV RW  E S     + SM+
Subjt:  ASGGILIMWSEPDFTI----KETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRHITRSMR

Query:  IFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINWGPGPFRFENSWLQIASFREVLDNWW
         FN  I+   L+D PL N  FTWS+      LS +DRFL S    N F       L R TSDH+P+ L    I+WGP PFRF N++L+   +++ ++ WW
Subjt:  IFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINWGPGPFRFENSWLQIASFREVLDNWW

Query:  NHNS
         + S
Subjt:  NHNS

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]5.5e-7326.04Show/hide
Query:  PSASLPWNHPT--------RSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNK
        PS +  +N P         RS  ++RK F +  D+YS+ +   +TE G + + S+ ++ + L+W+  T  +L   P T +FF E R  E  + + K  N 
Subjt:  PSASLPWNHPT--------RSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNK

Query:  HGYFVEINQLQNSGSRISILIPSESNKQGWFSFFSLISDYPKEFNHQTSKP------------------QSRSYKEILQQKQP-------------KVSH
         G   EI ++     +  IL+P   +K GW SF S+I+  PK      ++P                    RSY + + + +P               SH
Subjt:  HGYFVEINQLQNSGSRISILIPSESNKQGWFSFFSLISDYPKEFNHQTSKP------------------QSRSYKEILQQKQP-------------KVSH

Query:  PLNTS-----PPDLVWTDIIVVQRFYQRDDWPSIRSSILSSISNRCSINPFQDNKALLHVYDRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTV
          + S       DL+   +++V+RF+  DDW  I  ++        + N F   KAL+H      A  LC++  WS +GK+ ++F   +   +       
Subjt:  PLNTS-----PPDLVWTDIIVVQRFYQRDDWPSIRSSILSSISNRCSINPFQDNKALLHVYDRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTV

Query:  SYGGWIEVRNLSPVYWSEDVFRFIGDSCGGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEGITVQIRGLTGETIGREQFNEWHQLRKG
        SYGGW   R +    W+   F+ IG +C G +  +  T    NL+EAR+KVR N +GF+P+++ +                  +  G + F +     +G
Subjt:  SYGGWIEVRNLSPVYWSEDVFRFIGDSCGGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEGITVQIRGLTGETIGREQFNEWHQLRKG

Query:  SYEVEEDKSESKDLNLEEKENTQVIARESSPIHERESPIMEQPPF-NEDSISVDF-PNLTDSTKSSKGKEPLIMEEPSVELKNNISLP-------VGPTN
         + +E      +++ L          R+++   +  +P  EQ  F   ++IS DF    +D  KSS   +P  ++   ++   N +LP       V  +N
Subjt:  SYEVEEDKSESKDLNLEEKENTQVIARESSPIHERESPIMEQPPF-NEDSISVDF-PNLTDSTKSSKGKEPLIMEEPSVELKNNISLP-------VGPTN

Query:  LKIGQKGS----TSGLSPKLVI--VGSDTEAYLSSPSPIN---SPHKINLDPPPTHDFDLTIFNPDQ---HHLP--------------------------
        L      S     SG+S   V+       +  L   S +N   S  K++ + P        IFNPD    +H P                          
Subjt:  LKIGQKGS----TSGLSPKLVI--VGSDTEAYLSSPSPIN---SPHKINLDPPPTHDFDLTIFNPDQ---HHLP--------------------------

Query:  -----------------------------------LAIMPPESTNAGQSAINNKVNAPAPDILTNHPQPETPPINQPMFALPEYLRHIAPILSEHGLCIM
                                           L  +P    N      +N  NA   DI      PETP +  P+               +H     
Subjt:  -----------------------------------LAIMPPESTNAGQSAINNKVNAPAPDILTNHPQPETPPINQPMFALPEYLRHIAPILSEHGLCIM

Query:  AIPPFLPPKRKTVTTTGKKTKLQREL--DNLKTTVHYDKTASLALTEGVQETKTSNI---DNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIK
                K K   +   K +L   L  + LK +   D + +   T  +     S +   +  +IKSLW S+ I W + ++ G+SGGILI+W   + ++ 
Subjt:  AIPPFLPPKRKTVTTTGKKTKLQREL--DNLKTTVHYDKTASLALTEGVQETKTSNI---DNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIK

Query:  ETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRHITRSMRIFNQLIATYKLLDIPLQNGCF
           +GLFS+S +  + +  S+WLT +YGP +R  RI FW ELH+L  L    WILGGD NV R   E +     + + R+ N  I+   L+D PL N  F
Subjt:  ETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRHITRSMRIFNQLIATYKLLDIPLQNGCF

Query:  TWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGD--INWGPGPFRFENSWLQIASFREVLDNWWNHNSLQ
        TWS+  + P  S IDRFL +    N F       L R TSDH+PL     +  ++WGP PFR  +  L    F+  +  WW  NS+Q
Subjt:  TWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGD--INWGPGPFRFENSWLQIASFREVLDNWWNHNSLQ

TYK00493.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.4e-6024.6Show/hide
Query:  RSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKHGYFVEINQLQNSGSRISI
        RS  I+RK F +  D+Y++ +   +TE G + + S+ ++ + L+W+ ST  +L + P + +FF E R  E+ + + K  N  G   EI ++ +   +  I
Subjt:  RSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKHGYFVEINQLQNSGSRISI

Query:  LIPSESNKQGWFSFFSLISDYPKEFNHQTSKP------------------QSRSYKEILQQKQPKVS------------------HPLNTSPPDLVWTDI
        L+P    K  W SF S+I+  PK      ++P                    RSY + + + +  +S                   P ++  P L+   +
Subjt:  LIPSESNKQGWFSFFSLISDYPKEFNHQTSKP------------------QSRSYKEILQQKQPKVS------------------HPLNTSPPDLVWTDI

Query:  IVVQRFYQRDDWPSIRSSILSSISNRCSINPFQDNKALLHVYDRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSED
        ++V+RF+  DDW  I  ++        + N F   K L+H      A  LC++  W+ +GK+ ++F      ++       SYGGW   R +    W+  
Subjt:  IVVQRFYQRDDWPSIRSSILSSISNRCSINPFQDNKALLHVYDRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSED

Query:  VFRFIGDSCGGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEGITVQIRGLTGETIGREQFNEWHQLRKGSYEVEEDKSESKDLNLEEK
         F+ IG +CGG +  +  T    NL+EA+LK+R N +GF+P+ + +     G    VQ+                              SE K L     
Subjt:  VFRFIGDSCGGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEGITVQIRGLTGETIGREQFNEWHQLRKGSYEVEEDKSESKDLNLEEK

Query:  ENTQVIARESSPIHERESPIMEQPPFNEDSISVDFPNLTDSTKSSKGKEPLIMEEPSVELKNNISLP----VGPTNL-----KIGQKGSTSGLSPKLVIV
               R+++   +  +P  EQ  F  D +    P+L ++   S  ++ +  E+PS  LK+ I  P      PT L           +T+  S   ++ 
Subjt:  ENTQVIARESSPIHERESPIMEQPPFNEDSISVDFPNLTDSTKSSKGKEPLIMEEPSVELKNNISLP----VGPTNL-----KIGQKGSTSGLSPKLVIV

Query:  GSDTEAYLSS-------PSPINS-------PHKINLDPPPTHDFDLTIFNPDQHHLPLAIMPPESTNAG-QSAINNKVNAPAPDILTNHPQPETPPINQP
        G   +  L         PS + S         K++ + P       T FNPD    P    P +      + ++  K +   P +  N  Q +   I QP
Subjt:  GSDTEAYLSS-------PSPINS-------PHKINLDPPPTHDFDLTIFNPDQHHLPLAIMPPESTNAG-QSAINNKVNAPAPDILTNHPQPETPPINQP

Query:  MFALPEYLRHIAPILSEHGLCI---MAIPPFLPPKR-------------------KTVTTTGKKTKLQRELDNLKTTVHYDKTAS--------LALTEGV
        +  +   L       S+ GL +   +   P L P +                   + V  T +      E  N    V+Y K               +  
Subjt:  MFALPEYLRHIAPILSEHGLCI---MAIPPFLPPKR-------------------KTVTTTGKKTKLQRELDNLKTTVHYDKTAS--------LALTEGV

Query:  QETKTSNIDNHLIKSLWSSSHIGWSSLDSVGA--------------SGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDF
        ++T +    N L+  L  +        DS GA              +GGILI+W     ++    +G FS+S + F +   S+WLT +YGP +R  R++ 
Subjt:  QETKTSNIDNHLIKSLWSSSHIGWSSLDSVGA--------------SGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDF

Query:  WKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRHITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRI
        W++LH+L  L    WI+GGD NV R   E +     + S  + N  I+   L+D PL N  +TWS+  + P  S +DRFL +      F       L R 
Subjt:  WKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRHITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRI

Query:  TSDHYPLSL--TFGDINWGPGPFRFENSWLQIASFREVLDNWW
        TSDH+PL    +   + WGP PFR  +  L    F+  ++ WW
Subjt:  TSDHYPLSL--TFGDINWGPGPFRFENSWLQIASFREVLDNWW

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]1.2e-6450.64Show/hide
Query:  VQETKTSNIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGD
        +QETK S +D  ++KSLWS+  I WS+LD+ G + GILI+W++PD    E I+G+FS++I+  ++DGF FW++ IYGPS  EF   FW+EL DL+ L  +
Subjt:  VQETKTSNIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGD

Query:  RWILGGDFNVTRWSWEKSHGRHITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGD
         WIL GDFNVTRWSWEKS+GR +T+SM +FN  I    L+D+PL NG  TWS    N   SLID FL++  C++K G     R+ R TSDH+P+ L FG 
Subjt:  RWILGGDFNVTRWSWEKSHGRHITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGD

Query:  INWGPGPFRFENSWLQIASFREVLDNWWNHNSL
         NWG  PFRFEN WL   +F+  L+ WW +  L
Subjt:  INWGPGPFRFENSWLQIASFREVLDNWWNHNSL

TrEMBL top hitse value%identityAlignment
A0A5A7TDG1 LINE-1 retrotransposable element ORF2 protein1.2e-6024.6Show/hide
Query:  RSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKHGYFVEINQLQNSGSRISI
        RS  I+RK F +  D+Y++ +   +TE G + + S+ ++ + L+W+ ST  +L + P + +FF E R  E+ + + K  N  G   EI ++ +   +  I
Subjt:  RSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKHGYFVEINQLQNSGSRISI

Query:  LIPSESNKQGWFSFFSLISDYPKEFNHQTSKP------------------QSRSYKEILQQKQPKVS------------------HPLNTSPPDLVWTDI
        L+P    K  W SF S+I+  PK      ++P                    RSY + + + +  +S                   P ++  P L+   +
Subjt:  LIPSESNKQGWFSFFSLISDYPKEFNHQTSKP------------------QSRSYKEILQQKQPKVS------------------HPLNTSPPDLVWTDI

Query:  IVVQRFYQRDDWPSIRSSILSSISNRCSINPFQDNKALLHVYDRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSED
        ++V+RF+  DDW  I  ++        + N F   K L+H      A  LC++  W+ +GK+ ++F      ++       SYGGW   R +    W+  
Subjt:  IVVQRFYQRDDWPSIRSSILSSISNRCSINPFQDNKALLHVYDRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSED

Query:  VFRFIGDSCGGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEGITVQIRGLTGETIGREQFNEWHQLRKGSYEVEEDKSESKDLNLEEK
         F+ IG +CGG +  +  T    NL+EA+LK+R N +GF+P+ + +     G    VQ+                              SE K L     
Subjt:  VFRFIGDSCGGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEGITVQIRGLTGETIGREQFNEWHQLRKGSYEVEEDKSESKDLNLEEK

Query:  ENTQVIARESSPIHERESPIMEQPPFNEDSISVDFPNLTDSTKSSKGKEPLIMEEPSVELKNNISLP----VGPTNL-----KIGQKGSTSGLSPKLVIV
               R+++   +  +P  EQ  F  D +    P+L ++   S  ++ +  E+PS  LK+ I  P      PT L           +T+  S   ++ 
Subjt:  ENTQVIARESSPIHERESPIMEQPPFNEDSISVDFPNLTDSTKSSKGKEPLIMEEPSVELKNNISLP----VGPTNL-----KIGQKGSTSGLSPKLVIV

Query:  GSDTEAYLSS-------PSPINS-------PHKINLDPPPTHDFDLTIFNPDQHHLPLAIMPPESTNAG-QSAINNKVNAPAPDILTNHPQPETPPINQP
        G   +  L         PS + S         K++ + P       T FNPD    P    P +      + ++  K +   P +  N  Q +   I QP
Subjt:  GSDTEAYLSS-------PSPINS-------PHKINLDPPPTHDFDLTIFNPDQHHLPLAIMPPESTNAG-QSAINNKVNAPAPDILTNHPQPETPPINQP

Query:  MFALPEYLRHIAPILSEHGLCI---MAIPPFLPPKR-------------------KTVTTTGKKTKLQRELDNLKTTVHYDKTAS--------LALTEGV
        +  +   L       S+ GL +   +   P L P +                   + V  T +      E  N    V+Y K               +  
Subjt:  MFALPEYLRHIAPILSEHGLCI---MAIPPFLPPKR-------------------KTVTTTGKKTKLQRELDNLKTTVHYDKTAS--------LALTEGV

Query:  QETKTSNIDNHLIKSLWSSSHIGWSSLDSVGA--------------SGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDF
        ++T +    N L+  L  +        DS GA              +GGILI+W     ++    +G FS+S + F +   S+WLT +YGP +R  R++ 
Subjt:  QETKTSNIDNHLIKSLWSSSHIGWSSLDSVGA--------------SGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDF

Query:  WKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRHITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRI
        W++LH+L  L    WI+GGD NV R   E +     + S  + N  I+   L+D PL N  +TWS+  + P  S +DRFL +      F       L R 
Subjt:  WKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRHITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRI

Query:  TSDHYPLSL--TFGDINWGPGPFRFENSWLQIASFREVLDNWW
        TSDH+PL    +   + WGP PFR  +  L    F+  ++ WW
Subjt:  TSDHYPLSL--TFGDINWGPGPFRFENSWLQIASFREVLDNWW

A0A5A7TTA1 DUF4283 domain-containing protein5.0e-7227.65Show/hide
Query:  NHPTRSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKHGYFVEINQLQNSGS
        N   R  S+++K F ++ D+ SR S   ITE G   S S+++T  SL WL  TF  L   P T +FF E R  ++ L V+ ++N+ GY  EI ++ + G 
Subjt:  NHPTRSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKHGYFVEINQLQNSGS

Query:  RISILIPSESNKQGWFSFFSLIS--------DYPKEFNHQTSKPQSR---SYKEILQQKQPK--------VSHPLNTSPPDLVWT-DIIVVQRFYQRDDW
        +  IL+P   +K GW  F  +++        + P    +   K + +   SY      + P+         S   ++S  D   T     ++R    DDW
Subjt:  RISILIPSESNKQGWFSFFSLIS--------DYPKEFNHQTSKPQSR---SYKEILQQKQPK--------VSHPLNTSPPDLVWT-DIIVVQRFYQRDDW

Query:  PSIRSSILSSISNRCS---INPFQDNKALLHVYDRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSEDVFRFIGDSC
          I   +      + S     PF  +KALL + D++ A  LCK+  W+ +G   +KF   +  A+       SYGGW   R +    W+ + F  IG++ 
Subjt:  PSIRSSILSSISNRCS---INPFQDNKALLHVYDRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSEDVFRFIGDSC

Query:  GGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEGITVQIRGLTGETIGREQFNEWHQLRKGSYEVEEDKSESKDLNLEEKENTQVIARE
        GG++  +  +   + L EA +KV++N TGF+P+ I +      EG    I+ +T          +W + R  S      K+ +++ N             
Subjt:  GGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEGITVQIRGLTGETIGREQFNEWHQLRKGSYEVEEDKSESKDLNLEEKENTQVIARE

Query:  SSPIHERESPIMEQPPFNEDSISVDFPNLTDSTKSSKGKEPLIMEE---PSVELKNNISLPVGPTNLKIGQKGSTSGLSPKLVIVGSDTE----------
                +P  EQ  F  +   +  P+L +S+K  KG + +  +    P+  +K   +  V   NL        S  S K+      +E          
Subjt:  SSPIHERESPIMEQPPFNEDSISVDFPNLTDSTKSSKGKEPLIMEE---PSVELKNNISLPVGPTNLKIGQKGSTSGLSPKLVIVGSDTE----------

Query:  ---AYLSSPSPINSPHKINLDPPPTHDFDLTIFNPDQHHLPL------AIMPPESTNAGQSAINNKV-----NAPAPDILTNHPQPETPPINQPMFALPE
            Y    SPIN+  K++   P    F  +  +     L L      A M  +S    QS I  KV      +   +      Q +   I+   F L  
Subjt:  ---AYLSSPSPINSPHKINLDPPPTHDFDLTIFNPDQHHLPL------AIMPPESTNAGQSAINNKV-----NAPAPDILTNHPQPETPPINQPMFALPE

Query:  YLRHIAPILSEHGLCIMAIPPFLPPKRKTVTTTGKKTKLQRELDNLKTTVHYD--KTASLALTEGVQETKTS---NIDNHLIKSLWSSSHIGWSSLDSVG
         L HI+ +LS+        P ++P    + T+  +   ++  L ++ T  H D  K     + E  ++ + S    + + L ++    +    S  +SV 
Subjt:  YLRHIAPILSEHGLCIMAIPPFLPPKRKTVTTTGKKTKLQRELDNLKTTVHYD--KTASLALTEGVQETKTS---NIDNHLIKSLWSSSHIGWSSLDSVG

Query:  ASGGILIMWSEPDFTI----KETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRHITRSMR
            I I+   P+  +    ++ I G FSVSI V   +G S+WL+ IYGP++R+ R  FW+EL +L  +    WILGGDFNV RW  E S     + SM+
Subjt:  ASGGILIMWSEPDFTI----KETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRHITRSMR

Query:  IFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINWGPGPFRFENSWLQIASFREVLDNWW
         FN  I+   L+D PL N  FTWS+      LS +DRFL S    N F       L R TSDH+P+ L    I+WGP PFRF N++L+   +++ ++ WW
Subjt:  IFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINWGPGPFRFENSWLQIASFREVLDNWW

Query:  NHNS
         + S
Subjt:  NHNS

A0A5D3BL61 LINE-1 retrotransposable element ORF2 protein1.2e-6024.6Show/hide
Query:  RSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKHGYFVEINQLQNSGSRISI
        RS  I+RK F +  D+Y++ +   +TE G + + S+ ++ + L+W+ ST  +L + P + +FF E R  E+ + + K  N  G   EI ++ +   +  I
Subjt:  RSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKHGYFVEINQLQNSGSRISI

Query:  LIPSESNKQGWFSFFSLISDYPKEFNHQTSKP------------------QSRSYKEILQQKQPKVS------------------HPLNTSPPDLVWTDI
        L+P    K  W SF S+I+  PK      ++P                    RSY + + + +  +S                   P ++  P L+   +
Subjt:  LIPSESNKQGWFSFFSLISDYPKEFNHQTSKP------------------QSRSYKEILQQKQPKVS------------------HPLNTSPPDLVWTDI

Query:  IVVQRFYQRDDWPSIRSSILSSISNRCSINPFQDNKALLHVYDRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSED
        ++V+RF+  DDW  I  ++        + N F   K L+H      A  LC++  W+ +GK+ ++F      ++       SYGGW   R +    W+  
Subjt:  IVVQRFYQRDDWPSIRSSILSSISNRCSINPFQDNKALLHVYDRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSED

Query:  VFRFIGDSCGGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEGITVQIRGLTGETIGREQFNEWHQLRKGSYEVEEDKSESKDLNLEEK
         F+ IG +CGG +  +  T    NL+EA+LK+R N +GF+P+ + +     G    VQ+                              SE K L     
Subjt:  VFRFIGDSCGGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEGITVQIRGLTGETIGREQFNEWHQLRKGSYEVEEDKSESKDLNLEEK

Query:  ENTQVIARESSPIHERESPIMEQPPFNEDSISVDFPNLTDSTKSSKGKEPLIMEEPSVELKNNISLP----VGPTNL-----KIGQKGSTSGLSPKLVIV
               R+++   +  +P  EQ  F  D +    P+L ++   S  ++ +  E+PS  LK+ I  P      PT L           +T+  S   ++ 
Subjt:  ENTQVIARESSPIHERESPIMEQPPFNEDSISVDFPNLTDSTKSSKGKEPLIMEEPSVELKNNISLP----VGPTNL-----KIGQKGSTSGLSPKLVIV

Query:  GSDTEAYLSS-------PSPINS-------PHKINLDPPPTHDFDLTIFNPDQHHLPLAIMPPESTNAG-QSAINNKVNAPAPDILTNHPQPETPPINQP
        G   +  L         PS + S         K++ + P       T FNPD    P    P +      + ++  K +   P +  N  Q +   I QP
Subjt:  GSDTEAYLSS-------PSPINS-------PHKINLDPPPTHDFDLTIFNPDQHHLPLAIMPPESTNAG-QSAINNKVNAPAPDILTNHPQPETPPINQP

Query:  MFALPEYLRHIAPILSEHGLCI---MAIPPFLPPKR-------------------KTVTTTGKKTKLQRELDNLKTTVHYDKTAS--------LALTEGV
        +  +   L       S+ GL +   +   P L P +                   + V  T +      E  N    V+Y K               +  
Subjt:  MFALPEYLRHIAPILSEHGLCI---MAIPPFLPPKR-------------------KTVTTTGKKTKLQRELDNLKTTVHYDKTAS--------LALTEGV

Query:  QETKTSNIDNHLIKSLWSSSHIGWSSLDSVGA--------------SGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDF
        ++T +    N L+  L  +        DS GA              +GGILI+W     ++    +G FS+S + F +   S+WLT +YGP +R  R++ 
Subjt:  QETKTSNIDNHLIKSLWSSSHIGWSSLDSVGA--------------SGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDF

Query:  WKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRHITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRI
        W++LH+L  L    WI+GGD NV R   E +     + S  + N  I+   L+D PL N  +TWS+  + P  S +DRFL +      F       L R 
Subjt:  WKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRHITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRI

Query:  TSDHYPLSL--TFGDINWGPGPFRFENSWLQIASFREVLDNWW
        TSDH+PL    +   + WGP PFR  +  L    F+  ++ WW
Subjt:  TSDHYPLSL--TFGDINWGPGPFRFENSWLQIASFREVLDNWW

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein2.7e-7326.04Show/hide
Query:  PSASLPWNHPT--------RSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNK
        PS +  +N P         RS  ++RK F +  D+YS+ +   +TE G + + S+ ++ + L+W+  T  +L   P T +FF E R  E  + + K  N 
Subjt:  PSASLPWNHPT--------RSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNK

Query:  HGYFVEINQLQNSGSRISILIPSESNKQGWFSFFSLISDYPKEFNHQTSKP------------------QSRSYKEILQQKQP-------------KVSH
         G   EI ++     +  IL+P   +K GW SF S+I+  PK      ++P                    RSY + + + +P               SH
Subjt:  HGYFVEINQLQNSGSRISILIPSESNKQGWFSFFSLISDYPKEFNHQTSKP------------------QSRSYKEILQQKQP-------------KVSH

Query:  PLNTS-----PPDLVWTDIIVVQRFYQRDDWPSIRSSILSSISNRCSINPFQDNKALLHVYDRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTV
          + S       DL+   +++V+RF+  DDW  I  ++        + N F   KAL+H      A  LC++  WS +GK+ ++F   +   +       
Subjt:  PLNTS-----PPDLVWTDIIVVQRFYQRDDWPSIRSSILSSISNRCSINPFQDNKALLHVYDRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTV

Query:  SYGGWIEVRNLSPVYWSEDVFRFIGDSCGGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEGITVQIRGLTGETIGREQFNEWHQLRKG
        SYGGW   R +    W+   F+ IG +C G +  +  T    NL+EAR+KVR N +GF+P+++ +                  +  G + F +     +G
Subjt:  SYGGWIEVRNLSPVYWSEDVFRFIGDSCGGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEGITVQIRGLTGETIGREQFNEWHQLRKG

Query:  SYEVEEDKSESKDLNLEEKENTQVIARESSPIHERESPIMEQPPF-NEDSISVDF-PNLTDSTKSSKGKEPLIMEEPSVELKNNISLP-------VGPTN
         + +E      +++ L          R+++   +  +P  EQ  F   ++IS DF    +D  KSS   +P  ++   ++   N +LP       V  +N
Subjt:  SYEVEEDKSESKDLNLEEKENTQVIARESSPIHERESPIMEQPPF-NEDSISVDF-PNLTDSTKSSKGKEPLIMEEPSVELKNNISLP-------VGPTN

Query:  LKIGQKGS----TSGLSPKLVI--VGSDTEAYLSSPSPIN---SPHKINLDPPPTHDFDLTIFNPDQ---HHLP--------------------------
        L      S     SG+S   V+       +  L   S +N   S  K++ + P        IFNPD    +H P                          
Subjt:  LKIGQKGS----TSGLSPKLVI--VGSDTEAYLSSPSPIN---SPHKINLDPPPTHDFDLTIFNPDQ---HHLP--------------------------

Query:  -----------------------------------LAIMPPESTNAGQSAINNKVNAPAPDILTNHPQPETPPINQPMFALPEYLRHIAPILSEHGLCIM
                                           L  +P    N      +N  NA   DI      PETP +  P+               +H     
Subjt:  -----------------------------------LAIMPPESTNAGQSAINNKVNAPAPDILTNHPQPETPPINQPMFALPEYLRHIAPILSEHGLCIM

Query:  AIPPFLPPKRKTVTTTGKKTKLQREL--DNLKTTVHYDKTASLALTEGVQETKTSNI---DNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIK
                K K   +   K +L   L  + LK +   D + +   T  +     S +   +  +IKSLW S+ I W + ++ G+SGGILI+W   + ++ 
Subjt:  AIPPFLPPKRKTVTTTGKKTKLQREL--DNLKTTVHYDKTASLALTEGVQETKTSNI---DNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIK

Query:  ETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRHITRSMRIFNQLIATYKLLDIPLQNGCF
           +GLFS+S +  + +  S+WLT +YGP +R  RI FW ELH+L  L    WILGGD NV R   E +     + + R+ N  I+   L+D PL N  F
Subjt:  ETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRHITRSMRIFNQLIATYKLLDIPLQNGCF

Query:  TWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGD--INWGPGPFRFENSWLQIASFREVLDNWWNHNSLQ
        TWS+  + P  S IDRFL +    N F       L R TSDH+PL     +  ++WGP PFR  +  L    F+  +  WW  NS+Q
Subjt:  TWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGD--INWGPGPFRFENSWLQIASFREVLDNWWNHNSLQ

A0A6J1E2G6 uncharacterized protein LOC1110254056.0e-6550.64Show/hide
Query:  VQETKTSNIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGD
        +QETK S +D  ++KSLWS+  I WS+LD+ G + GILI+W++PD    E I+G+FS++I+  ++DGF FW++ IYGPS  EF   FW+EL DL+ L  +
Subjt:  VQETKTSNIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGD

Query:  RWILGGDFNVTRWSWEKSHGRHITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGD
         WIL GDFNVTRWSWEKS+GR +T+SM +FN  I    L+D+PL NG  TWS    N   SLID FL++  C++K G     R+ R TSDH+P+ L FG 
Subjt:  RWILGGDFNVTRWSWEKSHGRHITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGD

Query:  INWGPGPFRFENSWLQIASFREVLDNWWNHNSL
         NWG  PFRFEN WL   +F+  L+ WW +  L
Subjt:  INWGPGPFRFENSWLQIASFREVLDNWWNHNSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTTCTCCTTCAGCCTCACTCCCATGGAACCATCCTACCCGATCCATTTCCATAGATCGAAAAACCTTCTCTATAGCTTTTGATGAATACTCACGTGGAAGTAAAGC
AAAAATCACTGAAAAAGGTAGAAATTTCTCCAAGTCTCTGTCTCTCACTTGGAAATCCCTCAACTGGTTAGCCTCCACTTTCAACACCCTTGCCAAAGAACCTTGTACGT
ACAAATTCTTCTCAGAATTTCGTGGGGATGAATATATTCTTTGTGTTGAAAAACTCAACAATAAACATGGATACTTTGTGGAGATCAACCAACTGCAGAACTCTGGGAGC
AGAATCAGCATTCTCATCCCTTCCGAAAGCAACAAACAAGGTTGGTTTTCTTTTTTCTCCTTAATTTCAGATTATCCAAAGGAGTTCAACCACCAGACATCAAAGCCGCA
ATCACGATCGTATAAGGAGATCCTCCAACAGAAGCAACCAAAGGTTTCCCATCCACTGAATACTTCTCCTCCAGATTTGGTTTGGACAGATATAATTGTTGTGCAGAGGT
TCTATCAGCGTGATGATTGGCCATCCATTCGTTCATCAATTCTCTCCTCCATATCCAATCGATGCTCTATTAATCCCTTCCAAGACAACAAAGCTTTGCTCCATGTCTAT
GATCGCAAGACAGCCCTTGAGTTATGCAAATCGACTGAATGGTCTCAAATTGGAAAACATCGGCTGAAATTCTACCCTTTGACATCGAAAGCATATAAGCAAGATAATTT
CACTGTTTCTTATGGTGGTTGGATAGAAGTTCGAAACCTCTCCCCTGTCTATTGGTCTGAAGATGTATTTCGATTCATTGGTGATAGTTGTGGAGGCTACCTAACAACCT
CAAGTCACACCGATAGGATGATCAATCTCATGGAAGCTCGTTTGAAGGTCCGACAGAATTCCACAGGCTTCATTCCATCATCGATTGCCCTCCCTATTGCCCTAGTCGGC
GAAGGAATTACAGTCCAAATTCGGGGACTTACCGGAGAGACAATCGGACGGGAACAATTTAATGAGTGGCACCAATTACGGAAGGGAAGTTATGAAGTAGAGGAGGATAA
ATCGGAATCAAAAGATTTGAATTTAGAGGAAAAAGAGAATACACAAGTGATTGCCCGAGAATCTTCACCGATTCATGAGAGAGAATCACCGATTATGGAGCAGCCACCAT
TTAATGAAGATTCAATATCGGTTGATTTTCCTAATTTAACAGATTCCACAAAATCCTCCAAAGGAAAAGAGCCATTAATAATGGAAGAGCCATCGGTGGAATTAAAAAAT
AACATATCTCTTCCTGTGGGCCCTACGAATTTGAAAATTGGTCAAAAGGGCTCAACATCTGGTCTTAGCCCAAAATTGGTTATTGTAGGCTCTGATACTGAAGCTTATTT
ATCCAGCCCATCTCCAATCAATTCACCTCATAAGATTAATTTGGACCCACCCCCCACACACGATTTTGATCTCACCATTTTTAACCCTGACCAACATCATCTTCCTTTAG
CCATTATGCCGCCAGAATCAACAAATGCTGGCCAATCAGCCATAAATAACAAAGTGAATGCTCCAGCTCCAGATATTTTAACCAACCATCCCCAACCAGAAACACCTCCC
ATCAATCAGCCAATGTTTGCCCTCCCAGAATATCTCCGTCATATAGCTCCAATTCTTAGTGAGCATGGGTTGTGTATCATGGCTATCCCTCCATTTCTACCACCTAAAAG
GAAGACAGTTACTACTACCGGGAAGAAAACAAAACTCCAGAGAGAGCTTGATAACCTAAAAACTACAGTGCATTATGATAAAACTGCTTCTTTGGCCTTAACGGAGGGAG
TACAGGAAACAAAAACGTCTAATATAGACAATCATCTGATTAAATCCTTATGGAGTTCATCTCATATTGGTTGGTCTTCTCTCGATTCCGTTGGAGCATCAGGGGGCATC
CTTATTATGTGGAGTGAACCAGACTTCACTATCAAAGAGACAATTCAAGGTCTTTTCTCTGTCTCTATTCATGTTTTTATGGCTGATGGTTTTTCTTTTTGGCTTACAAA
TATTTATGGCCCCTCTCGACGAGAATTTCGTATTGACTTTTGGAAAGAATTACATGATCTGGCGGGTCTGGGAGGTGATCGTTGGATCCTTGGAGGAGACTTTAATGTTA
CCCGATGGTCATGGGAGAAATCTCATGGTCGTCACATCACTCGGAGTATGCGTATTTTCAACCAATTGATTGCAACTTACAAGCTTCTGGATATCCCATTACAAAATGGT
TGTTTCACCTGGTCCAGTTTTGGTGACAATCCGTATCTTTCCTTAATAGACAGATTTTTGATTTCCAAAGATTGTCTGAATAAATTCGGGTCTTCTCATCTTCTTCGGCT
TGACAGAATTACTTCAGATCACTACCCTCTTTCCCTTACTTTTGGTGATATTAATTGGGGTCCTGGGCCTTTTCGATTTGAAAATTCCTGGCTGCAAATTGCATCATTCC
GTGAGGTGTTGGATAATTGGTGGAATCACAATTCTCTTCAA
mRNA sequenceShow/hide mRNA sequence
ATGACTTCTCCTTCAGCCTCACTCCCATGGAACCATCCTACCCGATCCATTTCCATAGATCGAAAAACCTTCTCTATAGCTTTTGATGAATACTCACGTGGAAGTAAAGC
AAAAATCACTGAAAAAGGTAGAAATTTCTCCAAGTCTCTGTCTCTCACTTGGAAATCCCTCAACTGGTTAGCCTCCACTTTCAACACCCTTGCCAAAGAACCTTGTACGT
ACAAATTCTTCTCAGAATTTCGTGGGGATGAATATATTCTTTGTGTTGAAAAACTCAACAATAAACATGGATACTTTGTGGAGATCAACCAACTGCAGAACTCTGGGAGC
AGAATCAGCATTCTCATCCCTTCCGAAAGCAACAAACAAGGTTGGTTTTCTTTTTTCTCCTTAATTTCAGATTATCCAAAGGAGTTCAACCACCAGACATCAAAGCCGCA
ATCACGATCGTATAAGGAGATCCTCCAACAGAAGCAACCAAAGGTTTCCCATCCACTGAATACTTCTCCTCCAGATTTGGTTTGGACAGATATAATTGTTGTGCAGAGGT
TCTATCAGCGTGATGATTGGCCATCCATTCGTTCATCAATTCTCTCCTCCATATCCAATCGATGCTCTATTAATCCCTTCCAAGACAACAAAGCTTTGCTCCATGTCTAT
GATCGCAAGACAGCCCTTGAGTTATGCAAATCGACTGAATGGTCTCAAATTGGAAAACATCGGCTGAAATTCTACCCTTTGACATCGAAAGCATATAAGCAAGATAATTT
CACTGTTTCTTATGGTGGTTGGATAGAAGTTCGAAACCTCTCCCCTGTCTATTGGTCTGAAGATGTATTTCGATTCATTGGTGATAGTTGTGGAGGCTACCTAACAACCT
CAAGTCACACCGATAGGATGATCAATCTCATGGAAGCTCGTTTGAAGGTCCGACAGAATTCCACAGGCTTCATTCCATCATCGATTGCCCTCCCTATTGCCCTAGTCGGC
GAAGGAATTACAGTCCAAATTCGGGGACTTACCGGAGAGACAATCGGACGGGAACAATTTAATGAGTGGCACCAATTACGGAAGGGAAGTTATGAAGTAGAGGAGGATAA
ATCGGAATCAAAAGATTTGAATTTAGAGGAAAAAGAGAATACACAAGTGATTGCCCGAGAATCTTCACCGATTCATGAGAGAGAATCACCGATTATGGAGCAGCCACCAT
TTAATGAAGATTCAATATCGGTTGATTTTCCTAATTTAACAGATTCCACAAAATCCTCCAAAGGAAAAGAGCCATTAATAATGGAAGAGCCATCGGTGGAATTAAAAAAT
AACATATCTCTTCCTGTGGGCCCTACGAATTTGAAAATTGGTCAAAAGGGCTCAACATCTGGTCTTAGCCCAAAATTGGTTATTGTAGGCTCTGATACTGAAGCTTATTT
ATCCAGCCCATCTCCAATCAATTCACCTCATAAGATTAATTTGGACCCACCCCCCACACACGATTTTGATCTCACCATTTTTAACCCTGACCAACATCATCTTCCTTTAG
CCATTATGCCGCCAGAATCAACAAATGCTGGCCAATCAGCCATAAATAACAAAGTGAATGCTCCAGCTCCAGATATTTTAACCAACCATCCCCAACCAGAAACACCTCCC
ATCAATCAGCCAATGTTTGCCCTCCCAGAATATCTCCGTCATATAGCTCCAATTCTTAGTGAGCATGGGTTGTGTATCATGGCTATCCCTCCATTTCTACCACCTAAAAG
GAAGACAGTTACTACTACCGGGAAGAAAACAAAACTCCAGAGAGAGCTTGATAACCTAAAAACTACAGTGCATTATGATAAAACTGCTTCTTTGGCCTTAACGGAGGGAG
TACAGGAAACAAAAACGTCTAATATAGACAATCATCTGATTAAATCCTTATGGAGTTCATCTCATATTGGTTGGTCTTCTCTCGATTCCGTTGGAGCATCAGGGGGCATC
CTTATTATGTGGAGTGAACCAGACTTCACTATCAAAGAGACAATTCAAGGTCTTTTCTCTGTCTCTATTCATGTTTTTATGGCTGATGGTTTTTCTTTTTGGCTTACAAA
TATTTATGGCCCCTCTCGACGAGAATTTCGTATTGACTTTTGGAAAGAATTACATGATCTGGCGGGTCTGGGAGGTGATCGTTGGATCCTTGGAGGAGACTTTAATGTTA
CCCGATGGTCATGGGAGAAATCTCATGGTCGTCACATCACTCGGAGTATGCGTATTTTCAACCAATTGATTGCAACTTACAAGCTTCTGGATATCCCATTACAAAATGGT
TGTTTCACCTGGTCCAGTTTTGGTGACAATCCGTATCTTTCCTTAATAGACAGATTTTTGATTTCCAAAGATTGTCTGAATAAATTCGGGTCTTCTCATCTTCTTCGGCT
TGACAGAATTACTTCAGATCACTACCCTCTTTCCCTTACTTTTGGTGATATTAATTGGGGTCCTGGGCCTTTTCGATTTGAAAATTCCTGGCTGCAAATTGCATCATTCC
GTGAGGTGTTGGATAATTGGTGGAATCACAATTCTCTTCAA
Protein sequenceShow/hide protein sequence
MTSPSASLPWNHPTRSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKHGYFVEINQLQNSGS
RISILIPSESNKQGWFSFFSLISDYPKEFNHQTSKPQSRSYKEILQQKQPKVSHPLNTSPPDLVWTDIIVVQRFYQRDDWPSIRSSILSSISNRCSINPFQDNKALLHVY
DRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSEDVFRFIGDSCGGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVG
EGITVQIRGLTGETIGREQFNEWHQLRKGSYEVEEDKSESKDLNLEEKENTQVIARESSPIHERESPIMEQPPFNEDSISVDFPNLTDSTKSSKGKEPLIMEEPSVELKN
NISLPVGPTNLKIGQKGSTSGLSPKLVIVGSDTEAYLSSPSPINSPHKINLDPPPTHDFDLTIFNPDQHHLPLAIMPPESTNAGQSAINNKVNAPAPDILTNHPQPETPP
INQPMFALPEYLRHIAPILSEHGLCIMAIPPFLPPKRKTVTTTGKKTKLQRELDNLKTTVHYDKTASLALTEGVQETKTSNIDNHLIKSLWSSSHIGWSSLDSVGASGGI
LIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRHITRSMRIFNQLIATYKLLDIPLQNG
CFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINWGPGPFRFENSWLQIASFREVLDNWWNHNSLQ