; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moctig00246g070 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoctig00246g070
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationtig00000246_pilon:51314..54191
RNA-Seq ExpressionMoctig00246g070
SyntenyMoctig00246g070
Gene Ontology termsGO:0034641 - cellular nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFS39075.1 hypothetical protein Acr_00g0061040 [Actinidia rufa]3.3e-8841.98Show/hide
Query:  SNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIHGRSRLGYINGTIAEPDEADPSFSMWDAQNSMVMAWLINSMEEDIKEFFIF
        S+  + S PK   T    S+      QITS KL+G+N+LQWSRS  L+I G  R GY++G+I +P   DPSF +WD QNSMVMAWL+NSM++ I E ++ 
Subjt:  SNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIHGRSRLGYINGTIAEPDEADPSFSMWDAQNSMVMAWLINSMEEDIKEFFIF

Query:  YSTTKNLWNALTMAFSDFDNSAQLFELHNKARSLRQGESDNSK------------------------DAERFRKHVEKERIYDFLAGLRPELNDVRGRLL
        Y T K +W+A+T+A+SD ++S+Q+F+L +++R+LRQ E   ++                        + E +RK + KER Y+FL GL P L+DVRGR+L
Subjt:  YSTTKNLWNALTMAFSDFDNSAQLFELHNKARSLRQGESDNSK------------------------DAERFRKHVEKERIYDFLAGLRPELNDVRGRLL

Query:  ATKPIPAIDEIFAEVRWESSRKRVMMGD-THKKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTN
        + KP+P++D IF+EVR E  R+R+M+G        S++ ++SA+AAR    P SR  R+  LWCDHC R +HTK+ CW+LHG+P       D+       
Subjt:  ATKPIPAIDEIFAEVRWESSRKRVMMGD-THKKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTN

Query:  TPSSRTSSSGYQVGPSVSNSQDLAISLP-PFSKALLEQLYRLLTSPVESTPSSSFVAQRGIFSAALTSQQNSDQ-WILDSGATDHMTAFHDMFTMYSPNP
         P S+  S G +  P V+   D A S    F++A L+Q+ +L +S      S+S +A  GI S+ +     S + WI+DSGA++HM++   +F+ YS   
Subjt:  TPSSRTSSSGYQVGPSVSNSQDLAISLP-PFSKALLEQLYRLLTSPVESTPSSSFVAQRGIFSAALTSQQNSDQ-WILDSGATDHMTAFHDMFTMYSPNP

Query:  IQTHVKLADGSSAIIKGFGYVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYF
            V LADGS + + G G V LSPN+ LH+VL VPKL  NL+S+ KLT D  C A  + + C+FQD  +G TIG A    GLYYF
Subjt:  IQTHVKLADGSSAIIKGFGYVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYF

RVW98618.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]8.3e-9239.85Show/hide
Query:  MVKIAMMTDVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIHGRSRLGYINGTIAEPDEADPSFSMWDAQNSMV
        M K A MT   K  SS  + T ++    + +++   S S  +S Q+T  KLNGKN+L+W++S  L I GR +LG++NG +++P   DP+   W ++NS+V
Subjt:  MVKIAMMTDVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIHGRSRLGYINGTIAEPDEADPSFSMWDAQNSMV

Query:  MAWLINSMEEDIKEFFIFYSTTKNLWNALTMAFSDFDNSAQLFELHNKARSLRQG------------------------ESDNSKDAERFRKHVEKERIY
        +AWLINSME  I +  +F  T K++W A+   +SD +NS+Q+F+L +K    RQG                        E D   D+ R +K  E +R+Y
Subjt:  MAWLINSMEEDIKEFFIFYSTTKNLWNALTMAFSDFDNSAQLFELHNKARSLRQG------------------------ESDNSKDAERFRKHVEKERIY

Query:  DFLAGLRPELNDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHKKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGR
         FLAGL   L++VRGR+L  KP+P+I E+F+EVR E +R++VM+ D   K  +  +ESSAL ++G     S   RR   WCDHCK+  HTK+ CW++HG+
Subjt:  DFLAGLRPELNDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHKKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGR

Query:  PQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKALLEQLYRLLTSPVESTPSSSFVAQRGIFSAALTSQQNSDQ--WILDSGAT
        PQ      +++    ++  + +T S+  Q GP +++ +      P F+K  L  LY+L  SP  S PS S   Q     AAL+S +++    WI+DSGAT
Subjt:  PQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKALLEQLYRLLTSPVESTPSSSFVAQRGIFSAALTSQQNSDQ--WILDSGAT

Query:  DHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGYVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGL
        DHMT    +F+ Y P      +K+ADGS + I G G V +SP++TLH+VLHVP L CNL+S+ K+T D +CQA F  S C FQ+L +G TIGNA    GL
Subjt:  DHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGYVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGL

Query:  YYFRGPSLRNK--QVLQGETEPITS--SLDGN
        Y+F   S   K  Q + G+   + +  SLDG+
Subjt:  YYFRGPSLRNK--QVLQGETEPITS--SLDGN

XP_022159153.1 uncharacterized protein LOC111025577 [Momordica charantia]4.7e-20475.72Show/hide
Query:  MVKIAMMTDVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIHGRSRLGYINGTIAEPDEADPSFSMWDAQNSMV
        MVK AMMTDVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVI GRSRLGYINGTIAEPDEADPSFS+WDAQNSMV
Subjt:  MVKIAMMTDVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIHGRSRLGYINGTIAEPDEADPSFSMWDAQNSMV

Query:  MAWLINSMEEDIKEFFIFYSTTKNLWNALTMAFSDFDNSAQLFELHNKARSLRQGESD------------------------NSKDAERFRKHVEKERIY
        MAWLINSMEEDIKE FIFYST K+LWNALTMAFSDFDNSAQLFELHNKARSLRQGESD                        NSKDAERFRKHVEKERIY
Subjt:  MAWLINSMEEDIKEFFIFYSTTKNLWNALTMAFSDFDNSAQLFELHNKARSLRQGESD------------------------NSKDAERFRKHVEKERIY

Query:  DFLAGLRPELNDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHKKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGR
        DFLAGLRPEL+DVRGRLLATKPIPAIDEIFAEV WESSRKRVMMGDTH KPLSLSLESSALAARGPPP                                
Subjt:  DFLAGLRPELNDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHKKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGR

Query:  PQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKALLEQLYRLLTSPVESTPSSSFVAQRGIFSAALTSQQNSDQWILDSGATDH
                                                  S  P   A LEQLYRLLT PVESTPSSSFVAQRGI SAALT QQ+SDQWILDSGATDH
Subjt:  PQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKALLEQLYRLLTSPVESTPSSSFVAQRGIFSAALTSQQNSDQWILDSGATDH

Query:  MTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGYVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYY
        MTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFG VILSPNITLHSVL VPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIG+ADGFEGLYY
Subjt:  MTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGYVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYY

Query:  FRGPSLRNKQVLQGETEPITSSL
        FRGPSLRNKQVLQG T   +S +
Subjt:  FRGPSLRNKQVLQGETEPITSSL

XP_024044151.1 uncharacterized protein LOC18046468 isoform X1 [Citrus clementina]7.8e-9041.33Show/hide
Query:  ITSPKLNGKNFLQWSRSALLVIHGRSRLGYINGTIAEPDEADPSFSMWDAQNSMVMAWLINSMEEDIKEFFIFYSTTKNLWNALTMAFSDFDNSAQLFEL
        IT+ KLNG+NFLQWS+S  + I G+ ++GY+ G+I EP E DP F  WDA NSM+M+WL+NSME++I + ++F  T K+LW+A+T  +SD  NSAQ+++L
Subjt:  ITSPKLNGKNFLQWSRSALLVIHGRSRLGYINGTIAEPDEADPSFSMWDAQNSMVMAWLINSMEEDIKEFFIFYSTTKNLWNALTMAFSDFDNSAQLFEL

Query:  HNKARSLRQGESDNSK------------------------DAERFRKHVEKERIYDFLAGLRPELNDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMG
          + R  +QG    +K                        D+ +++K +EKER+++FLAGL  +L++VRGR+L  +P+P+  E+F+ VR E SRK VMMG
Subjt:  HNKARSLRQGESDNSK------------------------DAERFRKHVEKERIYDFLAGLRPELNDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMG

Query:  DTHKKPLSLSLESSALAARGPPPP-------SSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQ-VGPSVSNS
         +       S E+SAL +  P  P         +S  ++ +WCD+C +  HT+D CW+LHG           +PP   N   S   S G+Q VG +   +
Subjt:  DTHKKPLSLSLESSALAARGPPPP-------SSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQ-VGPSVSNS

Query:  QDLAISLPPFSKALLEQLYRLLTSPVESTPSSSF--VAQRG-IFSAALTSQQNSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFG
                 F+K  LEQLYR L         SSF  +AQ+G  F+A     +  D WI+DSGATDHMT+   +F+ Y P      +K+ADGS + + G G
Subjt:  QDLAISLPPFSKALLEQLYRLLTSPVESTPSSSF--VAQRG-IFSAALTSQQNSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFG

Query:  YVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYF
         + +S N+ L SVLHVP L CNL+SV K+T DL C A F+ S C FQDL +G  IG+A   +GLYYF
Subjt:  YVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYF

XP_024044152.1 uncharacterized protein LOC18046468 isoform X2 [Citrus clementina]7.8e-9041.33Show/hide
Query:  ITSPKLNGKNFLQWSRSALLVIHGRSRLGYINGTIAEPDEADPSFSMWDAQNSMVMAWLINSMEEDIKEFFIFYSTTKNLWNALTMAFSDFDNSAQLFEL
        IT+ KLNG+NFLQWS+S  + I G+ ++GY+ G+I EP E DP F  WDA NSM+M+WL+NSME++I + ++F  T K+LW+A+T  +SD  NSAQ+++L
Subjt:  ITSPKLNGKNFLQWSRSALLVIHGRSRLGYINGTIAEPDEADPSFSMWDAQNSMVMAWLINSMEEDIKEFFIFYSTTKNLWNALTMAFSDFDNSAQLFEL

Query:  HNKARSLRQGESDNSK------------------------DAERFRKHVEKERIYDFLAGLRPELNDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMG
          + R  +QG    +K                        D+ +++K +EKER+++FLAGL  +L++VRGR+L  +P+P+  E+F+ VR E SRK VMMG
Subjt:  HNKARSLRQGESDNSK------------------------DAERFRKHVEKERIYDFLAGLRPELNDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMG

Query:  DTHKKPLSLSLESSALAARGPPPP-------SSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQ-VGPSVSNS
         +       S E+SAL +  P  P         +S  ++ +WCD+C +  HT+D CW+LHG           +PP   N   S   S G+Q VG +   +
Subjt:  DTHKKPLSLSLESSALAARGPPPP-------SSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQ-VGPSVSNS

Query:  QDLAISLPPFSKALLEQLYRLLTSPVESTPSSSF--VAQRG-IFSAALTSQQNSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFG
                 F+K  LEQLYR L         SSF  +AQ+G  F+A     +  D WI+DSGATDHMT+   +F+ Y P      +K+ADGS + + G G
Subjt:  QDLAISLPPFSKALLEQLYRLLTSPVESTPSSSF--VAQRG-IFSAALTSQQNSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFG

Query:  YVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYF
         + +S N+ L SVLHVP L CNL+SV K+T DL C A F+ S C FQDL +G  IG+A   +GLYYF
Subjt:  YVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYF

TrEMBL top hitse value%identityAlignment
A0A438F2X4 Retrovirus-related Pol polyprotein from transposon TNT 1-946.0e-8839Show/hide
Query:  DESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIHGRSRLGYINGTIAEPDEADPSFSMWDAQNSMVMAWLINSMEEDI
        +  ++G N+   S P++  +    S ++   L IT  KLNG+NF+QW++S  + I G+ +  Y+ G I +P E DP +  W  +NSMVM+WLINSM  DI
Subjt:  DESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIHGRSRLGYINGTIAEPDEADPSFSMWDAQNSMVMAWLINSMEEDI

Query:  KEFFIFYSTTKNLWNALTMAFSDFDNSAQLFELHNKARSLRQGESDNS------------------------KDAERFRKHVEKERIYDFLAGLRPELND
         E F++Y T K +W+A    +S+ DN++ +FE+ +  + LRQG+S  +                        +D   ++K +EKERIY FL GL   L++
Subjt:  KEFFIFYSTTKNLWNALTMAFSDFDNSAQLFELHNKARSLRQGESDNS------------------------KDAERFRKHVEKERIYDFLAGLRPELND

Query:  VRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHKKPLSLSLESSALAARGPPP-PSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYR
        VRGR+L+ KP+P++ E+F+E+R E SR++VM+G  +    S +LE+SAL ARG     ++  T++N  WCDHC++  HTK+ CW LHG+P       D++
Subjt:  VRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHKKPLSLSLESSALAARGPPP-PSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYR

Query:  PPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKALLEQLYRLLTSPVESTPS---SSFVAQRGIFSAAL-TSQQNSDQWILDSGATDHMTAFHDM
        P  P      R  ++  +   S ++S        PFSK  LE L ++    ++ST +   ++ VAQ+GIF  AL   Q+N   WI+DSGA+DHMT    +
Subjt:  PPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKALLEQLYRLLTSPVESTPS---SSFVAQRGIFSAAL-TSQQNSDQWILDSGATDHMTAFHDM

Query:  FTMYSPNPIQTHVKLADGSSAIIKGFGYVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLR
        F  Y+P      V++ADG+ + + G G VI+S +ITLHSVL+VPKL CNL+S+ +LT DL C   F    C FQ L +G  IGNA+   GLY  R   +R
Subjt:  FTMYSPNPIQTHVKLADGSSAIIKGFGYVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLR

A0A438IPH4 Retrovirus-related Pol polyprotein from transposon RE14.0e-9239.85Show/hide
Query:  MVKIAMMTDVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIHGRSRLGYINGTIAEPDEADPSFSMWDAQNSMV
        M K A MT   K  SS  + T ++    + +++   S S  +S Q+T  KLNGKN+L+W++S  L I GR +LG++NG +++P   DP+   W ++NS+V
Subjt:  MVKIAMMTDVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIHGRSRLGYINGTIAEPDEADPSFSMWDAQNSMV

Query:  MAWLINSMEEDIKEFFIFYSTTKNLWNALTMAFSDFDNSAQLFELHNKARSLRQG------------------------ESDNSKDAERFRKHVEKERIY
        +AWLINSME  I +  +F  T K++W A+   +SD +NS+Q+F+L +K    RQG                        E D   D+ R +K  E +R+Y
Subjt:  MAWLINSMEEDIKEFFIFYSTTKNLWNALTMAFSDFDNSAQLFELHNKARSLRQG------------------------ESDNSKDAERFRKHVEKERIY

Query:  DFLAGLRPELNDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHKKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGR
         FLAGL   L++VRGR+L  KP+P+I E+F+EVR E +R++VM+ D   K  +  +ESSAL ++G     S   RR   WCDHCK+  HTK+ CW++HG+
Subjt:  DFLAGLRPELNDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHKKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGR

Query:  PQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKALLEQLYRLLTSPVESTPSSSFVAQRGIFSAALTSQQNSDQ--WILDSGAT
        PQ      +++    ++  + +T S+  Q GP +++ +      P F+K  L  LY+L  SP  S PS S   Q     AAL+S +++    WI+DSGAT
Subjt:  PQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKALLEQLYRLLTSPVESTPSSSFVAQRGIFSAALTSQQNSDQ--WILDSGAT

Query:  DHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGYVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGL
        DHMT    +F+ Y P      +K+ADGS + I G G V +SP++TLH+VLHVP L CNL+S+ K+T D +CQA F  S C FQ+L +G TIGNA    GL
Subjt:  DHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGYVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGL

Query:  YYFRGPSLRNK--QVLQGETEPITS--SLDGN
        Y+F   S   K  Q + G+   + +  SLDG+
Subjt:  YYFRGPSLRNK--QVLQGETEPITS--SLDGN

A0A6A2WU09 60S ribosomal protein L383.5e-8840.64Show/hide
Query:  DGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIHGRSRLGYINGTIAEPDEADPSFSMWDAQNSMVMAWLINSMEEDIKEFF
        D S   S+S  +S S T +    +N S  ITS KLNG NFLQWS+S  L I GR + GY++G   +P E + S   W+A+NSM+M+WLINSM+  +   +
Subjt:  DGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIHGRSRLGYINGTIAEPDEADPSFSMWDAQNSMVMAWLINSMEEDIKEFF

Query:  IFYSTTKNLWNALTMAFSDFDNSAQLFELHNKARSLRQGES------------------------DNSKDAERFRKHVEKERIYDFLAGLRPELNDVRGR
        +F  T  ++WNA+   +SD  N+ Q FEL  +   L+QGE                           +KD   F+K VEKER+++FL GL  EL++VRGR
Subjt:  IFYSTTKNLWNALTMAFSDFDNSAQLFELHNKARSLRQGES------------------------DNSKDAERFRKHVEKERIYDFLAGLRPELNDVRGR

Query:  LLATKPIPAIDEIFAEVRWESSRKRVMMGDTHKKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPT
        +L  +P+P+  E+F+EVR E SR+ VM+G     P     ESSAL +  PP  + R   R+   C+HC +  HTK +CW+LHG+P  +N S +       
Subjt:  LLATKPIPAIDEIFAEVRWESSRKRVMMGDTHKKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPT

Query:  NTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKALLEQLYRLLTS-PVESTP------SSSFVAQRGIFSAALTSQQNSDQWILDSGATDHMTAFHDMFT
           +SR +   +      S     A  L  FSK  LEQLY+L++S  + +TP      +SS +AQ+G +  A  +   S+ WI+DSGATDHMT    +F+
Subjt:  NTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKALLEQLYRLLTS-PVESTP------SSSFVAQRGIFSAALTSQQNSDQWILDSGATDHMTAFHDMFT

Query:  MYSPNPIQTHVKLADGSSAIIKGFGYVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNK
         Y P      VK+ADGS   I G G +I+SP++TL +VLHVPKL CNLISV ++ HD KC A  T +   FQD  +G  IGNA   +GLY+    +  NK
Subjt:  MYSPNPIQTHVKLADGSSAIIKGFGYVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNK

Query:  QV
        QV
Subjt:  QV

A0A6J1DY12 uncharacterized protein LOC1110255772.3e-20475.72Show/hide
Query:  MVKIAMMTDVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIHGRSRLGYINGTIAEPDEADPSFSMWDAQNSMV
        MVK AMMTDVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVI GRSRLGYINGTIAEPDEADPSFS+WDAQNSMV
Subjt:  MVKIAMMTDVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIHGRSRLGYINGTIAEPDEADPSFSMWDAQNSMV

Query:  MAWLINSMEEDIKEFFIFYSTTKNLWNALTMAFSDFDNSAQLFELHNKARSLRQGESD------------------------NSKDAERFRKHVEKERIY
        MAWLINSMEEDIKE FIFYST K+LWNALTMAFSDFDNSAQLFELHNKARSLRQGESD                        NSKDAERFRKHVEKERIY
Subjt:  MAWLINSMEEDIKEFFIFYSTTKNLWNALTMAFSDFDNSAQLFELHNKARSLRQGESD------------------------NSKDAERFRKHVEKERIY

Query:  DFLAGLRPELNDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHKKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGR
        DFLAGLRPEL+DVRGRLLATKPIPAIDEIFAEV WESSRKRVMMGDTH KPLSLSLESSALAARGPPP                                
Subjt:  DFLAGLRPELNDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHKKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGR

Query:  PQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKALLEQLYRLLTSPVESTPSSSFVAQRGIFSAALTSQQNSDQWILDSGATDH
                                                  S  P   A LEQLYRLLT PVESTPSSSFVAQRGI SAALT QQ+SDQWILDSGATDH
Subjt:  PQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKALLEQLYRLLTSPVESTPSSSFVAQRGIFSAALTSQQNSDQWILDSGATDH

Query:  MTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGYVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYY
        MTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFG VILSPNITLHSVL VPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIG+ADGFEGLYY
Subjt:  MTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGYVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYY

Query:  FRGPSLRNKQVLQGETEPITSSL
        FRGPSLRNKQVLQG T   +S +
Subjt:  FRGPSLRNKQVLQGETEPITSSL

A0A7J0DNJ6 Uncharacterized protein1.6e-8841.98Show/hide
Query:  SNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIHGRSRLGYINGTIAEPDEADPSFSMWDAQNSMVMAWLINSMEEDIKEFFIF
        S+  + S PK   T    S+      QITS KL+G+N+LQWSRS  L+I G  R GY++G+I +P   DPSF +WD QNSMVMAWL+NSM++ I E ++ 
Subjt:  SNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIHGRSRLGYINGTIAEPDEADPSFSMWDAQNSMVMAWLINSMEEDIKEFFIF

Query:  YSTTKNLWNALTMAFSDFDNSAQLFELHNKARSLRQGESDNSK------------------------DAERFRKHVEKERIYDFLAGLRPELNDVRGRLL
        Y T K +W+A+T+A+SD ++S+Q+F+L +++R+LRQ E   ++                        + E +RK + KER Y+FL GL P L+DVRGR+L
Subjt:  YSTTKNLWNALTMAFSDFDNSAQLFELHNKARSLRQGESDNSK------------------------DAERFRKHVEKERIYDFLAGLRPELNDVRGRLL

Query:  ATKPIPAIDEIFAEVRWESSRKRVMMGD-THKKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTN
        + KP+P++D IF+EVR E  R+R+M+G        S++ ++SA+AAR    P SR  R+  LWCDHC R +HTK+ CW+LHG+P       D+       
Subjt:  ATKPIPAIDEIFAEVRWESSRKRVMMGD-THKKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTN

Query:  TPSSRTSSSGYQVGPSVSNSQDLAISLP-PFSKALLEQLYRLLTSPVESTPSSSFVAQRGIFSAALTSQQNSDQ-WILDSGATDHMTAFHDMFTMYSPNP
         P S+  S G +  P V+   D A S    F++A L+Q+ +L +S      S+S +A  GI S+ +     S + WI+DSGA++HM++   +F+ YS   
Subjt:  TPSSRTSSSGYQVGPSVSNSQDLAISLP-PFSKALLEQLYRLLTSPVESTPSSSFVAQRGIFSAALTSQQNSDQ-WILDSGATDHMTAFHDMFTMYSPNP

Query:  IQTHVKLADGSSAIIKGFGYVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYF
            V LADGS + + G G V LSPN+ LH+VL VPKL  NL+S+ KLT D  C A  + + C+FQD  +G TIG A    GLYYF
Subjt:  IQTHVKLADGSSAIIKGFGYVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYF

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.0e-1220.04Show/hide
Query:  KLNGKNFLQWSRSALLVIHGRSRLGYINGTIAEP---------DEADPSFSMWDAQNSMVMAWLINSMEEDIKEFFIFYSTTKNLWNALTMAFSDFDNSA
        KL   N+L WSR    +  G    G+++G+   P            +P ++ W  Q+ ++ + ++ ++   ++      +T   +W  L   +++  +  
Subjt:  KLNGKNFLQWSRSALLVIHGRSRLGYINGTIAEP---------DEADPSFSMWDAQNSMVMAWLINSMEEDIKEFFIFYSTTKNLWNALTMAFSDFDNSA

Query:  QLFELHNKARSLRQGESDNSKDAERFRKHVEK-----------ERIYDFLAGLRPELNDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHKKPLS
         + +L  + +   +G        +      ++           E++   L  L  E   V  ++ A    P + EI   +    S+   +   T      
Subjt:  QLFELHNKARSLRQGESDNSKDAERFRKHVEK-----------ERIYDFLAGLRPELNDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHKKPLS

Query:  LSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKALLE
        + + ++A++ R     ++ +    N   D+    N++K                       P    S+    +  Q  P +   Q   +     S     
Subjt:  LSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKALLE

Query:  QLYRLLTSPVESTPSSSFVAQRGIFSAALTSQQNSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGYVILSPN---ITLHSVLHV
        QL   L+S     P S F   +   + AL S  +S+ W+LDSGAT H+T+  +  +++ P      V +ADGS+  I   G   LS     + LH++L+V
Subjt:  QLYRLLTSPVESTPSSSFVAQRGIFSAALTSQQNSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGYVILSPN---ITLHSVLHV

Query:  PKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLY
        P +  NLISV +L +       F  +    +DL TG  +      + LY
Subjt:  PKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.8e-1221.54Show/hide
Query:  KLNGKNFLQWSRSALLVIHGRSRLGYINGTIAEP---------DEADPSFSMWDAQNSMVMAWLINSMEEDIKEFFIFYSTTKNLWNALTMAFSD--FDN
        KL   N+L WSR    +  G    G+++G+   P            +P ++ W  Q+ ++ + ++ ++   ++      +T   +W  L   +++  + +
Subjt:  KLNGKNFLQWSRSALLVIHGRSRLGYINGTIAEP---------DEADPSFSMWDAQNSMVMAWLINSMEEDIKEFFIFYSTTKNLWNALTMAFSD--FDN

Query:  SAQL-FELHNKARSLRQGESDNSKDAERFRKHVEKERIYDFLAGLRPELNDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHKKPLSLSLESSAL
          QL F       +L     D+ +  ER             L  L  +   V  ++ A    P++ EI   +              +++   L+L S+ +
Subjt:  SAQL-FELHNKARSLRQGESDNSKDAERFRKHVEKERIYDFLAGLRPELNDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHKKPLSLSLESSAL

Query:  AARGPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKALLEQLYRLLTS
             P  ++  T RN         TN  ++   +        N S  ++P       SS + S   Q  P +   Q    S+   S     QL++  ++
Subjt:  AARGPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKALLEQLYRLLTS

Query:  PVESTPSSSFVAQRGIFSAALTSQQNSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGYVIL---SPNITLHSVLHVPKLCCNLI
          +   +S F   +   + A+ S  N++ W+LDSGAT H+T+  +  + + P      V +ADGS+  I   G   L   S ++ L+ VL+VP +  NLI
Subjt:  PVESTPSSSFVAQRGIFSAALTSQQNSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGYVIL---SPNITLHSVLHVPKLCCNLI

Query:  SVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLY
        SV +L +  +    F  +    +DL TG  +      + LY
Subjt:  SVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLY

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.8e-1225.13Show/hide
Query:  NFLQWSRSALLVIHGRSRLGYINGTIAEPDEADPSFSMWDAQNSMVMAWLINSMEEDIKEFFIFYSTTKNLWNALTMAFSDFDNSAQLFELHNKARSLRQ
        N++ W       +    + G+I+GT+ +PD   P +  W+  N+MVM WL+NSM + + E  ++  T   +W  L   F    +  ++++L  +  +LRQ
Subjt:  NFLQWSRSALLVIHGRSRLGYINGTIAEPDEADPSFSMWDAQNSMVMAWLINSMEEDIKEFFIFYSTTKNLWNALTMAFSDFDNSAQLFELHNKARSLRQ

Query:  GESD-----------------------------NSKDAERFRKHVEKERIYDFLAGLR--PELNDVRGRLLATKPIPAIDEIFAEVR
        G                                N +  +R  +  EKE+ Y+FL GL+       V  +++  KP P++ E FA V+
Subjt:  GESD-----------------------------NSKDAERFRKHVEKERIYDFLAGLR--PELNDVRGRLLATKPIPAIDEIFAEVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTAAAATAGCCATGATGACTGATGTGCGCAAGGATGAGAGTTCCGACGGATCGAATACGACCTCAATTTCTCTCCCTAAAAGCACATCCACGACACCTCACATCTC
TATCTCTGAAAACACTTCGCTCCAAATTACCTCTCCAAAACTCAATGGAAAAAATTTTCTGCAATGGTCTCGATCGGCCCTCCTAGTTATCCATGGCCGCAGTCGACTCG
GGTACATCAATGGCACGATAGCAGAACCAGATGAAGCTGACCCTTCTTTTTCCATGTGGGATGCACAAAACTCAATGGTCATGGCATGGCTCATTAACTCGATGGAGGAG
GACATTAAAGAATTCTTCATCTTCTACTCAACAACAAAGAATCTTTGGAATGCGCTCACTATGGCTTTTTCTGATTTTGATAACTCGGCTCAATTGTTTGAATTACACAA
TAAGGCACGTTCCTTACGACAAGGTGAATCCGATAACTCGAAGGATGCTGAACGCTTCCGCAAACACGTCGAGAAGGAACGAATTTATGATTTTCTTGCAGGTCTTCGTC
CGGAATTAAATGATGTGCGTGGCCGCTTACTTGCCACAAAGCCAATCCCAGCCATTGACGAAATCTTCGCAGAAGTTCGCTGGGAGTCAAGCCGTAAACGTGTGATGATG
GGTGATACACACAAAAAACCTCTGTCCCTCTCACTGGAATCATCAGCTCTGGCGGCACGAGGTCCACCACCACCTTCATCACGATCTACTCGTCGGAACAACCTATGGTG
TGATCATTGTAAGCGCACAAACCATACAAAAGATCGGTGTTGGGAACTCCATGGTCGTCCTCAACGGAAGAATTTATCAGGGGATTATCGGCCACCTCCACCTACAAATA
CTCCATCTTCTCGAACCAGCTCCTCCGGCTATCAAGTGGGCCCTAGTGTGTCAAACTCCCAAGATTTGGCCATCTCTCTCCCTCCATTTTCGAAGGCACTACTTGAACAG
CTCTATCGCCTCTTAACATCGCCGGTTGAGTCTACTCCTTCGTCAAGTTTTGTGGCACAACGAGGTATTTTTAGTGCAGCATTAACAAGTCAGCAGAATTCCGATCAGTG
GATCTTAGATTCGGGTGCAACTGATCATATGACCGCGTTTCATGATATGTTTACCATGTACTCACCCAACCCGATTCAAACACATGTCAAGCTTGCAGATGGGTCATCGG
CCATTATTAAGGGCTTTGGTTATGTTATTCTTAGCCCAAACATTACATTGCACTCAGTGCTCCATGTGCCTAAATTATGTTGCAATCTAATTTCTGTTCAGAAGTTGACT
CATGATTTAAAGTGTCAAGCCCTGTTCACTGACTCTAAGTGTTTGTTTCAGGACTTGATAACGGGAACGACGATTGGCAATGCTGATGGCTTTGAAGGGCTCTACTACTT
CAGAGGACCAAGTCTAAGAAATAAACAAGTTCTCCAGGGGGAGACTGAGCCTATTACAAGTAGTCTTGATGGAAATTTTTGGGAGATTGACGATCTAAATACCAGAATTG
AGTCCCCTCAGTCAAAAATACCTGAGATTGATGGTCTAAATACCGAGTCACCTCAGCCAAAAATTCCTATCCCGATCGTCCCAATTACTCAGATAGAAGGGTCAGTTCCT
ATTATTTCCTGTAATAATGAAGATGATCAAGTCAACCCAAATCGAAGTGACAAGCAACCTGAGACTCTTGTTTATTCTCGGCGACAAACGGTTCAAAGAGGAGTGGAGCC
ACCACAGCTTCAACAGCAAAGTCATGAATCCATCTCATCCTTAGGAAGGGTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTAAAATAGCCATGATGACTGATGTGCGCAAGGATGAGAGTTCCGACGGATCGAATACGACCTCAATTTCTCTCCCTAAAAGCACATCCACGACACCTCACATCTC
TATCTCTGAAAACACTTCGCTCCAAATTACCTCTCCAAAACTCAATGGAAAAAATTTTCTGCAATGGTCTCGATCGGCCCTCCTAGTTATCCATGGCCGCAGTCGACTCG
GGTACATCAATGGCACGATAGCAGAACCAGATGAAGCTGACCCTTCTTTTTCCATGTGGGATGCACAAAACTCAATGGTCATGGCATGGCTCATTAACTCGATGGAGGAG
GACATTAAAGAATTCTTCATCTTCTACTCAACAACAAAGAATCTTTGGAATGCGCTCACTATGGCTTTTTCTGATTTTGATAACTCGGCTCAATTGTTTGAATTACACAA
TAAGGCACGTTCCTTACGACAAGGTGAATCCGATAACTCGAAGGATGCTGAACGCTTCCGCAAACACGTCGAGAAGGAACGAATTTATGATTTTCTTGCAGGTCTTCGTC
CGGAATTAAATGATGTGCGTGGCCGCTTACTTGCCACAAAGCCAATCCCAGCCATTGACGAAATCTTCGCAGAAGTTCGCTGGGAGTCAAGCCGTAAACGTGTGATGATG
GGTGATACACACAAAAAACCTCTGTCCCTCTCACTGGAATCATCAGCTCTGGCGGCACGAGGTCCACCACCACCTTCATCACGATCTACTCGTCGGAACAACCTATGGTG
TGATCATTGTAAGCGCACAAACCATACAAAAGATCGGTGTTGGGAACTCCATGGTCGTCCTCAACGGAAGAATTTATCAGGGGATTATCGGCCACCTCCACCTACAAATA
CTCCATCTTCTCGAACCAGCTCCTCCGGCTATCAAGTGGGCCCTAGTGTGTCAAACTCCCAAGATTTGGCCATCTCTCTCCCTCCATTTTCGAAGGCACTACTTGAACAG
CTCTATCGCCTCTTAACATCGCCGGTTGAGTCTACTCCTTCGTCAAGTTTTGTGGCACAACGAGGTATTTTTAGTGCAGCATTAACAAGTCAGCAGAATTCCGATCAGTG
GATCTTAGATTCGGGTGCAACTGATCATATGACCGCGTTTCATGATATGTTTACCATGTACTCACCCAACCCGATTCAAACACATGTCAAGCTTGCAGATGGGTCATCGG
CCATTATTAAGGGCTTTGGTTATGTTATTCTTAGCCCAAACATTACATTGCACTCAGTGCTCCATGTGCCTAAATTATGTTGCAATCTAATTTCTGTTCAGAAGTTGACT
CATGATTTAAAGTGTCAAGCCCTGTTCACTGACTCTAAGTGTTTGTTTCAGGACTTGATAACGGGAACGACGATTGGCAATGCTGATGGCTTTGAAGGGCTCTACTACTT
CAGAGGACCAAGTCTAAGAAATAAACAAGTTCTCCAGGGGGAGACTGAGCCTATTACAAGTAGTCTTGATGGAAATTTTTGGGAGATTGACGATCTAAATACCAGAATTG
AGTCCCCTCAGTCAAAAATACCTGAGATTGATGGTCTAAATACCGAGTCACCTCAGCCAAAAATTCCTATCCCGATCGTCCCAATTACTCAGATAGAAGGGTCAGTTCCT
ATTATTTCCTGTAATAATGAAGATGATCAAGTCAACCCAAATCGAAGTGACAAGCAACCTGAGACTCTTGTTTATTCTCGGCGACAAACGGTTCAAAGAGGAGTGGAGCC
ACCACAGCTTCAACAGCAAAGTCATGAATCCATCTCATCCTTAGGAAGGGTGTGA
Protein sequenceShow/hide protein sequence
MVKIAMMTDVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIHGRSRLGYINGTIAEPDEADPSFSMWDAQNSMVMAWLINSMEE
DIKEFFIFYSTTKNLWNALTMAFSDFDNSAQLFELHNKARSLRQGESDNSKDAERFRKHVEKERIYDFLAGLRPELNDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMM
GDTHKKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKALLEQ
LYRLLTSPVESTPSSSFVAQRGIFSAALTSQQNSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGYVILSPNITLHSVLHVPKLCCNLISVQKLT
HDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQGETEPITSSLDGNFWEIDDLNTRIESPQSKIPEIDGLNTESPQPKIPIPIVPITQIEGSVP
IISCNNEDDQVNPNRSDKQPETLVYSRRQTVQRGVEPPQLQQQSHESISSLGRV