; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g32010 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g32010
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr11:23312877..23321575
RNA-Seq ExpressionMoc11g32010
SyntenyMoc11g32010
Gene Ontology termsGO:0034641 - cellular nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8658400.1 60S ribosomal protein L38 [Hibiscus syriacus]8.1e-10443.82Show/hide
Query:  DGSNTTSISLPKSTSTTLHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESF
        D S   S+S  +S S TL+    +N S  ITS KLNG NFLQWS+S  L IRGR + GY++G   +P E + S   W+A+NSM+M+WLINSM+  +  ++
Subjt:  DGSNTTSISLPKSTSTTLHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESF

Query:  IFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGR
        +F  TA D+WNA+   +SD  N+ Q FEL+ +   L+QGE  VTQYY+ L+ LW E+D+  + EW  +KD   F+K VEKER+++FL GL  ELD+VRGR
Subjt:  IFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGR

Query:  LLATKPIPAIDEIFAEVRWGSSRKCVMMGDTHTKPLSLSLESSALAARGPPPSSSRSTRLNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPT
        +L  +P+P+  E+F+EVR   SR+ VM+G     P     ESSAL +  PP  + R    +   C+HC +  HTK +CW+LHG+P  +N S +       
Subjt:  LLATKPIPAIDEIFAEVRWGSSRKCVMMGDTHTKPLSLSLESSALAARGPPPSSSRSTRLNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPT

Query:  NTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKAQLEQLYRLL-------TPLVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFT
           +SR +   +      S     A  L  FSK QLEQLY+L+       TP   S  +SS +AQ+G +  A  +   S+ WI+DSGATDHMT    +F+
Subjt:  NTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKAQLEQLYRLL-------TPLVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFT

Query:  MYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIDNADGFEGLYYFRGPSLRNK
         Y P      VK+ADGS   I G GS+I+SP++TL +VLHVPKL CNLISV ++ HD KC A  T +   FQD  +G  I NA   +GLY+    +  NK
Subjt:  MYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIDNADGFEGLYYFRGPSLRNK

Query:  QV
        QV
Subjt:  QV

RVW98618.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.6e-10442.19Show/hide
Query:  TMMTDVRKDESSDGSNTTSISLPKSTSTTLHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWL
        T MT   K  SS  + T ++    + +++   S S  +S Q+T  KLNGKN+L+W++S  L I GR +LG++NG +++P   DP+   W ++NS+V+AWL
Subjt:  TMMTDVRKDESSDGSNTTSISLPKSTSTTLHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWL

Query:  INSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLA
        INSME  I +  +F  TAKD+W A+   +SD +NS+Q+F+L++K    RQG+ +VT YY+ +  LW ELDLC   EW+   D+ R +K  E +R+Y FLA
Subjt:  INSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLA

Query:  GLRPELDDVRGRLLATKPIPAIDEIFAEVRWGSSRKCVMMGDTHTKPLSLSLESSALAARGPPPSSSRSTRLNNLWCDHCKRTNHTKDRCWELHGRPQRK
        GL   LD+VRGR+L  KP+P+I E+F+EVR   +R+ VM+ D   K  +  +ESSAL ++G   S S   R    WCDHCK+  HTK+ CW++HG+PQ  
Subjt:  GLRPELDDVRGRLLATKPIPAIDEIFAEVRWGSSRKCVMMGDTHTKPLSLSLESSALAARGPPPSSSRSTRLNNLWCDHCKRTNHTKDRCWELHGRPQRK

Query:  NLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKAQLEQLYRLLTPLVESTPSSSFVAQRGIFSAALTSQQHSDQ--WILDSGATDHMT
            +++    ++  + +T S+  Q GP +++ +      P F+K QL  LY+L      S PS S   Q     AAL+S + +    WI+DSGATDHMT
Subjt:  NLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKAQLEQLYRLLTPLVESTPSSSFVAQRGIFSAALTSQQHSDQ--WILDSGATDHMT

Query:  AFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIDNADGFEGLYYFR
            +F+ Y P      +K+ADGS + I G GSV +SP++TLH+VLHVP L CNL+S+ K+T D +CQA F  S C FQ+L +G TI NA    GLY+F 
Subjt:  AFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIDNADGFEGLYYFR

Query:  GPSLRNKQVLQV
          S   K + ++
Subjt:  GPSLRNKQVLQV

XP_022159153.1 uncharacterized protein LOC111025577 [Momordica charantia]1.7e-22381.87Show/hide
Query:  MVKTTMMTDVRKDESSDGSNTTSISLPKSTSTTLHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMV
        MVKT MMTDVRKDESSDGSNTTSISLPKSTSTT HISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMV
Subjt:  MVKTTMMTDVRKDESSDGSNTTSISLPKSTSTTLHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMV

Query:  MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY
        MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFEL NKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY
Subjt:  MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY

Query:  DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWGSSRKCVMMGDTHTKPLSLSLESSALAARGPPPSSSRSTRLNNLWCDHCKRTNHTKDRCWELHGR
        DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEV W SSRK VMMGDTHTKPLSLSLESSALAARGPPPS                               
Subjt:  DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWGSSRKCVMMGDTHTKPLSLSLESSALAARGPPPSSSRSTRLNNLWCDHCKRTNHTKDRCWELHGR

Query:  PQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKAQLEQLYRLLTPLVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDH
                                                         AQLEQLYRLLT  VESTPSSSFVAQRGI SAALT QQHSDQWILDSGATDH
Subjt:  PQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKAQLEQLYRLLTPLVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDH

Query:  MTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIDNADGFEGLYY
        MTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVL VPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTI +ADGFEGLYY
Subjt:  MTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIDNADGFEGLYY

Query:  FRGPSLRNKQVLQ
        FRGPSLRNKQVLQ
Subjt:  FRGPSLRNKQVLQ

XP_024044151.1 uncharacterized protein LOC18046468 isoform X1 [Citrus clementina]1.1e-10344.35Show/hide
Query:  ITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFEL
        IT+ KLNG+NFLQWS+S  + IRG+ ++GY+ G+I EP E DP F  WDA NSM+M+WL+NSME++I ++++F  TAKDLW+A+T  +SD  NSAQ+++L
Subjt:  ITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFEL

Query:  RNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWGSSRKCVMMG
        + + R  +QG   VT+YY+ L+ LW ELD   + EWE + D+ +++K +EKER+++FLAGL  +LD+VRGR+L  +P+P+  E+F+ VR   SRK VMMG
Subjt:  RNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWGSSRKCVMMG

Query:  DTHTKPLSLSLESSALAARGPPPSSSRSTRLNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQ-VGPSVSNSQDLAISL
         +  +  +L   +      G   +  +S   + +WCD+C +  HT+D CW+LHG           +PP   N   S   S G+Q VG +   +       
Subjt:  DTHTKPLSLSLESSALAARGPPPSSSRSTRLNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQ-VGPSVSNSQDLAISL

Query:  PPFSKAQLEQLYRLLTPLVESTPSSSF--VAQRG-IFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPN
          F+K QLEQLYR L         SSF  +AQ+G  F+A     +  D WI+DSGATDHMT+   +F+ Y P      +K+ADGS + + G GS+ +S N
Subjt:  PPFSKAQLEQLYRLLTPLVESTPSSSF--VAQRG-IFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPN

Query:  ITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIDNADGFEGLYYF
        + L SVLHVP L CNL+SV K+T DL C A F+ S C FQDL +G  I +A   +GLYYF
Subjt:  ITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIDNADGFEGLYYF

XP_024044152.1 uncharacterized protein LOC18046468 isoform X2 [Citrus clementina]1.1e-10344.35Show/hide
Query:  ITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFEL
        IT+ KLNG+NFLQWS+S  + IRG+ ++GY+ G+I EP E DP F  WDA NSM+M+WL+NSME++I ++++F  TAKDLW+A+T  +SD  NSAQ+++L
Subjt:  ITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFEL

Query:  RNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWGSSRKCVMMG
        + + R  +QG   VT+YY+ L+ LW ELD   + EWE + D+ +++K +EKER+++FLAGL  +LD+VRGR+L  +P+P+  E+F+ VR   SRK VMMG
Subjt:  RNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWGSSRKCVMMG

Query:  DTHTKPLSLSLESSALAARGPPPSSSRSTRLNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQ-VGPSVSNSQDLAISL
         +  +  +L   +      G   +  +S   + +WCD+C +  HT+D CW+LHG           +PP   N   S   S G+Q VG +   +       
Subjt:  DTHTKPLSLSLESSALAARGPPPSSSRSTRLNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQ-VGPSVSNSQDLAISL

Query:  PPFSKAQLEQLYRLLTPLVESTPSSSF--VAQRG-IFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPN
          F+K QLEQLYR L         SSF  +AQ+G  F+A     +  D WI+DSGATDHMT+   +F+ Y P      +K+ADGS + + G GS+ +S N
Subjt:  PPFSKAQLEQLYRLLTPLVESTPSSSF--VAQRG-IFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPN

Query:  ITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIDNADGFEGLYYF
        + L SVLHVP L CNL+SV K+T DL C A F+ S C FQDL +G  I +A   +GLYYF
Subjt:  ITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIDNADGFEGLYYF

TrEMBL top hitse value%identityAlignment
A0A438F2X4 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-10041.4Show/hide
Query:  DESSDGSNTTSISLPKSTSTTLHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDI
        +  ++G N+   S P++  +    S ++   L IT  KLNG+NF+QW++S  + I G+ +  Y+ G I +P E DP +  W  +NSMVM+WLINSM  DI
Subjt:  DESSDGSNTTSISLPKSTSTTLHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDI

Query:  KESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDD
         E+F++Y TAK++W+A    +S+ DN++ +FE+++  + LRQG+S VT+Y++ L R W +LD+   L W+  +D   ++K +EKERIY FL GL   LD+
Subjt:  KESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDD

Query:  VRGRLLATKPIPAIDEIFAEVRWGSSRKCVMMGDTHTKPLSLSLESSALAARGPPP-SSSRSTRLNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYR
        VRGR+L+ KP+P++ E+F+E+R   SR+ VM+G   T+  S +LE+SAL ARG    +++  T+ N  WCDHC++  HTK+ CW LHG+P       D++
Subjt:  VRGRLLATKPIPAIDEIFAEVRWGSSRKCVMMGDTHTKPLSLSLESSALAARGPPP-SSSRSTRLNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYR

Query:  PPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKAQLEQLYRLLTPLVESTPS---SSFVAQRGIFSAAL-TSQQHSDQWILDSGATDHMTAFHDM
        P  P      R  ++  +   S ++S        PFSK QLE L ++    ++ST +   ++ VAQ+GIF  AL   Q++   WI+DSGA+DHMT    +
Subjt:  PPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKAQLEQLYRLLTPLVESTPS---SSFVAQRGIFSAAL-TSQQHSDQWILDSGATDHMTAFHDM

Query:  FTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIDNADGFEGLYYFRGPSLR
        F  Y+P      V++ADG+ + + G GSVI+S +ITLHSVL+VPKL CNL+S+ +LT DL C   F    C FQ L +G  I NA+   GLY  R   +R
Subjt:  FTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIDNADGFEGLYYFRGPSLR

A0A438IPH4 Retrovirus-related Pol polyprotein from transposon RE17.9e-10542.19Show/hide
Query:  TMMTDVRKDESSDGSNTTSISLPKSTSTTLHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWL
        T MT   K  SS  + T ++    + +++   S S  +S Q+T  KLNGKN+L+W++S  L I GR +LG++NG +++P   DP+   W ++NS+V+AWL
Subjt:  TMMTDVRKDESSDGSNTTSISLPKSTSTTLHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWL

Query:  INSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLA
        INSME  I +  +F  TAKD+W A+   +SD +NS+Q+F+L++K    RQG+ +VT YY+ +  LW ELDLC   EW+   D+ R +K  E +R+Y FLA
Subjt:  INSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLA

Query:  GLRPELDDVRGRLLATKPIPAIDEIFAEVRWGSSRKCVMMGDTHTKPLSLSLESSALAARGPPPSSSRSTRLNNLWCDHCKRTNHTKDRCWELHGRPQRK
        GL   LD+VRGR+L  KP+P+I E+F+EVR   +R+ VM+ D   K  +  +ESSAL ++G   S S   R    WCDHCK+  HTK+ CW++HG+PQ  
Subjt:  GLRPELDDVRGRLLATKPIPAIDEIFAEVRWGSSRKCVMMGDTHTKPLSLSLESSALAARGPPPSSSRSTRLNNLWCDHCKRTNHTKDRCWELHGRPQRK

Query:  NLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKAQLEQLYRLLTPLVESTPSSSFVAQRGIFSAALTSQQHSDQ--WILDSGATDHMT
            +++    ++  + +T S+  Q GP +++ +      P F+K QL  LY+L      S PS S   Q     AAL+S + +    WI+DSGATDHMT
Subjt:  NLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKAQLEQLYRLLTPLVESTPSSSFVAQRGIFSAALTSQQHSDQ--WILDSGATDHMT

Query:  AFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIDNADGFEGLYYFR
            +F+ Y P      +K+ADGS + I G GSV +SP++TLH+VLHVP L CNL+S+ K+T D +CQA F  S C FQ+L +G TI NA    GLY+F 
Subjt:  AFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIDNADGFEGLYYFR

Query:  GPSLRNKQVLQV
          S   K + ++
Subjt:  GPSLRNKQVLQV

A0A6A2WU09 60S ribosomal protein L383.9e-10443.82Show/hide
Query:  DGSNTTSISLPKSTSTTLHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESF
        D S   S+S  +S S TL+    +N S  ITS KLNG NFLQWS+S  L IRGR + GY++G   +P E + S   W+A+NSM+M+WLINSM+  +  ++
Subjt:  DGSNTTSISLPKSTSTTLHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESF

Query:  IFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGR
        +F  TA D+WNA+   +SD  N+ Q FEL+ +   L+QGE  VTQYY+ L+ LW E+D+  + EW  +KD   F+K VEKER+++FL GL  ELD+VRGR
Subjt:  IFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGR

Query:  LLATKPIPAIDEIFAEVRWGSSRKCVMMGDTHTKPLSLSLESSALAARGPPPSSSRSTRLNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPT
        +L  +P+P+  E+F+EVR   SR+ VM+G     P     ESSAL +  PP  + R    +   C+HC +  HTK +CW+LHG+P  +N S +       
Subjt:  LLATKPIPAIDEIFAEVRWGSSRKCVMMGDTHTKPLSLSLESSALAARGPPPSSSRSTRLNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPT

Query:  NTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKAQLEQLYRLL-------TPLVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFT
           +SR +   +      S     A  L  FSK QLEQLY+L+       TP   S  +SS +AQ+G +  A  +   S+ WI+DSGATDHMT    +F+
Subjt:  NTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKAQLEQLYRLL-------TPLVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFT

Query:  MYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIDNADGFEGLYYFRGPSLRNK
         Y P      VK+ADGS   I G GS+I+SP++TL +VLHVPKL CNLISV ++ HD KC A  T +   FQD  +G  I NA   +GLY+    +  NK
Subjt:  MYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIDNADGFEGLYYFRGPSLRNK

Query:  QV
        QV
Subjt:  QV

A0A6J1DY12 uncharacterized protein LOC1110255778.4e-22481.87Show/hide
Query:  MVKTTMMTDVRKDESSDGSNTTSISLPKSTSTTLHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMV
        MVKT MMTDVRKDESSDGSNTTSISLPKSTSTT HISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMV
Subjt:  MVKTTMMTDVRKDESSDGSNTTSISLPKSTSTTLHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMV

Query:  MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY
        MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFEL NKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY
Subjt:  MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY

Query:  DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWGSSRKCVMMGDTHTKPLSLSLESSALAARGPPPSSSRSTRLNNLWCDHCKRTNHTKDRCWELHGR
        DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEV W SSRK VMMGDTHTKPLSLSLESSALAARGPPPS                               
Subjt:  DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWGSSRKCVMMGDTHTKPLSLSLESSALAARGPPPSSSRSTRLNNLWCDHCKRTNHTKDRCWELHGR

Query:  PQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKAQLEQLYRLLTPLVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDH
                                                         AQLEQLYRLLT  VESTPSSSFVAQRGI SAALT QQHSDQWILDSGATDH
Subjt:  PQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLAISLPPFSKAQLEQLYRLLTPLVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDH

Query:  MTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIDNADGFEGLYY
        MTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVL VPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTI +ADGFEGLYY
Subjt:  MTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIDNADGFEGLYY

Query:  FRGPSLRNKQVLQ
        FRGPSLRNKQVLQ
Subjt:  FRGPSLRNKQVLQ

A0A7J0DNJ6 Uncharacterized protein2.1e-10244.29Show/hide
Query:  SSDGSNTTSISLPKSTSTTLHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKE
        SSD + +    +P ++  +L          QITS KL+G+N+LQWSRS  L+IRG  R GY++G+I +P   DPSF +WD QNSMVMAWL+NSM++ I E
Subjt:  SSDGSNTTSISLPKSTSTTLHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKE

Query:  SFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVR
         ++ Y TAK +W+A+T+A+SD ++S+Q+F+LR+++R+LRQ E  VTQY+SSL +LW ELDL     W  + + E +RK + KER Y+FL GL P LDDVR
Subjt:  SFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVR

Query:  GRLLATKPIPAIDEIFAEVRWGSSRKCVMMGD-THTKPLSLSLESSALAARGPPPSSSRSTRLNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPP
        GR+L+ KP+P++D IF+EVR    R+ +M+G        S++ ++SA+AAR P    SR  R   LWCDHC R +HTK+ CW+LHG+P       D+   
Subjt:  GRLLATKPIPAIDEIFAEVRWGSSRKCVMMGD-THTKPLSLSLESSALAARGPPPSSSRSTRLNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPP

Query:  PPTNTPSSRTSSSGYQVGPSVSNSQDLAISLP-PFSKAQLEQLYRLLTPLVESTPSSSFVAQRGIFSAALTSQQHSDQ-WILDSGATDHMTAFHDMFTMY
             P S+  S G +  P V+   D A S    F++AQL+Q+ +L +       SS+ +   GI S+ +     S + WI+DSGA++HM++   +F+ Y
Subjt:  PPTNTPSSRTSSSGYQVGPSVSNSQDLAISLP-PFSKAQLEQLYRLLTPLVESTPSSSFVAQRGIFSAALTSQQHSDQ-WILDSGATDHMTAFHDMFTMY

Query:  SPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIDNADGFEGLYYF
        S       V LADGS + + G G+V LSPN+ LH+VL VPKL  NL+S+ KLT D  C A  + + C+FQD  +G TI  A    GLYYF
Subjt:  SPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIDNADGFEGLYYF

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.5e-1721Show/hide
Query:  KLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEP---------DEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSA
        KL   N+L WSR    +  G    G+++G+   P            +P ++ W  Q+ ++ + ++ ++   ++ +    +TA  +W  L   +++  +  
Subjt:  KLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEP---------DEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSA

Query:  QLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWGSSRK
         + +LR + +   +G   +  Y   L   + +L L L    ++ +  ER  +++ +E         +P +D    ++ A    P + EI   +    S+ 
Subjt:  QLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWGSSRK

Query:  CVMMGDTHTKPLSLSLESSALAARGPPPSSSRSTRLNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDL
          +   T      + + ++A++ R    +++ +    N   D+    N++K                       P    S+    +  Q  P +   Q  
Subjt:  CVMMGDTHTKPLSLSLESSALAARGPPPSSSRSTRLNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDL

Query:  AISLPPFSKAQLEQLYRLLTPLVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSP
         +     S  +  QL   L+ +    P S F   +   + AL S   S+ W+LDSGAT H+T+  +  +++ P      V +ADGS+  I   GS  LS 
Subjt:  AISLPPFSKAQLEQLYRLLTPLVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSP

Query:  N---ITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIDNADGFEGLY
            + LH++L+VP +  NLISV +L +       F  +    +DL TG  +      + LY
Subjt:  N---ITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIDNADGFEGLY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.2e-1119.7Show/hide
Query:  KLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEP---------DEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSA
        KL   N+L WSR    +  G    G+++G+   P            +P ++ W  Q+ ++ + ++ ++   ++ +    +TA  +W  L   +++  +  
Subjt:  KLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEP---------DEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSA

Query:  QLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWGSSRK
         + +LR   R            +  L  L   +D                      E++   L  L  +   V  ++ A    P++ EI  E       K
Subjt:  QLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWGSSRK

Query:  CVMMGDTHTKPLSLSLESSALAARGPPPSSSRSTRLNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDL
         + +      P++ ++ +          +++ +   NN   +     N+ +   W+    P       D R P P        S  G+            
Subjt:  CVMMGDTHTKPLSLSLESSALAARGPPPSSSRSTRLNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDL

Query:  AISLPPFSKAQLEQLYRLLTPLVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVIL--
               S  +  QL++  +   +   +S F   +   + A+ S  +++ W+LDSGAT H+T+  +  + + P      V +ADGS+  I   GS  L  
Subjt:  AISLPPFSKAQLEQLYRLLTPLVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVIL--

Query:  -SPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIDNADGFEGLY
         S ++ L+ VL+VP +  NLISV +L +  +    F  +    +DL TG  +      + LY
Subjt:  -SPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIDNADGFEGLY

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).9.7e-2332.11Show/hide
Query:  NFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQ
        N++ W       +R   + G+I+GT+ +PD   P +  W+  N+MVM WL+NSM + + ES ++  TA  +W  L   F    +  ++++LR +  +LRQ
Subjt:  NFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQ

Query:  GESDVTQYYSSLRRLWAELD--------LCLNLEWENSKDAERFRKHVEKERIYDFLAGLR--PELDDVRGRLLATKPIPAIDEIFAEVR
        G   V +Y+  L ++W EL          C     E +K AE  R   EKE+ Y+FL GL+     + V  +++  KP P++ E FA V+
Subjt:  GESDVTQYYSSLRRLWAELD--------LCLNLEWENSKDAERFRKHVEKERIYDFLAGLR--PELDDVRGRLLATKPIPAIDEIFAEVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTAAAACAACCATGATGACTGATGTGCGCAAGGATGAGAGTTCCGACGGATCGAATACGACCTCAATTTCTCTCCCTAAAAGCACATCCACGACACTTCACATCTC
TATCTCTGAAAACACTTCGCTCCAAATTACCTCTCCAAAACTCAATGGAAAAAATTTTCTGCAATGGTCTCGATCGGCCCTCCTAGTTATTCGTGGCCGGAGTCGACTCG
GGTACATCAATGGCACAATAGCAGAACCAGATGAAGCTGACCCTTCTTTTTCCGTGTGGGATGCACAAAACTCAATGGTCATGGCATGGCTCATTAACTCGATGGAGGAG
GACATTAAAGAATCCTTCATCTTCTACTCAACAGCAAAGGATCTTTGGAATGCGCTCACTATGGCTTTTTCTGATTTTGATAACTCGGCTCAATTGTTTGAATTACGCAA
TAAGGCACGTTCCTTACGACAAGGTGAATCCGATGTCACCCAATACTACAGCTCATTACGTAGGTTGTGGGCTGAACTTGATTTATGTCTCAATCTCGAATGGGAGAACT
CGAAGGATGCTGAACGCTTCCGCAAACACGTCGAGAAGGAACGAATTTATGATTTTCTTGCAGGTCTTCGTCCGGAATTAGATGATGTGCGTGGCCGCTTACTTGCCACA
AAGCCAATCCCAGCCATTGACGAAATCTTCGCAGAAGTTCGTTGGGGGTCAAGCCGTAAATGTGTGATGATGGGTGATACACACACAAAACCTCTGTCCCTCTCACTGGA
ATCATCAGCTCTGGCGGCACGAGGTCCACCACCATCTTCATCCCGATCTACTCGTCTGAACAACCTATGGTGTGATCATTGTAAGCGCACAAACCATACAAAAGATCGGT
GTTGGGAACTCCATGGTCGTCCTCAACGGAAGAATTTATCAGGGGATTATCGGCCACCTCCACCTACAAATACTCCATCTTCTCGAACCAGCTCCTCCGGCTATCAAGTG
GGCCCTAGTGTGTCAAACTCCCAAGATTTGGCCATCTCTCTCCCTCCATTTTCGAAGGCACAACTTGAACAGCTCTATCGCCTCTTAACACCGCTGGTTGAGTCTACTCC
TTCGTCAAGTTTTGTGGCACAACGAGGTATTTTTAGTGCAGCTTTAACAAGTCAGCAGCATTCCGATCAGTGGATCTTAGATTCGGGTGCAACTGATCATATGACCGCTT
TTCATGATATGTTTACCATGTACTCACCCAACCCGATTCAGACACATGTCAAGCTTGCAGATGGGTCATCGGCCATTATTAAGGGCTTTGGTTCTGTTATTCTTAGCCCA
AACATTACATTGCACTCAGTGCTCCATGTGCCTAAATTATGTTGCAATCTAATTTCTGTTCAGAAGTTGACTCATGATTTAAAGTGTCAAGCCCTATTCACTGACTCTAA
GTGTTTGTTTCAGGACTTGATAACGGGAACGACGATTGACAATGCTGATGGCTTTGAAGGGCTCTACTACTTCAGAGGACCAAGTCTAAGAAATAAACAAGTTCTCCAGG
TTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTAAAACAACCATGATGACTGATGTGCGCAAGGATGAGAGTTCCGACGGATCGAATACGACCTCAATTTCTCTCCCTAAAAGCACATCCACGACACTTCACATCTC
TATCTCTGAAAACACTTCGCTCCAAATTACCTCTCCAAAACTCAATGGAAAAAATTTTCTGCAATGGTCTCGATCGGCCCTCCTAGTTATTCGTGGCCGGAGTCGACTCG
GGTACATCAATGGCACAATAGCAGAACCAGATGAAGCTGACCCTTCTTTTTCCGTGTGGGATGCACAAAACTCAATGGTCATGGCATGGCTCATTAACTCGATGGAGGAG
GACATTAAAGAATCCTTCATCTTCTACTCAACAGCAAAGGATCTTTGGAATGCGCTCACTATGGCTTTTTCTGATTTTGATAACTCGGCTCAATTGTTTGAATTACGCAA
TAAGGCACGTTCCTTACGACAAGGTGAATCCGATGTCACCCAATACTACAGCTCATTACGTAGGTTGTGGGCTGAACTTGATTTATGTCTCAATCTCGAATGGGAGAACT
CGAAGGATGCTGAACGCTTCCGCAAACACGTCGAGAAGGAACGAATTTATGATTTTCTTGCAGGTCTTCGTCCGGAATTAGATGATGTGCGTGGCCGCTTACTTGCCACA
AAGCCAATCCCAGCCATTGACGAAATCTTCGCAGAAGTTCGTTGGGGGTCAAGCCGTAAATGTGTGATGATGGGTGATACACACACAAAACCTCTGTCCCTCTCACTGGA
ATCATCAGCTCTGGCGGCACGAGGTCCACCACCATCTTCATCCCGATCTACTCGTCTGAACAACCTATGGTGTGATCATTGTAAGCGCACAAACCATACAAAAGATCGGT
GTTGGGAACTCCATGGTCGTCCTCAACGGAAGAATTTATCAGGGGATTATCGGCCACCTCCACCTACAAATACTCCATCTTCTCGAACCAGCTCCTCCGGCTATCAAGTG
GGCCCTAGTGTGTCAAACTCCCAAGATTTGGCCATCTCTCTCCCTCCATTTTCGAAGGCACAACTTGAACAGCTCTATCGCCTCTTAACACCGCTGGTTGAGTCTACTCC
TTCGTCAAGTTTTGTGGCACAACGAGGTATTTTTAGTGCAGCTTTAACAAGTCAGCAGCATTCCGATCAGTGGATCTTAGATTCGGGTGCAACTGATCATATGACCGCTT
TTCATGATATGTTTACCATGTACTCACCCAACCCGATTCAGACACATGTCAAGCTTGCAGATGGGTCATCGGCCATTATTAAGGGCTTTGGTTCTGTTATTCTTAGCCCA
AACATTACATTGCACTCAGTGCTCCATGTGCCTAAATTATGTTGCAATCTAATTTCTGTTCAGAAGTTGACTCATGATTTAAAGTGTCAAGCCCTATTCACTGACTCTAA
GTGTTTGTTTCAGGACTTGATAACGGGAACGACGATTGACAATGCTGATGGCTTTGAAGGGCTCTACTACTTCAGAGGACCAAGTCTAAGAAATAAACAAGTTCTCCAGG
TTTGA
Protein sequenceShow/hide protein sequence
MVKTTMMTDVRKDESSDGSNTTSISLPKSTSTTLHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEE
DIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLAT
KPIPAIDEIFAEVRWGSSRKCVMMGDTHTKPLSLSLESSALAARGPPPSSSRSTRLNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQV
GPSVSNSQDLAISLPPFSKAQLEQLYRLLTPLVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSP
NITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIDNADGFEGLYYFRGPSLRNKQVLQV