; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g20640 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g20640
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr3:14076638..14079512
RNA-Seq ExpressionMoc03g20640
SyntenyMoc03g20640
Gene Ontology termsGO:0034641 - cellular nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8658400.1 60S ribosomal protein L38 [Hibiscus syriacus]5.3e-10441.62Show/hide
Query:  DGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADLFFSVWDAQNSMVMAWLINSMEEDIKESF
        D S   S+S  +S S T +    +N S  ITS KLNG NFLQWS+S  L IRGR + GY++G   +P E +     W+A+NSM+M+WLINSM+  +  ++
Subjt:  DGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADLFFSVWDAQNSMVMAWLINSMEEDIKESF

Query:  IFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGR
        +F  TA D+WNA+   +SD  N+ Q FEL+ +   L+QGE  VTQYY+ L+ LW E+D+  + EW  +KD   F+K VEKER+++FL GL  ELD+VRGR
Subjt:  IFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGR

Query:  LLAIKPIPAIDEIFAEVCWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWELHGRPQRKNLSGDYRPPPPT
        +L  +P+P+  E+F+EV  E SR+ VM+G     P     ESSAL +  PP  + R   R+   C+HC +  HTK +CW+LHG+P  +N S +       
Subjt:  LLAIKPIPAIDEIFAEVCWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWELHGRPQRKNLSGDYRPPPPT

Query:  NTPSSRTSSSGYQVGPSVSNSQDLATSLPPFSKAQLEQLYRLL-------TPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFT
           +SR +   +      S     A+ L  FSK QLEQLY+L+       TP   S  +SS +AQ+G +  A  +   S+ WI+DSGATDHMT    +F+
Subjt:  NTPSSRTSSSGYQVGPSVSNSQDLATSLPPFSKAQLEQLYRLL-------TPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFT

Query:  MYSPNPIQTHVKLVDGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLIYVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNK
         Y P      VK+ DGS   I G GS+I+SP++TL +VLHVPKL CNLI V ++ HD KC A  T +   FQD  +G  IGNA   +GLY+    +  NK
Subjt:  MYSPNPIQTHVKLVDGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLIYVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNK

Query:  QVLQIEESVPIISCNNE--------DDQVNPNRSDKQPET--------LVYSRRQTVQKGVEPPQPQ
        QV ++      +SC NE           +N    + Q ET         VYSRR T    V PP P+
Subjt:  QVLQIEESVPIISCNNE--------DDQVNPNRSDKQPET--------LVYSRRQTVQKGVEPPQPQ

RVW98618.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.6e-10540.28Show/hide
Query:  MVKTAMMTDVHKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADLFFSVWDAQNSMV
        M KTA MT   K  SS  + T ++    + +++   S S  +S Q+T  KLNGKN+L+W++S  L I GR +LG++NG +++P   D     W ++NS+V
Subjt:  MVKTAMMTDVHKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADLFFSVWDAQNSMV

Query:  MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY
        +AWLINSME  I +  +F  TAKD+W A+   +SD +NS+Q+F+L++K    RQG+ +VT YY+ +  LW ELDLC   EW+   D+ R +K  E +R+Y
Subjt:  MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY

Query:  DFLAGLRPELDDVRGRLLAIKPIPAIDEIFAEVCWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWELHGR
         FLAGL   LD+VRGR+L  KP+P+I E+F+EV  E +R++VM+ D   K  +  +ESSAL ++G     S   RR   WCDHCK+  HTK+ CW++HG+
Subjt:  DFLAGLRPELDDVRGRLLAIKPIPAIDEIFAEVCWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWELHGR

Query:  PQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQ--WILDSGAT
        PQ      +++    ++  + +T S+  Q GP +++ +      P F+K QL  LY+L   P  S PS S   Q     AAL+S + +    WI+DSGAT
Subjt:  PQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQ--WILDSGAT

Query:  DHMTAFHDMFTMYSPNPIQTHVKLVDGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLIYVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGL
        DHMT    +F+ Y P      +K+ DGS + I G GSV +SP++TLH+VLHVP L CNL+ + K+T D +CQA F  S C FQ+L +G TIGNA    GL
Subjt:  DHMTAFHDMFTMYSPNPIQTHVKLVDGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLIYVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGL

Query:  YYFRGPSLRNKQVLQI---------------EESVPIISCNNEDDQVNPNRSDKQPETLVYSRRQTVQKGVEP-PQPQQQS
        Y+F   S   K + +I                 S+P  + N+ +       S K  E + YSRR+   K   P P P  +S
Subjt:  YYFRGPSLRNKQVLQI---------------EESVPIISCNNEDDQVNPNRSDKQPETLVYSRRQTVQKGVEP-PQPQQQS

XP_022159153.1 uncharacterized protein LOC111025577 [Momordica charantia]6.6e-22477.34Show/hide
Query:  MVKTAMMTDVHKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADLFFSVWDAQNSMV
        MVKTAMMTDV KDESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEAD  FSVWDAQNSMV
Subjt:  MVKTAMMTDVHKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADLFFSVWDAQNSMV

Query:  MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY
        MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFEL NKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY
Subjt:  MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY

Query:  DFLAGLRPELDDVRGRLLAIKPIPAIDEIFAEVCWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWELHGR
        DFLAGLRPELDDVRGRLLA KPIPAIDEIFAEV WESSRKRVMMGDTHTKPLSLSLESSALAARGPPP                                
Subjt:  DFLAGLRPELDDVRGRLLAIKPIPAIDEIFAEVCWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWELHGR

Query:  PQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDH
                                                  S  P   AQLEQLYRLLT PVESTPSSSFVAQRGI SAALT QQHSDQWILDSGATDH
Subjt:  PQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDH

Query:  MTAFHDMFTMYSPNPIQTHVKLVDGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLIYVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYY
        MTAFHDMFTMYSPNPIQTHVKL DGSSAIIKGFGSVILSPNITLHSVL VPKLCCNLI VQKLTHDLKCQALFTDSKCLFQDLITGTTIG+ADGFEGLYY
Subjt:  MTAFHDMFTMYSPNPIQTHVKLVDGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLIYVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYY

Query:  FRGPSLRNKQVLQIEESVPIISCNNEDDQVNPNRSDKQPETLVYSRRQTVQKGVEP
        FRGPSLRNKQVLQ   +        E    NPN   +    +V  R +T   G  P
Subjt:  FRGPSLRNKQVLQIEESVPIISCNNEDDQVNPNRSDKQPETLVYSRRQTVQKGVEP

XP_024044151.1 uncharacterized protein LOC18046468 isoform X1 [Citrus clementina]5.8e-10344.54Show/hide
Query:  ITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADLFFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFEL
        IT+ KLNG+NFLQWS+S  + IRG+ ++GY+ G+I EP E D  F  WDA NSM+M+WL+NSME++I ++++F  TAKDLW+A+T  +SD  NSAQ+++L
Subjt:  ITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADLFFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFEL

Query:  RNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLAIKPIPAIDEIFAEVCWESSRKRVMMG
        + + R  +QG   VT+YY+ L+ LW ELD   + EWE + D+ +++K +EKER+++FLAGL  +LD+VRGR+L  +P+P+  E+F+ V  E SRK VMMG
Subjt:  RNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLAIKPIPAIDEIFAEVCWESSRKRVMMG

Query:  DTHTKPLSLSLESSALAARGPPPP-------SSRSTRRNNLWCDHCKRTNHTKDQCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQ-VGPSVSNS
         +       S E+SAL +  P  P         +S  ++ +WCD+C +  HT+D CW+LHG           +PP   N   S   S G+Q VG +   +
Subjt:  DTHTKPLSLSLESSALAARGPPPP-------SSRSTRRNNLWCDHCKRTNHTKDQCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQ-VGPSVSNS

Query:  QDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSF--VAQRG-IFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLVDGSSAIIKGFG
            T    F+K QLEQLYR L         SSF  +AQ+G  F+A     +  D WI+DSGATDHMT+   +F+ Y P      +K+ DGS + + G G
Subjt:  QDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSF--VAQRG-IFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLVDGSSAIIKGFG

Query:  SVILSPNITLHSVLHVPKLCCNLIYVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYF
        S+ +S N+ L SVLHVP L CNL+ V K+T DL C A F+ S C FQDL +G  IG+A   +GLYYF
Subjt:  SVILSPNITLHSVLHVPKLCCNLIYVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYF

XP_024044152.1 uncharacterized protein LOC18046468 isoform X2 [Citrus clementina]5.8e-10344.54Show/hide
Query:  ITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADLFFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFEL
        IT+ KLNG+NFLQWS+S  + IRG+ ++GY+ G+I EP E D  F  WDA NSM+M+WL+NSME++I ++++F  TAKDLW+A+T  +SD  NSAQ+++L
Subjt:  ITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADLFFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFEL

Query:  RNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLAIKPIPAIDEIFAEVCWESSRKRVMMG
        + + R  +QG   VT+YY+ L+ LW ELD   + EWE + D+ +++K +EKER+++FLAGL  +LD+VRGR+L  +P+P+  E+F+ V  E SRK VMMG
Subjt:  RNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLAIKPIPAIDEIFAEVCWESSRKRVMMG

Query:  DTHTKPLSLSLESSALAARGPPPP-------SSRSTRRNNLWCDHCKRTNHTKDQCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQ-VGPSVSNS
         +       S E+SAL +  P  P         +S  ++ +WCD+C +  HT+D CW+LHG           +PP   N   S   S G+Q VG +   +
Subjt:  DTHTKPLSLSLESSALAARGPPPP-------SSRSTRRNNLWCDHCKRTNHTKDQCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQ-VGPSVSNS

Query:  QDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSF--VAQRG-IFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLVDGSSAIIKGFG
            T    F+K QLEQLYR L         SSF  +AQ+G  F+A     +  D WI+DSGATDHMT+   +F+ Y P      +K+ DGS + + G G
Subjt:  QDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSF--VAQRG-IFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLVDGSSAIIKGFG

Query:  SVILSPNITLHSVLHVPKLCCNLIYVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYF
        S+ +S N+ L SVLHVP L CNL+ V K+T DL C A F+ S C FQDL +G  IG+A   +GLYYF
Subjt:  SVILSPNITLHSVLHVPKLCCNLIYVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYF

TrEMBL top hitse value%identityAlignment
A0A438F2X4 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-9941.2Show/hide
Query:  DESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADLFFSVWDAQNSMVMAWLINSMEEDI
        +  ++G N+   S P++  +    S ++   L IT  KLNG+NF+QW++S  + I G+ +  Y+ G I +P E D  +  W  +NSMVM+WLINSM  DI
Subjt:  DESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADLFFSVWDAQNSMVMAWLINSMEEDI

Query:  KESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDD
         E+F++Y TAK++W+A    +S+ DN++ +FE+++  + LRQG+S VT+Y++ L R W +LD+   L W+  +D   ++K +EKERIY FL GL   LD+
Subjt:  KESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDD

Query:  VRGRLLAIKPIPAIDEIFAEVCWESSRKRVMMGDTHTKPLSLSLESSALAARGPPP-PSSRSTRRNNLWCDHCKRTNHTKDQCWELHGRPQRKNLSGDYR
        VRGR+L+IKP+P++ E+F+E+  E SR++VM+G   T+  S +LE+SAL ARG     ++  T++N  WCDHC++  HTK+ CW LHG+P       D++
Subjt:  VRGRLLAIKPIPAIDEIFAEVCWESSRKRVMMGDTHTKPLSLSLESSALAARGPPP-PSSRSTRRNNLWCDHCKRTNHTKDQCWELHGRPQRKNLSGDYR

Query:  PPPPTNTPSSRTSSSGYQVGPSVSNSQDLATSLPPFSKAQLEQLYRLLTPPVESTPS---SSFVAQRGIFSAAL-TSQQHSDQWILDSGATDHMTAFHDM
        P  P      R  ++  +   S ++S        PFSK QLE L ++    ++ST +   ++ VAQ+GIF  AL   Q++   WI+DSGA+DHMT    +
Subjt:  PPPPTNTPSSRTSSSGYQVGPSVSNSQDLATSLPPFSKAQLEQLYRLLTPPVESTPS---SSFVAQRGIFSAAL-TSQQHSDQWILDSGATDHMTAFHDM

Query:  FTMYSPNPIQTHVKLVDGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLIYVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLR
        F  Y+P      V++ DG+ + + G GSVI+S +ITLHSVL+VPKL CNL+ + +LT DL C   F    C FQ L +G  IGNA+   GLY  R   +R
Subjt:  FTMYSPNPIQTHVKLVDGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLIYVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLR

A0A438IPH4 Retrovirus-related Pol polyprotein from transposon RE17.9e-10640.28Show/hide
Query:  MVKTAMMTDVHKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADLFFSVWDAQNSMV
        M KTA MT   K  SS  + T ++    + +++   S S  +S Q+T  KLNGKN+L+W++S  L I GR +LG++NG +++P   D     W ++NS+V
Subjt:  MVKTAMMTDVHKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADLFFSVWDAQNSMV

Query:  MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY
        +AWLINSME  I +  +F  TAKD+W A+   +SD +NS+Q+F+L++K    RQG+ +VT YY+ +  LW ELDLC   EW+   D+ R +K  E +R+Y
Subjt:  MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY

Query:  DFLAGLRPELDDVRGRLLAIKPIPAIDEIFAEVCWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWELHGR
         FLAGL   LD+VRGR+L  KP+P+I E+F+EV  E +R++VM+ D   K  +  +ESSAL ++G     S   RR   WCDHCK+  HTK+ CW++HG+
Subjt:  DFLAGLRPELDDVRGRLLAIKPIPAIDEIFAEVCWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWELHGR

Query:  PQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQ--WILDSGAT
        PQ      +++    ++  + +T S+  Q GP +++ +      P F+K QL  LY+L   P  S PS S   Q     AAL+S + +    WI+DSGAT
Subjt:  PQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQ--WILDSGAT

Query:  DHMTAFHDMFTMYSPNPIQTHVKLVDGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLIYVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGL
        DHMT    +F+ Y P      +K+ DGS + I G GSV +SP++TLH+VLHVP L CNL+ + K+T D +CQA F  S C FQ+L +G TIGNA    GL
Subjt:  DHMTAFHDMFTMYSPNPIQTHVKLVDGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLIYVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGL

Query:  YYFRGPSLRNKQVLQI---------------EESVPIISCNNEDDQVNPNRSDKQPETLVYSRRQTVQKGVEP-PQPQQQS
        Y+F   S   K + +I                 S+P  + N+ +       S K  E + YSRR+   K   P P P  +S
Subjt:  YYFRGPSLRNKQVLQI---------------EESVPIISCNNEDDQVNPNRSDKQPETLVYSRRQTVQKGVEP-PQPQQQS

A0A6A2WU09 60S ribosomal protein L382.6e-10441.62Show/hide
Query:  DGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADLFFSVWDAQNSMVMAWLINSMEEDIKESF
        D S   S+S  +S S T +    +N S  ITS KLNG NFLQWS+S  L IRGR + GY++G   +P E +     W+A+NSM+M+WLINSM+  +  ++
Subjt:  DGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADLFFSVWDAQNSMVMAWLINSMEEDIKESF

Query:  IFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGR
        +F  TA D+WNA+   +SD  N+ Q FEL+ +   L+QGE  VTQYY+ L+ LW E+D+  + EW  +KD   F+K VEKER+++FL GL  ELD+VRGR
Subjt:  IFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGR

Query:  LLAIKPIPAIDEIFAEVCWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWELHGRPQRKNLSGDYRPPPPT
        +L  +P+P+  E+F+EV  E SR+ VM+G     P     ESSAL +  PP  + R   R+   C+HC +  HTK +CW+LHG+P  +N S +       
Subjt:  LLAIKPIPAIDEIFAEVCWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWELHGRPQRKNLSGDYRPPPPT

Query:  NTPSSRTSSSGYQVGPSVSNSQDLATSLPPFSKAQLEQLYRLL-------TPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFT
           +SR +   +      S     A+ L  FSK QLEQLY+L+       TP   S  +SS +AQ+G +  A  +   S+ WI+DSGATDHMT    +F+
Subjt:  NTPSSRTSSSGYQVGPSVSNSQDLATSLPPFSKAQLEQLYRLL-------TPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFT

Query:  MYSPNPIQTHVKLVDGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLIYVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNK
         Y P      VK+ DGS   I G GS+I+SP++TL +VLHVPKL CNLI V ++ HD KC A  T +   FQD  +G  IGNA   +GLY+    +  NK
Subjt:  MYSPNPIQTHVKLVDGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLIYVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNK

Query:  QVLQIEESVPIISCNNE--------DDQVNPNRSDKQPET--------LVYSRRQTVQKGVEPPQPQ
        QV ++      +SC NE           +N    + Q ET         VYSRR T    V PP P+
Subjt:  QVLQIEESVPIISCNNE--------DDQVNPNRSDKQPET--------LVYSRRQTVQKGVEPPQPQ

A0A6J1DY12 uncharacterized protein LOC1110255773.2e-22477.34Show/hide
Query:  MVKTAMMTDVHKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADLFFSVWDAQNSMV
        MVKTAMMTDV KDESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEAD  FSVWDAQNSMV
Subjt:  MVKTAMMTDVHKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADLFFSVWDAQNSMV

Query:  MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY
        MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFEL NKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY
Subjt:  MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY

Query:  DFLAGLRPELDDVRGRLLAIKPIPAIDEIFAEVCWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWELHGR
        DFLAGLRPELDDVRGRLLA KPIPAIDEIFAEV WESSRKRVMMGDTHTKPLSLSLESSALAARGPPP                                
Subjt:  DFLAGLRPELDDVRGRLLAIKPIPAIDEIFAEVCWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWELHGR

Query:  PQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDH
                                                  S  P   AQLEQLYRLLT PVESTPSSSFVAQRGI SAALT QQHSDQWILDSGATDH
Subjt:  PQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDH

Query:  MTAFHDMFTMYSPNPIQTHVKLVDGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLIYVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYY
        MTAFHDMFTMYSPNPIQTHVKL DGSSAIIKGFGSVILSPNITLHSVL VPKLCCNLI VQKLTHDLKCQALFTDSKCLFQDLITGTTIG+ADGFEGLYY
Subjt:  MTAFHDMFTMYSPNPIQTHVKLVDGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLIYVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYY

Query:  FRGPSLRNKQVLQIEESVPIISCNNEDDQVNPNRSDKQPETLVYSRRQTVQKGVEP
        FRGPSLRNKQVLQ   +        E    NPN   +    +V  R +T   G  P
Subjt:  FRGPSLRNKQVLQIEESVPIISCNNEDDQVNPNRSDKQPETLVYSRRQTVQKGVEP

A0A7J0DNJ6 Uncharacterized protein4.1e-10244.65Show/hide
Query:  SNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADLFFSVWDAQNSMVMAWLINSMEEDIKESFIF
        S+  + S PK   T    S+      QITS KL+G+N+LQWSRS  L+IRG  R GY++G+I +P   D  F +WD QNSMVMAWL+NSM++ I E ++ 
Subjt:  SNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADLFFSVWDAQNSMVMAWLINSMEEDIKESFIF

Query:  YSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLL
        Y TAK +W+A+T+A+SD ++S+Q+F+LR+++R+LRQ E  VTQY+SSL +LW ELDL     W  + + E +RK + KER Y+FL GL P LDDVRGR+L
Subjt:  YSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLL

Query:  AIKPIPAIDEIFAEVCWESSRKRVMMGD-THTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWELHGRPQRKNLSGDYRPPPPTN
        +IKP+P++D IF+EV  E  R+R+M+G        S++ ++SA+AAR    P SR  R+  LWCDHC R +HTK+ CW+LHG+P       D+       
Subjt:  AIKPIPAIDEIFAEVCWESSRKRVMMGD-THTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWELHGRPQRKNLSGDYRPPPPTN

Query:  TPSSRTSSSGYQVGPSVSNSQDLATSLP-PFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQ-WILDSGATDHMTAFHDMFTMYSPNP
         P S+  S G +  P V+   D A+S    F++AQL+Q+ +L +       SS+ +   GI S+ +     S + WI+DSGA++HM++   +F+ YS   
Subjt:  TPSSRTSSSGYQVGPSVSNSQDLATSLP-PFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQ-WILDSGATDHMTAFHDMFTMYSPNP

Query:  IQTHVKLVDGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLIYVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYF
            V L DGS + + G G+V LSPN+ LH+VL VPKL  NL+ + KLT D  C A  + + C+FQD  +G TIG A    GLYYF
Subjt:  IQTHVKLVDGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLIYVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYF

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.3e-1420.13Show/hide
Query:  KLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEP---------DEADLFFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSA
        KL   N+L WSR    +  G    G+++G+   P            +  ++ W  Q+ ++ + ++ ++   ++ +    +TA  +W  L   +++  +  
Subjt:  KLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEP---------DEADLFFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSA

Query:  QLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLAIKPIPAIDEIFAEVCWESSRK
         + +LR + +   +G   +  Y   L   + +L L L    ++ +  ER  +++ +E         +P +D +  +       P + EI   +    S+ 
Subjt:  QLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLAIKPIPAIDEIFAEVCWESSRK

Query:  RVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDL
          +   T      + + ++A++ R     ++ +    N   D+    N++K                       P    S+    +  Q  P +   Q  
Subjt:  RVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDL

Query:  ATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLVDGSSAIIKGFGSVILSP
           +   S  +  QL   L+      P S F   +   + AL S   S+ W+LDSGAT H+T+  +  +++ P      V + DGS+  I   GS  LS 
Subjt:  ATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLVDGSSAIIKGFGSVILSP

Query:  N---ITLHSVLHVPKLCCNLIYVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLY
            + LH++L+VP +  NLI V +L +       F  +    +DL TG  +      + LY
Subjt:  N---ITLHSVLHVPKLCCNLIYVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.9e-0920.35Show/hide
Query:  KLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEP---------DEADLFFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSA
        KL   N+L WSR    +  G    G+++G+   P            +  ++ W  Q+ ++ + ++ ++   ++ +    +TA  +W  L   +++  +  
Subjt:  KLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEP---------DEADLFFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSA

Query:  QLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLAIKPIPAIDEIFAEVCWESSRK
         + +LR   R            +  L  L   +D                      E++   L  L  +   V  ++ A    P++ EI   +    S+ 
Subjt:  QLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLAIKPIPAIDEIFAEVCWESSRK

Query:  RVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDL
                     L+L S+ +     P  ++  T RN         TN  ++   +        N S  ++P       SS + S   Q  P +   Q  
Subjt:  RVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQVGPSVSNSQDL

Query:  ATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLVDGSSAIIKGFGSVIL--
          S+   S  +  QL++  +   +   +S F   +   + A+ S  +++ W+LDSGAT H+T+  +  + + P      V + DGS+  I   GS  L  
Subjt:  ATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLVDGSSAIIKGFGSVIL--

Query:  -SPNITLHSVLHVPKLCCNLIYVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLY
         S ++ L+ VL+VP +  NLI V +L +  +    F  +    +DL TG  +      + LY
Subjt:  -SPNITLHSVLHVPKLCCNLIYVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLY

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.7e-2131.75Show/hide
Query:  NFLQWSRSALLVIRGRSRLGYINGTIAEPDEADLFFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQ
        N++ W       +R   + G+I+GT+ +PD     +  W+  N+MVM WL+NSM + + ES ++  TA  +W  L   F    +  ++++LR +  +LRQ
Subjt:  NFLQWSRSALLVIRGRSRLGYINGTIAEPDEADLFFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQ

Query:  GESDVTQYYSSLRRLWAELD--------LCLNLEWENSKDAERFRKHVEKERIYDFLAGLR--PELDDVRGRLLAIKPIPAIDEIFAEV
        G   V +Y+  L ++W EL          C     E +K AE  R   EKE+ Y+FL GL+     + V  +++  KP P++ E FA V
Subjt:  GESDVTQYYSSLRRLWAELD--------LCLNLEWENSKDAERFRKHVEKERIYDFLAGLR--PELDDVRGRLLAIKPIPAIDEIFAEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTAAAACAGCCATGATGACTGATGTGCACAAGGATGAGAGTTCCGACGGATCGAATACGACCTCAATTTCTCTCCCTAAAAGCACATCCACGACACCTCACATCTC
TATCTCTGAAAACACTTCGCTCCAAATTACCTCTCCAAAACTCAACGGAAAAAATTTTCTGCAATGGTCTCGATCGGCCCTCCTAGTTATTCGTGGCCGCAGTCGACTCG
GGTACATCAATGGCACGATAGCAGAACCAGATGAAGCTGACCTTTTTTTTTCCGTGTGGGATGCACAAAACTCAATGGTCATGGCATGGCTCATTAACTCGATGGAGGAG
GACATTAAAGAATCCTTCATCTTCTACTCAACAGCAAAGGATCTTTGGAATGCGCTCACTATGGCTTTTTCTGATTTTGATAACTCGGCTCAATTGTTTGAATTACGCAA
TAAGGCACGTTCCTTACGACAAGGTGAATCCGATGTCACCCAATACTACAGCTCATTACGTAGGTTGTGGGCTGAACTTGATTTATGTCTCAATCTCGAATGGGAGAACT
CGAAGGATGCTGAACGCTTCCGCAAACACGTCGAGAAGGAACGAATTTATGATTTTCTTGCAGGTCTTCGTCCGGAATTAGATGATGTGCGTGGCCGCTTACTTGCCATA
AAGCCAATCCCAGCCATTGACGAAATCTTCGCAGAAGTTTGCTGGGAGTCAAGCCGTAAACGTGTGATGATGGGTGATACACACACAAAACCTCTGTCCCTCTCACTGGA
ATCATCAGCTCTGGCGGCACGAGGTCCACCACCACCTTCATCCCGATCTACTCGTCGGAACAACCTATGGTGTGATCATTGTAAGCGCACAAACCATACAAAAGATCAGT
GTTGGGAACTCCATGGTCGTCCTCAACGGAAGAATTTATCAGGGGATTATCGGCCACCTCCACCTACAAATACTCCATCTTCTCGAACCAGCTCCTCCGGCTATCAAGTG
GGCCCTAGTGTGTCCAACTCCCAAGATTTGGCCACCTCTCTCCCTCCATTTTCGAAGGCACAACTTGAACAGCTCTATCGCCTCTTAACACCGCCGGTTGAGTCTACTCC
TTCGTCAAGTTTTGTGGCACAACGAGGTATTTTTAGTGCAGCTTTAACAAGTCAGCAGCATTCCGATCAGTGGATCTTAGATTCGGGTGCAACTGATCATATGACCGCTT
TTCATGATATGTTTACCATGTACTCACCCAACCCGATTCAGACACATGTCAAGCTTGTAGATGGGTCATCGGCCATTATTAAGGGCTTTGGTTCTGTTATTCTTAGCCCA
AACATTACATTGCACTCAGTGCTCCATGTGCCTAAATTATGTTGCAATCTAATTTATGTTCAGAAGTTGACTCATGATTTAAAGTGTCAAGCCCTGTTCACTGACTCTAA
GTGTTTGTTTCAGGACTTGATAACGGGAACGACGATTGGCAATGCTGATGGCTTTGAAGGGCTCTACTACTTCAGAGGACCAAGTCTAAGAAATAAACAAGTTCTCCAGA
TAGAAGAGTCAGTTCCTATTATTTCCTGTAATAATGAAGATGATCAAGTCAACCCAAATCGAAGTGACAAGCAACCTGAGACTCTTGTTTATTCTCGGCGACAAACGGTT
CAAAAAGGAGTGGAGCCACCACAGCCTCAACAGCAAAGTCATGAATCCATCTCGTCCTTAGGAAGGGTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTAAAACAGCCATGATGACTGATGTGCACAAGGATGAGAGTTCCGACGGATCGAATACGACCTCAATTTCTCTCCCTAAAAGCACATCCACGACACCTCACATCTC
TATCTCTGAAAACACTTCGCTCCAAATTACCTCTCCAAAACTCAACGGAAAAAATTTTCTGCAATGGTCTCGATCGGCCCTCCTAGTTATTCGTGGCCGCAGTCGACTCG
GGTACATCAATGGCACGATAGCAGAACCAGATGAAGCTGACCTTTTTTTTTCCGTGTGGGATGCACAAAACTCAATGGTCATGGCATGGCTCATTAACTCGATGGAGGAG
GACATTAAAGAATCCTTCATCTTCTACTCAACAGCAAAGGATCTTTGGAATGCGCTCACTATGGCTTTTTCTGATTTTGATAACTCGGCTCAATTGTTTGAATTACGCAA
TAAGGCACGTTCCTTACGACAAGGTGAATCCGATGTCACCCAATACTACAGCTCATTACGTAGGTTGTGGGCTGAACTTGATTTATGTCTCAATCTCGAATGGGAGAACT
CGAAGGATGCTGAACGCTTCCGCAAACACGTCGAGAAGGAACGAATTTATGATTTTCTTGCAGGTCTTCGTCCGGAATTAGATGATGTGCGTGGCCGCTTACTTGCCATA
AAGCCAATCCCAGCCATTGACGAAATCTTCGCAGAAGTTTGCTGGGAGTCAAGCCGTAAACGTGTGATGATGGGTGATACACACACAAAACCTCTGTCCCTCTCACTGGA
ATCATCAGCTCTGGCGGCACGAGGTCCACCACCACCTTCATCCCGATCTACTCGTCGGAACAACCTATGGTGTGATCATTGTAAGCGCACAAACCATACAAAAGATCAGT
GTTGGGAACTCCATGGTCGTCCTCAACGGAAGAATTTATCAGGGGATTATCGGCCACCTCCACCTACAAATACTCCATCTTCTCGAACCAGCTCCTCCGGCTATCAAGTG
GGCCCTAGTGTGTCCAACTCCCAAGATTTGGCCACCTCTCTCCCTCCATTTTCGAAGGCACAACTTGAACAGCTCTATCGCCTCTTAACACCGCCGGTTGAGTCTACTCC
TTCGTCAAGTTTTGTGGCACAACGAGGTATTTTTAGTGCAGCTTTAACAAGTCAGCAGCATTCCGATCAGTGGATCTTAGATTCGGGTGCAACTGATCATATGACCGCTT
TTCATGATATGTTTACCATGTACTCACCCAACCCGATTCAGACACATGTCAAGCTTGTAGATGGGTCATCGGCCATTATTAAGGGCTTTGGTTCTGTTATTCTTAGCCCA
AACATTACATTGCACTCAGTGCTCCATGTGCCTAAATTATGTTGCAATCTAATTTATGTTCAGAAGTTGACTCATGATTTAAAGTGTCAAGCCCTGTTCACTGACTCTAA
GTGTTTGTTTCAGGACTTGATAACGGGAACGACGATTGGCAATGCTGATGGCTTTGAAGGGCTCTACTACTTCAGAGGACCAAGTCTAAGAAATAAACAAGTTCTCCAGA
TAGAAGAGTCAGTTCCTATTATTTCCTGTAATAATGAAGATGATCAAGTCAACCCAAATCGAAGTGACAAGCAACCTGAGACTCTTGTTTATTCTCGGCGACAAACGGTT
CAAAAAGGAGTGGAGCCACCACAGCCTCAACAGCAAAGTCATGAATCCATCTCGTCCTTAGGAAGGGTGTGA
Protein sequenceShow/hide protein sequence
MVKTAMMTDVHKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITSPKLNGKNFLQWSRSALLVIRGRSRLGYINGTIAEPDEADLFFSVWDAQNSMVMAWLINSMEE
DIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLAI
KPIPAIDEIFAEVCWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHCKRTNHTKDQCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQV
GPSVSNSQDLATSLPPFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWILDSGATDHMTAFHDMFTMYSPNPIQTHVKLVDGSSAIIKGFGSVILSP
NITLHSVLHVPKLCCNLIYVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQIEESVPIISCNNEDDQVNPNRSDKQPETLVYSRRQTV
QKGVEPPQPQQQSHESISSLGRV