; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g28340 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g28340
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr11:20655410..20658160
RNA-Seq ExpressionMoc11g28340
SyntenyMoc11g28340
Gene Ontology termsGO:0034641 - cellular nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8658400.1 60S ribosomal protein L38 [Hibiscus syriacus]5.2e-9941.47Show/hide
Query:  DGSNTTSISLPKSTSTTPHISISENTSLQITAPKLNGKNFLQWSQSDLLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESF
        D S   S+S  +S S T +    +N S  IT+ KLNG NFLQWSQS  L IRGR + GY++G   +P E + S   W+A+NSM+M+WLINSM+  +  ++
Subjt:  DGSNTTSISLPKSTSTTPHISISENTSLQITAPKLNGKNFLQWSQSDLLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESF

Query:  IFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGR
        +F  TA D+WNA+   +SD  N+ Q FEL+ +   L+QGE  VTQYY+ L+ LW E+D+  + EW  +KD   F+K VEKER+++FL GL  ELD+VRGR
Subjt:  IFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGR

Query:  LLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDH-----------WDYRPPPLTNTPSSRTSSSG--
        +L  +P+P+  E+F+EVR E SR+ VM+G     P     ESSAL +  PP  + R   R+   C+H           W     P  N  ++R S +   
Subjt:  LLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDH-----------WDYRPPPLTNTPSSRTSSSG--

Query:  -YQVGPSVSNSQDLATSLPSFSKAQLEQLYRLL-------TPPVESTPSSSFVAQRGIFSAALTSQQHSDQWIINSGATDHMTAFHDMFTMYSPNPIQTH
         +      S     A+ L +FSK QLEQLY+L+       TP   S  +SS +AQ+G +  A  +   S+ WII+SGATDHMT    +F+ Y P      
Subjt:  -YQVGPSVSNSQDLATSLPSFSKAQLEQLYRLL-------TPPVESTPSSSFVAQRGIFSAALTSQQHSDQWIINSGATDHMTAFHDMFTMYSPNPIQTH

Query:  VKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQIEESVP
        VK+ADGS   I G GS+I+SP++TL +VLHVPKL CNLISV ++ HD KC A  T +   FQD  +G  IGNA   +GLY+    +  NKQV ++     
Subjt:  VKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQIEESVP

Query:  IISCNNE--------DDQFNPNRSDKQPET--------LVYSRRQTVQRGVEPPQPQ
         +SC NE            N    + Q ET         VYSRR T    V PP P+
Subjt:  IISCNNE--------DDQFNPNRSDKQPET--------LVYSRRQTVQRGVEPPQPQ

RVW98618.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.2e-10239.11Show/hide
Query:  MVKTAMMTNVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITAPKLNGKNFLQWSQSDLLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMV
        M KTA MT   K  SS  + T ++    + +++   S S  +S Q+T  KLNGKN+L+W+QS  L I GR +LG++NG +++P   DP+   W ++NS+V
Subjt:  MVKTAMMTNVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITAPKLNGKNFLQWSQSDLLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMV

Query:  MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY
        +AWLINSME  I +  +F  TAKD+W A+   +SD +NS+Q+F+L++K    RQG+ +VT YY+ +  LW ELDLC   EW+   D+ R +K  E +R+Y
Subjt:  MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY

Query:  DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDH-----------WDYRPP
         FLAGL   LD+VRGR+L  KP+P+I E+F+EVR E +R++VM+ D   K  +  +ESSAL ++G     S   RR   WCDH           W     
Subjt:  DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDH-----------WDYRPP

Query:  PLTNTPSSRTSSSGYQVGPSVSNSQDLATSLPSFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQ--WIINSGATDHMTAFHDMFTMY
        P      + +    +Q   + S    + +  P+F+K QL  LY+L   P  S PS S   Q     AAL+S + +    WII+SGATDHMT    +F+ Y
Subjt:  PLTNTPSSRTSSSGYQVGPSVSNSQDLATSLPSFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQ--WIINSGATDHMTAFHDMFTMY

Query:  SPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQV
         P      +K+ADGS + I G GSV +SP++TLH+VLHVP L CNL+S+ K+T D +CQA F  S C FQ+L +G TIGNA    GLY+F   S   K +
Subjt:  SPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQV

Query:  LQI---------------EESVPIISCNNEDDQFNPNRSDKQPETLVYSRRQTVQRGVEP-PQPQQQSHESISSLGTE-QSTLVPQDNTNDLDLPIALRK
         +I                 S+P  + N+ +       S K  E + YSRR+   +   P P P    HE  S L  E  S+  P +N  D   P+    
Subjt:  LQI---------------EESVPIISCNNEDDQFNPNRSDKQPETLVYSRRQTVQRGVEP-PQPQQQSHESISSLGTE-QSTLVPQDNTNDLDLPIALRK

Query:  DLGSQS
        +  S+S
Subjt:  DLGSQS

XP_022159153.1 uncharacterized protein LOC111025577 [Momordica charantia]5.9e-22880.26Show/hide
Query:  MVKTAMMTNVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITAPKLNGKNFLQWSQSDLLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMV
        MVKTAMMT+VRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQIT+PKLNGKNFLQWS+S LLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMV
Subjt:  MVKTAMMTNVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITAPKLNGKNFLQWSQSDLLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMV

Query:  MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY
        MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFEL NKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY
Subjt:  MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY

Query:  DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHWDYRPPPLTNTPSSRTS
        DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEV WESSRKRVMMGDTHTKPLSLSLESSALAARGPPP                     P+         
Subjt:  DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHWDYRPPPLTNTPSSRTS

Query:  SSGYQVGPSVSNSQDLATSLPSFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWIINSGATDHMTAFHDMFTMYSPNPIQTHVKLAD
                                 AQLEQLYRLLT PVESTPSSSFVAQRGI SAALT QQHSDQWI++SGATDHMTAFHDMFTMYSPNPIQTHVKLAD
Subjt:  SSGYQVGPSVSNSQDLATSLPSFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWIINSGATDHMTAFHDMFTMYSPNPIQTHVKLAD

Query:  GSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQIEESVPIISCN
        GSSAIIKGFGSVILSPNITLHSVL VPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIG+ADGFEGLYYFRGPSLRNKQVLQ   +       
Subjt:  GSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQIEESVPIISCN

Query:  NEDDQFNPNRSDKQPETLVYSRRQT--VQRGVEPPQP
         E    NPN   +    +V  R +T  + RG  P  P
Subjt:  NEDDQFNPNRSDKQPETLVYSRRQT--VQRGVEPPQP

XP_024044151.1 uncharacterized protein LOC18046468 isoform X1 [Citrus clementina]4.7e-10045.18Show/hide
Query:  ITAPKLNGKNFLQWSQSDLLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFEL
        IT  KLNG+NFLQWSQS  + IRG+ ++GY+ G+I EP E DP F  WDA NSM+M+WL+NSME++I ++++F  TAKDLW+A+T  +SD  NSAQ+++L
Subjt:  ITAPKLNGKNFLQWSQSDLLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFEL

Query:  RNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMG
        + + R  +QG   VT+YY+ L+ LW ELD   + EWE + D+ +++K +EKER+++FLAGL  +LD+VRGR+L  +P+P+  E+F+ VR E SRK VMMG
Subjt:  RNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMG

Query:  DTHTKPLSLSLESSALAARGPPPP-------SSRSTRRNNLWCDH-----------WDY--RPPPLTNTPSSRTSSSGYQ-VGPSVSNSQDLATSLPSFS
         +       S E+SAL +  P  P         +S  ++ +WCD+           W    +PP L N   S   S G+Q VG +   +    T    F+
Subjt:  DTHTKPLSLSLESSALAARGPPPP-------SSRSTRRNNLWCDH-----------WDY--RPPPLTNTPSSRTSSSGYQ-VGPSVSNSQDLATSLPSFS

Query:  KAQLEQLYRLLTPPVESTPSSSF--VAQRG-IFSAALTSQQHSDQWIINSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLH
        K QLEQLYR L         SSF  +AQ+G  F+A     +  D WII+SGATDHMT+   +F+ Y P      +K+ADGS + + G GS+ +S N+ L 
Subjt:  KAQLEQLYRLLTPPVESTPSSSF--VAQRG-IFSAALTSQQHSDQWIINSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLH

Query:  SVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYF
        SVLHVP L CNL+SV K+T DL C A F+ S C FQDL +G  IG+A   +GLYYF
Subjt:  SVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYF

XP_024044152.1 uncharacterized protein LOC18046468 isoform X2 [Citrus clementina]4.7e-10045.18Show/hide
Query:  ITAPKLNGKNFLQWSQSDLLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFEL
        IT  KLNG+NFLQWSQS  + IRG+ ++GY+ G+I EP E DP F  WDA NSM+M+WL+NSME++I ++++F  TAKDLW+A+T  +SD  NSAQ+++L
Subjt:  ITAPKLNGKNFLQWSQSDLLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFEL

Query:  RNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMG
        + + R  +QG   VT+YY+ L+ LW ELD   + EWE + D+ +++K +EKER+++FLAGL  +LD+VRGR+L  +P+P+  E+F+ VR E SRK VMMG
Subjt:  RNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMG

Query:  DTHTKPLSLSLESSALAARGPPPP-------SSRSTRRNNLWCDH-----------WDY--RPPPLTNTPSSRTSSSGYQ-VGPSVSNSQDLATSLPSFS
         +       S E+SAL +  P  P         +S  ++ +WCD+           W    +PP L N   S   S G+Q VG +   +    T    F+
Subjt:  DTHTKPLSLSLESSALAARGPPPP-------SSRSTRRNNLWCDH-----------WDY--RPPPLTNTPSSRTSSSGYQ-VGPSVSNSQDLATSLPSFS

Query:  KAQLEQLYRLLTPPVESTPSSSF--VAQRG-IFSAALTSQQHSDQWIINSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLH
        K QLEQLYR L         SSF  +AQ+G  F+A     +  D WII+SGATDHMT+   +F+ Y P      +K+ADGS + + G GS+ +S N+ L 
Subjt:  KAQLEQLYRLLTPPVESTPSSSF--VAQRG-IFSAALTSQQHSDQWIINSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLH

Query:  SVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYF
        SVLHVP L CNL+SV K+T DL C A F+ S C FQDL +G  IG+A   +GLYYF
Subjt:  SVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYF

TrEMBL top hitse value%identityAlignment
A0A438IPH4 Retrovirus-related Pol polyprotein from transposon RE11.1e-10239.11Show/hide
Query:  MVKTAMMTNVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITAPKLNGKNFLQWSQSDLLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMV
        M KTA MT   K  SS  + T ++    + +++   S S  +S Q+T  KLNGKN+L+W+QS  L I GR +LG++NG +++P   DP+   W ++NS+V
Subjt:  MVKTAMMTNVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITAPKLNGKNFLQWSQSDLLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMV

Query:  MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY
        +AWLINSME  I +  +F  TAKD+W A+   +SD +NS+Q+F+L++K    RQG+ +VT YY+ +  LW ELDLC   EW+   D+ R +K  E +R+Y
Subjt:  MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY

Query:  DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDH-----------WDYRPP
         FLAGL   LD+VRGR+L  KP+P+I E+F+EVR E +R++VM+ D   K  +  +ESSAL ++G     S   RR   WCDH           W     
Subjt:  DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDH-----------WDYRPP

Query:  PLTNTPSSRTSSSGYQVGPSVSNSQDLATSLPSFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQ--WIINSGATDHMTAFHDMFTMY
        P      + +    +Q   + S    + +  P+F+K QL  LY+L   P  S PS S   Q     AAL+S + +    WII+SGATDHMT    +F+ Y
Subjt:  PLTNTPSSRTSSSGYQVGPSVSNSQDLATSLPSFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQ--WIINSGATDHMTAFHDMFTMY

Query:  SPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQV
         P      +K+ADGS + I G GSV +SP++TLH+VLHVP L CNL+S+ K+T D +CQA F  S C FQ+L +G TIGNA    GLY+F   S   K +
Subjt:  SPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQV

Query:  LQI---------------EESVPIISCNNEDDQFNPNRSDKQPETLVYSRRQTVQRGVEP-PQPQQQSHESISSLGTE-QSTLVPQDNTNDLDLPIALRK
         +I                 S+P  + N+ +       S K  E + YSRR+   +   P P P    HE  S L  E  S+  P +N  D   P+    
Subjt:  LQI---------------EESVPIISCNNEDDQFNPNRSDKQPETLVYSRRQTVQRGVEP-PQPQQQSHESISSLGTE-QSTLVPQDNTNDLDLPIALRK

Query:  DLGSQS
        +  S+S
Subjt:  DLGSQS

A0A6A2WU09 60S ribosomal protein L382.5e-9941.47Show/hide
Query:  DGSNTTSISLPKSTSTTPHISISENTSLQITAPKLNGKNFLQWSQSDLLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESF
        D S   S+S  +S S T +    +N S  IT+ KLNG NFLQWSQS  L IRGR + GY++G   +P E + S   W+A+NSM+M+WLINSM+  +  ++
Subjt:  DGSNTTSISLPKSTSTTPHISISENTSLQITAPKLNGKNFLQWSQSDLLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESF

Query:  IFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGR
        +F  TA D+WNA+   +SD  N+ Q FEL+ +   L+QGE  VTQYY+ L+ LW E+D+  + EW  +KD   F+K VEKER+++FL GL  ELD+VRGR
Subjt:  IFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGR

Query:  LLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDH-----------WDYRPPPLTNTPSSRTSSSG--
        +L  +P+P+  E+F+EVR E SR+ VM+G     P     ESSAL +  PP  + R   R+   C+H           W     P  N  ++R S +   
Subjt:  LLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDH-----------WDYRPPPLTNTPSSRTSSSG--

Query:  -YQVGPSVSNSQDLATSLPSFSKAQLEQLYRLL-------TPPVESTPSSSFVAQRGIFSAALTSQQHSDQWIINSGATDHMTAFHDMFTMYSPNPIQTH
         +      S     A+ L +FSK QLEQLY+L+       TP   S  +SS +AQ+G +  A  +   S+ WII+SGATDHMT    +F+ Y P      
Subjt:  -YQVGPSVSNSQDLATSLPSFSKAQLEQLYRLL-------TPPVESTPSSSFVAQRGIFSAALTSQQHSDQWIINSGATDHMTAFHDMFTMYSPNPIQTH

Query:  VKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQIEESVP
        VK+ADGS   I G GS+I+SP++TL +VLHVPKL CNLISV ++ HD KC A  T +   FQD  +G  IGNA   +GLY+    +  NKQV ++     
Subjt:  VKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQIEESVP

Query:  IISCNNE--------DDQFNPNRSDKQPET--------LVYSRRQTVQRGVEPPQPQ
         +SC NE            N    + Q ET         VYSRR T    V PP P+
Subjt:  IISCNNE--------DDQFNPNRSDKQPET--------LVYSRRQTVQRGVEPPQPQ

A0A6A2ZFN0 Gag-Pol-p1991.7e-9239.39Show/hide
Query:  WSQSDLLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESD
        WSQS  L IRG  + GY++GT  +  E D     W+A+NSM+M+WLINSM+  +  +++F  TA D+WNA+   +SD  N+ Q FEL+ +   L+QGE  
Subjt:  WSQSDLLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESD

Query:  VTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLES
        VTQYY+ L+ LW E+D+  + EW  +KD   F+K VEKE +++FL GL  ELD+VRGR+L  +P+P+  E+F+EVR E SR+ +M+G     P     ES
Subjt:  VTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLES

Query:  SALAARGPPPPSSRSTRRNNLWCDH-----------WDYRPPPLTNTPSSRTSSSG---YQVGPSVSNSQDLATSLPSFSKAQLEQLYRLL-------TP
        SAL +  P   + R   R+   C+H           W     P  N  ++R S +    Y      S     AT L +FSK QLEQLY+L+       TP
Subjt:  SALAARGPPPPSSRSTRRNNLWCDH-----------WDYRPPPLTNTPSSRTSSSG---YQVGPSVSNSQDLATSLPSFSKAQLEQLYRLL-------TP

Query:  PVESTPSSSFVAQRGIFSAALTSQQHSDQWIINSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQ
           S  +SS +AQ+G +  A  +   S+ WII+SGATDHMT    +F+ Y P      VK+ADGS   I G GS+I+SP++TL +VLHVPKL CNLISV 
Subjt:  PVESTPSSSFVAQRGIFSAALTSQQHSDQWIINSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQ

Query:  KLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQIEESVPIISCNNE--------DDQFNPNRSDKQPET--------LVYS
        ++ HD KC A  T +   FQD  +G  IGNA   +GLY+    +  +KQV Q       + C NE            N    + Q ET         VYS
Subjt:  KLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQIEESVPIISCNNE--------DDQFNPNRSDKQPET--------LVYS

Query:  RRQTVQRGVEPPQPQQQSHESISSLGTEQSTLVPQDN---TNDLDLPIALRKDLGS
        RR T    V P     Q  +S    GT   + +   N   + +   PIALRK + S
Subjt:  RRQTVQRGVEPPQPQQQSHESISSLGTEQSTLVPQDN---TNDLDLPIALRKDLGS

A0A6J1DY12 uncharacterized protein LOC1110255772.8e-22880.26Show/hide
Query:  MVKTAMMTNVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITAPKLNGKNFLQWSQSDLLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMV
        MVKTAMMT+VRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQIT+PKLNGKNFLQWS+S LLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMV
Subjt:  MVKTAMMTNVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITAPKLNGKNFLQWSQSDLLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMV

Query:  MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY
        MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFEL NKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY
Subjt:  MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY

Query:  DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHWDYRPPPLTNTPSSRTS
        DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEV WESSRKRVMMGDTHTKPLSLSLESSALAARGPPP                     P+         
Subjt:  DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHWDYRPPPLTNTPSSRTS

Query:  SSGYQVGPSVSNSQDLATSLPSFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWIINSGATDHMTAFHDMFTMYSPNPIQTHVKLAD
                                 AQLEQLYRLLT PVESTPSSSFVAQRGI SAALT QQHSDQWI++SGATDHMTAFHDMFTMYSPNPIQTHVKLAD
Subjt:  SSGYQVGPSVSNSQDLATSLPSFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWIINSGATDHMTAFHDMFTMYSPNPIQTHVKLAD

Query:  GSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQIEESVPIISCN
        GSSAIIKGFGSVILSPNITLHSVL VPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIG+ADGFEGLYYFRGPSLRNKQVLQ   +       
Subjt:  GSSAIIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQIEESVPIISCN

Query:  NEDDQFNPNRSDKQPETLVYSRRQT--VQRGVEPPQP
         E    NPN   +    +V  R +T  + RG  P  P
Subjt:  NEDDQFNPNRSDKQPETLVYSRRQT--VQRGVEPPQP

A0A7J0DNJ6 Uncharacterized protein6.2e-9839.8Show/hide
Query:  SNTTSISLPKSTSTTPHISISENTSLQITAPKLNGKNFLQWSQSDLLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIF
        S+  + S PK   T    S+      QIT+ KL+G+N+LQWS+S  L+IRG  R GY++G+I +P   DPSF +WD QNSMVMAWL+NSM++ I E ++ 
Subjt:  SNTTSISLPKSTSTTPHISISENTSLQITAPKLNGKNFLQWSQSDLLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIF

Query:  YSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLL
        Y TAK +W+A+T+A+SD ++S+Q+F+LR+++R+LRQ E  VTQY+SSL +LW ELDL     W  + + E +RK + KER Y+FL GL P LDDVRGR+L
Subjt:  YSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLL

Query:  ATKPIPAIDEIFAEVRWESSRKRVMMGD-THTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDH-----------WDYRPPPLTNTPSSRTSSSGYQV
        + KP+P++D IF+EVR E  R+R+M+G        S++ ++SA+AAR    P SR  R+  LWCDH           W     P+   P S+  S G + 
Subjt:  ATKPIPAIDEIFAEVRWESSRKRVMMGD-THTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDH-----------WDYRPPPLTNTPSSRTSSSGYQV

Query:  GPSVSNSQDLATSLP-SFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQ-WIINSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSA
         P V+   D A+S   +F++AQL+Q+ +L +       SS+ +   GI S+ +     S + WII+SGA++HM++   +F+ YS       V LADGS +
Subjt:  GPSVSNSQDLATSLP-SFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQ-WIINSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSA

Query:  IIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYF-------------RG--PSLRNKQVLQ
         + G G+V LSPN+ LH+VL VPKL  NL+S+ KLT D  C A  + + C+FQD  +G TIG A    GLYYF             RG   S R+ Q+  
Subjt:  IIKGFGSVILSPNITLHSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYF-------------RG--PSLRNKQVLQ

Query:  IEESV--PIISCNNEDDQFNPNRS-------------------------DKQPETLVYSRRQTVQRGVEPPQPQQQSHESISSLGTEQSTLVPQDNTNDL
        +   +  P    ++    F   RS                           +P  L Y RR+ V    +P  P    H S SS G++ + L PQ     L
Subjt:  IEESV--PIISCNNEDDQFNPNRS-------------------------DKQPETLVYSRRQTVQRGVEPPQPQQQSHESISSLGTEQSTLVPQDNTNDL

Query:  DLPIALRK
        D+PIALRK
Subjt:  DLPIALRK

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-1621.04Show/hide
Query:  KLNGKNFLQWSQSDLLVIRGRSRLGYINGTIAEP---------DEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSA
        KL   N+L WS+    +  G    G+++G+   P            +P ++ W  Q+ ++ + ++ ++   ++ +    +TA  +W  L   +++  +  
Subjt:  KLNGKNFLQWSQSDLLVIRGRSRLGYINGTIAEP---------DEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSA

Query:  QLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRK
         + +LR + +   +G   +  Y   L   + +L L L    ++ +  ER  +++ +E         +P +D    ++ A    P + EI   +    S+ 
Subjt:  QLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRK

Query:  RVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHWDYRPPPLTNTPSSRTSSSGY----QVGPSVSNSQDLATSLPSFSKAQLEQLYRLLT
          +   T      + + ++A++ R     ++ +    N   + +D R     + P  ++S++ +    Q  P +   Q     +   S  +  QL   L+
Subjt:  RVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHWDYRPPPLTNTPSSRTSSSGY----QVGPSVSNSQDLATSLPSFSKAQLEQLYRLLT

Query:  PPVESTPSSSFVAQRGIFSAALTSQQHSDQWIINSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPN---ITLHSVLHVPKLCCNL
              P S F   +   + AL S   S+ W+++SGAT H+T+  +  +++ P      V +ADGS+  I   GS  LS     + LH++L+VP +  NL
Subjt:  PPVESTPSSSFVAQRGIFSAALTSQQHSDQWIINSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPN---ITLHSVLHVPKLCCNL

Query:  ISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLY
        ISV +L +       F  +    +DL TG  +      + LY
Subjt:  ISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.2e-1120Show/hide
Query:  KLNGKNFLQWSQSDLLVIRGRSRLGYINGTIAEP---------DEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSA
        KL   N+L WS+    +  G    G+++G+   P            +P ++ W  Q+ ++ + ++ ++   ++ +    +TA  +W  L   +++  +  
Subjt:  KLNGKNFLQWSQSDLLVIRGRSRLGYINGTIAEP---------DEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSA

Query:  QLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRK
         + +LR   R            +  L  L   +D                      E++   L  L  +   V  ++ A    P++ EI   +    S K
Subjt:  QLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRK

Query:  RVMMGDTHTKPLSLSLESSALA-------ARGPPPPSSRSTRRNNLWCDHWDYRPPPLTNTPSSRTSSSGYQVGPSVSNSQDLATSLPSFSKAQLEQLYR
         + +      P++ ++ +            RG     + +  R+N W               SS + S   Q  P +   Q    S+   S  +  QL++
Subjt:  RVMMGDTHTKPLSLSLESSALA-------ARGPPPPSSRSTRRNNLWCDHWDYRPPPLTNTPSSRTSSSGYQVGPSVSNSQDLATSLPSFSKAQLEQLYR

Query:  LLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWIINSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVIL---SPNITLHSVLHVPKLC
          +   +   +S F   +   + A+ S  +++ W+++SGAT H+T+  +  + + P      V +ADGS+  I   GS  L   S ++ L+ VL+VP + 
Subjt:  LLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWIINSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVIL---SPNITLHSVLHVPKLC

Query:  CNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLY
         NLISV +L +  +    F  +    +DL TG  +      + LY
Subjt:  CNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLY

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.1e-2232.11Show/hide
Query:  NFLQWSQSDLLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQ
        N++ W       +R   + G+I+GT+ +PD   P +  W+  N+MVM WL+NSM + + ES ++  TA  +W  L   F    +  ++++LR +  +LRQ
Subjt:  NFLQWSQSDLLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQ

Query:  GESDVTQYYSSLRRLWAELD--------LCLNLEWENSKDAERFRKHVEKERIYDFLAGLR--PELDDVRGRLLATKPIPAIDEIFAEVR
        G   V +Y+  L ++W EL          C     E +K AE  R   EKE+ Y+FL GL+     + V  +++  KP P++ E FA V+
Subjt:  GESDVTQYYSSLRRLWAELD--------LCLNLEWENSKDAERFRKHVEKERIYDFLAGLR--PELDDVRGRLLATKPIPAIDEIFAEVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTAAAACAGCCATGATGACTAATGTGCGCAAGGATGAGAGTTCCGACGGATCGAATACGACCTCAATTTCTCTCCCTAAAAGCACATCCACGACACCTCAC
ATCTCTATCTCTGAAAACACTTCGCTCCAAATTACCGCTCCAAAACTCAACGGAAAAAATTTTCTGCAATGGTCTCAATCGGACCTCCTAGTTATTCGTGGCCGC
AGTCGACTCGGGTACATCAATGGCACGATAGCAGAACCAGATGAAGCTGACCCTTCTTTTTCCGTGTGGGATGCACAAAACTCAATGGTCATGGCATGGCTCATT
AACTCGATGGAGGAGGACATTAAAGAATCCTTCATCTTCTACTCAACAGCAAAGGATCTTTGGAATGCGCTCACTATGGCTTTTTCTGATTTTGATAACTCGGCT
CAATTGTTTGAATTACGCAATAAGGCACGTTCCTTACGACAAGGTGAATCCGATGTCACCCAATACTATAGCTCATTACGTAGGTTGTGGGCTGAACTTGATTTA
TGTCTCAATCTCGAATGGGAGAACTCGAAGGATGCTGAACGCTTCCGCAAACACGTCGAGAAGGAACGAATTTATGATTTTCTTGCAGGTCTTCGTCCGGAATTA
GATGATGTGCGTGGCCGCTTACTTGCCACAAAGCCAATCCCAGCCATTGACGAAATCTTCGCAGAAGTTCGCTGGGAGTCAAGCCGTAAACGTGTGATGATGGGT
GATACACACACAAAACCTCTGTCCCTCTCACTGGAATCATCAGCTCTGGCGGCACGAGGTCCACCACCACCTTCATCCCGATCTACTCGTCGGAACAACCTATGG
TGTGATCATTGGGATTATCGGCCACCTCCACTTACAAATACTCCATCTTCTCGAACCAGCTCCTCCGGCTATCAAGTGGGCCCTAGTGTGTCAAACTCCCAAGAT
TTGGCCACCTCTCTCCCTTCATTTTCGAAGGCACAACTTGAACAGCTCTATCGCCTCTTAACACCGCCGGTTGAGTCTACTCCTTCGTCAAGTTTTGTGGCACAA
CGAGGTATTTTTAGTGCAGCTTTAACAAGTCAGCAGCATTCCGATCAGTGGATCATAAATTCGGGTGCAACTGATCATATGACCGCTTTTCATGATATGTTTACC
ATGTACTCACCCAACCCGATTCAGACACATGTCAAGCTTGCAGATGGGTCATCGGCCATTATTAAGGGCTTTGGTTCTGTTATTCTTAGCCCAAACATTACATTG
CACTCAGTGCTCCATGTGCCTAAATTATGTTGCAATCTAATTTCTGTTCAGAAGTTGACTCATGATTTAAAGTGTCAAGCCTTGTTCACTGACTCTAAGTGTTTG
TTTCAGGACTTGATAACGGGAACGACGATTGGCAATGCTGATGGCTTTGAAGGGCTCTACTACTTCAGAGGACCAAGTCTAAGAAATAAACAAGTTCTCCAGATA
GAAGAGTCAGTTCCTATTATTTCCTGTAATAATGAAGATGATCAATTCAACCCAAATCGAAGTGACAAGCAACCTGAGACTCTTGTTTATTCTCGGCGACAAACG
GTTCAAAGAGGAGTGGAGCCACCACAGCCTCAACAGCAAAGTCATGAATCCATCTCGTCCTTAGGTACTGAACAATCTACCCTTGTGCCTCAAGACAATACTAAT
GATCTTGATCTTCCTATTGCACTTAGGAAGGACCTTGGCTCTCAATCTGGAAGCCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTAAAACAGCCATGATGACTAATGTGCGCAAGGATGAGAGTTCCGACGGATCGAATACGACCTCAATTTCTCTCCCTAAAAGCACATCCACGACACCTCAC
ATCTCTATCTCTGAAAACACTTCGCTCCAAATTACCGCTCCAAAACTCAACGGAAAAAATTTTCTGCAATGGTCTCAATCGGACCTCCTAGTTATTCGTGGCCGC
AGTCGACTCGGGTACATCAATGGCACGATAGCAGAACCAGATGAAGCTGACCCTTCTTTTTCCGTGTGGGATGCACAAAACTCAATGGTCATGGCATGGCTCATT
AACTCGATGGAGGAGGACATTAAAGAATCCTTCATCTTCTACTCAACAGCAAAGGATCTTTGGAATGCGCTCACTATGGCTTTTTCTGATTTTGATAACTCGGCT
CAATTGTTTGAATTACGCAATAAGGCACGTTCCTTACGACAAGGTGAATCCGATGTCACCCAATACTATAGCTCATTACGTAGGTTGTGGGCTGAACTTGATTTA
TGTCTCAATCTCGAATGGGAGAACTCGAAGGATGCTGAACGCTTCCGCAAACACGTCGAGAAGGAACGAATTTATGATTTTCTTGCAGGTCTTCGTCCGGAATTA
GATGATGTGCGTGGCCGCTTACTTGCCACAAAGCCAATCCCAGCCATTGACGAAATCTTCGCAGAAGTTCGCTGGGAGTCAAGCCGTAAACGTGTGATGATGGGT
GATACACACACAAAACCTCTGTCCCTCTCACTGGAATCATCAGCTCTGGCGGCACGAGGTCCACCACCACCTTCATCCCGATCTACTCGTCGGAACAACCTATGG
TGTGATCATTGGGATTATCGGCCACCTCCACTTACAAATACTCCATCTTCTCGAACCAGCTCCTCCGGCTATCAAGTGGGCCCTAGTGTGTCAAACTCCCAAGAT
TTGGCCACCTCTCTCCCTTCATTTTCGAAGGCACAACTTGAACAGCTCTATCGCCTCTTAACACCGCCGGTTGAGTCTACTCCTTCGTCAAGTTTTGTGGCACAA
CGAGGTATTTTTAGTGCAGCTTTAACAAGTCAGCAGCATTCCGATCAGTGGATCATAAATTCGGGTGCAACTGATCATATGACCGCTTTTCATGATATGTTTACC
ATGTACTCACCCAACCCGATTCAGACACATGTCAAGCTTGCAGATGGGTCATCGGCCATTATTAAGGGCTTTGGTTCTGTTATTCTTAGCCCAAACATTACATTG
CACTCAGTGCTCCATGTGCCTAAATTATGTTGCAATCTAATTTCTGTTCAGAAGTTGACTCATGATTTAAAGTGTCAAGCCTTGTTCACTGACTCTAAGTGTTTG
TTTCAGGACTTGATAACGGGAACGACGATTGGCAATGCTGATGGCTTTGAAGGGCTCTACTACTTCAGAGGACCAAGTCTAAGAAATAAACAAGTTCTCCAGATA
GAAGAGTCAGTTCCTATTATTTCCTGTAATAATGAAGATGATCAATTCAACCCAAATCGAAGTGACAAGCAACCTGAGACTCTTGTTTATTCTCGGCGACAAACG
GTTCAAAGAGGAGTGGAGCCACCACAGCCTCAACAGCAAAGTCATGAATCCATCTCGTCCTTAGGTACTGAACAATCTACCCTTGTGCCTCAAGACAATACTAAT
GATCTTGATCTTCCTATTGCACTTAGGAAGGACCTTGGCTCTCAATCTGGAAGCCAATGA
Protein sequenceShow/hide protein sequence
MVKTAMMTNVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITAPKLNGKNFLQWSQSDLLVIRGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLI
NSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPEL
DDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLSLESSALAARGPPPPSSRSTRRNNLWCDHWDYRPPPLTNTPSSRTSSSGYQVGPSVSNSQD
LATSLPSFSKAQLEQLYRLLTPPVESTPSSSFVAQRGIFSAALTSQQHSDQWIINSGATDHMTAFHDMFTMYSPNPIQTHVKLADGSSAIIKGFGSVILSPNITL
HSVLHVPKLCCNLISVQKLTHDLKCQALFTDSKCLFQDLITGTTIGNADGFEGLYYFRGPSLRNKQVLQIEESVPIISCNNEDDQFNPNRSDKQPETLVYSRRQT
VQRGVEPPQPQQQSHESISSLGTEQSTLVPQDNTNDLDLPIALRKDLGSQSGSQ