; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g07030 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g07030
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr2:4973578..4975209
RNA-Seq ExpressionMoc02g07030
SyntenyMoc02g07030
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.0e-11747.86Show/hide
Query:  KNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLQVLTALQGHGLEDFIDPEGPIPSKNI--TVEGEGSSSSTQNQEYLNWKQQDKLITSWLLGSMS
        +NT   S  N   Q+   GNKIS VKL DD FLLW+ Q+LTAL+ + LE+F++ E   PSK +  T     S++ T N  Y  WK+QD+LI+SWLLGSMS
Subjt:  KNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLQVLTALQGHGLEDFIDPEGPIPSKNI--TVEGEGSSSSTQNQEYLNWKQQDKLITSWLLGSMS

Query:  EEVLSQMLECETSKEIWSVLNSLFSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISACVDP
        EE+L+QML C+++KEIW  L  +FSSR LA+ M+ K+KL N+KKGS+ LKEYF K  Q VDAL + +KP+S  DHI+++LAGLG+++ S +SVISA  D 
Subjt:  EEVLSQMLECETSKEIWSVLNSLFSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISACVDP

Query:  PTLQETYSLLLAQEGRNERNAITTEVSLPSVNLTTQEQSKKGRPTNSTDTRGNWNNNR------GRGGNRSNRGRNWNTNFRIQCQLCGRFGHTASRCYQ
        P++QE  SLLL QE +NE   I +E +LPSVN+ TQ  ++KG  +     + N++NN       GRG  RSNRGR  N N + QCQ+C + G++A RC+ 
Subjt:  PTLQETYSLLLAQEGRNERNAITTEVSLPSVNLTTQEQSKKGRPTNSTDTRGNWNNNR------GRGGNRSNRGRNWNTNFRIQCQLCGRFGHTASRCYQ

Query:  RFDRSFQGPNSSASSFGFHPTPSYGSSSNPDHPQMNAFTLSQELNRDTNWYPDSGALHHVTNDLGNLSIGAEIHGNNRVLVGNDSGLNVSHIGSTFLKSS
        R+       NSS  S   H T SY + +N  HPQM+A   + +LN D+NWYPDSGA +H+T+ L NLSIG+E  G N++   N SGL ++H GS    SS
Subjt:  RFDRSFQGPNSSASSFGFHPTPSYGSSSNPDHPQMNAFTLSQELNRDTNWYPDSGALHHVTNDLGNLSIGAEIHGNNRVLVGNDSGLNVSHIGSTFLKSS

Query:  SAPSNIFLLNNLLHVPQITKNLISVSKFAKYNNVYFEFHSSNCFVKDLQTGQILLQGQVSDGLYTFSLDKAKSSTSFPSTIPSHGSSSSTITPQVLHTSA
        + P   F LNNLL VP ITKNLISVS+FAK N+V+FEFH + C+VKDL TGQ+LLQG ++DGLY F+++        PS    H S+S+T   + +  + 
Subjt:  SAPSNIFLLNNLLHVPQITKNLISVSKFAKYNNVYFEFHSSNCFVKDLQTGQILLQGQVSDGLYTFSLDKAKSSTSFPSTIPSHGSSSSTITPQVLHTSA

Query:  SPATSFPCATDSAKQFTPFTQCKPSVLDIWHWRLGHP
         P ++ P                  +LD+WH RLGHP
Subjt:  SPATSFPCATDSAKQFTPFTQCKPSVLDIWHWRLGHP

RVW60229.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]5.2e-8536.35Show/hide
Query:  VINPGNKISTVKLTDDNFLLWRLQVLTALQGHGLEDFIDPEGPIPSKNITVEGEGSSSSTQNQEYLNWKQQDKLITSWLLGSMSEEVLSQMLECETSKEI
        VI+P +++ T++L DDNFL+W+ Q+  A++G+GLE F+     +P K +T   +       N ++ ++++QD L+ SWLL S+    L Q++ C ++ E+
Subjt:  VINPGNKISTVKLTDDNFLLWRLQVLTALQGHGLEDFIDPEGPIPSKNITVEGEGSSSSTQNQEYLNWKQQDKLITSWLLGSMSEEVLSQMLECETSKEI

Query:  WSVLNSLFSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISACVDPPTLQETYSLLLAQEGR
        W+ ++  F+S++ A+VM  KS+++ LKK  L++++Y  K K   D L  A   IS TDHI+ ++ GLG E++S ++VIS+    P+LQ   S L+A EGR
Subjt:  WSVLNSLFSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISACVDPPTLQETYSLLLAQEGR

Query:  NERNAITTEVSLPSVNLTTQEQSK--------KGRPTNSTD----------TRGNWNNNRGRGGNRSNRGRNWNTNFRIQCQLCGRFGHTASRCYQRFDR
              + ++   SVN T+Q  ++         G P++             TRG++ +NRGRG  R+  G       + QCQLC +FGHT  RC+ R+D 
Subjt:  NERNAITTEVSLPSVNLTTQEQSK--------KGRPTNSTD----------TRGNWNNNRGRGGNRSNRGRNWNTNFRIQCQLCGRFGHTASRCYQRFDR

Query:  SF-------------------QGPNSSASSFGFHPTPSYGSSSNPDHPQMNAFTLSQELNRDTNWYPDSGALHHVTNDLGNLSIGAEIHGNNRVLVGNDS
        +F                    G + S SS G      Y +  N D+ +M A   + E  ++  W+PDSGA +HVT+DLGNL+ GAE +GN+++ +GN +
Subjt:  SF-------------------QGPNSSASSFGFHPTPSYGSSSNPDHPQMNAFTLSQELNRDTNWYPDSGALHHVTNDLGNLSIGAEIHGNNRVLVGNDS

Query:  GLNVSHIGSTFLKSSSAPSNIFLLNNLLHVPQITKNLISVSKFAKYNNVYFEFHSSNCFVKDLQTGQILLQGQVSDGLYTFSLDKAKSSTSFPSTIPSHG
        GL +SHIG +   SSS+P+ +  L N+L VP I KNL+SVS+FA+ NNVYFEFH   CFVKD     +LLQG +  GLY F+L K     +   ++ +  
Subjt:  GLNVSHIGSTFLKSSSAPSNIFLLNNLLHVPQITKNLISVSKFAKYNNVYFEFHSSNCFVKDLQTGQILLQGQVSDGLYTFSLDKAKSSTSFPSTIPSHG

Query:  SSSSTITPQVLHTSASPATSFPCATDSAKQFTPFTQCKPSVLDIWHWRLGHPA
        +  +     ++H   S    FP  T+S+            V D+WH RLGHPA
Subjt:  SSSSTITPQVLHTSASPATSFPCATDSAKQFTPFTQCKPSVLDIWHWRLGHPA

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.0e-11747.86Show/hide
Query:  KNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLQVLTALQGHGLEDFIDPEGPIPSKNI--TVEGEGSSSSTQNQEYLNWKQQDKLITSWLLGSMS
        +NT   S  N   Q+   GNKIS VKL DD FLLW+ Q+LTAL+ + LE+F++ E   PSK +  T     S++ T N  Y  WK+QD+LI+SWLLGSMS
Subjt:  KNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLQVLTALQGHGLEDFIDPEGPIPSKNI--TVEGEGSSSSTQNQEYLNWKQQDKLITSWLLGSMS

Query:  EEVLSQMLECETSKEIWSVLNSLFSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISACVDP
        EE+L+QML C+++KEIW  L  +FSSR LA+ M+ K+KL N+KKGS+ LKEYF K  Q VDAL + +KP+S  DHI+++LAGLG+++ S +SVISA  D 
Subjt:  EEVLSQMLECETSKEIWSVLNSLFSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISACVDP

Query:  PTLQETYSLLLAQEGRNERNAITTEVSLPSVNLTTQEQSKKGRPTNSTDTRGNWNNNR------GRGGNRSNRGRNWNTNFRIQCQLCGRFGHTASRCYQ
        P++QE  SLLL QE +NE   I +E +LPSVN+ TQ  ++KG  +     + N++NN       GRG  RSNRGR  N N + QCQ+C + G++A RC+ 
Subjt:  PTLQETYSLLLAQEGRNERNAITTEVSLPSVNLTTQEQSKKGRPTNSTDTRGNWNNNR------GRGGNRSNRGRNWNTNFRIQCQLCGRFGHTASRCYQ

Query:  RFDRSFQGPNSSASSFGFHPTPSYGSSSNPDHPQMNAFTLSQELNRDTNWYPDSGALHHVTNDLGNLSIGAEIHGNNRVLVGNDSGLNVSHIGSTFLKSS
        R+       NSS  S   H T SY + +N  HPQM+A   + +LN D+NWYPDSGA +H+T+ L NLSIG+E  G N++   N SGL ++H GS    SS
Subjt:  RFDRSFQGPNSSASSFGFHPTPSYGSSSNPDHPQMNAFTLSQELNRDTNWYPDSGALHHVTNDLGNLSIGAEIHGNNRVLVGNDSGLNVSHIGSTFLKSS

Query:  SAPSNIFLLNNLLHVPQITKNLISVSKFAKYNNVYFEFHSSNCFVKDLQTGQILLQGQVSDGLYTFSLDKAKSSTSFPSTIPSHGSSSSTITPQVLHTSA
        + P   F LNNLL VP ITKNLISVS+FAK N+V+FEFH + C+VKDL TGQ+LLQG ++DGLY F+++        PS    H S+S+T   + +  + 
Subjt:  SAPSNIFLLNNLLHVPQITKNLISVSKFAKYNNVYFEFHSSNCFVKDLQTGQILLQGQVSDGLYTFSLDKAKSSTSFPSTIPSHGSSSSTITPQVLHTSA

Query:  SPATSFPCATDSAKQFTPFTQCKPSVLDIWHWRLGHP
         P ++ P                  +LD+WH RLGHP
Subjt:  SPATSFPCATDSAKQFTPFTQCKPSVLDIWHWRLGHP

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]4.1e-9852.1Show/hide
Query:  SSKNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLQVLTALQGHGLEDFIDPEGPIPSKNI-TVEGEGSSSS-TQNQEYLNWKQQDKLITSWLLGS
        SS   S+ +   Q  + INPG+K+S V+L DDN LLW+ Q+ TALQG+GLE +ID     P++ + T E E SSSS  QN  Y  W +QDKLI++WLLGS
Subjt:  SSKNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLQVLTALQGHGLEDFIDPEGPIPSKNI-TVEGEGSSSS-TQNQEYLNWKQQDKLITSWLLGS

Query:  MSEEVLSQMLECETSKEIWSVLNSLFSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISACV
        M+E++LSQML+C++++EIW+VL  +F+SR LARVM+LK KLEN KKG+LSLK+YF K K +VD+L  A K +S  DHIMH+LAGLG EFD+ +SVI+A  
Subjt:  MSEEVLSQMLECETSKEIWSVLNSLFSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISACV

Query:  DPPTLQETYSLLLAQEGRNERNAITTEVSLPSVNLTTQEQSKKGRPTNSTDTRGNWNN--NRGRG-GNRSNRGRNWNTNFRIQCQLCGRFGHTASRCYQR
         P TLQE  SLLL QEGRNERN I ++ SLPSVNLT  + SKK     S     + +N   RGRG  NRS+  RNW  N + QCQ+CGRFGHTA RCY R
Subjt:  DPPTLQETYSLLLAQEGRNERNAITTEVSLPSVNLTTQEQSKKGRPTNSTDTRGNWNN--NRGRG-GNRSNRGRNWNTNFRIQCQLCGRFGHTASRCYQR

Query:  FDRSFQGPNSSASSF---GFHP-----TPSYGSSSNP------------DHPQMNAFTLSQELNRDTNWYPDSGALHHVTNDLGNLSIGAEIHGNNRVLV
        F+R+F GPN + ++F   GF       TPS+ + S+P               QM A  ++Q+ NRD+NWY DSG  +HVTN+ GN S+G+E HG+ ++ V
Subjt:  FDRSFQGPNSSASSF---GFHP-----TPSYGSSSNP------------DHPQMNAFTLSQELNRDTNWYPDSGALHHVTNDLGNLSIGAEIHGNNRVLV

Query:  GNDSG
        GN +G
Subjt:  GNDSG

XP_022156747.1 uncharacterized protein LOC111023586 [Momordica charantia]2.2e-9144.93Show/hide
Query:  RLQVLTALQGHGLEDFIDPEGPIPSKNITVEGEGSSSST---QNQEYLNWKQQDKLITSWLLGSMSEEVLSQMLECETSKEIWSVLNSLFSSRNLARVME
        + QVLTA+QGHGLE +ID +   PS+ I   G+G +SST    N EY +W +QDKLI+ WLLGSMSEE+LSQML+C   KEIW++L   F+SRNLARVM+
Subjt:  RLQVLTALQGHGLEDFIDPEGPIPSKNITVEGEGSSSST---QNQEYLNWKQQDKLITSWLLGSMSEEVLSQMLECETSKEIWSVLNSLFSSRNLARVME

Query:  LKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISACVDPPTLQETYSLLLAQEGRNERNAITTEVSLPSVNLT
        LKSKLEN+KKGS++LK YF K K +VD+L  A K +   DHIMH+LA LG EFDS VSVIS    P ++QE  S           N         S    
Subjt:  LKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISACVDPPTLQETYSLLLAQEGRNERNAITTEVSLPSVNLT

Query:  TQEQSKKGRPTNSTDTRGNWNNNRGRGGNRSNRGRNWNTNFRIQCQLCGRFGHTASRCYQRFDRSFQGPNSSASSFGFHPTPSYGSSSNPDHPQMNAFTL
         Q QS  G  ++ST  + N+                            G FG                                GS+     PQM A  +
Subjt:  TQEQSKKGRPTNSTDTRGNWNNNRGRGGNRSNRGRNWNTNFRIQCQLCGRFGHTASRCYQRFDRSFQGPNSSASSFGFHPTPSYGSSSNPDHPQMNAFTL

Query:  SQELNRDTNWYPDSGALHHVTNDLGNLSIGAEIHGNNRVLVGNDSGLNVSHIGSTFLKSSSAPSN---IFLLNNLLHVPQITKNLISVSKFAKYNNVYFE
        + + NRD  WYPDSGA +HVTND GN S+G++ HGN ++ VGN + L++SHIGS  L+S SA ++   +F L NLLHVPQI KNLIS+S FAK N+V+FE
Subjt:  SQELNRDTNWYPDSGALHHVTNDLGNLSIGAEIHGNNRVLVGNDSGLNVSHIGSTFLKSSSAPSN---IFLLNNLLHVPQITKNLISVSKFAKYNNVYFE

Query:  FHSSNCFVKDLQTGQILLQGQVSDGLYTFSLDKAKSSTSFPSTIPSHGSSSSTITPQVLHTSASPATSFPCATDSAKQFTPFTQCKPSVLDIWHWRLGHP
        FH SN FVKDL TGQ+L QG V D LY F L KA S    P ++ S  ++S TI   +L  S SP    P              C  SVLDIWH R GH 
Subjt:  FHSSNCFVKDLQTGQILLQGQVSDGLYTFSLDKAKSSTSFPSTIPSHGSSSSTITPQVLHTSASPATSFPCATDSAKQFTPFTQCKPSVLDIWHWRLGHP

Query:  AFL
         FL
Subjt:  AFL

TrEMBL top hitse value%identityAlignment
A0A438FJP6 Retrovirus-related Pol polyprotein from transposon TNT 1-942.5e-8536.35Show/hide
Query:  VINPGNKISTVKLTDDNFLLWRLQVLTALQGHGLEDFIDPEGPIPSKNITVEGEGSSSSTQNQEYLNWKQQDKLITSWLLGSMSEEVLSQMLECETSKEI
        VI+P +++ T++L DDNFL+W+ Q+  A++G+GLE F+     +P K +T   +       N ++ ++++QD L+ SWLL S+    L Q++ C ++ E+
Subjt:  VINPGNKISTVKLTDDNFLLWRLQVLTALQGHGLEDFIDPEGPIPSKNITVEGEGSSSSTQNQEYLNWKQQDKLITSWLLGSMSEEVLSQMLECETSKEI

Query:  WSVLNSLFSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISACVDPPTLQETYSLLLAQEGR
        W+ ++  F+S++ A+VM  KS+++ LKK  L++++Y  K K   D L  A   IS TDHI+ ++ GLG E++S ++VIS+    P+LQ   S L+A EGR
Subjt:  WSVLNSLFSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISACVDPPTLQETYSLLLAQEGR

Query:  NERNAITTEVSLPSVNLTTQEQSK--------KGRPTNSTD----------TRGNWNNNRGRGGNRSNRGRNWNTNFRIQCQLCGRFGHTASRCYQRFDR
              + ++   SVN T+Q  ++         G P++             TRG++ +NRGRG  R+  G       + QCQLC +FGHT  RC+ R+D 
Subjt:  NERNAITTEVSLPSVNLTTQEQSK--------KGRPTNSTD----------TRGNWNNNRGRGGNRSNRGRNWNTNFRIQCQLCGRFGHTASRCYQRFDR

Query:  SF-------------------QGPNSSASSFGFHPTPSYGSSSNPDHPQMNAFTLSQELNRDTNWYPDSGALHHVTNDLGNLSIGAEIHGNNRVLVGNDS
        +F                    G + S SS G      Y +  N D+ +M A   + E  ++  W+PDSGA +HVT+DLGNL+ GAE +GN+++ +GN +
Subjt:  SF-------------------QGPNSSASSFGFHPTPSYGSSSNPDHPQMNAFTLSQELNRDTNWYPDSGALHHVTNDLGNLSIGAEIHGNNRVLVGNDS

Query:  GLNVSHIGSTFLKSSSAPSNIFLLNNLLHVPQITKNLISVSKFAKYNNVYFEFHSSNCFVKDLQTGQILLQGQVSDGLYTFSLDKAKSSTSFPSTIPSHG
        GL +SHIG +   SSS+P+ +  L N+L VP I KNL+SVS+FA+ NNVYFEFH   CFVKD     +LLQG +  GLY F+L K     +   ++ +  
Subjt:  GLNVSHIGSTFLKSSSAPSNIFLLNNLLHVPQITKNLISVSKFAKYNNVYFEFHSSNCFVKDLQTGQILLQGQVSDGLYTFSLDKAKSSTSFPSTIPSHG

Query:  SSSSTITPQVLHTSASPATSFPCATDSAKQFTPFTQCKPSVLDIWHWRLGHPA
        +  +     ++H   S    FP  T+S+            V D+WH RLGHPA
Subjt:  SSSSTITPQVLHTSASPATSFPCATDSAKQFTPFTQCKPSVLDIWHWRLGHPA

A0A438H844 Retrovirus-related Pol polyprotein from transposon RE11.3e-8136.84Show/hide
Query:  ESSKNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLQVLTALQGHGLEDFIDPEGPIPSKNITVEGEGSSSSTQNQEYLNWKQQDKLITSWLLGSM
        E+S+ TS + ++  H         +S+ KL + NFL+WR Q+LT L+GH L+ F+     +PS+ ++ + E  + +  N ++ +W+QQD+LI SWLL S+
Subjt:  ESSKNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLQVLTALQGHGLEDFIDPEGPIPSKNITVEGEGSSSSTQNQEYLNWKQQDKLITSWLLGSM

Query:  SEEVLSQMLECETSKEIWSVLNSLFSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISACVD
        ++ +L++M+ C+TS ++W  L   F+++  A+V + K++L N KKG LS+ +Y  K + +VD L      IS  DHI  +  GL  ++++ +  +++ +D
Subjt:  SEEVLSQMLECETSKEIWSVLNSLFSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISACVD

Query:  PPTLQETYSLLLAQEGRNERNAITTEVSLPSV--------------NLTTQEQSKKGRPTNSTDT-----RGNWNNNRGRGGNRSNRGRNWNTNFRIQCQ
        P T++E   LLLAQE R E+N    ++S PS+              N     ++   RP   +       RGN+   +GRG  R  RG +W  N + QCQ
Subjt:  PPTLQETYSLLLAQEGRNERNAITTEVSLPSV--------------NLTTQEQSKKGRPTNSTDT-----RGNWNNNRGRGGNRSNRGRNWNTNFRIQCQ

Query:  LCGRFGHTASRCYQRFDRSFQGPNSSASSFGFHPTPSYGS-----SSNPDHPQMNAFTLSQELNRDTNWYPDSGALHHVTNDLGNLSIGAEIHGNNRVLV
        LCGR GH   +CY RFD+SF GP+      G  P  +        S N      +    + E+ +D NWYPDSGA HH+T +L NL   ++   ++ V V
Subjt:  LCGRFGHTASRCYQRFDRSFQGPNSSASSFGFHPTPSYGS-----SSNPDHPQMNAFTLSQELNRDTNWYPDSGALHHVTNDLGNLSIGAEIHGNNRVLV

Query:  GNDSGLNVSHIGSTFLKSSSAPSNIFLLNNLLHVPQITKNLISVSKFAKYNNVYFEFHSSNCFVKDLQTGQILLQGQVSDGLYTFSLDKAKSSTSFPSTI
        GN  GL + HIG T   SS  PS    L  LLHVP+ITKNL+SVSKFA  N+V+FEFH ++CFVKDL T  +L+ GQ+  GLY F   + K        +
Subjt:  GNDSGLNVSHIGSTFLKSSSAPSNIFLLNNLLHVPQITKNLISVSKFAKYNNVYFEFHSSNCFVKDLQTGQILLQGQVSDGLYTFSLDKAKSSTSFPSTI

Query:  PSHGSS--SSTITPQVLHTSASPATSFPCATDSAKQFTPFTQCKPSVLDIWHWRLGHPA----FLLSNKC
        P H SS  +ST  P    T  + +TS            PFT        +WH RLGHP+     L+ NKC
Subjt:  PSHGSS--SSTITPQVLHTSASPATSFPCATDSAKQFTPFTQCKPSVLDIWHWRLGHPA----FLLSNKC

A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-11747.86Show/hide
Query:  KNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLQVLTALQGHGLEDFIDPEGPIPSKNI--TVEGEGSSSSTQNQEYLNWKQQDKLITSWLLGSMS
        +NT   S  N   Q+   GNKIS VKL DD FLLW+ Q+LTAL+ + LE+F++ E   PSK +  T     S++ T N  Y  WK+QD+LI+SWLLGSMS
Subjt:  KNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLQVLTALQGHGLEDFIDPEGPIPSKNI--TVEGEGSSSSTQNQEYLNWKQQDKLITSWLLGSMS

Query:  EEVLSQMLECETSKEIWSVLNSLFSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISACVDP
        EE+L+QML C+++KEIW  L  +FSSR LA+ M+ K+KL N+KKGS+ LKEYF K  Q VDAL + +KP+S  DHI+++LAGLG+++ S +SVISA  D 
Subjt:  EEVLSQMLECETSKEIWSVLNSLFSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISACVDP

Query:  PTLQETYSLLLAQEGRNERNAITTEVSLPSVNLTTQEQSKKGRPTNSTDTRGNWNNNR------GRGGNRSNRGRNWNTNFRIQCQLCGRFGHTASRCYQ
        P++QE  SLLL QE +NE   I +E +LPSVN+ TQ  ++KG  +     + N++NN       GRG  RSNRGR  N N + QCQ+C + G++A RC+ 
Subjt:  PTLQETYSLLLAQEGRNERNAITTEVSLPSVNLTTQEQSKKGRPTNSTDTRGNWNNNR------GRGGNRSNRGRNWNTNFRIQCQLCGRFGHTASRCYQ

Query:  RFDRSFQGPNSSASSFGFHPTPSYGSSSNPDHPQMNAFTLSQELNRDTNWYPDSGALHHVTNDLGNLSIGAEIHGNNRVLVGNDSGLNVSHIGSTFLKSS
        R+       NSS  S   H T SY + +N  HPQM+A   + +LN D+NWYPDSGA +H+T+ L NLSIG+E  G N++   N SGL ++H GS    SS
Subjt:  RFDRSFQGPNSSASSFGFHPTPSYGSSSNPDHPQMNAFTLSQELNRDTNWYPDSGALHHVTNDLGNLSIGAEIHGNNRVLVGNDSGLNVSHIGSTFLKSS

Query:  SAPSNIFLLNNLLHVPQITKNLISVSKFAKYNNVYFEFHSSNCFVKDLQTGQILLQGQVSDGLYTFSLDKAKSSTSFPSTIPSHGSSSSTITPQVLHTSA
        + P   F LNNLL VP ITKNLISVS+FAK N+V+FEFH + C+VKDL TGQ+LLQG ++DGLY F+++        PS    H S+S+T   + +  + 
Subjt:  SAPSNIFLLNNLLHVPQITKNLISVSKFAKYNNVYFEFHSSNCFVKDLQTGQILLQGQVSDGLYTFSLDKAKSSTSFPSTIPSHGSSSSTITPQVLHTSA

Query:  SPATSFPCATDSAKQFTPFTQCKPSVLDIWHWRLGHP
         P ++ P                  +LD+WH RLGHP
Subjt:  SPATSFPCATDSAKQFTPFTQCKPSVLDIWHWRLGHP

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-11747.86Show/hide
Query:  KNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLQVLTALQGHGLEDFIDPEGPIPSKNI--TVEGEGSSSSTQNQEYLNWKQQDKLITSWLLGSMS
        +NT   S  N   Q+   GNKIS VKL DD FLLW+ Q+LTAL+ + LE+F++ E   PSK +  T     S++ T N  Y  WK+QD+LI+SWLLGSMS
Subjt:  KNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLQVLTALQGHGLEDFIDPEGPIPSKNI--TVEGEGSSSSTQNQEYLNWKQQDKLITSWLLGSMS

Query:  EEVLSQMLECETSKEIWSVLNSLFSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISACVDP
        EE+L+QML C+++KEIW  L  +FSSR LA+ M+ K+KL N+KKGS+ LKEYF K  Q VDAL + +KP+S  DHI+++LAGLG+++ S +SVISA  D 
Subjt:  EEVLSQMLECETSKEIWSVLNSLFSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISACVDP

Query:  PTLQETYSLLLAQEGRNERNAITTEVSLPSVNLTTQEQSKKGRPTNSTDTRGNWNNNR------GRGGNRSNRGRNWNTNFRIQCQLCGRFGHTASRCYQ
        P++QE  SLLL QE +NE   I +E +LPSVN+ TQ  ++KG  +     + N++NN       GRG  RSNRGR  N N + QCQ+C + G++A RC+ 
Subjt:  PTLQETYSLLLAQEGRNERNAITTEVSLPSVNLTTQEQSKKGRPTNSTDTRGNWNNNR------GRGGNRSNRGRNWNTNFRIQCQLCGRFGHTASRCYQ

Query:  RFDRSFQGPNSSASSFGFHPTPSYGSSSNPDHPQMNAFTLSQELNRDTNWYPDSGALHHVTNDLGNLSIGAEIHGNNRVLVGNDSGLNVSHIGSTFLKSS
        R+       NSS  S   H T SY + +N  HPQM+A   + +LN D+NWYPDSGA +H+T+ L NLSIG+E  G N++   N SGL ++H GS    SS
Subjt:  RFDRSFQGPNSSASSFGFHPTPSYGSSSNPDHPQMNAFTLSQELNRDTNWYPDSGALHHVTNDLGNLSIGAEIHGNNRVLVGNDSGLNVSHIGSTFLKSS

Query:  SAPSNIFLLNNLLHVPQITKNLISVSKFAKYNNVYFEFHSSNCFVKDLQTGQILLQGQVSDGLYTFSLDKAKSSTSFPSTIPSHGSSSSTITPQVLHTSA
        + P   F LNNLL VP ITKNLISVS+FAK N+V+FEFH + C+VKDL TGQ+LLQG ++DGLY F+++        PS    H S+S+T   + +  + 
Subjt:  SAPSNIFLLNNLLHVPQITKNLISVSKFAKYNNVYFEFHSSNCFVKDLQTGQILLQGQVSDGLYTFSLDKAKSSTSFPSTIPSHGSSSSTITPQVLHTSA

Query:  SPATSFPCATDSAKQFTPFTQCKPSVLDIWHWRLGHP
         P ++ P                  +LD+WH RLGHP
Subjt:  SPATSFPCATDSAKQFTPFTQCKPSVLDIWHWRLGHP

A0A6J1DLT9 uncharacterized protein LOC1110217572.0e-9852.1Show/hide
Query:  SSKNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLQVLTALQGHGLEDFIDPEGPIPSKNI-TVEGEGSSSS-TQNQEYLNWKQQDKLITSWLLGS
        SS   S+ +   Q  + INPG+K+S V+L DDN LLW+ Q+ TALQG+GLE +ID     P++ + T E E SSSS  QN  Y  W +QDKLI++WLLGS
Subjt:  SSKNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLQVLTALQGHGLEDFIDPEGPIPSKNI-TVEGEGSSSS-TQNQEYLNWKQQDKLITSWLLGS

Query:  MSEEVLSQMLECETSKEIWSVLNSLFSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISACV
        M+E++LSQML+C++++EIW+VL  +F+SR LARVM+LK KLEN KKG+LSLK+YF K K +VD+L  A K +S  DHIMH+LAGLG EFD+ +SVI+A  
Subjt:  MSEEVLSQMLECETSKEIWSVLNSLFSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISACV

Query:  DPPTLQETYSLLLAQEGRNERNAITTEVSLPSVNLTTQEQSKKGRPTNSTDTRGNWNN--NRGRG-GNRSNRGRNWNTNFRIQCQLCGRFGHTASRCYQR
         P TLQE  SLLL QEGRNERN I ++ SLPSVNLT  + SKK     S     + +N   RGRG  NRS+  RNW  N + QCQ+CGRFGHTA RCY R
Subjt:  DPPTLQETYSLLLAQEGRNERNAITTEVSLPSVNLTTQEQSKKGRPTNSTDTRGNWNN--NRGRG-GNRSNRGRNWNTNFRIQCQLCGRFGHTASRCYQR

Query:  FDRSFQGPNSSASSF---GFHP-----TPSYGSSSNP------------DHPQMNAFTLSQELNRDTNWYPDSGALHHVTNDLGNLSIGAEIHGNNRVLV
        F+R+F GPN + ++F   GF       TPS+ + S+P               QM A  ++Q+ NRD+NWY DSG  +HVTN+ GN S+G+E HG+ ++ V
Subjt:  FDRSFQGPNSSASSF---GFHP-----TPSYGSSSNP------------DHPQMNAFTLSQELNRDTNWYPDSGALHHVTNDLGNLSIGAEIHGNNRVLV

Query:  GNDSG
        GN +G
Subjt:  GNDSG

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.8e-4730.78Show/hide
Query:  NTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLQVLTALQGHGLEDFIDPEGPIPSKNITVEGEGSSSSTQNQEYLNWKQQDKLITSWLLGSMSEEV
        +  E+ + N  I  +N  N     KLT  N+L+W  QV     G+ L  F+D    +P   I  +    ++   N +Y  WK+QDKLI S +LG++S  V
Subjt:  NTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLQVLTALQGHGLEDFIDPEGPIPSKNITVEGEGSSSSTQNQEYLNWKQQDKLITSWLLGSMSEEV

Query:  LSQMLECETSKEIWSVLNSLFSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISACVDPPTL
           +    T+ +IW  L  ++++ +   V +L+++L+   KG+ ++ +Y        D L    KP+   + +  +L  L  E+   +  I+A   PPTL
Subjt:  LSQMLECETSKEIWSVLNSLFSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISACVDPPTL

Query:  QETYSLLLAQEGRNERNAITTEVSLPSVNLTTQEQSKKGRPTNSTDTRGNWNNNRGRGGNRSNRGRNW---NTNFRI----------QCQLCGRFGHTAS
         E +  LL     N  + I    S   + +T    S +   T + +  GN  NNR    N +N  + W   +TNF            +CQ+CG  GH+A 
Subjt:  QETYSLLLAQEGRNERNAITTEVSLPSVNLTTQEQSKKGRPTNSTDTRGNWNNNRGRGGNRSNRGRNW---NTNFRI----------QCQLCGRFGHTAS

Query:  RCYQRFDRSFQGPNSSASSFGFHPTPSYGSSSNPDHPQMNAFTLSQELNRDTNWYPDSGALHHVTNDLGNLSIGAEIHGNNRVLVGNDSGLNVSHIGSTF
        RC Q         NS        P+P       P  P+ N    S       NW  DSGA HH+T+D  NLS+     G + V+V + S + +SH GST 
Subjt:  RCYQRFDRSFQGPNSSASSFGFHPTPSYGSSSNPDHPQMNAFTLSQELNRDTNWYPDSGALHHVTNDLGNLSIGAEIHGNNRVLVGNDSGLNVSHIGSTF

Query:  LKSSSAPSNIFLLNNLLHVPQITKNLISVSKFAKYNNVYFEFHSSNCFVKDLQTGQILLQGQVSDGLYTFSLDKAKSSTSF--PSTIPSHGSSSSTI---
        L + S P N   L+N+L+VP I KNLISV +    N V  EF  ++  VKDL TG  LLQG+  D LY + +  ++  + F  PS+  +H S  + +   
Subjt:  LKSSSAPSNIFLLNNLLHVPQITKNLISVSKFAKYNNVYFEFHSSNCFVKDLQTGQILLQGQVSDGLYTFSLDKAKSSTSF--PSTIPSHGSSSSTI---

Query:  TPQVLHTSAS
         P +L++  S
Subjt:  TPQVLHTSAS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.0e-4027.54Show/hide
Query:  NTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLQVLTALQGHGLEDFIDPEGPIPSKNITVEGEGSSSSTQNQEYLNWKQQDKLITSWLLGSMSEEV
        +  EI + N +I  +N  N     KLT  N+L+W  QV     G+ L  F+D   P+P   I  +    +    N +Y  W++QDKLI S +LG++S  V
Subjt:  NTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLQVLTALQGHGLEDFIDPEGPIPSKNITVEGEGSSSSTQNQEYLNWKQQDKLITSWLLGSMSEEV

Query:  LSQMLECETSKEIWSVLNSLFSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISACVDPPTL
           +    T+ +IW  L  ++++ +   V +L+               +  +     D L    KP+   + +  +L  L  ++   +  I+A   PP+L
Subjt:  LSQMLECETSKEIWSVLNSLFSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISACVDPPTL

Query:  QETYSLLLAQEGRNERNAITTEVSLP-SVNLTTQEQSKKGRPTNSTDTRGNWNNNRGRGGN---RSNRGRNWNTN---FRIQCQLCGRFGHTASRCYQRF
         E +  L+ +E  ++  A+ +   +P + N+ T   +   R  N+     N+NNN  R  +    S+  R+ N     +  +CQ+C   GH+A RC Q  
Subjt:  QETYSLLLAQEGRNERNAITTEVSLP-SVNLTTQEQSKKGRPTNSTDTRGNWNNNRGRGGN---RSNRGRNWNTN---FRIQCQLCGRFGHTASRCYQRF

Query:  DRSFQGPNSSASSFGFHPTPSYGSSSNPDHPQMNAFTLSQELNRDTNWYPDSGALHHVTNDLGNLSIGAEIHGNNRVLVGNDSGLNVSHIGSTFLKSSSA
           FQ   +   S          S   P  P+ N   ++   N + NW  DSGA HH+T+D  NLS      G + V++ + S + ++H GS  L +SS 
Subjt:  DRSFQGPNSSASSFGFHPTPSYGSSSNPDHPQMNAFTLSQELNRDTNWYPDSGALHHVTNDLGNLSIGAEIHGNNRVLVGNDSGLNVSHIGSTFLKSSSA

Query:  PSNIFLLNNLLHVPQITKNLISVSKFAKYNNVYFEFHSSNCFVKDLQTGQILLQGQVSDGLYTFSLDKAKSSTSFPSTIPSHGSSSSTITPQVLHTSASP
          +   LN +L+VP I KNLISV +    N V  EF  ++  VKDL TG  LLQG+  D LY + +  +++ + F                      ASP
Subjt:  PSNIFLLNNLLHVPQITKNLISVSKFAKYNNVYFEFHSSNCFVKDLQTGQILLQGQVSDGLYTFSLDKAKSSTSFPSTIPSHGSSSSTITPQVLHTSASP

Query:  ATSFPCATDSAKQFTPFTQCKPSVLDIWHWRLGHPAFLLSN
                           C  +    WH RLGHP+  + N
Subjt:  ATSFPCATDSAKQFTPFTQCKPSVLDIWHWRLGHPAFLLSN

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.6e-0723.53Show/hide
Query:  ISTVKLTDDNFLLWRLQVLTALQGHGLEDFIDPEGPIPSKNITVEGEGSSSSTQNQEYLNWKQQDKLITSWLLGSMSEEVLSQMLECETSKEIWSVLNSL
        I  +   +DN++ W+++  + L+      FID   P P     +             Y  W+Q + ++  WL+ SM++++L  ++  ET+ ++W  L  +
Subjt:  ISTVKLTDDNFLLWRLQVLTALQGHGLEDFIDPEGPIPSKNITVEGEGSSSSTQNQEYLNWKQQDKLITSWLLGSMSEEVLSQMLECETSKEIWSVLNSL

Query:  FSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQI
        F      ++ +L+ +L  L++G  S++EYF K  ++
Subjt:  FSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQI

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)7.6e-1026.15Show/hide
Query:  NQEYLNWKQQDKLITSWLLGSMS-EEVLSQMLECETSKEIWSVLNSLFSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHI
        N   +NW+++D ++   L G+++ ++     +   TS++IW  + + F +   AR + L S+L     G + + +Y+ K K++ D+L     P++  + +
Subjt:  NQEYLNWKQQDKLITSWLLGSMS-EEVLSQMLECETSKEIWSVLNSLFSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHI

Query:  MHLLAGLGTEFDSTVSVISACVDPPTLQETYSLLLAQEGRNERNAITTEVSLPSVNLTTQEQSKKGRP-TNSTDTRGNWNNNRGRG-GNRSNRGR
        M++L GL  +FD+ ++VI      P+  +  ++L  +E R +R        +   + +T     +  P TN   + GN    RGRG GN   RGR
Subjt:  MHLLAGLGTEFDSTVSVISACVDPPTLQETYSLLLAQEGRNERNAITTEVSLPSVNLTTQEQSKKGRP-TNSTDTRGNWNNNRGRG-GNRSNRGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACATTAGAATCGTCGAAAAACACCTCTGAAATATCGATCACAAATCAGCATATTCAGGTTATTAATCCTGGTAACAAGATCTCTACAGTCAAATTGACTGATGATAA
TTTCTTGTTGTGGCGATTGCAAGTTCTGACTGCTCTCCAAGGCCATGGACTGGAGGACTTCATCGATCCTGAAGGTCCAATTCCATCGAAAAATATCACTGTTGAAGGAG
AAGGTTCATCTTCATCAACTCAGAATCAGGAATATCTAAATTGGAAGCAACAAGATAAATTGATTACATCGTGGCTTCTTGGATCTATGTCTGAAGAAGTATTATCTCAA
ATGCTTGAGTGTGAAACCTCAAAAGAGATTTGGTCTGTTCTGAATAGTCTCTTTTCATCGAGAAACCTAGCTCGTGTTATGGAGTTAAAATCGAAGCTTGAAAACCTAAA
GAAAGGGAGCCTCAGTCTAAAGGAGTATTTTGCAAAGGCAAAGCAAATTGTTGATGCCCTAACTGCTGCTAGTAAACCAATTTCGAAGACTGATCATATAATGCATCTAT
TAGCCGGTCTAGGAACCGAATTCGATTCAACTGTGTCGGTAATTTCTGCGTGTGTTGATCCTCCAACACTTCAAGAAACGTATTCTTTACTACTTGCTCAAGAAGGAAGG
AACGAGAGGAATGCTATCACTACTGAGGTATCACTACCATCAGTGAATTTAACAACTCAAGAACAATCGAAGAAGGGACGTCCAACTAATTCTACAGATACTAGAGGAAA
TTGGAACAATAACAGAGGAAGAGGAGGCAATCGATCAAACCGTGGGCGAAATTGGAATACCAATTTCAGAATTCAGTGTCAACTTTGCGGTCGATTTGGCCATACTGCCT
CGAGGTGTTATCAACGCTTTGATCGGAGTTTTCAGGGCCCTAATTCGTCGGCTTCTTCGTTCGGATTTCATCCGACCCCTTCGTATGGTTCATCATCAAATCCTGATCAC
CCTCAGATGAATGCTTTTACTCTTTCTCAGGAGCTCAATCGAGACACTAACTGGTATCCAGATTCTGGTGCTTTACATCACGTCACAAATGATCTTGGAAATTTGTCTAT
TGGAGCTGAAATTCATGGCAATAACAGAGTTCTTGTAGGCAACGACTCAGGTTTGAACGTTTCACATATTGGATCTACCTTTCTTAAATCTTCTTCTGCCCCTTCTAATA
TCTTTCTGCTAAATAATCTGCTCCATGTTCCTCAAATTACCAAAAATCTCATTAGTGTTAGTAAATTTGCAAAATATAATAATGTTTACTTTGAGTTTCATTCCTCTAAT
TGTTTTGTGAAGGACCTCCAAACTGGCCAAATTCTACTCCAGGGCCAAGTTTCTGATGGGCTGTACACATTCTCTTTGGACAAGGCTAAGTCTTCTACATCATTCCCTTC
CACCATTCCTTCTCATGGCTCGTCTTCTTCAACCATCACTCCTCAGGTTCTTCATACATCAGCTTCTCCTGCTACTTCTTTTCCATGTGCTACTGATTCTGCTAAACAAT
TTACACCTTTCACACAATGTAAGCCTTCTGTTTTAGATATATGGCACTGGCGTCTTGGCCATCCAGCATTCCTACTGTCAAACAAGTGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGACATTAGAATCGTCGAAAAACACCTCTGAAATATCGATCACAAATCAGCATATTCAGGTTATTAATCCTGGTAACAAGATCTCTACAGTCAAATTGACTGATGATAA
TTTCTTGTTGTGGCGATTGCAAGTTCTGACTGCTCTCCAAGGCCATGGACTGGAGGACTTCATCGATCCTGAAGGTCCAATTCCATCGAAAAATATCACTGTTGAAGGAG
AAGGTTCATCTTCATCAACTCAGAATCAGGAATATCTAAATTGGAAGCAACAAGATAAATTGATTACATCGTGGCTTCTTGGATCTATGTCTGAAGAAGTATTATCTCAA
ATGCTTGAGTGTGAAACCTCAAAAGAGATTTGGTCTGTTCTGAATAGTCTCTTTTCATCGAGAAACCTAGCTCGTGTTATGGAGTTAAAATCGAAGCTTGAAAACCTAAA
GAAAGGGAGCCTCAGTCTAAAGGAGTATTTTGCAAAGGCAAAGCAAATTGTTGATGCCCTAACTGCTGCTAGTAAACCAATTTCGAAGACTGATCATATAATGCATCTAT
TAGCCGGTCTAGGAACCGAATTCGATTCAACTGTGTCGGTAATTTCTGCGTGTGTTGATCCTCCAACACTTCAAGAAACGTATTCTTTACTACTTGCTCAAGAAGGAAGG
AACGAGAGGAATGCTATCACTACTGAGGTATCACTACCATCAGTGAATTTAACAACTCAAGAACAATCGAAGAAGGGACGTCCAACTAATTCTACAGATACTAGAGGAAA
TTGGAACAATAACAGAGGAAGAGGAGGCAATCGATCAAACCGTGGGCGAAATTGGAATACCAATTTCAGAATTCAGTGTCAACTTTGCGGTCGATTTGGCCATACTGCCT
CGAGGTGTTATCAACGCTTTGATCGGAGTTTTCAGGGCCCTAATTCGTCGGCTTCTTCGTTCGGATTTCATCCGACCCCTTCGTATGGTTCATCATCAAATCCTGATCAC
CCTCAGATGAATGCTTTTACTCTTTCTCAGGAGCTCAATCGAGACACTAACTGGTATCCAGATTCTGGTGCTTTACATCACGTCACAAATGATCTTGGAAATTTGTCTAT
TGGAGCTGAAATTCATGGCAATAACAGAGTTCTTGTAGGCAACGACTCAGGTTTGAACGTTTCACATATTGGATCTACCTTTCTTAAATCTTCTTCTGCCCCTTCTAATA
TCTTTCTGCTAAATAATCTGCTCCATGTTCCTCAAATTACCAAAAATCTCATTAGTGTTAGTAAATTTGCAAAATATAATAATGTTTACTTTGAGTTTCATTCCTCTAAT
TGTTTTGTGAAGGACCTCCAAACTGGCCAAATTCTACTCCAGGGCCAAGTTTCTGATGGGCTGTACACATTCTCTTTGGACAAGGCTAAGTCTTCTACATCATTCCCTTC
CACCATTCCTTCTCATGGCTCGTCTTCTTCAACCATCACTCCTCAGGTTCTTCATACATCAGCTTCTCCTGCTACTTCTTTTCCATGTGCTACTGATTCTGCTAAACAAT
TTACACCTTTCACACAATGTAAGCCTTCTGTTTTAGATATATGGCACTGGCGTCTTGGCCATCCAGCATTCCTACTGTCAAACAAGTGCTAA
Protein sequenceShow/hide protein sequence
MTLESSKNTSEISITNQHIQVINPGNKISTVKLTDDNFLLWRLQVLTALQGHGLEDFIDPEGPIPSKNITVEGEGSSSSTQNQEYLNWKQQDKLITSWLLGSMSEEVLSQ
MLECETSKEIWSVLNSLFSSRNLARVMELKSKLENLKKGSLSLKEYFAKAKQIVDALTAASKPISKTDHIMHLLAGLGTEFDSTVSVISACVDPPTLQETYSLLLAQEGR
NERNAITTEVSLPSVNLTTQEQSKKGRPTNSTDTRGNWNNNRGRGGNRSNRGRNWNTNFRIQCQLCGRFGHTASRCYQRFDRSFQGPNSSASSFGFHPTPSYGSSSNPDH
PQMNAFTLSQELNRDTNWYPDSGALHHVTNDLGNLSIGAEIHGNNRVLVGNDSGLNVSHIGSTFLKSSSAPSNIFLLNNLLHVPQITKNLISVSKFAKYNNVYFEFHSSN
CFVKDLQTGQILLQGQVSDGLYTFSLDKAKSSTSFPSTIPSHGSSSSTITPQVLHTSASPATSFPCATDSAKQFTPFTQCKPSVLDIWHWRLGHPAFLLSNKC