; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g08050 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g08050
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr4:5818790..5821167
RNA-Seq ExpressionMoc04g08050
SyntenyMoc04g08050
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.9e-16349.48Show/hide
Query:  MEFKKASTTAAYMSNLHMGHIIYGCSHEFQLREHQTLPVVANKVMMLQETVIENE---FLVNHNPLF------------DLNPDIVSVMMTDTSTMDEKM
        M   KA++ ++ +S+ ++G +    SH  +  + Q    V  K  +  E +IE+     ++  NPLF            + + ++VSVMM D  T +  M
Subjt:  MEFKKASTTAAYMSNLHMGHIIYGCSHEFQLREHQTLPVVANKVMMLQETVIENE---FLVNHNPLF------------DLNPDIVSVMMTDTSTMDEKM

Query:  AEMEKKLSALMKTVEERDYEIASLKNQIENRDVAESSQTLAIKINDKGKTIAQENQ-SQYSTSIASLSVQQLQDMITNTIRAQYGGSSQSSLLYSKPYTK
        AEME+K++ LMK VEERD+EIA+LK+Q++  + +ESSQT  +K  DKGK + +ENQ  Q S S+ASLSVQQLQDMI N+IRAQYGG  Q+S +YSKPYTK
Subjt:  AEMEKKLSALMKTVEERDYEIASLKNQIENRDVAESSQTLAIKINDKGKTIAQENQ-SQYSTSIASLSVQQLQDMITNTIRAQYGGSSQSSLLYSKPYTK

Query:  RINDLRMPAGYQPPKFQHFDGKGNSNNILL-----------TSSKLVRMLLEKEFLNRFYSTKRTVSMMELTNSKQRKGEPVVEYINRWRALSLDCKDRL
        RI++LRMP GYQP KFQ FDGKGN    ++              +LVR  +     N F  T+R VSMMELTN+ QRKGEPV++YINRWRALSLDCKD+L
Subjt:  RINDLRMPAGYQPPKFQHFDGKGNSNNILL-----------TSSKLVRMLLEKEFLNRFYSTKRTVSMMELTNSKQRKGEPVVEYINRWRALSLDCKDRL

Query:  TELSTVELCTQGMHWELLYILQGIKPHTFEELATRAHDMELSIANRGSKDSLVLDLEKESKSVEKTS-------NEFFGVNTISPKLFSKIK-GKRIEKQ
        TELS VE+CTQGMHWELLYILQGIKP TFEELATRAHDM+LSIANRG KD LV     +   +  T        NE   V     K FSK K  K     
Subjt:  TELSTVELCTQGMHWELLYILQGIKPHTFEELATRAHDMELSIANRGSKDSLVLDLEKESKSVEKTS-------NEFFGVNTISPKLFSKIK-GKRIEKQ

Query:  KGNRKIWLSLKERQEKVYPFPDSDIPYMLEQLLETRLIVLPECIRPKEMSKVDDPKYCKYHRVISHPVEKCFVLKERILNLAREGKIELDFNEIAQSNHA
         G+ K   +L+ERQ+KVYPFPDSD+  MLEQL+E +LI LPEC RP+++ KVDDP YCKYHRVISH VEKCFVLKE I  LARE KIELD +E+AQ+NH 
Subjt:  KGNRKIWLSLKERQEKVYPFPDSDIPYMLEQLLETRLIVLPECIRPKEMSKVDDPKYCKYHRVISHPVEKCFVLKERILNLAREGKIELDFNEIAQSNHA

Query:  APISVRSQRKTSRNAIKDVVKSVDQ-GEENWATVTRQKKQKQSFAWNKSSMLSKEHEHQGKKDSRKPKSERKENKDSFRPKISATLKEFFPKKFFYIKKK
        A +++ S    S   + D  KS+ Q G      V  Q+K   S + NK                 +P  +  E    F P+   +  E  P++   +   
Subjt:  APISVRSQRKTSRNAIKDVVKSVDQ-GEENWATVTRQKKQKQSFAWNKSSMLSKEHEHQGKKDSRKPKSERKENKDSFRPKISATLKEFFPKKFFYIKKK

Query:  KENL---ASSHYINVEEVDNFKITGQRASVFDRIKPSTTRPSVFQRMSMATTKEENQCFISISIQPSVFQRLNVSTSKKNHSSTSAFDCSEVIGDQFEKT
                 ++Y + EE+DN     QR SVFD IKP TTR SVFQR+SMAT KEENQC      Q S F+RL++S SKK+  ST  FD  ++  DQ ++ 
Subjt:  KENL---ASSHYINVEEVDNFKITGQRASVFDRIKPSTTRPSVFQRMSMATTKEENQCFISISIQPSVFQRLNVSTSKKNHSSTSAFDCSEVIGDQFEKT

Query:  MKTMKAKSFAEVNDDKKIHSIVPSRMKRKFSLLINTEGSFKVKQKLIISTNPKNGGCDESCNENES
        MKT+KAK F E NDD KIHS VPSRMKRK S+ INTEGS  VK + II TNP N G ++  +EN+S
Subjt:  MKTMKAKSFAEVNDDKKIHSIVPSRMKRKFSLLINTEGSFKVKQKLIISTNPKNGGCDESCNENES

KAA0033746.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.7e-16347.65Show/hide
Query:  MEFKKASTTAAYMSNLHMGHIIYGCSHEFQLREHQTLPVVANKVMMLQETVIENE---FLVNHNPLF------------DLNPDIVSVMMTDTSTMDEKM
        M   KA++ ++  S+ ++G +    SH  +  + Q    V  K+ +  E +IE+     ++  NPLF            + + ++VSVMM D  T +  M
Subjt:  MEFKKASTTAAYMSNLHMGHIIYGCSHEFQLREHQTLPVVANKVMMLQETVIENE---FLVNHNPLF------------DLNPDIVSVMMTDTSTMDEKM

Query:  AEMEKKLSALMKTVEERDYEIASLKNQIENRDVAESSQTLAIKINDKGKTIAQENQ-SQYSTSIASLSVQQLQDMITNTIRAQYGGSSQSSLLYSKPYTK
         EME+K++ LMK  EERD+EIA+LK+Q++     ESSQT  +K  DKGK + QENQ  Q S S+ASLSVQQLQDMI N+IRAQYGG  Q+S +YSK YTK
Subjt:  AEMEKKLSALMKTVEERDYEIASLKNQIENRDVAESSQTLAIKINDKGKTIAQENQ-SQYSTSIASLSVQQLQDMITNTIRAQYGGSSQSSLLYSKPYTK

Query:  RINDLRMPAGYQPPKFQHFDGKGN-----------SNNILLTSSKLVRMLLEKEFLNRFYSTKRTVSMMELTNSKQRKGEPVVEYINRWRALSLDCKDRL
        RI++LRMP GYQPPKFQ FDGKGN             N      +LVR  +     N F          E     + +GEPV++YINRWRALSLDCKD+L
Subjt:  RINDLRMPAGYQPPKFQHFDGKGN-----------SNNILLTSSKLVRMLLEKEFLNRFYSTKRTVSMMELTNSKQRKGEPVVEYINRWRALSLDCKDRL

Query:  TELSTVELCTQGMHWELLYILQGIKPHTFEELATRAHDMELSIANRGSKDSLVLDLEKESKS--------VEKTSNEFFGVNTISPKLFSKIKGKRIEKQ
        TELS VE+CTQGMHWELLYILQGIKP TFEEL+TRAHDMELSIAN G+KD LV   ++  K+             NE   V     K FSK K  + E+ 
Subjt:  TELSTVELCTQGMHWELLYILQGIKPHTFEELATRAHDMELSIANRGSKDSLVLDLEKESKS--------VEKTSNEFFGVNTISPKLFSKIKGKRIEKQ

Query:  -KGNRKIWLSLKERQEKVYPFPDSDIPYMLEQLLETRLIVLPECIRPKEMSKVDDPKYCKYHRVISHPVEKCFVLKERILNLAREGKIELDFNEIAQSNH
          G+ K   +L+ERQ+KVYPFPDSD+  MLEQL+E +LI LPEC RP++  KVDDP YCKYHRVISHPVEKCFVLKE IL LARE KIELD +E+AQ+NH
Subjt:  -KGNRKIWLSLKERQEKVYPFPDSDIPYMLEQLLETRLIVLPECIRPKEMSKVDDPKYCKYHRVISHPVEKCFVLKERILNLAREGKIELDFNEIAQSNH

Query:  AA---------------------------PISVRSQRKT-SRNAIKDVVKSVDQGEENWATVTRQKK------QKQSFAWNKSSMLSKEHEHQGKKDSR-
         A                           PI VR Q+KT + N+      S D+GEE W  VT +K+      Q  S    K S  +  H+ +G+++ + 
Subjt:  AA---------------------------PISVRSQRKT-SRNAIKDVVKSVDQGEENWATVTRQKK------QKQSFAWNKSSMLSKEHEHQGKKDSR-

Query:  -KPKSERKENKDSFRPKISATLKEFFPKKFF--YIKKKKENLA---------SSHYINVEEVDNFKITGQRASVFDRIKPSTTRPSVFQRMSMATTKEEN
         KPK  + ++KD F+P+ S  L EF P+ F   + +K  E  A          ++Y + EEVDN     QR  VF RIKP T R SVFQR+SMA  +EEN
Subjt:  -KPKSERKENKDSFRPKISATLKEFFPKKFF--YIKKKKENLA---------SSHYINVEEVDNFKITGQRASVFDRIKPSTTRPSVFQRMSMATTKEEN

Query:  QCFISISIQPSVFQRLNVSTSKKNHSSTSAFDCSEVIGDQFEKTMKTMKAKSFAEVNDDKKIHSIVPSRMKRKFSLLINTEGSFKVKQKLIISTNPKNGG
        QC  S   + S F+RL++ST KK+  STSAFD  ++I DQ ++ MK++K K F E NDD KIHS VPSRMKRK S+ INTEGS  VK + II TNP N G
Subjt:  QCFISISIQPSVFQRLNVSTSKKNHSSTSAFDCSEVIGDQFEKTMKTMKAKSFAEVNDDKKIHSIVPSRMKRKFSLLINTEGSFKVKQKLIISTNPKNGG

Query:  CDESCNENES
         ++  +EN+S
Subjt:  CDESCNENES

KAA0045440.1 retrotransposon gag protein [Cucumis melo var. makuwa]3.0e-17352.93Show/hide
Query:  NPDIVSVMMTDTSTMDEKMAEMEKKLSALMKTVEERDYEIASLKNQIENRDVAESSQTLAIKINDKGKTIAQENQSQYSTSIASLSVQQLQDMITNTIRA
        +P+I+SVM+T+  T + +MAE+EKK++ LMK VEERDYEIA LKN IE+ D AESS    +K  DKGK + QE Q +  TSIASLSVQQLQ+MIT++I+ 
Subjt:  NPDIVSVMMTDTSTMDEKMAEMEKKLSALMKTVEERDYEIASLKNQIENRDVAESSQTLAIKINDKGKTIAQENQSQYSTSIASLSVQQLQDMITNTIRA

Query:  QYGGSSQSSLLYSKPYTKRINDLRMPAGYQPPKFQHFDGKGNSNNILLTSSKLVRML---------LEKEFLNRFYSTKRTVSMMELTNSKQRKGEPVVE
        QYGGS+Q+  LYSKPYTKRI++LRMP GYQPPKFQ +  K   NN+LLTSSK V++L         LE++FLNRFYST+R +SMMELTN++Q+KGE V++
Subjt:  QYGGSSQSSLLYSKPYTKRINDLRMPAGYQPPKFQHFDGKGNSNNILLTSSKLVRML---------LEKEFLNRFYSTKRTVSMMELTNSKQRKGEPVVE

Query:  YINRWRALSLDCKDRLTELSTVELCTQGMHWELLYILQGIKPHTFEELATRAHDMELSIANRGSKDSLVLDLEKESKSVEKTS-------NEFFGVNTIS
        YINRWRALSLDCKDRLTELS VE+CTQGMHW LLYILQGIKP TF+ELATRAHDMEL++AN+G+KD L+  L  +   ++ T         E   V+   
Subjt:  YINRWRALSLDCKDRLTELSTVELCTQGMHWELLYILQGIKPHTFEELATRAHDMELSIANRGSKDSLVLDLEKESKSVEKTS-------NEFFGVNTIS

Query:  PKLFSKIKGKRIE-KQKGNRKIWLSLKERQEKVYPFPDSDIPYMLEQLLETRLIVLPECIRPKEMSKVDDPKYCKYHRVISHPVEKCFVLKERILNLARE
         K FSK K  +IE K  G+ K   +LKERQEKVYPFP+SD+  MLEQLLE +LI L EC R ++  KVDDP YCKYHRVISHP+EKCFVLKE IL LARE
Subjt:  PKLFSKIKGKRIE-KQKGNRKIWLSLKERQEKVYPFPDSDIPYMLEQLLETRLIVLPECIRPKEMSKVDDPKYCKYHRVISHPVEKCFVLKERILNLARE

Query:  GKIELDFNEIAQSNH---------------------------AAPISVR-SQRKTSRNAIKDVVKSVDQGEENWATVTRQKK------QKQSFAWNKSSM
         KIELD +E+AQ+NH                           +API V   QR  + N+        D G E W TVT QK       QK+S    K + 
Subjt:  GKIELDFNEIAQSNH---------------------------AAPISVR-SQRKTSRNAIKDVVKSVDQGEENWATVTRQKK------QKQSFAWNKSSM

Query:  LSKEHEHQGKKDSR--KPKSERKENKDSFRPKISATLKEFFPKKFFYIKKKKENL-------------ASSHYINVEEVDNFKITGQRASVFDRIKPSTT
         S  H+ +G+++ +   PK  + +++D  + + S TL EF P+ F  ++   E +              +++Y + +EV+N     QR SVFDRIKPSTT
Subjt:  LSKEHEHQGKKDSR--KPKSERKENKDSFRPKISATLKEFFPKKFFYIKKKKENL-------------ASSHYINVEEVDNFKITGQRASVFDRIKPSTT

Query:  RPSVFQRMSMATTKEENQCFISISIQPSVFQRLNVSTSKKNHSSTSAFDCSEVIGDQFEKTMKTMKAKSFAEVNDDKKIHSIVPSRMKRKFSLLINTEG
        R SVFQR+SMAT +E+NQC   I  + S F+RL++STSKKN  STSAFD  ++  DQ ++ MK++KAK F E NDD KIHS +PSRMKRK  + INTEG
Subjt:  RPSVFQRMSMATTKEENQCFISISIQPSVFQRLNVSTSKKNHSSTSAFDCSEVIGDQFEKTMKTMKAKSFAEVNDDKKIHSIVPSRMKRKFSLLINTEG

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.5e-16950.97Show/hide
Query:  NPDIVSVMMTDTSTMDEKMAEMEKKLSALMKTVEERDYEIASLKNQIENRDVAESSQTLAIKINDKGKTIAQENQSQYSTSIASLSVQQLQDMITNTIRA
        +P+I+SVM+T+  T + +MAE+EKK++ LMK VEERDYEIA LKN IE+RD AESS    +K  DKGK + QE+Q Q STSIASLSVQQLQ+MI ++I+ 
Subjt:  NPDIVSVMMTDTSTMDEKMAEMEKKLSALMKTVEERDYEIASLKNQIENRDVAESSQTLAIKINDKGKTIAQENQSQYSTSIASLSVQQLQDMITNTIRA

Query:  QYGGSSQSSLLYSKPYTKRINDLRMPAGYQPPKFQHFDGKGNSNNIL---------------LTSSKLVRML---------------------LEKEFLN
        QYGG +Q+  LY KPYTKRI++LRMP GYQPPKFQ FDGKGN    +               L   + VR L                     LE++FLN
Subjt:  QYGGSSQSSLLYSKPYTKRINDLRMPAGYQPPKFQHFDGKGNSNNIL---------------LTSSKLVRML---------------------LEKEFLN

Query:  RFYSTKRTVSMMELTNSKQRKGEPVVEYINRWRALSLDCKDRLTELSTVELCTQGMHWELLYILQGIKPHTFEELATRAHDMELSIANRGSKDSLV----
        RFYST+  VSMMELTN++Q+KGE V++YINRWRALSLDCKDRLTELS VE+CTQGMHW LLYILQGIKP TFEELATRAHDMELSIANRG+KD L+    
Subjt:  RFYSTKRTVSMMELTNSKQRKGEPVVEYINRWRALSLDCKDRLTELSTVELCTQGMHWELLYILQGIKPHTFEELATRAHDMELSIANRGSKDSLV----

Query:  ---LDLEKESKSVEKTSNEFFGVNTISPKLFSKIKGKRIE-KQKGNRKIWLSLKERQEKVYPFPDSDIPYMLEQLLETRLIVLPECIRPKEMSKVDDPKY
            +L+   K       E   V+    K FSK K  +IE K  G+ K   +LKERQEKVYPFPDSD+  MLEQLLE +LI LPEC RP++  KVDDP Y
Subjt:  ---LDLEKESKSVEKTSNEFFGVNTISPKLFSKIKGKRIE-KQKGNRKIWLSLKERQEKVYPFPDSDIPYMLEQLLETRLIVLPECIRPKEMSKVDDPKY

Query:  CKYHRVISHPVEKCFVLKERILNLAREGKIELDFNEIAQSNHAAPISVRSQRKTSRNAIKDVVKSVDQGEENWATVTRQKKQKQSFAWNKSSMLSKEHEH
        CKYHRVISHPVEKCFVLKE IL LARE KIELD +E+AQ+NHA         + + N IK                                        
Subjt:  CKYHRVISHPVEKCFVLKERILNLAREGKIELDFNEIAQSNHAAPISVRSQRKTSRNAIKDVVKSVDQGEENWATVTRQKKQKQSFAWNKSSMLSKEHEH

Query:  QGKKDSRKPKSERKENKDSFRPKISATLKEFFPKKFFYIKKKKENL-------------ASSHYINVEEVDNFKITGQRASVFDRIKPSTTRPSVFQRMS
                      +++D  + + S TL EF P+ F  ++   E +               ++Y + +EV+N     QR SVFDRIKPSTTR SVFQR+S
Subjt:  QGKKDSRKPKSERKENKDSFRPKISATLKEFFPKKFFYIKKKKENL-------------ASSHYINVEEVDNFKITGQRASVFDRIKPSTTRPSVFQRMS

Query:  MATTKEENQCFISISIQPSVFQRLNVSTSKKNHSSTSAFDCSEVIGDQFEKTMKTMKAKSFAEVNDDKKIHSIVPSRMKRKFSLLINTEGSFKVKQKLII
        +AT +EENQC   I  + S  +RL++ST KK+  STS+FD  ++  DQ ++ MK+ KAK F E NDD KIHS VPSRMKRK  + INTEGS  VK + II
Subjt:  MATTKEENQCFISISIQPSVFQRLNVSTSKKNHSSTSAFDCSEVIGDQFEKTMKTMKAKSFAEVNDDKKIHSIVPSRMKRKFSLLINTEGSFKVKQKLII

Query:  STNPKNGGCDESCNENES
         TNP N G ++   EN+S
Subjt:  STNPKNGGCDESCNENES

TYK03695.1 retrotransposon gag protein [Cucumis melo var. makuwa]8.3e-17151.49Show/hide
Query:  ENEFLVNHNPLFDLN------PDIVSVMMTDTSTMDEKMAEMEKKLSALMKTVEERDYEIASLKNQIENRDVAESSQTLAIKINDKGKTIAQENQSQYST
        EN  +  HN L + +      P+I+SVM+T+  T + +MAE+EKK++ LMK VEERDYEIA LKN IE+RD AESS    +K  DKGK + QE+Q Q ST
Subjt:  ENEFLVNHNPLFDLN------PDIVSVMMTDTSTMDEKMAEMEKKLSALMKTVEERDYEIASLKNQIENRDVAESSQTLAIKINDKGKTIAQENQSQYST

Query:  SIASLSVQQLQDMITNTIRAQYGGSSQSSLLYSKPYTKRINDLRMPAGYQPPKFQHFDGKGNSNNIL---------------LTSSKLVRML--------
        SIASLSVQQLQ+MI ++I+ QYGG +Q+  LYSKPYTKRI++LRMP GYQPPKFQ FDGKGN    +               L   + VR L        
Subjt:  SIASLSVQQLQDMITNTIRAQYGGSSQSSLLYSKPYTKRINDLRMPAGYQPPKFQHFDGKGNSNNIL---------------LTSSKLVRML--------

Query:  -------------LEKEFLNRFYSTKRTVSMMELTNSKQRKGEPVVEYINRWRALSLDCKDRLTELSTVELCTQGMHWELLYILQGIKPHTFEELATRAH
                     LE++FLNRFYST+R VSMMELTN++Q+KGE V++YINRWRALSLDCKDRLTELS VE+CTQGMHW LLYILQGIKP TFEELATRAH
Subjt:  -------------LEKEFLNRFYSTKRTVSMMELTNSKQRKGEPVVEYINRWRALSLDCKDRLTELSTVELCTQGMHWELLYILQGIKPHTFEELATRAH

Query:  DMELSIANRGSKDSLV-------LDLEKESKSVEKTSNEFFGVNTISPKLFSKIKGKRIE-KQKGNRKIWLSLKERQEKVYPFPDSDIPYMLEQLLETRL
        DMELSI NRG+KD L+        +L    K       E   V+    K FSK K  +IE K  G+ K   +LKERQEKVYPF DSD+  MLEQLLE +L
Subjt:  DMELSIANRGSKDSLV-------LDLEKESKSVEKTSNEFFGVNTISPKLFSKIKGKRIE-KQKGNRKIWLSLKERQEKVYPFPDSDIPYMLEQLLETRL

Query:  IVLPECIRPKEMSKVDDPKYCKYHRVISHPVEKCFVLKERILNLAREGKIELDFNEIAQSNHAA--------PIS-VRSQRK------TSRNAIKDVVKS
        I LP+C RPK+  KVDDP YCKYHRVISHPVEKCFVLKE IL LARE KIEL+ +E+AQ+NH A        P++ +  QRK      TS    + +V  
Subjt:  IVLPECIRPKEMSKVDDPKYCKYHRVISHPVEKCFVLKERILNLAREGKIELDFNEIAQSNHAA--------PIS-VRSQRK------TSRNAIKDVVKS

Query:  VDQG--------EENWATVTRQKK------QKQSFAWNKSSMLSKEHEHQGKKDSR--KPKSERKENKDSFRPKISATLKEFFPKKFFYIKKKKENL---
          Q         +E W TVTRQK       QK+S    K +  S  H+ +G+++ +   PK  + +++D  + + S TL EF P+ F  ++   E +   
Subjt:  VDQG--------EENWATVTRQKK------QKQSFAWNKSSMLSKEHEHQGKKDSR--KPKSERKENKDSFRPKISATLKEFFPKKFFYIKKKKENL---

Query:  ----------ASSHYINVEEVDNFKITGQRASVFDRIKPSTTRPSVFQRMSMATTKEENQCFISISIQPSVFQRLNVSTSKKNHSSTSAFDCSEVIGDQF
                   +++Y + +EV+N     QR SVFDRIKPSTTR SVFQR+SMAT +EENQC   I  + S F+RL++S SKKN  STSAFD  ++  DQ 
Subjt:  ----------ASSHYINVEEVDNFKITGQRASVFDRIKPSTTRPSVFQRMSMATTKEENQCFISISIQPSVFQRLNVSTSKKNHSSTSAFDCSEVIGDQF

Query:  EKTMKTMKAKSFAEVNDDKKIHSIVPSRMKRKFSLLINTE
        ++ MK++KAK F E NDD KIHS VPSRMKRK  + INT+
Subjt:  EKTMKTMKAKSFAEVNDDKKIHSIVPSRMKRKFSLLINTE

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein1.4e-16349.48Show/hide
Query:  MEFKKASTTAAYMSNLHMGHIIYGCSHEFQLREHQTLPVVANKVMMLQETVIENE---FLVNHNPLF------------DLNPDIVSVMMTDTSTMDEKM
        M   KA++ ++ +S+ ++G +    SH  +  + Q    V  K  +  E +IE+     ++  NPLF            + + ++VSVMM D  T +  M
Subjt:  MEFKKASTTAAYMSNLHMGHIIYGCSHEFQLREHQTLPVVANKVMMLQETVIENE---FLVNHNPLF------------DLNPDIVSVMMTDTSTMDEKM

Query:  AEMEKKLSALMKTVEERDYEIASLKNQIENRDVAESSQTLAIKINDKGKTIAQENQ-SQYSTSIASLSVQQLQDMITNTIRAQYGGSSQSSLLYSKPYTK
        AEME+K++ LMK VEERD+EIA+LK+Q++  + +ESSQT  +K  DKGK + +ENQ  Q S S+ASLSVQQLQDMI N+IRAQYGG  Q+S +YSKPYTK
Subjt:  AEMEKKLSALMKTVEERDYEIASLKNQIENRDVAESSQTLAIKINDKGKTIAQENQ-SQYSTSIASLSVQQLQDMITNTIRAQYGGSSQSSLLYSKPYTK

Query:  RINDLRMPAGYQPPKFQHFDGKGNSNNILL-----------TSSKLVRMLLEKEFLNRFYSTKRTVSMMELTNSKQRKGEPVVEYINRWRALSLDCKDRL
        RI++LRMP GYQP KFQ FDGKGN    ++              +LVR  +     N F  T+R VSMMELTN+ QRKGEPV++YINRWRALSLDCKD+L
Subjt:  RINDLRMPAGYQPPKFQHFDGKGNSNNILL-----------TSSKLVRMLLEKEFLNRFYSTKRTVSMMELTNSKQRKGEPVVEYINRWRALSLDCKDRL

Query:  TELSTVELCTQGMHWELLYILQGIKPHTFEELATRAHDMELSIANRGSKDSLVLDLEKESKSVEKTS-------NEFFGVNTISPKLFSKIK-GKRIEKQ
        TELS VE+CTQGMHWELLYILQGIKP TFEELATRAHDM+LSIANRG KD LV     +   +  T        NE   V     K FSK K  K     
Subjt:  TELSTVELCTQGMHWELLYILQGIKPHTFEELATRAHDMELSIANRGSKDSLVLDLEKESKSVEKTS-------NEFFGVNTISPKLFSKIK-GKRIEKQ

Query:  KGNRKIWLSLKERQEKVYPFPDSDIPYMLEQLLETRLIVLPECIRPKEMSKVDDPKYCKYHRVISHPVEKCFVLKERILNLAREGKIELDFNEIAQSNHA
         G+ K   +L+ERQ+KVYPFPDSD+  MLEQL+E +LI LPEC RP+++ KVDDP YCKYHRVISH VEKCFVLKE I  LARE KIELD +E+AQ+NH 
Subjt:  KGNRKIWLSLKERQEKVYPFPDSDIPYMLEQLLETRLIVLPECIRPKEMSKVDDPKYCKYHRVISHPVEKCFVLKERILNLAREGKIELDFNEIAQSNHA

Query:  APISVRSQRKTSRNAIKDVVKSVDQ-GEENWATVTRQKKQKQSFAWNKSSMLSKEHEHQGKKDSRKPKSERKENKDSFRPKISATLKEFFPKKFFYIKKK
        A +++ S    S   + D  KS+ Q G      V  Q+K   S + NK                 +P  +  E    F P+   +  E  P++   +   
Subjt:  APISVRSQRKTSRNAIKDVVKSVDQ-GEENWATVTRQKKQKQSFAWNKSSMLSKEHEHQGKKDSRKPKSERKENKDSFRPKISATLKEFFPKKFFYIKKK

Query:  KENL---ASSHYINVEEVDNFKITGQRASVFDRIKPSTTRPSVFQRMSMATTKEENQCFISISIQPSVFQRLNVSTSKKNHSSTSAFDCSEVIGDQFEKT
                 ++Y + EE+DN     QR SVFD IKP TTR SVFQR+SMAT KEENQC      Q S F+RL++S SKK+  ST  FD  ++  DQ ++ 
Subjt:  KENL---ASSHYINVEEVDNFKITGQRASVFDRIKPSTTRPSVFQRMSMATTKEENQCFISISIQPSVFQRLNVSTSKKNHSSTSAFDCSEVIGDQFEKT

Query:  MKTMKAKSFAEVNDDKKIHSIVPSRMKRKFSLLINTEGSFKVKQKLIISTNPKNGGCDESCNENES
        MKT+KAK F E NDD KIHS VPSRMKRK S+ INTEGS  VK + II TNP N G ++  +EN+S
Subjt:  MKTMKAKSFAEVNDDKKIHSIVPSRMKRKFSLLINTEGSFKVKQKLIISTNPKNGGCDESCNENES

A0A5A7SUW1 Retrotransposon gag protein8.1e-16447.65Show/hide
Query:  MEFKKASTTAAYMSNLHMGHIIYGCSHEFQLREHQTLPVVANKVMMLQETVIENE---FLVNHNPLF------------DLNPDIVSVMMTDTSTMDEKM
        M   KA++ ++  S+ ++G +    SH  +  + Q    V  K+ +  E +IE+     ++  NPLF            + + ++VSVMM D  T +  M
Subjt:  MEFKKASTTAAYMSNLHMGHIIYGCSHEFQLREHQTLPVVANKVMMLQETVIENE---FLVNHNPLF------------DLNPDIVSVMMTDTSTMDEKM

Query:  AEMEKKLSALMKTVEERDYEIASLKNQIENRDVAESSQTLAIKINDKGKTIAQENQ-SQYSTSIASLSVQQLQDMITNTIRAQYGGSSQSSLLYSKPYTK
         EME+K++ LMK  EERD+EIA+LK+Q++     ESSQT  +K  DKGK + QENQ  Q S S+ASLSVQQLQDMI N+IRAQYGG  Q+S +YSK YTK
Subjt:  AEMEKKLSALMKTVEERDYEIASLKNQIENRDVAESSQTLAIKINDKGKTIAQENQ-SQYSTSIASLSVQQLQDMITNTIRAQYGGSSQSSLLYSKPYTK

Query:  RINDLRMPAGYQPPKFQHFDGKGN-----------SNNILLTSSKLVRMLLEKEFLNRFYSTKRTVSMMELTNSKQRKGEPVVEYINRWRALSLDCKDRL
        RI++LRMP GYQPPKFQ FDGKGN             N      +LVR  +     N F          E     + +GEPV++YINRWRALSLDCKD+L
Subjt:  RINDLRMPAGYQPPKFQHFDGKGN-----------SNNILLTSSKLVRMLLEKEFLNRFYSTKRTVSMMELTNSKQRKGEPVVEYINRWRALSLDCKDRL

Query:  TELSTVELCTQGMHWELLYILQGIKPHTFEELATRAHDMELSIANRGSKDSLVLDLEKESKS--------VEKTSNEFFGVNTISPKLFSKIKGKRIEKQ
        TELS VE+CTQGMHWELLYILQGIKP TFEEL+TRAHDMELSIAN G+KD LV   ++  K+             NE   V     K FSK K  + E+ 
Subjt:  TELSTVELCTQGMHWELLYILQGIKPHTFEELATRAHDMELSIANRGSKDSLVLDLEKESKS--------VEKTSNEFFGVNTISPKLFSKIKGKRIEKQ

Query:  -KGNRKIWLSLKERQEKVYPFPDSDIPYMLEQLLETRLIVLPECIRPKEMSKVDDPKYCKYHRVISHPVEKCFVLKERILNLAREGKIELDFNEIAQSNH
          G+ K   +L+ERQ+KVYPFPDSD+  MLEQL+E +LI LPEC RP++  KVDDP YCKYHRVISHPVEKCFVLKE IL LARE KIELD +E+AQ+NH
Subjt:  -KGNRKIWLSLKERQEKVYPFPDSDIPYMLEQLLETRLIVLPECIRPKEMSKVDDPKYCKYHRVISHPVEKCFVLKERILNLAREGKIELDFNEIAQSNH

Query:  AA---------------------------PISVRSQRKT-SRNAIKDVVKSVDQGEENWATVTRQKK------QKQSFAWNKSSMLSKEHEHQGKKDSR-
         A                           PI VR Q+KT + N+      S D+GEE W  VT +K+      Q  S    K S  +  H+ +G+++ + 
Subjt:  AA---------------------------PISVRSQRKT-SRNAIKDVVKSVDQGEENWATVTRQKK------QKQSFAWNKSSMLSKEHEHQGKKDSR-

Query:  -KPKSERKENKDSFRPKISATLKEFFPKKFF--YIKKKKENLA---------SSHYINVEEVDNFKITGQRASVFDRIKPSTTRPSVFQRMSMATTKEEN
         KPK  + ++KD F+P+ S  L EF P+ F   + +K  E  A          ++Y + EEVDN     QR  VF RIKP T R SVFQR+SMA  +EEN
Subjt:  -KPKSERKENKDSFRPKISATLKEFFPKKFF--YIKKKKENLA---------SSHYINVEEVDNFKITGQRASVFDRIKPSTTRPSVFQRMSMATTKEEN

Query:  QCFISISIQPSVFQRLNVSTSKKNHSSTSAFDCSEVIGDQFEKTMKTMKAKSFAEVNDDKKIHSIVPSRMKRKFSLLINTEGSFKVKQKLIISTNPKNGG
        QC  S   + S F+RL++ST KK+  STSAFD  ++I DQ ++ MK++K K F E NDD KIHS VPSRMKRK S+ INTEGS  VK + II TNP N G
Subjt:  QCFISISIQPSVFQRLNVSTSKKNHSSTSAFDCSEVIGDQFEKTMKTMKAKSFAEVNDDKKIHSIVPSRMKRKFSLLINTEGSFKVKQKLIISTNPKNGG

Query:  CDESCNENES
         ++  +EN+S
Subjt:  CDESCNENES

A0A5A7TQK6 Retrotransposon gag protein1.5e-17352.93Show/hide
Query:  NPDIVSVMMTDTSTMDEKMAEMEKKLSALMKTVEERDYEIASLKNQIENRDVAESSQTLAIKINDKGKTIAQENQSQYSTSIASLSVQQLQDMITNTIRA
        +P+I+SVM+T+  T + +MAE+EKK++ LMK VEERDYEIA LKN IE+ D AESS    +K  DKGK + QE Q +  TSIASLSVQQLQ+MIT++I+ 
Subjt:  NPDIVSVMMTDTSTMDEKMAEMEKKLSALMKTVEERDYEIASLKNQIENRDVAESSQTLAIKINDKGKTIAQENQSQYSTSIASLSVQQLQDMITNTIRA

Query:  QYGGSSQSSLLYSKPYTKRINDLRMPAGYQPPKFQHFDGKGNSNNILLTSSKLVRML---------LEKEFLNRFYSTKRTVSMMELTNSKQRKGEPVVE
        QYGGS+Q+  LYSKPYTKRI++LRMP GYQPPKFQ +  K   NN+LLTSSK V++L         LE++FLNRFYST+R +SMMELTN++Q+KGE V++
Subjt:  QYGGSSQSSLLYSKPYTKRINDLRMPAGYQPPKFQHFDGKGNSNNILLTSSKLVRML---------LEKEFLNRFYSTKRTVSMMELTNSKQRKGEPVVE

Query:  YINRWRALSLDCKDRLTELSTVELCTQGMHWELLYILQGIKPHTFEELATRAHDMELSIANRGSKDSLVLDLEKESKSVEKTS-------NEFFGVNTIS
        YINRWRALSLDCKDRLTELS VE+CTQGMHW LLYILQGIKP TF+ELATRAHDMEL++AN+G+KD L+  L  +   ++ T         E   V+   
Subjt:  YINRWRALSLDCKDRLTELSTVELCTQGMHWELLYILQGIKPHTFEELATRAHDMELSIANRGSKDSLVLDLEKESKSVEKTS-------NEFFGVNTIS

Query:  PKLFSKIKGKRIE-KQKGNRKIWLSLKERQEKVYPFPDSDIPYMLEQLLETRLIVLPECIRPKEMSKVDDPKYCKYHRVISHPVEKCFVLKERILNLARE
         K FSK K  +IE K  G+ K   +LKERQEKVYPFP+SD+  MLEQLLE +LI L EC R ++  KVDDP YCKYHRVISHP+EKCFVLKE IL LARE
Subjt:  PKLFSKIKGKRIE-KQKGNRKIWLSLKERQEKVYPFPDSDIPYMLEQLLETRLIVLPECIRPKEMSKVDDPKYCKYHRVISHPVEKCFVLKERILNLARE

Query:  GKIELDFNEIAQSNH---------------------------AAPISVR-SQRKTSRNAIKDVVKSVDQGEENWATVTRQKK------QKQSFAWNKSSM
         KIELD +E+AQ+NH                           +API V   QR  + N+        D G E W TVT QK       QK+S    K + 
Subjt:  GKIELDFNEIAQSNH---------------------------AAPISVR-SQRKTSRNAIKDVVKSVDQGEENWATVTRQKK------QKQSFAWNKSSM

Query:  LSKEHEHQGKKDSR--KPKSERKENKDSFRPKISATLKEFFPKKFFYIKKKKENL-------------ASSHYINVEEVDNFKITGQRASVFDRIKPSTT
         S  H+ +G+++ +   PK  + +++D  + + S TL EF P+ F  ++   E +              +++Y + +EV+N     QR SVFDRIKPSTT
Subjt:  LSKEHEHQGKKDSR--KPKSERKENKDSFRPKISATLKEFFPKKFFYIKKKKENL-------------ASSHYINVEEVDNFKITGQRASVFDRIKPSTT

Query:  RPSVFQRMSMATTKEENQCFISISIQPSVFQRLNVSTSKKNHSSTSAFDCSEVIGDQFEKTMKTMKAKSFAEVNDDKKIHSIVPSRMKRKFSLLINTEG
        R SVFQR+SMAT +E+NQC   I  + S F+RL++STSKKN  STSAFD  ++  DQ ++ MK++KAK F E NDD KIHS +PSRMKRK  + INTEG
Subjt:  RPSVFQRMSMATTKEENQCFISISIQPSVFQRLNVSTSKKNHSSTSAFDCSEVIGDQFEKTMKTMKAKSFAEVNDDKKIHSIVPSRMKRKFSLLINTEG

A0A5A7URH1 Ty3-gypsy retrotransposon protein1.7e-16950.97Show/hide
Query:  NPDIVSVMMTDTSTMDEKMAEMEKKLSALMKTVEERDYEIASLKNQIENRDVAESSQTLAIKINDKGKTIAQENQSQYSTSIASLSVQQLQDMITNTIRA
        +P+I+SVM+T+  T + +MAE+EKK++ LMK VEERDYEIA LKN IE+RD AESS    +K  DKGK + QE+Q Q STSIASLSVQQLQ+MI ++I+ 
Subjt:  NPDIVSVMMTDTSTMDEKMAEMEKKLSALMKTVEERDYEIASLKNQIENRDVAESSQTLAIKINDKGKTIAQENQSQYSTSIASLSVQQLQDMITNTIRA

Query:  QYGGSSQSSLLYSKPYTKRINDLRMPAGYQPPKFQHFDGKGNSNNIL---------------LTSSKLVRML---------------------LEKEFLN
        QYGG +Q+  LY KPYTKRI++LRMP GYQPPKFQ FDGKGN    +               L   + VR L                     LE++FLN
Subjt:  QYGGSSQSSLLYSKPYTKRINDLRMPAGYQPPKFQHFDGKGNSNNIL---------------LTSSKLVRML---------------------LEKEFLN

Query:  RFYSTKRTVSMMELTNSKQRKGEPVVEYINRWRALSLDCKDRLTELSTVELCTQGMHWELLYILQGIKPHTFEELATRAHDMELSIANRGSKDSLV----
        RFYST+  VSMMELTN++Q+KGE V++YINRWRALSLDCKDRLTELS VE+CTQGMHW LLYILQGIKP TFEELATRAHDMELSIANRG+KD L+    
Subjt:  RFYSTKRTVSMMELTNSKQRKGEPVVEYINRWRALSLDCKDRLTELSTVELCTQGMHWELLYILQGIKPHTFEELATRAHDMELSIANRGSKDSLV----

Query:  ---LDLEKESKSVEKTSNEFFGVNTISPKLFSKIKGKRIE-KQKGNRKIWLSLKERQEKVYPFPDSDIPYMLEQLLETRLIVLPECIRPKEMSKVDDPKY
            +L+   K       E   V+    K FSK K  +IE K  G+ K   +LKERQEKVYPFPDSD+  MLEQLLE +LI LPEC RP++  KVDDP Y
Subjt:  ---LDLEKESKSVEKTSNEFFGVNTISPKLFSKIKGKRIE-KQKGNRKIWLSLKERQEKVYPFPDSDIPYMLEQLLETRLIVLPECIRPKEMSKVDDPKY

Query:  CKYHRVISHPVEKCFVLKERILNLAREGKIELDFNEIAQSNHAAPISVRSQRKTSRNAIKDVVKSVDQGEENWATVTRQKKQKQSFAWNKSSMLSKEHEH
        CKYHRVISHPVEKCFVLKE IL LARE KIELD +E+AQ+NHA         + + N IK                                        
Subjt:  CKYHRVISHPVEKCFVLKERILNLAREGKIELDFNEIAQSNHAAPISVRSQRKTSRNAIKDVVKSVDQGEENWATVTRQKKQKQSFAWNKSSMLSKEHEH

Query:  QGKKDSRKPKSERKENKDSFRPKISATLKEFFPKKFFYIKKKKENL-------------ASSHYINVEEVDNFKITGQRASVFDRIKPSTTRPSVFQRMS
                      +++D  + + S TL EF P+ F  ++   E +               ++Y + +EV+N     QR SVFDRIKPSTTR SVFQR+S
Subjt:  QGKKDSRKPKSERKENKDSFRPKISATLKEFFPKKFFYIKKKKENL-------------ASSHYINVEEVDNFKITGQRASVFDRIKPSTTRPSVFQRMS

Query:  MATTKEENQCFISISIQPSVFQRLNVSTSKKNHSSTSAFDCSEVIGDQFEKTMKTMKAKSFAEVNDDKKIHSIVPSRMKRKFSLLINTEGSFKVKQKLII
        +AT +EENQC   I  + S  +RL++ST KK+  STS+FD  ++  DQ ++ MK+ KAK F E NDD KIHS VPSRMKRK  + INTEGS  VK + II
Subjt:  MATTKEENQCFISISIQPSVFQRLNVSTSKKNHSSTSAFDCSEVIGDQFEKTMKTMKAKSFAEVNDDKKIHSIVPSRMKRKFSLLINTEGSFKVKQKLII

Query:  STNPKNGGCDESCNENES
         TNP N G ++   EN+S
Subjt:  STNPKNGGCDESCNENES

A0A5D3BX77 Retrotransposon gag protein4.0e-17151.49Show/hide
Query:  ENEFLVNHNPLFDLN------PDIVSVMMTDTSTMDEKMAEMEKKLSALMKTVEERDYEIASLKNQIENRDVAESSQTLAIKINDKGKTIAQENQSQYST
        EN  +  HN L + +      P+I+SVM+T+  T + +MAE+EKK++ LMK VEERDYEIA LKN IE+RD AESS    +K  DKGK + QE+Q Q ST
Subjt:  ENEFLVNHNPLFDLN------PDIVSVMMTDTSTMDEKMAEMEKKLSALMKTVEERDYEIASLKNQIENRDVAESSQTLAIKINDKGKTIAQENQSQYST

Query:  SIASLSVQQLQDMITNTIRAQYGGSSQSSLLYSKPYTKRINDLRMPAGYQPPKFQHFDGKGNSNNIL---------------LTSSKLVRML--------
        SIASLSVQQLQ+MI ++I+ QYGG +Q+  LYSKPYTKRI++LRMP GYQPPKFQ FDGKGN    +               L   + VR L        
Subjt:  SIASLSVQQLQDMITNTIRAQYGGSSQSSLLYSKPYTKRINDLRMPAGYQPPKFQHFDGKGNSNNIL---------------LTSSKLVRML--------

Query:  -------------LEKEFLNRFYSTKRTVSMMELTNSKQRKGEPVVEYINRWRALSLDCKDRLTELSTVELCTQGMHWELLYILQGIKPHTFEELATRAH
                     LE++FLNRFYST+R VSMMELTN++Q+KGE V++YINRWRALSLDCKDRLTELS VE+CTQGMHW LLYILQGIKP TFEELATRAH
Subjt:  -------------LEKEFLNRFYSTKRTVSMMELTNSKQRKGEPVVEYINRWRALSLDCKDRLTELSTVELCTQGMHWELLYILQGIKPHTFEELATRAH

Query:  DMELSIANRGSKDSLV-------LDLEKESKSVEKTSNEFFGVNTISPKLFSKIKGKRIE-KQKGNRKIWLSLKERQEKVYPFPDSDIPYMLEQLLETRL
        DMELSI NRG+KD L+        +L    K       E   V+    K FSK K  +IE K  G+ K   +LKERQEKVYPF DSD+  MLEQLLE +L
Subjt:  DMELSIANRGSKDSLV-------LDLEKESKSVEKTSNEFFGVNTISPKLFSKIKGKRIE-KQKGNRKIWLSLKERQEKVYPFPDSDIPYMLEQLLETRL

Query:  IVLPECIRPKEMSKVDDPKYCKYHRVISHPVEKCFVLKERILNLAREGKIELDFNEIAQSNHAA--------PIS-VRSQRK------TSRNAIKDVVKS
        I LP+C RPK+  KVDDP YCKYHRVISHPVEKCFVLKE IL LARE KIEL+ +E+AQ+NH A        P++ +  QRK      TS    + +V  
Subjt:  IVLPECIRPKEMSKVDDPKYCKYHRVISHPVEKCFVLKERILNLAREGKIELDFNEIAQSNHAA--------PIS-VRSQRK------TSRNAIKDVVKS

Query:  VDQG--------EENWATVTRQKK------QKQSFAWNKSSMLSKEHEHQGKKDSR--KPKSERKENKDSFRPKISATLKEFFPKKFFYIKKKKENL---
          Q         +E W TVTRQK       QK+S    K +  S  H+ +G+++ +   PK  + +++D  + + S TL EF P+ F  ++   E +   
Subjt:  VDQG--------EENWATVTRQKK------QKQSFAWNKSSMLSKEHEHQGKKDSR--KPKSERKENKDSFRPKISATLKEFFPKKFFYIKKKKENL---

Query:  ----------ASSHYINVEEVDNFKITGQRASVFDRIKPSTTRPSVFQRMSMATTKEENQCFISISIQPSVFQRLNVSTSKKNHSSTSAFDCSEVIGDQF
                   +++Y + +EV+N     QR SVFDRIKPSTTR SVFQR+SMAT +EENQC   I  + S F+RL++S SKKN  STSAFD  ++  DQ 
Subjt:  ----------ASSHYINVEEVDNFKITGQRASVFDRIKPSTTRPSVFQRMSMATTKEENQCFISISIQPSVFQRLNVSTSKKNHSSTSAFDCSEVIGDQF

Query:  EKTMKTMKAKSFAEVNDDKKIHSIVPSRMKRKFSLLINTE
        ++ MK++KAK F E NDD KIHS VPSRMKRK  + INT+
Subjt:  EKTMKTMKAKSFAEVNDDKKIHSIVPSRMKRKFSLLINTE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATTTAAAAAGGCTTCTACAACAGCTGCTTACATGAGTAACCTTCATATGGGTCATATCATCTATGGTTGCTCTCATGAATTCCAACTACGGGAACATCAAACTCT
TCCTGTTGTTGCAAACAAAGTAATGATGCTGCAAGAAACAGTTATTGAAAATGAGTTTCTTGTTAATCATAACCCCTTATTTGATCTTAACCCTGACATTGTCTCTGTTA
TGATGACTGATACAAGCACTATGGATGAAAAAATGGCTGAAATGGAAAAGAAGCTCTCTGCATTGATGAAGACAGTGGAAGAAAGAGATTATGAAATCGCTTCCTTGAAG
AATCAGATTGAAAACAGAGATGTTGCTGAGTCGAGTCAAACCCTTGCTATTAAGATAAATGACAAGGGAAAGACTATCGCTCAAGAAAATCAATCGCAATATTCCACTTC
GATCGCTTCGCTGTCTGTTCAGCAGCTCCAAGATATGATCACGAACACGATCAGAGCGCAATACGGCGGATCTTCTCAAAGCTCCCTCTTGTATTCTAAGCCATATACTA
AGAGGATTAATGATCTGAGAATGCCTGCTGGATACCAACCTCCAAAATTCCAACATTTCGATGGGAAGGGCAACTCAAACAACATATTGCTCACTTCGTCGAAACTTGTG
AGAATGCTGCTTGAAAAAGAATTCCTAAATCGCTTTTACAGTACGAAGAGAACTGTTAGCATGATGGAGCTCACAAACTCCAAACAAAGAAAAGGCGAACCGGTTGTCGA
ATATATCAACCGATGGAGAGCTCTAAGTCTTGATTGCAAAGATAGACTTACTGAACTATCCACTGTCGAATTATGCACTCAAGGTATGCATTGGGAACTTCTTTACATCC
TTCAAGGTATAAAACCTCACACCTTTGAAGAGCTAGCAACTCGTGCTCACGATATGGAGCTAAGCATTGCTAATCGAGGAAGTAAGGATTCTTTAGTGCTGGACTTAGAG
AAAGAAAGCAAAAGTGTTGAGAAAACTTCAAATGAATTTTTTGGTGTAAACACAATCTCTCCCAAGTTGTTTTCAAAAATAAAAGGGAAGAGAATTGAGAAGCAAAAAGG
AAATAGGAAAATATGGCTAAGCTTGAAAGAAAGGCAAGAAAAAGTCTATCCTTTTCCTGATTCCGACATCCCTTATATGTTGGAACAATTGCTGGAAACACGATTGATTG
TACTTCCAGAGTGTATACGACCAAAAGAAATGAGTAAAGTTGATGATCCCAAATATTGCAAGTACCACCGAGTTATCAGTCATCCAGTGGAAAAGTGTTTTGTCCTAAAA
GAGCGTATCCTAAATTTAGCTCGAGAAGGAAAGATTGAGTTGGATTTCAATGAAATAGCTCAATCAAATCACGCTGCACCCATATCTGTTCGCTCTCAACGGAAGACTTC
AAGAAATGCCATTAAAGATGTTGTCAAATCTGTCGATCAGGGAGAGGAAAACTGGGCTACTGTGACTCGTCAGAAGAAACAAAAACAAAGTTTTGCTTGGAATAAGTCAT
CTATGCTCAGCAAAGAGCATGAACACCAAGGAAAGAAGGATTCAAGGAAGCCTAAATCAGAGAGGAAAGAAAATAAAGACTCCTTTCGACCCAAGATATCAGCGACCTTA
AAAGAATTTTTCCCAAAAAAATTCTTCTATATCAAAAAGAAGAAAGAAAATCTTGCAAGTTCACATTACATAAATGTGGAAGAGGTTGACAACTTTAAAATAACTGGGCA
AAGGGCTTCTGTCTTCGATCGTATCAAGCCTTCAACTACTCGACCTTCGGTCTTCCAGAGGATGAGTATGGCCACAACAAAAGAAGAAAATCAATGTTTTATTTCCATCT
CCATTCAACCTTCAGTCTTTCAAAGGCTAAACGTCTCCACGTCAAAGAAAAATCATTCTTCGACATCTGCTTTTGATTGTTCAGAAGTGATAGGCGATCAATTTGAAAAA
ACGATGAAGACCATGAAGGCGAAATCATTTGCTGAAGTGAATGATGACAAGAAAATTCACAGCATTGTTCCTTCCCGTATGAAGAGAAAATTCTCTCTTCTCATAAATAC
AGAAGGTTCCTTTAAAGTAAAACAAAAGCTCATCATCTCAACCAATCCTAAAAATGGAGGATGTGATGAAAGTTGTAATGAAAATGAGAGCATATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAATTTAAAAAGGCTTCTACAACAGCTGCTTACATGAGTAACCTTCATATGGGTCATATCATCTATGGTTGCTCTCATGAATTCCAACTACGGGAACATCAAACTCT
TCCTGTTGTTGCAAACAAAGTAATGATGCTGCAAGAAACAGTTATTGAAAATGAGTTTCTTGTTAATCATAACCCCTTATTTGATCTTAACCCTGACATTGTCTCTGTTA
TGATGACTGATACAAGCACTATGGATGAAAAAATGGCTGAAATGGAAAAGAAGCTCTCTGCATTGATGAAGACAGTGGAAGAAAGAGATTATGAAATCGCTTCCTTGAAG
AATCAGATTGAAAACAGAGATGTTGCTGAGTCGAGTCAAACCCTTGCTATTAAGATAAATGACAAGGGAAAGACTATCGCTCAAGAAAATCAATCGCAATATTCCACTTC
GATCGCTTCGCTGTCTGTTCAGCAGCTCCAAGATATGATCACGAACACGATCAGAGCGCAATACGGCGGATCTTCTCAAAGCTCCCTCTTGTATTCTAAGCCATATACTA
AGAGGATTAATGATCTGAGAATGCCTGCTGGATACCAACCTCCAAAATTCCAACATTTCGATGGGAAGGGCAACTCAAACAACATATTGCTCACTTCGTCGAAACTTGTG
AGAATGCTGCTTGAAAAAGAATTCCTAAATCGCTTTTACAGTACGAAGAGAACTGTTAGCATGATGGAGCTCACAAACTCCAAACAAAGAAAAGGCGAACCGGTTGTCGA
ATATATCAACCGATGGAGAGCTCTAAGTCTTGATTGCAAAGATAGACTTACTGAACTATCCACTGTCGAATTATGCACTCAAGGTATGCATTGGGAACTTCTTTACATCC
TTCAAGGTATAAAACCTCACACCTTTGAAGAGCTAGCAACTCGTGCTCACGATATGGAGCTAAGCATTGCTAATCGAGGAAGTAAGGATTCTTTAGTGCTGGACTTAGAG
AAAGAAAGCAAAAGTGTTGAGAAAACTTCAAATGAATTTTTTGGTGTAAACACAATCTCTCCCAAGTTGTTTTCAAAAATAAAAGGGAAGAGAATTGAGAAGCAAAAAGG
AAATAGGAAAATATGGCTAAGCTTGAAAGAAAGGCAAGAAAAAGTCTATCCTTTTCCTGATTCCGACATCCCTTATATGTTGGAACAATTGCTGGAAACACGATTGATTG
TACTTCCAGAGTGTATACGACCAAAAGAAATGAGTAAAGTTGATGATCCCAAATATTGCAAGTACCACCGAGTTATCAGTCATCCAGTGGAAAAGTGTTTTGTCCTAAAA
GAGCGTATCCTAAATTTAGCTCGAGAAGGAAAGATTGAGTTGGATTTCAATGAAATAGCTCAATCAAATCACGCTGCACCCATATCTGTTCGCTCTCAACGGAAGACTTC
AAGAAATGCCATTAAAGATGTTGTCAAATCTGTCGATCAGGGAGAGGAAAACTGGGCTACTGTGACTCGTCAGAAGAAACAAAAACAAAGTTTTGCTTGGAATAAGTCAT
CTATGCTCAGCAAAGAGCATGAACACCAAGGAAAGAAGGATTCAAGGAAGCCTAAATCAGAGAGGAAAGAAAATAAAGACTCCTTTCGACCCAAGATATCAGCGACCTTA
AAAGAATTTTTCCCAAAAAAATTCTTCTATATCAAAAAGAAGAAAGAAAATCTTGCAAGTTCACATTACATAAATGTGGAAGAGGTTGACAACTTTAAAATAACTGGGCA
AAGGGCTTCTGTCTTCGATCGTATCAAGCCTTCAACTACTCGACCTTCGGTCTTCCAGAGGATGAGTATGGCCACAACAAAAGAAGAAAATCAATGTTTTATTTCCATCT
CCATTCAACCTTCAGTCTTTCAAAGGCTAAACGTCTCCACGTCAAAGAAAAATCATTCTTCGACATCTGCTTTTGATTGTTCAGAAGTGATAGGCGATCAATTTGAAAAA
ACGATGAAGACCATGAAGGCGAAATCATTTGCTGAAGTGAATGATGACAAGAAAATTCACAGCATTGTTCCTTCCCGTATGAAGAGAAAATTCTCTCTTCTCATAAATAC
AGAAGGTTCCTTTAAAGTAAAACAAAAGCTCATCATCTCAACCAATCCTAAAAATGGAGGATGTGATGAAAGTTGTAATGAAAATGAGAGCATATGA
Protein sequenceShow/hide protein sequence
MEFKKASTTAAYMSNLHMGHIIYGCSHEFQLREHQTLPVVANKVMMLQETVIENEFLVNHNPLFDLNPDIVSVMMTDTSTMDEKMAEMEKKLSALMKTVEERDYEIASLK
NQIENRDVAESSQTLAIKINDKGKTIAQENQSQYSTSIASLSVQQLQDMITNTIRAQYGGSSQSSLLYSKPYTKRINDLRMPAGYQPPKFQHFDGKGNSNNILLTSSKLV
RMLLEKEFLNRFYSTKRTVSMMELTNSKQRKGEPVVEYINRWRALSLDCKDRLTELSTVELCTQGMHWELLYILQGIKPHTFEELATRAHDMELSIANRGSKDSLVLDLE
KESKSVEKTSNEFFGVNTISPKLFSKIKGKRIEKQKGNRKIWLSLKERQEKVYPFPDSDIPYMLEQLLETRLIVLPECIRPKEMSKVDDPKYCKYHRVISHPVEKCFVLK
ERILNLAREGKIELDFNEIAQSNHAAPISVRSQRKTSRNAIKDVVKSVDQGEENWATVTRQKKQKQSFAWNKSSMLSKEHEHQGKKDSRKPKSERKENKDSFRPKISATL
KEFFPKKFFYIKKKKENLASSHYINVEEVDNFKITGQRASVFDRIKPSTTRPSVFQRMSMATTKEENQCFISISIQPSVFQRLNVSTSKKNHSSTSAFDCSEVIGDQFEK
TMKTMKAKSFAEVNDDKKIHSIVPSRMKRKFSLLINTEGSFKVKQKLIISTNPKNGGCDESCNENESI