; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001463 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001463
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:31635276..31636841
RNA-Seq ExpressionLag0001463
SyntenyLag0001463
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026100.1 uncharacterized protein E6C27_scaffold19G00360 [Cucumis melo var. makuwa]1.4e-9249.04Show/hide
Query:  MTNANNTQVPVLTPIAAPFSNPPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEGHLTGSKSCPPKRIPQPNEVATTGAEDSGTEVNEASSSTVET
        M NA  T  P  +  +A FSNPPLNQ+LNQ+ ++KLDR N+LLWK LALPIL+ YKLEGHLTG   CP   +   +   TT  E+       ASSS    
Subjt:  MTNANNTQVPVLTPIAAPFSNPPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEGHLTGSKSCPPKRIPQPNEVATTGAEDSGTEVNEASSSTVET

Query:  VLNPQYEAWLVVDQLLLGWLYNSMTSEVATQVMSFDNAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPVSTRNL
        ++N  +E W+  D LLLGWLYNSMT +VA Q+M F N +  W A QD FGVQSRAEED+LRQ+ Q +RK + KM +YL VMK++VDNLGQ GSPV  R L
Subjt:  VLNPQYEAWLVVDQLLLGWLYNSMTSEVATQVMSFDNAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPVSTRNL

Query:  VSQVLLGLDEEYNLVVAMIQGRGDISWSEMQAELLVFEKRLELQNTQKSTSSLNLSSLNLNPSANMTTNRFSNNQRQFQN-QHSGNNRQNYNSNNGGRGY
        +SQVLLGLDE YNLV+ +IQG+ DISW +MQ++LL+FEK L+ QNTQK        ++  +P+ NM      N QR   N +  G NRQ+++        
Subjt:  VSQVLLGLDEEYNLVVAMIQGRGDISWSEMQAELLVFEKRLELQNTQKSTSSLNLSSLNLNPSANMTTNRFSNNQRQFQN-QHSGNNRQNYNSNNGGRGY

Query:  NIGGRNRGRGRSYGNSNYRPVYQVCGKPGHSAIACYHRFDKEYSP--VQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAVVDPNWYADNAASSH
                     GN N  P  Q+CGK GHSA+ CY+RF+KE+S   VQ++   +S G  S N   PA FV++Q+  PF   P+ VVDPNWY D+ A++H
Subjt:  NIGGRNRGRGRSYGNSNYRPVYQVCGKPGHSAIACYHRFDKEYSP--VQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAVVDPNWYADNAASSH

Query:  VTADYNNVGNPVDYTG
        VT + +N+ NP +Y+G
Subjt:  VTADYNNVGNPVDYTG

TXG55646.1 hypothetical protein EZV62_020902 [Acer yangbiense]1.5e-8140.19Show/hide
Query:  NANNTQVPVLTPIAAPFSN--PPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEGHLTGSKSCPPKRIPQPNEVATTGAEDSGTEVNEASSSTVET
        ++++T+ P +    +  SN   P    LNQ  +IKLDR NF+LWK +   I++ ++L+GHL  ++ CPP+ +P P    T G  DSG            +
Subjt:  NANNTQVPVLTPIAAPFSN--PPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEGHLTGSKSCPPKRIPQPNEVATTGAEDSGTEVNEASSSTVET

Query:  VLNPQYEAWLVVDQLLLGWLYNSMTSEVATQVMSFDNAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPVSTRNL
          NP+YE WLV DQLL+GWLY+SMT  VA  VM    A   W A+++LFG  S+++ + +R   Q +RK S  M +YL  MK+  D+L  AG P     L
Subjt:  VLNPQYEAWLVVDQLLLGWLYNSMTSEVATQVMSFDNAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPVSTRNL

Query:  VSQVLLGLDEEYNLVVAMIQGRGDISWSEMQAELLVFEKRLELQNTQKSTSSLNLSSLNLNPSANMTTNRFSN--NQRQFQNQHSGNNRQNYNSNNGGRG
         + +L GLD EY  +V +I+ R   +W E+   LL ++ +LE  N   +  +L LSS    PSA++ TN+ +N  N  +  NQ + N   N   N GG  
Subjt:  VSQVLLGLDEEYNLVVAMIQGRGDISWSEMQAELLVFEKRLELQNTQKSTSSLNLSSLNLNPSANMTTNRFSN--NQRQFQNQHSGNNRQNYNSNNGGRG

Query:  YNIGGRNRGRGRSYGNSNYRPVYQVCGKPGHSAIACYHRFDKEYSPVQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAVVDPNWYADNAASSHV
           GG  R RGR   N+N RP  QVCGK GHSA  CY R+D  Y        G+ P  NS N  SP+ FV +         PE V D  WYAD+ A++HV
Subjt:  YNIGGRNRGRGRSYGNSNYRPVYQVCGKPGHSAIACYHRFDKEYSPVQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAVVDPNWYADNAASSHV

Query:  TADYNNVGNPVDYTGNERVTVGNGNKLTISSVGNSCLTD-EVNNLELKNILCVPKIAKNLVSVSKLIEDNRIFIEFHNGFCLVKDKDSGRTVMKGTLRDG
        T D  N+    +Y G+E + VGNG +L IS VG   L     +++ LK +L VP+I KNL+SVS+L+ DN +FIEFH   C VKDK +   V++G L++G
Subjt:  TADYNNVGNPVDYTGNERVTVGNGNKLTISSVGNSCLTD-EVNNLELKNILCVPKIAKNLVSVSKLIEDNRIFIEFHNGFCLVKDKDSGRTVMKGTLRDG

Query:  LYQLENVQATAGVNV
        LYQLE     +  N+
Subjt:  LYQLENVQATAGVNV

TYK05754.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]7.9e-9150.79Show/hide
Query:  VATQVMSFDNAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPVSTRNLVSQVLLGLDEEYNLVVAMIQGRGDISW
        +A Q+M F NAK  W A QDLFGVQSRAEED+LRQ+FQ +RK      DYLR+MK++ D LGQAGSPV  R  +SQ LLGLDE YN V+A+IQG+ +ISW
Subjt:  VATQVMSFDNAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPVSTRNLVSQVLLGLDEEYNLVVAMIQGRGDISW

Query:  SEMQAELLVFEKRLELQNTQKSTSSLNLSSLNLNPSANMTTNRFSNNQRQFQN-QHSGNNRQNYNSNNGGRGYNIGGRNRGRGRSYGNSNYRPVYQVCGK
         +MQ+ELL FEKRLE Q+TQK+T ++      +    N+  NR S++ R++ N Q  GNNR N     G  G+NIG   RGRG+  GN   +P  QVC K
Subjt:  SEMQAELLVFEKRLELQNTQKSTSSLNLSSLNLNPSANMTTNRFSNNQRQFQN-QHSGNNRQNYNSNNGGRGYNIGGRNRGRGRSYGNSNYRPVYQVCGK

Query:  PGHSAIACYHRFDKEY-SPVQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAVVDPNWYADNAASSHVTADYNNVGNPVDYTGNERVTVGNGNKL
         GHSA+ CY+RF+KE+ SP+       S   N +   +    VT QS N F    + V++ NWY D+ A++H+T +Y+N+ NP +Y+G E++ VGNG+ L
Subjt:  PGHSAIACYHRFDKEY-SPVQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAVVDPNWYADNAASSHVTADYNNVGNPVDYTGNERVTVGNGNKL

Query:  TISSVGNSCLTDEVNNLELKNILCVPKIAKNLVSVSKLIEDNRIFIEFHNGFCLVKDKDSGRTVMKGTLRDGLYQLENVQ
         IS +GN+ LTD +N L LKN+LCVP I KNLVSVSKL +DN ++IEFH  +C +KDKD+GRT++  T++DGLY L+ ++
Subjt:  TISSVGNSCLTDEVNNLELKNILCVPKIAKNLVSVSKLIEDNRIFIEFHNGFCLVKDKDSGRTVMKGTLRDGLYQLENVQ

XP_016902197.1 PREDICTED: uncharacterized protein LOC107991581 isoform X1 [Cucumis melo]1.9e-10547.55Show/hide
Query:  MTNANNTQVPVLTPIAAPFSNPPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEGHLTGSKSCPPKRIPQPNEVATTGAEDSGTEVNEASSSTVET
        M NA  T  P  +  +A FSNPPLNQ+LNQ+T++KLDR N+LLWK LALPIL+ YKLEGHLT    CP   +   +   TT  E+       ASSS    
Subjt:  MTNANNTQVPVLTPIAAPFSNPPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEGHLTGSKSCPPKRIPQPNEVATTGAEDSGTEVNEASSSTVET

Query:  VLNPQYEAWLVVDQLLLGWLYNSMTSEVATQVMSFDNAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPVSTRNL
        ++NP +E W+  D LLLGWLYNSMT +VA Q+M F N +  W A QD FGVQSRAEED+LRQ+ Q +RK                               
Subjt:  VLNPQYEAWLVVDQLLLGWLYNSMTSEVATQVMSFDNAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPVSTRNL

Query:  VSQVLLGLDEEYNLVVAMIQGRGDISWSEMQAELLVFEKRLELQNTQKSTSSLNLSSLNLNPSANMTTNRFSNNQR-QFQNQHSGNNRQNYNSNNGGRGY
              GLDE YNLV+ +IQG+ DISW +MQ++LL+FEKRL+ QNTQK     N  ++  +P+ NM      N QR Q   +  G NRQ+++        
Subjt:  VSQVLLGLDEEYNLVVAMIQGRGDISWSEMQAELLVFEKRLELQNTQKSTSSLNLSSLNLNPSANMTTNRFSNNQR-QFQNQHSGNNRQNYNSNNGGRGY

Query:  NIGGRNRGRGRSYGNSNYRPVYQVCGKPGHSAIACYHRFDKEYSP--VQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAVVDPNWYADNAASSH
                     GN N  P  Q+CGK GHSA+ CY+RF+KE+S   VQN+   +S G  S N   PA FV++Q+  PF   P+ VVDPNWY D+ A++H
Subjt:  NIGGRNRGRGRSYGNSNYRPVYQVCGKPGHSAIACYHRFDKEYSP--VQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAVVDPNWYADNAASSH

Query:  VTADYNNVGNPVDYTGNERVTVGNGNKLTISSVGNSCLTDEVNNLELKNILCVPKIAKNLVSVSKLIEDNRIFIEFHNGFCLVKDKDSGR
        VT + +N+ NP +Y+G E+VTVGNGN+L IS VGN+CLTD   +L LKNILCVP IAKNL+SVSKL +DN I+IEFH   C +KDK +G+
Subjt:  VTADYNNVGNPVDYTGNERVTVGNGNKLTISSVGNSCLTDEVNNLELKNILCVPKIAKNLVSVSKLIEDNRIFIEFHNGFCLVKDKDSGR

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]1.7e-10147.53Show/hide
Query:  NNTQVPVLTPIA----APFSNPPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEGHLTGSKSCPPKRIPQPNEVATTGAEDSGTEVNEASSSTVET
        +N   P++TP A    A F++PPLNQLLNQ+TSIK+DRGNFLLW+NLALPILRSYKL  +LTG K CPP  +     V T    D+ T +  ++SS    
Subjt:  NNTQVPVLTPIA----APFSNPPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEGHLTGSKSCPPKRIPQPNEVATTGAEDSGTEVNEASSSTVET

Query:  VLNPQYEAWLVVDQLLLGWLYNSMTSEVATQVMSFDNAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPVSTRNL
         LNP YEAW+VVD+LLLGWLYNSM ++VA QVM F  ++  W A+Q+LFGVQSRAE DYL+QVFQQ+ K SL+M +YL++MKSH DNL  AGS VS R+L
Subjt:  VLNPQYEAWLVVDQLLLGWLYNSMTSEVATQVMSFDNAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPVSTRNL

Query:  VSQVLLGLDEEYNLVVAMIQGRGDISWSEMQAELLVFEKRLELQNTQKSTSSLNLSSLNLNPSANMTTNR-FSNNQRQFQNQHSGNNRQNYNSNNGGRGY
        VSQVL GLDEEYN +V  +QG+ ++SWSEM AELL +EKRLE QN+ KS   +N +     PS N    R F  NQR     ++GNN    N++ GG GY
Subjt:  VSQVLLGLDEEYNLVVAMIQGRGDISWSEMQAELLVFEKRLELQNTQKSTSSLNLSSLNLNPSANMTTNR-FSNNQRQFQNQHSGNNRQNYNSNNGGRGY

Query:  ---NIGGRNRGRG-RSYGNSNYRPVYQVCGKPGHSAIACYHRFDKEYSPVQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAVVDPNWYADNAAS
           + G RNRGRG +   + N+ P        G +  A +H                                    T+  V  PE V+DP+WYAD+ A+
Subjt:  ---NIGGRNRGRG-RSYGNSNYRPVYQVCGKPGHSAIACYHRFDKEYSPVQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAVVDPNWYADNAAS

Query:  SHVTADYNNVGNPVDYTGNERVTVGNGNKLTISSVGNSCLTDEVNNLELKNILCVPKIAKNLVSVSKLIEDNRIFIEFHNGFCLVKDKDSGRTVMKGTLR
        SHVTA+ NNV   VDY+G E V V NGNKL+IS +G++ +     +L+LK++L VP IAKNL                        DK SGRT++KGTL+
Subjt:  SHVTADYNNVGNPVDYTGNERVTVGNGNKLTISSVGNSCLTDEVNNLELKNILCVPKIAKNLVSVSKLIEDNRIFIEFHNGFCLVKDKDSGRTVMKGTLR

Query:  DGLYQLE
        D LY+L+
Subjt:  DGLYQLE

TrEMBL top hitse value%identityAlignment
A0A1S4E1U6 uncharacterized protein LOC107991581 isoform X19.4e-10647.55Show/hide
Query:  MTNANNTQVPVLTPIAAPFSNPPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEGHLTGSKSCPPKRIPQPNEVATTGAEDSGTEVNEASSSTVET
        M NA  T  P  +  +A FSNPPLNQ+LNQ+T++KLDR N+LLWK LALPIL+ YKLEGHLT    CP   +   +   TT  E+       ASSS    
Subjt:  MTNANNTQVPVLTPIAAPFSNPPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEGHLTGSKSCPPKRIPQPNEVATTGAEDSGTEVNEASSSTVET

Query:  VLNPQYEAWLVVDQLLLGWLYNSMTSEVATQVMSFDNAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPVSTRNL
        ++NP +E W+  D LLLGWLYNSMT +VA Q+M F N +  W A QD FGVQSRAEED+LRQ+ Q +RK                               
Subjt:  VLNPQYEAWLVVDQLLLGWLYNSMTSEVATQVMSFDNAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPVSTRNL

Query:  VSQVLLGLDEEYNLVVAMIQGRGDISWSEMQAELLVFEKRLELQNTQKSTSSLNLSSLNLNPSANMTTNRFSNNQR-QFQNQHSGNNRQNYNSNNGGRGY
              GLDE YNLV+ +IQG+ DISW +MQ++LL+FEKRL+ QNTQK     N  ++  +P+ NM      N QR Q   +  G NRQ+++        
Subjt:  VSQVLLGLDEEYNLVVAMIQGRGDISWSEMQAELLVFEKRLELQNTQKSTSSLNLSSLNLNPSANMTTNRFSNNQR-QFQNQHSGNNRQNYNSNNGGRGY

Query:  NIGGRNRGRGRSYGNSNYRPVYQVCGKPGHSAIACYHRFDKEYSP--VQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAVVDPNWYADNAASSH
                     GN N  P  Q+CGK GHSA+ CY+RF+KE+S   VQN+   +S G  S N   PA FV++Q+  PF   P+ VVDPNWY D+ A++H
Subjt:  NIGGRNRGRGRSYGNSNYRPVYQVCGKPGHSAIACYHRFDKEYSP--VQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAVVDPNWYADNAASSH

Query:  VTADYNNVGNPVDYTGNERVTVGNGNKLTISSVGNSCLTDEVNNLELKNILCVPKIAKNLVSVSKLIEDNRIFIEFHNGFCLVKDKDSGR
        VT + +N+ NP +Y+G E+VTVGNGN+L IS VGN+CLTD   +L LKNILCVP IAKNL+SVSKL +DN I+IEFH   C +KDK +G+
Subjt:  VTADYNNVGNPVDYTGNERVTVGNGNKLTISSVGNSCLTDEVNNLELKNILCVPKIAKNLVSVSKLIEDNRIFIEFHNGFCLVKDKDSGR

A0A5A7SIT7 Uncharacterized protein7.0e-9349.04Show/hide
Query:  MTNANNTQVPVLTPIAAPFSNPPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEGHLTGSKSCPPKRIPQPNEVATTGAEDSGTEVNEASSSTVET
        M NA  T  P  +  +A FSNPPLNQ+LNQ+ ++KLDR N+LLWK LALPIL+ YKLEGHLTG   CP   +   +   TT  E+       ASSS    
Subjt:  MTNANNTQVPVLTPIAAPFSNPPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEGHLTGSKSCPPKRIPQPNEVATTGAEDSGTEVNEASSSTVET

Query:  VLNPQYEAWLVVDQLLLGWLYNSMTSEVATQVMSFDNAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPVSTRNL
        ++N  +E W+  D LLLGWLYNSMT +VA Q+M F N +  W A QD FGVQSRAEED+LRQ+ Q +RK + KM +YL VMK++VDNLGQ GSPV  R L
Subjt:  VLNPQYEAWLVVDQLLLGWLYNSMTSEVATQVMSFDNAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPVSTRNL

Query:  VSQVLLGLDEEYNLVVAMIQGRGDISWSEMQAELLVFEKRLELQNTQKSTSSLNLSSLNLNPSANMTTNRFSNNQRQFQN-QHSGNNRQNYNSNNGGRGY
        +SQVLLGLDE YNLV+ +IQG+ DISW +MQ++LL+FEK L+ QNTQK        ++  +P+ NM      N QR   N +  G NRQ+++        
Subjt:  VSQVLLGLDEEYNLVVAMIQGRGDISWSEMQAELLVFEKRLELQNTQKSTSSLNLSSLNLNPSANMTTNRFSNNQRQFQN-QHSGNNRQNYNSNNGGRGY

Query:  NIGGRNRGRGRSYGNSNYRPVYQVCGKPGHSAIACYHRFDKEYSP--VQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAVVDPNWYADNAASSH
                     GN N  P  Q+CGK GHSA+ CY+RF+KE+S   VQ++   +S G  S N   PA FV++Q+  PF   P+ VVDPNWY D+ A++H
Subjt:  NIGGRNRGRGRSYGNSNYRPVYQVCGKPGHSAIACYHRFDKEYSP--VQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAVVDPNWYADNAASSH

Query:  VTADYNNVGNPVDYTG
        VT + +N+ NP +Y+G
Subjt:  VTADYNNVGNPVDYTG

A0A5C7IJ06 Uncharacterized protein7.3e-8240.19Show/hide
Query:  NANNTQVPVLTPIAAPFSN--PPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEGHLTGSKSCPPKRIPQPN-----EVATTGAEDSGTEVNEASS
        ++++T  P +    +  SN   P    LNQ  +IKLDR NF+LWK +   I++ ++L+GHL  ++ CPP+ +P P         T G  DSG        
Subjt:  NANNTQVPVLTPIAAPFSN--PPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEGHLTGSKSCPPKRIPQPN-----EVATTGAEDSGTEVNEASS

Query:  STVETVLNPQYEAWLVVDQLLLGWLYNSMTSEVATQVMSFDNAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPV
            +  NP+YE WLV DQLL+GWLY+SMT  VA  VM    A   W A+++LFG  S+++ + +R   Q +RK S  M +YL  MK+  D+L  AG P 
Subjt:  STVETVLNPQYEAWLVVDQLLLGWLYNSMTSEVATQVMSFDNAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPV

Query:  STRNLVSQVLLGLDEEYNLVVAMIQGRGDISWSEMQAELLVFEKRLELQNTQKSTSSLNLSSLNLNPSANMTTNRFSN--NQRQFQNQHSGNNRQNYNSN
            L +  L GLD EY  +V +I+ R   +W E+   LL ++ +LE  N   +  +L LSS    PSA++ TN+ +N  N  +  NQ + N   N   N
Subjt:  STRNLVSQVLLGLDEEYNLVVAMIQGRGDISWSEMQAELLVFEKRLELQNTQKSTSSLNLSSLNLNPSANMTTNRFSN--NQRQFQNQHSGNNRQNYNSN

Query:  NGGRGYNIGGRNRGRGRSYGNSNYRPVYQVCGKPGHSAIACYHRFDKEYSPVQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAVVDPNWYADNA
         GG     GG  R RGR   N+N RP  QVCGK GHSA  CY R+D  Y        G+ P  NS N  SP+ FV +         PE V D  WYAD+ 
Subjt:  NGGRGYNIGGRNRGRGRSYGNSNYRPVYQVCGKPGHSAIACYHRFDKEYSPVQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAVVDPNWYADNA

Query:  ASSHVTADYNNVGNPVDYTGNERVTVGNGNKLTISSVGNSCLTD-EVNNLELKNILCVPKIAKNLVSVSKLIEDNRIFIEFHNGFCLVKDKDSGRTVMKG
        A++HVT D  N+    DY G+E + VGNG +L IS VG   L     +++ LK +L VP+I KNL+SVS+L+ DN +FIEFH   C VKDK +G  V++G
Subjt:  ASSHVTADYNNVGNPVDYTGNERVTVGNGNKLTISSVGNSCLTD-EVNNLELKNILCVPKIAKNLVSVSKLIEDNRIFIEFHNGFCLVKDKDSGRTVMKG

Query:  TLRDGLYQLENVQATAGVNV
         L++GLYQLE     +  N+
Subjt:  TLRDGLYQLENVQATAGVNV

A0A5D3C373 Retrovirus-related Pol polyprotein from transposon TNT 1-943.8e-9150.79Show/hide
Query:  VATQVMSFDNAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPVSTRNLVSQVLLGLDEEYNLVVAMIQGRGDISW
        +A Q+M F NAK  W A QDLFGVQSRAEED+LRQ+FQ +RK      DYLR+MK++ D LGQAGSPV  R  +SQ LLGLDE YN V+A+IQG+ +ISW
Subjt:  VATQVMSFDNAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPVSTRNLVSQVLLGLDEEYNLVVAMIQGRGDISW

Query:  SEMQAELLVFEKRLELQNTQKSTSSLNLSSLNLNPSANMTTNRFSNNQRQFQN-QHSGNNRQNYNSNNGGRGYNIGGRNRGRGRSYGNSNYRPVYQVCGK
         +MQ+ELL FEKRLE Q+TQK+T ++      +    N+  NR S++ R++ N Q  GNNR N     G  G+NIG   RGRG+  GN   +P  QVC K
Subjt:  SEMQAELLVFEKRLELQNTQKSTSSLNLSSLNLNPSANMTTNRFSNNQRQFQN-QHSGNNRQNYNSNNGGRGYNIGGRNRGRGRSYGNSNYRPVYQVCGK

Query:  PGHSAIACYHRFDKEY-SPVQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAVVDPNWYADNAASSHVTADYNNVGNPVDYTGNERVTVGNGNKL
         GHSA+ CY+RF+KE+ SP+       S   N +   +    VT QS N F    + V++ NWY D+ A++H+T +Y+N+ NP +Y+G E++ VGNG+ L
Subjt:  PGHSAIACYHRFDKEY-SPVQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAVVDPNWYADNAASSHVTADYNNVGNPVDYTGNERVTVGNGNKL

Query:  TISSVGNSCLTDEVNNLELKNILCVPKIAKNLVSVSKLIEDNRIFIEFHNGFCLVKDKDSGRTVMKGTLRDGLYQLENVQ
         IS +GN+ LTD +N L LKN+LCVP I KNLVSVSKL +DN ++IEFH  +C +KDKD+GRT++  T++DGLY L+ ++
Subjt:  TISSVGNSCLTDEVNNLELKNILCVPKIAKNLVSVSKLIEDNRIFIEFHNGFCLVKDKDSGRTVMKGTLRDGLYQLENVQ

A0A6J1DCW4 uncharacterized protein LOC1110195988.3e-10247.53Show/hide
Query:  NNTQVPVLTPIA----APFSNPPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEGHLTGSKSCPPKRIPQPNEVATTGAEDSGTEVNEASSSTVET
        +N   P++TP A    A F++PPLNQLLNQ+TSIK+DRGNFLLW+NLALPILRSYKL  +LTG K CPP  +     V T    D+ T +  ++SS    
Subjt:  NNTQVPVLTPIA----APFSNPPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEGHLTGSKSCPPKRIPQPNEVATTGAEDSGTEVNEASSSTVET

Query:  VLNPQYEAWLVVDQLLLGWLYNSMTSEVATQVMSFDNAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPVSTRNL
         LNP YEAW+VVD+LLLGWLYNSM ++VA QVM F  ++  W A+Q+LFGVQSRAE DYL+QVFQQ+ K SL+M +YL++MKSH DNL  AGS VS R+L
Subjt:  VLNPQYEAWLVVDQLLLGWLYNSMTSEVATQVMSFDNAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPVSTRNL

Query:  VSQVLLGLDEEYNLVVAMIQGRGDISWSEMQAELLVFEKRLELQNTQKSTSSLNLSSLNLNPSANMTTNR-FSNNQRQFQNQHSGNNRQNYNSNNGGRGY
        VSQVL GLDEEYN +V  +QG+ ++SWSEM AELL +EKRLE QN+ KS   +N +     PS N    R F  NQR     ++GNN    N++ GG GY
Subjt:  VSQVLLGLDEEYNLVVAMIQGRGDISWSEMQAELLVFEKRLELQNTQKSTSSLNLSSLNLNPSANMTTNR-FSNNQRQFQNQHSGNNRQNYNSNNGGRGY

Query:  ---NIGGRNRGRG-RSYGNSNYRPVYQVCGKPGHSAIACYHRFDKEYSPVQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAVVDPNWYADNAAS
           + G RNRGRG +   + N+ P        G +  A +H                                    T+  V  PE V+DP+WYAD+ A+
Subjt:  ---NIGGRNRGRG-RSYGNSNYRPVYQVCGKPGHSAIACYHRFDKEYSPVQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAVVDPNWYADNAAS

Query:  SHVTADYNNVGNPVDYTGNERVTVGNGNKLTISSVGNSCLTDEVNNLELKNILCVPKIAKNLVSVSKLIEDNRIFIEFHNGFCLVKDKDSGRTVMKGTLR
        SHVTA+ NNV   VDY+G E V V NGNKL+IS +G++ +     +L+LK++L VP IAKNL                        DK SGRT++KGTL+
Subjt:  SHVTADYNNVGNPVDYTGNERVTVGNGNKLTISSVGNSCLTDEVNNLELKNILCVPKIAKNLVSVSKLIEDNRIFIEFHNGFCLVKDKDSGRTVMKGTLR

Query:  DGLYQLE
        D LY+L+
Subjt:  DGLYQLE

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.1e-3027.12Show/hide
Query:  LNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEGHLTGSKSCPPKRIPQPNEVATTGAEDSGTEVNEASSSTVETVLNPQYEAWLVVDQLLLGWLYNS
        LN  ++ VT  KL   N+L+W      +   Y+L G L GS + PP         AT G + +               +NP Y  W   D+L+   +  +
Subjt:  LNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEGHLTGSKSCPPKRIPQPNEVATTGAEDSGTEVNEASSSTVETVLNPQYEAWLVVDQLLLGWLYNS

Query:  MTSEVATQVMSFDNAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPVSTRNLVSQVLLGLDEEYNLVVAMIQGRG
        ++  V   V     A   W  ++ ++   S      LR   +Q  K +  + DY++ + +  D L   G P+     V +VL  L EEY  V+  I  + 
Subjt:  MTSEVATQVMSFDNAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPVSTRNLVSQVLLGLDEEYNLVVAMIQGRG

Query:  -DISWSEMQAELLVFEKRLELQNTQKSTSSLNLSSLNLNP-SANMTTNRFSNNQRQFQNQHSGNNRQNYNSNNGGRGYNIGGRNRGRGRSYGNSNYRPVY
           + +E+   LL  E ++           L +SS  + P +AN  ++R +       N +  N   N N+NN  + +     N      + N+N    Y
Subjt:  -DISWSEMQAELLVFEKRLELQNTQKSTSSLNLSSLNLNP-SANMTTNRFSNNQRQFQNQHSGNNRQNYNSNNGGRGYNIGGRNRGRGRSYGNSNYRPVY

Query:  ----QVCGKPGHSAIACYHRFDKEYSPVQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAVVDPNWYADNAASSHVTADYNNVGNPVDYTGNERV
            Q+CG  GHSA  C        S +Q+  +      + N+ Q P+ F   Q      +G       NW  D+ A+ H+T+D+NN+     YTG + V
Subjt:  ----QVCGKPGHSAIACYHRFDKEYSPVQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAVVDPNWYADNAASSHVTADYNNVGNPVDYTGNERV

Query:  TVGNGNKLTISSVGNSCLTDEVNNLELKNILCVPKIAKNLVSVSKLIEDNRIFIEFHNGFCLVKDKDSGRTVMKGTLRDGLYQ
         V +G+ + IS  G++ L+ +   L L NIL VP I KNL+SV +L   N + +EF      VKD ++G  +++G  +D LY+
Subjt:  TVGNGNKLTISSVGNSCLTDEVNNLELKNILCVPKIAKNLVSVSKLIEDNRIFIEFHNGFCLVKDKDSGRTVMKGTLRDGLYQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.0e-2525.78Show/hide
Query:  LNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEGHLTGSKSCPPKRIPQPNEVATTGAEDSGTEVNEASSSTVETVLNPQYEAWLVVDQLLLGWLYNS
        LN  ++ VT  KL   N+L+W      +   Y+L G L GS   PP         AT G +                 +NP Y  W   D+L+   +  +
Subjt:  LNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEGHLTGSKSCPPKRIPQPNEVATTGAEDSGTEVNEASSSTVETVLNPQYEAWLVVDQLLLGWLYNS

Query:  MTSEVATQVMSFDNAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPVSTRNLVSQVLLGLDEEYNLVVAMIQGRG
        ++  V   V     A   W  ++ ++   S      LR +                   +  D L   G P+     V +VL  L ++Y  V+  I  + 
Subjt:  MTSEVATQVMSFDNAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPVSTRNLVSQVLLGLDEEYNLVVAMIQGRG

Query:  -DISWSEMQAELLVFEKRLELQNTQKSTSSLNLSSLNLNP-SANMTTNRFSNNQRQFQNQHSGNNRQNYNSNNGGRGYNIGGRNRGRGRSYGNSNYRPVY
           S +E+   L+  E +L           L L+S  + P +AN+ T+R +N  R   NQ++  + +NYN+NN          +  R  +     Y    
Subjt:  -DISWSEMQAELLVFEKRLELQNTQKSTSSLNLSSLNLNP-SANMTTNRFSNNQRQFQNQHSGNNRQNYNSNNGGRGYNIGGRNRGRGRSYGNSNYRPVY

Query:  QVCGKPGHSAIAC--YHRFDKEYSPVQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAVVDPNWYADNAASSHVTADYNNVGNPVDYTGNERVTV
        Q+C   GHSA  C   H+F                   +N  QS + F   Q      V      + NW  D+ A+ H+T+D+NN+     YTG + V +
Subjt:  QVCGKPGHSAIAC--YHRFDKEYSPVQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAVVDPNWYADNAASSHVTADYNNVGNPVDYTGNERVTV

Query:  GNGNKLTISSVGNSCLTDEVNNLELKNILCVPKIAKNLVSVSKLIEDNRIFIEFHNGFCLVKDKDSGRTVMKGTLRDGLYQ
         +G+ + I+  G++ L     +L+L  +L VP I KNL+SV +L   NR+ +EF      VKD ++G  +++G  +D LY+
Subjt:  GNGNKLTISSVGNSCLTDEVNNLELKNILCVPKIAKNLVSVSKLIEDNRIFIEFHNGFCLVKDKDSGRTVMKGTLRDGLYQ

Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.7e-0623.77Show/hide
Query:  SIKLDRGNFLLWKNLALPILRSYKLEGHLTGSKSCPPKRIPQPNEVATTGAEDSGTEVNEASSSTVETVLNPQYEAWLVVDQLLLGWLYNSMTSEVATQV
        ++ L++ N+ +W+ L   +  S+ + GH+ GS +  P                  TE                 + W   D L+  W+Y ++T  +   +
Subjt:  SIKLDRGNFLLWKNLALPILRSYKLEGHLTGSKSCPPKRIPQPNEVATTGAEDSGTEVNEASSSTVETVLNPQYEAWLVVDQLLLGWLYNSMTSEVATQV

Query:  MSFD-NAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPVSTRNLVSQVLLGLDEEYNLVVAMIQGRGDI-SWSEM
        +     A+  W ++++LF     A         + +  + L + +Y + +KS  D L    SP+S R LV  +L GL E+Y+ ++ +I+ +    S++E 
Subjt:  MSFD-NAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPVSTRNLVSQVLLGLDEEYNLVVAMIQGRGDI-SWSEM

Query:  QAELLVFEKRLELQNTQKSTSSLNLSSLNLNPSANMTTNRFSNNQRQFQNQHSGNNRQNYNSNNGGRGYNIGGRNRGRGRSYG----NSNYR---PVYQV
        ++ LL+ E RL    + KS SSL+ ++   +PS +         Q ++  ++  NN      +N GRG +   +NRG G S G    N+N+R   P   +
Subjt:  QAELLVFEKRLELQNTQKSTSSLNLSSLNLNPSANMTTNRFSNNQRQFQNQHSGNNRQNYNSNNGGRGYNIGGRNRGRGRSYG----NSNYR---PVYQV

Query:  CGKP------GHSAIACYHRFDKEYSPVQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAV
         G P       H     +H+  K Y P Q       P   S  +  P+     +  NP++ G   V
Subjt:  CGKP------GHSAIACYHRFDKEYSPVQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCAACGCCAACAACACCCAAGTCCCCGTCCTAACCCCTATTGCAGCTCCGTTTAGCAATCCTCCTTTGAATCAATTACTGAACCAAGTTACTAGTATCAAACTTGA
TAGGGGAAACTTCCTATTATGGAAGAACCTTGCCTTACCAATCCTTAGAAGTTACAAATTGGAAGGCCATCTCACTGGCTCAAAATCTTGCCCACCGAAGCGTATTCCGC
AACCAAATGAAGTTGCAACCACCGGAGCTGAAGACTCTGGAACTGAAGTTAACGAGGCGTCTAGCTCAACCGTAGAGACGGTGCTTAACCCACAATATGAAGCCTGGCTT
GTTGTTGACCAGCTCTTACTGGGATGGCTGTATAACTCCATGACGTCGGAGGTAGCCACTCAAGTAATGAGTTTCGATAATGCAAAATACTTTTGGGCAGCTATTCAGGA
CTTGTTCGGCGTCCAATCCAGGGCAGAGGAGGATTACCTCCGTCAGGTATTTCAGCAATCTCGAAAGAATAGTCTAAAAATGTCTGATTACTTACGTGTCATGAAAAGTC
ATGTAGATAACCTAGGGCAGGCTGGGAGTCCGGTTTCAACAAGGAATCTGGTATCACAAGTGTTATTGGGGCTAGACGAGGAGTATAACCTTGTGGTGGCGATGATCCAA
GGTAGAGGTGACATTTCTTGGTCAGAGATGCAAGCTGAGCTTCTGGTTTTTGAAAAACGCCTTGAGCTTCAGAACACTCAGAAAAGCACATCATCACTCAACTTATCATC
ACTCAACCTAAATCCATCTGCGAACATGACTACCAATCGGTTTTCCAATAATCAAAGGCAGTTTCAAAACCAACATAGTGGAAATAACAGACAGAACTACAACTCCAACA
ATGGAGGCAGAGGCTACAACATTGGAGGTAGGAACAGAGGCAGGGGTCGTAGTTATGGTAACTCAAACTATCGCCCTGTGTACCAAGTTTGCGGAAAGCCTGGACATTCT
GCAATTGCTTGTTATCATCGATTTGATAAAGAGTATTCACCTGTGCAAAATAAAGAAAATGGGAACAGCCCAGGTCAAAACTCCAACAATACACAATCCCCTGCAGCCTT
CGTGACAAGTCAAAGCACTAATCCATTTGTTGTTGGACCTGAGGCTGTAGTCGATCCCAATTGGTATGCGGACAACGCTGCATCTAGTCATGTTACTGCTGATTATAATA
ATGTGGGAAATCCAGTCGACTACACAGGTAATGAACGAGTGACAGTAGGTAATGGGAACAAACTTACAATCTCTTCTGTTGGTAATTCATGTTTAACTGATGAAGTCAAT
AATCTTGAGCTTAAAAACATTTTATGTGTTCCTAAGATAGCGAAAAATCTCGTTAGTGTATCAAAACTTATCGAGGATAACAGAATTTTCATTGAATTTCATAATGGCTT
CTGTCTTGTTAAGGACAAGGATTCGGGCAGAACAGTAATGAAAGGAACACTTAGAGATGGGCTATACCAACTAGAGAATGTTCAAGCTACTGCAGGAGTGAATGTTGATA
GTCAGAAAGTTCAGAAGATAGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGACCAACGCCAACAACACCCAAGTCCCCGTCCTAACCCCTATTGCAGCTCCGTTTAGCAATCCTCCTTTGAATCAATTACTGAACCAAGTTACTAGTATCAAACTTGA
TAGGGGAAACTTCCTATTATGGAAGAACCTTGCCTTACCAATCCTTAGAAGTTACAAATTGGAAGGCCATCTCACTGGCTCAAAATCTTGCCCACCGAAGCGTATTCCGC
AACCAAATGAAGTTGCAACCACCGGAGCTGAAGACTCTGGAACTGAAGTTAACGAGGCGTCTAGCTCAACCGTAGAGACGGTGCTTAACCCACAATATGAAGCCTGGCTT
GTTGTTGACCAGCTCTTACTGGGATGGCTGTATAACTCCATGACGTCGGAGGTAGCCACTCAAGTAATGAGTTTCGATAATGCAAAATACTTTTGGGCAGCTATTCAGGA
CTTGTTCGGCGTCCAATCCAGGGCAGAGGAGGATTACCTCCGTCAGGTATTTCAGCAATCTCGAAAGAATAGTCTAAAAATGTCTGATTACTTACGTGTCATGAAAAGTC
ATGTAGATAACCTAGGGCAGGCTGGGAGTCCGGTTTCAACAAGGAATCTGGTATCACAAGTGTTATTGGGGCTAGACGAGGAGTATAACCTTGTGGTGGCGATGATCCAA
GGTAGAGGTGACATTTCTTGGTCAGAGATGCAAGCTGAGCTTCTGGTTTTTGAAAAACGCCTTGAGCTTCAGAACACTCAGAAAAGCACATCATCACTCAACTTATCATC
ACTCAACCTAAATCCATCTGCGAACATGACTACCAATCGGTTTTCCAATAATCAAAGGCAGTTTCAAAACCAACATAGTGGAAATAACAGACAGAACTACAACTCCAACA
ATGGAGGCAGAGGCTACAACATTGGAGGTAGGAACAGAGGCAGGGGTCGTAGTTATGGTAACTCAAACTATCGCCCTGTGTACCAAGTTTGCGGAAAGCCTGGACATTCT
GCAATTGCTTGTTATCATCGATTTGATAAAGAGTATTCACCTGTGCAAAATAAAGAAAATGGGAACAGCCCAGGTCAAAACTCCAACAATACACAATCCCCTGCAGCCTT
CGTGACAAGTCAAAGCACTAATCCATTTGTTGTTGGACCTGAGGCTGTAGTCGATCCCAATTGGTATGCGGACAACGCTGCATCTAGTCATGTTACTGCTGATTATAATA
ATGTGGGAAATCCAGTCGACTACACAGGTAATGAACGAGTGACAGTAGGTAATGGGAACAAACTTACAATCTCTTCTGTTGGTAATTCATGTTTAACTGATGAAGTCAAT
AATCTTGAGCTTAAAAACATTTTATGTGTTCCTAAGATAGCGAAAAATCTCGTTAGTGTATCAAAACTTATCGAGGATAACAGAATTTTCATTGAATTTCATAATGGCTT
CTGTCTTGTTAAGGACAAGGATTCGGGCAGAACAGTAATGAAAGGAACACTTAGAGATGGGCTATACCAACTAGAGAATGTTCAAGCTACTGCAGGAGTGAATGTTGATA
GTCAGAAAGTTCAGAAGATAGAATGA
Protein sequenceShow/hide protein sequence
MTNANNTQVPVLTPIAAPFSNPPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEGHLTGSKSCPPKRIPQPNEVATTGAEDSGTEVNEASSSTVETVLNPQYEAWL
VVDQLLLGWLYNSMTSEVATQVMSFDNAKYFWAAIQDLFGVQSRAEEDYLRQVFQQSRKNSLKMSDYLRVMKSHVDNLGQAGSPVSTRNLVSQVLLGLDEEYNLVVAMIQ
GRGDISWSEMQAELLVFEKRLELQNTQKSTSSLNLSSLNLNPSANMTTNRFSNNQRQFQNQHSGNNRQNYNSNNGGRGYNIGGRNRGRGRSYGNSNYRPVYQVCGKPGHS
AIACYHRFDKEYSPVQNKENGNSPGQNSNNTQSPAAFVTSQSTNPFVVGPEAVVDPNWYADNAASSHVTADYNNVGNPVDYTGNERVTVGNGNKLTISSVGNSCLTDEVN
NLELKNILCVPKIAKNLVSVSKLIEDNRIFIEFHNGFCLVKDKDSGRTVMKGTLRDGLYQLENVQATAGVNVDSQKVQKIE